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456497 AW967956 Hs.123648 ESTs, Weakly similar to AF1 08460 1 ublnu 3.5 

413786 AW613780 Hs.1 3500 ESTs 3.5 

430935 AW072916 zinc finger protein 131 (dona pHZ- 10) 3.5 

439904 AW892676 gb:CM3-NN00O4-2B030O-13U12 NN0004 Homo 3.5 

449909 AA004681 Hs.59432 ESTs 3.5 

445817 NMJJ03642 Hs. 13340 histone aceiyltransferase 1 3.5 

419594 AA013O51 Hs^1417 topolsomerase (DNA) tl binding protein 3.5 

417251 AW015242 Hs.99488 ESTs, Weakly similar to YK54_YEAST HYPOT 3.5 

423545 AP000692 Hs.129781 chromcsome 21 open reading frame 5 3.5 

438527 A1969251 Hs. 11 5325 RAB7, member RAS oncogene family-like 1 3.5 

453916 AW974874 Hs.75212 <mi»e decarboxylase 1 3.5 

430200 BE613337 Hs.234896 geminin 3.4 

437187 AL080208 Ks.306325 Homo sapiens mRNA; cONA DKFZp58 6C 1 523 (f 3.4 

423645 AI215632 Hs,147487 ESTs 3.4 

432370 AA308334 Hs^74424 r4^etytneuramlnic acid phosphate synlha 3.4 

434966 AA657494 gbmt66f04.s1 NCLCGAPJ>r3 Homo sapiens 3.4 

4 <W571 NM_015902':Homo sapiens progestin induce 3.4 

418727 AA227609 Hs.94834 ESTs 3.4 

436374 AA400709 Hs.96716 ESTs, Weakly similar to T17210 hypotheti 3.4 

452721 AJ269529 Hs.301871 sctoranierfan%37(gryceroWi)rio 3.4 

427472 AA522539 Hs.131250 transposorwJerived Buster3 r/arrsposase-l 3.4 

418245 AA088767 Hs.83883 transmembrane, prostate androgen induced 3.4 

402031 ENSP00000251 Q56*:Rasma membrane calcium 3.4 

407137 T97307 gb;ye53h05.s1 Soares fetal Over spleen 3.4 

449603 AI655662 Hs.197698 ESTs 34 

425592 D90041 Hs. 155956 W^cefyitransferase 1 (aryiamrne Nicety 3.4 

436805 AA731533 Hs.270751 ESTs 3.4 

4373B7 AA749316 Hs.271879 ESTs, Moderately similar to ALU 1 HUMAN A 14 

407908 BE379758 Hs.1 10853 uncharacterized hematopoietic stem/proge 3.4 

430144 A1732722 Hs.187694 ERGL protein; ERGIC-53-fike protein a4 

416292 AA1 79233 Hs.42390 nasopharyngeal carcinoma susceptibility 14 

423800 AA331156 gb:EST35034 Embryo, 6 week, subtracted { 3.4 

439375 AA689526 Hs.344249 steroid dehydrogenase homolog 3.4 

429084 AJ001443 Hs.195614 splicing factor 3b, subunit 3, 130kD 3.4 

432886 BE15902B Hs.279704 chromatin accessibiGty complex 1 14 

408267 AW380525 Hs.343564 tubu&wpecific chaperone e 14 

408832 AW085690 Hs.63428 ESTs, Weakry similar to Z1 95 HUMAN ZINC 14 

422564 AI148006 Hs.222120 ESTs 14 

424842 AA034127 Hs.153487 signal transducing adaptor motecute (SH3 14 

429984 AL050102 Hs.227209 hypothetical protein FU21 61 7 14 

435256 AF193766 Hs.13872 cytoldne-fike protein C17 14 

436137 AI056769 Hs.133512 ESTs 14 

440348 AW015802 Hs.47023 ESTs 34 

442910 AJ365130 Hs.11307 ESTs, Weakry similar to T19326 hypolheQ 14 

443242 BE243910 Hs.9082 rrudeoporin p54 14 

445469 AW298370 Hs.153714 ccmplementclq tumor necrosis factowel 14 

449517 AW500106 Ks.23643 serine/threonine protein kinase MASK 14 

425354 U62027 Hs.155935 complement component 3a receptor 1 3.4 

433037 NNL014158 Hs.279938 HSPC067 protein 3.4 

423044 AA320829 Hs.97266 protocadherin 18 34 

434128 W93170 Hs.284164 protein x 0004 14 

449845 AW971183 Hs.9683 Dnal(rfep40)horndog,subfan^C l niembe 14 

423590 AW952412 Hs.65874 ESTs, Weakly similar to A40348 Ekv/Sex- 14 

440370 AA884000 Hs.8173 hypothetical protein FU 10803 14 

438825 BE327427 Hs.79953 ESTs 14 

446874 AW988304 Ks.56156 ESTs 14 

408101 AW968504 Hs.123073 CDC2-felated protein kinase 7 14 

419131 AA406293 Hs.41167 ESTs 14 

423178 AI033140 Hs.124983 Homo sapiens mRNA; cDNA DKFZp564C142 (fr 14 

433764 AW753676 Hs.39982 ESTs 3.4 

440658 H29142 Hs. 143032 ESTs, Weakly similar to neuronal thread 14 

411558 AA102670 Hs.70725 garnma-ammobutyrfc acid (GABA) A recepto 13 

452815 AA418841 gbaw01e1U1 Soares_NhHMPu_S1 Homo sapi 13 

421234 AA907153 Hs.190060 ESTs 13 

424075 AI807320 Hs^27630 RE1<suenring transcripfon factor 13 

412045 AA099802 Hs.83883 transmembrane, prostate androgen Induced 13 

420041 AB005142 Hs.94592 Wotho 13 

426174 AA547959 Hs.115838 ESTs 13 

427083 NMJM6363 Hs.173497 Sec23 (S. cerevistae) homolog B 3.3 

441021 AW578716 Hs.7644 HI histone family, member 2 13 

436664 AW197887 Hs^53353 ESTs 13 

441124 T97717 Hs.119563 ESTs 3.3 

444169 AV648170 Hs.58756 ESTs 13 

421476 AW953805 HsJ1887 ESTs 13 

422165 AL041199 Hs.1481 hlstidine oecarboxytase 13 

455100 BE160198 gbK3V1-HT0413-0102(MM)59^03 HT0413 Homo 3.3 

450747 AI064821 Hs.3t8535 ESTs. Highly similar to 181 8357AEWS gen 3.3 

401197 ENSP00000229263*:HSPC213. 13 

433404 T32982 Hs.102720 ESTs 13 

422546 AB007969 Hs.301478 K1AA0500 protein 3.3 

443884 N20617 Hs.1 94397 lepfin receptor 13 

414341 080004 Hs.75909 KIAA01B2 protein 13 

425292 NMJJ05824 Hs.1 55545 37 kDa leucine-dch repeat (LRR) protein 13 
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414407 AA147026 Hs76704 ESTs 33 

420324 AF163474 Hs.96744 prostate sndrogen-reguiated transcript 1 3.3 

423453 AW450737 H$.128791 CO-09 prolan 33 

444170 AW613879 Hs.102408 ESTs 33 

445474 AI240014 Hs.259558 ESTs 13 

450582 AG39732 Groh RNAsequence binding factor 1 3.3 

451246 AW189232 Hs.39140 cutaneous T-ce3 lymphoma tumor antigen 3.3 

434194 AF119847 Hs.283940 Homo sapiens PR0 1 550 mRNA, partial ods 3.3 

444324 AI301330 Hs.143838 ESTs 3,3 

427581 NM.014788 Hs.179703 WAA0129 gene product 3.3 

429258 AM48755 gbzx10e09.r1 Soares total fetus Nb2HF8 3.3 

400880 NMJX)0611*:Homo sapiens C059 antigen p18 3.3 

445525 BE149866 Hs.14831 Homo sapiens, Similar to zinc finger pro 3.3 

451968 H56196 Hs.26409 Homo sapiens mRNA; cDNA DKFZp547K204 (fr 3.3 

411190 AA306342 Hs.69171 protein kinase CJike 2 3.3 

410660 AI061118 Hs.65328 Fanconi anemia, complementation group F 3.3 

420493 AI635113 Hs.27Q366 ESTs, Weakly similar to 178885 serineAh 3.3 

423740 Y07701 Hs.293007 aminopepfidase puromycin sensitive 3.3 

428523 AW974540 Hs.98626 ESTs 3.3 

428743 AL080060 Hs.301549 Homo sapiens mRNA; cDNA DKFZp564H172 (fr 3.3 

432363 AA534489 gb:nf76g1 \M NCI CGAP Co3 Homo sapiens 3.3 

435102 AW899053 Hs.76917 F-box only protein 8 33 

438147 AW250553 H-2K binding factor-2 13 

445808 AV655234 ESTs, Moderately similar to PC4259 rem 3.3 

453058 AW612293 Hs.288684 HomosaplenscONAFU11750fis.cioneHE 3.3 

453286 AA034319 Hs.29041 HomosaptenscDNAFU14177tis,ctoneNT 3.3 

453802 AL134757 gb:DKFZp547G1Q90_r1 547 {synonym: hfbrl) 3.3 

422017 NJO03877 Hs.1 10776 STAT induced STAT inWbitor-2 3.3 

411373 BE326276 Hs.8861 ESTs 33 

435114 AA775483 Hs.288936 mitochondrial riboscmal protein 19 3.3 

443674 AI081330 Hs.1 45008 ESTs 33 

427528 AU077143 Hs.179565 midchromosome maintenance deficient (S. 3.3 

433800 AI034361 Hs.135150 lung type4 cell membrane-associated gly 3.3 

420969 AI636310 Hs.28310 ESTs 3.3 

421654 AW163267 Hs.106469 suppressor of varl (S.cerevisiae) Hike 3.3 

409542 AA5O3020 Hs.36563 hypothetical protein FU2241 8 3.3 

410119 F07841 Hs.13926 ESTs 13 

417379 AA196390 gb:zp99b10.s1 Stratagene muscle 937209 H 13 

423201 NM_000163 Hs.125180 growth hormone receptor 13 

442787 W93048 Hs.250723 hypothetical protein MGC2747 3.3 

407930 AA045847 Hs.188361 Homo sapiens cDNA FU12807 fis, clone NT 3.3 

423427 AL137612 Hs.285848 WAA1454 protein 3.3 

424903 T26477 Hs.22883 ESTs, Weakly similar to I38022 hypothec 3.3 

426775 AA384564 ESTs 33 

432378 AI493046 Hs.146133 ESTs 33 

438875 AA827640 Hs.189059 ESTs 33 

452959 AI933416 Hs.189674 ESTs 33 

453124 AI139058 Hs.125790 teucine-rich repeat-containing 2 3.3 

417560 U73338 Hs.82283 5^uiyltetrahydr(rfofatfrhornocysteineme 3,2 

433002 AF048730 Hs.279906 cydinTI 32 

430929 AA489166 Hs.1 56933 ESTs 32 

437444 H46008 Hs.31518 ESTs 32 

446297 AI346930 Hs.149728 ESTs 32 

407870 AB032990 Hs.40719 hypothetical protein WAA1164 3.2 

429377 AA813192 Hs.2O0596 WAA0547 gene product 32 

426030 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) 32 

420583 H77859 Hs.65450 reficuton4 3.2 

425264 AA353953 Hs.20369 ESTs, Weatdy similar to gonadotropin ind 3.2 

442562 BE379584 doTudiyWiphosphoongosaccharide-protd 3 2 

418871 NVL001608 Hs.1209 acyl-Coenzyme A dehydrogenase, tong chal 12 

421029 AW057782 Hs.293053 ESTs 32 

452997 N64777 Hs.44656 ESTs • 3.2 

435035 BE568487 Hs.47668 x006pmtein 32 

408412 AW193033 ;Hs,124436 ESTs 32 

421684 BE281591 Hs.106768 hypothetical protein FU10511 3.2 

421887 AW161450 Hs.109201 CGI-86 protein 32 

432409 AA806538 Hs.1 30732 KIAA1 575 protein 32 

434378 AA631739 Hs.335440 EST 32 

409401 AI201895 Hs.181309 proteasome (prosome, macropain) subunit, 3.2 

420195 N44348 Hs.26243 Homo sapiens cDNA FU1 1177 fis, done PL 3.2 

410264 AK001853 Hs.61508 Homo sapiens cONAFU 10991 fis, done PL 3.2 

410296 AW630675 Hs.271946 ESTs 32 

413838 AV661185 Hs.75574 mltrxrKmdnalnTwsomal protein L1 9 3.2 

414342 AA742181 Hs.75912 WAA0267 protein 3.2 

414709 AA7O4703 Hs.77031 Sp2 transcription factor 3.2 

425475 W56339 Hs.107057 ESTs 32 

437814 AH388192 Hs.135474 ESTs, Weakly similar to DDX9 HUMAN ATP-0 12 

449897 AW819642 Hs.24135 transmembrane protein vezatin; hypolheti 12 

452295 BE379936 Hs.28866 programmed cell death 10 12 

437517 AI927675 Hs.244482 Homo sapiens, clone 1MAGE-.361 1719, mRNA, 12 

432960 AW150945 Hs.144758 ESTs 12 

452498 AK000101 Hs.29700 hypothetical protein FU20094 12 

427299 AA830210 Hs.214263 ESTs, Moderately similar to ALU1_HUMAN A 12 
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ESTs 12 

cryptic gene 3.2 

ESTs 12 

ESTs 3.2 

hypothetical protein 3.2 

ESTs,WeaWysirrilartoA47582B^gr 3.2 

WAA1181 protein 3.2 

ESTs, Moderately similar to ZN91_HUMAN Z 3.2 

ESTs 3.2 

ESTs 3.2 

zinc finger protein ZNF14(Mike protein 3.2 

ESTs 3.2 

ESTs 3.2 

ESTs 3.2 

Homo sapiens cDNA FU 12300 fis, done MA 3.2 

sema domain, imrnunogtobulin domain (Ig), 3.2 

Target Exon 3.2 

ESTs 3.2 

ESTs, Weakly similar to ALU8.HUMAN ALU S 3.2 

Homo sapiens cONA FU10674 fis, done NT 3.2 

zinc finger protein 161 3, 2 

NM.001 397:Homo sapiens endolhelin conver 3.2 

hypolheCcal protein FU 13346 3.2 

Homo sapiens cDNA: FU21000 fis, done C 3.2 

ESTs, Moderately similar to 178885 serin 3.2 

RAD1 (S. pombe) homotog 3,2 

Homo sapiens done 24407 mRNA sequence 3.2 

ESTs 3.2 

nuclear factor related to kappa B bindin 3.1 

hypotheflcal protein MGC13102 3.1 

phosphatidic acid phosphatase type 2A 3.1 

ESTs 3.1 

ESTs 3.1 

ESTs 3.1 

proenkephailn 3.1 

solute carrier family 30 (zinc transport 11 

ESTs, Weakly similar to ALUC.HUMAN !!!! 11 

deiodinase, iodo thyronine, type II 11 

Homo sapiens, Similar to RiKEN cDMA 5830 11 

COC14 (cell division cyde 14, S. cerevi 11 

ESTs 11 

RAB4, member RAS oncogene family 3.1 

Homo sapiens done 24405 mRNA sequence 3.1 

ESTs 3.1 

guanine nucleotide binding protein 4 3. 1 

ESTs 11 

ESTs 11 

ESTs, Weakly similar to ALU1_HUMAN ALUS 3.1 

Homo sapiens mRNA; cDNA DKFZp564B083 (fr 11 

ESTs 11 

pre-B-ceH leukemia transcription fador 11 

hypothetical protein FU12577 3.1 

sdute carrier famiry 2 (facilitated glu 11 

WAA1048 protein 11 

ESTs 11 

matrilin2 3.1 

ESTs, WeaWy similar to T24396 hypolheU 3.1 

guanine nucleotide binding protein 4 11 
ESTs 3.1 
hypothetical protein FU13149 11 
gbzo65aOZr1 Stratagene pancreas (93720 11 
coalomer protein complex, subunit beta 11 
ESTs 11 
ESTs 11 
ESTs 11 
hypothetical protein FU1 1273 3.1 
ESTs 11 
ESTs 3.1 
ESTs 11 
hypothetic^ protein FU21 159 11 
ESTs, Weakly similar to ALU4 JiUMAN ALU S 11 
prostate cancer associated protein 7 3.1 
ESTs 11 
WAA1627 protein 11 
ESTs, Weakly simBar to unnamed protein 11 
ATPase, Ca transporting, type 2C, member 11 
ESTs 11 
ESTs 3.1 
DKFZP564G202 prate* 11 
ESTs 11 
ESTs 11 
ESTs 3.1 
ESTs 11 
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hypothelica! protein FU 11 360; artemtsp 3.1 

ESTs ai 

ESTs 3.1 

NM_002425:Homo sapiens matrix rnetalopro 3.1 

RAN binding protein 7 3.1 

ubiquitin-acGvafing enzyme El (A1S9T an 11 

zinc finger protein 183 (RING finger, C3 3.1 

ESTs 3.1 

gb:nc26a07.s1 NCI_CGAP_Pr1 Homo sapiens 3.1 

Homo sapiens. Similar to RIKEN cDNA 28 1 0 3.1 

ESTs 3.1 

Homo sapiens cDNA FU 12187 Ms, clone MA 3.0 

HSPC034 protein 3.0 

intedeukin 7 receptor 10 

hypothetical protein MGC14141 3.0 

ESTs 3.0 

aldehyde dehydrogenase 1 famBy, member 3.0 

Homo sapiens, ctone IMAG&4298026, mRNA, 3.0 

Human DNA sequence from done RP5-876B10 10 

ESTs io 

ESTs 10 

OC32 10 

Homo sapiens done 24923 mRNA sequence 3.0 

ESTs 10 

transforming growth factor, beta 3 3.0 

phosphodiesterase 9A 10 

ESTs 3.0 

Homo sapiens mRNA; cDNA DKFZp586Ml418 (f 10 

hypothetical protein FU14751 3.0 

DKF2P434F091 protein 3.0 
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K1AA0586 gene product 10 
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ESTs 10 
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purinergic receptor (family A group 5) 10 

ESTs io 

catumenin 3.0 

hypothetical protein MGC4126 10 

ESTs io 

hypothetical protein FU 20274 3.0 

hypothetical protein FU10074 3.0 

ESTs io 

gb:QV-BT20tW1049W)07 BT200 Homo sapien 10 

ESTs io 

ESTs 10 

crystaJHn, zeta (qiinone redudase)-tik 10 

ESTs 3.0 

Homo sapiens, clone IMAGE:3507281, mRNA. 3.0 

ESTs io 

ESTs io 

Homo sapiens mRNA; cONA DKF2p556M063 (rr 3.0 

hed domain and RLD3 io 

UV-1 protein, estrogen regulated 10 

ESTs io 

ESTs 3.0 

hypothetical protein FU10422 10 

ESTs io 

DKFZP434G032 protein 10 

Homo sapiens, clone IMAG&41 79482, mRNA 3.0 

nudix (nucleoside diphosphate linked moi 10 

ESTs, Weakly similar to Homotog of rat Z 10 

K1AA1028 protein 30 

gb:zr97g10.r1 NCLCGAPJ3CB1 Homo sapiens 10 

hypothetical protein FU 1 1 021 stmflar to 10 

PFTAIRE protein kinase 1 10 

hypothetical protein FU1 1171 3.0 

ESTs 3.0 

ESTs 10 

RBP1 -tike protein 10 

ESTs, Weakly similar to A46010 X-Gnked 10 



TABLE 60B 



Pkey: Unique Eos probeset Wentifier number 



WO 03/042661 



PCT/US02/36810 



CAT 



Pkey 

411479 

411667 
414372 
414680 

415989 
416288 



416913 
417379 
418304 
416647 
418866 
419536 
419544 
423412 
423800 
426226 
426413 
426503 
426775 

426991 
428342 

429163 
429220 
429258 
430935 
430968 
431429 



432093 
432125 
432189 
432340 
432363 
432600 
432810 

434579 
434966 
435023 
437866 



438147 
439092 
439518 
439904 
440840 

441102 
442562 



443161 
445808 
447082 
448212 
449625 



Genbank accession numbers 
CAT Number Accessions 



450582 
452260 



452815 
453802 
455100 



1247077J 

1253334J 
143909.1 
147525.1 

156454.1 

1585983.1 

162718.1 

153001.1 
167238.1 
173658.2 
177521.1 
179788J 
185688 J 
185760.2 
228001.1 
232161J 
262918.1 
266650.1 
268283.1 
271683.1 

27415.1 
290035.2 

300543 1 
301384.1 
301917.1 
32577*J 
326269.1 
33313 1 



341283 1 
341776 1 
342819 1 
345248.1 
345469 1 
350959 1 
354375.1 



6.1 
396504 1 
398093 1 
44433.2 



45074.1 
468554.1 
47334.1 
479942 1 
50357.2 

509604J 
54500.2 



561305.1 

65133J 

707248.1 

755099.1 

8113.1 



450580 83929.1 



83933.1 
9074.1 



92338.2 



93255.1 

981589.1 

1253334J 



AW848047 AW848202 AW848631 AW848142 AW848702 AW8481 21 AW848632 AW848140 AW848571 AW848009 AW848067 AW848069 
AW848905AW848214 

BE1601 98 AW935898 T1 1520 AW935930 AW856073 AW861034 
AA143854 AW753140 AA213770 AW970865 AA569075 AA492132 

AA743331 AA837388 AW664540 AA775711 AA150955 BE465475 AA968994 AA886905 AI141054 AW194991 AA252147 AA31 9500 All 84288 
AA708749 AA644620 AA652769 AA242975 AA151074 T19890 

A1267700 A1720344 AA191424 A1023543 AI469633 AA 172056 AW958465 AA172236 AW953397 AA355086 
H51299 H44619 H46391 R86024 H51892 T72744 

AI633044 AW016212 AW241143 AA769058 R43272 AW068958 AA210918 AA293774 A1748815 AI763294 AJ333114 A1277384 A1088297 A1468477 

AI824624 AW189606 A1631751 240749 AI984673 A1671316 AA189024 AW235412 BE178426 R24677 R40635 H05100 R40597 

AW934714 BE161007 BE 1 62500 AW749902 AW749864 BE162498 BE 161 005 AA1 90449 AW513465 BE161006 BE162499 

AA196390 AA507837 AA196468 

AA215702 AA368006 AA215703 BE056555 BE006876 

AA226198AA226513AA3B3773 

T65754AA229857AA229658 

AA603305 AA244095 M244183 

AI909154 AA526337 AA244193 AI909153 

AF109300 A1299378 AI202654 

AA331 156 AA331157 AA331 155 

AA769045 AA372590 AW963633 

AA377823 AW954494 AI022688 

AA3801 53 AA380233 AW983529 

AA384564 AW966475 H02121 N41297 D63213 AA886888 AI922414 AW044240 AW196808 AI076736 AA599294 AI954433 AW1 17617 A1640323 
H98134 

AK001536 AA191092 AW510354 AI5542S6 AL353968 AA134266 

AI7391 68 AA426249 AI199636 AW5051 98 AW977291 AA824583 AA883419 AA724079 AI015524 AI377728 AW293682 AI928140 AA731438 

AI092404AI085630AA731340 

AA884766 AW974271 AA592975 AA447312 

AW207206 AW341473 AA448195 AI951341 

AA448765 C04967 C03045 AA658293 

AW07291 6 A118491 3 AA4891 95 AW466994 AW469044 N59350 AS819642 AI280239 AI220572 AA789302 AI47361 1 AW841 126 D60937 
AW972830 AA527647 AA489820 AA570362 

AF072813AF119297 M362885AF059524N^006054M157365AW163623 M056148AA227062 M418057^ 

D58629 AA232373 AA233577 T35955 BE618035 AA354497 AA359082 T32010 AA134519 BE299901 BE268096 BE396B26 AA324268 AL120308 

AA187561 AA311680 

H28383 AW972670 H28359 AA525808 

AW972567 AA526539 AI057032 AW167842 

AA527941 A18106O8 AI620190 AA635266 

AA534222AA632632 T81234 

AA534489 AW970240 AW970323 

AI821085 AW973464AA554802A1821831 AA657438 AA640756 AA650339 

AA863400 AI991439 AW016017 AW014704 AI367512 H17550 AA744752 R46187 AW471324 AI126670 M626033 AJ276287 AI094253 AI286003 

AI1471B3 AI911443 AW51261 2 AA972102 AA999975 AI684428 AI335035 063102 A1524234 A15391 56 AA565542 

T55958T57205AF147346 

AA657494 AI582663 A1581639 

A1692552 A1393343A1800510AI377711 F24263 AA661876 

AA156781 AW293839 U52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE164040 BE164012 BE163972 BE163974 BE163992 

AA837481 AW468444 BE185091 AW468002 AA687333 AA81 1830 AA581806 AI866686 AI572124 AA043777 AA040926 020160 AI536733 

AA812489 AW874142 AI47188 

AW250553 L07876 Z36843 R30693 AI190097 AW965317 

AA830149 AW978407 M85983 AW503637 

W76326 AF086341 W72300 

AW892676 AA853877 D44747 

AW629666 AW959831 AW205739 BE620243 AA41 2367 AW300025 AW051920 AI28B591 AW2361 14 AI302852 AI038548 AA534496 A1797207 
AA921877 

AA973905 A1299888 AA917019 H63235 T90771 

BE379584 R3421 1 BE544768 AW973709 AI653056 A1653173 AI266043 A1656750 H741 80 A1492830 AI376090 AI472184 D59940 AW170056 

A1082443 AW021 142 AI167921 A1348677 A1278577 AW130886 AA761517 A1698203 AA1 15535 A1264790 R34328 D59939 AW205074 AA554902 

D62102AI0007 

AI038316 A! 344531 AJ261653 

AV655234 AW956332 AA340239 

T85314 AI3606B4 T85528 T91254 

AI475858AW969013 

NM.014253 AF100772 BE088769 AL022718 BE161779 AW863569 BE161640 AL039060 BE168542 AW296554 AA323193 AA235370 AW779760 
N48674AI375997 R45432 059344 A1203107 F07491 R35360R25094AI913631 AM98402 T61382 AI016320 N45526 T61415 AA331486 
N40087 H12925 AA460779 AA096372 BE160847 AW81 6890 AW81 6893 AW81 6577 AW816891 AW816882 AW816868 AW816689 AW81 6940 
AW816892 AW81 6941 AW81 6578 AA029183 T83320 H86850 AA01Q295 BE160823 Z25353 AA150883 AW887764 AW023806 AW022095 
AA164518 AA730973 W0041 7 W65303 

AI339732 AA010300 AW515041 AA768334 N29860 AA425874 AA425118 AA865829 AW936878 

AA453208MMJJQ4251 U44103AI671547 N57463AW007521 AI479232 T82809 R67109 AW796099 AI833Q27 AT76539S AI763029 AI758228 
BE326331 AI934576 A1922378 AW276431 AI718466 N38566 AA904753 BE464245 AI338752 AI659875 AW272338 A14231 36 AI089270 AI160904 
AA664354BE1 

AB31594 AW970867 AW027959 AI1 29800 AI927949 AI650270 AI625105 AW514661 AI708393 AL138076 BE180510 AI926721 AI399955 AA749139 
AI862160 AW87401 1 AI242763 AA262795 AA039854 H73499 A1093249 BE245651 AI816834 N25206 AA828301 AI084565 A1302816 AA026905 
AA77255 

AA418841 AI452657 AI768876 AA028973 8E179873 C00215AA418930 
AL134757AW079131 

BE160198 AW935898 T11520 AW935930 AW856073 AW861034 
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TABLE 60C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers m this column are Genbank Identifier (Gl) numbers. "Dunham I. et a!.' refers to the publication entitled The DNA sequence of 
human chromosome 22/ Dunham I. el aL, Nature {1 999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntpositlon: Indicates nucleotide positions of pretficted exons. 



Ptey 


Ref 


Strand 


NUMStlion 


400664 


8118496 


Plus 


13558-13721,13942-14090,14554-14679 


400695 


7249150 


Plus 


160456-160567,164757-164873 


400880 


9931121 


Plus 


29235-29336,3636^-36580 


401197 


9719705 


Plus 


176341-176452 


401424 


8176894 


Plus 


24223-24428 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Plus 


157315-157950 


401558 


7139678 


Plus 


103510-104090 


402031 


7656761 


Plus 


33080-33263,33939-34094,361 0 W6507 


402802 


3287156 


Minus 


53242-53432 


402855 


9662953 


Minus 


59763-59909 


403046 


3540153 


Minus 


55707-55859,56369^6511 


403047 


3540153 


Minus 


59793-59968 


403790 


6084957 


Minus 


8782647947,89835-90002 


404571 


7249169 


Minus 


112450-112648 


404632 


9796668 


Plus 


45096-45229 


404641 


9796810 


Minus 


32247-32362 


404642 


9796810 


Pius 


102999-103145 


405523 


9454643 


Plus 


114550-114688.117265.117407,119490-119599.123237-123395,131140-131217 


405558 


1621110 


Plus 


4502-4544.5983-6083 



Table 61 A lists about 440 genes up-regulated in prostate cancer compared to normal prostate and normal adult tissues. These were selected from 59680 probesels on the 
Affymetrix/Eos Hu03 GeneChip array such that the ratio of "average" prostate cancer to "average" normal prostate tissue was greater than or equal to 2.0 and the ratio to normal adult 
tissues was greater than or equal to 2.0. The "average" prostate cancer level was set to the 75* percentile amongst prostate cancers. The "average 1 normal prostate tissue level was 
set to the 95" percentte amongst prostate tissues. The "average" normal adult tissue level was set to the 85* percentile amongst non-malignant tissues. In order to remove gene- 
specific background levels of non-specific hybridization, the 1(P percentile value amongst the non-malignant tissues was subtracted from both the numerator and the denominator 
before the ratio was evaluated 

TABLE 61 A: ABOUT 440 GEKES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE AND NORMAL ADULT TISSUES 



Ptey: Unique Eos probesel identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unlgene Title: Unigene gene title 

R1: Ratio of prostate tumor to normal adult body tissue 

R2: Ratio of prostate tumor to normal prostate tissue 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


444151 


AW972917 


Hs.1 28749 


atpra^lhytacyt-CoA racemase 


42.2 


42.2 


421566 


NMJXJ0399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


3.2 


26.5 


420729 


AWS64897 


Hs.290825 


ESTs 


3.7 • 


15.8 


401197 






ENSP00000229263*:HSPC213. 


3.0 


12.6 


450096 


AI682088 


Hs.79375 


hotocarboxylase synthetase (bto8n-[prop ' 


12.4 


12.4 


449156 


AF103907 


Hs.171353 


prostate cancer antigen 3, non-coding DD 


12.3 


12.3 


443271 


BE568568 


Hs.1 95704 


ESTs 


11.6 


11.6 


434078 


AW880709 


Hs.283683 


chromosome 8 open reading frame 4 


2.1 


11.2 


431117 


AF003522 


Hs.250500 


delta (DrosophHaHBcel 


9.4 


9.4 


417315 


AI080042 


Hs.180450 


ribosoma! protein S24 


2.3 


9.0 


416182 


NMJXM354 


Hs.79069 


cycCnG2 


6.4 


8.4 


421913 


AI934365 


Hs.109439 


osteoglycln (osteoinductive factor, mime 


Z2 


8.4 


434217 


AW014795 


Hs.23349 


ESTs 


a3 


a3 


425782 


U66468 


Hs.159525 


cell growth regulatory with EF-hand dome 


8.3 


83 


442501 


AA315267 


Hs.23128 


ESTs 


zo 


&3 


429984 


AL05Q102 


Hs^27209 


hypothetical protein PU21617 


ai 


ai 


449048 


Z45051 


Hs.22920 


similar to S68401 (cattle) glucose indue 


3.1 


ao 


417407 


AA923278 


Hs.290905 


EST s, Weakly similar to protease [rUapi 


34.0 


7.9 


420120 


AL049610 


Hs.95243 


transcription elongation factor A (Sll)- 


7.5 


7.5 


419078 


M93119 


Hs.89584 


insdinoma-asscdated 1 


7.4 


7.4 


425905 


AB032S59 


Hs.318584 


novel C3HC4 type Zinc finger (ring finge 


9.4 


7.3 


425018 


BE24S277 


Hs.154196 


E4F transcription factor 1 


7.2 


7.2 


452221 


C21322 


Hs.288057 


hypothetical protein RJ22242 


7.0 


7.0 


421552 


AF026692 


Hs.105700 


secreted mzzted-retated protein 4 


4.0 


6.6 


409799 


D11928 


Hs76845 


phosphoserine phosphatase-Cke 


6.3 


6.3 


411373 


BE326276 


Ks.8861 


ESTs 


3.2 


6.3 


423445 


NM.014324 


Hs.128749 


aipha-mettiylacyt-CoA racemaso 


&7 


a2 


422583 


AA410506 


Hs.27973 


K1AA0874 protein 


Z3 


6.2 


438209 


AL120659 


Hs.6111 


aryMiydrocarbon receptor nuclear trans) 


6.1 


61 


437147 


AL049964 


Hs.8358 


hypothetical protein FU20366 


2.6 


6.0 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



432415 


T18971 


Hs.289014 


ESTs, Weakly similar to A43932 mucin 2 p 


6.0 


6.0 


446896 


T15767 


Hs.22452 


Homo sapiens mRNA for K1AA1737 protein, 


6.2 


6.0 


449118 


R67477 


Hs.23103 


Bed (S. cerevisiae) homotog 


5.9 


5.9 


427472 


AA522539 


Ks.131250 


transpose rwterived Buster3 transposase-l 


3.3 


5.9 


431548 


A1834273 


Hs.9711 


novel protein 


15.7 


5.8 


417153 


X57010 


Hs.81343 


ccBagen, type II, alpha 1 (primary oste 


4.2 


5.8 


410889 


X91662 


Hs.66744 


twist (DrosophSa) homolog (acrocephalos 


3.5 


5.8 


429597 


NM.003816 


Hs.2442 


a cfisintegrin and metafioproteinase doma 


7.9 


5.8 


422538 


NM 006441 


Hs.118131 


5,10-meUienyitetrahydrofolate synthetase 


5.7 


5.7 


414812 


X72755 


Hs.77367 


monokine Induced by gamma interferon 


15 


5.6 


448552 


AW973653 


Hs.20104 


hypothetical protein FU00052 


5.6 


5.6 


416111 


AA033813 


Hs.79018 


chromatin assembly factor 1, subunit A ( 


5.5 


5.5 


414516 


AI307802 


Hs.135560 


ESTs, Weakly similar to T43458 hypotheti 


27 


5.5 


414343 


AL036166 


Hs.323378 


coated vesicle membrane protein 


5.4 


5.4 


451684 


AF216751 


Hs.26813 


COA14 


3.9 


5.4 


421470 


R27496 


Hs.1378 


annexin A3 


5.3 


5.3 


426350 


NM 003245 


Hs.2022 


transglutaminase 3 (E polypeptide, prate 


5.3 


5.3 


436476 


AA326108 


Hs.33829 


bHLH protein DEC2 


2.8 


5.3 


437571 


AA760894 


Hs. 153023 


ESTs 


5.2 


5.2 


447072 


061594 


Hs.17279 


tyrosyt protein suifotranslerase 1 


5.2 


5.2 


444917 


R68651 


Hs. 144997 


ESTs 


5.1 


5.1 


417318 


AW953937 


Hs.240845 


ESTs 


4.6 


5.1 


434170 


AA626509 


Hs.122329 


ESTs 


5.1 


5.0 


432B97 


AF155099 


Hs.279780 


NY-REN-18 antigen 


3.3 


5.0 


433560 


A1925195 


Hs.130891 


hypothetical protein M6C4400 


4.8 


5.0 


421823 


N40850 


Ks.28625 


ESTs 


5.0 


5.0 


451244 


AW008798 


Hs.343877 


hypothetical protein FU20039 


4.9 


4.9 


453370 


AI470523 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


29.8 


4.9 


425242 


D13635 


Hs.1 55287 


K1AA0010 gene product 


12 


4.8 


439024 


R96696 


Hs.35598 


ESTs 


5.4 


4.8 


407235 


D20569 


Hs,169407 


SAC2 (suppressor of actin mutations 2, y 


4.8 


4.8 


421863 


AI952677 


Hs.108972 


Homo sapiens mRNA; cONA DKFZp434P228 (fr 


4.7 


4.7 


437916 


BE566249 


Hs.20999 


hypothetical protein FU23142 


16 


4.7 


451952 


AL1 20173 


Hs.301663 


ESTs 


4.7 


4.7 


431676 


AI685464 




gb:H88f04jc1 NCl_CGAP_Pr28 Homo sapiens 


9.1 


4.7 


452242 


R50956 


Hs.159993 


gycosyl transferase 


4.7 


4.7 


401519 






CI 5000476*:gi|12737279lref|XPJ)1 2163.1 1 


4.6 


4.6 


414342 


AA742181 


Hs.75912 


KIAA0257 protein 


3.4 


4.6 


426197 


AA004410 


Hs.100009 


acyt-Coenzyme A oxidase 1, patrriftoyl 


3.3 


4.6 


429467 


NMJJ04477 


Hs.203772 


FSHD region gene 1 


11 


4.6 


408380 


AF123050 


Hs.44532 


dhittqufGn 


4.2 


4.6 


425907 


AA365752 


Hs.1 55965 


ESTs 


2.2 


4.6 


427078 


AI676062 


Hs.1 11902 


ESTs 


4.8 


4.5 


419465 


AW500239 


Hs.21187 


Homo sapiens cDNA: FU 23068 Ms, clone L 


17 


4.5 


429250 


K56585 


Ks.198308 


tryptophan rich basic protein 


4.5 


4.5 


431585 


BE242803 


Hs.262823 


hypothetical protein FU1 0326 


2.5 


4.5 


443250 


AI041530 


Hs.132107 


ESTs 


8.1 


4.4 


423545 


AP000S92 


Hs.129781 


chromosome 21 open reading frame 5 


4.4 


4.4 


423698 


AA329796 


Hs.1098 


DKF2p434J1813 protein 


4.4 


4.4 


442281 


N34742 


Hs.170065 


Homo sapiens cONA FU13492 fis, clone PL 


4.4 


4.4 


428826 


AL048842 


Hs.194019 


attracOn 


2.0 


4.3 


447188 


H65423 


Hs.17631 


hypothetical protein DKFZp434E2135 


4.3 


4.3 


418701 


AA814948 


Hs.96343 


ESTs, WeaJdy similar to ALUC_HUMAN 1!!1 


4.2 


4.2 


429900 


AA460421 


HsJ0875 


ESTs' 


4.2 


4.2 


439518 


W76326 




gb:zd60d04.r1 Scares JetaLhearLNbHH19W 


12 


4.2 


409151 


AA306105 


Hs.50785 


SEC22, vesicle Mafficking protein (S. c 


3.5 


4.2 


420544 


AA677577 


HsJ8732 


Homo sapiens Chromosome 16 BAG done CIT 


10.0 


4.2 


444100 


AA383343 


Hs.22116 


CDC14 (cell division cycle 14, S. cerevi 


2.8 


4.2 


443622 


AI911527 


Hs.1 1605 


ESTs 


21 


4.1 


410037 


AB02072S 


Hs.58009 


WAA0918 protein 


4.1 


4.1 


421129 


BE439899 


Hs*9271 


ESTs 


4.1 


4.1 


428966 


AF059214 


Hs.194687 


cholesterol 25-hydroxytase 


3.9 


4.1 


448664 


AI879317 


Hs^34691 


sptictng factor 3a, subunit 1, 120kD 


2.8 


4.1 


432242 


AW022715 


Hs.162160 


ESTs, Weakly sirriar to ALU4_HUMAN ALU S 


2.0 


4.1 


433887 


AW204232 


Hs.279522 


ESTs 


4.1 


4.1 


436556 


AI364997 


Hs.7572 


ESTs 


4.1 


4.1 


400301 


XD3635 


Hs.1 657 


estrogen receptor 1 


4.1 


4.1 


443646 


AI085198 


Hs.1 64226 


ESTs 


4.1 


4.1 


430389 


AL117429 


Hs.240845 


DKFZP434D146 protein 


2.2 


4.0 


418432 


M14156 


Hs-85112 


insuSn*6ke growth factor 1 (somatomedl 


4.0 


4.0 


433409 


AI278802 


Hs^5661 


ESTs 
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4.0 


447247 


AW369351 


Hs.287955 


Homo sapiens cDNA FU 13090 fis, done NT 
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Hs.132390 


zinc finger protein 36 (KOX 18) 
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444755 


AA431791 


Hs.113823 


OpX (caseinolylic protease X, E. cofi) 


3.9 


19 
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Hs.76549 


ESTs, WeaWy similar to A56 1 54 Ab) subst 
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AK001745 


Hs.1 84628 


hypothetical protein FU10883 
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L4cymirenirtefaipria-arninoaa1pa!a aminctra 


19 


19 


423337 


NM 004655 
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axin 2 (conductin. axil) 
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18 


422661 


NM.014700 


Hs.1 19004 


KIAA0665 gene product 
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18 


407317 
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HsJ71461 


ESTs. Weakly similar to 138022 hypothec" 
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ESTs 
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422975 


AA347720 


Hs.122659 


K1AA0264 protein 


2.2 
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AW964806 


Hs. 38035 


Homo sapiens, Similar b RIKEN cONA 0610 


3.8 


18 


414664 


AA587775 


Hs.66»5 


rruilti-PDZ-dornarvcoolatning protein 


3.8 


18 


437162 


AW005505 


Ks.5464 


thyroid hormons rec6ptor coaclivating pr 


27 


3.8 


421828 


AW89196S 


Hs.279789 


histone deacetytase 3 


4.1 


18 
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AA436002 


Hs.183161 


ESTs 
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3.8 


422049 


W25760 


Hs.77631 


glycine cleavage system protein H (amino 


3.1 


3.8 


453160 


A1263307 


Hs.239884 


H2B histone family, member L 
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18 


424906 


A1566086 


Hs.1 53716 


Homo sapiens mRNA for Hmob33 protein, 3* 


17 


17 


446554 


AA1 51730 


Hs.301789 


nudix (nucleoside diphosphate linked moi 


17 


17 


452259 


AA317439 


Hs.28707 


signal sequence receptor, gamma (translo 


3.7 


17 


448807 


AI571940 


Hs.7549 


ESTs 


3.6 


3.7 


439209 


AF087993 


Hs.91954 


ESTs 
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3.6 


420077 


AW512260 


Hs.87767 


ESTs 


4.4 


3.6 


451009 


AA013140 


Hs.1 15707 


ESTs 


4.1 


3.6 


452281 


T93500 


Hs.28792 


Homo sapiens cDNA FU1 1 041 Ms, clone PL 
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452561 


AI692181 


Hs.49169 


K1AA1634 protein 
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3.6 


447033 


AJ357412 


Hs.1 57601 


ESTs 


7.7 


3.6 


427138 


N77624 


Hs.1 7371 7 


phosphatide ackJ phosphatase type 28 


3.6 


3.6 


442991 


BE281238 


Hs.8886 


hypothetical protein FU 20424 
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414680 


AA743331 




hemoglobin, alpha 2 


3.5 


15 


417380 


T06809 


Hs.332086 


ESTs 
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425710 
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Hs.159275 


sohits rarripr famitv mamhpr 4 


3.5 


3.5 


441247 
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Hs.1 28051 
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15 


3.5 


426931 


NM 003416 


Hs.2076 


zinc finger protein 7 (KOX 4, clone HF.1 


2.2 


3.5 


417873 


BE266659 


Hs.293659 


Homo sapiens. Similar to RIKEN cONA A430 


3.5 


15 


432101 


AI918950 


Hs.123842 


EphA3 


16.5 


3.5 


429302 


AU076674 


Hs.198899 


eukaryotic translation initiation factor 


3.3 


15 


435513 


AW404075 


Hs.42785 


OC11 protein 


2.2 


3.5 


440643 


M7B878 


Hs7314 


WAA0614 protein 


21 


3.5 


418293 


A1224483 


Hs.16063 


hypothetical protein FU21877 


6.7 
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425900 


AK000166 


Hs.288809 


hypothetical protein FU20159 


21 


3.4 


431689 


AA305688 


Hs.267695 


UDP-GaI:betaG!cNAc beta 1,3-galactosyltr 
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3.4 


440749 


W22335 


Hs.7392 


hypothetical protein MGC3199 
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3.3 


408374 


AW025430 


Hs.155591 


forkheadboxFI 


4.3 


3.3 
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24 


3.3 


432169 


Y00971 


Hs.2910 


phosphoribosyl pyrophosphate synthetase 


3.3 


3.3 


437872 


AK002015 


Hs.5887 


RNA binding motif protein 7 


3.3 


13 


443912 


R37257 


Hs.184780 


ESTs 


3.3 


3.3 


414565 


AA502972 


Hs.183390 


hypothetical protein FU13590 


9.9 


3.3 


448164 


R61680 


Hs.26904 


ESTs, Moderately simitar to Z195_HUMAN Z 


3.3 


13 


450094 


AI174947 


Hs.295789 


Homo saDtens mRNA* cDNA DKF7o564D1 164 ff 


4.4 


3.3 


424800 


AL035588 


Hs.1 53203 


MvoD famftv inhihifnr 


16 


3.3 


451367 


AA923729 


Hs 26322 
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445472 


AB0Q6631 


Hs.1 2784 


Homo sapiens mRNA for KIAA0293 gene, par 


11.5 


12 


408101 


AW968504 


Hs.1 23073 


COC2-re)ated protein kinase 7 
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12 


421246 


AW582962 


Hs.102897 


CGJ-47 protein 


3.2 


12 


426227 
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Hs.154299 


Human nrntftina^fi arjK/atpr| rprpnfnr-9 mR 
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3.2 


402802 
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Hs.33718 


Homo sapiens cDNA FU 1 2641 (is, done NT 


27 


3.2 


436207 
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Hs.12845 


hypothetical protein MGC13159 


3.2 


3.2 


453864 


AW021407 


Hs.21066 


hypothetical protein * 


3.2 


12 


448944 


AB014605 


Hs.22599 


atroprun~1 interacting protein 1j activi 


3.8 


3.2 


444489 


AI151010 


Hs.1 57774 


ESTs 


3.2 


3.2 


453124 


AI139058 


Hs.125790 


leudne-nch repeat-containing 2 


3.2 


12 


453078 


AF053551 


Hs.31584 


metaxin 2 


21 


3.1 


439444 


AE77652 


Hs.54578 


ESTs, Weakly similar to 138022 hypotheti 


10.6 


11 


413492 


087470 


Hs.75400 


K1AA0 280 protein 


28 


3.1 


434804 


AA649530 


Hs.348148 


gb*ns44f05.s1 NO CGAP _Aiv1 Homo sapiens 


21 


11 


445840 


A1277811 


Hs.146291 


ESTs 


3.1 


11 


423565 


AW976434 


Hs.3623 


hypothetical protein FU1 1220 


22 


11 


408161 


AW952912 


Hs.300383 


hypothetical protein MGC3032 


11 


11 


414922 


D00723 


Hs.77631 


glycine cleavage system protein H (amino 


11 


3.1 


430519 
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Hs.49210 


F-hox otitv nmtpin 4 

% *1A/A Will J piWldll 


3.1 


11 


430487 


D87742 


Hs.241552 


K1AA0268 protein - 


16.2 


11 


423554 


M90516 


Hs.1674 


glutarrtine-fructose-6-phosphate transamln 


3.1 


11 


430226 


BE245562 


Hs.2551 


adrenergic, beta- 2-, receptor, surface 


9.0 


3.1 


400296 


AA305627 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


20.9 


3.1 


420154 


AI093155 


Hs.95420 


JM27 protein 


27.4 


11 


453293 


AA382267 


Hs.10653 


ESTs 


17 


3.1 


457718 


F18572 


Hs.22978 


ESTs, Weakty similar to ALU4.HUMAN ALU S 


10 


10 


447476 


8E293466 


Hs.20880 


ESTs, Weakly similar to 138022 hypotheti 


24 


10 


419440 


AB020689 


Hs.90419 


KIAA0882 protein 


29 


10 


433285 


AW975944 


Hs.237396 


ESTs 


7.7 


10 


450671 


AI356967 


Hs.430B6 


ESTs, Weakly similar to A46010 X-iinked 


10 


10 


445921 


AW015211 


Hs.146181 


ESTs 


4.3 


3.0 


416882 


AI633044 




tryptophan^ tRNA synthetase 2 (mitochon 


15 


3.0 


414052 


AW578849 


Hs, 283552 


ESTs, Weakly similar to unnamed proton 


25 


3.0 


450580 


N40087 




ESTs 


3.0 


3.0 


447726 


AL137638 


Hs.19368 


matrifin2 


19 


29 


447816 


NM.007233 


Hs.274329 


TP53 target gene 1 


7.2 


29 


420407 


AA814732 


Hs.145010 


lipopolysaccaride-specrfic response 5-6 


5.1 


29 


422299 


AK000181 


Hs.114556 


hypothetical protein FU20174 
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29 
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420297 A1628272 Hs.88323 ESTs, VSteaWy simBar to ALU IJ HUMAN ALUS 

427701 AA411101 Hs.243886 nuclear autoanSgenic sperm protein (his 

437812 AJ 582291 Hs. 15846 ESTs, WeaJdy similar to 04HUD1 debrisoqu 

445808 AV655234 ESTs, Moderately simHar to PC4259 fern 

5 428600 AW863261 Hs.242413 hypolhsticat protein DKFZp434K1421 

442320 AI287817 Hs. 129836 ESTs 

418407 AL044818 Hs.84928 nuclear transcription factor Y, beta 

433050 AI093330 Hs. 163440 Homo sapiens cDNA: FU21 000 fis. done C 

451796 AL133019 Hs.27038 Homo sapiens mRNA; cDNA OKFZp434G21 27 (f 

10 407783 AW996872 Hs.172028 a (fisinlegrin and metaltoproteinase doma 

459029 AA131376 Hs.343809 fibroblast growth factor 12B 

441676 BE564206 Hs.49889 ESTs 

439352 BE614347 Hs.169615 hypothetical protein FU20989 

453628 AW243307 Hs.83937 hypothetical protein 

15 412324 AW978439 Hs.69504 ESTs 

408063 BE086548 Hs.42346 raJrineurMnding protein calsarrin-1 

421928 AF013758 Hs.109643 poryadenytate binding protein-lnteractin 

421247 BE391727 Hs.102910 general trarecription factor tlH, polype 

_ _ 433865 N29862 Hs.44104 ESTs 

20 423201 NMJ00163 Hs.125180 growth hormone receptor 

450832 AW970602 Hs.105421 ESTs 

416913 AW934714 gb:RC1-DT0001-O31293-O11-a11 DT0001 Homo 

417251 AW015242 Hs.99488 ESTs, Weakly similar to YK54_YEAST HYPOT 

427144 X95097 Hs.2126 vasoactive Intestinal peptide receptor 2 

25 444668 AA654650 Hs.282906 ESTs 

434015 AAB4451B Hs.300876 hypothetical protein FU 13386 

432689 AB018320 Arg/AW-interacting protein ArgBP2 

423482 BE280172 Hs.129228 galactokinase 2 

448206 BE622585 Hs.3731 ESTs, Moderately similar to I38022 hypot 

30 450546 AA010200 Hs.175551 ESTs 

452270 AW975014 Hs.26 ferrochelatase (protoporphyria) 

458332 A1000341 Hs220491 ESTs 

436109 AA922153 Hs.132760 hypothetical protein MGC1 5729 

422867 L32137 Hs.1584 cartilage oligomeric matrix protein (pse 

35 412576 AA447718 Hs.107057 ESTs 

413336 AI569936 Hs.296178 hypothetical protein FU22637 

433577 AW007080 Hs.284192 ESTs 

439093 AAS34163 Hs.5476 Homo sapiens, clone I MAGE:35301 23, mRNA, 

Ar . 442833 AA328153 Hs.88201 ESTs, Weakly similar to A Chain A, Cryst 

40 447401 BE618582 Hs.97661 ESTs 

409619 AK001015 Hs.55220 B(X2-associated athanogene 2 

432729 AK00Q292 Hs. 130732 hypothetical protein FU 20285 

452721 AJ269529 Hs.301871 solute carrier family 37 (glyceroW^ho 

447342 A1199268 Hs.19322 Homo sapiens. Similar to RIKEN cONA 2010 

45 409205 AI952884 Hs.14832 ESTs, Moderately similar to unnamed prat 

422805 AA436989 Hs.121017 H2A histone family, member A 

446238 T95143 Hs.14511 SCO (cytochrome oxidase deficient, yeast 

447313 U92981 Hs.18081 Homo sapiens clone DT1P1B6 mRNA, CAG rep 

411630 U42349 Hs.71119 Putative prostate cancer tumor suppresso 

50 410592 R94088 Hs.43569 ESTs 

417173 U61397 Hs.81424 ubiquiWke 1 (sentrin) 

417601 NM.014735 Hs.82292 WAA0215 gene product 

433419 A1830342 Hs.211Z72 ESTs 

447509 AF107454 Hs. 107537 chromosome 7 open reading frame 2 

55 424580 AA446539 Hs.339024 ESTs,WeakrysirnilartoA46010X^nked 

407252 AA659037 Hs. 163780 ESTs 

420061 AW024937 Hs.29410 ESTs 

424432 A6037821 Hs. 146858 protocadherin 10 

_ 417683 AW566008 Hs.239154 ankyrin repeat family A (RFXANK4ike), 

60 414312 AA155694 Hs.191060 ESTs 

417135 AA422067 Hs£0547 ESTs 

437748 AF234882 Hs.5814 suppression of tumorigenidty 7 

422728 AW937826 Hs. 103262 ESTs, Weakly simOar to ZN91_HUMAN ZINC 

r _ 453070 AK001465 Hs.31575 SEC63, endoplasmic reticulum translccon 

65 406627 T64904 Hs.163780 ESTs 

407151 H25836 Hs.301527 ESTs, Moderately similar to unknown [H.s 
425815 R94023 Hs.94560 ESTs, Moderately similar to 138022 hypot 
433805 AA706910 Hs.1 12742 ESTs 

„ 431474 AL133990 Hs.190642 ESTs 

/O 436204 AK002014 Hs.47546 rtornosapienscDNAFU11458fis,doneHE 
439328 W07411 Hs.118212 ESTs, Moderately similar to ALU3.HUMAN A 
449115 AW959952 Hs.37528 ESTs, Weakly similar to AF090944 1 PRO0B 
437531 AJ400752 Hs.112259 T ceil receptor gamma locus 
„ 445424 AB028945 Hs.12696 cortactto SH3 doman-binding protein 
/-> 403047 N^005656":Homo sapiens trarsmernbrane pr 

416198 H27332 Hs.99598 hypothetical protein MGC5338 
417576 AA339449 Hs.82285 phosptranTx^IycinamMe formyitransfer 
430607 AW973521 Hs.247324 nitochondrte! rfbcsomal protein S14 
OA 436401 AJ087958 Hs29088 ESTs 
OO 450654 AJ245587 Hs25275 KruppeMype zinc finger protein 

426216 N77630 Hs.13895 Homo sapiens cDNA FU11654 fis, done HE 
431325 AW026751 Hs.5794 ESTs. WeaWy similar to 21 09260A Bcefl 
410102 AW248508 Hs.279727 Homo sapiens cONAFU14035 fis, clone HE 
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446791 


AI632278 


Hs.195922 


ESTs 


2.5 


25 


418004 


U37519 


Hs.87539 


aldehyde dehydrogenase 3 family, member 
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419791 


AI579909 


Hs.105104 


ESTs 
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447210 


AF035269 


Hs.17752 


phosphatidytserine-spedfic phcsphoOpas 
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25 
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AF033827 


Hs. 155553 


HNK-1 sulfotransferase 
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452741 


BE392914 


Hs.30503 


Homo sapiens cDNA FU11344 fis, clone PL 
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AI420256 


Hs.161271 


ESTs 


21 


24 


412043 


BE156622 


Hs.333371 


Homo sapiens done TA40 untranslated mRN 
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421413 


AI826128 


Hs.57637 


ESTs, Weakly similar to A49364 59 protei 
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ESTs 
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24 


429258 
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gb2x10e09j1 SoaresJctaLfetus_Nb2HF8_ 
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24 


435102 
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Hs.76917 


F4xw only protein 8 


24 


2.4 
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BE264645 


Hs.282093 


hypothetical protein FU21918 


3.1 


24 


407021 


U52077 




gb:Human marinerl transposase gene, comp 


7.3 


24 
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Hs.244482 


Homo sapiens, clone 1MAGE:3611719, mRNA, 
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Hs.343564 


tobuim-sperine chaperone e 


4.3 


24 


447713 


A1420733 


Hs.207083 


ESTs 


2.4 


24 


456848 


AL1 21087 


Hs.296406 


KIAA0685 gene product 


22 


24 


437617 


AI026701 


Hs.5716 


K1AA031 0 gene product 


26 


24 


432378 


A1493046 


Hs.1 461 33 


ESTs 


5.2 


24 


443160 


R15875 


Hs.258576 


daudin 12 


10.1 


24 


445929 


AI089660 


Hs323401 


dpy-30-like protein 
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24 


424701 
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Hs.151988 


mitog en-activated protein kinase kinase 
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Hs.12477 


synaptosomal- associated protein, 91 kOa 


25 


24 
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Hs.56145 


thymosin, beta, identified h neuroblast 
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Hs.107056 


CED-6 protein 
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AW206373 


Hs.16443 


Homo sapiens cDNA: FU21721 fis, clone C 
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Hs.89433 


ATP-blnding cassette, sub-family C (CFTR 
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451071 


AW138807 


Hs.204939 


ESTs, Highly similar to A54677 homeotlc 
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putative G protein-coupled receptor 


3.1 
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2.3 
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Hs.151046 


hypothetical protein RJ11 193 
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AW450461 


Hs.203965 


ESTs 


23 
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452387 


AI680772 


Hs.306094 


trinucleotide repeat containing 1 2 
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23 


412045 


AA0998Q2 


Hs.83883 


transmembrane, prostate androgen induced 


4.1 


23 


428180 


A1129767 


Hs.182874 


guanine nucleotide binding protein (G pr 


20 
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AA406400 


Hs. 12482 


glycerone phosphate O-scyttransf erase 


27 
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443194 


AI954968 


Hs.279009 


matrix Gla protein 


21 


23 


414869 


AA157291 


Hs.21479 


utriraicteinl 
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2.3 


407300 


AA102616 


Hs.120769 


gb:zn43e07.s1 Stratagene HeLa cell s3 93 


2.3 
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436396 


AI683487 


Hs.152213 


wingless-type MMTV Integration site fami 


23 


23 


439221 


AA737106 


Hs.32250 


ESTs, Moderately similar to I78885 serin 


23 


23 


453802 


AL134757 




gb:DKFZp547G1090j1 547 (synonym: hfbrl) 


4.1 


23 


421059 


A3654133 


Hs.30212 


thyroid receptor interacting protein 15 


7.9 


23 


426413 


AA377823 




gb:EST908Q5 Synovial sarcoma Homo sapien 


22 


23 


431147 


AT767751 


Hs.2O300 


ESTs 


5.8 


23 


442677 


AJ 557914 




peroxisomal acyl-CoA (hioesterase 


2.5 


23 


446091 


AW022192 


Hs.200197 


ESTs 


4.0 


23 


431563 


A1027643 


Hs.120912 


ESTs 


22 


23 


439478 


AF049460 


Hs.6574 


deformed epidermal autoregulatory factor 


21 


23 


413049 


NM.002151 


Hs.823 


hepsln (transmembrane protease, serine 1 


3.5 


23 


405685 






C2002829:gil4507689|reflNP_003298.1| tra 


23 


23 


407917 


U63139 


Hs.41587 


RAD50 (S. cerovisiae} homolog 


23 


23 


450378 


AW249181 


Hs.19954 


ESTs, Weakly similar to T1 9873 hypotheO 


23 


23 


429303 


AW137635 


Hs.44238 


ESTs, Weakly similar to S65657 alpha-1C- 


23 


23 


417248 


AA329449 


Hs.247302 


twisted gastrutation 


23 


22 


429277 


AW452016 


Hs.127863 


ESTs, Weakly similar to SFR4_HUMAN SPUC 


26 


22 


431555 


AI815470 


Hs.260024 


Cdc42 effector protein 3 


4.5 


22 


440099 


AL080058 


Hs.6909 


DKFZP564G202 protein 


29 


22 


416653 


AA768553 


Hs.193145 


metallothbnein IE (functional) 


3.4 


22 


431542 


H63010 


Hs.5740 


ESTs 


7.0 


22 


401558 






ENSP00000220478*:SECRETOGRANIN III. 


22 


22 


418281 


U09550 


Hs.1154 


ovWuctal glycoprotein 1, 120xO (mucin 9 


22 


22 


422295 


AF051151 


Hs.1 14408 


toll-like receptor 5 


2.2 


22 


432453 


AI885537 


Hs.27172 


ESTs, Moderately similar to PC4259 ferri 


22 


22 


447499 


AW262580 


Hs.147674 


protocadherin beta 16 


22 


22 


452994 


AW962597 


Ks.31305 


KIAA1547 protein 


4.0 


22 


450325 


AI935962 


Hs.26289 


ESTs 


11.9 


22 


422390 


AW450893 


Hs.1 21830 


ESTs, Weakly similar to T42682 hypolheti 


21 


22 


448789 


BE539108 


Hs.22051 


hypothetical protein MGC15548 


26 


22 


436679 


AJ1274B3 


Hs.120451 


ESTs, Weakly similar to unnamed protein 


11.8 


22 


425263 


NM.001197 


Hs.155419 


B02-interacting killer (apoptosls-fnduc 


12 


22 


451939 


U80456 


Hs.27311 


single-minded (Droscphaa) homolog 2 


13.6 


22 


410252 


AW821182 


Hs.61418 


microfibrillar -associated protein 1 


4.2 


22 


439699 


AF086534 


Hs.187561 


ESTs, Moderately similar to ALU1.HUMAN A 


4.8 


22 


422576 


BE548555 


Hs.1 18554 


CGI-83 protein 


21 


22 


433293 


AF007835 


Hs.32417 


hypothetical protein MGC4309 


11.0 


22 


410733 


D84284 


Hs.66052 


C038 antigen (p45) 


3.1 


22 


421379 


Y15221 


Hs.103982 


smafl Inducible cytokine suWamBy B (Cy 


21 


22 


442995 


AA532511 


Hs.288455 


Homo sapiens cDNA: FU23270 fis, clone C 


22 


22 


408833 


AW612232 


HsJ254835 


ESTs 


3.0 


22 
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422424 


AI1B6431 


Hs.296638 


prostate differentiation factor 


4.4 


22 




433444 


AW975324 


Hs.129816 


ESTs 


11.7 


22 




446354 


AW449650 


Ks.346335 


ESTs 


2.6 


22 


5 


438869 


AF075009 




gb:Homo sapiens fuQ length insert cONA 


5.9 


22 


425628 


NMJXM476 


Hs.1915 


folate hydrolase (prostate-specific memb 


12.8 


22 




424036 


AA770688 




H2A histone family, member L 


3.8 


22 




440146 


AW014231 


Hs.90790 


Homo sapiens cDNA; FU 22930 fis. done K 


4.8 


22 




404641 






NM_021 965*: Homo sapiens phosphogtucomuta 


22 


22 


10 


407253 


AA411175 


Hs.141939 


ESTs. Moderately similar to S65657 alpha 


12 


22 


406068 






C2002008:giI73C3957fe biAAF59000. 1 1 (AE00 


ai 


21 




415162 


AF03571B 


Hs.78061 


transcription factor 21 


2.1 


21 




423349 


AF010258 


Hs.1 27428 


homeobox A9 


5.9 


21 




424534 


D87682 


Hs.150275 


K1AA0241 protein 


13 


21 


15 


407756 


AA1 16021 


Hs.38260 


uMquitin specific protease 18 


2.5 


21 


440658 


H29142 


Hs.143032 


ESTs, WeaWy similar to neuronal thread 


2.5 


21 




416547 


H62914 


Hs.268946 


ESTs, Weakly similar to PC4259 ferritin 


2.1 


21 




432281 


AK001239 


Hs.274263 


hypothetical protein FU 10377 


Z1 


21 




447533 


NM 004786 


Hs.18792 


Ihioredoxbvlflte, 32kD 


21 


21 


20 


448966 


AW372914 


Hs.86149 


phosphoinositol 3-phcsphate-binding prat 


21 


21 


449897 


AW819642 


Hs.24135 


transmembrane protein vezatin; hypolhefi 


2.1 


21 




452959 


AI933416 


Hs.189674 


ESTs 


2.1 


21 




427083 


NM.006363 


Hs.173497 


Sec 23 (S. cerevisiae) rtornotog B 


2.8 


21 




408291 


AB023191 


Hs.44131 


WAA0974 protein 


1\ 


21 


25 


418727 


AA227609 


Hs.94834 


ESTs 


21 


2.1 


440491 


R35252 


Hs.130558 


ESTs, Weakly similar to 2109260A B cell 


1\ 


21 




433466 


AA508353 


Hs.105314 


reiaxin 1 (H1) 


17.3 


21 




432695 


063480 


Hs^78634 


K1AA0146 protein 


2.3 


21 




427982 


NM.016156 


Hs.181326 


KIAA1073 protein 


3.2 


21 


30 


445873 


AA250970 


Hs.251946 


poty(A}-binding protein, cytoplasmic 14 


15 


2.1 


434569 


AI311295 


Hs.344478 


K1AA0196 gene product 


1\ 


21 




441791 


AW372449 


Hs.61271 


hypothetical protein FU21159 


2.1 


21 




438520 


AA706319 


Hs.98416 


ESTs 


27 


21 




414269 


AA298489 


Hs.303171 


olfactory receptor, family 51, subfamily 


17.3 


21 


35 


410660 


AI061118 


Hs.65328 


Fanconl anemia, complementation group F 


3.5 


21 


456660 


AA909249 


Hs.112282 


solute carrier family 30 (zinc transport 


7.4 


21 




450861 


AI523898 


Hs.17617 


ESTs 


24 


21 




419193 


029643 


Hs.34789 


dolichyl-diphosphootigosaccharide-protei 


21 


2.1 




423784 


AK000039 


Hs.132826 


Homo sapiens cDNA FU14913 fis, done PL 


3.2 


21 


40 


416208 


AW291168 


Hs.41295 


ESTs, Weakly similar to MUC2.HUMAN MUCIN 


6.4 


21 


407182 


AA312551 


Hs.230157 


ESTs 


21 


21 




421689 


N87820 


Hs.10682S 


KIAA1696 protein 


21 


21 




432833 


N51075 


Hs.1 10028 


ESTs 


21 


21 




425170 


AU077315 


Hs.154970 


transcription factor CP2 


26 


20 


45 


436278 


BE396290 


Hs.5097 


synaplogyrin 2 


28 


20 


438719 


AA357129 


Hs.239625 


Integra) membrane protein 2B 


23 


20 




432728 


NM_006979 


Hs.278721 


HLA class U region expressed gene KE4 


3.6 


20 




410310 


J02931 


Hs.62192 


coagulation factor ill (thromboplastin, 


20 


20 




429716 


R25685 


Hs.211933 


collagen, type XIII, alpha 1 


3.6 


20 


50 


424081 


NM.006413 


Hs.139120 


ribonuclease P (30kD) 


20 


20 


433800 


AI034361 


Hs.135150 


lung type-t ceil membrane-associated gfy 


20 


20 




413627 


BE182082 


Hs.246973 


ESTs 


3.3 


20 




414133 


AW022188 


Hs.41167 


ESTs 


23 


20 




411096 


U80034 


Ks.68583 


mitochondrial intermediate peptidase 


14.7 


20 


55 


450244 


AA007534 


Hs.125062 


ESTs 


3.1 


20 


422493 


AW474183 


Hs.250173 


hypothetical protein FU13158 


26 


20 




409219 


AA393383 


Hs.1 33331 


ESTs 


21 


20 




419986 


A1345455 


Hs.78915 


GA*brnding protein transcription factor, 


20 


20 




421969 


AF052217 


Hs.1 10099 


core-binding factor, runt domain, alpha 


20 


20 


60 


434128 


W93170 


Hs.284164 


protein x 0004 


27 


20 


434503 


T96231 


Hs.17762 


ESTs 


20 


20 




443292 


AK000213 


Hs.9196 


hypothetical protein 


20 


20 




443555 


N71710 


Hs.21398 


ESTs, Moderately similar to A Chain A, H 


2.0 


20 




448172 


N75276 


Hs.135904 


ESTs 


6.0 


20 


65 


452039 


AI922988 


Hs.172510 


ESTs 


20 


20 


400290 


H18836 


Hs.31608 


hypothetical protein FU 20041 


6.2 


20 




449625 


NMJH4253 




odz (odd Ozrterwn. Drosophita) homolog 1 


10.1 


20 




441406 


Z45957 


Hs.7837 


phosphoprotein regulated by mitogenic pa 


23 


20 




451369 


AA017321 


Hs.269691 


ESTs 


21 


20 


70 


434011 


AW953437 


Hs.5486 


clone FLB5214 


21 


20 


408806 


AW847814 


Hs.289005 


Homo sapiens cDNA: FU21532 Ms, done C 


5.4 


20 




433388 


A1432672 


Hs.288539 


hypothetical protein FU22191 


27 


20 




432653 


N62096 


Hs.293185 


ESTs. WeaWy similar to JC7328 ammo aci 


5.5 


20 




446783 


AW138343 


Hs.141867 


ESTs 


4.4 


20 


75 


437323 


AA371145 


Hs.194397 


leptin receptor 


25 


20 


410076 


T05387 


Hs.7991 


ESTs 


29 


20 




421501 


M29971 


Hs.1384 


O^rethytguanine-ONA rneihyttransferase 


21 


20 




419733 


AW362955 




Homo sapiens cDNA FU1441 5 fis, clone HE 


25 


20 



80 
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Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 

Accession: Genbank accession numbers 



Pkey 


CAT Numher 


Accession 


414680 


147525 1 


AA74TTC1 AAfWifffl AWK64S40 AA77571 1 AA150965 BE465475 AA968994 AA8B6905 AJ 141054 AW194991 AA252147 AA319500 AJ184288 






AA708749 AA644620 AA652769 AA242975 AA151074 T 19890 


416882 


162718 1 


A1S33044 AW01621 2 AW241 143 AA76905B R43272 AW068958 AA210918 AA293774 AI748B15 A1763294 A1333114 AI277384 AI088297 AI468477 






AlfttAR94 AWtfiQWIR AIR31751 740749 AIQR4fi71 AIR71^16 AA1fW0?4 AW235412 RF1784 


416913 


1R3001 1 


AW334714 RF 161 007 RF1fi?«5Q0 AW749M2 AW7498S4 BE 162498 BE161005 AA190449 AW51346S 6E161Q06 BE162499 






aiqiyhcvi aawcw aaoa^iqi aiqtiqi^ 


419733 




Awncoacc uccmcr AiiWtfiKfi lA/CAQCQ H 977^1 Tfiifi^ M7<i71<i wnAQC7 WR3fi7fi AAftfOfW AA514302 W63789 BE 0464 12 T91396 






AlQ^l Q7H AWnA4777 W?fVl1R AWfiftU4ATQM14 A11WM7 AAflOQR43 AAR4R719 AA5R1Q66 AA789002 


424036 


23460.1 


AA770688 H15373 AW161070 BE304523 BE378517 AA989300 AA904029 BE25421 1 AA449148 A1268420 A1300495 AI21 5637 AJ300494 AI268551 






AA928971 AA179427 AA947684 BE393792 H98018 AJ885781 AI188567 AI290658 C15404 


426413 


266650.1 


AA377823 AW954494 A1022688 


429258 


301917J 


AA448765 C04967 C03045 AA658293 


431676 


33641 1J 


AJ685464 AW971 336 AA51 3587 AA525 142 


432689 


35275.1 


AB018320 H56457 AA247916 N83488 N87920 AA095653T19858 AL134279 AA094167 AI673378 AI000340 R47500 W16595 AW1 52297 A1625937 






AA002027 A1814851 AA902666 AI039729 AW975053 BE302243 AI240793 AA193203 N5581 


438869 


46651.1 


AF075009 R63109R63068 


439518 


47334.1 


W7632SAF086341W72300 


442677 


548626.1 


AJ557914 W81031 AW473764 AI814081 W81068 AW182826 AW173296 AI376594 AI220500 BE257195 BE246486 R55637 C20788 A1014407 






AJ248353AW028015 


445808 


65133.1 


AV655234 AW966332 AA340239 


449625 


8113.1 


NM.014253 AF100772 BE088769 AL022718 BE161779 AW863569 BE161640 AL039060 BE1 68542 AW296554 AA323193 AA235370 AW779760 






N48674 AI375997 R45432 D59344 AI203107 F07491 R35360 R25094 AI913631 AI498402 T613 


450580 


83929.1 


N40087 H12925 AA460779 AA096372 8E1 60847 AW816890 AW816893 AW816577 AW81 6891 AW81 6882 AW816868 AW81 6889 AW816940 






AW816892 AW816941 AW816578 AA029183 T83320 H86850 AA010295 BE160823 225353 AA150883 AW8 


453802 


981589J 


A1134757AW079131 



TABLE 61 C 



Pkey: Unique number cones ponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. Dunham I. et al." refers to the publication entitled The DNA sequence of 
human chromosome 22." Dunham 1. et al., Nature (1939) 402489-495. 
Strand: Indicates DMA strand from which exons were predicted. 

Nt position: indicates nucleotide positions of predicted axons. 



Pkey 


Ref 


Strand 


NLposition 


401197 


9719705 


Plus 


176341-176452 


401519 


6649315 


Phis 


157315-167950 


401558 


7139678 


Pius 


103510-104090 


402802 


3287156 


Minus 


53242-53432 


403047 


3540153 


Minus 


59793-59968 


404641 


9796810 


Minus 


32247-32362 


405685 


4508129 


Minus 


37956-38097 


406068 


9114084 


Plus 


382-543 



Table 62A lists about 600 genes up-regulated in prostate cancer compared to normal adult tissues. These were selected from 59680 probesets on the AffymetrWEos Hu03 GeneChip 
array such that the ratio of 'average" prostate cancer to "average" normal adult tissues was greater than or equal to 3.0, The "average" prostate cancer level was set to the 75* 
percentile amongst prostate cancers. The "average" normal prostate tissue level was set to the 95 ft percentile amongst normal prostate tissues. In order to remove gene-specific 
background levels of non-specific hybridization, the 10* percenQe value amongst non-malignant body tissues was subtracted from both the numerator and the denominator before the 
ratio was evaluated. 

TABLE 62A: ABOUT 600 GENES UP-REGUIATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE TISSUES 

Pkey: Unique Eos probeset Identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnlgenelD: Unigene number 
Unigene Title: Unigene gene title 



R1: 


Ratio of prostate tumor to normal prostate tissue 




Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


444151 


AW972917 


Hs.128749 


alpha^thytacyt-CoA racemase 


412 


421566 


NM_000399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


26.5 


427274 


NM.005211 


Hs.174142 


colony stimulating (actor 1 receptor, to 


16.0 


420729 


AW964897 


Hs.290825 


ESTs 


15.8 


412326 


R07566 


Hs.73817 


small inducible cytokine A3 (homologous 


14.7 


416188 


BE157260 


Hs79070 


v-myc avian myetocytomatosis viral oncog 


13.6 


417696 


BE241624 


Hs.82401 


C069 antigen (p60, early T-cell acfivafi 


13.1 


424247 


X14008 


Hs.234734 


lysozyme (renal amyloidosis) 


12.6 


401197 






ENSP00000229263**.HSPC213. 


12.6 


426793 


X89887 


Hs.172350 


HIR (histone cell cyde regulation defec 


116 


450096 


AJ682088 


HS79375 


rtdccarboxyiase synthetase (biottn-{prop 


114 


449156 


AF103907 


Hs.171353 


prostate cancer antigen 3, non-coding DO 


113 


443271 


BE568568 


Hs.195704 


ESTs 


11.6 


434078 


AW880709 


Hs.283683 


chrornosorrte 8 open reading frame 4 


\\2 


423396 


A082555 


HS.1Z7950 


bromodomaiiVGonlairjinQ 1 


1U 
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10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



406038 


Y14443 


426559 


AB001914 


419553 


N34145 


418038 


237976 


410023 




421684 


RC001CQ1 
OC40 I09t 




KIM 001fi91 


437866 


AA1557A1 
AAI0O/O1 


440274 


R945Q5 
r\44030 


90 1 11/ 


ATUU0044 


/nci At 
4U0I41 


VI 4441 
T 14440 




AB 00 2367 


4171K 


A1UOUU44 


433332 


A11R71A7 
AIOO/O**/ 


jJCCC 14 


A\/£5inn 

AVOOOl 1U 




a Ann 17*10 

AAUUI/04 


432606 


KIM no?i04 


417381 


AF1A4147 
MT 104144 


495174 

4401/4 


U0/4OU 




ArUOOOC I 


** 10104 




44340U 


i I07nifl 

U3/UI0 


°iC 131 J 


AJ 304000 


41491 7 
4044 I / 


AWU14/yO 


41 71fi1 
41/000 


AVAM9Q157 
AW 14300/ 


A 947A9 
**40 /04 


1 tec/ eft 
U00408 


449V11 


AA11R9R7 
AA0104D/ 


^ccfOUO 


AA1O4101 
AA034100 


/ 900(5/ 

443304 


Al ncA1M 
ALU0U1U4 


44 9»17 
440O0/ 


AI304040 


443U<J0 


74k/v;i 

4.40U01 


*H*400U 


AW i lOOOO 


417407 


AA3404/0 


41P.QP.9 
40D304 


A\A/177114 
AWO/ fOI4 


450111 
40U0 10 


AJU00303 


420120 


Al A4QRin 
ALU430 IU 


4.1RR11 
*WOO 10 




419078 


MQ111Q 
MOOl 13 


4950(15 
4403V/0 


AR0190CQ 
ABU04339 


451900 


ADA011QQ 


4477RQ 
44//03 


AU/0717fM 
AWO/0/U4 


ACOA1P. 

404400 


DCU//O40 


40400? 


DClO/443 


440UI0 


DC 0.4 COT7 
DC4404// 


447717 
44/ /Of 


Ai/nnAeJi 
AKUUU040 


424897 


□53216 


433oo7 


AK000596 


44/OUo 


D26067 


jIIOICO 

401003 


AA0/D049 


404Z4l 


U41044 


416051 


AAooooDo 


441 1 01 


kill |V\CC7j( 

NI\^JuU5574 


VMQIV7C 
44oU/o 


AJ 100 140 


49A7CO 
440/03 


AI09U4U1 


41G9Q9 
410434 


AA17Q91 < l 
AA1/3400 


417109 
41/ 0U4 


DCO/CO10 
DC440014 


417022 


NM (114717 


451957 




452859 


A1300S55 


413431 




408405 


AK001339 

MINUU IMC 


421552 


ACflOCCQO 

ATU40094 


426108 




4C3H3U 


At071111 
A13/ 1 101 


45491 Q 
4044 13 


Y75049 
A/0U44 


441090 
•♦40U4U 


Aiicnnca 
AlOOUUOO 


410/04 


DC11/C14 
DCJ14044 


414404 


AI43930U 


4i70an 


DCZ41030 


AAA'iQi 
444001 


DCOO/009 


44R01Q 


AWQ471R4 
AW34/104 


409799 


011928 


424308 


AW975531 


411373 


BE326276 


423445 


NKL014324 


422283 


AA410508 


418196 


AI745649 


438209 


AL120659 


437147 


AL049964 


432415 


T16971 


446896 


T15767 


408242 


AA251594 


447965 


AW292577 



zinc finger protein 200 1 1.1 

Hs.170414 paired basic amino add cleaving system 10.6 

Hs.10177 ESTs, Moderately similar to ZN91..HUMAN Z 10.4 

Hs. 83337 latent trartsforrning growth factor beta b 10.3 

I&57929 sSt (DrosophDa) homotog 3 10.1 

Hs.106768 hypothetical protein RJ1051 1 9.9 

Hs.170087 aryl hydrocarbon receptor 9.9 

metaflothtoneln IE (functional) 9.5 

Hs.7122 scrapie responsive protein 1 9.4 

Hs.250500 delta Prosoprrilaj-fikel 9.4 

zinc finger protein 200 9.2 

Hs.21355 doublecortin and CaM kinase^ke 1 9.2 

Hs.180450 ribosomal protein S24 9.0 

Hs.44898 Homo sapiens done TCCCTA00151 mRNA sequ 9.0 

Hs. 106650 hypothetical protein FU20533 8.9 

Hs. 173233 hypothetical protein FU 10970 &9 

Hs.3066 granzyme K (serine protease, granzyme 3; 8.8 

Hs.82042 solute carrier family 23 (nucleobase tra 8.6 

Ks.154978 KIAA0261 protein 8.6 

Ks.13131 lymphocyte adaptor protein 8.5 

Hs.79069 cyclinG2 8.4 

Hs. 12451 echboderm microtubule-associated protei 8.4 

Hs.1 09439 osteoglydn (osteoinductive factor, mime 8.4 

Hs.23349 ESTs 8.3 

Hs.329700 ESTs 8.3 

Hs.1 59525 cell growth reguJatory with EF-hand doma 8.3 

Hs.23128 ESTs 8.3 

Hs.26873 ESTs a2 

Hs.227209 hypothefcal protein FU21 61 7 ai 

Hs.9884 spindle pole body protein 8.0 

Hs.22920 similar to S68401 (cattle) glucose Indue 8.0 

Hs.154150 ESTs 7.9 
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Hs.332633 Bardet-Bledl syndrome 2 7.6 

Hs.95243 transcription elongation factor A (Sit)- 7.5 
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Hs.89584 insulinoma-associated 1 7.4 
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Hs.27207 KIAA0982 protein 7.3 

Hs.320831 Homo sapiens cDNA FU 14597 fis, done KT 7.3 

Hs^1447 ESTs, Moderately simflar to A46010 X-Bn 7.3 

Hs.29206 hypothetical protein MGC14376 7.2 

Hs.154196 E4F transcription factor 1 7.2 

Hs.19404 0KFZP564L0862 protein 7.2 

Hs.153684 frizzlakelated protein 7.1 

Hs.3618 hippocateu>like1 7.1 

Hs.174905 K1AA0033 protein 7.1 

Hs.274415 Homo sapiens cDNA FU 10229 fis, done HE 7.0 

Hs.288057 hypothetical protein FU22242 7.0 

Hs.25253 mannosidase, alpha, class 1A, member 1 6.9 

Hs.1 84585 UM domain only 2 (rhomboiWike 1) 6.9 

Hs.20196 adenytate cydase 9 6.9 

Hs.21213 ESTs 6.8 

Hs.42390 nasopharyngeal card noma susceptibility 6.8 

Hs.8941 ESTs 6.8 

Hs.80905 Ras association (RalGDS/AF-6) domain fam 6.7 

Hs.1 0299 Homo sapiens cONA FU 13545 fis. done PL 6.7 

Hs.288158 hypothetica) protein FU23591 6.7 

Hs.75355 ubiquitirwoniugafjng enzyme E2N (nomolo 6.7 
Hs.44672 hypothetical protein FU10470 6.7 
Hs.1 05700 secreted frizzled-reiated protein 4 6.6 
Hs.1 66468 programmed cell death 5 6.6 
Hs.23889 ESTs, Weakly similar to ALU7_HUMAN ALU S a6 
Hs.44313 v-rel avian reticutoendothetiosis viral 6.5 
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Hs. 78776 putative transmembrane protein 6.4 
Hs.334885 mitochondrial GTP binding protein 6.4 
Hs.82848 sdectin L (lymphocyte adhesion molecule 6.4 
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Hs.1 54443 truruchromosorne maintenance deficient (S. 6.3 
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Hs.289014 ESTs, Weakly stmSar to A43932 mucin 2 p 6.0 
Hs.22452 Homo sapiens mRNA for K1AA1737 protein, 6.0 
Hs.43913 PiBFI gene product 5.9 
Hs.94445 ESTs S9 
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chromatin assembty factor 1 , submit A ( 5.5 
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coated veside membrane protein 5.4 

Homo sapiens cONA: FU22528 fis, clone H 5.4 
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COA14 5.4 
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annexinA3 5.3 

eukaryoSc translation elongation factor 5.3 

Homo sapiens mRNA; cONA DKFZp56400472 (f 5.3 
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K1AA0187 gene product 5.1 
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ESTs 5.1 

ESTs. Moderately similar to 138022 hypot 5.1 
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ESTs 5.0 
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topoisomerase (DNA) II alpha (170k0) 4.9 

hypothetical protein FU20039 4.9 
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hypothetical protein FU10914 4.9 
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RAS p21 protein activator (GTPase activa 4.8 
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inositol potypriasphate-4-phosphatase, ty 4.8 

ESTs 4.8 

SAC2 (suppressor of actin mutations 2, y 4.8 

zinc finger protein 131 {done pHZ-10) 4.7 

ESTs 4.7 
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ESTs 

ESTs, WeaWy similar to T34036 rrypothetl 

activating transcription factor 3 

step U spCcing factor SLU7 

ESTs 

ESTs . 

ESTs 

gb:tl88f04.x1 NCLCGAPJ>r28 Homo sapiens 
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gycosyttransferase 
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ESTs, Highly similar to cytokine recepto 

ESTs 

ESTs 
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eukaryorJc translation initiafion factor 
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chromosome 21 open reading frame 5 

DKFZp434J1813 protein 

ATPase, aminophosphofipidtransporter-ll 
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ESTs 

716 



4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.6 
4.6 

4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.5 
4.5 
4,5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.1 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



ATCcrrc 
tOOO/D 


AJ45o2i3 


Hs.77542 


*WO30O 


DC 0 Hoon 


Hs, 10029 


41fWJ7 


AB 020725 


Hs£8009 


411QCQ 
t300 


AI2D7410 


Hs.69280 


**< i i/y 


oc4o9o99 


Hs.89271 




DC »7Cnnfl 


Hs.1 04916 


4/oybo 


AHJ59214 


Hs.1 94687 




A1861854 


Hs.210778 




A1567669 


Hs.40342 


440400 


AT765890 


Hs.16341 


440004 


AIo73o17 


Hs.334691 


404390 






4UOOID 


AW957974 


Hs.25485 


417512 


X76534 


Hs.82226 


414000 


AL039185 


Hs.77558 




AW970043 


Hs.238039 


432242 


AW022715 


Hs. 1621 60 


40000/ 


AW204232 


Hs.279522 


400030 


AJ304997 


Hs7572 


430157 


BE348706 


Hs.278543 


VIC 1 1CQ 

4ouoy 


NM_00o052 


ll Ar*A J r» 

Hs.26146 


456776 


AV654978 


Hs.19904 


400301 


X03635 


Hs.1657 


443846 


AI085198 


Hs.164226 


451491 


AI972094 


Hs.286221 


452234 


AW084176 


Hs.223296 


430389 


AL1 17429 


Hs.240845 


413275 


W94748 


Hs.48821 


418432 


M14156 


Hs.85112 


432261 


R42216 


Hs.12342 


433409 


AI278802 


Hs.25661 


40/iob 


AA1 20761 


Hs.28307 


417187 


AB011151 


Hs,334659 


421921 


no 33 S3 


Hs.6820 


AATJA7 


AWo09351 


Hs.287955 


415710 


J04543 


HsJ8637 


457794 


AA589292 


Hs.246850 


**0 1 IZ4 


All 00 AW 


II- A4 inn 

Hs.31432 


AO MM 


UR7*>On 


Hs. 169681 




t\fJQ07AQA 

Awyy/4o4 


Hs.5003 


44£f0l 


Al boo 167 


Hs.131044 


410030 


BtoooYoO 


Hs.86978 


423749 


U09848 


Hs.1 32390 


439394 


AA149250 


Hs.55105 


442287 


AW952703 


Hs.8182 


436758 


AW977167 


Hs.1 55272 


451030 


H40697 


Hs.181426 


405934 






439231 


AW581935 


Hs.141480 


444755 


AA431791 


Hs.1 13823 


447276 


AL049795 


Hs.17987 


431912 


Al 660552 


Hs76549 


4£049o 


AK001745 


Hs.184628 


412978 


AI431708 


Hs.820 


41490Z 


AF273304 


Hs.235376 


44/4oZ 


A DA44ArA 

AB 033059 


Hs.1 8705 


/MQGQA 

443oyu 


AAO84070 


Hs^68835 


4bUZUo 


Ar 097994 


Hs.301528 


450899 


T77447 


Hs.177864 


423337 


«M_004003 


Hs. 127337 


452588 


A A OQ01 on 


Hs.1 10637 


4££00l 


NMJ/14700 


Hs.1 19004 


40/ZOl 


BE303000 


Hs.288929 


450506 


A til rtnll^A 

NM.C04460 




407317 


Al 204033 


Hs^71461 


410330 


AW023630 


Ks.159425 


444143 


AW747996 


Hs.160999 


415156 


XB49Q8 


HsJ8060 


422975 


AA347720 


Hs.122669 


403100 






422449 


AF168418 


Hs.1 16784 


424163 




Hs.1 42245 


424806 


AA382523 


Hs.105689 


425878 


AW9S480S 


Hs.38085 


432363 


AA534489 




443441 


AW291196 


Hs.92195 


432529 


AI989507 


Hs.162245 


432908 


AI861896 


Hs.304505 


424723 


BE4098.13 


Hs.152337 


424855 


AW204725 


Hs.25560 


414664 


AA587775 


Hs.66295 


437162 


AW005505 


Hs.5464 


421828 


AW891965 


Hs.279789 



ESTs 4.1 

cathepsinC 4.1 

KIAA0918 protein 4.1 

Homo sapiens, done IMAGE:3636299, mRNA. 4.1 

ESTs 4.1 

hypothetical protein FU21940 4.1 

cholesterol 25-hydnwyiase 4.1 

hypothetical protein FU 10989 4.1 

putative nudear protein 4.1 

MAWD binding protein 4.1 

splicing factor 3a, subunit 1, 120kD 4.1 

Target Exon 4.1 

hypothetical protein FU22341 4.1 

glycoprotein (transmembrane) nmb 4.1 

thyroid hormone receptor Interactor 7 4.1 

hypothetical protein FU 11090 4.1 

ESTs, Weakly similar to ALU4_HUMAN ALU S 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs 4.1 

Down syndrome critical region gene 3 4. 1 

cystathionase (cystathionine gamma-lyase 4. 1 

estrogen receptor 1 4.1 

ESTs 4.1 

Homo sapiens cDMA FU 1 3741 (is, done PL 4.1 

ESTs, Weakly similar to I38022 hypotheti 4.1 

DKFZP434D146 protein 4.0 

ESTs 4.0 

insulin-like growth factor 1 (somatomedi 4.0 

Homo sapiens done 24538 mRNA sequence 4.0 

ESTs 4.0 

WW domain rinding protein 4 (formin bind 4.0 

hypothetical protein MGC14139 4.0 

translocase of inner mitochondria) membr 4.0 

Homo sapiens cDNA FU 13090 fis, done NT 4.0 

annexinA7 4.0 

ESTs 4.0 

cardiac ankyrtn repeat protein 4.0 

death efTector domahvcontaining 4.0 

K1AA0456 protein 4 0 

ESTs 4.0 

prolyl endopeptidase 3.9 

zinc finger protein 36 (KOX 1 8) 3.9 

ESTs 3.9 

synaptic nuclei expressed gene 1 b 3.9 

ESTs 3.9 

ESTs. Moderately similar to YSHUT threon 3.9 

Target Exon 3.9 

Homo sapiens mRNA; cONA DKFZp434N079 (fr 19 

CIpX (caseinolytic protease X,E.coli) 3.9 

hypothetical protein MGC1203 3.9 

ESTs, Weakly similar to A561 54 AW subst 3.9 

hypothetical protein FU10883 3.9 

homeoboxC6 3.9 

XPMC2 protein 3.9 

KIAA1233 protein 3.9 

ESTs 19 

L-kynurenine/alpha-aminoadipate arro'notra 3.9 

ESTs 3.9 

axln 2 (conducfln, axil) 3.8 

homeoboxAlO 3.8 

WAA0665 gene product 3.8 

hypothetical protein FU13258 similar to 3.8 

fibroblast activation protein, alpha 3.8 

ESTs,WeakrysirnJlartol3W22hypotheti 3.8 

ESTs 3.8 

ESTs, Moderately similar to A561 94 Oirom 3.8 

phosphorytase kinase, beta 3.8 

K1AA0264 protein 3.8 

C2C0102r.-gll7296271|gb|AAF5156Z1|(AEO 3.8 

thyroid hormone receptor interactor 4 18 

HEfW-HLmassodating3 3.8 

MSTP031 protein 18 

Homo sapiens. Similar to R1KEN cDNA 0610 3.8 

gtefif76g1U1Na_CGAP.Co3Homo sapiens 3.8 

ests as 

ESTs a8 

ESTs 3.8 

protein argjnme N-methyltransferase 3(h 18 

ESTs as 

muIl^DZ-dariairH^taining protein 18 

thyroid hormone receptor coacthvaiing pr 18 

rustonedeacely1ase3 18 
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451524 
418821 
422049 
453160 
416072 
424906 
446554 
449523 
452259 
417348 
430289 
435401 
448807 
413403 
418323 
449125 
408089 
428242 
439209 
434927 
431735 
456619 
458082 
411213 
414709 
415339 
418203 
427283 
452260 
439506 
420077 
451009 
436473 
413198 
443067 
452281 
452561 
433269 
445817 
447033 
414171 
427138 
442991 
434974 
443399 
401807 
413979 
414680 
417380 
425710 
441247 
445183 
426931 
417873 
419594 
446416 
449008 
456719 
432101 
453565 
449217 
412922 
450066 
429302 
451246 
415528 
435513 
440643 
409088 
439132 
451173 
422673 
401597 
401744 
432723 
449455 
450628 
432125 
435411 
434629 
437158 
416375 
419441 



AK001466 Hs.26516 hypothetical protein FU 10604 

AA436002 Hs.183161 ESTs 

W25760 Hs.77631 glycine cleavage system protein H (amino 

AI263307 Hs.239884 H2B htstone family, member L 

AL1 10370 Hs.79000 growth associated protein 43 

Hs.153716 Homo sapiens mRNA for Hmob33 protein, 3* 

Hs.301789 nudix (nucleoside diphosphate Bnked mol 

Hs.54443 chemokine (C-C motif) receptor 5 

Hs.28707 signal sequence receptor, gamma (translo 

Hs.31 8526 hypothetical protein FU12661 

Hs.238039 hypothetical protein FU1 1090 

Hs.10056 hypothetical protein FU14621 

Hs.7549 ESTs 

Hs.198425 ESTs, Moderately sirrflar to S65657 alpha 

Hs.1 1 62 major histocornpafiblHty complex, class 

Hs.196029 Homo sapiens mRNA for K1AA1657 protein, 

Hs.42644 thioredoxin-Uke 

Hs.2250 leukemia inhibitory factor (cholinergic 

Hs.91954 ESTs 

Hs.293815 Homo sapiens HSPC285 mRNA, partial cds 

Hs.75968 thymosin, beta 4, X chromosome 

Hs.107153 inhibitoT of growth family, member 1-Ck 

Hs.314451 ESTs. Weakly similar to ALU LHUMAN ALU S 

Hs.69285 neuropil 1 

Hs.77031 Sp2 transcription factor 

Hs.78398 KIAA0071 protein 

Hs.83758 COC28 protein kinase 2 
Hs.174185 



AA151730 
NM_000579 
AA317439 
AI940507 
AK001952 
R44477 
AJ571940 
AA129105 
NMJW2118 
A1671439 
H59799 
H55709 
AF087993 
H46612 
AW977724 
AV647917 
AW978811 
AA676939 
AA704703 
NNL015156 
X54942 
AL1 19796 
AA453208 
'AI361238 
AW512260 
AA013140 
AI193122 
AW157712 
AI077389 
T93500 
AI692181 
AI343543 
NM.003642 
AI357412 
AA360328 
N77624 
BE281238 
AA778711 
AM52659 

BE218090 
AA743331 
T06809 
AF030880 
AW118681 
AB007877 
N^.003416 



RAB9, member RAS oncogene family 

Ks.41136 ESTs 

Hs.87767 ESTs 

Hs.115707 ESTs 

Hs.132275 ESTs 

Hs.47534 ESTs, Weakly similar to 138022 hypotheti 

Hs.269818 ESTs,WeaHysimaartoZ195 HUMAN ZINC 

Hs.28792 HomosaplenscDNAFU11041 fis,cbnePl 

Hs.49169 WAA1634 protein 

Hs.126890 ESTs 

Hs.13340 histone acetyttransferase 1 

Hs.157601 ESTs 

Hs.865 RAP1A, member of RAS oncogene family 

Hs.173717 phosphatidic acid phosphatase type 28 

Hs.8886 hypothetical protein FU20424 

Hs.4310 eukaryotic translation initiation factor 

Hs.61 153 proteasome (prosome, macropain) 26S subu 

C7001350:gi|6578126|gb|AAF17706.1|AF0496 

Hs.279453 ESTs 

hemoglobin, alpha 2 

Hs.332086 ESTs 

Hs.159275 solute carrier family, member 4 

Hs.128051 Homo sapiens thymic stromal rymphopoletl 

Hs.12385 KIAA0417gene product 

Hs.2076 zinc finger protein 7 (KOX 4, clone HF.1 

Hs.293659 Homo sapiens, Similar to RIKEN cDNA A430 

AA013051 Hs.91417 topoiscmerase (DMA) II binding protein 

AV658299 Hs.1 63959 ESTs 

AW578003 Hs.22826 tropomodutln 3 (ubiquitous) 

ZWto ankyrin 3, node of Ramder (ankyrin G) 

AI918950 Hs.123642 EphA3 

BE298808 Hs.33363 DKFZP434N093 protein 

AA278536 Hs.23262 ribonudease, RNase A family, k6 

M50721 Hs.74870 H2.0 (DrosophllaHike homeo box 1 

H56499 Hs.252692 ESTs, Weakly similar to 138022 hypotheti 

AU076674 Hs.198899 eukaryotic translation Initiation factor 

AW189232 Hs.39140 cutaneous T-ceH lymphoma tumor antigen 

R17236 gb.yg12c04.r1 Soares infant brain 1NIB H 

AW404075 Hs.42785 DC11 protein 

M78878 Hs.7314 K1AA061 4 protein 

AW236991 Hs.102495 ESTs, Moderately similar to 138022 hypol 

H72803 Hs.33363 ESTs 

AI765082 Hs.48317 ESTs 

N59027 gfcyv59d1 1x1 Soares fetal Ever spleen 

AA172106 Rag C protein 
Target Exon 

D29677 Hs.3085 KIAA0054 gene product; Helicase 

NM.004380 Hs^3598 CREB binding protein (Rubinstein-Taybi s 

AW382884 Hs^04715 ESTs 

AW972667 Homo sapiens cDNA FU12300fis, clone MA 

AW444619 Hs.138211 ESTs 

AA789081 Hs.4029 gtoa-ampiified sequence-41 

AW090198 KIAA1150 protein 

H95567 Hs.124700 ESTs 

AW023731 Hs.274368 MSTP032 protein 



18 
3.8 
3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3,7 
3.7 
3.7 
3.7 
17 
3.7 
3.7 
3.7 
37 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
16 
16 
3.6 
16 
16 
3.6 
16 
16 
16 
16 
16 
16 
3.6 
16 
16 
15 
15 
15 
3.5 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
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432741 AI732358 Hs.185118 ESTs, Moderately similar to A374 13 catbl 3.4 

439192 AW970536 Hs.105413 ESTs 3.4 

440474 AI207936 Hs.7195 gamma-antinobutyric aid (GABA) A recepto 3.4 

428456 AF151063 Hs.184456 hypothetical protein 3.4 

5 447397 BE247676 Hs.18442 E-1 enzyme 3.4 

426828 NM_000020 Ks.172670 acfivin A receptor type H-fike 1 3.4 

418293 A1224483 Hs. 16063 hypothetical protein RJ 21 877 3.4 

424779 AL046851 Hs.153053 C037 antigen 3.4 

. 415015 NM 002109 Hs.77798 histidyl-tRNA synthetase 3.4 

10 425900 AK0O0166 Hs.288809 hypothetical protein FU20 159 3.4 

408113 T82427 Hs.194101 Homo sapiens cDNA: FU20869 fis, clone A 3.4 

452852 AK001972 Hs.30822 hypothetical protein FU1 1 1 10 3.4 

414671 AA832479 Hs.7208O ESTs 3.4 

431689 AA305688 Hs.267695 UDP-Gal:betaQcNAc beta 1,3-galactosyttr 3.4 

15 444480 AI150730 ' Hs.265640 ESTs, Moderately similar to A47582 B-cel 3.4 

403389 C3001393*:gi|3327090|dbi|BAA31613.1HAB 3.3 

440749 W22335 Hs.7392 hypothetical protein MGC3199 3.3 

408374 AW025430 Hs.155591 forkheadboxFI 3.3 

410494 M36564 Hs.64016 protein S (alpha) 3.3 

20 426997 BE620738 Hs.173125 peptidylprolyl isomerase F (cyclophllin 3.3 

433505 AW504027 Hs.15301 Homo sapiens cDNA FU 12596 fis, clone NT 3.3 

400277 Eos Control 3.3 

411031 W37943 Hs.34892 KIAA1323 protein 3.3 

417295 AW993524 Hs.43148 ESTs 3.3 

25 438459 T49300 Hs.35304 Homo sapiens cDNA FU13655 fis, clone PL 3.3 

432169 Y00971 Hs.2910 phosphoribosyl pyrophosphate synthetase 3.3 

437872 AK002015 Hs.5B87 RNA binding motif protein 7 3.3 

443912 R37257 Hs.1 84780 ESTs 3.3 

445270 AI762154 Hs.54982 Homo sapiens cDNA FU14014 fis, done HE 3.3 

30 425836 AW955696 Hs.90960 ESTs 3.3 

414565 AA502972 Hs.183390 hypothetical protein FU 13590 3.3 

418700 AI963808 Ks.86970 ESTs, Moderately similar to ALU5_HUMAN A 3.3 

428186 AW504300 Hs£95605 mannosidase, alpha, class 2A, member 2 3.3 

436499 AJ276678 Hs.283102 HEF like Protein 3.3 

35 448164 R61680 Hs.26904 ESTs. Moderately similar to Z195.HU MAN Z 3.3 

426848 H72531 Hs.36190 ESTs 3.3 

439653 AW021103 Hs.6631 hypothetical protein RJ20373 3.3 

450094 A1174947 Hs.295789 Homo sapiens mRNA; cDNA OKFZp564D1 164 (f 3.3 

402474 N^004079:Homo sapiens cathepsin S (CTSS a3 

40 424800 AL035588 Hs.153203 MyoD family inhibitor 3.3 

430280 AA361258 Hs.237868 interleukin 7 receptor 3.3 

408968 A1552236 Hs.49376 hypothetical protein FU20644 3.3 

453548 AL079983 Hs.1 16774 Integrin, alpha 1 3.3 

A _ 451367 AA923729 Hs.26322 cell cycle related kinase 3.2 

45 444445 AA342329 Hs.1 15920 Homo sapiens cONA: FU 22816 fis, done K 3.2 

445472 AB006631 Hs.12784 Homo sapiens mRNA for WAA0293 gene, par 3.2 

400658 ENSP00000237081*:KIAA1217 PROTEIN (FRAGM 3.2 

408101 AW968504 Hs.123073 CDC2-related protein Mnase 7 3.2 

410366 A1267589 Hs.302689 hypothefical protein 3.2 

50 411048 AK001742 Hs.67991 hypotheticai protein DKFZp434G0522 3.2 

421246 AW582962 Hs.102897 CGI-47 protein 3.2 

422763 AA033699 Hs.83938 ESTs, Moderately slmflar to MAS2_HUMAN M 3.2 

426227 U67058 Hs.154299 Human proteinase activated receptor-2 mR 3.2 

439047 AW979177 gb:EST391287 MAGE resequences, MAGP Homo 3.2 

55 444451 AV650179 Hs.282431 ESTs 3.2 

420568 F09247 Hs.247735 protocadherin alpha 10 3.2 

, 406973 M34996 Hs.198253 major histocompatiblfity complex, class 3.2 

402802 NM 001397:Homo sapiens endothelin conver 3.2 

431304 8E157283 gb:RC4-HT0373.13020f>011.a03HT0373Homo 3.2 

60 406247 Target Exon 3.2 

411653 AF070578 Hs.71168 Homo sapiens clone 24674 mRNA sequence 3.2 

412639 AW961284 Hs.296235 ESTs 3.2 

419168 A1336132 Hs.33718 Homo sapiens cDNA RJ 12641 fis, done NT 3.2 

433561 BE540937 Hs.20104 hypothefical protein FU00052 3.2 

65 436207 AA334774 Hs.12845 hypothetical protein MGC13159 3.2 

438461 AW075485 Hs.286049 phosphoserine aminotransferase 3.2 

455473 AW9B4788 gb:RC1-HN0015.1204004)21-c07HN0015Homo 3.2 

441683 BE564214 Hs.1 02946 ESTs 3.2 

„ 417386 AL037228 Hs.82043 0123 gene product 3.2 

70 427286 AW732802 Hs.2132 epidermal growth factor receptor pathway 3.2 

442355 AA456539 Hs.8262 lysoscrnaVasscdaied membrane protein 2 3.2 

422473 U94780 Hs.1 17242 meningioma expressed antigen 6 {coiled-c 3.2 

407816 AW500857 Hs.40137 anaphase-promoling compiex 1; meiotic ch 3.2 

nc 458809 AW972512 Hs.20985 sin3-essociated polypeptide, 30kD 12 

75 453864 AW021407 Hs.21068 hypothefical protein 3.2 

448944 AB014605 Hs.22599 atrophin-1 interacting protein 1; activi 3.2 

435129 AI381659 Hjl267086 ESTs 3.2 

424894 H83520 Hs.153678 reproductions 3.2 

415959 D28459 Hs.80612 ubiquifirvconjugafing enzyme E2A (RAD6 h 3.2 

80 410185 BE294068 Hs.737 snmediate early protein 3.2 

451149 ALD47586 Hs. 10283 RNA binding moth* protein 8B 3.2 

410531 AW752953 gb:QVT>a0224-26109W35^02CT0224Horno 12 

411954 AA652523 Hs.269496 ESTs, Weakly simSar to 138022 hypothec 3.2 
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423748 


AI149048 


Hs.30211 


hypothetical protein FU 2231 3 


12 


444489 


AI151010 


Hs.157774 


ESTs 


12 


453124 


AI139058 


Ks. 125790 


leudne-rich repeat-containing 2 


3.2 


452413 


AW082633 


Hs.212715 


ESTs 


11 


409390 


A1927212 


Hs.3734 


ESTs 


3.1 


453078 


AF053551 


Hs.31584 


metaxin 2 


11 


453024 


AW846787 




gb:QV3-CT0194-181099-004-e01 CT0194Homo 


3.1 


439444 


AI277652 


Hs.54578 


ESTs, Weakly similar to I38Q22 hypolheti 


3.1 


441124 


T97717 


Hs.119563 


ESTs 


11 


416128 


AA173632 




COC14 (cell division cycle 14, S. cerevi 


11 


445101 


T75202 


Hs.12314 


Homo sapiens mRNA; cONA DKFZp586C1019 (f 


11 


407949 


W21874 


Hs.247057 


ESTs, Weakly similar to 2109260A B ceQ 


3.1 


413492 


087470 


Hs.75400 


WAA0280 protein 


3.1 


421044 


AF061871 


Hs.101302 


Human DNA sequence from done RP1-238D1 5 


11 


422603 


BE242587 


Hs.118651 


hematopoieCcaHy expressed homeobox 


11 


427399 


NM.014883 


Hs.177664 


WAA0914 gene product 


11 


434804 


AA649530 


Hs.348148 


gb:ns44fl&s1 NCLCGAPJUvl Homo sapiens 


11 


442906 


AW296888 


Hs.170939 


ESTs 


3.1 


445840 


AI277811 


Hs.146291 


ESTs 


3.1 


446084 


AL137658 


Hs.13768 


Homo sapiens mRNA; cONA DKFZp434H216 (f 


3.1 


435293 


AI040777 


Hs.117170 


ESTs 


3.1 


400750 






TargetExon 


3.1 


416852 


AF283776 


Hs.80285 


Homo sapiens mRNA; cDNA DKFZp586C1723 (f 


3.1 


423566 


AW976434 


Hs.3623 


hypothetical protein FU 1 1 220 


11 


409558 


BE253407 


Hs.72363 


Homo sapiens mRNA for FU00 1 1 6 protein, 


3.1 


400194 






NNL003763:Homo sapiens syntaxin 16 (STX1 


3.1 


400479 
402895 






Target Exon 

ENSP00000252117:MyeDn transcription tao 


3.1 
11 


403423 






TargetExon 


3.1 


408161 


AW952912 


Hs.300383 


hypothetical protein MGC3032 


11 


410886 


AW809324 




gb:MR4-ST0121-141099^)1(W306_1ST0121 Ho 


3.1 


414922 


D00723 


Hs.77631 


glycine deavage system protein H (amino 


11 


420121 


AW968271 


Hs.191534 


ESTs, Weakly similar to ALU1JWMAN ALU S 


11 


430519 


AF 129534 


Hs.49210 


F-box only protein 4 


11 


447147 


AA910353 


Hs.292815 


ESTs. Weakly similar to T23482 hypolheti 


11 


422481 


AL050163 


Hs.117339 


DNAX-acQvation protein 10 


11 


430487 


087742 


Hs.241552 


K1AA0268 protein 


11 


409549 


AB029015 


Hs.54886 


phosphoDpase C, epsiton 2 


11 


423554 


M90516 


Hs.1674 


glutairine-mjctose-tphosphate transamin 


3.1 


431286 


AW263476 


Hs.44268 


myeTin gene expression factor 2 


11 


430226 


BE245562 


Hs.2551 


adrenergic, beta-2-, receptor, surface 


11 


408051 


AI623351 


Hs.172148 


ESTs 


11 


420090 


AA220238 


Hs.94986 


nbonudeaseP(38kD) 


11 


400296 


AA305627 


Hs.139335 


ATP-bindlng cassette, sub-family C (CFTR 


3.1 


414279 


AW021691 




GCN5 (general control of amino-acid synt 


11 


440426 


AI159B00 


Hs.7181 


Homo sapiens cONA FU 1 3663 fis, done PL 


11 


423872 


AB020316 


Hs.134015 


uronyl 2^suIfotransferase 


3.1 


420154 


AJ093155 


Hs.95420 


JM27 protein 


11 


447560 


AF065214 


Hs.18858 


phosphoHpase A2, group IVC (cytosdic, 


11 


453293 


AA382267 


Hs.1G653 


ESTs 


3.1 


434398 


AA121098 


Hs.3838 


serum-indudble kinase 


3.0 


414135 


NM.004419 


Hs.2128 


dual specificity phosphatase 5 


10 


441527 


W19504 


Hs.7884 


sdute carrier family 21 (organic anion 


3.0 


423957 


AW978309 


Hs.136235 


Homo sapiens cONA FU13542 fis, done PL 


10 


444001 


AI095O87 


Hs.152299 


ESTs, Moderately simflar to S65657 alpha 


3.0 


411315 


AW836547 




gb:PM!LT0032-O30100-006*08 LT0032 Homo 


3.0 


414178 


AW957372 


Hs.46791 


ESTs, Weakly simflar to I38022 hypotheti 


10 


441401 


Al 824338 


Hs.126891 


ESTs , 


10 


457718 


F18572 


Hs.22978 


ESTs, Weakly similar to ALU4_HUMAN ALU S 


10 


429500 


X78565 


Hs.289114 


hexabrachion (tenasdn C cytotacfln) 


10 


424240 


AB023185 


Hs.143535 


caldum/catmoduiirHlependent protein kin 


3.0 


447476 


BE293466 


Hs.20880 


ESTs, Weakly similar to I38022 hypotheti 


10 


419440 


AB020689 


Hs.90419 


WAA0882 protein 


3.0 


421786 


AI188653 


Hs.21351 


ESTs 


10 


433285 


AW975944 


Hs^37396 


ESTs 


10 


409997 


A1906055 


Hs.57749 


synaptic nuclei expressed gene 2; WAA10 


3.0 


417874 


BE616160 


Hs.82829 


protein tyrosine phosphatase, noiwecept 


3.0 


420956 


AA351584 


Hs.100543 


Homo sapiens clone 24505 mRNA sequence 


10 


445345 


AW003850 


Hs.12532 


chromosome 1 open reading frame 21 


3.0 


450571 


AI356967 


Hs.43086 


ESTs, Weakly similar to A4601 0 X-flnked 


10 


427315 


AA179949 


Hs.175563 


Homo sapiens mRNA; cDNA OKFZp564N0763 (f 


10 


434953 


BE0491Q2 


Hs.179982 


ESTs. Weakly sMar to TRHY.HUMAN TRICH 


10 


436469 


AK001455 


Hs.5198 


Down syndrome critical region gene 2 


3.0 


428130 


AW444985 


Hs.77603 


ESTs 


3.0 


414839 


X63692 


Hs.77462 


DNA (cytostne^wrotrr/l transferase 1 


10 


445784 


A1253155 


Hs.146065 


ESTs 


3.0 


446677 


AI800311 


Hs.156291 


ESTs 


3.0 


426448 


R06054 




gb^e89g07j1 Scares fetal liver spleen 


3.0 


445921 


AW015211 


Hs.146181 


ESTs 


3.0 


406673 


M34996 


Hs.198253 


major histocompatibiliry complex, class 


10 


430387 


AW372884 


Hs.240770 


nuclear cap binding protein subunit 2. 2 


3.0 


427157 


U51166 


Hs.173824 


thymtne-DNA glycosytase 


10 


435907 


AW304025 


Hs.258397 


ESTs 


10 
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jfl222 Hs ' 41185 ^ s ^e"smRNA;cONADKFZp56401262(f 10 

tryptophan/! (RNA synthetase 2 (mfiochcn ao 

^ fiKf 9 HS ' 283552 ESTs -^^ similar b unnamed protein ao 

*)DUOoU N4UUQ7 ESTs 2 o 



TABLE 62B 



1 /\ Ptey. Unique Eos probeset Identifier number 
1 U CAT number. Gene duster number 

Accession: Genbank accession numbers 
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55 



65 
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75 
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Pkey 
410531 



CAT Number Accession 



411315 
411962 
414279 

414660 

415528 
416128 
416882 

422673 
426448 
430935 
430968 
431304 
431676 
432125 
432363 
437158 



1207200 1 
1225822 1 
1238570 1 
126744J 
143227J 

147525.1 

1539409 1 

157163J 

162718J 

219674 1 
267323 1 
325772J 
326269 1 
331286 1 
336411 1 
341776 1 
345469,1 
43392.5 



437866 44433J 



439047 
439518 
450506 

450580 

452260 

453024 

455473 

456719 

TABLE 62C 



4681 39 J 
47334J 
836J 

83929J 

9074J 

944876_28 

129291 7 J 

222707J 



AW752953 H88044BE156092 
AW809324 BE144977 8E144956 
AW836547 AW836513 AW836587 
AA099050AA099526 T47733 

tSEISl^S? f ? 45431 ^BW«*I741845 M67448B R44190 R52535 R52617 AI220925 AI979148 AI744688 AW242437 AA618148 
AJ983837 AA399623 AI676204 AI420077 N24944 051042 AA282786 AA137264AW236107 AW769 

R17236 R52580F11642 

AA173632 AI174858 AA581361 AI700024AA1 73988 BE165417 AJ366964 

AW241 143 M769Q58 R43272 AW068958 AA210918 AA293774 AI7488 1 5 Al 763294 A1333114 AI277384 AI088297 AI468477 
AI824ffi4AW189606AI631751 Z40749AI984673AI671316AA189024AW235412BE1784 
N59027 AA314694 N53937 R08100 
R06054AA378789AW956453 

BE157283 BE157287 AA502438 
Ai685464 AW971336 AA51 3587 AA525142 
AW972667 AA526539 AI057032 AW167842 
AA534489 AW970240 AW970323 

AJ801 808 AW61 1634 AI393606 AW235356 AW000736 AW468599 AI582546 AA952057 AA523012 AW51 

^!£2! ^SSSS AA024963 M778446 BE073977 AW444904 AW602574 BE164040 BE164012 BE163972 BE163974 BE163992 
A^481 AW468444BE185091 AW4^2AA687333AA811830 AA581806AJK6686AI572124AA04 
AW979177 AA846994 AA829672 
W76326AF086341 W72300 

^^}f™ U76833 AW630055 AW471133 C02434 W45237 AW793518 BE0701 12 A1587479 A1624429 AW1 90535 AI446661 AM78772 
AW022667AA528235AA599775AW613820AI435793AW594230AI05ra ^ 

AW816892 AW816941 AW816578 AA029183 T83320 H86850 AA01 0295 BE1 60823 Z25353 AA150883 AW8 

IJSt? 04251 044103 mim? N57463 AW007521 AI479232 T82809 R67109 AW796099 AI833027 A1765395 A1763029 AI758228 
BE326331 AJ934576A1922378AW276431 AI718466 N36566 AA904753 BE464245 AI338752AI 

AW839043 AW839049 AW903372 AW846755 AW846767 AW903368 AW846766 BE146826 AW839056 AW846802 AW 

7 JmSl^-Moo? 4 ?^ AW948021 AW984802 AW984800 AW984799 AW984825 AW984792 AW984821 AW984820 AW 

AW1 96918 AA948267 A1953735 AJ263703 AA319159 AW964436 AJ903440 AW5941 71 AI867447 AW2 



Ptey: 
Ref: 

60 Strand: 

NLpcsllion: 



Unique number corresponding to an Eos probeset 
Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. Dunham I. et at." refers to the 
human chromosome 22/ Dunham I. et aL. Nature (1999) 402:48*495. 
Indicates DMA strand from which exons were predicted. 

Indicates nucleotide positions of predicted exons. 



publicafion entitled The DNA sequence of 



Pkey 


Ref 


Strand 


400479 


8439766 


Minus 


400658 


8118459 


Minus 


400750 


8119067 


Plus 


401197 


9719705 


Plus 


401519 


6649315 


Plus 


401597 


3293210 


Plus 


401744 


2576349 


Plus 


401807 


7331536 


Plus 


402474 


7547175 


Minus 


402802 


3287156 


Minus 


402895 


9957547 


Plus 


403100 


8954402 


Minus 


403389 


9438331 


Minus 


403423 


7105492 


Ptus 


403851 


7708872 


Plus 


404390 


8887030 


Minus 


405141 


8980911 


Plus 


405934 


6758795 


Plus 


406038 


8389537 


Plus 
Minus 


406247 


7417725 



NLposifai 

115386-116348 

73525-73644 

198991-199168,199316-199548 

176341-176452 

157315-157950 

65838-66031 

14595-14751 

152325-152912 

53526-53628,55755-55920,57530-57757 
53242-53432 

85537-85671.8637W6469 

13683-13874 

163415-163634 

6934069615 

22733-23007 

3962440072 

99861-100054 

159913-160605 

3776447877 

4623446461 



721 



WO 03/042661 
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Table 63A fists about 368 genes significantly down-regulated In prostate cancer compared to normal prostate. These were selected from 59680 probesets on the Aflymetrix/Eos Hu03 
GeneChlp array such that the ratio of "average" normal prostate to "average" prostate cancer tissues was greater than or equal to 2.5. The "average" normal prostate level was set to 
the 75 a percentile of normal prostate tissues. The "average* prostate cancer level was set to the 75* percentile amongst tumor samples. In order to remove gene-specific 
background levels of non-specific hybridfeafion, the 1 0* percentile value amongst all the tissues was subtracted from both the numerator and the denominator before the raBo was 
evaluated. 



TABLE 63A: AB OUT 366 GENES SIGNIFICANTLY DOWN-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE 

Pkey: Unique Eos probesel identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Un'tgene Title: Unigene gene title 

R1: Ratio of normal prostate tissue to prostate tumor Bssue 

Pkey 
425932 
407245 
425545 
426752 
400440 
412519 
413778 
420813 
415337 
407013 
433331 
432117 
453863 
431847 
408855 
448059 
403612 
405001 
441490 
435805 
401917 
455649 
450218 
404606 
432326 
459708 
440808 
458339 
454278 
417032 
434352 
439175 
439749 
417364 
400831 
416935 
442082 
452625 
440965 
456815 
430611 
415981 
441040 
442764 
411426 
433081 
401896 
445323 
456895 
417729 
431627 
420721 
449519 
442089 
436781 
402797 
404267 
442931 
418626 
423772 
457136 
436404 
458840 
403649 
435866 
423871 

722 



ExAccn 


Uninene ID 


semenogelin 1 


R1 


M81650 


Hs.1968 


89.4 


X90568 


Hs.1 72004 






N98529 


Hs.158295 


Homo saolerts donsMGC'12401 mRNA.com 


32.2 


X69490 


Hs.172004 


titin 


OQ C 
cu.3 


XB3957 


Hs.63870 


nebuUn 


20.7 


AA196241 


Hs.73980 


tmnnnin T1 cVplptart slrnw 


1^Q 

10.3 


AA090235 


Hs.75535 


myosin, light polypeptide 2, regulatory, 


12.0 


X515Q1 


Hs.99949 


prolsc tin-induced protein 


11.8 


Z44881 


Hs.9012 


CO IS, WtJaniy dJUTUcJ (0 OlDOOU Ulxn-OUHJI 


m 7 


U35637 




gtj.nutnan ncuuiin rnr\iv\, paruai cos 


y.i 


nil wUU 1 *# 






a q 

o.y 


AL036195 


Hs.2909 


protamine 1 


0.0 


X02544 


Hs.572 


orosomucoid 1 




AI791314 


Uc 110QXC 


Qu^xosog iz.yo oirauayene lung carcinorria 


7 A 


lOOUOl 


rtomo sapiens mKNA lor ruAAi/Z7 protein, 


7.1 


AI459021 


ns. i fwzD 


PQTc 
CO IS 


6.8 






Tom At Pvnn 


R A 

D.H 


U58196 




interieukin enhancer binding factor 1 


6.2 


N46901 


Hs 266720 


ESTs 


ft 9 


AW470260 


Hs. 48496 


Colo 


c n 

0.0 


AL050149 




RAM KinHirvi nr-rttatn *\ 
rvMM LBIIUli ry JJlUlcin O 




BE065051 




nVRC1-RTffi13-11fWnn.ni7.rfM Unmn 


«; 7 


R02018 


Ks.168640 




0.0 






Tarn el Pjton 


9.0 


A1280308 


Hs 274381 ' 


dnBiuiiuo-o«nsiuve coiion cnannei ne 


0.0 


AA776881 


Hs 1fiRA19 


Lfnmn cantons rnMA CI I1A117 fie r*\nna u A 

nuntjj sapiens guiyh cuiiij^ ns, clone ma 


0.0 


AK001339 


Hs.7432 


hvnnrhptird nrnioln CI 110477 
iiyuuuiuuccs protein ru \\r\f i 


0.0 


AW976853 


Hs.172843 


ESTs 


f\ 7 

0.£ 


AF217525 


Hs 49002 




3.4 






CO 15 


5.2 






email nwienla nmlaU Y Kntw) 

smau muscie protein, AHinxed 


5.1 


AF086021 


Hs 971113 


ESTs 


s n 


AL009 aHL 


nS.lD/ /Dd 


Unmn Gtmlt%n& mDMA full Innnlh 1...J .nil 

nomo sapiens mKNA tuii lengin insert cuN 


4.9 


N73749 


Hs 777A75 


CO 15 


4.8 


AA190712 






4.U 




Suj410r1u3.11 ou aicgeno nuLB ceu so «m 


X 0 


R41823 


Hs.7413 


CCTc 
CO IS 


4.8 


AA724771 


Hs*61425 


ESTs 


4.7 


AI523646 


Hs.169859 


ESTs 


4.7 


NMJJ13348 


Hs.144011 


potassium inwardly-rectlrying channeJ.su 


4.7 


AA837120 


Hs.156461 


ESTs 


4.6 


R35694 




Qb7g67b04j1 Soares Want brain 1 NIB H 


4.6 


AW449782 


Hs.178803 


ESTs 


4.6 


AJ762254 


Hs.131122 


ESTs 


4.6 


BE141714 




gb:CM)-HT0101-(B1099^32-c04HT010i Homo 


4.6 


Z85986 


Hs.188757 


Homo sapiens, done MGC:5564, mRNA, comp 


4.6 






Target Exon 


4.5 


AW207282 


Hs.213049 


ESTs 


4.5 


A1535663 


Hs.39379 


ESTs 


4.5 


Z43798 


Hs.6777 


ESTs 


4.5 


AW609720 


Hs.265540 


HSPC042 protein 


4.5 


AA927802 


Hs.159471 


ZAP3 protein 


4.4 


W04244 


Hs.49829 


ESTs 


4.4 


AI801500 


Hs.128457 


ESTs 


4.4 


AI914535 


Hs.221377 


ESTs 


4.4 






Target Exon 


4.4 






NM»004348*:Homo sapiens runt-related tra 


4.3 


AI024376 


Hs.150473 


ESTs 


4.3 


AW299508 


Hs.135230 


ESTs 


4.3 


AA306637 




EAP30 subunit of EU complex 


4.3 


AA428240 


Hs.126083 


ESTs 


4.3 


AW968556 


Hs.137240 


Homo sapiens mKNA for partial 3TTTR, seq 


4.3 


AI580296 


Hs.174782 


ESTs, Weakly similar to KIAA1437 protein 


4.3 






Target Exon 


42 


AA704538 


Hs.119740 


ESTs 


4.2 


AA331906 




gb:EST35805 Embryo, 8 week 1 Homo sapien 


4.2 
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438190 


AK001059 




gb:Homo sapiens cONA FU10197 (is, done 


4.2 


400164 






Eos Control 


4.1 


435519 


AI218950 


Hs.1 25461 


hvooihetical nrnfmn PI .11 1 533 


4.1 


425094 


AJ 955956 


Hs.21417 


ESTs 


4.1 


415928 


R46799 


Hs.23966 


ESTs 


4.1 


415746 


AA167670 


Hs.21413 


solute carrier family 1 2, (potasshmvchl 


4^1 


426663 


AJ028767 


H&262603 


ESTs 


4.1 


453752 


AL120800 




gb:DKFZp762E152_r1 762 (synonym; hmel2) 


4.1 


404260 






Target Exon 


40 


453412 


AJ003290 




gb AI003290 Selected chromosome 21 cONA 


4.0 


401411 






ENSP00000247172*:HYPOTHETICAL \2&2 kDa 


4.0 


415925 


H09474 


Hs.202341 


Homo sapiens cDMA FU23573 (is, clone L 


4.0 


402817 


H24185 






4.0 


449233 


BE048401 


Hs.196511 


ESTs 


3.9 


435457 


AA682421 


Hs.59125 


ESTs 


3.9 


444105 


AW1 89097 


Hs. 166597 


ESTs 


3.9 


430610 


AI821465 


Hs 188810 


ESTs Weakh/ similar tn AI I Jfi HUMAN At 11 S 


39 


411320 


AW836646 




ah*PM3-l T0fW7/Klfl1ftnJ1A8-pn$ 1 Tfiftt? Homo 

y O.r IVWl. 1 vwtT/jU 1 ULrUUO-CvO L 1 wJt null ID 


0.0 


409679 


BE250521 




roc hAmnbvi ftefio famih/ momhar A 

i to itotnotog gene icurniy, rn&niDcr m 


1 ft 

J.O 


415386 


Z43087 




gb;HSC13A121 normalized infant brain cDN 


3.8 


446175 


AL03S5S8 


Hs.291 


glutamyl aminopeptidase (aminopeptidase 


0.0 


459707 


AA631362 




yu.upoauu i .;> i HL>i_LAj/-vr_ i fiy i nuiiiu sapiens 


^ ft 

0.0 


406917 


X65964 




oh'H SfioiftrK rmiin npnn 


3.8 


455887 


BE 1541 73 

Dt l\Tt Ifv 




yu.rWl I*n i UOHU-Z.U 1 taiT-vlHW 14 rl 1 UMU nomo 


t ft 


443313 


AI796730 


Hs 55513 


ESTs 


^ ft 
o.o 


452351 


AA025647 




gbze85dG1.r1 Scares JetaLhearLNbHH19W 


3.8 


400499 






C10001fi5fl-QilfifJ7qi94lrpJlNP 039759 11 na 


3.7 


411829 


AW865749 




ab*OV3-Sf\K)03t-1005rj0.18S^D3 KN0091 Homo 


3.7 


408229 


AW176091 




gb;OV0-8T0107-250899-007-b08 BT0107 Homo 


3.7 


427318 


Af 186081 


Hs.1 75783 


zinc trsnsportfir 


3.7 


417620 


RQ2530 


Hs.191198 


ESTs 


3.7 


404660 






C9000841*-j)ni2S54691lablAAH01185 1IAAH0 


3.7 


409144 


AW341187 


Hs.279714 


ESTs 


3.7 


436524 


AA922236 


Hs.221037 


ESTs 


3.7 


459200 


Y09306 


Hs.30148 


hcrneodomairHnteracting protein kinase 3 


3.7 


418833 


AW974899 


Hs.292776 


ESTs 


3.7 


424887 


AA348112 


Hs 259461 


ESTs 


^7 

o.t 


404984 






Target Exon 


3.7 


454549 


AW806910 




gb:QV4-ST0023-160400-172-f04 ST0023 Homo 


3.7 


426736 


AA43161S 


Hs.1 30722 


ESTs 


3.7 


434552 


AA639618 


Hs.325116 


Homo sapiens, done MG&2962, mRNA, comp 


3.6 


438040 


AI222422 


Hs.121846 


ESTs 


3.6 


443618 


T82009 


Hs 300700 


iiypuj icuuaJ piuicin ruituf n 


0.0 


442097 


AW015799 


Hs.1 28474 


ESTs 


3.6 


435186 


All 19470 


Hs 145631 


ESTs 


0.0 


429853 


AB020625 


Hs 225949 


hi rtvmnhifi nJiVft ? 


0.0 


456521 


AW373450 


Hs 286212 


hvmfhptiral nrrrfpJn PI I1179Q 


o.o 


453282 


AK00GD43 


Hs.32922 


hvoolhetical orotein FL120036 
NfyL003105*iHomo sapiens sortilin- related 


3.6 


400749 






3.6 


450295 


A1766732 


Hs.210628 


ESTs 


3.5 


421154 


AA284333 


Hs.287631 


Homo saoiens cDNA FU 14269 fis. dona PL 


3.5 


434378 


AA631739 


Hs.335440 


EST 


o.o 


421342 


AA504749 




yu.ddooikXj.f i nw_VA3^r_V3\xj i nuiiiu sapiens 


o.o 


406704 


M21665 


Hs.929 


myosin hftaw noh/nonHrfa 7 rsnflar mm 


o.o 


429043 


AI824977 


Hs.1 45319 


ESTs 


3.5 


438211 


T08401 


Hs.290856 


ESTs, Moderately similar to AlU4_HUMAN A 


3.5 


405258 






Target Exon 


3*4 


440207 


AJ371978 


Hs 128326 


ESTs 


O. 1 * 


445045 


AI 652676 


Hs 147256 
■ w. iti con 


ESTs 




406177 






Tompt Pyrin 


o.*f 


438535 


L09078 




yu.nmifu sapiens mruNA nay in en i 


OA 


405735 








•iA 
O.** 


445797 


A1253414 




Gb*aa14fD4j(1 Stanley Pmnial NS oool 2 


3.4 


444286 


A1625304 


Hs.190312 


ESTs 


3^4 


442027 


A1652926 


Hs.1 28395 


ESTs 


3\4 


447889 


AW4B9180 




ESTs 


o.t 


405152 






Target Exon 


3.4 


402460 






C1001261*:gi|2695979|emb|CAA70854.1| (Y0 


3.4 


448516 


AW898595 




ab-RC1-NN0073-260400-011-<i09 NN0073 Homo 


5\4 


450586 


AAO10319 


Hs.60389 


ESTs 


3.4 


447600 


AI420990 




ESTs 


14 


406085 






Target Exon 


3.3 


417959 


AW977642 


Hs.291742 


ESTs 


3.3 


418672 


L44284 


Hs.12915 


ESTs 


3.3 


446593 


W79572 


Hs.13277 


hypothetical protein FU22054 


3.3 


446516 


AW996692 


Hs.257557 


ESTs. Weakly similar b 138022 hypotheti 


3.3 


415896 


K08311 


Hs.14822 


ESTs, Weakly similar to 178885 serfne/th 


3.3 


410140 


AL134435 


Hs.247837 


neurexin 3 


3.3 


458539 


AI733837 


Hs.145661 


ESTs 


3.3 


422399 


AW410380 


Hs.1 16056 


mesenchymal stem cell protein DSC43 


3.3 


413381 


BE090690 




gbilC1-BT0720.280300.011-g02 BT0720 Homo 


3.3 


429422 


AK001494 


Hs.202596 


Homo sapiens cONAFU 10632 fis, done NT 


3.3 


428561 


AW973652 


Hs.283105 


ESTs 


3,2 
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433942 
400461 
442640 
438814 
438361 
422482 
407142 
454319 
442907 
456075 
438690 
452570 
408493 
447556 
451632 



AI830851 

AW002425 

H08623 

AW974812 

AW235786 

AA421773 

AW815333 

AI423317 

R45154 

H45377 

AI053501 

AJ010063 

AW970512 

AA059246 

AW937063 

AU38411 

A1681234 

AB037798 

AA961613 
BE048255 

AW293237 
AI219833 

BE165819 
AA352090 
AB040919 
AI418464 
AA236867 



445608 
449017 
415908 
457733 
449340 
428134 
434407 
429973 
418092 
411355 
443366 
422069 
420778 
403451 
451686 
423837 
422731 
450049 
428656 
401278 
418087 
421813 
402490 
448001 
445316 
405150 
413784 
425198 
433224 
421894 
419386 
402422 

435849 BE305242 Hs.16098 
405422 
420543 
410627 
409079 
406600 
408025 
426349 
444576 
430661 
441335 
459647 
435713 
407275 
456103 
435200 
449245 
428132 
443454 
423732 
411944 
432718 
435534 
430348 
410289 
453126 
459421 
441543 
442252 
419254 
443727 
419896 
453898 
415098 
415131 



AA278221 
AA181339 
W87707 

AI692784 

A! 308855 

A1400974 

AC005551 

BE222470 

R34107 

AA699313 

AI364186 

Z39430 

AA670310 



AA421765 

AIQ57494 

AF058056 

AW877139 

AA563943 

AA830927 

AA476915 

AW901618 

AA032155 

AA233154 

AI733014 

AI733395 

AI469453 

225389 

Z99362 

AW003512 

D59687 

D61119 



AI205646 

AA826278 

AA805666 

AI439905 

AA412535 

AW247736 

Al 023763 

N73442 

AA815031 

AW861293 

BE206854 

AI885187 

BE005934 



Hs.200014 

Hs.224142 

Hs^2833 

HsJ»1971 

Hs.195359 

Hs.161008 

Hs.164680 
Hs338439 

Hs^78869 
Hs.343603 



Hs.110293 
Hs275150 

Hs.258509 
Hs.188790 

Hs.127838 



Hs^02037 
Hs.166767 

HsJM7684 
Hs.128003 
Hs.210958 
Hs.190836 



Hs.173344 

Hs.929 

Hs.820S5 

Hs.41767 

Hs.301497 

Hs.182045 

Hs.130714 

Hs.150825 

Hs.321450 

Hs.1 14071 

Hs.72350 
Hs.145903 
Hs.224296 

Hs.133421 
Hs.132183 

Hs.244371 

Hs.117306 

Hs,189225 

Hs.61935 

Hs.61622 

Hs.12532 

Hs^69715 

Hs.129124 

Hs.222760 

Hs.18459 

Hs.232770 



AW272166 Hs.123465 



Hs.147220 
Hs^49596 
Hs. 146217 
Hs.344476 

Hs.101617 
Hs.79707 

Hs. 123593 

Hs.336940 

Hs.46039 

Hs.210710 

Hs.310625 



ESTs. WeaWy similar to ACIDIC PROUNE-R 3.2 

ESTs 3.2 

ESTs 3.2 

ESTs 12 

hypothetical protein MGC10954 3.2 

ESTs 3.2 

gb:GV0$T021546010(W)83^01 ST0215 Homo 3.2 

ESTs 3.2 

ESTs 3.2 

gb:yn99ri03.r1 Scares adul! brain N2b5HB5 3.1 

ESTs, Moderately similar to 21C926QA B c 3.1 

fifin-cap (tetelhonin) 3.1 

gb:EST382593 MAGE resequences, MAGK Homo 3.1 

Target Exon 3 1 

ESTs 3.1 

gb:PM3^)T0037-23129*O01-g11 DT0037 Homo 3.1 

gb:OKFZp434A1229_r1 434 (synonym: htes3) 3.1 

EST 3.1 

WAA1377 protein 3.1 

Target Exon 3.1 

ESTs 3.1 

gbiz49b05.y1 NCI_CGAP_Bm52 Homo sapien 3.1 

Target Exon 3.1 

ESTs 3.1 

ESTs 3.1 

Target Exon 3.1 

ESTs 31 

hypothetical protein FLJ21 213 3.1 

WAA1486 protein 3.1 

ESTs 3.1 

ESTs, Weakly similar to I38022 hypotheU 3. 1 

Target Exon 3.1 

daudln 2 3 1 

ENSP00000216658*:HYPOTHET1CAL 133.5 kOa 30 

ESTs 3.O 

myosin, heavy polypeptide 7, cardiac mus 3.0 

interleukin 6 signal transducer (gp1 30, 3.0 

Target Exon 3.0 

PTD002 protein 3.0 

amjnyltransferase 1 3.0 

ESTs 3.O 

Homo sapiens HSPC323 mRNA. partial cds 3.0 

ESTs, Weakly similar to antigen NY-CO-33 3.0 

pregnancy specific beta-1-grycoprotein 1 3.0 

ESTs 3.0 

gbxjw34ri07.x1 NCLCGAPJM Homo sapiens 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.O 

gtau24g02s1 Soares_NhHMPu_S1 Homo sapi 3.0 

ESTs 2.9 

solute carrier family 16 (mooocarboxyfic 2.9 

gbX)V2-PT0010-160400-13^01 PT0010 Homo 2.9 

ESTs 2.9 

ESTs 2.9 

ESTs, Weakly similar to I38022 hypoiheti 29 

Homo sapiens mRNA; cDNA DKFZp761 1071 (fr 29 

ESTs 2.9 

chromosome 1 open reading frame 21 2.9 

ESTs 2.9 

ESTs 2.9 

ESTs 2.9 

ESTs 2.9 

gb:HSZ99362 DKFZphamyl Homo sapiens cDNA 2.9 

arachidonale lipoxygenase 3 2.9 

gb:HUM056E10B Ctantech human fetal brain 2.9 

gb:HUM158C11B Oontech human fetal brain 29 

Target Exon 2.9 

ESTs 2.9 

Target Exon 29 

ESTs 29 

ESTs 29 

Homo sapiens cDNA: FU23077 fis, clone L 2.9 

gb:C57g08jc1 NCI_CGAP_Lym12 Homo sapien 2.8 

gb:zt99b10.s1 SoaresJesfisJ^HT Homo sap 2.8 

ESTs, Weakly similar to T32527 hypolhefi 28 

ESTs 2.8 

gkyz31h09.fi So3resjnulupJe_sderosls_ 28 

ESTs 28 

Homo sapiens cDNA: RJ20861 fis, done A 28 

phosphogtycerate mutase 2 (muscle) 28 

ESTs,Moo^ratelysflrtotoAUJ6„HUMANA 28 

EST 28 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 
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426481 

401556 

407269 

435754 

433565 

451004 

425223 

405770 

456227 

454445 

419494 

427639 

428079 

406337 

401884 

406881 

428986 

450044 

403630 

445514 

446362 

432492 

430889 

434316 

413155 

433329 

446523 

449923 

453826 

405678 

432789 

455791 

449109 

436255 

415984 

402844 

456666 

414620 

404979 

412318 

424361 

412542 

441975 

457021 

457861 

439204 

415642 

446847 



AW963941 

AJ245210 
M700752 
AA599763 
AA044967 
AA352825 

T84239 

AW749432 

W01060 



Hs.117341 
Hs.1 12520 

Hs.146065 

Hs.189788 

Hs.34382 



AW444530 Hs. 105362 
AA421020 Hs.208919 



016154 

M442884 

R66444 

AI241280 

AW612481 

AW275110 

U22491 

AW411330 

BE067952 

AF015041 

NMJ303063 

BE258051 

AL138129 

D26361 



Hs.201201 
Hs.51891 

Hs.148906 
Hs.104105 
Hs.271106 
Hs.248117 
Hs.118796 

Hs.199291 
Hs.334629 



Hs.3104 



AW270992 Hs.120949 

F01143 Hs.284284 

R19046 Hs.5010 

AA452512 Hs.76719 

AA150120 Hs.244621 



447336 
449045 
422185 
436030 
449589 
436092 
415054 
412908 
409583 
410483 
435383 
426629 
415831 
412281 
434898 
422229 
447518 
458546 
438648 



420833 
453903 
443650 
427419 
423741 
451577 
441358 
402706 
436054 
402749 
442472 
445762 
441420 
405564 



AW935911 

AK001551 

AW961516 

AW173248 

AW968934 

AW450205 

AF087987 

U19878 

T51454 

AI792583 

AW139383 

BEQ72483 

AL1 17530 

R02287 

AW752437 

AI345995 

AI733907 

AA121913 

AW440117 

BE163567 

R61083 

A1203933 

H15145 

AI810054 

AW500458 

AF134414 

TB0061 

AI215667 

AA813125 

AW511049 

R47948 

AW299606 

AI698330 

NMJXXBOO 

AA330362 

N69101 

AW173212 



Hs.326729 

Hs.145944 

Hs.95097 

Hs.344285 

Hs.173108 

Hs.305890 

Hs.42696 

Hs.336224 

Hs.82845 

Hs.135354 

Hs.245437 

Hs.278337 

Hs.1 12822 

Hs.121052 

Hs.135258 

Hs.127383 



Hs.256879 



Hs.97142 
Hs.30509 
Hs.14119 
Hs. 29956 
Hs.113271 

Hs.175044 
Hs.146335 
Hs.202007 
Hs.188732 
Hs.232777 
Hs.151444 
Hs.177888 

Hs.40730 
Hs.129041 



AI076262 Hs.119813 



gb:EST376014 MAGE resequences. MAGH Homo 
Target Exon 

gb:Homo sapiens mRNA for irnmimoglobufin 

ESTs 

ESTs 

gb2J53d09.f1 Soares retina N2b4HR Homo 
Qb:EST60880 Activated T-ceDs XX Homo sa 
NM_002362Hcmo sapiens melanoma antigen, 
ESTs 

gb:RC3-BT038«01299-O1 1-a09 BT0386 Homo 
ESTs 

Homo salens, done MG&18257, mRNA, com 
ESTs 

C14000021:gi|7242973|dbj|BAA92547.lj (AB 
Target Exon 

gbcHuman gene for cytochrome P-450c1 1. e 

ESTs 

ESTs 

C3001708*:gl|4758028|refINP_(S)4360.1| co 

ESTs 

ESTs 

ESTs 

G protein-coupled receptor 7 
annexinAS 

gb:CM0-BT0365461299-122-g09 BT0365 Homo 
numb {Drosophna) homotog-fike 



AW806859 

A1734002 

AA932872 



Hs.264590 



gb:601111Q34F1 NiH_MGC 16 Homo sapiens c 
gb:DKFZp547F152_r1 547 (synonym: MbM) 
CX001454:gi|8393794|reflNP.058681.1| myo 
KIAA0042 gene product 

gb:RC1-eT0720-280300-01 1-f08 BT0720 Homo 
ESTs, Weakly simitar to ALU7_HUMAN ALU S 
zinc finger 1111 

gb.-yg21f11.r1 Soares infant brain 1NIB H 
C1000118*:gi{9951913|reqNP 062832.11 pr 
U6 snRNA^ssociated Sm-!ike protein 
ribosomal protein S14 
Target Exon 

hypolhefical protein MGC11082 

Homo sapiens cDNA FU 10689 fe, done NT 

ESTs 

EST 

Homo sapiens cDNA: FU21897 fe, done H 

BCL24ike1 

EST 

transmembrane protein with EGF-Jike and 
Homo sapiens cDNA: FU21930 fis, done H 
ESTs 
ESTs 

Homo sapiens cDNA FU1 1537 fis, clone HE 

DKFZP434B172 protein 

ESTs 

ESTs, Weakly similar to ALU7 JWMAN ALU S 
ESTs 

gbzo86n09.y5 Stratagene ovarian cancer 
pregnancy-associated plasma protein-E 
ESTs 

gb:QV3-HTO46O-230200-101-b08 HT0460 Homo 

weel (S. pombe) homolog 

ESTs 

ESTs 

ESTs 

WAA0460 protein 

ABO Wood group (transferase A, alpha 1- 

gb:yd22f0&s1 Soares fetal liver spleen 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

histatin 3 

gb|EST34074 Embryo. 12 week D Homo sapi 
ESTs 

Target Exon 
ESTs 

Target Exon 

gb^RO^TOrm081199404<03 ST0020 Homo 
ESTs, Moderately similar to ALU5.HUMAN A 
gbxo57d07.s1 NCLCGAP_Lu5 Homo sapiens 
Target Exon 
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Z7 
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17 
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27 

27 
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27 

27 

27 

27 

27 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 

26 
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26 

26 

26 

26 

26 

26 

26 

26 

26 
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406003 






Target Exon 


26 


459584 


AI910884 


Hs.346429 


ESTs 


26 


441597 


AW135032 


Hs.203625 


ESTs 


26 


411280 


N50617 


Hs.80506 


small nuclear ribonucteoprotein pdypept 


26 


420509 


M83554 


Hs.1314 


tumor necrosis factor receptor superfanti 


26 


445060 


AA830811 


Hs.282908 


ESTs 


16 


436260 


BE172762 


Hs.292710 


ESTs, Weakly sircar to ALUS HUMAN ALU S 


IS 


428717 


T78001 


Hs.133546 


hypothetical protein RJ21120 


2.6 


401716 


W03955 




C16000902:gil403440lgb|AAA7316M| (M817 


26 


416628 




gbza62d04.r1 Soares fetal fiver spleen 


16 


443864 


N37059 


Hs.36250 


ESTs, Weakly similar to I38022 hypoM 


2.6 


440702 


AA904178 


Hs.148233 


ESTs 


16 


456310 


AA225522 




gb:nc25c06.r1 Na_CGAP_Pr1 Homo sapiens 


26 


451255 


AA020857 


Hs.90744 


ESTs 


26 


455737 


BE072246 




gb:OV4-BT0536-271299459-b12 BT0536Homo 


26 


445326 


AI220Q72 


Hs.344672 


ESTs 


26 


408432 


AW195262 




Ob*jcn67b05j(1 NCI.CGAP.CML1 Homo sapiens 


26 


445874 


8E326671 


Hs.170058 


ESTs ■ 


2.6 


441063 


AA913819 


Hs.188025 


ESTs 


26 


455505 


AW970640 


Hs.309071 


ESTs 


2.6 


453491 


AL040177 




gb:DKFZp434F0213.r1 434 (synonym: htes3) 


26 


455048 


AW852749 




gb:PM1-CT0247-08010£W08-h09 CT0247 Homo 


26 


456990 


NM_004504 


Hs.171545 


HIV-1 Rev binding protein 


26 


444130 


AJ125263 


Hs.170410 


ESTs 


25 


422210 


BE269319 


Hs.171937 


steroid dehydrogenase-Uke 


25 


424241 


AW995948 


Ks.162339 


Homo sapiens pyruvate dehydrogenase kina 


25 


451606 


AA018791 


Hs.7945 


AIE-75 binding protein protein 
histamine H4 receptor 


25 


400427 


AB044934 


Hs.287388 


25 


410443 


BE062906 


Hs.28338 


K1AA1546 protein 


25 


455210 


AW866599 




gb^V4-SN0O24-21040O-l81-b10 SN0024 Homo 


25 


429909 


AW977090 


Hs.184360 


CQ-203 prolan 


25 


441191 


AI693930 


Hs.148816 


ESTs 


25 


413489 


BE144228 




gb:MR(WHT016S140200-009-d04 HT0165 Homo 


25 


448215 


N34740 


Hs.6658 


ESTs 


25 


419354 


M62839 


Hs.1252 


apolipoprotein H (bela-2-glycoprotem I) 


25 


447610 


AW296286 


HsJ55534 


ESTs 


25 


450724 


R55428 




gb.-yj79b05.r1 Soares breast 2NbHBst Homo 


25 


414523 


AU076633 


Hs.76353 


serine (or cysteine) proteinase inhibito 


25 


414776 


AA155598 


Hs.212839 


hypothetical protein FU14195; WAA1714 


25 


447730 


AI421251 


Hs.114085 


Homo sapiens mRNA for K1AA1755 protein, 


25 


434152 


AI792665 


Hs.291190 


ESTs 


25 


412671 


AW977734 


Hs^7931 


gb:EST389963 MAGE resequences, MAGO Homo 


25 


456401 


W28146 




gb:43f11 Human retina cDNA randomly prim 


26 


404678 






Target Exon 


25 


408520 


AA225063 


Hs.161614 


ESTs 


25 


411332 


AW837212 




gb:QV2-LT0038-140300-081<01 LT0038Homo 


25 


448920 


AW408009 


Hs.22580 


alkyiglycerone phosphate synthase 


25 


429906 


AU080137 


Hs.193743 


ESTs 


25 


433712 


AF090887 


Hs.306562 


Homo sapiens clone HQ0085 


25 


438353 


BE539951 


Hs.306996 


Homo sapiens, clone IMAGE:3447073, mRNA, 


25 


446224 


AW450551 


Hs. 13308 


ESTs 


25 



TABLE 63B 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


.CAT Number 


Accession 


408229 


10484S2J 


AW176091 H24234 


408432 


1058667J 


AW195262R27868AW811262 


409679 


114787J 


BE250521 AA076837 BE249870 AA984291 AW502442 AW501551 A1221491 AA194239 D63046 AA193426 AA773243 AA193293 


410483 


1204995 1 


BE163567 BE073689 BE073747 BE073780 BE073739 BE073748 BE163495 AW750178 BE163491 BE073763 BE073671 


411320 


1238624 1 


AW836646 AW836580 AW836610 AW836636 AW836603 AW836832 


411332 


1239102 1 


AW83721 2 AW837408 AW837265 AW837380 AW83721 3 AW83741 1 AW837418 


411356 


1240273 1 


H45377H21137AW838640 


411426 


1245515.1 


BE141714 AW845993 AW845989 


411829 


1260309 1 


AW865749BE179419BE179492 


411944 


1266482J 


AW8771 39 AW877135 AW877018 AW991835 AW877128 AW877108 AW877017 AW877107 


413155 


1351148 1 


BE067952 BE067945 BE067942 BE067943 BE067949 BE067954 BE057944 BE067953 BE067956 BE067946 


413381 


1365950 1 


BE090690 BE090688 BE090881 BE090693 BE090675 


413489 


1373392.1 


BE144228BE144291 


415054 


151827 1 


AI7339O7AA159708A1732614 


415098 


1522174 1 


059687 D59694 D59656 D59589 


415131 


1523680 1 


D61119 D81508 081734 


415386 


1535560 1 


Z43087 F07410 H15506 H54108 R95033 H98000 


415981 


1564242.1 


R35694H12035 R53312 


416628 


1604848 1 


W03955 H82332 H69247 H72486 


4t6935 


163179.1 


AA190712 AA190665 AA252554 


419386 


184356.1 


AA236867 AA237066 AA354236 AW957759 H08961 


419896 


t888662J 


Z99362 299363 


420778 


196389.1 


AW970512AA280251 AI652287 BE466438 AI650725AA551854 AA281574AW571481 
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421342 
421813 
422731 
423741 
423772 



423871 
426481 
428132 
431847 
434407 
435383 
436190 
438535 
441420 
442472 
445797 
447518 
447600 
448516 
449923 
450724 
451004 
452351 
453412 
453491 
453752 
453826 
454445 
454549 
455048 
455210 
455649 
455737 
455791 
455887 
456075 
456310 
456401 



201427.1 
207654J 
220507.1 
231582.1 
23188 1 



232749J 

267878.1 

287430.1 

338402.1 

385744.1 

405360.1 

41555.1 

45946.1 

516748.1 

543371.1 

650943.1 

724787.1 

728288 1 

766241 1 

81926J 

844565.1 

85453 1 

91233 1 

966264.1 

969172.-1 

979899.1 

982669.1 

1204468.1 

1223789.1 

1250599.1 

1260650.1 

1348708.1 



1365954.1 
1380836.1 
1476756.1 
177089,1 
1844649.2 



AA504749AA287498 
BE048255AA313083AA298419 
AL138411 AL138412AA315860 
AA330362 AW962525 H87796 

AA306837 NM.0O7241 AF156102 BE388339 BE614027 AW935016 AA315261 AW376416 AW304951 AI859820 AA367436 AW516957 BE614519 
AI4531 52 AI453149 AI4531 39 AW1 68378 AI139491 AI538358 AW468227 AI680027 AW090513 AA662830 F30995 AI351 985 AI424349 AW009599 
C02215AI6525 
AA331906AA332484 

AW963941 AW963944 AA379825 AA379564 
AA421765 AA456076 AI290275 AA455579 
AI791314 AI791434 AA516511 
AW815333 AW815409 AA632563 
R61083 R13743 AA679174 AA679193 Z42903 
AK001059AA633055 

L09078 L03145 L09094 L09098 L03165 L09102 
AA932872 W28068 W28643 T96110 
AW806859 AW806852 AF049582 
AI253414AI366014 R34822 
T80061 AI382804 
AW20990AI399725AM01757 

AW898595 AW898588 AW898590 AW898663 AW898592 AJ525093 

BE258051 R45758AA004732BE255126 

R55428 A1820704 A1732283 R54983 

AA044967 H86327 AA0 13079 AA058776 BE242713 AA019987 

AA025647 R45716AW753786 

AJ003290 AJ003288 AW276947 

AI040177 

AL1 20800 BE378580 
AL138129AL138179BE064231 
' AW749432 AW749434 R47332 

AW806910 AW866461 AW866396 AW866373 AW866611 AW866616 
AW852749 AW852755 AW852620 
AW866599 AW866294 AW866468 AW866467 

BE065051 BE155165 BE064764 BE155231 BE064648 BE064671 BE064636 
BE072248 BE072229 BE072225 BE072210 BE072221 BE072256 BE072211 BE072242 
BE090689 BE090685 BE090697 BE090680 BE090591 BE090696 BE090698 BE090686 
BE154173BE154099 BE154096 
N73442 R98100 BE410380 

AA225522 AA225465 AI820979 AW973985 A1791935 AA558735 
W28146W28187 



TABLE 63C 



Pkey: Unique number corresponding to en Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (GI) numbers. "Dunham I. et aL" refers to the purification entitled The 0NA sequence of 

human chromosome 22." Dunham L et at., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLPQsition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLpositlon 


400461 , 


9929654 


Pfus 


32727-32846,3292943051 


400499 


9796071 


Minus 


148495-148806 


400749 


7331445 


Minus 


9162-9293 


400831 


8576271 


Minus 


56502-57407 


401278 


9799936 


Pius 


98428-98573 


401411 


7799787 


Minus 


144144-144329 


401656 


9100664 


Minus 


1-382 


401716 


6715703 


Plus 


174722-174911 


401884 


B140731 


Minus 


89182-90053 


401896 


8569194 


Phis 


115129-115294 


401917 


9502466 


Plus 


25054-25229 


402422 


9796344 


Minus 


3284343008 


402460 


9796884 


Minus 


108901-109254,110246-110581,113613-113960 


402490 


9797648 


Plus 


149982-150929 


402706 


8894426 


Minus 


148640-148805 


402749 


9212740 


Minus 


68787-68882,76602-76768 


402797 


3421043 


Minus 


15758-15930 


402817 


6822166 


Minus 


48811-49012 


402844 


9369286 


Phis 


5495*55313 


403451 


9838240 


Plus 


77382-78300 


403612 


8469060 


Minus 


94723-94859 


403630 


8569999 


Minus 


13909-14466,15251-15760,16898-17431,4174242440 


403649 


6705159 


Minus 


27141-27247 


404260 


9366879 


Plus 


51396-51513 


404257 


9581792 


Minus 


12209-12313,18241-18397 


404606 


9212936 


Minus 


22310-23269 


404660 


9797068 


Plus 


168215-166916 


404678 


9797204 


Plus 


115196-115448 


404979 


4160139 


Minus 


87762-88217 


404984 


6939882 


Plus 


87221-67505 


405001 


6015406 


Minus 


104646-104819 


405150 


9929758 


Plus 


126475-126773 
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405152 


9965561 


Minus 


137662-137969 


405258 


7329310 


Phis 


129930-130076 


405422 


7243869 


Minus 


101938-102079,102261.102443,102896-103202 


405564 


2114222 


Minus 


16766-17344 


405676 


4079570 


Plus 


151821-152027 


405735 


9931101 


Minus 


29854-29976 


405770 


2735037 


Plus 


61057-62075 


406003 


8247800 


Phis 


42079^2516 


406085 


9123888 


Plus 


18655-18843 


406177 


7279760 


Minus 


18930-19148 


406337 


9213455 


Plus 


90117-90337 


406398 


9256276 


Mnus 


118691-118959 


406600 


8248616 


Minus 


36296-36610 



Table 64A lists about 703 genes up-regulated in 8PH compared to norma) adult tissues. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 GeneChip array 
such thai (he ratio of 'average - BPH to •average* norma! adult tissues was greater than or equal to 3.0. The 'average' BPH level was set to the 75* percentile amongst BPH tissues. 
The 'average' normal adult tissue level was set to the 85° percentile amongst non-malignant tissues. In order to remove gene-specific background levels of non-specific hybridization, 
the 10* percentile value amongst the non-malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

TABLE 64A: ABOUT 703 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 

Unlgene Title: Unigene gene title 

R1: Ratio of BPH tissue to normal adult body tissue 



Pkey 


ExAccn 


Unigene ID 


Unigene Title 


R1 


420154 


AI093155 


Hs.95420 


JM27 protein 


49.6 


426747 


AA535210 


Hs.1 71995 


kaliikrein 3, (prostate specific antigen 


47.2 


419526 


AI821895 


Hs. 193481 


ESTs 


43.6 


432441 


AW292425 


Hs.163484 


ESTs 


42.7 


431616 


AA508552 


Hs.195839 


ESTs, Weakly similar to I38022 hypotheti 


30.2 


407202 


N5B172 


Hs. 109370 


ESTs 


26.1 


432101 


A1918950 


Hs. 123642 


EphA3 


25.8 


400299 


X07730 


Hs.1 71995 


kaDikretn 3, (prostate specific antigen 


25.5 


425075 


AA506324 


Hs.1852 


add phosphatase, prostate 


24.6 


414569 


AF 109298 


Hs.1 18258 


prostate cancer associated protein 1 


24.4 


410929 


H47233 


Hs.30643 


ESTs 


21.1 


400287 


S39329 


Hs.1 81350 


kaliikrein 2, prostatic 


20.3 


446057 


A1420227 


Hs.149358 


ESTs, Weakly similar b A46010 X-iinked 


19.8 


423073 


BE252922 


Hs.1 231 19 


MAD (mothers against decapentaplegic, Or 


18.6 


415989 


A1267700 




ESTs 


17.8 


428336 


AA503115 


Hs.183752 


rrttcroserranoprotein, beta- 


17.3 


450693 


AW450461 


Hs.203965 


ESTs 


16.7 


400298 


AA032279 


Hs.61635 


six transmembrane epithelial antigen of 


16.7 


407168 


R45175 


Hs.117183 


ESTs 


15.5 


408369 


R38438 


Hs.182575 


solute carrier family 15 (H??? transport 


15.5 


454119 


BE549773 


Hs.40510 


uncoupling protein 4 


14.5 


428819 


AL135623 


Hs.193914 


K1AA0575 gene product 


14.5 


400302 


N48056 


Hs.1915 


folate hydrolase (prostate-specific memb 


14.4 


400297 


AI127076 


Hs.306201 


hypothetical protein DKFZp56401278 


14.3 


418432 


M14156 


Hs.85112 


insuIin-Cke growth factor 1 (scmatomedi 


14.2 


432674 


AA641092 


Hs.257339 


ESTs, Weakly similar to 138022 hypotheti 


14.2 


433444 


AW975324 


Hs.129816 


ESTs 


13.8 


432240 


A16947S7 


Hs.129179 


Homo sapiens cDNA FU 13581 rts, clone PL 


13.5 


428342 


AI739168 




Homo sapiens cDNA FU 13458 Ms, clone PL 


\23 


401424 






NM-001 172:Homo sapiens arginase, type II 


12.7 


432435 


BE218886 


Hs.282070 


ESTs 


12.5 


448999 


AF179274 


Hs.22791 


transmembrane protein with EGF4ke and 


12.3 


446100 


AW967109 


Hs.13804 


hypothetical protein dJ462023.2 


12.0 


425154 


NM_001851 


Hs.154850 


collagen, type IX, alpha 1 


11.9 


407275 


AI364166 




gtKqw34h07jc1 NCLCGAP.UI4 Homo sapiens 


11.8 


452340 


NM.002202 


Hs.505 


ISL1 transcription factor, UM/homeodoma 


11.7 


432473 


AJ202703 


Hs.152414 


ESTs 


11.3 


410330 


AW023630 


Hs.159425 


ESTs 


11.2 


431679 


AW971082 


Hs.222886 


ESTs, Weakly similar to TRHY.HUMAN TRtCH 


11.2 


452792 


AB037765 


Ks.30652 


WAA1344 protein 


11.2 


418848 


AI820961 


Hs.193465 


ESTs 


10.9 


400292 


AA250737 


Hs.72472 


BMP-R1B 


10.9 


433647 


AA603367 


Hs.222294 


ESTs 


10.8 


453160 


A1263307 


Hs.239884 


H2B histone family, member L 


10.8 


409262 


AK00C631 


Hs.52256 


hypothetical protein FU20624 


10.6 


431474 


AL133990 


Hs.190642 


CEGP1 protein 


10.3 


429220 


AW207206 




ESTs 


10.3 


428134 


AA421773 


Hs.161008 


ESTs 


10.2 


408622 


AA056060 


Hs.202577 


Homo sapiens cONA FIJI 2166 fe, clone MA 


10.1 


456497 


AW967956 


Hs.123548 


ESTs, WeaWy sirrilar to AF1084S) 1 uto 


9.8 


434792 


AA649253 


Ks.132458 


ESTs 


9.7 


433466 


AA50S353 


Hs.105314 


relax* 1(H1) 


9.5 


439176 


AI446444 


Hs.190394 


ESTs, WeaWy similar to 828096 fine-1 or 


9.5 


428398 


AJ249368 


Hs.98558 


ESTs 


9.4 



728 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



429290 


AF203032 


Hs 198760 


nBuiuQus i CuL flcaVj poiypcputjo 


9.3 


407709 


AA456135 


Hs. 23023 


ESTs 


Q 9 
9.0 


415293 


R49462 


Hs. 106541 


ESTs 


9.1 


429918 


AW873986 


Hs.113383 


ESTs 


9.1 


440260 


AI972867 


Hs,7130 


copineiV 


9.1 


453096 


AW294531 


Hs.11325 


ESTs 


q i 


410467 


AF 102546 


Hs. 63931 


dachshund (DrosopWta) ticmoJog 


q q 


456516 


BE172704 


Hs!222746 


WAA1 610 prolan 


8.6 


430722 


AW968543 


Hs.203270 


ESTs, Weakly similar to ALU 1 .HUMAN ALU S 


8.8 


458072 


AI890347 


Hs^71923 


Hnmn eantpne H*)MA* PI I977ftft fie Hnrtn K 


ft R 
0.0 


453005 


AI362575 


Hs.303171 


ESTs 


ft 7 

0./ 


418339 


AA639902 


Hs.104215 


ESTs Morterrfplv similar tn 9PPN HUMAN <i 


ft7 

0.' 


420424 


AB033Q36 


Hs.97594 


iwmu iu proieui 


fi 7 

0./ 


450542 


R39773 


Hs.7130 


copine IV 


ft 7 
Cf 


434036 




U- 1 077 99 

ns. is/roo 


hypothetical protein MGC2849 


o.f 


428927 


AA441837 


Hs 90250 


ESTs 


ft ft 

O.D 


418829 


AAft1fift31 
rWO 1030 1 


Lie CCQQQ 


na nomeooox [urosopniiaj, tamny 3, a 


8.5 


420345 


AW295230 




co is 


o.o 


446336 


fWYOIOUOO 


Me 

ns. 19 1401 


cols 


8.4 


419743 


AW408762 


Hs.5957 


Hnmn eaniflne rinnp 9441 fi mRMA eoni i ft nrn 




428728 


NM 016625 


Hs. 191381 


hvnflJhfifirj^l nrnlpin 


8.3 


417169 


R13550 


Hs.246773 


ESTs 


ft 9 


453387 


A1990741 


Hs.252809 


ESTs 


8.2 


434666 


AF151 103 


Hs 1 1 2259 


1 veil Icwepwi yoNUIld KJUUo 


ft 1 

0. 1 


417958 


AA767382 


Hs.193417 


ESTs 


8,0 


433433 


AI692623 


Hs.121513 




7 Q 


444609 


AW571659 


Hs.278081 


ESTs 


7.8 


424099 


AF071202 


Hs 139336 




7 ft 

1.0 


433923 


AI823453 


Hs. 146625 


ESTs 


77 


437587 


AI591222 


Hs.72325 


Mirman DMA efintroni*o fmm /4nna PP1 1fl7 11 4 
nuindn urw sequence irain aon8 r\r i* iof J U 


7 7 
l.l 


441690 


R81733 


Hs 33106 


ESTs 


7 ft 


453370 


AI470523 


Hs.139336 


ATP-hindina c%snttp entvfpmifv C. fHPTR 


7.6 


441247 


AW1 18681 


Hs. 128051 


Mnmn eantpne fhvmrr sfmmal hrmnhnnnfRfi 


7.5 


439444 


AI977fiM 


He ftAJ»7R 


co i s, vveawy similar to uouzz nypouicu 


7 C 
1.0 


431448 


AL137517 


He 3nfi901 


himnlhofWH nmloln n^C7riCC/ni 77Q 

nypwieucai piuttan ut\r£poovji£.i o 


7 A 


413597 


AW302885 


Hs 117183 
no. i 1 1 i o<j 


ESTs 


7 A 


404592 








7.3 


409557 


BE 182896 


Hs.211193 

no»4 ill 9v 


ESTs 


7 9. 
/.0 


420352 


BE258835 




ab-601 1 17374F1 NIH MCC 16 Hnmn sflnipne r 


7.3 


438231 


AW594539 


Re I'tffiftQ 


PQTe 
CO 15 


7 9 
1.0 


440529 . 


AW207640 


He 1fi47fl 


Mfimrt eaniQne rniJA* PI 19171ft fie rlnna P 


7 1 


449300 


Al 556959 


Hs.346514 


ESTs 


7 1 


417332 


AW972717 


Hs.288462 


hvoolhptif^l nmfptn PU91S1 1 

iijpwuicuua pjvlCBI < U£ lull 


7.1 


453930 


AA419466 


Hs 3B797 




7 < 


449249 


T52285 


Hs.193115 


Homo saniens mRMA for KIAA1 7ft4 rtmtpin 


7.0 


430187 


AJ 799909 


Hs 158989 

• i %rvouo 


ESTs 


7.0 


434217 


AW014795 


Hs.23349 


ESTs 


7.0 


431217 


NM^013427 


Hs 250830 


Rho R I Pasfl nrtiualrnn nmfpJn R 


6.9 


430188 


AL049242 


Hs.234794 


Homo saolens mRMA- cDNA DKP7o564RnR3 {ft 


6 Q 


415786 


AW419196 


Hs^57924 


hvrvilhpflra! nmtpln PI I1t7ft9 


o.a 


421059 


Aifift4i33 

WWS IOO 


He 9rt91 9 


thyroid receptor interacting protein 15 


c o 
o.o 


426384 


A1472078 


Me TmRRO 
ns.OUODOc 


PQTe 


<: fi 

0.0 


458509 


AA654650 




ESTs 


ft R 
0.0 


449821 


AIR71141 


Me 91 1 1 90 


CCTe 

Co is 


C Q 

0.8 


407118 


AA1567Q0 


Me 9fi?fl3fi 


PQTe Woakru clmtlar 7994 Ml IUAKI 71 MP 
CO 1 S, vVcaniy Similar 10 tZZJ_nUIWMN 


ftft 
O.O 


443250 


AI041530 


He 1391P7 
ns. i j£ lur 


ESTs 


fi7 
0./ 


428330 


L22524 


Hs.2256 


matnx metaUoptotBinase 7 (matnlysin, 


0./ 


408197 


AA282262 


Hs.1 07410 


ESTs WeakJvarnibff to A46Q10 XJinkpii 


6.6 


454457 


AW753456 




oh-OV2-CT0?B1-?S1OQ<Ul1 1^111 HTOTfil Hnmn 


6.6 


421154 


AA284333 


Hs 287631 


Mnmn eaniAne rQWA PI I1A9RQ fie Hnno Pt 


ft ft 
o.o 


429433 


AA452899 


Hs 213586 


ESTs. WepJdv similar to K1AA1 3*J3 nmtpfn 


6.5 


431467 


N71831 


Hs 25639ft 


Mnmn snninne mRMA* tflMA nKF7n4j)4pnc9ft /f 


ft ^ 

0.0 


424433 


H04607 


Hs.9218 


ESTs 


6.5 


450184 


AJ239923 


Hs.63931 


ESTs 


ftft 
o.o 


423566 


AW976434 


Hs.3623 


hvanlhpfieal nmMn PLH199Q 


6.5 


450497 


K64159 


Hs 15328 


ESTs 


6.5 


425312 


AA354940 


Hs.1 45958 


ESTs 


6.4 


426140 


AP13179B 

nr ivi i90 


Re 9497ftfi 
nj».o*tjf do 


nomo sapiens aone rnruv\ sequence 


ft A 


440911 


AA909536 


Re 143569 


PQTe 
CO IS 


R4 
0.1 


412350 


AJ 659306 
Al 821085 




nfnfeln K/merna nhftenhataeA nnn_m^ont 

piuusn lyiueow pnospnoioSci noirrewpi 


fi 1 

O.J 


432600 




atn»0>5a12v5NPJ PAAP Pr3 Hnmn enniens 


6.3 


435375 


AI733610 


Hs.187832 


ESTs 


6\3 


416370 


N90470 


Hs.203697 


ESTs, Weakly similar to 138022 hypotheti 


6.3 


431359 


AW993522 


Hs.292934 


ESTs 


6^ 


424736 


AF23C877 


Hs.152701 


mlcrotubu Wnterading protein that ass 


6.1 


403667 






Target Exon 


6.1 


424846 


AU077324 


Hs.1832 


neuropeptide Y 


6.1 


433569 


AW602166 


Ks.222399 


CEGP1 protein 


6,1 


436396 


AJ683487 


Hs.152213 


wingless-type MMTV integrafion site tarnJ 


6.1 


448004 


AW451477 


Hs.257456 


ESTs 


6.1 


431770 


BE221B80 


Hs.268555 


S'-S* exoribonudease 2 


6.0 


453861 


AI026838 


Ks.30120 


ESTs. WeaWy simaar to NUCU_HUMAN NUCUE 


6.0 


415621 


AI648602 


Hs.55468 


ESTs 


6.0 
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436873 


N23874 


Hs.50477 


RAB27A, member RAS oncogene family 


6.0 


442481 


N99828 




gb2a32c04.r1 Soares fetal Over spleen 


6.0 


432527 


AW975028 


Hs.102754 


ESTs 


5.9 


447156 


AW274731 


Hs.157920 


ESTs 


5.9 


404003 






Target Exon 


5.9 


424853 


BE549737 


Hs.132967 


Human EST clone 1 22887 mariner transpose 


5.9 


427726 


AI359144 


Hs.143688 


Homo sapiens cDNA: FU23031 fis, clone L 


5.9 


433043 


W57554 


Hs.125019 


lymphoid nuclear protein (LAF-4) mRNA 


5.9 


438138 


R98299 


Hs.177502 


ESTs 


5.9 


440129 


AA865818 


Hs.1 74936 


ESTs, Weakly similar to S71886 Ste2043c 


5.8 


434973 


AW449285 


Hs.313636 


EST 


5.8 


408527 


AL135018 


Hs.33074 


Homo sapiens, clone IMAGE3606519, mRNA, 


5.8 


459241 


AA032276 


Hs.99010 


ESTs, Moderately similar to T14342 NSD1 


5.8 


434485 


AI623511 


Hs.1 18567 


ESTs 


5.8 


429716 


R25685 


H&211933 


collagen, type XIII. alpha 1 


5.7 


443960 


A1093577 


Hs.255418 


hypothetical protein FU21986 


5.7 


453200 


AA033832 


Hs.212433 


ESTs 


5.7 


428002 


AA418703 




gb2v98c01s1 Soares_NhHMPu_S1 Homosapi 


5.7 


432956 


AA650114 


Hs.325198 


ESTs 


5.7 


448148 


NMJ)16578 


Hs.20509 


HBV pX associated protein-8 


5.7 


420948 


AB016898 


Hs.100469 


myeloWryrnphoid or mixed-lineage leukem 


5.7 


425810 


AI923627 


Hs.31903 


ESTs 


5.6 


414312 


AA155694 


Hs.191060 


ESTs 


5.6 


404571 






N^015902*:Homo sapiens progestin Induce 


5.6 


424850 


AA151057 


Hs.1 53498 


chromosome 18 open reading frame 1 


5.6 


439092 


AA830149 




gfcoc44rMs1 NCLCGAPJ3C81 Homo sapiens 


5.6 


427078 


AI676062 


Hs.1 11902 


ESTs 


5.6 


422805 


AA436989 


Hs.121017 


H2A histone family, member A 


5.6 


417511 


AL049176 


Hs.82223 


chordin-likB 


5.6 


449625 


NMJJ14253 




odz (odd Oz/lerwn, Drosophila) hornotog 1 


5.6 


433927 


AI557019 


Hs.1 16467 


small nuclear protein PRAC 


5.5 


441876 


BE564206 


Hs.49889 


ESTs 


5.5 


421513 


X00949 


Hs.105314 


relaxin1(H1) 


5.5 


432682 


AI376400 


Hs.1595B8 


ESTs 


5.5 


435021 


AAS22192 


Hs.54709 


ESTs 


5.5 


454171 


AW854832 




gbK!V2-CT0261-201099O11-fl)5CT0261 Homo 


5.5 


426581 


AB040956 


Hs.13S890 


KIAA1 523 protein 


5.5 


439079 


AF085937 


Hs.38348 


ESTs 


5.5 


443635 


A1080230 


Hs.134214 


ESTs 


5.5 


400080 






Eos Control 


5.5 


410196 


AI936442 


Hs.59838 


hypothetical protein FU10808 


5.5 


436578 


AI091435 


Hs.134859 


ESTs 


5.5 


424051 


AL1 10203 


Hs.138411 


Homo sapiens mRNA; cDNA DKFZp586J1922 (f 


5.4 


450325 


AI935962 


Hs.26289 


ESTs 


5.4 


421863 


AI952677 


Hs.108972 


Homo sapiens mRNA; cDNA DKFZp434P228 (fr 


5.4 


432161 


AK000400 


Hs.341181 


ESTs, Weakly similar to envelope [H.sapi 


5.4 


435072 


AW592176 


Hs.1 16932 


ESTs 


5.4 


428647 


AA830050 


Hs.124344 


ESTs 


5.4 


441111 


Al 806867 


Hs.126594 


ESTs 


5.4 


433087 


AI720686 


Hs.152520 


ESTs 


5.3 


450244 


AA007534 


Hs.125062 


ESTs 


5.3 


400294 


N95796 


Hs.278695 


Homo sapiens proslein mRNA, complete cds 


5.3 


415539 


A1733881 


Hs.72472 


BMP-R1B 


5.3 


437267 


AW511443 


Hs.258110 


ESTs 


5.3 


441916 


AA993571 


Hs.129075 


ESTs 


5.3 


452784 


BE463857 


Hs.151258 


hypothetical protein FU21062 


5.3 


446715 


AI337735 


Hs.1 7391 9 


ESTs, Moderately similar to ZN91JWMAN Z 


5.3 


452531 


AA429462 


Hs.293946 


ESTs, WeaWy simitar to 138022 hypolhefi 


5.3 


426503 


AA380153 




gb:EST93093 Skin tumor 1 Homo sapiens cD 


5.3 


423101 


M83941 


Hs.123642 


EphA3 


5.3 


445704 


A1493742 


Hs.167700 


ESTs, Moderately similar to 138022 hypot 


5.3 


450813 


A1739625 


Hs.203376 


ESTs 


5.3 


409079 


W87707 


Hs.82065 


interleukln 6 signal transducer (gp130, 


5.2 


415890 


H08225 


Hs.268712 


ESTs 


5.2 


454968 


AW849046 




tfxIL3^T0214-l5030(W85-H06 CT0214 Homo 


5.2 


441054 


AA913591 


Hs.126480 


ESTs 


5.2 


408385 


AF055634 


Hs.44553 


unc5 (Celegans homoiog) c 


5.2 


437752 


AA767376 


Hs^91631 


ESTs, Moderately similar to S65657 alpha 


5.2 


448072 


A1459306 


Hs.24908 


ESTs 


5.2 


419536 


AA603305 




gb:np12d11.s1 NCLCGAPJT3 Homo sapiens 


5.2 


421040 


M715026 


Hs.135280 


ESTs 


5.2 


446495 


D60923 


Hs.1 53460 


ESTs 


5.2 


415263 


AA948033 


Hs.130853 


ESTs 


5.2 


426748 


ALM8409 


Hs.97177 


ESTs, Weakly similar to ALU1.HUMAN ALU S 


5.2 


436338 


W92147 


Rs.118394 


ESTs 


5.2 


445238 


AA883971 


Hs.187506 


ESTs 


5.1 


450382 


AA397656 


Hs.60257 
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Homo sapiens mRNA; cDNA DKF2p727C191 (fr 


3.6 




423349 


AFO 10258 


Hs.1 27428 


horooo box A9 


3.6 




426917 


AA913814 


Hs. 172854 


DKFZP586B0923 orolsin 


3.6 




429430 


AI381837 


Hs.155335 


ESTs 


3.6 


10 


433563 


AI732637 


Hs.277901 


ESTs 


16 


425465 


11 8964 


Hs.1904 


protein kinase C, iota 


3.5 




452498 


AK000101 


Hs.29700 


hypothetica! protein FU 20094 


3.5 




401132 






C12000517*^ij4758712lref|NP_004659.1| a 


3.5 




421105 


AA766501 


Hs 125113 


ESTs 


o.o 


15 


435177 


AI018174 


Hs.42936 


ESTs 


3.5 


449343 


A1151418 




(ffuuaii piiw|Mioui5<3 o \tuiiiicny £Oj, cm 


0.3 




457653 


AI820719 


Hs 154662 


Hnal fHcf\4fl1 hnnWtvi ejihfamthi A momhfl 
ui icw ina^Huj iimiwiuy, suuiqiiuij t\ uiciiiuo 






419563 


rv%3£Q£03 




Unmrv contort* i»nMA CI fi« rinna UC 

nomo sapiens cuna ru nyoo ns, ctone nt 


0.0 




419994 


AA2828A1 


ro. ouuor 


CCTr 

CO 1 5 


J.D 


20 


427304 


AA7fi15?f\ 


He. IfiWtt 
no. iuoo30 


CO IS 


*o 


434763 


AA648618 






3.5 




447497 


AW1 67254 




ESTs 






449988 


AW372068 


Hs.201420 


ESTs Moderatelv similar to ALU7 HUMANA 


3.5 




422321 


AA906427 


rid. id i uoj 


hvnnfhpltrol nirAnm MfirHTQR 
iiypoinsucaj pruiBin ivhjv>i i£oo 


0.3 


25 


418723 


AA504428 


Hs, 10487 


Hnmn Kanrpnc: pJnnp IMAfiP ^CK41^9 mRNA 


0.3 


438680 


AA906121 


Hs.173421 


ESTs 


35 




452903 


Al 953425 


Hs.345291 


ESTs, Weakly similar to 138022 hypotheti 


as 




443273 


Al 042063 


Hs.132156 


ESTs 


3^ 




403510 






Target Exon 


as 


30 


435681 


AA694192 


Hs. 148979 


ESTs 


3.5 


451722 


H86374 


Hs.40861 


ESTs 


3.5 




435981 


H74319 


Hs.1 88620 


ESTs 


is 




449845 


AW971183 


Hs.9683 


DnaJ (Hsp40) homotog, subfamily C membe 


3.5 




436024 


AI800041 


Hs 190555 


ESTs 


35 


35 


429643 


AA455889 


Hs. 167279 


FYVF-ffnnflr-mntaininn Rshft pfffvlrw run 


3.5 


443324 


R44013 


Hs 164225 


ESTs 


3.5 




458332 


AI000341 


Hs.220491 


ESTs 


3.5 




435688 


H72286 


Hs 128387 


ESTs 


14 




435047 


AA454985 


Hs.54973 


cadherin-like protein VR20 


3.4 


40 


432606 


NM.002104 


Hs.3066 


njanzyme K (serine protease, Qranzyme 3; 


3.4 


409047 
404848 


AW961434 




ESTs 

ENSPfl00M24fl769* , BG153O3 1 f similar to P. 


o.*« 
34 




412494 


AL133900 


Hs.792 




3.4 




417435 






caiuonjc annyurase 111, musue spsuuc 


O.Q 


45 


430403 


AF039390 


Hs 241382 


tttmnr rwrn<;ic fa'trtr ninnmf\ etinorfami 

UJlllvJ IRJWUMD WUIU ^UytDHJJ BUpBlMIIH 


3.4 


436340 


R42246 


Ur oicnc 
ns.£iouo 


CQTe 
Co IS 


0.4 




441596 


AA939300 


Hs 206758 


ESTs 


3.4 




442231 


W02434 


Hs.222413 


ESTs 


3.4 




447124 


AW976438 

of f wi \rtvv 


Hs.1 7428 






50 


449517 


AW500106 


Hs. 23643 


<«prvtpAhfpnntrtA rtfnfpin Irinaep MASK 


3.4 


450297 


AW901347 


He lftW9 




^ A 




453682 


T79703 




ah*vH71pfiR r1 RnarM fptnl Hvpt enlpan 


3.4 




456995 


T89832 


Hs.170278 


ESTs 


14 




444324 


AI301330 


Hs.143838 


ESTs 


3.4 


55 


400379 


NM.018432 




HnmA fiiif^pnQ m/flrian canror rolataH nmf 
nmiiu oopiciid wVoiioii uoms ickjiuu uiui 


T4 

O.H 


419964 




Re 990Q11 


ESTs 


o.*» 




424026 


AI798295 


no. io# of o 


lUXAWidJ piuisiii uw pscuQogerto i 


0.** 




427033 


AI457449 


H<5 192817 


ESTs 


?4 
0.»t 




453942 


AW190920 


Hs.19928 


hypothetical protein SP329 


3.4 


60 


431421 


AW969118 


Hs.1 081 44 


EST s WaaWv filmflar In nnn^mpd nmtoin 


3.4 


408784 


AW971350 


H<5 633flfi 


ESTs 


?4 

O.H 




420184 


AA188408 




hvrvifhsKral nmfo!n 






420721 


AA9278Q2 


Hs 159471 


7AP^ nmfptn 






422728 


AW937826 


Hs 103262 


CCTc Wpakfv ^milartn 7WQ1 HUMAN 71 MP 


O.H 


65 


436703 


AW880614 


Hs. 14 6381 


^WA bindinQ mo^f protein^ X ctvomosome 


3.4 


429165 


AW009886 


Hs.1 18258 


prostate cancer associated protein 1 


3.4 




427674 


NMJJ03528 


Hs.2t78 


H2B histone lankly, msnibef Q 


3.4 




412505 


AA974491 


Hs.21734 


P<?Tc 

COlO 


0.** 




431408 


AA504757 


Hs.105738 


ESTs 


3.4 


70 


438801 


AA825971 


Hs.124284 


ESTs 


14 
o.t 


445432 


AV653771 




nh'AVR 53771 Gl C Hnmn <?3mpir<; rHMA Hnnn 


14 




418575 


AA225313 


Hs. 222886 


ESTs, Weakly similar to TRHY HUMAN TRICH 


13 




436345 


AA873008 


Ks.121572 


ESTs 


3.3 




423023 


N50128 


Hs.173400 


ESTs 


13 


75 


427615 


BE410107 


Hs.179817 


CQ-82 protein 


13 


429588 


AI080271 


Hs.134533 


ESTs 


3.3 




400362 


AF068294 


Hs.272414 


Homo sapiens HDCMB45P mRNA, partial cds 


13 




425523 


AB007948 


Hs.158244 


KIAA0479 protein 


13 




426237 


AK001104 


Hs.168241 


hypothetical protein FU10242 


13 


80 


427473 


AW274439 


Hs.252709 


ESTs 


13 


434520 


AA205273 


Hs.177011 


hypothetica) protein 


13 




447282 


AI989963 


Hs.197505 


ESTs 


3.3 




433554 


AW957666 


Hs.8108 


disabled (Drosophila) homolog 1 


13 




414818 


BE541217 


HsJ3606 


ESTs 


13 



734 



WO 03/042661 



PCT/US02/36810 





418985 


AI042330 


Hs.87128 


hypotheCca) protein FU23309 


3.3 




436854 


AA749167 


Hs.173911 


ESTs 


13 




430865 


AI073424 


Hs.5232 


HSPC125 protein 


13 


J 


447182 


BE241868 


Hs.1 7585 


WAA0801 gene product 


3.3 


423545 


AI215632 


Hs.147487 


ESTs 


3.3 




429227 


AI961456 


Hs.21275 


hypothetical protein FU 11 011 


3.3 




411928 


AA888624 


Hs.197289 


rab3 GTPase-acuVaSng protein, noivcata 


3.3 




418051 


AW192535 


Hs.19479 


ESTs 


3.3 


1 A 

10 


418719 


AW975590 


Hs.161707 


ESTs 


13 


437714 


AA766346 


Hs.293242 


ESTs 


3.3 




455710 


BE072049 




gb:PM«T0532-170100^X)4-ffl6 BT0532 Homo 


13 




421590 


AF0O4715 


Hs.105940 


jerky (mouse) homolog-fike 


13 




445707 


AI248720 


Hs.1 14390 


ESTs 


3.3 


15 


411436 


AW846433 




gb:OV0-CT017947030(M43-b02 CT0179 Homo 


3.3 


426657 


NM.015865 


Hs.171731 


solute carrier family 14 (urea transport 
prostate cancer associated protein 5 


3.3 




423412 


AF109300 




13 




436962 


AW377314 


Hs.5364 


DKFZP564I052 protein 


3.3 




415319 


AA659823 


Hs.34955 


Homo sapiens cDNA FU 1 3485 fis, done PL 


13 


OA 

20 


429318 


AW861930 


Hs.1 02500 


hypothetical protein dJ51 1E16.2 


3.3 


433213 


AW665130 


Hs.137190 


ESTs 


3.3 




453973 


AI291895 


Hs.61993 


ESTs, Weakly similar to ALU1.HUMAN ALU S 


3.3 




418245 


AA088767 


Hs.83883 


transmembrane, prostate androgen induced 


3.2 




428043 


T92248 


Hs.2240 


uteroglobin 


3.2 


25 


407385 


AA610150 


HSJ272072 


ESTs, Weakly similar to 138022 hypothec 


12 


423595 


R82826 


Hs^20702 


ESTs 


3.2 




407021 


U52077 




gb:Human marinerl transposase gene, comp 


3.2 




418986 


AI123555 


Hs.81796 


ESTs 


3.2 




438118 


AW753311 


Hs.346690 


ESTs 


12 


30 


447516 


W05355 


Hs.102971 


hypothetical protein FU14751 


3.2 


449570 


AA001793 




gb:zh85cGS.r1 Soares JetalJveLSp!een_ 


12 




450687 


AA495800 




gbzw05b07.s1 Soares_NhHMPu_S1 Homo sapi 


12 




425557 


T89839 


Hs.119471 


ESTs 


3.2 




459646 


AW883968 


Hs-321190 


gb:QV3OT0063-290300-13frc04 OT0063 Homo 


12 


35 


408480 


AI350337 


Hs.164568 


fibroblast growth factor 7 (keratinocyte 


12 


451025 
401416 


AW028689 


Hs.301985 


ESTs 

C14000338*:gi|7459502|pir||S74665 outer 


3.2 
3.2 




421928 


AF013758 


Hs.109643 


polyadenytate binding protein-interactin 


3.2 




454860 


AW835767 




gb:QV4-LT0016-24020O-110-b08 LT0016 Homo 


3,2 


40 


436995 


AI160015 


Hs.118112 


ESTs 


12 


428736 


AK001331 


Hs.192662 


hypothetical protein FU10469 


12 




420120 


AL049610 


Hs.95243 


uanscnpuon elongation factor a [buy 


12 




422352 


AA766296 


Hs.99200 


ESTs 


12 




429037 


X81895 


Hs.194765 


Rsapiens GENX-5624 mRNA, 3T UTR 


12 


45 


432625 


AI243596 


Hs.94830 


ESTs, Moderately similar to T03094 A-kin 


3.2 


445218 


T79971 


Hs.12432 


Homo sapiens clone 24407 mRNA sequence 


12 




447597 


A1886035 


Hs.213675 


ESTs 


12 




437866 


AA156781 




metaOothionein 1E (functional} 


3.2 




416882 


AI633044 




tryptophanyt IRNA synthelase 2 (mitochon 


3.2 


50 


410870 


U81599 


Hs.66731 


homeo box B13 


3.2 


436899 


AF085833 


Hs.135624 


ESTs 


3.2 




415862 


R51034 


Hs, 14451 3 


ESTs 


12 




420969 


AI636310 


Hs-28310 


ESTs 


12 




415467 


R60891 


Hs.260274 


ESTs 


3.2 


55 


412666 


AL080116 


Hs.74420 


origin recognition complex, subunit 3 (y 


3.2 


417380 


T06809 


Hs.332086 


ESTs 


12 




418319 


AW611703 


Hs.190173 


ESTs, WeaWy similar to A46010 X-tinted 


12 




419088 


Al 538323 


Hs^2620 


integrin, beta 8 


12 




422165 


AL041199 


Hs.1481 


histjdine decarboxylase 


3.2 


60 


448044 


AI458682 




gb:tk13e01j(1 NO_CGAP_U24 Homo sapiens 


3.2 


452862 
405548 


AW378065 


Hs.8687 


ESTs 

Target Exon 


3.2 
3.1 




4395B4 


AA838114 


Hs.221612 


ESTs 


11 




419839 


U24577 


Hs.93304 


phosphoiipase A2, group VII (platelet-ac 


11 


65 


427359 


AW020782 


Hs79881 


Homo sapiens cDNA: FU23006 fis, done L 


It 


429569 


AA454993 


Hs.138343 


ESTs, Weakly similar to 178885 serine/th 


11 




434512 


AW139932 


Hs.188941 


ESTs 


11 




452102 


U04343 


Hs.27954 


CD 88 antigen (C028 antigen Itgand 2, B7- 


11 




425856 


AA364908 


Hs.98927 


hypothetjcat protein FU 1 3993 


3.1 


70 


416836 


054745 


Hs.80247 


cholecystoldnin 


11 


432589 


AL135725 


Hs.131708 


ESTs 


11 




420512 


AA262866 


Hs.143817 


ESTs 


3.1 




417206 


AA291183 


Ks.81648 


hypothetical protein FU 1 1021 simitar to 


3.1 




410821 


AI1 14811 


Hs.92526 


ESTs, Weakly similar to T00355 hypothec 


3.1 


75 


423855 


AA331761 


Hs.254859 


ESTs 


11 


425905 


AB032959 


Hs.318584 


novel C3HC4 type Zinc finger (ring fmge 


3.1 




4Z7773 


AA412290 


Hs.98124 


ESTs 


11 




441817 


AW969706 


Hs.293332 


ESTs 


11 




456660 


AA909249 


Hs.112282 


solute canter family 30 (zinc transport 


11 


80 


421823 


N40850 


HsJ8625 


ESTs 


11 


416812 


H91010 


Hs.44940 


ESTs 


11 




445784 


A1253155 


Hs.1 46055 


ESTs 


3.1 




434384 


AA631910 


Hs.162849 


ESTs 


11 




416140 


AI918035 


Hs.301198 


roundabout (axon guidance receptor, Dros 


11 



735 



WO 03/042661 



PCT/US02/36810 



407426 


AF 129533 




grxHomo sapiens F-box protein FbDb {FBI 


3.1 


416423 


H54375 


Hs 268921 


ESTs 


3.1 


418037 


A1990212 


Hs.86447 


ESTs 


11 


419197 


N48921 


Hs.27441 


K1AA1615 protein 


3.1 


420179 


N74530 


Hs.21168 


ESTs 


3.1 


433610 


AA806822 


Hs.1 12547 


ESTs 


3.1 


436295 


N73895 




gb2362d06.s1 Soares fetal liver spleen 


3.1 


444800 


AW1 19071 


Hs. 153287 


ESTs 


3.1 


418858 


AW9616Q5 


Hs.21145 


rrypotheOcai protein RG083M05.2 


3't 


446469 


BE094848 


Hs.15113 


homogentisate 1 ,2-dtaxygenase (homogentl 


3.1 


431992 


Nfol_002742 


Hs.2891 


protein kinase C, mu 


3.1 


405510 






ENSP00000233779*:HypoOielcal 68.0 kDa p 


11 


407349 


AA825449 


Hs. 83332 


Homo sariens cDNA: RJ22437 fa done H 


3.1 


410859 


AW808361 




Gb'MR1"ST01 11-1 110994103404 ST0111 Homo 


11 


425354 


U62027 


no. 199903 


compjemeni component 3a reoepior 1 


4.1 


425480 


AB0231Q8 


U« 1 CQ1 < IC 
no. 130 133 


iwvuijoi protein 


0.1 


441492 


AM 49998 


Hs.146346 

rid* I^MrnJ 


ESTs 


3.1 


447078 


AW885727 


Hs.9914 


ESTs 


3.1 


459324 


AW080953 




oh-xr2fic19 x1 MPJ fiftAP f!Mfl Homo santon<i 


3.1 


433852 


A137835Q 


no. 1 tDOia 


CO 15 


^ n 

3.U 


448658 


H71739 


Hs 200227 


FST^ Mnrfftrstph/ similar trt AS3QTO fhrnm 
lu is, nnjwstousij oilmen n-i hjjjjj uiiuiii 


10 


452242 


R50956 


no. 1 


cjycosyf transferase 


1 n 


424690 


BE53835S 


Hs 151777 
no. 1 9 1 f » » 


OfllVflmnftf* tranctsltftn initial Inn fsrlnr 


J.U 


405264 






NM 030S1 3*"Homo sanlens stinnrpssnr of on 


3.0 


407253 


AA411175 


Hs 141939 


ESTs. Mnrferatplv similar (n SS5fi57 atnha 
CO la, lYlULRZl alciy ouiuicn W OOJUJ/ aJpiia 


10 


452234 


AW084176 


Hs.223296 


ESTs Weaktv similar to 138022 hvoothsG 


10 


434497 


AI821803 


Hs.136580 


ESTs 


10 


420355 


AW958263 


Hs 123126 


ESTs 


3.0 


403461 






Taroet Fxon 


10 


412988 


6E046680 




oh*rvd?hn^ xi hid f fiAP RDF? Hnmn carton* 


10 


452679 


242387 


Hs.83883 


transmembrane, prostate androgen induced 


10 


416642 


T96118 


Hs.226313 


ESTs 


3.0 


418948 


A1217097 




gb^d43h07j(1 Soares_fetal_hearLNbHH19W 


10 


426174 


AA547959 


Hs.1 15838 


ESTs 


3.0 


430459 


BE178539 


Hs 278634 


ESTs 


3.0 


439182 


AF086030 


Hs.21621 


hvoothetical nmtein DKFZn762O076 


10 


446258 


AI283476 


Hs.263478 


ESTs 


3.0 


448686 




He 334719 


Hj|juuicuuoi protein ru \m *m 


J.U 


453455 


AA063553 


Hs 221931 


ESTs. Weflklv similar to 101087 RNA hftlic 


10 


414441 


AA234759 


Uq 113050 
no. I\?co*aj 


ESTs 


3.U 


427302 


AA400540 


Hs. 135282 


HomosanienscDNAFLJ11554fa dortfiHF 


10 


437048 


AA743240 


H.S.915B2 


ESTs 


3.0 


450963 


AI664668 


Hs.48832 


ESTs 


3.0 


432336 


NM rcwiQ 


Lie 


protein kinase, interferon-inducible don 


^n 


459535 


AV654907 




nh'AVfi < iAQn7 C \ P Mnmrt carvpnc r DKJA rlnnp 


3.0 


448734 


RFfii4ri7r> 

ULU 1 «f II 1 u 


iuj.jzW ID 


noino sapiens rrtr\iv\ cuna ur\r£p3D*}nt3 id \\ 


1 n 


451468 


AW503398 


Hs 293663 


ESTs. Moderslelv similar to 138022 hvont 


10 


407829 


AA045084 


Hs 99795 


hvnnt hp final nrrrfp'tn Ft JH1Q7 


10 


420407 


AA814732 


Hs.145010 


Gpopolysdccsi We-speciftc response 5*0 


3.U 


426743 


AA383833 




CO IS 


3.U 


442326 


H92962 


Hs.1 24813 




3.U 


449913 




rui.oaou 10 


CQTc 
CO IS 


3.U 


454096 






quaaw-vi UIU3- ituoyj-uof-gu/ v*iuiuo norno 


1 n 


437323 


AA371145 


ITS. I*7*t03/ 


tepQn receptor 


3.U 


407137 


T97307 




gojo33nu3^ i ooares Tela] iwer speeri 


1 n 


450580 


N40087 




ESTs 


3.0 


429084 


AJ001443 


Hs.195614 


spGcing factor 3b, subunit 3, 130k0 


10 


418365 


AW014345 


Hs.161690 


ESTs 


3.0 


423784 


AK000039 


Hs.132826 


Homo sapiens cDNA FU 14913 fa, done PL 


3.0 


435677 


AA694142 


Hs.293726 


ESTs, Weakly similar to TSGA RAT TESTIS 


3.0 


414951 


AW794931 


Hs.100861 


hypothetical protein FU 14600 


3.0 


418819 


AA228776 


Hs.191721 


ESTs 


10 


428634 


AA811845 


Hs.105290 


Keich motif containing protein 


3.0 


431869 


AA521136 


Hs.190176 


ESTs 


3.0 


435008 


AF150262 


Hs.1 62898 


ESTs 


10 


448880 


AW205507 


Hs.32360 


ESTs, Highly simitar to 138587 retroviru 


10 


451391 


AA017410 


Hs.40568 


ESTs 


10 


452959 


AI933416 


Hs.189674 


ESTs 


3.0 



TABLE 64B 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 

Accession: Genbank accession numbers 



Pkey CAT Number Accession 

410790 1221131J AW803357 AW803423 AW812233 R06814 

410869 1225123J AW808361 AW808404 AW8083B6 AW808594 AW808654 AW808813 AW808551 AW808676 AW808350 AW808406 AW808694 AW808934 
AW808829 AW808385 AW803422 AW808401 AW808409 AW80S760AW808863AW808521 AW808539 AW808609 AW808472 AWB08739 
AW808704 AW808558 AW806714 AW808420 AW8 

411436 1245660J AW846433AW846159AW846377 AW846528 



736 



WO 03/042661 



PCT/US02/36810 



411479 1247077J 



412988 
413081 
413525 
415389 
416009 
416288 
416882 

418259 
418866 
418948 
419536 
420111 
420352 
423412 
424200 
426413 
426503 
426991 
428002 
428342 

429163 
429220 
430535 
432600 
432765 
433523 
434763 
436295 
436326 



439092 
439306 
440840 

440947 
442481 
445432 
448044 
449343 



449570 
449625 

450317 
450580 



450582 
4506B7 
452462 
453682 
454037 
454096 
454171 
454457 
454860 
454968 
455276 
455646 
455710 
457374 
45B912 



1342150J 
1348563 1 
1374635J 
156454J 
1566379 J 
1585983 J 
162718.1 

173388J 
179788.1 
180808 J 
185688L1 
190755.1 
192979.1 
228001.1 
236595.1 
266650.1 
268283.1 
27415.1 
285602.1 
290035.2 

300543.1 

301384.1 

319843.1 

350959.1 

353907.1 

368873.1 

392847.1 

41733.1 

41795.2 



437866 44433J 



468554.1 
47088 1 
50357J 

505904 1 

543588.1 

63943.1 

747196.1 

80517J2 



81018.1 
81 13 J 

831956.1 
83929.1 



83933.1 

84327.1 

918580.1 

977454.1 

996287J 

1007449.1 

1049240.1 

1207274.1 

1237732.1 

1247029.1 

1272541.1 

1348557.1 

1352368 1 

328758.1 

823104.1 



AW848047 AW848202AW848631 AW848142 AW848702 AW848121 AW848632AW848140 AW848571 AW848009 AW848067AW848069 

AW848905AW848214 

OEM 6380 6E046738 BE044958 

BE064415 BEO 54430 BE064448 

BE145899 BE145848 BE145849 BE145853 BE145927 BE145925 

A1287700 AI720344 AA191424 Al 023543 AI469633 AA 172056 AW958465 AA1 72236 AW953397 AA355086 
Z43062R13213H14422 

H51299 H44619 H46391 R86024 H51892T72744 

A1633044 AW016212 AW241143 AA769058 R43272 AW068958 AA210918 AA293774 AI748815 AI763294 AI333114 AI277384 AI088297 AJ468477 
AJ824624 AW189606 AI631751 Z40749 AI984673 AI671316 AA189024 AW235412 BE178426 R24677 R40635 H05100 R40597 
AA215404 AI990909 BE464132 AW271459 N74332 AI262061 
T65754AA229857AA22965B 

A1217097 AW886090 W38035 W38792 AA232835 AW936043 
AA603305 AA244095 AA244183 
AA255652 AA28091 1 AW967920 AA262684 

BE258835 AW968316 AA258918 AW843305 R14744 AJ580388 BE071923 R36280 

AF109300A129937BAI202654 

AA337221 AA336756AW966196 

AA377823 AW954494 AI022688 

AA380153 AA380233 AW963529 

AK001536AA191092AW510354AI554256AL353968AA134266 
AA418703AA418711BE0719158E071920BE071912 

AI739168 M426249 AI199635 AW505198 AW977291 AA824583 AA883419 AA724079 AI015524 A1377728 AW293682 AI928140 AA731438 
AK)92404 AI085630 AA731340 
AA884766AW974271 AA592975AA447312 
AW207206 AW341473 AA448195 AI951341 
AW9S8485 AW968670 AA480922 BE350425 

AI821085 AW973464 AA554802 AI821831 AA657438 AA640756 AA650339 
AJQ03429 AJ003367 AA564825 

H29B82 AW665533 AW1 49901 AI572917 AA598500 AI686466 AI336390 AW854390 AW864320 

AA648618AW974389 H51771 

N73895AJ001872 

BE085236 BE085317 X04236 AA577934 AA578392 AA502836 AA595852 AA578258 AW270791 AA507151 AA559152 T5704O BE503281 
AW593405 AI825755 AI350499 AI655710 AI972281 A1654949 BE073961 BE073962 BE041399 AW75021 4 AA228488 BE07401 6 AI908706 
AW270601 AW873282 

AA156781 AW293839 U52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE164040 BE164012 BE163972 BE163974 BE163992 

AA837481 AW468444 BE185091 AW468002 M687333 AA811830 AAM^^ 

AA812489AW874142AJ47188 

AA830149 AW978407 M85983 AW503637 

BE220199 W0181 3 AF086118 N70760 BE221405 

AW629666 AW959831 AW205739 BE620243 AA412367 AW300025 AW051920 AI288591 AW2361 14 Al 302852 A1038548 AA534496 AI797207 
AA921877 

AA910403 A1815593 W58361 AW162520 AJ816550 
N99828 BE079873 A!1 10738 AF074645 
AV653771 BE089370 

A1458682 H24240 R14537 R18426 AW867082 

A1151418 W60401 AW631238 AI346936 AA862855 W60310 N72501 H90060 BE150445 AW380821 A! 540906 C04881 W03542 AA641764 H97053 
AW889353 AA521308 AA001203 W92828 AI207798 AA746655 R78710 W24617 AA024605 C01747 AW1 73095 W61229 W92685 AA742467 
H00789 R76925AW1828 
AA001793AA001871 

NM>014253 AF100772 BE088769 AL02271 8 BE161779 AW863569 BE161640 AL039060 BE168542 AW296554 AA323193 AA235370 AW779760 
N48S74 AI375997 R45432 053344 AI203107 F07491 R35360 R25094 AI913631 AI498402 T61382 AI016320 N45526 T61415 AA331486 
A1692689 R14223 R18395 

N40087 H12925 AA460779 M095372 BE1 60847 AW816890 AW81 6893 AW816577 AW816B91 AW816882 AW816868 AW816889 AW816940 
AW816892 AW816941 AW816578 AA029183 T83320 H86850 AA010295 BE160823 Z25353 AA150883 AW887764 AW023806 AW022095 
AA164518 AA730973 W00417 W65303 

AC39732 AA010300 AW515041 AA768334 N29860 AA425874 AA4251 1 6 AA865829 AW936878 

AA495800 AA495737 AA010736 AA654716 AA640726 

BE173515 BE173560 AI902860 

T79703T95307AL079725 

AW998716AW022148N68020 

AW062757 AW17689Q AW062758 AW176895 AW176869 AW176900 AW176897 AW176928 AW176868 AW176892 
AW854832 AW854798 AW854857 AW854816 AW854834 AW854817 
AW753456 AW753036 AW854868 AW854862 
AW835767 AW835537 BE160187 

AW849046 AW847956 AW849039 AW847957 AW848279 AW848698 AW849034 AW849033 

BE176479 BE176678 BE176357 BE176550 AW886079 BE176676 BE176615 BE176555 BE176489 BE176610 BE176362 

BE064420 BE064435 BE0S4429 BE064414 BE064400 BE064517 

BE072049 BE069471 BE069489 BE069478 BE069479 BE069492 BE069485 BE072122 BE072124 BE069491 8E069486 BE069464 

AA493662 AWB97396 BE154814 

A191 1 066 A1933734 A16B0888 AJ003599 



TABLE 64C 



Pkey. Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham L et at" refers to the publicafion entitled The DNA sequence of 
human chromosome 22." Dunham I. et at, Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLpositioo: indicates nucleotide tj)c^licm of predicted exons. 



Ptey Ref Strand NLposifion 
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WO 03/042661 



PCT/US02/36810 



400533 


RQR1 R2fi 


Minus 


277132-977 


400746 


7329328 


Minus 


147703-147806 

Iff IV«rltr09u 


401132 


8705350 


Minus 


6S679-8579S 


401403 


7710966 


Plus 


146180-146294 


401416 


7452889 




121456-121R9R 


401424 


8176894 


Plus 


24223-9442R 


403242 


7637817 


Minus 


11297.12111 


403481 


9965004 


Plus 


93496-93633 


403510 


7652047 


Plus 


61866-62027 


403667 


6850483 


Minus 


1344-1442,1545-1697 


403696 


3135242 


Minus 


143467-143634 


404003 


8655948 


Plus 


198349-199096 


404561 


9795980 


Minus 


69039-70100 


404571 


7249169 


Minus 


112450-112648 


404592 


9943965 


Minus 


39067-39225 


404848 


8248647 


Minus 


23955-24034,25143-25264 


405264 


7329374 


Phis 


28556-28684 


405348 


2914717 


Minus 


43310-43462 


405510 


7630909 


Minus 


101028-101174 


405548 


1532158 


Phis 


11552-11686 



Table 65A Bsts about 347 genes up-regulated in BPH compared to prostate cancer tissues. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 GeneChrp array 
such that the ratio of 'average* BPH to 'average* prostate cancer tissues was greater than or equal to 3.0. The "average* BPH level was set to the 75* percentile amongst BPH 
tissues. The "average* prostate cancer tissue level was set to the 85" percentile amongst malignant prostate tissues. In order to remove gene-specific background levels of non- 
specific hybridization, the 1 0 s * percentile value amongst non-malignant body tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

TABLE 65A: ABOUT 347 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO PROSTATE CANCER TISSUES 

Pkey: Unique Eos probeset Identifier number 

ExAccm Exemplar Accession number, Genbank accession number 

UnigenelO: Unigene number 

Unigene Title: Unigene gene title 

_R1j Ratio of BPH tissue to prostate tumor tissue 



Pkey 

428134 

446336 

456373 

458072 

400533 

418310 

404592 

454457 

400080 

459646 

420352 

438231 

418387 

442481 

449249 

457653 

404967 

430535 

454860 

423789 

412988 

400440 

454171 

400086 

440911 

425312 

426140 

419015 

4537B9 

424940 

403667 

429014 

417758 

419999 

405348 

404003 

453200 

428002 

432319 

445444 

443361 

411518 

439079 

422081 

408197 

423523 



AA814100 Hs.86693 



ExAccn UnigenelO Unigene Tide 
AA421773 Hs.161Q08 ESTs 
AW815036 Hs.151251 ESTs 

BE247706 Hs.89751 membrane-spanning 4-domains. subfamily A 
A1890347 Hs.271923 Homo sapiens cDNA: FU2Z785 Ms, clone K 

ENSP00000209376*:PRED65 protein {Fragmen 
ESTs 

NM_022739*:Homo sapiens E3 ublquitln Bg 
AW753456 gb:QV2-CT0261-281099^11-d11 CT0261 Homo 

Eos Control 

Hs.321190 gb:Qv^Tfj063-290300-135<04OT0063Homo 
gb:6011 17374F1 N1H.MGCJ6 Homo sapiens c 
AW594539 Hs.155689 ESTs 

R1808S gb:yg16b12.rt Scares infant brain 1N1BH 

N99828 gb.za32c04.r1 Scares fetal fiver spleen 

T52285 Hs.193115 Homo sapiens mRNA for KIAA1764 protein, 
A1820719 Hs.154662 DnaJ (Hsp40) homolog, subfamily A, membe 
Target Exon 

AW968485 gb:EST380561 MAGE resequences, MAGJ Homo 

AW835767 gb:QVUT0016-2402WM 10408 LT0016 Homo 

AK002084 Ks.132851 hypothetical protein FU11222 
BE046680 gb:hn42h03j(1 NCLCGAP_RDF2 Homo sapiens 

X83957 Hs.83870 nebulin 

AW854832 gb:CW2-CT0261-20109W)1 1-f05 CT0261 Homo 

Eos Control 
AA909536 Hs. 143562 ESTs 
AA354940 Hs.145958 ESTs 

AF131798 Hs.343768 Homo sapiens done 251 1 9 mRNA sequence 
T79262 Hs.14463 ESTs 
AA628517 Hs.118502 ESTs 
AA985308 Hs.283902 ESTs 

Target Exon 
AI800518 Hs.118158 ESTs 
U27699 Hs.82535 sohite carrier family 6 (neurotransmffle 
AI760942 Hs.191754 ESTs 

C7(X)1664:gi|12698061|dbilBAB21849.1! (AB 

Target Exon 
AA033832 Hs.212433 ESTs 

AA418703 gbzv98c03.s1 Soares„NhHMPu_S1 Homo sapi 

AW510770 Hs.128388 ESTs 

AA380876 Hs.270 pleckstrin homology, Sec7 and coiied/coi 
AI792628 Hs.133273 ESTs 

AW850246 gML*CT021 9-291 099421-E07 CT0219 Homo 

AF085937 Hs.38348 ESTs 
AW136820 Hs.196011 ESTs 

AA282262 Hs.107410 ESTs, Weakly similar to A46010 X-tinked 
T87318 Hs.120411 ESTs 

738 



R1 
9.4 
9.3 



a7 
a7 

8.1 
7.7 
7.4 
7.4 
7.3 
7.3 
7.2 
7.1 
7.0 
6.9 
6.9 
6.9 
6.9 
6.8 
6.8 
6.6 
6.6 
6.4 
6.4 
6.4 
6.4 
6.3 
6.2 

ai 

6.1 
6.0 

ao 

6.0 

ao 
a9 
5.7 
a7 

5.7 
5.6 
5.6 
5.6 
5.5 
5.5 
5.5 

as 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65' 

70 

75 

80 



436578 


AI091435 


Hs.134859 


450920 


AA011626 


Hs. 133324 


435072 


AW592176 


Hs. 11 6932 


414403 


AW969551 


Hs.76064 


443744 


A! 084326 


Hs.271548 


433097 


Al 720686 


Hs.152520 


441916 


AA993571 


Hs.1 29075 


414818 


BE541217 


Hs.23606 


452531 


M429462 


Hs.293946 


454968 


AW849046 




415890 


H08225 


Hs.268712 


408385 


AF055634 


Hs.44553 


437752 


AA767376 


Hs.291631 


446495 


060923 


Hs.153460 


435375 


A1733610 


Hs.187832 


426748 


AL048409' 


Hs.97177 


424994 


AW954525 




405321 






416706 


AA314676 


Hs 288945 


438206 


AA7803B5 


Hs 187885 


445238 


AA883971 


Hs 187506 
r»i (Of wuu 


455747 


BE074910 




420533 


AI 8095 10 


Hs 118971 


457374 


AA493662 




440354 


AA889386 


Hs. 125468 


440388 


AJ693520 


Hs.223000 


421188 


AA2846S8 


Hs.261493 


403481 






438132 


AA907076 


Hs. 122060 


403333 






450317 


AI692689 




433597 


AA708205 


Hs. 100343 


430172 


M468591 


Hs.161889 


452843 


AI796769 


Hs.208320 


452031 


AA741314 


Hs.865 


427302 


M400540 


Hs.13S282 


442123 


Al 697790 


Hs.159961 


438727 


AW978756 


Hs.205679 


420931 


AF044197 


Hs.1 00431 


442160 


A1337127 


Hs.156325 


442295 


A1827248 


Hs.224398 


434589 


AF1 47383 




404995 






422906 


U80773 


lid. l£ldOU 


405549 






430697 


AA484207 


Hs-211867 


443998 


A1620661 


Hs M6276 


455801 


8E140643 




444800 


AW1 19071 


Hs 1S3287 


403371 






438431 


AW207860 


Hs 293116 


421926 


AA300591 




439752 


T78968 


Hs.14411 


414441 


AA234759 


Hs 132950 


454665 


AW8 12866 




454585 


BE059128 




452978 


AA029994 


Hs.61523 


438042 


AW296971 


Hs 180610 


418059 


AA2115B6 




451469 


NM.014809 


Hs.26441 


433072 


AI928037 


Hs 158832 


443318 


AI051603 


Hs.1 33141 


424686 


AA345504 




416312 


W02640 


Hs.16247 


415467 


R60891 




459045 


N69101 


Hs 40730 


415417 


F12038 


Hs 140970 


409111 


AL043362 


Hs.7984 


441620 


R59595 


Hs.26675 


427908 


AA417272 


Hs^4122 


432765 


AJ 003429 




448108 


AW300021 


Hs.170685 


436345 


AA873008 


Hs.121572 


409500 


U08098 


Hs.54576 


413525 


BE145899 




403305 


NM_006825 




436338 


W92147 


Hs.118394 


427521 


AW973352 


Hs .290585 


430124 


AW204994 


Hs.253450 


410790 


AW803357 




434360 


AW015415 


Hs.127780 


416385 


H54253 


Hs.205241 


442786 


H50733 


Hs.256261 



ESTs 5.5 

ESTs 5.5 

ESTs 5.4 

ribosoma) protein L27a 5.4 

ESTs, Weakly simflar to I78885 seiineAh 5,4 

ESTs 5.3 

ESTs 5.3 

ESTs 5.3 

ESTs, Weakly simitar to 138022 hypctheti 5.3 

gMl3-CT0214-15O300-O85-HQ6 CT0214 Komo 5.2 

ESTs 5.2 

unc5 (C.elegans homoJog) c 5.2 

ESTs, Moderately similar to S65657 alpha 5.2 

ESTs 5.2 

ESTs 5.2 

ESTs, WeaWy similar to ALU1.HUMAN ALU S 5.2 

gb:EST366595 MAGE resequences, MAGC Homo 5.2 

Target Exon 5.1 

hypothetical protein FU13448 5.1 

ESTs 5.1 

ESTs 5.1 

gb:RC5ST0580.170300-021-F12 BT0580 Homo 5.1 

ESTs 5.1 

gbmh05d12j;1 NO.CGAP.Thy1 Homo sapiens 5.0 

ESTs 5.0 

ESTs 4.9 

ESTs 4.9 

Target Exon 4.8 

ESTs 4.8 

NMJ>02518*:Homo sapiens neuronal PAS dom 4.8 

gb:wd86g05jc1 NCl_CGAP_Lu24 Homo sapiens 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs 4,7 

RAP1A, member of RAS oncogene family 4.7 

Homo sapiens cDNA RJ11554 fis, done HE 4.7 

EST 4.7 

ESTs 4.7 

small inducible cytokine B subfamily (Cy 4.6 

ESTs 4.6 

Homo sapiens cONA RJ 1 1469 fis, clone HE 4.6 

gb:Homo sapiens full length insert cDNA 4.6 

ErJSP0000025189(r:Moruxytic leukemia zinc 4.6 

Human EST clone 42944 mariner iransposon 4.5 

C7001976*:gi|4758712ireliNP_004659.1) al 4.5 

ESTs 4.5 

ESTs 4.5 

gb:RCO-HT0015-310599416 HT0015 Homo sap 4.5 

ESTs 4.5 

Target Exon 4.5 

ESTs 4.5 

gb:EST1 3437 Testis tumor Homo sapiens cD 4.5 

ESTs 4.5 

ESTs 4.5 

gb:RC3-ST0186-300100-01 7-b03 ST0186 Homo 4.5 

gb:QV3-BT0379-31010a-071-g06 BT0379 Homo 4.4 

ESTs 4.4 

ESTs 4.4 

gb2n56d05^1 Stratagene muscle 937209 H 4.4 

WAA0319 gene product 4.4 

ESTs 4.4 

ESTs 4,4 

gb:EST51 529 Gall bladder II Homo sapiens 4.3 

ESTs, Weakly similar to 2004399A chromos 4.3 

ESTs 4.3 

ESTs 4.3 

ESTs, Weakly similar to ALU6_HUMAN ALU S 4.3 

pteckstrin homology, Sec7 and cofled/coi 4.3 

ESTs 4.2 

ESTs 4.2 

gb AI003429 Selected chromosome 21 cDNA 4.2 
ESTs 4.2 
ESTs 4.2 
sutfotransferase, estrogen-preferring 4.2 
gb:MR04ff0208.221299-20*b10 HT0208 Homo 4.2 
transmembrane protein (63kD), endoplasmi 4.2 
ESTs 4.2 
ESTs 4.2 
ESTs 4.2 
gb:lL2-UM007W}9O30O-O50-A08 UM0079 Homo 4.1 
ESTs 4.1 
ESTs, Weakly simflar to S65657 atpha-IC- 4.1 
ESTs, Moderately similar to ALU8.HUMAN A 4.1 
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WO 03/042661 



445206 


AQ50199 




424200 


AA337221 




448152 


AI741053 


He. 170770 


453713 


R20640 


H* 7Q1'tt 

ns.'3 IOO 


400746 






413081 






442338 




no. 1 ouuow 


455333 






455276 


RP17R47Q 

DCIf D4f 3 




421312 


HM04404/ 


He 9Q1C7H 
ns.43 10/u 


431421 


AWQRQ1 1fi 
MVT303 1 JO 


no. 1 Uu 1 44 


440004 


MJUU1301 


rlS.444U4 


441 900 


AJ7 33322 


ns.u/1 /b 


433040 


DbUo444U 




.A9/i7CC 

444/03 


AA428211 


HS.2B4256 


434U44 


AA33034/ 


HS.43031J/ 


43000/ 


At 00717X1 

AI33U/4I 


lie monfj 
Hs.2S2oQ9 


404444 


AC11QQQG 
AT 1 13000 


US. 400341 


ASAMVl 
4340110 


AM/ft79X0ft 
AVY07440U 


Ue 97*57 JlO 

nS.£/J/4o 


443UDO 


HM000003 


Uc 170999 

ns.1 faczt 


A0CR7R 
4030/0 


rvUOOOU 


nS.4U134 


453055 


AW2Q140R 
MVY43I400 


ns.oi3i / 


447530 


AVW 134U00 


Ue 94flftfiC 
rlS -440003 


407834 


AYVUOM J3 1 


r»*B t uu 


400398 


AF1371QR. 

/%T 10/ 030 


Uc 90*1070 


421353 




Ue 9CC19n 
no. 4 00 1 ou 


401459 






403433 






444911 


unfit 17 


Me 9C/I 


4-JMCA 


A A719fifi1 
rv\f I ODD I 


Ue 191001 
nS.l4lU3l 


447930 


r\440/4 


Me 107C10 


1 1U333 


AW7CA1Q9 
ftVV/34134 




452320 


H/\U440/ 0 


U c icnAi9 

nS. I0U4 14 


4H914C 
4U4 143 






4CRA0R 

430400 


AHA1C9H 
A114134U 


Uc 1C1/ICi( 

ns. I314M 


44U43U 


aioooioq 
ftlOOO 1 43 


Ue irCR1i17 

nS.iDOl*»/ 


407CR7 


A IC0 1909 
ftl33 1444 


U c 79*J9C 

nS./4043 


41finftQ 
4I0UU3 


740flR9 
4.4 Oil 04 




434081 


AAAHR0A 

AAD01004 




410004 


nD3U40 


Me 99/QR1 
nS, 444301 


433523 


n43004 




411436 


/vr¥ 04D4OO 




44400U 


AWW7fl4Rn 
AVV4/U0DU 


Ue 1R0911 

ns. 1 030 1 1 


431447 


AACnCIOR 


Uc 901041 
nS^3 \ 04 1 


40x009 

4OO304 


WU034O 


Uc 14JIC9/ 
MS. 114344 


449447 
44*44/ 


A A 00 07 99 
/VV933 /40 


Ue 1 9GC/T7 
nS.l430U/ 


A10R91 
413001 


AW440yoU 


Ue CAAC 


AfW)A1 
4u\i444 






4CC>IQA 
4334SU 


AUI0C9477 

AvvyooA// 




AAQOCA 
443404 


Albo/043 


nS.ISoluo 


440000 


AJ080230 


Hs.134214 




AIU03044 


Ue flOOQO 

nS.boooii 


404404 


DC171C1C 

0 CI 7351 5 




41R7C0 
410/03 


AA227879 


1 1- 407C91 

nS.lo/Wi 


4C(Vt07 
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URAi CO 


Ue 4C09Q 

rlS.13028 


44//03 


Al /M17AC 


Ue oAtmc 
n&oAUo/O 


451746 


MRR17R 
moo 1 fa 


Ue 01 19CO 
nS.Ol 1430 


416321 


H94331 


Me OJA94 
n3.04U44 


448135 


AI47ftfl74 


He 0A07QQ 
nS>040/33 


405510 






407437 


AF220264 




451859 


H44491 


Ue 9C9Q0R 
rlo.4O4300 


455135 


AW857989 




409169 


AA1 25984 




420300 


AA256245 


Ue 197C70 

ns. 14/0/ 0 


427726 


rUO03 144 


He Id^RRfl 

ns. 140000 


4040U0 


AUUnR17C7 


Ue AAOAi 
nS.44441 


ACME/I 
400004 


vc&tn 

AO041 I 


Ue 1910 

rlS.l4iy 




XQ7876 


Ue Q0701 


444033 


D16471 


Hs.121571 


447458 


rtl/4 IU04 


U« 1CBQC1 

ns. 1 0030 1 


411421 


BE272110 


Hs.21177 


439075 


AF085933 


Hs.292620 


440947 


AA910403 




417565 


AJ203405 


Hs.47831 


430096 


U91935 


Hs.233321 


448658 


H71739 


Hs.200227 


404561 






429073 


AA446167 


Hs.47385 


419002 


T78625 


Hs.268594 


450630 


AA010429 


Hs.191939 


455067 


AW854538 




412768 


AW996044 


Hs.26239 



ESTs 

9b:EST4l944 Endomebial tumor Homo sapie 
ESTs 

cadherin 8, type 2 
Target Exon 

gb:RC4-BT0311-241199-012-b03 BT0311 Homo 
ESTs 

gb:QVO-DT0O2O-O9t)2OC^10&fl05 DT0020 Homo 
gb:RCo^hT03^1603C0-022-b09 HT0585 Homo 
ESTs 

ESTs, Weakly similar to unnamed protein 
protease, serine, 12 (neuro trypsin, moto 
ESTs 

gb:RC4-8T031 1-241 t99-012<C8 BT0311 Homo 
hypothetical protein FU 14033 similar to 
hypothetical protein FU23403 
ESTs 

Homo sapiens PR02591 mRNA, complete cds 

ESTs 

ESTs 

ESTs 

Homo sapiens, done MGC3658, mRNA, comp 

ESTs, Moderately similar to JC5238 gatac 

ESTs 

ubiquiEn3 

ESTs 

a«J0046^:gi|97M241lrerlNPJ62628.1| S 

NIA-001622:Homo sapiens alpha.2-HS^Iycop 

xanlhene dehydrogenase 

ESTs 

ESTs 

gb:RC2-CT0321-131299-012-a04 CT0321'Homo 
ESTs 

Target Exon 

ESTs, WeaWy sirriar to AUJC_HUMAN \\\\ 
ESTs 

Human DNA sequence from done RP1-187J 1 1 
gb:HSC12E041 normalized Infant brain cDN 
gb.-np77n05.s1 Na_CGAPJ>r2 Homo sapiens 
Homo sapiens cONA RJ 14366 fis, clone HE 
ESTs 

gb:QVG^CTO179^7030O-143M)O2 CT0179 Homo 

ESTs. Weakly similar to putative p1 50 (H 

ESTs 

ESTs 

ESTs 

ESTs 

Target Exon 

gb:EST365547 MAGE resequences, MAGB Homo 

ESTs 

ESTs 

ESTs 

gb:RC2-m056U21020(W12-103 HT0560 Homo 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ENSP00000233779*:hypo9iefcal 68.0 kDa p 

gbJHomo sapiens MOST-1 mRNA, complete cd 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

gb"PM2-CT0328-281 299^03-604 CT0328 Homo 

gb2n27h06^1 Stratagene neuroepitheSum 

Homo sapiens FKSG41 (FKSG41 ) mRNA, compl 

Homo sapiens cONA: FU23031 fis, clone L 

Homo sapiens cONA: FU21447 fis, done C 

alcohol dehydrogenase 4 (class II), pi p 

wingless-type MMTV integration site farrd 

Human mRNA, Xq terminal portion 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Retina-derived POlWomain factor-1 

ESTs, Moderately similar to A53959 throm 

trtcrwrhinophalangeal syndrome I gene (T 

ESTs 

ESTs 

ESTs 

gb:RC3-CTO255-20O1 00-024^02 CT0255 Homo 
Human DNA sequence from done RP1 1-438B2 
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439481 
419622 
449821 
451193 
412701 
419611 
433563 
427235 
449045 
401132 
421105 
405264 
417675 



AA766501 Hs.125113 



434763 
416379 
407557 
437956 
403510 
451722 
435681 
427125 
424287 
434497 
453682 
404848 
417435 
442231 
450297 
436340 
441596 
456995 
456304 
427033 
400379 
410551 
423357 
450582 
437662 
442388 
445004 
450597 
438801 
445432 
450003 
431408 
428923 
407344 
425523 
427473 
437115 
434520 
447282 
453328 



449256 
432550 
423101 
424853 
410700 
418719 
402703 
455710 
452011 
453973 
420355 
418898 
418985 
449570 
447516 
408304 
402454 
411552 
407896 
435786 
447597 
432625 
459535 
447183 
426629 
447892 
457136 
443585 



AF086294 Hs.1 25844 ESTs 

AA452054 Hs.119338 ESTs 

AI671141 Hs.211122 ESTs 

N29850 Hs.44098 ESTs 

AW984757 9b^C1-HN0015^)4O400-011-g10HN0015Homo 

AB031479 Hs.91600 SEEK1 protein 

AI732637 Hs.277901 ESTs 

AI126288 Hs.192232 ESTs 

BE072483 Hs.278337 Homo sapiens cONA FU1 1537 (is, done HE 
C120005ir:gil4758712!ref|NP 004659.11a 
ESTs 

NM„030813':Homo sapiens suppressor of po 

AI808607 Hs.3781 similar to murine leutine-nch repeat pr 

AW372068 Hs.201420 ESTs, Moderately similar to ALU7_HUMAN A 

AA648518 gb:ns07a1 1.r1 NCI CGAP Ew1 Homo sapiens 

N38857 Hs.203933 ESTs 

Z83803 gb:H^apiens mRNA for axonemal dynein he 

AA773283 Hs.203559 hypothetical protein FU 12701 

Target Exon 

H86374 Hs.40861 ESTs 

AA694192 Hs.148979 ESTs 

AA683362 Hs.97612 ESTs 

AL133105 Hs.144633 hypothetical protein DKFZp434F2322 

A1821803 Hs.136580 ESTs 

T79703 gb:yd71e08.rt Scares fetal liver spleen 

ENSP00000240769':BG153Ol1 {similar to C 

NM_0051 81 Hs.821 29 carbonic anhydrase Hi. muscle specific 

W02434 Hs.222413 ESTs 

AW901347 Hs.38592 hypothetical protein FU23342 

R42246 Hs.21606 ESTs 

AA939300 Hs.206768 ESTs 

T89832 Hs.170278 ESTs 

A1820973 gb:nc21c02.y5 NCLCGAPJYI Homo sapiens 

AJ457449 Hs.192817 ESTs 

NM_018432 Homo sapiens ovarian cancer related prot 

R36730 Hs.21816 ESTs 
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AI339732 G<ich RNA sequence binding factor 1 

AA765387 Hs.145095 ESTs 

AW663442 Hs.129485 ESTs 

A1204616 Hs.148701 ESTs 

AT701635 Hs.207077 ESTs 

AA825971 Hs.124284 ESTs 

AV653771 gbAV653771 GLC Homo sapiens cDNA clone 

AA777809 Hs.191995 ESTs 

AA504757 Hs.105738 ESTs 

BE047698 Hs.188785 ESTs 

AI038025 Hs.271418 gbxw29t07j(1 Soares_totalJetus_Nb2HF8 

AB007948 Ks.158244 WAA0479 protein 

AW274439 Hs.252709 ESTs 

AA744703 Hs.129030 ESTs 

AA205273 Hs. 1770 11 hypothetical protein 

AI989963 Hs.197505 ESTs 

AW292635 Hs.346145 ESTs 

A1042330 Hs.67128 hypothetical protein FU23309 

AA059050 Hs.59347 ESTs 

AW297206 Hs.164018 ESTs 

M83941 Hs.123642 EphA3 

BE549737 Hs.1 32967 Human EST clone 1 22887 mariner trans peso 

AA352335 Hs.65641 hypothetical protein FU20073 

AW975590 Hs.161707 ESTs 

Target Exon 

BE072049 gb:PW-BT0532-17010OO04-f06 BT0532 Homo 

AW628911 Hs.211429 ESTs 

AI291895 Hs.61993 ESTs. Weakly simflar to ALU^HUMAN ALUS 

AW968263 Hs.123126 ESTs 

T66847 Hs.1 94040 ESTs, Weakly similar to I38022 hypothec 

AI123555 Hs.81796 ESTs 

AA001793 gb2h86c06/1 Soares Jetaijver_spleen_ 

W05355 Hs.1 02971 hypothetical protein FU14751 

AW810279 p*:MR4-ST0125-151299429-a09 ST0125 Homo 
C1002501*3gi|129092|sp|P23270lOLn_RAT 0 

AW851255 gb:(L3-CT022T>16020W)66.H02 CT0220 Homo 

D76435 Hs,41 1 54 2c family member 1 {odd-paired Drosophi 

H09175 Hs.26085 ESTs 

A1886036 Hs.213675 ESTs 

AJ243596 Hs.94830 ESTs, Moderately similar to T03094 A-kin 

AV654907 gbAV654907 GIC Homo sapiens cDNAdone 

AI554733 Hs.173182 ESTs 

AK03933 Hs.97142 ESTs 

AI435848 Hs.172978 ESTs 

AA428240 Hs.126083 ESTs 

AW466983 Hs.283949 enameSn 
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451399 
450687 
416812 
452102 
434512 
458251 
439950 
429569 
403805 
422666 
408799 
429350 
430454 
441817 
427773 
415901 
445758 
403291 
436295 
455772 
418037 
438142 
416423 
448765 
457021 
420301 
420721 
436013 
441492 
410869 
419386 
435628 
444326 
413774 
434352 
434269 
416422 
416642 
439182 
418948 
433796 
427813 
405733 
426743 
449913 
454096 
440129 
410929 
458187 
429430 
451391 
414951 



AL042110 

AA495800 

H91010 

U04343 

AW139932 

AL040927 

AW937417 

AA454993 

AA677981 

AA059412 

AI754634 

AW469011 

AW969706 

AA412290 

H08396 

R60715 

N73895 

W28799 

AI990212 

T90309 

H54375 

R15337 

AW968934 

AA767526 

AA927802 

AA703419 

AI149998 

AW808361 

AA236867 

W88732 

AI939357 

AA131782 

AF129505 

AK001991 

H60457 

T96118 

AFO86O30 

A1217097 

AA810867 

D29833 

AA383833 

AA004696 

AW062757 

AA865818 

H47233 

D56919 

A1381837 

AA017410 

AW794931 



Hs^26728 ESTs 3.1 

gbzw05b07.s1 Soares.NhHMPu.S1 Homo sapi 3.1 

ESTs 3.1 

CD86 antigen (CD28 antigen Ggand 2, B7- 3.1 

ESTs 3.1 

ESTs ^ 3.1 

ESTs 11 

ESTs, Weakly similar to 176885 serlne/th 3.1 

Target Exon 3.1 

SMC2 (structural maintenance of chromoso 11 

hypothetical protain MGC10940 11 

ESTs 11 

ESTs 11 

ESTs 11 

ESTs 11 

ubiQuitin cartxwyWermtnaJ esterase L1 11 

ESTs 11 

Target Exon 11 

gbza62d06.s1 Soares fetal liver spleen 3.1 

gb:52g 1 1 Human retina cDNA randomly prim 3. 1 

ESTs 11 

ESTs 11 

ESTs 11 

Homo sapiens mRNA; cDNA DKFZp547D086 {fr 11 

Homo sapiens cDNA: FU21897 fis, clone H 11 

paired box gene 5 {B-ceH lineage specif 11 

2AP3 protein H 

Homo sapiens cDNA: FU23593 fis, clone L 11 

ESTs 11 

gb:MR1-STO111.111099^03-fO4ST0111 Homo 11 

ESTs, WeaWy similar to I38022 hypotheti 11 

ESTs 10 

ESTs 10 

ESTs 10 

small muscle protein, X-linked 3.0 

similar (0 murine leurine-rich repeat pr 3.0 

ESTs, Moderately similar to ZN91 .HUMAN Z 10 

ESTs 10 

hypothetical protein DKFZp762O076 10 

gb:qd43h07jc1 SoaresJetalJiearl_NbHH19W 3.0 

ESTs 3.0 

safivaryproline-rich protein 3.0 

NM.021 140*:Homo sapiens ubiquitously tra 3.0 

ESTs 3.0 

ESTs 10 

gb:CMO-CT0103-120899437-g07 CT0103 Homo 3.0 

ESTs, Weakly stmaar to S71886 Ste2(Mik 10 

ESTs 3.0 

myomegalin 3.0 

ESTs 3.0 

ESTs 10 

hypothetical protein FU14600 10 



Ks.44940 

Hs.27954 

Hs.188941 

Ks.210422 

Ks.293561 

Ks.138343 

Hs.119023 

Hs.47986 

Hs.131987 

Hs,105635 

Hs^93332 

Hs.98124 

Hs.76118 

Hs.25804 



Hs.86447 

Hs.269651 

Hs^68921 

Hs.21958 

Hs.173108 

Hs.22030 

Hs.159471 

Hs.287749 

Hs. 146346 



Hs.36107 

Hs.270710 

Hs.182314 

Hs.86492 

Hs.3781 

Hs.226313 
Hs.21621 

Hs.186997 
Hs.2207 

Hs.245022 
Hs.333016 

Hs.174936 

Hs.30643 

Hs.265848 

Hs.155335 

Hs.40568 

Hs.100861 



TABLE 65B 



Pkey: Unique Eos probeset Identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
408304 



409189 

410559 
410790 
410889 



411436 
411518 
411552 
412701 
412988 
413081 
413525 
416009 
416422 
418059 
418387 
418948 



CAT Number 
1050848J 



110687J 

1208283 J 
1221 131J 
1225123J 



1245660.1 
1248692J 
1249255.1 
1322288 J 
1342150 1 
1348563.1 
1374635.1 
1566379.1 
1593811 1 
171879 1 
174731 1 
180808.1 



Accession 

AW810279 8E1 46684 BE146693 BE146694 BE146679 AW810472 AW810208 AW810356 AW810193 AW178838 AW178837 AW178857 AW810515 
AWB10330 AW810514 AWB10441 AW81 0358 AW1 78852 AW810359 AW810322 AW810327 AW81021 1 AW178835 AW810635 AW810288 
AW810263AW810325AW810443AW8 

AA125984 AA127189 AA065075 AA070377 AA100017 AA079891 AA1 13255 AA075168 AA082764 AA083380 N84829 AA084752 AA076512 
AA085119 AA085208 AA085045 

AW754192 W00554 AW857797 AW754203 AW754197 AW754193 
AW803357 AW803423 AW812233 R06814 

AW808381 AW8084MAW8083WAW808594AW808654AW808813AW808551 AW808576AW808350AW808406AW808694AW808934 

AW808829AW808385AW808422AW808401 AW803409 AW808760 AW808863 AW808521 AW808539 AW808609 AW808472 AW808739 

AW808704 AW808558 AW80S71 4 AW808420 AW8 

AW846433 AW8461 59 AW846377 AW846528 

AW850246AW850251 AW850302 

AW851255 AW851432 AW850955 

AW984757 AW984797 AW984734 AW984745 

BE046680 BE046738 BE044958 

BE064415 BE064430 BE064448 

BE145899 BE145848 BE145849 BE145853 BE145927 BE145925 

Z43062R13213H14422 

H60457 K68709 H73528 K54335 R87154 

AA211586F35799AA211641 F29720 AW937387AW937408 

R18085AA219028 R 177 12 Z44345 

A1217097 AW886090 W38035 W38792 AA232835 AW936043 
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457374 


326758.1 


AA493662 AW897396 BE154814 



TABLE 65C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et ai." refers to the publication entilled The DNA sequence of 

human chromosome 22." Dunham 1. el al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted, 

Nt position: Indicates nucleoBde positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400533 


6981826 


Minus 


277132-277595 


400746 


7329328 


Minus 


147703.147896 


401132 


8705350 


Minus 


85679-85795 


401459 


9212270 


Minus 


182001-183323 


402145 


8018280 


Plus 


113086-114800 


402454 


7534025 


Minus 


14826-15803 


402703 


8705069 


Minus 


15335-15500 


403242 


7637817 


Mnus 


11297-12511 


403291 


7230870 


Pius 


95177-95435 


403305 


8099945 


Plus 


114632-114805 


403333 


8568833 


Partus 


124794-124941 


403371 


9087278 


Pius 


105655-106050 


403433 


9719611 


Minus 


72225-72437 


403481 


9965004 


Plus 


93496-93633 


403510 


7652047 


Plus 


61866-62027 


403667 


6850483 


Minus 


1344-1442.1545-1697 


403805 


8140491 


Minus 


51483-51742,53429-53511 


404003 


8655948 


Plus 


198343-199096 


404561 


9795980 


Minus 


69039-70100 


404592 


9943965 


Minus 


39067-39225 


404848 


8248647 


Minus 


23955-24034,25143-25264 


404957 


7523744 


Minus 


89944-90729 


404995 


6006247 


Minus 


154015-154123 


405264 


7329374 


Ptus 


28556-28684 


405321 


3419846 


Minus 


44654-45210 


405348 


2914717 


Minus 


43310-43462 


405510 


7630909 


Minus 


101028-101174 


405549 


1552494 


Rus 


10878-11048 


405733 


9884689 


Plus 


124832-125051 



743 



WO 03/042661 



PCT7US02/36810 



Table 66A fists about 370 genes up-regulated In 8PH compared to prostate cancer and norma! adult tissues. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 
GeneChip array such (hat the ratio of 'average" BPH to "average* prostate cancer and normal adult tissues was greater than or equal to 3.0. The "average* BPH level was set to the 
75* percentile amongst BPH tissues. The "average* prostate cancer and normal adult tissue level was set to the 85* percentile amongst malignant prostate tissues and normal adult 
tissues. In order to remove gene-specific background levels of non-specific hybridization, the 10* percentile value amongsl non-malignant body tissues was subtracted from both the 
numerator and the denominator before the ratio was evaluated. 

TABLE 66A: ABOUT 370 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO PROSTATE CANCER AND NORMAL ADULT TISSUES 

"PRey! unique Eos probesel tdentilter 1 number ' — — 

ExAccrc Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Ungene Title: Unigene gene title 

_B1 Ratio of BPH tissue to prostate tumor and normal body tissue 



Pkpv 


ExAccn 


unigene \u 


Unigene Title 


D4 


410929 


H47233 






91 1 

£1.1 


450693 






PQTe 
Co is 


1C 7 


418432 


M14156 


Hs 85112 


jn^tifinJiVp nmu/th fevlnr 1 fenmafannoHi 

U wUUlrtlftt; ySUVYUI icJUvLH 1 ^OUMUUUlllCUl 


1i 9 


432473 




ns. 10*414 


CCTe 
COlS 


11.3 


446336 


AW8 15036 


Hs.151251 


CO IS 


m q 


407275 


MtOO*> IOO 




gD.qwo^nuf j(i Nu_Lw\r_uw Morno sapiens 


10.7 


428134 


AA421773 


no. i u i uuo 


ESTs 


in o 


400297 


All 27076 


no.ouoiU i 


nypouicacat proiem ur\rt.poo*iui4(0 


3.0 


433466 


AA5ftR3&3 

rv\3UDOOJ 


ns, tuoo i*t 


mfavln 1 /U1\ 


3.0 


*H0£30 


QAQAM 


ns. iuo04i 


CCTo 

bolS 


9.1 


458072 


nlOSUOS f 


He 97109** 


Unmn eon'pane /»nM A« CI I097PC fie jtlnna 1/ 

nomo saptens cuna. ruz^/oo ns, cione t\ 


O.0 


428927 


AA441837 


nb.SUtOvl 


CO 15 


0.0 


*»^uo*+o 




ns./o^ol 


CCTe 

to IS 


8.5 


«*oooo/ 




nSYozouy 


ESTs 


8.2 


ACAACT 
40440/ 


AW753456 




gb^V2-CT0261-26109S^011-d11 CT0261 Homo 


7.7 


44 1*4/ 


A\AM 4 OCQ1 

AWllbWi 


Hs.1 28051 


Homo sapiens thymic stromal lymphopoleti 


7.5 


4010/0 


M76665 


LI* mmi e- 

Hs.275215 


hydroxysteroid (1 1-beta) dehydrogenase 1 


7.4 


4UUU0U 






Eos Control 


7.4 


431448 


At 1W17 


ns.ouoiu i 


nypouieiicai protein UKrcpoo4uuf a 


7 A 


404592 






Nwt_uzi/oy .nomo sapiens cd UDtqutun tig 


7 O 
/.0 


420352 


DCtOOOOO 




gD.oui 11 / o/4r l jMin_Mou_ t o nomo sapiens c 


7 *a 
7.3 


HOOZj 1 


AVVOs4303 


nS.lobbbi* 


ESTs 


7.3 


M IUOOU 


AWUZOOJU 


nS.l034£D 


COT* 

colS 


7.2 


449300 


Al 656959 


Ue 


CCTe 

colS 


7.1 


449249 


T52285 


n5. 13J1 ID 


nomo sapiens mruNA ror imaai i m protein, 


7 ft 

IS) 


426384 




nS.OUoDDZ 


nypouieucai protein ru loio? truu 109} 


C Q 
0.0 


454171 






nWCMO f v m9fi1 9niftQQ_ni1 ffIC rTn9R1 Unmn. 


R ft 
0.0 


408197 


AA282262 


Ue 107410 


CO 1 o, VVcdlUy SniuldT lO MtDU I U A-UTInCU 


o.o 


«*Z3/ OJL 


inniCR 
UZUJDo 


Up 1/1 QQ 

ns.i'+oo 


rympnocyte cytosoitc protein 2 (SH2 doma 


6.6 


**C 1 IO*J 


AA9JMW4 
AAZ040J0 


Ue 0Q7C51 


Homo sapiens cuNA RJ 14259 fis, clone PL 


6.5 


HO l«tuf 


lM/1001 


Up oceioa 


Homo sapiens mRNA; cDNA DKrZp4o4c0528 (f 


6.5 






Ue 001 Q 

ns.szio 


CCTe 

co IS 


6.5 


AA1AM 

44Z401 


N33oZ0 




gb^a32c04/1 Scares fetal liver spleen ( 


6.4 


49CJ19 




Up 1JCOCQ 
tIS. 140300 


ESTs 


6.4 


air*. An 


AriJJ/SO 


n 5.343768 


Homo sapiens clone 25119 mRNA sequence 


6.4 




AAOftOCTC 

aajws&oo 


HS.143562 


ESTs 


6.4 


1UUDOJ 






cNSP0000X)209376^:PkcD65 protein (Fragmen 


6.2 


418310 


AA814100 


Hs.86693 


ESTs 


6.2 


403667 






Target Exon 


ai 


436396 
404003 


AJ683487 


Hs.152213 


wingless-type MMW integration site {ami 


6.1 






Target Exon 


5.9 


424853 


BE549737 


Hs.132967 


Human EST done 1 22887 mariner trans peso 


5.9 


438138 


R98299 


Hs.177502 


ESTs 


5.9 


424940 


AA985308 


Hs.283902 


ESTs 


5.8 


434485 


AJ623511 


Hs.1 18567 


ESTs 


5.8 


453200 


AA033832 


Hs.212433 


ESTs 


5.7 


428002 


AA41B703 




gbzv98c03.s1 Soares_NhHMPu_S1 Homosapi 


5.7 


424850 


AA151057 


Hs.153498 


chrornosome 18 open reading frame 1 


5.6 


445444 


AA3B0876 


Hs.270 


pieckslrin homology, Sec7 and coDedAxi 


5.6 


443361 


A1792628 


Hs.133273 


ESTs 


5.6 


421513 


X00949 


Hs.105314 


rdaxin 1 (H1) 


5.5 


439079 


AF085937 


Hs.38348 


ESTs 


5.5 


430535 


AW968485 




gb:EST3B0561 MAGE resequences, MAGJ Homo 


5.5 


436578 


AI091435 


Hs.134859 


ESTs 


5.5 


424051 


AL1 10203 


Hs.138411 


Homo sapiens mRNA; cDNA DKFZp586J 1922 (f 


5.4 


421863 


AI952577 


Hs.108972 


Homo sapiens mRNA: cONA DKFZc434P228 {fr 


5.4 


435072 


AW592176 


Hs.116932 


ESTs 


5.4 


435375 


A1733610 


Hs.1 87832 


ESTs 


5.4 


444509 


AW571659 


Hs.278081 


ESTs 


5.4 


416602 


NM_006159 


Hs.79389 


nel{chictenH^2 


5.4 


433087 


AI7206S6 


Hs.152520 


ESTs 


5.3 


439092 


AA830149 




gb:oc44f08s1 Na.CGAP.GCB1 Homo sapiens 


5.3 


437267 


AW511443 


Hs.258110 


ESTs 


5.3 


441916 


AA993571 


Hs.129075 


ESTs 


5.3 


452784 


BE463857 


Hs.151258 


hypothetical protein FU21062 


5.3 


452531 


AA429462 


Hs.293946 


ESTs. Weakly similar to I38Q22 hypotheti 
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Mooy4 1 


lie I^OCAO 

MS. 1 ISAi 


EphA3 


5.3 




40fbo7 


A1591222 


Hs.72325 


Human DNA sequence from clone RP1-187J1 1 


5.2 




415850 




Mc ocnrrn 

rtS.4D0f l£ 


CCTe 
bo IS 


C 9 


5 




AWo4304b 




gML3^T0214-150ow0oo-H0o CT0214 Homo 


5.2 


408385 


ArUOODJ'} 




unc5 (Celegans horoobg) c 


c *> 






A A7C7'37C 

AArb/j/o 


HS.Z3 lool 


cSTs, Mooeratefy similar to So5657 alpha 


5.2 




446495 


UOU3£J 










422906 


UWl » 0 


ns. i l loou 


Human EST done 42944 mariner trans poson 


O.C 


10 


427726 


AI359 1 44 


Hq lA^fiftft 

nb, I'+OuOO 


Unmn eonlane /»HM A* CI l77fV?1 fie Mnna 1 

noma sapiens cuiw ruzouo i ns, ount) l 


O.c 




AI l\AQAPiQ 
ALU404U3 


nS.97 if / 


CCTe Wa«Uu •fanlla* l« Al 111 UlIMAMAlllC 

to i s, weawy stmnar to alu 1 .human alu o 


c o 
5.2 




436338 


W92147 


ns. 1 1 oos4 


cols 


J.Z 




410/U5 


AAoi4o fb 


u_ noon ii c 

HS.2BB945 


nypoineucal protein FU 13448 


5.1 




441/149 


A AQCEO40 


Hs. 174936 


ESTs, WeaWy similar to S71 886 Ste20-uk 


5.1 


15 


443100 


A A 000074 

AAboo97l 


HS.lo750o 


ESTs 


5.1 




awoqtm 

A10O9/OZ 




G-rich RNA sequence binding factor 1 


5.1 




420533 


ATOAAf 4 A 

AI809510 


Hs.118971 


ESTs 


5.1 




AtQAAt 


AiuoZoBd 


Hs.30732 


mypofneucal protein FU 13409; K1AA1711 


S.1 




44(054 


AA889386 


Hs.1 25468 


ESTs 


5.0 


JL\J 


452891 


N75582 


Hs.2 12875 


ESTs. WeaWy similar to DYH9_HUMAN QUA 


5.0 


457374 


AA493662 




gb:nh05d1ls1 NCI_CGAP„Thy1 Homo sapiens 


5.0 




408829 


klkf /WtC/lJO 

NMJJ06042 


HS.4B384 


heparan sulfate (glucosamine) 3-O-sulfot 


5.0 




44cU41 


AW29Z769 


Hs. 206228 


ESTs 


5.0 




457653 


AJB20719 


Hs.154562 


DnaJ (Hsp40) homolog, subfamily A, membe 


4.9 


25 


^ RIM 07 


mm ioy 


Ue 

nS.lbozo 


to IS 


4.8 


*}J0 1 it 




Ue 190nCn 


EOT* 
CO IS 


4.o 




*H*1010 


DC04 IZlf 


Ue T3Cnfi 


CCTe 

co IS 


/ O 

4.0 




429433 




ni.Z I000D 


co < s, weawy similar id iwv\ i ooo pjuiBin 


A Ft 




4oooZ0 


UOOOflO 

HZyooZ 




ESTs 


4.8 




4PV017 


A 1 COO COO 

Albsiboy 




gbwaoogOojcl Nd_CGAP - Lu24 Homo sapiens 


4.8 


440b.» 


AlUoUZoU 


HS.134214 


ESTs 


4.8 




430172 


AA468591 


Hs.161889 


ESTs 


4.8 




AK'KA'i 
£040 


AI7QC7CO 




CCTe 

tolS 


4.7 




427302 


A A Af\f\CAf\ 

AA40U540 


Hs,1352B2 


Homo sapiens cDNA FU1 1 554 its, done HE 


4.7 




4bZUol 


AA741314 


Hs.865 


RAP1A, member of RAS oncogene famDy 


4.7 


423789 


Al\Qu20o4 


Hs. 132851 


hypothetical protein FU11222 


4.7 




/M7173 
41 f 1 10 


1 101 0O7 


HS.81424 


uDiquiun-Rte i (sentnn) 


4.6 




AOftcni 

IZlftWI 


ArU44lSf 


ns.iUU4ol 


small Inducible cytokine B subfamily (Cy 


4.6 




454024 


AA993527 


Hs.293907 


hypothetical protein FU23403 


4.6 


AO 


405348 






C7TO16W:gil12698061|db]IBAB21849.1| (AB 


4.6 


419511 


AA429750 


Hs.75113 


general transcription factor II IA 


4.6 




447058 


Ai 939456 


Hs. 160870 


ESTs 


4.6 




42621 B 


A A A1A ICC 

AA424256 


Hs. 123642 


EphA3 


4.6 




A "30007 


AA4o4ZuY 


U_ OHOC7 


CCTe 
tblS 


4.5 


45 


HOOZOJ 


AA0014U4 


Ue 0000*37 

nS.ZarUo/ 


Homo sapiens cDNA FU 14135 lis, done MA 


4.5 


*wyo£ i 


AtG711/M 

AlOf 1141 


Ue OHIO) 


ESTs 


A C 

4.5 




a inane 


AA5213U7 


Hs. 186651 


ESTs 


4.5 




A307C} 

4Ja/bz 


T78968 


Hs.14411 


ESTs 


4.5 






AJ07/401 


HS.141b9o 


rrypotneucai protein mgoiobdo 


4.5 


50 


4oUlao 


ai OAOivii 
AL049242 


Hs^34794 


Homo sapiens mRNA; cDNA DKFZp564BQ83 (fr 


4.4 


4ZOO0Z 




HS. 193576 


ESTs 


4.4 






AWJOC071 


Ue <QmC40 

nS.loUolu 


CCTe 

to IS 


A A 

4.4 




452978 


AAQ2S994 


Hs.61523 


ESTs 


4.4 




/oni ha 


Aiuyoioo 


Ue QCiOA 

HS.954Z0 


JM27 protein 


4.4 




4ZoZ43 


A A1inQ1jt 

AAlOUy 14 


Ue IQOOO'I 

HS. 1oo291 


zinc finger protein 268 


4.4 


442242 


AV647908 


Hs.90424 


Homo sapiens cDNA: FU23285 ns, done H 


4.4 




452498 


AKG00101 


Hs.29700 


hypothetical protein FU20094 


4.4 




459527 


AW977555 


Hs .291 735 


ESTs, WeaWy similar to 178885 serine/lh 


4.3 




40039/ 


A A 70Q00C 

AA708ZQ5 


Hs. 100343 


ESTs 


4.3 


f\C\ 
ou 


416312 


W02640 


Hs.16247 


ESTs, Weakly similar to 2004399A chromes 


4.3 


420111 


AA255652 




gb2s21h11.r1 NCI_CGAP_GCB1 Homo sapiens 


4.3 




403U40 


Mfiomi 

Noyiui 


Ue ^A7*50 


CCTe 

to IS 


A O 

4.0 




409705 


M37762 


Hs.56023 


brain-«ienved neurotrophic factor 


4.3 




*+Oo400 


AlftOOCM 

Aioyzozo 


Ue 

nS.iziolo 


Homo sapiens done ?3-1 placenta expres 


A 0 

4.Z 




42790B 


AA417272 


Ks.24122 


ESTs 


4.2 


432765 


AJ 0034 29 




gbAJ0O3429 Selected enromosome 21 cDNA 


4.2 




413525 


Bt145o99 




gb:MR0- HT 0208-221 2S>204-b 10 HT0208 Homo 


4.2 




420355 


AW958263 


Hs. 1231 26 


ESTs 


4.2 




A97C01 
4Z/0Z1 


AWg/OObZ 


Ue OOACQC 

Hs.290585 


ESTs 


4.2 


70 




AlsllUbO 




ESTs 


42 




AWOUOOOf 




itk.tl O 1 IVirtft7Q AOAIOO OCA AOQ 1 ILJnOTfl Uf-jvu. 

QD:lL24JM00/y4)yUOW4J5(M^ UMU079 Homo 


4.1 




415245 




Ue 070C1 


CCTe 

tolS 


A 1 

4.1 




434360 


AW015415 


Hs.127780 


ESTs 


4.1 




442786 


H50733 


Hs.256261 


ESTs, Moderately similar to ALU8JiUMAN A 


4.1 


75 


450597 


AI701635 


Hs.207077 


ESTs 


4.1 


424200 


AA337221 




gb:EST41944 Endometrial tumor Homo sapie 


4.1 




445208 


AI350199 


Hs^69990 


ESTs 


4.1 




438431 


AW207860 


Hs.293116 


ESTs 


4.1 




448152 


AI741053 


Hs.170770 


ESTs 


4.1 


80 


438875 


AA827640 


Hs.189059 


ESTs 


4.1 


400746 






Target Exon 


4.1 




413081 


BE064415 




gb:RC4-8T0311-2411SS012-b03BT0311 Homo 


4.1 




404967 






Target Exon 


4.0 




427203 


AW629517 


Hs.244855 


ESTs 


4.0 
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442338 


AI761976 


Hs.156080 


ESTs 


4.0 




455276 


BE176479 




gb:RC3-HT0585-160300-022-b09 HT0585 Homo 


4.0 




452903 


AI953425 


Hs.345291 


ESTs, Weakly similar to 138022 hypotheli 


4.0 


5 


435136 


R27299 


Hs.10172 


ESTs 


4.0 


416760 


H85182 


Hs.191327 


ESTs, Highly similar to KIAA1 102 protein 


4.0 




421312 


AA824627 


Hs.291670 


ESTs 


4.0 




441568 


A1733322 


Hs.127176 


ESTs 


4.0 




441736 


AW292779 


Hs.8182 


ESTs 


4.0 


10 


448882 


AJ001531 


Hs.22404 


orotfiasfi. sp.ririft 12/neumlrvnsin mntn 


4.0 


434222 


AF 11 9886 


Hs.283941 


Homo sapiens PR02591 mRNA, complets cds 


3.9 




420301 


AA767526 


Hs.22030 


na'rrpri box npn« 5 fB-epfl finpjwa ^Derif 


3.9 




429066 


AA868555 


Hs.1 78222 


ESTs 


3.9 




435878 


R08330 


Hs.20152 


ESTs 


3.9 


15 


447530 


AW1 92063 


Ks.248865 


ESTs Moderately similar to JC5238oalac 


3.9 


412988 


BE046680 




ah'hn4?h0lx1 NCI CGAP RDF2 Homo salens 


3.9 




410196 


AI936442 


H&59838 


hypothetical protein FU 10808 


3.9 




411084 


T18987 


Hs.125472 


ESTs Moderalerv similar to KtAA0877 oro 


3.9 




419629 


AB020695 


Hs91662 


K1AA0888 protein 


3.9 


20 


455646 


BE06442O 




gb:RC4-BT031 1-241 199412-C08 BT031 1 Homo 


&9 


414441 


AA234759 


Hs.132950 


ESTs 


3,9 




425810 


AI923627 


Hs.31903 


ESTs 


3.9 




409111 


AL043362 


Hs.7984 


nfpckstfin hnmofnnv Sp/*7 snri nv\f*rifmi 


3.9 




411479 


' AW848047 




gb:IL3-CT0214-251299-052-A12 CT0214 Homo 


3.9 


25 


440450 


AI333129 


Hs,156147 


ESTs 


3.8 


428342 


AI739168 




Homo sapiens cDNA RJ13458 fis, clone PL 


3.8 




431421 


AW969118 


Hs.1 08144 


ESTs, Weakly similar to unnamed protein 


3.8 




416009 


Z43062 




nh*HRni?Pfl41 nnrmalrTRfi tnfanl hrsin rDN 


3.8 




416534 


H69043 


Hs.224961 


Homo sapiens cDtiA FU 14366 fis, clone HE 


3.8 


30 


407198 


H91679 




Qh*wfl4a07 <:1 Snarps fptal llvpr srJppn 


3.8 


419831 


AW448930 


Hs.5415 


ESTs 


3.8 




424830 


AW270580 


Hs 189311 

1 lO* I ViW 1 1 


ESTs WeaWv similar to Dutafive o150 1H 


3.8 




431447 


AA505138 




ESTs 


3.8 




435932 


W03928 


Kb 114524 


ESTs 


3.8 


35 


442447 


AA999723 


Hs.1 29607 


ESTs 




403242 








as 




433908 


AW298141 


Hs.1 57975 


ESTs 


3.8 




454037 


AW998716 




gb:PM4-BNOOS7-250300-002-f1t BN0067 Homo 


3.7 




432101 


AI918950 


Hs.1 23642 


EphAS 


3.7 


40 


418759 


AA227879 


Hs.1 87621 


ESTs 


3.7 


452462 


BE173515 




gb:RC2-HT0560-210200-01 2-TO3 HT0560 Homo 


3.7 




448568 


AA149121 


Hs.71947 


ESTs 


3.7 




447785 


AL041765 


Hs.340375 


ESTs 


3.7 




451746 


M86178 


Hs 31 1258 


ESTs 


3,7 


45 


436345 


AA873008 


Hs 121572 


ESTs 


3.7 


446862 


AV660697 


Hs.282700 


ESTs 


3.7 




419875 


AA853410 


Hs 93557 


nmpnkpnhalin 

pi VK3I UtCpl I all 1 1 


3,7 




421040 


AA715026 


Hs.1 35280 


ESTs 


3.7 




454860 


AW835767 




nh-OVAJ Tflnifi-94n9nfL1 tfl-hftR I TOMR Hnmo 


3.7 


50 


450687 






nh*7wn^hn7e1 Qnanx; NhMMPii Unmncnri 
go«cwuoDur.5i owww_jNnniVuU_o i narnosapi 


^ 7 
O.I 


407437 


AF220264 




nh'HfHun onntpfrc MnRT-1 mRNA rnmnlpfp rri 

yU.nmiKJ oafJtClU) IVIWOI-I HUMvA, CUIll)JiCLO 


3.7 




431474 


AL1 33990 


Ue iqnfi42 
ru>. i aUCK+t 


wcur) p rale in 


0,f 




448004 


AW4M477 


nb.£Of HOO 


nolo 


17 




419261 


X07876 


Hs.89791 


urinotp^s-hrtiR MMTV infnnrfifinn <d(p fami 


3.7 


55 


422899 


016471 


Hs.121571 


Human mRNA, ternunal portion 


3.7 


439075 


AF085933 


Hs 292620 


ESTs 


3.7 




440947 


AA910403 




ESTs 


3.7 




447458 


A1741082 


Hs.1 58961 


ESTs 


3.7 




403481 






Target Exon 


3.6 


60 


404561 






(ri(2horhinopha]angeal syndrome 1 gene (T 


3.6 


417565 


AI203405 


Hs.47831 


ESTs 


3.6 




430096 


U91935 


Hs.233321 


Retina-derived POU-domaln factor- 1 


3.6 




448658 


H71739 


Hs.200227 


ESTs Moderately similar to A53S59 Uirom 


3.6 




449655 


AI021987 


Hs.59970 


ESTs 


3.6 


65 


450630 


AA010429 


Hs.191939 


ESTs 


3.6 


455067 


AW854538 




ab*RC3-CT0255-200100-024-b02 CT0255 Homo 


3.6 




423566 


AW976434 


Hs,3623 


hypothetical protein FU 1 1220 


3.6 




451193 


N29850 


Hs.44098 


ESTs 


36 




420026 


A1831190 


Hs.166676 


ESTs 


3.6 


70 


426917 


AA913814 


Hs 172854 


ntCF7KA6Rfl993 nmlpin 
urvrtrOQoowjto ptuuflii 


3.6 


429430 


rUJO 100/ 


rv>. iooooo 


CO IS 


3.6 




433563 


AJ 732637 


Hs^77901 


ESTs 


a6 




443744 


AI084326 


Hs^71548 


ESTs, Weakly similar to 178885 serineAh 


3.5 




456373 


BE247706 


Hs.89751 


rnembrane-spannlng 4-domains, subfamily A 


3.5 


75 


419589 


AW973708 


Hs31925 


Homo sapiens cDNA FU 13446 fis, clone PL 


as 


401132 






CI 2000517-:9lJ47587 12lreflNP_004659.1 i a 


15 




421105 


AA766501 


Hs.125113 


ESTs 


3.5 




435177 


AI018174 


Hs.42936 


ESTs 


3.5 




434763 


AA648S18 




gb:ns07a11j1 Na.CGAP„Ew1 Homo sapiens 


3.5 


80 


449988 


AW372068 


Hs.201420 


ESTs, Moderately similar to ALU7 HUMAN A 


3.5 


450216 
403510 


AA873345 


Hs.60226 


Homo sapiens, done IMAGE:3621 638, mRNA. 
Target Exon 


3.5 
3.5 




435681 


AA694192 


Hs.148979 


ESTs 


3.5 




451722 


H86374 


Hs.40861 


ESTs 


as 



746 



WO 03/042661 



PCT/US02/36810 





430865 


AKJ73424 


Hs.5232 


H5PC125 protein 


3.5 




444800 


AW1 19071 


Hsll 53287 


ESTs 


3.5 




428647 


AAB30050 


Hs. 124344 


ESTs 


3.5 


5 


412775 


AA709046 


Hs.27552 


Homo sapiens mRMA; cDMA 0KFZp586N2424 (f 


3.5 


429643 


AA455889 


Hs.167279 


FYVE-finger-contairung Rab5 effector pro 


3.5 




443324 


R44013 


Hs, 164225 


ESTs 


3.5 




430124 


AW204994 


Hs.253450 


ESTs 


3.4 




430701 


A1760833 


Hs.293971 


ESTs 


3.4 


10 


436714 


AA728964 


Hs.293399 


ESTs 


3.4 


404848 






ENSP00000240769*:BG153011 (similar to C 


3.4 




408480 


A1350337 


Hs.1 64568 


fibroblast growth factor 7 (keratinocyte 


3.4 




412494 


All 33900 


Hs.792 


AOP-nhosvlatinn farfnr domain nrnfptn 1 

nwi ~ liAjoj iquVvH i iquwi u vi i tail i yu vit-ti i i 


3.4 




417435 


NM_005181 


Hs.82129 


carbonic anhydrase ill, muscle specific 


3.4 


15 


430403 


Af 039390 


Hs^41332 


tumor necrosis factor (fifjand) superfami 


14 


436340 


R42246 


Hs,21606 


ESTs 


3.4 




441536 


AA939300 


Hs.206768 


ESTs 


**4 

0.** 




442231 


W02434 


Hs 99941 ^ 


ESTs 


<J.H 




447124 


AW976438 


Hs.17428 


RBPHIks protein 


14 


20 


450297 


AW901347 


Hs.38592 


hypothetical protein FU23342 


3.4 


453682 


779703 




flh*vrf71 pflft r1 QnntP^ falsi llvpf cnjppn 


3,4 




456995 


T89832 


Hs.1 70278 


ESTs 


34 




405510 






PN^Pnnnnrm^77Q*'Hvnnihpf1nat fiR fl kna n 
Liioruuuiwtoj/ 1 ? .ny puuiciicdi oo.u kuo p 






400379 


NM 018432 




Homo sapiens ovanan cancer related prot 


^14 


25 


419964 


AA811657 


Hs.220913 


ESTs 


3.4 


427033 


AI457449 


Hs!l92817 


ESTs 


3^4 




422321 


AA906427 


Hs.1 81035 


hvoolhefical Droteln MGC1 1 296 


3.4 




430829 


AW451999 


Hs.1 94024 


ESTs ' 


3.4 




420721 


AA927802 


Hs.159471 


2AP3 protein 


3.4 


30 


433628 


AI821784 


Hs.1 88578 


ESTs 


3.4 


436703 


AW880614 


Hs.146381 


RNA binding motif protein, X chromosome 


3.4 




429716 


R25685 


Hs.211933 


collagen, type XIII, alpha 1 


3.4 




427235 


AI126288 


Hs.1 92232 


ESTs 


3.4 




459646 


AW883968 


Hs.321190 


ob*GV3-OT0063-290300-135-e04 OT0063 Homo 


3.4 


35 


428923 


BE047698 


Hs.1 88785 


ESTs 


3.4 


431408 


AA 504757 


Hs 105738 


ESTs 


3.4 




438801 


M825971 


Hs.124284 


ESTs 


14 




445432 


AV653771 




ab:AV653771 GLC Homo saoiens cDNA done 


3.4 




450003 


AA777809 


Hs 191995 


ESTs 


3.4 


40 


448108 


AW300021 


Hs.170685 


ESTs 


3.3 


425523 


AB007948 


Hs.1 58244 


WAA0479 protein 


13 




427473 


AW274439 


Hs 252709 


ESTs 


13 




434520 


AA205273 


Hs.1 77011 


hvonthfitir^l nmtftin 


13 




447282 


AI989963 


Hs.1 97505 


ESTs 


13 


45 


447182 


8E241866 


Hs.1 7585 


KIAA0801 gene product 


13 


432229 


AW290976 


Hs.1 43587 


ESTs 


13 




418985 


AI042330 


Hs.87128 


hvDothefica] orotsin FU23309 


3.3 




418719 


AW975590 


Hs.1 61707 


ESTs 


13 




455710 


6E072049 




ab*PM4-BT0532-170l WHW4-f06 BT0532 Homo 


13 


50 


420300 


AA258245 


Hs 127573 


Homo saniens FKSS41 fFKSGdll mRNA. mmrd 


3.3 


416662 


T25853 


Hs.7538 


ESTs 


13 




411436 


AW846433 




gb:QVO-CT0179^030(M43-b02 CT0179 Homo 


13 




420851 


AA281062 


Hs.29493 


hYDOthetkal orotein FU20142 


3.3 




428715 


AW293716 


Hs.53126 


ESTs 


13 


55 


429318 


AW861930 


Hs.102500 


hypothetical protein dJ511E16.2 


13 


444246 


H93281 


Hs 10710 


hvmlhptfml nrrrfpin PI I9fl417 


33 




453973 


A1291895 




ESTs Wfiflldv similar to AL til HUMAN Al U R 


3.3 




434497 


A1821803 


Hs 136580 


ESTs 


32 




439306 






WTWn Rfntoin C5w\\ 


^ 9 


60 


420608 


BE548277 


Hs 103104 


ESTs 


3.2 


418986 


AJ 123555 


Hs.81796 


ESTs 


3.2 




438118 


AW753311 


Hs.346690 


ESTs 


3.2 




447516 


W05355 


Hs 102971 


hvnnthpf'tfuH rvntpin Fl 1 147^1 


3.2 




449570 


AA001793 




gbzh86c06j1 SoaresjfetaUiver.spleen. 


3.2 


65 


424994 


AW954525 




gb:EST366595 MAGE resequences, MAGC Homo 


12 


417675 
402145 


AI808607 


Hs.3781 


similar to murine leucine-rich repeat pr 
Target Exon 


12 
12 




448131 


AI675054 


Hs.200481 


ESTs 


12 




422352 


AA766296 


Hs.99200 


ESTs 


12 


70 


432625 


AI243596 


Hs.94830 


ESTs, Moderately similar to T03094 A-kin 


3.2 


447183 


AI554733 


Hs.1 731 82 


ESTs 


12 




447597 


AI886036 


Hs.213875 


ESTs 


12 




459535 


AV654907 




gb:AV654907 GLC Homo sapiens cDNA done 


12 




415467 


R60891 


Hs.260274 


ESTs 


3.2 


75 


434408 


AI031771 


Hs.1 32586 


ESTs 


12 


456354 


X56411 


Hs.1219 


alcohol dehydrogenase 4 [class II), pi p 


3.2 




453789 


AA628517 


Hs.1 18502 


ESTs 


3.2 




412666 


AL080116 


Hs.74420 


origin recognition complex, subunil 3 (y 


3.2 




418319 


AW611703 


Hs.190173 


ESTs, Weakly similar to A46010 X-Cnked 


3.2 


80 


419088 


A1538323 


Hs.52620 


Integrin, beta 6 


12 


420397 


NM.007018 


Hs.97437 


csentrosomal protein 1 


12 




422165 


AUJ41199 


Hs.1481 


htsfjdine decarboxylase 


3.2 




448044 


AI458682 




gb:tx13e01j(1 Na_CGAP_Lu24 Homo sapiens 


12 




454665 


AW812866 




gb:RC35T0186-30010XM)17-b03 ST0186 Homo 


3.1 



747 



WO 03/042661 



PCT/US02/36810 



4(35321 






Target Exon 


3.1 


429569 


AA454993 


\ 1 4 AAA J M 

Hs.138343 


ESTs, Weakly similar to 178885 senneAn 


3.1 


434512 


AW1 39932 


Ks,188941 


ESTs 


3,1 


452102 


U04343 


Hs.27954 


C086 antigen (CD28 antigen Dgand 2, B7- 


3.1 


433444 


AW975324 


Hs.129816 


ESTs 


3.1 


410821 


AI1 14811 


Hs.92526 


ESTs, Weakly similar to T00365 hypothefl 


11 


415861 


243123 


Hs.1 445 13 


ESTs 


11 


422299 


AK000181 


Hs.1 14556 


hypothetical protein RJ20174 


11 


432527 


AW975028 


Ks.1 02754 


ESTs 


11 


427773 


AA412290 


Hs.98124 


ESTs 


11 


441817 


AW969706 


Hs.293332 


ESTs 


11 


416812 


H91010 


Hs.44940 


ESTs 


3.1 


417958 


AA767382 


Hs.193417 


ESTs 


3.1 


407426 


AF 129533 




gb:Homo sapiens F-box protein Fbl3b (FBL 


3.1 


416423 


H54375 


Hs.268921 


ESTs 


11 


418037 


AI990212 


Hs.86447 


ESTs 


11 


419197 


N48921 


Hs.27441 


K1AA1 615 protein 


3.1 


420179 


M74530 


Ks.21168 


ESTs 


3.1 


436295 


N73895 




gbza62d06.s1 Soares fetal Ever spleen 


11 


405548 






Target Exon 


3.1 


423595 


R82826 


Hs.220702 


ESTs 


3.1 


412533 


AA679863 


Hs.69606 


ESTs 


3.1 


434072 


H70854 


Hs.283059 


Homo sapiens PRO1082 mRNA, complete cds 


11 


405264 






NM_030813*:Homo sapiens suppressor of po 


11 


410869 


AW808361 




gb:MR1-ST01 11-11 1099-003-104 ST01 1 1 Homo 


3.1 


425354 


U62027 


Hs.155935 


complement component 3a receptor 1 


11 


441492 


A 1 4 A fWWSA 

A1149998 


114 1MAJA 

Hs.146346 


ESTs 


11 


447078 


AW885727 


Hs.9914 


ESTs 


11 


435021 


AA922192 


Hs.54709 


ESTs 


3.0 


408832 


AW085690 


Hs.63428 


ESTs, Weakly similar to Z195.HUMAN ZINC 


10 


450580 


N40087 




ESTs 


3.0 


432319 


AW510770 


Hs.1 28386 


ESTs 


3.0 


453713 


R20640 


Hs.79133 


cadherin 8, type 2 


10 


445784 


AI253155 


Hs.146065 


ESTs 


10 


416642 


T96118 


Hs.226313 


ESTs 


3.0 


418948 


AI217097 




gb:qd43h07.x1 Soares_fetaLhearLNbHH19W 


3.0 


433796 


AA810867 


il 4 orivn 

Hs.1 86997 


ESTs 


10 


439182 


AF086030 


Hs.21621 


hypothetical protein DKFZp762O076 


3.0 


404995 






ENSPO(XX>0251890*:Ktonc<^fcleukerniazinc 


10 


444794 


AI419991 


Hs.145225 


ESTs 


3.0 


443634 


H73972 


Hs.1 34460 


ESTs 


10 


420133 


AA426117 


Hs.1 55543 


ESTs 


10 


407829 


AA045084 


Hs.29725 


hypothetical protein FU13197 


10 


426743 


AA383833 


Hs.245022 


ESTs 


10 


442326 


H92962 


Hs.124813 


hypothetical protein MGC14817 


10 


449913 


AA004696 


Hs.333016 


ESTs 


10 


454096 


AW082757 




gb:CMO-CT0103-120899437-g07 CT0103 Homo 


10 


419622 


AA452054 


Hs.119338 


ESTs 


10 


449745 


AI668593 




gbyt38a05.x5 Soares breast 3NbHBst Homo 


10 


428412 


AA428240 


Hs.126083 


ESTs 


10 


428200 


AI039624 


Hs.98388 


ESTs 


10 


414951 


AW794931 


Hs.100861 


hypothetical protein aJ14600 


10 


431869 


AA521136 


Hs.190176 


ESTs 


10 


451391 


AA017410 


Hs.40568 


ESTs 


10 


452959 


AI933416 


Hs.189874 


ESTs 


10 



TABLE 66B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank acces sion numbers 



Pkey, 

410790 
410869 



411436 
411479 

412988 
413081 
413525 
416009 
418948 
420111 
420352 
424200 
424994 
428002 



CAT Number Accession 



1221131.1 
1225123J 



1245660J 
1247077.1 

1342150.1 
1348563J 
1374635.1 
1566379.1 
180808 1 
190755 1 
192979.1 
236595.1 
245786.1 
285602,1 



AW803357AW803423 AW812233 R06814 

AW808361 AW808404 AW808386 AW808594 AW808654 AW808B13 AW808551 AW808676 AW808350 AW808406 AW808694 AW808934 
AW808829 AW808385 AW808422 AW808401 AW808409 AW808760 AW808863 AW808521 AW808539 AW808609 AW808472 AW808739 
AW808704 AW80B558 AW808714 AW808420 AW8 
AW846433 AW846159 AW846377 AW846528 

AW848047 AW848202 AW848631 AW848142 AW848702 AW848121 AW848632 AW848140 AW848671 AWB48009 AW848067 AW848069 
AW848905AW848214 
BE046680 BE046738 BE044958 
BE064415 BE054430 BE064448 

BE145899 BE145848 6E145849 BE1458S3 BE145927 BE145925 
Z43062R13213H14422 

AI21 7097 AW886090 W38035 W38792 AA232835 AW936043 
AA255652 AA28091 1 AW96792D AA262684 

BE258835 AW968316 AA258918 AW843305 R14744 A1580388 8E071923 R36280 

AA337221 AA336756AW966196 

AW354525 AI372685 AA349501 AJ372687 H10564 

AA418703AA418711BE071915BE071920BE071912 



748 



WO 03/042661 



PCT/US02/36810 



428342 290035,2 



430535 
432765 
433523 
434763 
436295 
439092 
439306 
440947 
442481 
445432 



449570 
449745 
450317 
450580 



450582 
4506B7 
452462 
453682 
454037 
454096 
454171 
454457 
454665 
454860 
454968 
455067 
455276 
455646 
455710 
457374 
458912 



319643 1 

353907J 

368873.1 

392847.1 

41733 1 

468554.1 

47088 1 

505904.1 

543588 1 

63943.1 

747196.1 

81018.1 

814534 1 

831956.1 

83929 1 



83933 1 
84327.1 
918580 1 
977454 1 
996287.1 
1007449 1 
1049240 1 
1207274 1 
1228599 1 
1237732.1 
1247029.1 
1252050.1 
1272541 1 
1348557.1 
1352368 1 
328758.1 
823104 1 



AI739168 AA426249 A1199636 AW505198 AW977291 AA824583 AA883419 AA724079 A1015524 AI377728 AW293582 AI928140 AA731438 
AI092404AI085630AA731340 
AW968485 AW968670 AA480922 BE350425 
AJ003429 AJ003367 AA564825 

H29882 AW685533 AW149901 AI572917 AA598500AI686466AI336390AW864390 AW864320 

AA648618AW974389 H51771 

N73895AJ001872 

AA8301 49 AW978407 M85983 AW503637 
BE2201 99 W0181 3 AF0861 18 N70760 BE221405 
AA910403 AI815593 W58361 AW1 62520 AI81 6550 
N99828 BE079873 AI1 10738 AF074645 
AV653771 BE089370 

AI458682 H24240 R14537 R18426 AW867082 
AA001793AA001871 

A1668593 AI820774 R86205 H39971 H22177 H26241 
AI692689 R14223 R18395 

N40087 H12925 AA460779 AA096372 BE160847 AW816890 AWB1 6893 AW81 6577 AW816891 AW816882 AW816868 AW81 6889 AWB16940 
AW816892 AW816941 AW816578 M029183 T83320 H88850 AA010295 BE1 60823 Z25353 AA 150883 AW887764 AW023806 AW022095 
AA16451 8 AA730973 WOW 17 W65303 

AI339732 AA010300 AW515041 AA768334 N29860 AA425874 AA425118AA865829AW936878 

AA495800 AA495737 AA010736 AA654716 AA640726 

BE173515 BE173560 A1902860 

T79703T96307AL079725 

AW998716AW022148 N68020 

AW062757 AW176890 AW062758 AW176895 AW176869 AW176900 AW176897 AW176928 AW176868 AW176892 

AW854832 AW854798 AW854857 AW854816 AW854834 AW854817 

AW753456 AW753036 AW854868 AW854862 

AW812866 AW812746 AW812747 AW81 2884 AW81 2763 AW812722 

AW835767 AW835537 BE160187 

AW849046 AW847956 AW849039 AW847957 AW848279 AW848698 AW849034 AW849033 
AW854538 AW854418 AW854412 

BE175479 BE176678 BE176357 BE176550 AW886079 BE176676 BE176615 BE176555 BE176489 BE176610 BE176362 
BE064420 BE064435 BE064429 BE064414 BE064400 BE054517 

BE072049 BE069471 BE069489 BE069478 BE069479 BE069492 BE069485 BE072122 BE072124 BE069491 BE069486 BE069464 

AA493662 AW897396 BE154814 

A!91 1066 AI933734 AI680888 AJ003599 



TABLE 66C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Q) numbers. *Ounham 1. et al.' refers to the publication entitled The DNA sequence of 

human chromosome 2T Dunham I. et aL. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
Nt_position: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposilion 


400533 


6981826 




277132-277595 


400746 


7329328 


Minus 


147703-147896 


401132 


8705350 


Minus 


8567^85795 


402145 


801B280 


Plus 


113086-114800 


403242 


7637817 


Minus 


11297-12511 


403481 


9965004 


Plus 


93496-93633 


403510 


7652047 


Pius 


61666-62027 


403667 


6850483 


Minus 


1344-1442,1545-1697 


404003 


8655948 


Plus 


198349-199096 


404561 


9795980 


Minus 


69039-70100 


404592 


9943965 


Minus 


39067-39225 


404848 


8248647 


Minus 


23955-24034.251 43-25264 


404967 


7523744 


Minus 


89944-90729 


404995 


6006247 


Minus 


154015-154123 


405264 


7329374 


Plus 


28556-28684 


405321 


3419846 


Mnus 


44654-45210 


405348 


2914717 


Minus 


4331CM3462 


405510 
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TABLE 67A: 689 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Table 67A fists about 689 genes up-regulated in prostata cancer compared to normal adult tissues. These were selected from 9009 probesets on the Affymetrix/Eos Hu03 GeneChlp 
array that showed some expression to human xenograft tumors and cell Tines. The 689 genes were selected such that the ratio of "average* prostate cancer to 'average" normal adult 
tissues was greater than or equal to 2.5. The "average' prostate cancer level was set to the 75th percentile amongst prostate cancers. The "average* normal adult tissue level was set 
to the 85th percentile amongst non-malignarrt tissues. In order to remove gene-specific background levels of nonspecific hybridization, the 10lh percentile value amongst non- 
malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 
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5.3 


421470 


Kef 43b 


LI B 4<170 


OJUw?AIII rw 


5.3 


418555 


AM 1704 c 


Up 074 CO 


hvrvrfhpliml ninf pin Ft J 15*^77 


5.2 


404632 






NM_022490:Homo sapiens hypothetical prot 


5.1 


450861 


AICHROQ 

Aio/JHao 


rtS.i I 01 / 


ESTs 


5.1 


431583 


AuMiOlj 






5.1 


400303 


AA4J77CQ 


rlS.791.5b 


1 rV-1 nrnjpln patron en refloated 

ui i pjw\wiJi r cjuuycii ic^uiam 


5.1 


400290 


H18836 


iL 14CAO 


hypothetical protein FU 20041 


5.1 


410037 


AB020725 


Hs.58009 


K1AA0918 protein 


5.1 


410240 


AL1 57424 


HS.61289 


synaptojanin 2 


5.0 


403046 






NM_005656*:Homo sapiens transmembrane pr 


5.0 


419647 


A A4JBA17 


nS.91o1o 


hvnnthpltcfll nmlp?n 


5.0 


450203 


ARJ97994 




1 JcvnifTpnlnp/alnha-fltTiJnfladlnatfi arranntra 
unvjituidiuicvoi^ii^oJiuiiwAJipoio aiitMiuua 


5.0 


450164 


Al 239923 


Hs.63931 


ESTs 


5.0 


417318 


AW953937 


Hs.240845 


ESTs 


4*9 


417153 


X57010 


Hs.81343 


pfitlanpn hi nn II ainha 1 fnrimarv ncln 


4^9 


412350 


A16593Q6 


Hs.73826 


nrnlHn h/imlnp nhnsnhala^p nnn^pcpnl 


4.9 


45608R 


BE177320 


Hs.156148 




4.9 


433852 


AI378329 


Hs.126629 


ESTs 


4.9 


452679 


242387 


Hs.83883 


trans membrane, prostate androgen induced 


4.9 


430387 


AW372884 


Hs.240770 


nuclear cap Dinoing proiein suDunii c, z 




416276 


U41060 


Hs.79136 


uv-i pfoiein, estrogen reyuiaicvj 


4 A 


441021 


AW578716 


Hs.7644 


n i nisione wrruy, rrwnoer z 


4.8 


453171 


R76472 


Hs.65646 


ESTs 


4.8 


416795 


AI497778 


Hs.20509 


HRV nX a^rraalpH nrntpin-fl 


4.8 


427615 


BE410107 


Hs.179817 


CGLP2 DTotein PSDR1 

ruurvi 


4.8 


419440 


AB020689 


Hs.90419 


K1AA0882 orotfiin 


4.8 


422048 


KM.012445 Hs.288126 


<;oflndin 2 Pirfr^eBiilar matrix nrolein 


4.8 


453082 


H18835 


Hs.31608 


hypothetical protein FU20O41 


4.8 


420948 


AB016898 


Hs.100469 


rTtyeloid/r/rnprioId or rnJxed-fineags leukem 


4.8 


442592 


BE566623 


Hs.29899 


ESTs, WeaWy similar to 602075 transcrip 


4.8 


421685 


AF189723 


Hs.105778 


A i kska %jA frAnfimvfinn fvnfi /fl ntf^iiupr 


4.7 


431724 


AA514535 


Hs.283704 


ESTs 


4.7 


423242 


AL039402 


Hs.125783 


DFMFJ5 nmtpin 


4.7 


434485 


AI623511 


Hs.118567 


ESTs 


4.7 


407103 


AA424881 


Hs.256301 


hypothetical proteh MGC13170 


4.6 


414085 


AA114016 


Hs.75746 


flMphvrin HphvrtmnpnacA 1 farrrih/ wtwnhPf 
wjciijuo uciijuiuyciiado i wuiui/i iiictinxa 


4.6 


422486 


BE514492 


Hs.1 17487 


OAfw npar kin nn inlfi 3 unfh hnmnirviv in 


4.6 


422083 


NM.001141 Hs.111256 


arachUonato 15^jpoxy^enas6i second ^fp 


4.6 


439735 


AI635365 


Hs.142846 


hypothetical protein 


4.6 


420039 


NM.004605 Hs.94581 


<a lifinfra n^fpra^A famflv rvtnsnDr % 


4.6 


449845 


AW971183 


Hs.6019 


Hna t /T-lcruim hnmnlrvi cuhfamiKi r 1 momho 

mioJ iKNTHNug, suoiarniry v», utaiiuc 


4.5 


423445 


NM.014324 Hs.128749 




4.5 


433730 


AK002135 


Hs.3542 


hvnnfhfitical nmtpin Fl .1 1 1 273 


4.5 


426006 


R49031 


Hs^2527 


ESTs 


4.5 


451982 


F13036 


Hs.27373 


Homo sapiens mRNA; cONA OKF2p56401 763 (f 


4.5 


415082 


AA16O0D0 


Hs.137396 


ESTs. Weakly sim3ar to JC5238 gatactosy 


4.5 


409799 


D11928 


Hs.76845 


phosphoserlne phosphatase-Oke 


4.5 


423583 


AL122055 


Hs.1 29836 


KIAA1028 protein 


4.4 


452017 


AF109302 


Hs^7495 


prostate cancer associated protein 7 


4.4 


437162 


AW0055O5 


Hs.5484 


thyroid hormone receptor ooacQvating pr 


4.4 


451468 


AW503398 


Hs.293663 


ESTs, Moderately similar to 138022 hypot 


4.4 


429467 


NM.004477 Hs^03772 


FSHD region gene 1 


4.4 


451752 


AB032997 




WAA1171 protein 


4.3 


430294 


A1538226 


Hs^2976 


guanine nucleotide binding protein 4 


4.3 


452259 


AA317439 


Ks.28707 


signal sequence receptor, gamma (translo 


4.3 


456362 


AW973003 


Hs.179909 


rrypothelical protein FU22995 


4.3 
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449048 
440300 
427308 
435706 
416854 
451406 
410076 
410275 
419239 
442501 
436761 
420380 



245051 
N39760 
D26067 
W31254 
H40164 



Hs^2920 



Hs.174905 

Hs.7045 

Hs.80296 



458440 
446416 
417873 
430598 
421305 
429299 
407894 
414664 
408418 
430945 
432363 
447574 
447595 
453857 
444108 
422890 
417379 
418818 
410102 
418821 
438825 
433233 
415276 
440146 
407819 
450402 
432527 
419733 
422134 
432435 
450680 
436420 
430122 
411031 



407813 
417067 
410227 
417958 
427176 
448826 
447966 
432S75 
419713 
427479 
443162 
413950 
439963 
419083 
448148 
419465 
432274 
447620 
430589 
418004 
416655 
459284 
421829 
437252 
437967 
417061 
423337 
418838 
453469 
408063 
422072 
420297 
420522 
408833 
426647 



AJ694320 Hs.6295 

T05387 Hs.7991 

U85658 Hs.61796 

AA468183 Hs.184598 

AA315267 Hs.23128 

AI817776 Hs.236557 

AA540891 Hs.102408 

AI364997 Hs.7572 

A1095468 Hs.135254 

AV658299 Hs.163959 

BE266659 Hs.293659 

AK001764 Hs.247112 

BE397354 Hs.324830 

AI620463 Hs.347408 

AJ278313 Hs.41143 

AA587775 Hs.66295 

AW963897 Hs.44743 

U80669 Hs.55999 
AA534489 

AF162666 Hs.1B895 

AW379130 Hs.18953 

AL080235 Hs.35861 

R55784 Hs.140942 
Z43784 
AA196390 

AA228899 Hs.101307 

AW248508 Hs.279727 

AA436002 Hs.183161 

BE327427 Hs.79953 

AB040927 Hs.301804 

U88666 Hs.78353 

AW014231 Hs.90790 
R42185 

BE218027 Hs.89969 

AW975028 Hs.102754 
AW362955 

AW179019 Hs.112110 

BE218886 Hs.282070 

AF131784 Hs.25318 

AA443966 Hs.31595 
NIVL013342 HsJ233765 

W37943 Hs.34892 

AA420450 Hs^92911 

AL120247 Hs.40109 

AJ001417 Hs.81086 

AB009284 Hs.61152 

AA767382 Hs.193417 

AW381569 Hs.40334 

AI580252 Hs.293246 

AA340605 Hs. 105887 
AI791855 

AW968058 Hs.92381 

BE410092 Hs.178471 

T49951 Hs.9029 

AA249096 Hs.32793 

AW247529 Hs.6793 

AI479560 Hs.98613 
NM-016578 Hs^0509 

AW500239 Hs.21187 

AK000382 Hs.274251 
AW290951 

AJ002744 Hs.246315 

U37519 Hs.87539 

AW968613 Hs.79428 

AF155660 Hs.300496 

AB018330 Hs.108708 

A1433633 Hs.164159 
BE277414 Hs.5947 
AI675944 Hs. 188691 
NM_004655 Hs.127337 
AW385224 Hs.35198 
AB014533 Hs33010 
BE086548 Hs.42346 
AB018255 Hs.111138 
AI628272 HsJ8323 
AW957137 Hs.98541 
AW512232 Hsu254835 
AA243464 Hs.294101 



similar to S68401 (cattle) glucose indue 

Homo sapiens. Similar to R1KEN cDNA 5830 

WAA0033 protein 

GL004 protein 

Puridnje cefl protein 4 

ESTs, WeaWy similar to T1 7248 hypotheti 

ESTs 

transcription (actor AP-2 gamma (acfivat 

Homo sapiens cDNA: FU 23241 fis, done C 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens clone 1 Lhrombospondin mRNA 
ESTs 

Homo sapiens, Similar to RiKEN cONA A430 
hypothetical protein FU10902 
diptheria toxin resistance protein requi 
hypothetical protein MGC13102 
phosphrinositide-spedfic phospholipase 
multi-PDZ-<k)mairvcontainin9 protein 
WAA1435 protein 

NK homeobox (Orosophila), family 3, A 

gb:nf76g1U1 NCI_CGAP_Co3 Homo sapiens 

tousled-like kinase 1 

phosphodiesterase 9A 

DKFZP58SE1621 protein 

ESTs 

ankyrin 3, node of Ranvier (ankyrin G) 
gb:zp99b10.s1 Stratagene muscle 937209 H 
Homo sapiens HUT1 1 protein mRNA, partial 
ESTs; homologue of PEM-3 (C3ona savignyi 
ESTs 
ESTs 

KIAA1494 protein 

SFRS protein kinase 2 

Homo sapiens cDNA: FU22930 fis, done K 

ESTs 

ESTs 

ESTs 

Homo sapiens cONA FU14415 Cs. clone HE 
mitochondrial ribosomal protein L42 
ESTs 

Homo sapiens clone 25194 mRNA sequence 
ESTs 

TCF3 (E2A) fusion partner (In childhood 
KIAA1323 protein 
Rakophflin 
K1AA0872 protein 

solute carrier family 22 (extraneuronal 
exostoses (multiple)* 2 
ESTs 
ESTs 

ESTs, Weakly similar to putative p1 50 [H 
ESTs, V\teakJy similar to Homdog of rat Z 
ESTs 

nudix (nucleoside diphosphate finked moi 
WAA0798 gene product 
DKFZP434G032 protein 
ESTs 

platelet-activating factor acetythydrota 

Homo sapiens cDNA FU 12292 fis, done MA 

HBV pX associated protein-8 

Homo sapiens cDNA; FU23Q88 fis, done L 

hypothetical protein FU20375; KIAA1797 

ESTs 

UDP-N-acetyl^ha-D^alactcsamIne:polyp 
aldehyde dehydrogenase 3 family, member 
8CL27adenovirus E1B 19kWnteracting pro 
mitochondrial solute carrier 
caldunifcatniodutnvdeperident protein kin 
ESTs, WeaWy slmflar to AUI1 JHUMAN ALU S 
met transforming oncogene (derived from 
Homo sapiens cONA RJ12033 fe, done HE 
axin 2 (conducfti. axil) 
ectonudeoOde pyrophosphatase/phosphcxS 
WAA0633 protein 

caldraunrvbtndlng protein calsardn-1 

KlAA0712gena product 

ESTs, Weakly similar to ALU 1 .HUMAN ALU S 

nypoineuca protein 

ESTs 

pre-8-ceII leukemia transcription factor 



4.3 

4.3 

4.3 

4.3 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

3.9 

3.9 

3.9 

3.9 

3.9 

3.9 

3.9 

3.9 

&9 

3.9 

3.9 

3.9 

3.9 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

38 

18 

18 

3.8 

3.8 

3.8 

3.8 

3.8 

Z.7 

3.7 

3.7 

37 

3.7 

3.7 

3.7 

3.7 

37 

3.7 

36 

3.6 

3.6 
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456177 ML012391 Hs.79414 prostate epilheGurrhspecinc Els transcr 3.6 

425689 W16480 Hs.24283 E$Ts, Moderately sim3ar to reduced expr 3.6 

432370 AA308334 Hs.274424 N-acetytneurarnirfc acid phosphate synlha 3.6 

_ 422424 A1185431 Hs^96638 prostate difTerenfiafcn factor 16 

5 446157 BE270828 Hs.131740 Homo sapiens cDNA: FU22S62 fis, done H 3.6 

433339 AF019226 Hs.8036 glioblastoma overexpressed 3.6 

421887 AW161450 Hs.109201 CG-86 protein 3.6 

435114 M775483 Hs.288936 mitochondrial rfoosomai protein L9 3.6 

, _ 412520 AA442324 Hs.795 H2A histone farnfly, member 0 16 

10 436476 AA326108 Hs.33829 bHLH protein 0EC2 3.6 

441224 AU076964 Hs.7753 catumenin 3.5 

404922 NMJ)03071:Homo sapiens SWI/SNF related, 3.5 

432278 AL137506 Hs.274256 hypothefca! protein FU23563 3.5 

452747 BE153855 Hs.61460 tg superfamily receptor LNIR 3.5 

15 429686 A1871613 Hs28538 Homo sapiens cONA: FU21086 fis, clone C 3.5 

408001 AA046458 Hs.95296 ESTs 3.5 

428330 L22524 Hs.2256 matrix metallcproteinase 7 (matftysin. 3.5 

418113 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 3.5 

428055 AA420564 Hs.101760 ESTs 3.5 

20 418827 BE327311 Hs.47166 HT021 3.5 

431103 M57399 Hs.44 ptetotrophin (heparin binding growth fac 3.5 

427701 AA411101 Hs.243886 nuclear autoanfigenic sperm protein (his 3.5 

419168 AI336132 Hs.33718 Homo sapiens cDNAFU 12641 fis, clone NT 3.5 

409151 AA306105 SEC22, vesicta trafficking protein ($. c 3.5 

25 439671 AW162840 Hs.6641 kinesln family member 5C 3.4 

452721 AJ269529 Hs^01871 solute carrier family 37 (gtycerd^pho 3.4 

449933 AW157098 Hs.324104 Human DNA sequence from clone RP1-63M2 o 3.4 

432615 AA557191 Hs.55028 ESTs, Weakly simBar to 154374 gene NF2 3.4 

443884 N20617 Hs.194397 leptin receptor 14 

30 403752 NM_002753*:Hcmo sapiens rrctogen-activate 3.4 

427723 A13S5260 Hs^79789 histone deacetylase 3 3.4 

421878 AA299652 Hs.1 11496 Homo sapiens cDNA FU1 1643 fis, clone HE 3.4 

440494 BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 3.4 

„ 426716 NM.006379 Hs.171921 sema domain, immunoglobulin domain (tg), 14 

35 433647 AA803367 Hs.222294 ESTs 3.4 

407137 T97307 gfcye53h05.s1 Scares fetal liver spleen 3.4 

432886 BE159028 Hs.279704 chromatin aaessibaity complex 1 3.4 

427715 BE245274 Hs.180428 WAA11B1 protein 13 

437617 A1026701 Hs.5716 K1AA0310 gene product 13 

40 447818 W79940 Hs.21906 Homo sapiens clone 24670 mRNA sequence 13 

414407 AA147025 Hs.76704 ESTs 13 

421537 BE383488 Hs.105547 neural proTrferatlon, differentiation an 13 

435655 AW105663 Hs.6947 HSPC069 protein 13 

427871 AW992405 Hs.59622 Homo sapiens, clone 1MAGE:3507281, mRNA, 13 

45 421662 NA/L014141 Hs.106552 cell recognition molecule Caspr2 13 

407204 R41933 Hs.140237 ESTs, Weakly similar to ALU LHUMAN ALUS 13 

407908 BE379758 Hs.1 10853 uncharactertzed hematopoietic stem/|proge 13 

407846 AA426202 Hs.40403 Cbp/p30tWnleracting transactivator, wit 3.3 

420568 F09247 Hs747735 protocadherin alpha 10 13 

50 436873 N23B74 Hs.50477 RAB27A, member RAS oncogene family 3.3 

411145 BE439553 Hs.250528 Homo sapiens, done IMAGE:4098694, mRNA, 13 

435017 AA336522 Hs.12854 angiotensin II, type I receptor-assoctat 13 

435020 AW505076 Hs.301855 DiGeorge syndrome critical region gene 8 3.3 

__ 443991 NM.002250 Hs.1 0082 potassium Intermediate/small conductance 3.3 

55 409960 BE261944 hexoWnase 1 13 

433891 AA613792 gb:no97h0ls1 Na_CGAP_Pr2 Homo sapiens 13 

430389 AL1 17429 Hs.240845 DKFZP434D1 46 protein 13 

432242 AW022715 Hs.162160 ESTs, Weakly similar to ALU4_HUMAN ALU S 13 

419972 AL041465 Hs.182982 gotgin-67 3.3 

60 445707 A1248720 Hs.1 14390 ESTs 3.3 

412628 AI972402 Hs.306051 hypotheBcal protein MGC2648 13 

425263 NM_001197 Hs.155419 BCL24nteractlng killer (apoptosis-lnduc 3.3 

419879 Z17805 Hs.93564 Homer, neuronal immediate early gene. 2 13 

433345 AI681545 Hs.1 52982 rrypothefcai protein FU 131 17 13 

65 417515 L24203 Hs.82237 ataxia4elangledasia group ^associated 3.3 

414922 D00723 glycine cleavage system protein H (amino 3.3 

437898 W81260 Hs.43410 ESTs 12 

426126 AU18747 Hs.26691 ESTs 3.2 

422089 AA523172 Hs.103135 ESTs, Weakly similar to SFR4_HUMAN SPUC 12 

70 450649 NMJD01429 Hs.25272 E1A binding protein p300 3.2 

404210 NM_005938:Homo sapiens myeloW/lyrnphoid 12 

434954 AF161455 Hs.284295 Homo sapiens HSPC337 mRNA. partial cds , 12 

401519 C15000476*:gl|12737279|reqXP_012163.1| 3.2 

„ 419517 AF052107 Hs.90797 Homo sapiens done 23620 mRNA sequence 12 

75 431585 BE242803 Hs.262823 hypothetical protein FU 10326 3.2 

427461 AA531527 Hs.332040 hypothetical protein MGC13010 12 

453288 AW583292 Hs.274412 similar to yeast Upf3, variant A 12 

412513 AA322S99 Hs^163 ESTs, Weakly similar to AF1 51 8401 COB 12 

0 _ 432579 AF043244 Hs.278439 nucleolar protein 3 (apoptosis repressor 12 

80 439593 BE073597 Hs.124863 ESTs 12 

451945 BE504055 Hs^11420 ESTs 12 

434614 AI249502 Hjl29669 ESTs 12 

445525 BE149866 Hs.1 4831 Homo sapiens, Similar to zinc finger pro 12 
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419991 


AJ000098 


Hs.94210 


eyes absent (DrosophHa) homotog 1 


3.2 


408101 


AW968504 


Hs.123073 


CDC2-fdated protein kinase 7 


3.2 


425810 


AI923627 


HsJ1903 


ESTs 


3.2 


443123 


AA094538 


Hs.272808 


putative transcription regulation nudea 


3.2 


447439 


AA313565 


Hs.145020 


ESTs, Weakly similar to WAA12Q5 protein 


' 3.2 


452167 


N75238 


Hs.13075 


Homo sapiens cDNA: FU2301 3 fis, done L 


3.1 


440789 


AB007857 


Hs.7416 


WAA0397 gene product 


3.1 


436087 


BE300296 


Ks.5054 


CGM 33 protein 


3.1 


414222 


AU 35173 




sorbitol dehydrogenase 


3.1 


419749 


X73608 


Hs.93029 


sparc/osteonectin. cwcv and kazaVfike d 


3.1 


418559 


AA225048 


Hs.104207 


ESTs 


3.1 


408784 


AW971350 


Hs.63386 


ESTs 


3.1 


424285 


BE207168 


Hs.144630 


nuclear receptor subfamily 2, group F, m 


3.1 


417193 


A1922189 


Hs.288390 


hypothetical protein FU22795 


3.1 


428738 


NMJJ00380 Hs.192803 


xeroderma pigmentosum, complementation g 


3.1 


436489 


AJ272269 


Hs.121429 


zinc-binding protein Rbcc72B 


3.1 


433006 


BE242758 


Hs.190223 


ESTs, Moderately similar to T29265 hypot 


3.1 


426991 


AK001536 




Homo sapiens cDNA FU10674 fis, done NT 


3.1 


445943 


AW898533 


Hs.181574 


ESTs 


11 


434194 


AF119847 




Homo sapiens PRO1550 mRNA, partial cds 


11 


432426 


AW973152 


Hs.31050 


ESTs 


3.1 


416974 


AF010233 


Hs.80667 


RALBP1 associated Eps domain containing 


3.1 


451131 


A1267586 


Hs.268012 


fatty-acid- Coenzyme A Bgase, long-chain 


3.1 


423453 


AW450737 


Hs.128791 


CGI-09 protein 


3.1 


417215 


BE253181 


Hsi1687 


non-metastatic cells 3, protein expresse 


11 


436278 


BE396290 


Hs£097 


synaptogyrin 2 


3.1 


432908 


AI861896 




ESTs 


3.1 


432952 


AA813887 


Hs.188173 


Homo sapiens cDNA FU1 2187 fis, done MA 


3.1 


438705 


AI049624 


Hs.283390 


ESTs, Weakly similar to 2109260A B cell 


3.1 


427982 


NM_016156 Hs.181326 


WAA1073 protein 


3.1 


431578 


AB037759 


Hs.261587 


GCN2 eiF2aipha kinase 


3.1 


409757 


NM.00189B Hs.123114 


cystatin SN 


3.1 


407136 


T64896 


Hs.287420 


Homo sapiens cONA FU11533 fis. done HE 


3.1 


434293 


NMJXJ4445 Hs.3796 


EphBS 


3.1 


410001 


AB041036 


Hs.57771 


kailikrein 11 


3.1 


434970 


AW272262 


Hs.225767 


ESTs 


3.1 


432800 


BE391046 


Hs.278962 


AIM-1 protein 


3.1 


423392 


AA195037 


Hs.169341 


HTPAP protein 


3.1 


432205 


AI806583 


Hs.125291 


ESTs 


3.1 


448807 


AI571940 


Hs.7549 


ESTs 


3.1 


439979 


AW600291 


Hs.6823 


hypothetical protein RJ 10430 


3.1 


435561 


AA351978 


Hs.4943 


hepatocellular carcinoma associated prot 


3.0 


427315 


AA179949 


Hs.175563 


Homo sapiens mRNA; cONA DKFZp554N0763 (f 


3.0 


410082 


AA081594 


Hs.158311 


Musashi (Drosophila) homotog 1 


3.0 


432621 


A1298501 


Hs.21192 


ESTs, Weakly similar to T46428 hypotneti 


3.0 


429638 


A1916662 


H&211577 


kinectin 1 (kinesin receptor) 


10 


452908 


AB001451 


Hsl0965 


neuronal She adaptor homotog 


10 


426030 


BE243933 


Hs.108642 


zinc finger protein 22 (KOX 15) 


10 


407688 


W25317 


Hs37616 


Human D9 splice variant B mRNA, complete 


10 


434958 


T99949 


Hs.303428 


Homo sapiens cDNA FU14832 fis, done OV 


10 


416610 


AW245993 


Hs.223394 


hypothetical protein MGC2742 


3.0 


451815 


AW974911 


Hs.184793 


Homo sapiens cDNA: FU21880 fis, clone H 


3.0 


433556 
413010 


W56321 


Hs.111460 


cafchim/cfitmodutin-ftenfinripnt nmtein kin 


10 


AA393273 


Hs.75133 


transcription factor 6-fike 1 (mitochond 


3.0 


449459 


BE546846 


Hs.195048 


ESTs 


3.0 


438523 


H66220 


Hs278177 


ESTs 


10 


434263 


N34895 


Hs.44648 


ESTs 


3.0 


446825 


BE266822 


Hs.344097 


filamin A, alpha (acfin-binding protein* 


10 


408661 


AW953853 


Hs.292833 


ESTs, Weakly similar to 138022 hypotneti 


10 


429966 


BE081342 


Hs.283037 


HSPC039 protein 


3.0 


412652 


AI801777 




ESTs 


10 


442409 


BE208843 


Hs, 129544 


hypothetical protein MGC15438 


3.0 


450832 


AW970602 


Hs.105421 


ESTs 


10 


440074 


AA863045 


Hs.10669 


ESTs, Weakly similar to T00050 hypotneti 


3.0 


436032 


AA150797 


Hs.109276 


latexin protein 


10 


419594 


AA013051 


Hs.91417 


topoisomerase (ONA) II binding protein 


10 


421674 


T10707 


Hs.296355 


hypothetical protein FU23138 


2.9 


432302 


AA345857 


Hs.274307 


WAA1442 protein 


2.9 


415172 


AF079529 


Hs.78106 


phosphodiesterase 83 


2.9 


412926 


AJ879076 


Hs.75061 


marmnhano immctnut^Mi alarrfna_iw* C 
rnduupnoyo niynoCuyioi&Q otwi&^tcii w 


2.9 


413142 


M81740 


Hs.75212 


ornithine decarboxylase 1 


19 


437179 


AA393508 




serologically defined colon cancer anOg 


19 


418372 


AA311833 


K&84318 


repficatton protein A1 (70kD) 


19 


439609 


AW971945 


Hs^93236 


ESTs 


2.9 


427398 


AW390020 


Hs.20415 


chromosome 21 open reading frame 11 


19 


447958 


AW796524 


Ks.68644 


Homo sapiens microsomal signal peptidase 


2.9 


409340 


BE174629 


Hi321130 


hypothetical protein MGC2771 


19 


420061 


AW024937 


Hs.29410 


ESTs 


19 


431663 


NMJJ16569 Hs^67182 


TBX^so protein 


19 


417622 


AW298163 


Hs.82318 


WAS protein family, member 3 


19 


420344 


BE463721 


HsJ7101 


putative G protein-coupled receptor 


19 


446791 


AI632278 


Hs.195922 


ESTs 


19 


433313 


W20128 


Hs.296039 


ESTs 


19 
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428465 


AW970976 


Hs.293653 


ESTs 


Z9 


457489 


AI693815 


Hs 127179 


ciyptic qca0 


2.9 


452092 


BE245374 


Hs.27842 


hypothetical protein FU11210 


Z9 


446880 


AI811807 


Hs 108646 


Homo sapiens cONA FU 14934 lis, clone PL 


2.9 


419829 


AI924228 


Hs.1 15185 


ESTs, Moderately similar to PC4259 ferri 


2.9 


437396 


BE140396 


Hs.21621 


hypothetical protein DKFZp7620076 


Z9 


413125 


BE244589 


Hs.75207 


gtyoxalaset 


2.9 


401785 






NM_002275*:Homo sapiens keratin 15 (KRT1 


2.9 


433495 


AW373784 


Hs.71 


alpha-2-glyooprotein 1, zinc 


Z9 


452099 


BE612992 


Hs.27931 


hypothetical protein FL1 10607 similar to 


Z9 


437296 


AA350994 


Hs.20281 


K1AA1700 


Z9 


421254 


AK001724 


Hs.1 02950 


coat protein Qammacop 


2.9 


445109 


AF039916 


Hs.12330 


ectonudeoskJe triphosphate diphosphohyd 


2.9 


423551 


AA327598 


Hs 89633 


ESTs 


2.9 


421154 


AA284333 


Hs.287631 


Homo sapiens cDNA FU 14269 fis, done PL 


Z9 


410193 


AJ 132592 


59757 




Z9 


435854 


AJ3781M 


Hs.4996 


puiduVQ auRji unejiBdt conujunng prow 


2.9 


423396 


AI382555 


H«i 197QW 


brornodornauv-containinQ ^ 


2.8 


442202 


RF979flfi9 


Hs 106534 




2.8 


441345 


AU/ncoc7Q 
AVlUOOO 1 3 


He 77RO. 


Hrimn cranio rx: mRM&< rTlMA fWP7n WiAOTD ffr 

nomo oopisrts rnruxn, curort utvtijkjosaui c [» 


Z8 


444367 


H54892 


Hs 10974 


hvnnthpfirfll nmtpm Fl 199^00 


2.8 


422522 


AID9349& 


Hs.34549 


FCTe Hfnhh/ similar tn ^QA^I 1 rinnn A 
CO 1 S, ruyiuy SUIuldJ \Jj flyWi 1 1 CUXIo ** 


2.8 


443337 


Y07604 


Hs 9235 


rmn-mptusbtfir rj>tk 4 nrntpf n pxtttpssp 


2.8 


416166 


AI754416 




nric49 effector nrntein 3 


2.8 


448734 


BF614070 


Kk 396416 


HorrtA «y»n!pjw mRMA- cDNA DKF2ofifi4Ht916 ff 
nun iv oafJiBiKi uirvrirt, tuiv^ ur\rfcj*i>o*tn is iu \i 


2.8 


413550 


W03011 


Ks.306881 


MSTP043 protein 


2.8 


426170 


BE161065 


Hs 167531 


mpthvleintanovl-CnfinTvmfi A eartoiwvtase 9 


2.8 


444101 


R19175 


Hs.1 69793 


ribosomat protein L32 


Z8 


425320 


U29344 


Hs.83190 


fattv acid ^vnthaso 


Z8 


431631 


AA548906 


Hs.122244 


ESTs 


Z8 


448604 


AW512213 


Hs.342849 


ADP-ribosytation factor-like 5 


Z8 


431416 


AA532718 




ESTs 


Z8 


447881 


. BE620886 




GCN1 (general control of amino-add synt 


Z8 


445309 


AL157474 


Hs.12504 


likely ortholog of mouse Arkadia 


2.8 


453186 


AK001708 


Hs.32271 


hvootheticat orotein FU 10346 


Z8 


439778 


AL109729 


Hs.99364 


putsiivG tf snsfftfinittffln^ protein 


2.8 


425010 


T16837 


Hs.4241 


ESTs 


Z8 


432840 


AK001403 


Hs.279521 




2.8 


450546 


AA010200 


Hs.175551 


ESTs 


2.7 


431674 


AA098901 


Hs.301642 


G*proteJn couptBd rBceptor 


Z7 


422017 


NM_003877 Hs.1 10776 


STAT induced STAT inhibitor-2 


Z7 


443181 


AI039201 


Hs.283316 


ESTs 


2.7 


448913 


AA194422 


Hs.22564 




2.7 


440193 


AW902312 


Hs.7037 


Hrtmo qanipns HonA 9409^ mRWA Rflflitpnr.fi 


2.7 


452941 


AL110347 


Hs.31074 


fJ.ciilf/vi!iifW!am^r>A RiilfinhvrlrnlncA fRidfa 
ii'dUituyiuwjociiuiHS ouiiuiijuiuiaAO ^buiui 


2.7 


451338 


AW612322 


Hs.19131 


trgngrrinftfin faftftr Hn_9 fF9F rflmorfrnl 

UoJKkAI^ILhJII IdUUil Up-fc ^Ctr UUHcllitH 


2.7 


411190 


AA306342 


Hs.69171 


nmfpfn Irfnacp HJikn 9 


2.7 


434629 


AA789081 


Hs.4029 


flftnmacvnnnfipd *5«niencfl-41 


Z7 


407192 


AAS09200 






2.7 


434747 


AA837085 




ESTs 


Z7 


428171 


AA489323 


Hs.182825 


lUAJoUJikM y)l IKCU 1 UvKJ 


2.7 


426657 


NM.015865 Hs.171731 


solute canter family 1 4 (urea transport 


Z7 


447147 


AA910353 


Hs.75432 


ESTs Weaklv similar to T23482 hvoothefi 


Z7 


452955 


AW390282 


Hs.31130 




2.7 


407748 


AL079409 


Hs.38176 


KtAAQ606 orotein' SCN Orcadian OsdHat 


2.7 


445596 


R89543 


Hs.12942 


veside trafficking protein 


Z7 


452268 


NM.003512 Hs.28777 


H9A htetonft farniTv mp.mhpr L 


2.7 


446494 


AA463276 


Hs.288906 


WW Domain -Containlno Germ 


Z7 


438157 


AW137011 


Hs.49576 


ESTs 


Z7 


452260 


AM53208 


K&330994 


RARQ mpmhpr RAS nnMnpflA fatntlv 

1 vUJ J| IllOillUtJI IWw IMwUyCllD lOIIUIJ 


2.7 


409646 


AW451449 


Hs.57749 


ESTs 


Z7 


401866 








Z7 


400301 


X03635 


Hs.1657 




2.7 


453390 


AA862496 


Hs.28482 


ESTs 


Z7 


426514 


BE616633 


Hs.170195 


bone morpnoQenetic protein 7 (osteoQenlc 


2.7 


425294 


AF033827 


Hs.155553 


HNK-1 sulfotransferase 


2.7 


405387 






NM..Q22170*iHomo sapiens WilGams-Beuren 


Z7 


441266 


H15968 


Hs.293845 


Homosanlens done 1MAG&3502329 mRNA. 


Z7 


432388 


X15218 


Hs.2969 


v^sld owfln S3rcorn3 vi rsl oncoQ6fl0 hornol 


Z7 


450937 


R49131 


Hs.26267 


AT P"^8 pondsnl tntfirferon rsspOflSQ protsi 


Z7 


447210 


AF035269 


Hs.17752 


phospnaOdytserlne-spedfic phosphoTipas 


Z7 


445098 


AUQ50272 


Hs.12305 


OKFZP566B183 protein 


Z7 


451404 


AA460775 


Hs.6295 


ESTs. Weakry similar to T1 7248 hypothefi 


Z7 


409650 


T08490 


Hs.288969 


HSCARG protein 


Z7 


452707 


AJ093823 


Hs.45070 


ESTs 


Z7 


451134 


AA318315 


Hs.25939 


hypothetical protein FU22195 


Z7 


452826 


BE245286 


Hjl301636 


peroxisomal biogenesis factor 6 


Z7 


420818 


AW9S9635 


HsJ3032 


ESTs, WeaWy similar to 1207289A reverse 


Z7 


43S510 


ALQ8Q220 


Hs.6285 


DKF2P586P0123 protein 


Z7 


406527 


T64904 


Hs.163780 


ESTs 


Z7 


442315 


AA173992 


Hs.7956 


ESTs. Moderately similar to ZN91_HUMAN Z 


Z7 


421717 


AF230924 


Hs.107187 


divalent cation tolerant protein CUTA 


Z6 


425910 


AA830797 


Hs.184760 


CCAAT-box-birafing transcription factor 


Z6 
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440695 


AW088363 


Hs.246240 


ESTs 


i6 


416941 


BE000150 


Hs.48778 


niban protein 


16 


422717 


AI557623 


Hs.1 19475 


cold Inducible RNA-bindlng proteh 


IS 


426110 


NM.002913 Ks.166563 


replication factor C (activator 1)1(14 


16 


422027 




He ViniQ 


fatty acid amide hydrolase 


2.6 


401197 






ENSP00000229263*:KSPC21 3. 


16 


442607 






Homo sapiens cDNA: FU 22696 fis, clone H 


16 


410297 


AA14R7M 




lumican 


16 


447963 


AM MOM 


mo. i D33 vv 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


16 


433101 


HVWOlCJl f 


u c 1 moo 


Homo sapiens mRNA; cONA DKFZp566L203 (fr 


16 


437546 




ns. i / »W 


T-boxl 


16 


443725 




He Q7fk1 


growth arrest and DNA-damage-inductbte, 


16 


436213 




He 71470 
ns./ lite. 


hypothetical protein FU10774; WAA1709 


16 


408157 


AAU4/0O3 


nS.o/iWb 


ESTs 


16 


420805 


L10333 


He 00047 


relicuton 1 


16 


454144 


acnoruia 

DC£OV*t/0 


ns. I0403D 


hypothetical protein MGC3243 


16 


426761 


A|nic7nQ 

AIUI3/U? 




Homo sapiens mRNA; cONA DKFZp586l2022 (f 


16 


426695 


ArKCfUT 
MIJ339H/ 


He 1RQQQQ 


nurineraie rw»o!fif fiamDv A oroun 51 


16 


431725 


X65724 


nii.£0O3 


Nome disease (pseudoglioma) 


16 


425174 


UO/40U 


ITS. IM3/0 


KIAA0261 protein 


16 


410467 


APin^R 

rXr lUZOnu 


He RIO/il 

ns.oo3J 1 


dachshund (Drosoph3a) homolog 


16 


413435 


X51405 


He 7R3RA 
rK>. /330U 


carboxypeptidase E 


16 


428293 


RP5«?nQ44 


He IRARRR 


sotute carrier family 1 (neutral amino a 


16 


426180 


AH0Q7C7 


nS.lozo/4 


guanine nucleotide binding protein (G or 


16 


439680 


AW9ATM1 


ns. 30*101 


ESTs, Weakly similar to A35659 krueppel- 


16 


424395 


AAlftfAR? 

MMIOOvOa 


He 1 AR3RR 
rib. I1DOOO 


rnicrotiJ but e- associated protein 7 


16 


406670 


tA/7Qfi^0 
W/SOJt 


ns.«wui 


hypothetical protein MGC13170 


16 


413762 


AIAM4 4/170 

AW41 i4/y 


Us a AO 

HS.o4o 


FK506-btnding protein 4 (59kO) 


16 


429922 




nS.zzol 1 / 


H1 his tone family, member 0 


16 


449518 


QCJQCOC1 
OCJS343d 


K5.0U00I 


ESTs 


16 


445919 


T53519 


Uc 11 /COO 


hypothetical protein MGC14141 


16 


429343 


AiyUUUj 03 


Uc 1QQ4RH 


Homo sapienSi Similar to epsin 3, clone 


16 


438552 


A i^A^flon 

/\JZ«*30ZU 


Uc tftiA 


type I transmembrane receptor (seizure-r 


2.6 


431472 




He W^Q 


nucleotide binding protein 2 {E.coii Min 


16 


409299 


A Aruccsn 
AAIWDOOU 


ns.oolib 


smaD nuclear riconucteoprotein D2 polyp 


16 


434672 


AW294020 


Hs,1 17721 


ESTs 


16 


443015 


R33261 


Hs.6614 


ESTs, Weakly similar to A43932 mucin 2 p 


16 


452576 


AB023177 


Hs.29900 


K1AA0960 protein 


16 


412843 


AF007555 


Hs.74624 


protein tyrosine phosphatase, receptor t 


16 


430375 


AW371048 


Hs.93758 


H4 his tone family, member H 


16 


408196 


AL034548 


Hs.43627 


SRY (sex determining region Yybox 22 


16 


424339 


BE257148 




endogiycan 


16 


41 6292 


AA179233 


Hs.42390 


nasopharyngeal carcinoma susceptibility - 


16 


451040 


AA324743 


Hs.40808 


ESTs 


16 


445636 


AW105401 




rihnenmal nmfpin 1 9Q 

IlLAJOUllUU piUlUlil LX3 


16 


419175 


AW270037 




KIAA0779 protein 


16 


422000 


M30599 


Hs.1 10637 


hnmpn hfi* A1H 


16 


441128 


AA570256 




ESTs, Weakly similar to T23273 hypo the ti 


16 


446237 


AW270515 


Hs.149596 


Homo sapiens, Similar to RIKEN cDNA 2310 


16 


431797 


BE169641 


Hs.270134 


hypothetical protein FLJ20280 


2.5 


406789 


AI041403 




rihne/imgl nrntnln 1 9Q 


15 


423217 


NM.000094 Hs.1640 


collagen, type VII, alpha 1 (epidermolys 


2.5 


423115 


AA421973 


Hs.169119 


ESTs Weakfv similar to T25731 hvootheti 


15 


420460 


AA262331 


Hs.48376 


Homo sarifans cionft HB-9 mRNA ^ftntiAncft 


2.5 


436165 


AI373544 


Hs.331328 


intermediate filament protein syncoiiin 


15 


430542 


AI557466 


Hs.119122 


ribosomat protein L13a 


15 


452827 


AI571835 


HS.5546B 


ESTs 


15 


431609 


AW792792 


Hs.264330 


N*acytsphin^osine arrdctohydrolase (acid c 


15 


431108 


AA991508 


Hs.105317 


ESTs 


15 


410310 


J02931 


Hs.62192 


coagulation factor III (thromboplastin. 


15 


430333 


S70114 


Hs.239489 


T1A1 cytotoxic granule-associated RNA-W 


15 


439864 


AI720078 


Hs.291997 


ESTs, Weakly similar to A47582 B-cetl gr 


15 


438821 


AA826425 


Hs.192375 


ESTs 


15 


421091 


W22821 




ribosomal protein L26 


15 


439414 


NM.001183 Hs.6551 


ATPasa. H trsnsoorflno h/sosomal fvacuo 


15 


400263 






Eos Control 


15 


451428 


AW0B3384 


Hs.11067 


ESTs, Highly similar to T46395 hypotheti 


15 


449051 


AW961400 


Hs.333526 


HPR9 fPCPntnr h/rminp Idnasn fo-prtvh? 
i ii-i v& icvC|jim ijtudiiic iuiioow ^v^ai^u*, 


15 


431615 


AW295859 


Hs.235860 


ESTs 


15 


433037 


NM.014158 H*279938 




15 


409504 


AA304961 


Hs.699 


pepSdytproiy) isomerase B (cydophHin 


15 


409330 


AK001231 


Hs.53940 


hypothetical protein FU 10369 


15 


43S299 


AKD00767 


Hsi111 


hypothetical protein FU 20729 


15 


450628 


AW382884 


Hs.204715 


ESTs 


15 


426793 


X89887 


Hs.172350 


WR {historic cell cyde regulation defec 


15 


431122 


A1267593 


Hs.250535 


Homo sapiens mRNA; cONA OKFZp434N2412 (f 


15 


407236 


W79485 


Hs.173980 


nuclear matrix protein NMP200 related to 


15 


421109 


L32832 


Hs.101842 


AT-binding transcription (actor 1 


15 


408770 


AW270608 


Hs.170195 


bone morphogeneflc protein 7 (osteogenic 


15 


416737 


AF154335 


Hs.79691 


UM domain protein 


15 


414669 


AA157291 


Hs.21479 


ubtnudetn 1 


15 


431222 


X56777 


Hs.273790 


zona pellucid a glycoprotein 3A (sperm re 


2.5 


403532 






NM_024S38:Homo sapiens rrypolheGcal prot 


15 



757 
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418849 


AI095485 


Hs.169341 


ESTs, Moderately similar to S65657 alpha 


2.5 


443804 


AL 135352 


Hs.255883 


ESTs. Weakly similar to 138022 hypothec 


25 


407887 


AA579668 


Hs.41072 


serine (or cysteine) proteinase inhibilo 


25 


414556 


AW975083 


Hs.343443 


ribosomal protein L36 


2.5 


446911 


N27605 


Ks.16492 


DKF2P564G2022 protein 


25 


435126 


A1333666 


Ks.42315 


plO-blrtding protein 


25 


421866 


M24470 


Hs.1435 


guanostne monophosphate reductase 


25 


426788 


U66615 


Hs.172280 


SWi/SNF related, matrix associated, acti 


25 


447050 


NM 016314 


Hs.17200 


STAM-Gke protein containing SH3 and IT A 


25 


408461 


AB037756 


Hs.45207 


hypothetical protein K1AA1335 


25 


421594 


R45689 


Hs.21889 


Homo sapiens cDNA FU 12978 fis, clone NT 


25 


443837 


AI9B462S 


Hs.9884 


spindle ode body protein 


25 


421867 


AA481078 


Hs.109045 


hypothetical protein FU 10498 


25 


435021 


AA922192 


Hs.73962 


ESTs 


25 


435750 


AB029012 


Hs.4990 


WAA1089 protein 


25 


435025 


T08990 


Hs.4742 


anchor attachment protein 1 (Gaalp, yeas 


25 


407255 


AA012992 


Hs.256301 


hypothetical protein MGC13170 


25 


425976 


C75094 


Hs.334514 


NG22 protein 


25 


449458 


AI805078 


Hs.208261 


ESTs 


2.5 


428013 


AF151020 


Hs.181444 


hypothetical protein 


25 


424369 


R87622 


Hs.26714 


K1AA1831 protein 


25 


431604 


AF175265 


Hs.264190 


vacuolar protein sorting 35 (yeast homo) 


25 


452779 


AA418775 


Hs.47234 


ESTs 


25 


433586 


T85301 


Hs.194397 


gb:yd78d06.5l Soares fetsS liver spleen 


2.5 


438682 


AA354489 


Hs.222103 


EBP50-PDZ interactor of 64 kD 


2.5 


427515 - 


T79526 


Hs.179516 


integral type 1 protein 


25 


418700 


AI963808 


Hs,86970 


ESTs. Moderately simitar to ALU5.HUMAN A 


25 



TABLE 67B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number 

415989 10194J 

429220 15103J 
412446 i 63467J 

433404 739*_1 

449625 249224.1 

453160 6028_5 

420218 191547J 

437124 59408J 



432586 6633 1 
438869 52134J 
450377 12109.1 



436063 
428342 



5483J 
6712.1 



424036 6226.1 



450203 
451752 
432363 



19009.1 
10408.5 
1234917J 



Accession 

BC013389 BC017398 AI023543 AA191424 AI267700 AI469633 AW958465 AW9S3397 AA172056 BE940298 BF909208 BF909980 BF095153 
BG285837 A1720344 BF541 715 AA355086 AA1 72235 
AW341473 AA4481 95 AW207206 Al 951 341 AA969259 

BC021735 AI669212 AL120184 AI769949 BE701002 BE184363 BE819031 BG702238 BF090049 BF963318 BF961912 BF943013 AA934514 
AA151245 BF960659 AA987907 Z41449 BF9080S9 BF908053 BF908049 BE699424 BF908060 BF982832 BF952020 BF963134 BI035538 
BF908052 BF908057 BF090026 BF943158 AI632924BF512340 BF952021 BF960776 BF943437 BF942847 A1768015 F09778 F04816 F02721 
AA1 02645 Al 633838 AA61 7929 BF947001 B1035448 BE935876 AW890837 AW898604 BF957405 BF963433 BG704815 
BC022980 BF224081 BG149908 AW672842 BE670687 AI702161 AW341832 BE222503 N71836 AI026061 AW953116 AW083132 AI979261 
AV725377 AI423298 AI640707 AW675518 AI03261 1 AI818044 AI299508 AI91 1386 AI27041 8 BE219257 BM141826 AA826491 Z251 59 AA587421 
N59447 Z39436 T32982 R54110BF11 5783 F09044 BF808433 

BI918168 AW779760 N48674 AI375997 AA235370 BG699146 AI913631 A1498402 AI016320 AA323193 R49021 D59344 BG986750 N45526 
BG986917T61382 R49391 R45432AI203107R35004F07491 R25094 R35360 

BC009612 NMJKB526 BI597616 AV761592 AV760377 AL601008 BI604131 BE645918 BG187760 BG181525 BG210634 BG192999 A1263307 
AA344186 AW952966 AA033609 AA037562 AA7221 83 R79452 H70775 BF674991 BE769437 BG007856 AA037483 AW572535 AI143991 
AA084581 AA033610 AV742510 AV735788 R08336 

AW958037 R42S57 AI337047 AA948360 A1638005 AA459950 AI62491 5 A1638047 Ai467856 AI521 826 AA860305 AI93231 5 AW003092 AW271756 

AW779380 AA609879 AI634791 AM93770 AI56521 1 Z41 145 AI627952 AA303734 BE349457 AW196765 AA256527 BE0B97Z7 

AL050013 B6939500 AW969191 AA769925 AI377973 A1625545 AA81 1365 AA521 1 14 N24705 A1379579 AA424899 AI684671 AAB29715 AI453010 

N35401 AA677452 AA504340 AI209149 AA883574 AI379062 A10B4455 AI280147 AA644327 BF432508 N27873 N47364 N34880 All 47024 T86860 

AI219716 AA960926 H25544 BI857123 AW960489 AA599099 AA765246 AA328537 BG434703 AA890373 AA424765 AI292318 AA829886 N95742 

AI218758 H25588 N36282 AA024987 N36687 BJ919187 N49471 AA889970 AW166152 AA468546 AI262504 AI452782 AA554458 AA807080 

AA724542AW102730AA909978AW118134AW827241 W56431 BM127381 BF436987AI0165Q9AW663972BM127686C15552 N53435 N51744 

T98800 N56960 BG108636 N49381 R49886 D61278 B1756612 AA508234 R49885 BF850422 

BC022881 AU1 50944 BG750783 AW754175 AW857737 AI91 1 659 AJ050036 AA554053 A1826259 AA568548 

AF075009 R63109 R63068 

AB033091 AL520743BE811813N53332N99716AL561910AA280655BE710392AW05100 AI873695 
BG700891 BI553517 R80518 BG779771 BG534451 AA479402 AW961580 BF061430 A1857643 A1768486 AW512118 AA479302 AW770384 
AW072470 AI041595 AI049699 AW592865 AA976261 BE879747 BG114119 AW1 83811 AA909938 BF571621 BF350794 BF351375 BE925699 
AL050294 BC010371 BF982270 AL042656 BF095732 AW812618 BF095731 BG212397 BF678765 BI038602 BG388664 AW675337 BG289398 
BE939598 AW805570 AA527097 AW1 50540 AI693720 AA743364 AI915793 N481 85 A1573107 AA043474 AI351 61 5 AI969490 AI910763 R50866 
A1699181 N73808 K08164 

AK000028 AA494483 AI298674 AA720773 AV761 529 AI884670 AI936202 AW294235 D61652 BF881 184 AV71 1384 N27154 AI926970 AW34970 
N40094 N28596 AA884747 AA512890 BG436593 

AK056315 AI01 5524 AA724079 B171 3619 A1377728 AW293882 AI928140 AI092404 AI085630 AA731340 BM469829 AW968804 AA425658 
AA769094 BF446026 AW1 1 8719 AI332765 AW500888 AW576556 AI859571 AW499664 AW614573 AW529495 AW505314 W74704 AI356361 
AI923640 AW070509 Al 521 500 AUM2095 AA609309 AA761 31 9 AI381489 H45700 AA761333 AW265424 AA909524 AA63531 1 AA649040 
AI392620 Z407Q8 AI985564 AW263513 AA913892 AI693486 AW263502 A1806164 AW291 137 BI061872 BI059498 AA1 34476 AW084888 AA036967 
AW370823 T55263 BI0O2756 AA48S664 BF827261 W74741 BF9S3166 

NM.033445 BC001 193 A1885781 BF794032 AA476620 AA810906 AA810905 AI291 244 AI885097 AI359708 AI335629 H97396 AI344589 AA300377 
AA457566 AW771833 BE465621 AI364068 AI364452 A16485C5 AI918342 AI928670 AAB86580 AL531029 AA886344 AI18641 9 8G329096 
BM045465 AL531028 BG437151 BE868021 AA179427 

AK055952 BG182168 BG220105 BG191569 BG188964 BG187388 BG220104 BG183714 BE645998 AI819354 AW974068 Al 393635 AI560846 
AIQ24796 AWQ20098 BW91127 AB93644 N74993 AW472959 BM478854 B1597437 H12165 B1458612 BE543192 
AB032997 A1141678AW978722BE467119AI761408BF727385AW237035 A1934521 BF436248AI479668Z40632AA832081 AW2S5901 
BF057835 BE465977 AI621 269 BE465983 BF756369 N74056 A1817896 AA716567 AA934774 H62600 H09497 BF943762 BE395335 BE883333 
AW97Q240 AA534489 AW970323 
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AK057805 AW162343 A1190479 AI093318 BE048820 Al 1 98397 AA654667 BE219303 Z39851 F02655 Z28734T16575 F10145 Z45266 AW572911 
AW964436 AW004030 A1532565 BE502530 BI792383 BF056928 AA449241 AI651825 AA805324 AI264863 AW19S918 AA948267 A1953735 
AJ263703 BF056387 AW594171 AI867447 AA319159 AI903440 AW9561 10 AI356013 A1867923 BG911908 D81 142 C1561 6 AL538697 Z25032 
Z43784 R13382 AV746924 AA449389 AA318815 BF384265 H17038 H 10064 FD4161 T87230 R40898 AW204071 BI819428 AA683393 AA683376 
243192 T74078 T05103 F12527 T77951 H101 18 H17037 BF855407 R19503 
AA196390 AA507837 M195468 

AK056626 A180089S BF939022 BE544718 AI954754 BE218177 BE348567 AI9S2406 AW293122 A1968798 A1457321 BE327228 BG913531 
AW939055 T30280 R54166 Z43366 BG819153 BF003119 BE646274 BF940881 R18246 R42185 
AK027321 W63676W63789BBW6412BF114614BE646183BM126230AW044233A195^ 

BM126251 AA789002 AA581966 AA809643 AW188870 BE705664 BE70S539 BE1 53177 BF084925 AL133779 AW981788 AA659693 AA347970 

AA295134 AA526037 AA449282 BG190454 H61476 T91398 N20018 T901 14 H75644 BE710736 BF687723 H28581 AA249370 BF726698 

BE841554 B1045099 T84525 AW1 29678 BG770826 

AW973834 AJ791 932 AI791855 AI732640 AA558833 AA559897 A1821 61 0 

AI973051 A1400921 AT796154 AW241817 AW290951 

NM.004892 AF047442 BE275338 BF724863 BI917206 BE276993 AL602308 AA306105 BM152505 BC001364 AW993471 AW993481 AW993283 
AW992919 AW992921 AW992980 AW992861 AW993220 B6573124 AA456385 AA234796 AI902726 AA354813 B1092644 BG778400 BI2600O1 
BG007325 A1267455 AA42&74 Al 160782 A14721B5 AA255500 AA434006 BG435520 AI3561 1 1 H00525AV749060BG944497 BG292031 
AA9021 53 C04925 AA9021B0 AA383100 AW073533 AA256706 AA150809 R65766 AW958448 BE090972 BF693195 AV738979 R65855 R801 36 
AA484677 

BE644758 Ai 082238 BF940027 A1201 079 AI436035 AW275956 A1085394 AI291655 AW070441 AM74134 AI268978 AI769279 AI567682 AA693941 
BF477668 AW664149 AA283782 BF509538 AW296868 AI268977 AI168133 BM352065 AI262769 BF941976 AI056920 AA481861 BF763697 
AU65B88BM352383 M427768AA385346AI186988AA931831M134972BF217480B 

BF384291 AL515057 AV702312 AA377395 AL544217 AI341000 AW193583 AI350789 AA888338 BF945380 AW879092 AA1 30839 T91 066 N92326 
AI004389 AA078832 AL572370 W04522 BE314003 AW960808 BM360872 AA3191 60 AA1 30778 AL514257 
AW182329 AA61 3792 T05304 AW858385 

BG107484 AA632009 AI432670 AI65666Q AI650884 AI521919 AI264653 AW150793 AW61 1894 AI917098 BI091245 AI651454 BF434B89 AI580286 
AI880735 BE301995 AI392959 AW613965 BM023628 AW515374 AI460102 BM023318 BE328188 AI952820 A1581 363 AA5S7165 AI695677 
AL562079 AI70O926 AW70561 BF063058 AW1 96387 AU1 32984 81064046 AI9701 57 R021 22 H 55924 AI521721 AA808206AA7 25223 AI766003 
AW339821 AA805951 AI287969 AW664827 

BC021085 AL527872 AL528296 AL557087 B1255090 AU 143499 AL560356 BG823170 BE736988 AU141388 AL580262 Bl 7641 73 AU 1 20299 
AU141755 AU141988 AU142941 AU141330 AU141784 BI770885 BI91 1394 BE901426 B1918039 BG760842 BE883026 BI254740 BE732690 
BG430761 BE792868 AW328267 AL135173 AA102674 BF726986 AL564735 BE155962 BE155979 BF741679 H67776 H59234 H89665 AW117774 
AW274435 L29008 NM 003104 U07361 BF002824 BI222949 AI458045 AW951537 BF196474 AI669049 AI042523 BE453928 AU1 60125 AU160580 
AW069877 AW015214 AI948718 BE219706 A1953605 BE217755 AI991382 AI245005 BF431 179 AI521843 BE048908 AI420449 AW275385 
AA336950 BE501521 BF740566 AA311404 AA384639 BI772535 BG473076 BE891298 BE246928 BM012986 BE242693 BE901342 BG746358 
BF374053 AL564430 AU143835 A1635707 BF1 95492 AI280559 BF741685 AA385257 BE247655 W94974 BE163702 A1025167 AI8271 1 8 N78641 
AL581093 AU158964AU158917 AI282516 AU146399AA713947 AI285028 AA101228AI338522 AA832316AI284986 AAB57926 BF372568 
AA570172 AI753825 AA171566 AU159257 BM194320 W93390 AW132101 AA550898 B1259678 AA522554 M55172 AW013929 AI826274 AI871237 
C75260 AA934846 AA555036 AA526579 AA526466 N80270 AI538347 AW615805 AU158875 AU158883 AU159396 AI016667 AU1 59238 AI28251 7 
AA406317 AI285043 N53050 AA969446 N57718 H93323 W93374 A1873751 H5601 1 AI936174 AA937830 AW438877 Al 800550 AW328268 
A1244886 AA948424 AA700604 AA832317 AA01 1 555 H97671 AA98871 1 AW 150529 N70983 H44325 AV657614 AA902238 AA64401 8 AA034050 
T71053 N71444 AA831158 T40892 AA706106 

AK001536 AK056135 BM474813 BE887303 AK022914 AW581996 AW812945 BE882302 AA134266 B1043873 AA019433 BI862088 BM468657 
AU128438 BE384458 AL353967 B1857117 BF686525 BI485223 BM460132 AU129877 B1222283 BG171592 BI043544 BG496295 BG750710 
BI256542 BG108520 AU150719 AW510354 AI554256 AL353968 AA191092 BF132635 
AF119847 AA437261 AA436987 A1132965 
AF150424 A1881896 AA570057 AV738855 

AI801777 BE677762 AW008210 AW009441 BE350994 BE207949 AI091475 AJ802774 A1827533 AI075363 AI659979 AA687855 AI0781 25 A1090285 
AA670058 AA602411 M683472 AI43605B AA612826 AI038932 BG057726 AI167355 BF449023 AI289476 AW074381 BF97291 2 AI991780 
AA889119 AI537472 239730 AI868953AI1 92337 BE812978 BE812939 AA1 15248 H99006 AI915784 F08973T16748 D20468 AA609899 BF081234 
AA115336 BE812876BE812972 BE812982 BE813006 BE813019 R43883 BE812981 

AK055109 BC019085 AA187684 BG656226 BM023227 AI93231 1 AW264381 AA398371 BM021483 AI432433 AI375777 A1 1 29580 AW262782 
AA134107 BM023515 AA977504 AI859222 AI348454 R69725 AA975268 8M021207 AL080074 AA129218 AW207842 N90581 AA771919 A1092259 
A1028416 AI074114 BG656536 BE501677 AW193419 AA917040 W90430 AI342984 AI37B957 A1038486 AW020068 BI491093 BF476021 R41226 
R69631 F04125 C02343 AA115589 R56480 Al 400988 R54266 R31422 

AK055915 BE867252 AI523348 AA765350 BF446858 243675 R19529 AL133837 BG389444 AW382942 BE702956 AA081961 BE835247 BE835308 
BE835295 AA376302 BE645790 AA375690 AA376592 AW954423 N85732 AA249770 AI754416 AA213816 BF592044 AA811729 AW514842 
A1633486 AI096810 AW183016 A1635738 N27524 BE645916 A1247842 AW991 230 AI754277 H1 6814 AI765892 BF1(^422 A1800361 T95129 
F1 1313 R97946 R93989 AA375242 BF109388 AI860939 AI680060 AW953899 BF971486 AI972337 AW953841 AA372437 AI216746 H11384 
R38484AA249043AA249732 

AW9691 51 AA532718 AA504784 AI791 194 AI821930 AI821485 F371 27 AA654208 F27974 

AK074291 AW293424 BE6761 35 AI8321 25 BE019146 BE465019 AT761 124 AA617778 AJ279232 AW575897 AI672039 F28618 BF924261 
AA722184 BF934174 BE004328 AV749301 BE880282 B1019798 BI019389 BF928776 AW813409 AV726604 AA077560 BE272975 BF949119 
AW814195 BE879126 AI697926 BF594155 BE2057B7 BF063513 N35828 AI948557 AI433839 AI379679 BG056182 A1589094 N23123 AA588805 
AW316581 AI080272 A1421980 AI493318 BF1 94830 NB7590 AA495993 N32996 AA699844 H96845 H96592 N28741 BI035539 BF747723 
BF171 066 W01350 H05495 AI243785 239622 AA887432 A1350659 R46102 
AA602964AA609200 

AW9 76 5 3 7 Al 033582 AA837085 AA745261 AA648395 

BC013939 BW9469Q BI491211 AI928393 AA843540 BG938644 AU185628 BI495842 AW173255 BM052709 AI743999 AI6901 44 AI922209 AI740907 
AW340368 AA928759 AW 1 18737 BF513970 AA707807 BF435295A1339463 AI373842AI433809 BE222392 AA602308 AA428261 A1460355 
AW662760AI888087AI342098AA722418 W78151 N64382BE221B48AW025901 AA452120 AI1 50479 AI016166AA779515 AA661791 BM474307 
BI911169BG575154AW953303 T33604D59141 AA385785AA148648BM461961 BG681168BI602483BE889592AW954311 BM052986 BI962893 
AI989299 242328 BRJ29504 T35668 BG402602AU 185770 A1023271 AA147719 AI434079 AI569000AJ276488AA992453 AA342821 AA648303 
AI349384AI051008AA926941 AA350894 AW071451 N22249 AJ784138 AA083847 N22258 AW440825 AA661 570 AA376687 AA659125 A1356299 
R70463 AJ383586 AA827189 BI494872 AW021094 BI494871 AA905500 AA460923 BI492041 AW028965 A162461 1 BG271 780 AM97723 H88862 
DS9858 N89979 AA658425 N81 154 062341 AI274437 N66697 H96993 AI370663 AA728850 H05232 R59374 H12223 AI935759 AI362553 D60006 
N29572 AI91 6833 N75273 AA148710 B1597117 BG740471 AA332671 AA333874 AA643052 AW020175 R70550 BG623469 AA452342 AW965441 
W19723R32966BE883841 R61 003 AJ91 0374 AI8652S2 R55325 A1468927 R34681 H96211 Z39807 BF954385 

NMJK5720 AF219137 AL534420 AL524055 AL537346 AL53B442 BG765888 AL530054 AL525377 BG474596 BG473144 BE251553 BG706099 
AL538039BG703131 BE255806BF805256F12128AL568773 81828686 BF7614ffl) AI204971 BG818818 BI199246 AL534816 BF529941 AA324163 
AL523285 BG914330 H07952 AL534815 BE769903 A1867802 BM310135 AL533702 BE254484 BF528852 BE867462 BE740130 AL134164 
AL5671 15 AL533701 AL524054 AL515904 AL523284 AL568203 AL53441 9 BF981 162 BE2571 48 AL561833 
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445636 8561 5 BF339388 AI345516 BG391657 BE708967 BG026034 BE261703 H56716 H65572 H93801 T48830 R96953 RSS989 R39707 B E 867593 AA0903 10 

AA090672 

419175 35068 1 AB018322BC012480B1524873AW665554AJ934469AI479916BF096179BF096162BF0961^ 

AA886998 AI570585 AI916688 A167881 1 AI693109 AI308135 AA669046 AA961064 AI018062 H80618 BE221942 R52609 AI915164 AA365626 
244671 81052776 BF882486 BG2B6184 AI589558 AA931663 AA534979 AI275392 AI273455 R52553 AA829920 H80652 AA360728 F10818 
AW953666 AW176773 H85527 AA765570 AA081927 BF093262 BG743753 AL037576 AA534314 BE814964 BE973713 N49493 BE006634 
BE005630 AW270037 AA234765 AI334004 BF057179 A1857450 AI34 1 191 AI434143 AI917449 AW517207 AA255424 AW008334 AAB47572 
AA99421 1 AAB61901 AA581873 AI580157 AI364363 AW242357 AW235291 N55645 AA319869 R3691 1 AA256551 AW044188 AI203159 N49403 
F02090 AI187299 A1609644 Z40516 AW952314 

441128 20932.1 8C014072BE328850Ai356567AI148171 AJ0221 65 BG1 49661 BF000671 AA233101 AA573721 AA447991 AW016855 AiOD5058 AA554071 
BF478215 AA9069O2 AW014761 BE905651 BE512923 BM047129 AA243852 AA232991 AA1 27550 AA1 27551 AA570256 A1473237 BF033706 
N9Q525 AW973623 AI359627 BG674574 BE903322 

406789 0_0 AIO41403 249148 

421091 24941 2 AK057700 BC015899 BE867108 AL526926 AL527436 BG913023 AJ884867 BE858461 AI885227 AI935218 BE645S96 A1922406 M778161 
BF345973 AW195853 AI687121 A1336147 AI091 364 A1769203 AW627859 BE677432 BF439954 BG820230 AA283686 AA812396 AI358396 
AA932022 BE222881 AW016109 BM352667 BF528099 BG818616 AI935916 AA912295 BG910887 AL568301 AL567278 8)522445 BI754384 
BG81 9375 BF835950 AA323718 AAB60955 AI089847 AL520776 AL526045 AL538507 A1567798 AL536876 AL574332 BF834531 BF3401 16 
BF835954 AL573997 AL581252 BF957667 AL580170 AW952324 AL569382 AW148695 AL559234 BG034270 BJ913839 AL581327 AL565842 
H43568 BF945188 AL559686 AL539326 T1S481 AW895092 AL582684 B1519896 

400263 18977 J Z11692 X51466 NM_O01961 M19997 BI224253 BG830478 AU122147 AU123437 BG1 13591 BG752624 BE8B6804 BI868669 BG337216 AW629935 
BM016525 AI560409 AL562866 AI909178 BF849556 AA371735 8F038841 BF727115 BC006547 BG757526 AL555664 BI261304 8G770095 
BI033488 BI517580 BG876486 B1011828 AI313235 BG831724 BF869852 BG998348 BI011834 BF888337 BF898627 BF092380 AW803215 F01241 
BF805719 BG876467 AW498536 BF988866 BG998849 AA248724 BG829202 BG756456 BG032392 BIB59287 BM016990 BG332359 BE933685 
BE166758 BM452445 A1937808 AW026128 N23684 AW006O41 AI337621 F331 1 1 BF344301 BG105450 BG387343 BF569547 BF154671 
BM007368 BF569385 BE772007 BI199487 BF761700 BI261519 BF944452 BF898506 AI038390 BM044934 AW381 142 BG743618 BE769206 
BE893973 BI015047 BF886479 BF761350 8E769769 BG7661 17 BF847365 BE397834 AW371 121 BF089125 BE082996 BF183193 BG180964 
BF089940 BI000274 BG255503 BG674499 BG774174B1015084 



TABLE 67C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier [OX] numbers. "Dunham I. et at' refers to the publicafion entitled The DNA 
sequence of human chromosome 22/ Dunham I. et Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


403047 


3540153 


Minus 


59793-59968 


401424 


8176894 


Plus 


24223-24428 


401451 


6634068 


Minus 


119926-121272 


401747 


9789672 


Minus 


118596-118816,119119-119244,119609-11976 


404632 


9796668 


Plus 


45096-45229 


403046 


3540153 


Minus 


55707-55859.56389.56511 


404922 


7341893 


Plus 


13248-13428 


403752 


7678857 


Plus 


33704-33828 


404210 


5006246 


Plus 


169926-170121 


401519 . 


6549315 


Plus 


157315-157950 


401785 


7249190 


Minus 


165776-165995,166189-166314,166408-16658 


401866 


801B106 


Plus 


73126-73623 


405387 


6587915 


Minus 


3769-3833,5708-5895 


401197 


9719705 


Plus 


176341-176452 


403532 


8076842 


Minus 


8175041901 



TABLE 68A: 995 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Table 68A lists about 995 genes up-regulated in prostate cancer compared to normal adult tissues. These were selected from 9009 probesets on the Aflymetrix/Eos Hu03 GeneChip 
array that showed some expression In human xenograft tumors and cell lines. The 995 genes were selected such that the ratio of 'average" prostate cancer to "average" normal adult 
tissues was greater than or equal to 2.5. The "average' prostate cancer level was set to the 85th percenfile amongst prostate cancers. The "average" normal adult tissue level was set 
to (he 85th percentile amongst non-malignant tissues. In order to remove genespecific background levels of non-specific hybridization, the 10th percentile value amongst non- 
malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Pkey: Unique Eos probeset Identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Untgene Title: Unigene gene title 



R1: 


Ratio of tumor to normal body tissue 




Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


448999 


AF179274 


Hs.22791 


transmembrane protein with EGF-fike and 


85.5 


426747 


AA535210 


Hs.171995 


kafl&reln 3, (prostate specific antigen 


68.6 


446057 


AM20227 


Hs.149358 


Trp-p8 transient receptor potential cat) 


65.6 
61.9 


400302 


N48056 


Hs.283946 


folate hydrolase (prostate-specific memb 


432441 


AW292425 


Hs.163484 


ESTs 


60.4 


419526 


A1821895 


Hs.193481 


ESTs 


45.9 


414569 


AF109298 


Ks.1 18258 


prostate cancer associated protein 1 


45.7 


424099 


AF071202 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


45.2 


432240 


A1694767 


Hs.1 29179 


Homo sapiens cDNA FU 13581 fe, clone PL 


44.9 


400298 


AA032279 


Hs.61635 


six transmembrane epithelial antigen of 


40.9 


453370 


AI470523 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


39.6 
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409731 AA125985 Hs.56145 thymosin, beta, identified In neuroblast 37.7 

400299 XD7730 Hs.171995 taflfoein 3, (prostate specific antigen 34.9 

400298 AA305627 Hs. 139336 ATP-Wndlng cassette, sub-family C (CFTR 31.7 

400292 AA250737 Hs.72472 BMP-R1B 31.4 

5 425075 AA505324 Hs.1852 add phosphatase, prostate 31.4 

409262 AK000631 Hs.52256 trypolheOcal protein FU2Q624 31.4 

428819 AL135623 Hs.193914 WAA0575 gene product 31.2 

409361 NM.005932 Hs.54416 sine ocufe homeobox (Drosophila) homclo 30.3 

, rt 407168 R45175 Hs.117183 ESTs 29.6 

10 400287 S39329 Hs.181350 kalftrein 2. prostatic 29.6 

415539 A1733881 Hs.72472 BMP-R1B 28.8 

432374 W88815 Hs.301885 Homo sapiens cDNA FIJ1 1346 Ms, clone PL 28.8 

433043 W57554 Hs.125019 lymphoid nudear protein (LAF-4) mRNA 28.1 

423073 BE252922 Hs.123119 MAD (mothers against decapentaptegtc. Dr 27.5 

15 403047 NMJK)5656*:Homo sapiens transmembrane pr 27.4 

444484 AK002126 Hs.11260 hypothetical protein FU11264 25.7 

428336 AA503115 Hs. 183752 nncrosemroproteln. beta- 25.0 

401424 NM_001172:Homo sapiens argtnase. type II 24.9 

407709 AA456135 Hs.23023 ESTs 24.7 

20 407122 H20276 Hs.31742 ESTs 24.6 

425628 NM.004476 Hs.283946 folate hydrolase (prostate-specific memb 23.2 

415989 AI267700 ESTs 23.1 

448290 AK002107 Hs.20843 Homo sapiens cDNA FU11245 fis, clone PL 22.7 

437052 AA861697 Hs.1 20591 ESTs 22.6 

25 411096 U80034 Hs.66583 mitochondrial intermediate peptidase 22.5 

451939 U80455 Hs.27311 single-minded (Drosophil a) homotog 2 21 2 

450096 A1682088 Hs.79375 hobcarboxylase synthetase (biotm-[prop 20.8 

419743 AW408762 Hs.5957 Homo sapiens done 24416 mRNA sequence 20.5 

427958 AA418000 Hs.98280 potassium intermediate/small conductance 20.5 

30 431548 AI834273 Hs.9711 novel protein 19.8 

434666 AF151103 Hs.112259 T celt receptor gamma locus 19.4 

426501 AW043782 Hs.293616 ESTs 19.3 

413597 AW302885 Hs.117183 ESTs 18.9 

„ 428852 NM_000346 Hs.2316 , SRY (sex determining region Y)-box 9 (ca 18.7 

35 435677 AA694142 Hs.293726 ESTs, Weakly similar to TSGA RAT TESTIS 18.0 

429220 AW207206 ESTs 17.9 

408826 AF216077 Hs.48376 Homo sapiens done HB-2 mRNA sequence 17.6 

418961 AW967646 Hs.23023 ESTs 17.3 

428898 AB033070 Hs. 194408 WAA1244 protein 17.3 

40 420757 X78592 Hs.99915 androgen receptor (dhydrotestosterone r 16.8 

, 428398 AI249368 Hs.98558 ESTs 16.3 

419078 M93119 Hs.89584 InsuBncrna-asscciated 1 15.8 

450382 AA39765B Hs.60257 Homo sapiens cDNARJ13598fc. clone PL 15.4 

._. 449625 NM.014253 odz (odd Oz/terwn, Drosophila) homdog 1 15.2 

45 401451 NM^004496*:Homo sapiens hepatocyte nude 14.9 

447033 AI357412 Hs.157601 ESTs 14.9 

452594 AU076405 Hs*29981 sotute carrier famDy 26 (sulfate transp 14.7 

448519 AW175665 Hs.278695 Homo sapiens prostein mRNA, complete cds 14.6 

418829 AA516531 Hs.55999 NK homeobox (Drosophila), family 3, A 14.5 

50 430376 AW292053 Hs.12532 crrromosome 1 open reading frame 21 14.4 

418846 AI820961 Hs.193465 ESTs 14.3 

429918 AW873986 Hs.119383 ESTs 14.2 

424012 AW368377 Hs.137569 tumor protein 63 kDa with strong homolog 14.1 

418278 AI088489 Hs.83937 hypothefical protein 14.1 

55 408000 L11690 Hs.198689 bullous pemphigoid antigen 1 (230/240kD) 13.9 

443822 AI087412 Hs.143611 ESTs, Weakly similar to 2004399A chromos 13.3 

432101 AI918950 Hs.123642 EphA3 133 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010 13.1 

412446 A1768015 ESTs 12.7 

60 437718 A1927288 Hs.196779 ESTs 12.6 

433404 T32982 ESTs 12.6 

431089 BE041395 Hs. 1561 10 ESTs, Weakly similar to unknown protein 12.6 

427212 AW293849 Hs^8279 ESTs, WeaWy similar to ALU7 JiUMAN ALU S 12.5 

rr 429597 NM.003816 Hs.2442 a disintegrin and rrujtalloprotelnase doma 12.5 

65 444151 AW972917 Hs.128749 alpruHnethytacyl-CoA racemase 12.4 

428046 AW812795 Hs.337534 ESTs, Moderately similar to I38022 hypot 12.4 

453160 AI263307 H2B histone family, member L 12.2 

433927 AI557019 Hs.1 16467 small nuclear protein PRAC 111 

„ 431117 AF003522 Hs.250500 delta (Dn*opWla)-foe 1 12.1 

70 42480O AL035588 Hs.1 53203 MyoD family inhibitor 11.9 

427674 NM.003528 Hs.2178 H2B Wstonefan^.rnemberQ 113 

424692 AA429834 Hs.1 51 791 WAA0092 gene product 11.7 

415263 AA948033 Hs.130853 ESTs 11.3 

„ 416182 NMJXM354 Hs.79069 cydinG2 1U 

75 420218 AW958037 ribosornal protein L4 11.3 

411887 AW182924 Hs.128790 ESTs 11-2 

425154 NM.001851 Hs.154850 collagen, type IX, atpha 1 \\2 

430226 BE245562 Hs.2551 adrenergic, beta-2-, receptor, surface 11.2 

or . 450325 AI935962 HsJ1973 ESTs 11.1 

SO 439444 AI277652 Hs.54578 ESTs, WeaWy similar to 138022 hypothec 11.1 

440529 AW207640 Hs.16478 Homo sapiens cDNA FU21718 fis, done C 11.0 

457211 AW972565 Hs.32399 ESTs, WeaWy similar to S51797 vasodUat 11.0 

428600 AW863261 Hs.138860 hypothetical protein DKFZp434K1 421 11.0 

761 



WO 03/042661 



PCT/US02/36810 





422728 


AW937826 


Hs.103262 


ESTs, Weakly similar to ZN91 .HUMAN ZINC 


11.0 




417333 


AL157545 


Hs.173179 


bromodomain and PHD finger containing, 3 


10.7 




434423 


NM.006769 


Hs.3844 


UM domain only 4 


10.6 


5 


434170 


M62&509 


Hs.122329 


ESTs 


10.6 


432729 


AK000292 


Hs.130732 


hypothetical protein FU 20285 


10.5 




446100 


AW967109 


Hs.13804 


hypothec protetadJ462023.2 


10.4 




432715 


AA247152 


Hs.200483 


ESTs, Weakly similar to KIAA1074 protein 


10.4 




421896 


N62293 


Hs.45107 


ESTs 


10.3 


1 A 


401747 






Homo sapiens kerafin 17 (KRT17) 


10.1 


451640 


AA195601 


Hs,26771 


Human DMA sequence from clone 747H23 on 


10.1 




431542 


H63010 


Hs.5740 


ESTs 


10.0 




447397 


BE247676 


Hs.18442 


£•1 enzyme 


10.0 




433285 


AW975944 


Hs.237396 


ESTs 


10.0 


15 


431325 


AW026751 


Hs.5794 


ESTs, Weakly similar to 210926QA B cell 


9.9 


410889 


X91662 


Hs.66744 


twist (OrosophBa) homdog {acrocephatos 


9.8 




432674 


AA641092 


Hs.257339 


ESTs, Weakly similar to 136022 hypotheti 


9.6 




434926 


BE543269 


Hs.50252 


mitochondrial ribosomai protein L32 


9.6 




442049 


AA310393 


Hs.190044 


ESTs 


9.6 


20 


443180 


R15875 


Hs.258576 


daudin12 


9.5 


406954 


M21305 




FGENES predicted novel secreted protein 


9.5 




426350 


NM.003245 


Hs.2022 


transglutaminase 3 (E polypeptide, prote 


9.5 




456660 


AA909249 


Hs.112282 


solute carrier f amfly 30 (zinc transport 


9.5 




448045 


AJ297436 


Hs.20166 


prostate stem cell antigen 


9.5 


25 


451684 


AF215751 


Hs.26813 


COA14 


9.5 


440594 


AW445167 


Hs.126036 


ESTs 


9.4 




432415 


T16971 


Hs.289014 


ESTs, Weakly similar to A43932 mucin 2 p 


9,3 




418601 


AA279490 


Hs.86368 


catmegin 


9.3 




433332 


AI367347 


Hs.44898 


Homo sapiens done TCCCTA00151 mRNAsequ 


9.2 


30 


437124 


AA554458 




WAA0666 protein 


9.2 


428728 


NMJ)16625 


Hs.191381 


hypothetical protein 


9.2 




43B869 


AF075009 




gbiHomo sapiens full length Insert cONA 
protein kinase C, mu 


9.1 




431992 


NMJW2742 


Hs.2891 


8.9 




413992 


W26276 


Hs.136075 


RNA.U2 small nuctear 


8.9 


35 


428342 


AI739168 




Homo sapiens cDNA FU 13458 fis, clone PL 


8.9 


418836 


AI655499 


Hs.161712 


ESTs 


8.8 




409799 


D11928 


Hs.76845 


phosphoserine phosphatase-fike 


8.8 




400294 


N95796 


Hs.276695 


Homo sapiens prostein mRNA, complete cds 


8.7 




434826 


AF 155661 


Hs.22265 


pyruvate dehydrogenase phosphatase 


8.7 


40 


418250 


U29926 


Hs.83918 


adenosine monophosphate deaminase (isofo 


8.7 


425905 


AB032959 


Hs.318584 


novel C3HC4 type Zinc finger (ring finge 


8.7 




418329 


AW247430 


Hs.84152 


cystathionins-beta-synthase 


8.7 




450203 


AF097994 




L^nurenlne/alpha^minoadipate aminotra 


8.7 




425465 


L18964 


Hs.1904 


protein kinase C, iota 


8.5 


45 


416239 


AL038450 


Hs.48948 


ESTs 


6.5 


436962 


AW377314 


Hs.5364 


DKFZP564I052 protein 


8.4 




450164 


A1239923 


Hs.63931 


ESTs 


8.4 




452744 


AI267652 


Hs.246107 


Homo sapiens mRNA; cONA OKFZp434E082 (fr 


8.4 




440774 


A1420611 


Hs.153934 


ESTs 


8.3 


50 


444922 


AI921750 


Hs.144871 


Homo sapiens cONA FU13752 fis, done PL 


8.3 


410870 


U81599 


Hs.66731 


homeo box B13 


8.3 




435047 


AA454985 


Hs.54973 


cadherin-Gke protein VR20 


8.2 




418564 


AA631143 


Hs.278695 


Homo sapiens prostein mRNA, complete cds 


8.2 




451027 


AW519204 


Hs.40808 


ESTs 


8.2 


55 


445873 


AA250970 


Hs.251946 


po!y(A)-binding protein, cytoplasmic 1-1 


8.2 


423349 


AF010258 


Hs.127428 


homeo box A9 


8.1 




421928 


AF013758 


Ks.109643 


pdyadenylate binding protein-interactin 


ai 




432586 


AA568548 




ESTs 


6.0 




426108 


AA622037 


Hs.166468 


programmed cefl death 5 


ao 


60 


417153 


X57010 


Hs.81343 


collagen, type tl, alpha 1 (primary oste 


ao 


416653 


M768553 


Hs.193145 


metaflolhbnetn IE (functional} 


7.9 




447476 


BE293466 


Hs.20880 


ESTs, Weakly similar to 138022 hypotheti 


7.9 




450377 


AB033091 




WAA1265 protein 


7.8 




429769 


NM.004917 


Hs.218366 


kaffikrein 4 {prostase, enamel matrix, p 


7.7 


65 


416815 


U41514 


Hs.80120 


UOP-^^tyt^rB-D^alactosamir^pdyp 


7.7 


433323 


AA805132 


Hs.159142 


ESTs 


7.7 




451952 


AL120173 


Hs.301663 


ESTs 


7.7 




452367 


U71207 


Hs.29279 


eyes absent (Drosophila) homoiog 2 


7.7 




421470 


R27496 


Hs.1378 


annexin A3 


7.7 


70 


420092 


AA814043 


Hs.88045 


ESTs 


7.6 


416111 


AA033813 


Hs.79018 


chromatin assembly factor 1 , suburtft A ( 


7.6 




440749 


W22335 


Hs.7392 


hypothetical protein MGC3199 


7.6 




441866 


BE464341 


Hs.21201 


nectin 3; DKFZP566B0846 protein 


7.6 




411019 


AW993097 


Hs.48617 


Homo sapiens cONA FU1 2540 Us, clone NT 


7.5 


75 


421566 


NM_000399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


7.4 


451418 


BE387790 


Hs.26369 


hypothetical protein FU 20287 


7.4 




433517 


AW022133 


Hs.189838 


ESTs 


7.4 




415621 


AI548602 


Hs.55468 


ESTs 


7.4 




442592 


BE566623 


Hs. 29899 


ESTs, Weakly similar to GO 2075 transcrip 


7.3 


80 


452340 


NM_002202 


Hs.505 


ISL1 transcription factor, UM/homeodoma 


7.3 


450244 


AA007534 


Hs.125062 


ESTs 


7.2 




433293 


AF007835 


Hs.32417 


hypothetical protein MGC4309 


7.2 




425259 


AL049280 


Hs.155397 


Homo sapiens mRNA; cONA DKF2p564Kl43 (fr 


7.2 




434485 


A1623511 


Hs.1 18567 


ESTs 


7.2 
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442772 


AW5Q3680 


Hs.5957 


Homo sapiens done 24416 mRNA sequence 


7.1 


425211 


M18667 


Hs.1867 


progastncsin (pepsinogen C) 


7.1 


431958 


X63629 


Hs.2877 


cadherin 3, type 1, P-cadherin (placenta 


7.0 


408418 


AW963897 


Hs.44743 


KJAA1435 protein 


7.0 


436063 


AKD00028 




riboscma) protein S24 


7.0 


412350 


Al 659306 


Hs.73826 


protein tyrosine phosphatase, noiwecept 


6.9 


414664 


AA587775 


Hs.66295 


mulli-POZ-domalrH»nt3ining protein 


as 


428465 


AW970976 


Hs.293653 


ESTs 


&9 


436476 


AA326108 


Hs.33829 


bHLH protein OEC2 


6.8 


431585 


BE242803 


Hs.262823 


hypothetical protein FU 10326 


6.8 


434808 


AF155108 


Hs.256150 


Homo sapiens. Similar to RIKEN cONA 2810 


6.7 


432103 


T15803 


Hs.272458 


protein phosphatase 3 (formerly 28), cat 


6.7 


456088 


BE177320 


Hs.156148 


hypothetical protein FL) 13231 


6.6 


429084 


AJ001443 


Hs.195614 


splidng factor 3b, subunit 3, 130kD 


6.6 


422485 


BE514492 


Hs.1 17487 


gene near HD on 4p16.3 with homology to 


6.6 


410227 


AB009284 


Hs.61152 


exostoses (muItiple)-Qke 2 


6.6 


453439 


AI572438 


Hs.32976 


guanine nucleotide binding protein 4 


as 


410037 


AB020725 


Hs.58009 


KIAA0918 protein 


6.5 


420297 


A1628272 


Hs.88323 


ESTs. Weakly similar to ALU 1 .HUMAN ALU S 


6.4 


409648 


AW451449 


Hs.57749 


ESTs 


6.4 


448148 


NM.016578 


Hs^0509 


HBV pX associated protein-8 


6.4 


410268 


AA316181 


Hs.61635 


six transmembrane epithelial antigen of 


6.4 


422134 


AW179019 


Hs.112110 


mitochondrial ribosomsl protein L42 


6.3 


415068 


Z19448 


Hs.1 31887 


ESTs, Weakly similar to T24396 hypotheti 


6.3 


404632 






NM_022490:Homo sapiens hypothetical prot 


6.3 


412935 


BE267045 


Hs.75064 


tubultn-spectnc chaperone c 


6.3 


453308 


AW959731 


Hs.323099 


ESTs 


a3 


449048 


Z45051 


Hs.22920 


simitar to S68401 (cattle) gtucose indue 


as 


410762 


AF226053 


Hs.66170 


HSKM-B protein 


• a3 


436032 


AA150797 


Hs.109276 


iatexln protein 


as 


456508 


AA502764 


Hs.123469 


ESTs, Weakly similar to AF208855 1 BM-01 


a3 


413328 


Y15723 


Hs.75295 


guanytate cyclase 1, soluble, alpha 3 


6.2 


425071 


NMJH3989 


Hs.1 54424 


deiodinase, iodothyronine, type IS 


6.2 


424036 


AA770688 




H2A histone family, member L 


6.2 


416245 


M088767 


Hs.83883 


transmembrane, prostate androgen induced 


6.2 


450861 


A1523898 


Hs.17617 


ESTs 


6.1 


418821 


AA436002 


Hs.183161 


ESTs 


ai 


458809 


AW972512 


Hs.20985 


sin3-associated polypeptide, 30kD 


6.1 


432527 


AW975028 


Hs.102754 


ESTs 


6.1 


433730 


AK002135 


Hs.3542 


hypothetical protein FU 1 1 273 


6.1 


449845 


AW971183 


Hs.6019 


DnaJ (Ksp40) homolog, subfamily C, membe 


6.1 


420948 


AB016898 


Hs.100469 


myetoidilyrnphoid or mixed-lineage leukem 


ai 


425704 


U79293 


Hs.1 59264 


Human clone 23948 mRNA sequence 


ao 


456440 


AI095468 


Hs.135254 


Homo sapiens done 1 thrombospondin mRNA 


ao 


419647 


AA348947 


Hs.91816 


hypothetical protein 


ao 


455497 


AA1 12573 


Hs.278695 


Homo sapiens prostein mRNA, complete cds 


ao 


428593 


AW207440 


Hs.1 85973 


degenerative spermatocyte (homolog Droso 


19 


400303 


AA242758 


Hs.79136 


UV-1 protein, estrogen regulated 


5.9 


446416 


AV658299 


Hs.1 63959 


ESTs 


5.9 


407819 


R42165 




ESTs 


5.9 


430387 


AW372884 


Hs.240770 


nuctear cap binding protein subunit 2, 2 


5.9 


410102 


AW248508 


Hs.279727 


ESTs; homolog ue of PEM-3 [Gona savignyi 


5.9 


429165 


AW009886 


Hs.1 18258 


prostate cancer associated protein 1 


5.9 


417873 


BE266659 


Hs^93659 


Homo sapiens, Simlar to RIKEN cONA A430 


5.8 


410240 


AL157424 


Hs.61289 


synaptojanln 2 


5.3 


421305 


BE397354 


Hs.324830 


diptheria toxin resistance protein requi 


5.8 


451406 


A1694320 


Hs.6295 


ESTs, Weakly similar to T17248 hypotheti 


ao 


416795 


A1497778 


Hs.20509 


HBV pX associated protein-8 


a7 


427701 


AA411101 


Hs^43886 


nuclear autoanSgenic sperm protein (his 


a7 


451468 


AW503398 


Hs.293563 


ESTs, Moderately similar to I38022 hypot 


a7 


433852 


AI378329 


Hs.1 26629 


ESTs 


a7 


432882 


NM.013257 


Hs.279696 


serurn/glucocorttcoid regulated kuiase-6 


5.6 


419440 


AB020689 


Hs.90419 


K1AA0882 protein 


ae 


435706 


W31254 


Hs.7045 


GU004 protein 


5.6 


431583 


AL042613 


Ks.262476 


S-adenosylmethionlne decarboxylase 1 


5.6 


442409 


BE208843 


Hs.1 29544 


hypothetical protein MGC1 5438 


5.6 


453171 


R76472 


Hs.65646 


ESTs 


5.6 


437252 


AI433833 


Hs.1 64159 


ESTs, Weakly similar to ALU INHUMAN ALU S 


5.6 


422083 


NM.001141 


Hs.1 11256 


arachMon atft IS-Upoxygenase, second typ 


5.6 


429638 


AI916662 


Hs.211577 


Idnecfin 1 (kinestn receptor) 


5.5 


403046 


AW627932 




NM_009656*:Homo sapiens transmembrane pr 


5.5 


447805 


Hs.302421 


gemjn4 


as 


443162 


T49951 


Hs.9029 


OKFZP434G032 protein 


5.4 


418555 


A1417215 


Hs^7159 


hypothetical protein FU 12577 


5.4 


437162 


AW005505 


Hs.5464 


thyroid hormone receptor coadivating pr 


a4 


419465 


AW500239 


Hs^1187 


Homo sapiens cDNA FU23068 fis, done L 


a4 


440146 


AW014231 


Hs.90790 


Homo sapiens cONA: FU 22930 fis, clone K 


a4 


427308 


026067 


Hs.174905 


WAA0033 protefn 


a4 


409151 


AA306105 




SEC22. veside trafficking protein (S. c 


5.4 


432840 


AK0014O3 


Hs.279521 


hypothetical protein FU20530 


a4 


423242 


AL039402 


Hs.125783 


DEME-6 protein 


a3 


453082 


H18835 


Hs.31608 


hypothetical protein FU20041 


a3 


441021 


AW578716 


Hs.7644 


H1 histone family, member 2 


5.3 
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415276 U8B666 Hs.78353 SFRS protein kinase 2 5.3 

400290 H18835 Hs.3l60B hypothetical protein FU20041 5l3 

432435 BE218886 Hs.282070 ESTs 5.3 

_ 417318 AW953937 Hs.240845 ESTs 5.3 

5 429467 NM.004477 Hs.203772 FSHD region gene 1 5.3 

416276 U41060 Hs.79136 U\M protein, estrogen regulated 5.3 

410076 T05387 Hs7991 ESTs 5.3 

420039 NMJJ04605 Hs.94581 sulfotransferase family, cytosoOc, 28, 5.2 

, A 423445 NNL014324 Hs.128749 aJpha-methylacyt-CoA racemase 5.2 

10 436420 AA443966 Hs.31595 ESTs 5.2 

414085 AA1 14016 Hs.75746 aldehyde dehydrogenase 1 family, member 5.1 

422048 NM.012445 Ks.288126 spondin 2. extracellular matrix protein 5.1 

427176 AW381569 Hs.40334 ESTs 5.1 

450832 AW970602 Hs.105421 ESTs 5.1 

15 448807 AI571940 Hs.7549 ESTs 5.1 

420568 F09247 Hs.247735 protocadherin alpha 10 5.1 

421685 AF189723 Hs.106778 ATPase, Ca transporting, type 2C. member 5.0 

427615 BE410107 Hs.179817 CGI-82 protein, PSDR1 5.0 

430598 AK001764 Hs.247112 hypothetical protem FU 10902 5.0 

20 452576 AB023177 Hs.29900 WM0960 protein 5.0 

431724 AA514535 Hs.283704 ESTs 5.0 

409757 NM_001898 Hs.123114 cystatinSN 5.0 

432242 AW022715 Hs.162160 ESTs, WeaHy similar to ALU4.HUMAN ALU S 5.0 

407103 AA424881 Hs.256301 hypothetical protein MGC1 3170 5.0 

25 421154 AA284333 Hs.287631 Homo sapiens cDNA FU14269 fis, done PL 5.0 

400301 X03635 Hs.1657 estrogen receptor 1 5.0 

426006 R49031 Hs.22627 ESTs 5.0 

452679 Z42387 Hs.83883 transmembrane, prostate androgen induced 4.9 

446880 A1811807 Hs.108646 Homo sapiens cDNA RJ14934 (is, done PL 4.9 

30 444108 R55784 Hs.140942 ESTs 4.9 

452259 AA317439 Hs.28707 signal sequence receptor, gamma (translo 4.9 

419083 A1479560 Hs.98613 Homo sapiens cDNA FU 12292 fis, done MA 4.9 

416854 H40164 Hs.80296 Purkinje cell protein 4 4.9 

422890 Z43784 ankyrin3.nodeofRanvler(ankyrinG) 4.9 

35 428330 L22524 Hs.2256 matrix metafloproteinase 7 (matrilysin, 4.9 

419168 A1336132 Hs.33718 Homo sapiens cDNA FU 12641 fis, done NT 4.9 

452017 AF109302 Hs.27495 prostate cancer associated protein 7 4.9 

456362 AW973003 Hs.17S909 hypothetical protein FU22995 4.9 

442501 AA315267 Hs.23128 ESTs 4.8 

40 439735 A1635386 Hs.142846 hypothetical protein 4.8 

451752 AB032997 KJAA1171 protein 4.8 

423583 AL122055 Hs.129836 WAA1028 protein 4.8 

446157 BE270828 Hs.131740 Homo sapiens cDNA: RJ22562 fis, clone H 4.8 

432363 AA534489 gb:nf76g11.s1 NCI_CGAP_Co3 Homo saptens 4.8 

45 433313 W20128 Hs.296039 ESTs 4.9 

408101 AW968504 Hs.123073 CDC2-related protein kinase 7 4.8 

453857 AL080235 Hs.35861 DKFZP586E1 621 protein 4.8 

451982 F13036 Hs.27373 Homo sapiens mRNA; cONA DKFZp56401763 (f 4.7 

447574 AF162666 Hs.18895 tousled* kinase 1 4.7 

50 429299 A1620463 Hs.347408 hypothetical protein MGC1 31 02 4.7 

420522 AW957137 Hs.98541 hypothetical protein 4.7 

417379 AA196390 gb:zp99b10.s1 Stratagene musde 937209 H 4.7 

411031 W37943 Hs.34892 WAA1323 protein 4.7 

431725 X65724 Hs.2839 Norrie disease (pseudogifoma) 4.7 

55 441224 AU076964 Hs.7753 calumenin 4.7 

407813 AL120247 Hs.40109 WAA0872 protein 4.7 

440074 AA863045 Hs.10669 ESTs, Weakly similar to T00050 hypothec 4.7 

430294 AI538226 Hs.32976 guanine nucleotide binding protein 4 4.7 

415082 AA160000 Hs.137396 ESTs, Weakly similar to JC5238 galactosy 4.7 

60 414922 D00723 gtydne cleavage system protein H (amino 4.7 

430945 U80669 Hs.55999 NK homeobox {Drosophiia), family 3. A 4.7 

438825 BE327427 Hs.79953 ESTs 4.7 

425174 087450 Hs.154978 WAA0251 protein 4.7 

420380 AA640891 Hs.102406 ESTs 4.7 

65 410193 AJ132592 Hs.59757 zinc finger protein 281 4.7 

440300 N39760 Hs.8859 Homo sapiens. Similar to RiKEN cDNA 5830 4.6 

436761 A1B17776 Hs.236557 ESTs 4.6 

410275 U85658 Hs.61796 transcription factor AP-2 gamma (activat 4.6 

404922 NM_003071:Homo sapiens SWl/SNF related, 4,6 

70 436556 AI364997 Hs.7572 ESTs 4.6 

429302 AU076674 Hs.198899 eukaryoflc translation Initiation factor 4.6 

447595 AW379130 Hs.18953 phosphc<Cesterase9A 4.6 

433006 BE242758 Hs. 190223 ESTs, Moderately similar to T29285 hypot 4.5 

407894 AJ278313 Hs.41143 phcsphdnositide^pedfic phosphoJipase 4.5 

75 418818 AA228899 Hs.1O1307 Homo sapiens HUTU protein mRNA, partial 4.5 

433556 W56321 Hs. 11 1460 caidumA^modunrHlependent protein kin 4.5 

443123 AA094538 Hs.272808 putative transcription regulafion nudea 4.5 

419991 AJO00O98 Hs.94210 eyes absent (Drosophiia) homotog 1 4.5 

_ . 419594 AA013051 Hs.91417 topoisomerase (DNA) 11 rinding protein 4.5 

80 453390 AA862496 Hs.28482 ESTs 4.5 

417061 AI675944 Hs.188691 Homo sapiens cDNA FU 12033 fis. done HE 4.5 

419239 AA468183 Hs.184598 Homo sapiens cDNA: FU23241 fis, clone C 4.5 

436873 N23874 Hs.50477 RAB27A, member RAS oncogene famBy 4.5 
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cysteine and gtycine-rich protein 2 3.9 

potassium in termed} aifi/smafl conductance 3.9 

glioblastoma overexpressed 3.9 

ESTs 3.9 

Homo sapiens cDNA FU23006 6s, done L 3.9 

Homo sapiens cDMA FU22696 fis, done H 3.9 

ESTs 3.9 

Homo sapiens cDNA FU12187 fis, done MA 3.9 

prostate epilhenum-spedfic Ets transcr 3.6 

hypothetical protein FU13117 18 

CO-86 protein 18 

ESTs 18 

CCAAT-box-binding transcription factor 18 

N-acetytneursminjc acid phosphate syntha 18 

transcription factor 6-fike 1 (mitochond 3.8 

mitochondria) ribosomai protein L9 18 

ESTs 3.8 

Homo sapiens cDNA FU 10674 fis, done NT 3.8 

Homo sapiens, done (MAGE:3507281 , mRNA, 18 

Irinesm family member 5C 3.8 

SRY (sex determining region Y)-box 4 3.8 

prostate differentiation factor 18 

ESTs, Weakly similar to KIAA1205 protein 18 

ESTs, Weakly similar to 138022 hypoM 18 

odd Oz/terwn homotog 2 (Drosophila, mous 18 

uncharacterized hematopoietic stern/proge 18 

ESTs 3.8 

Homo sapiens cDNA FU21086 fis, done C 18 

thtoredcxMce.32kD 3.8 

protein kinase Wke 2 18 

ESTs 18 

ESTs 18 

ESTs 17 

ESTs 17 

spastic ataxia of Chanevobt-Saguenay (s 17 

nucleolar protein 3 (apoptosls repressor 17 

ribosomai protein L32 17 

ESTs 3.7 

AJM-1 protein 17 

HSPC039 protein 17 

golgln-67 17 

ka!Ekrein11 17 

ESTs 3.7 

ENSP00000229263*;HSPC21 3. 17 

ESTs 17 

ptdotrophin (heparin binding growth fee 17 

ESTs 17 

angiotensin II, type I receptor-associat 17 

reficuton 1 17 

Target Exon 17 

DvWjinding protein IDAX fmWbibon of 17 

KIAA1073 protein 17 

NM-0221 70':Homo sapiens WflOams-eeuren 17 

Human DMA sequence from clone RP1-63M2 o 17 

v-skl avian sarcoma vkal oncogene homo! 17 

ESTs 3.7 

ESTs 17 

solute carrier family 37 (glycerot-3-pho 16 

lacrimal proline rich protein 3.6 

RALBP1 associated Eps domain containing 16 

neural proliferation, differentiation an 3.6 

xeroderma pigmentosum, complementation g 3.6 

acetyl-Coenzyme A carboxylase alpha 16 

KlAATJ606rjrotein;SCNGrr^taOsciM 16 
ESTs 16 
ESTs, WeaMy similar to SFR4 JWMAN SPUC 16 
cEhydro folate reductase 3.6 
ESTs, Weakly similar to ALU 1 .HUMAN ALU S 16 
E1A binding protein p300 16 
zinc-binding protein Rbcc728 16 
Homo sapiens dona 24670 mRNA sequence 16 
nuclear receptor subfamily 2, group F, m 16 
RAB9, member RAS oncogene farnSy 16 
putative G protein-coupled receptor 16 
WAA1181 protein 16 
WR (histcne cell cyde regulation defec 16 
Homer, neuronal immediate early gene, 2 3.6 
NMJN2753*:Homo sapiens mibgervactivate 16 
ESTs 16 
ESTs 3.6 
ESTs, WeaWy similar to 2109260A B cell 16 
hypothetical protein FU21 986 15 
ESTs, WeaMy similar to Af 151840 1 CQ-8 15 
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428695 AI355647 Hs. 189999 purinergic receptor (family A group 5) 3.5 

419517 AF052107 Hs.90797 Homo sapiens done 23620 mRNA sequence 3.5 

446494 AA463276 Hs.288905 WW Domain-Contaimng Gene 3.5 

417515 L24203 Hs.82237 ataxia-telangkdasia group D-assoctafed 3.5 

5 433891 AA613792 gbmo97h01s1 NO CGAP_Pr2 Homo sapiens 3.5 

427315 AA179949 Hs. 175553 Homo sapiens mRNA; cDNA OKFZp584N0763 (f 3.5 

445525 BE149866 Hs. 14831 Homo sapiens. Similar to zinc finger pro 3.5 

434672 AW294020 Hs. 11 7721 ESTs 3.5 

- 453288 AW583292 HSv274412 similar to yeast UpQ, variant A 3.5 

0 434970 AW272262 Hs.225767 ESTs 3.5 

417193 A1922189 Hs^88390 hypothetical protein FU 22795 15 

437617 AKJ26701 Hs.5716 KIAA0310 gene product 3.5 

445943 AW898533 Hs.181574 ESTs 15 

412628 AK72402 Hs.306051 hypothetical protein MGC2648 15 

.5 452167 N75238 Hs.13075 Homo saptens cDNA: RJ2301 3 fis, clone L 15 

410467 AF102546 Ks.63931 dachshund (Drosophfla) homolog 15 

440193 AW902312 Hs.7037 Homo sapiens done 24923 mRNA sequence 3.5 

451815 AW974911 Hs. 184793 Homo sapiens cDNA: RJ21880 6s, done H 15 

429170 NMJW1394 Hs.2359 dual specMy phosphatase 4 15 

10 410082 AA081594 Hs.158311 Musashi (Drosophlla) homolog 1 15 

436087 BE300296 Hs.5054 CG1-133 protein 3.5 

437898 W81260 Hs.43410 ESTs 15 

451131 A1267686 Hs.268012 fatry-add-Coenzyme A Dgase, long-chain 15 

% _ 439609 AW971945 Hs.293236 ESTs 15 

15 443555 N71710 Hs.21398 ESTs. Moderately simDar to A Chain A,H 15 

418610 AW245993 Hs.223394 hypothetical protein MGC2742 15 

409960 BE261944 hexddnasel 15 

425253 NMJXM197 Hs.155419 BCL2-interacfing kfller (apoptosis-induc 15 

404210 NM_005936:Homo sapiens myeloid/lymphoid 15 

50 442323 AW016669 Hs.29190 ESTs 15 

452707 A1093823 Hs.45070 ESTs 3.5 

421437 AW821252 Hs.104336 hypothetical protein 15 

426514 BE616633 Ks.170195 bone morphogenetic protein 7 (osteogenic 14 

450828 AW382884 Hs.204715 ESTs 14 

35 446825 BE266822 Hs.344097 filamin A, alpha (adSn-binding protein- 14 

421867 AA481078 Hs.109045 hypothefical protein FU10498 3.4 

434954 AF161455 Hs.284295 Homo sapiens HSPC337 mRNA, partial cds 14 

452908 AB001451 Hs.30965 neuronal She adaptor homolog 14 

429922 297630 Hs£26117 HI histone family, member 0 14 

40 448734 BE614070 Hs.326416 HoniosapierernRNA; cONA DKFZp564H1916 (f 14 

434293 NMJ04445 Hs.3796 EphB6 14 

414222 AL135173 sorbitol dehydrogenase 14 

421674 T10707 H&296355 hypothetical protein FU231 38 14 

438279 AA605166 Hs.154762 HIV-1 rev binding protein 2 14 

45 411076 A1222020 Hs. 182364 CocoaCrisp 14 

419749 X73608 Hs.93Q29 sparc/osteonecth, ewev and kazaJ-Eke d 3.4 

432302 AA345857 Hs.274307 WAA1442 protein 14 

407944 R34008 Hs.239727 desmocollin2 14 

431674 AA098901 Ks.301642 Gfroteln coupled receptor 3.4 

50 408291 AB023191 Hs.44131 WAA0974 protein 3.4 

431578 AB037759 Hs.261587 GCN2 eIF2atpha kinase 14 

440789 AB007857 Hs.7416 WAA0397 gene product 3.4 

434958 T99949 Hs.303428 Homo sapiens cONA RJ 14832 fis, clone OV 3.4 

452295 BE379936 Hs^8866 programmed cefl death 10 14 

55 417229 AA975096 Hs. 19522 hypothetical protein PR02849 14 

439593 BE073597 Hs.124863 ESTs 14 

407136 T64896 H&287420 Homo sapiens CONAFU11533 fis, done HE 14 

421044 AF061871 Hs.101302 Human DNA sequence from done RP1 -238D1 5 14 

„ A 427461 AA531527 Hs.332040 hypothetical protein MGC1 3010 3.3 

60 417315 AK380042 Hs.180450 ribosomal protein S24 13 

423392 AA195037 Hs. 169341 HTPAP protein 3.3 

412652 AI801777 ESTs 13 

415172 AF079529 Hs.78106 phosphodiesterase 8B 13 

re 418372 AA311833 Hs.B4318 replication protein A1 (70kD) 3.3 

65 • 447958 AW796524 Hs.68644 Homo sapiens ntosornal signal peptidase 3.3 

425010 T16837 Hs.4241 ESTs 13 

420460 AA262331 Hs.48376 Homo sapiens done HB-2 mRNA sequence 13 

443181 AI039201 Hs.283316 ESTs 13 

„ 442064 A1422887 Hs.88594 ESTs 13 

70 431663 NM-016569 Hs.267182 TBX^iso protein 13 

412926 A1879076 Hs.75061 macrophage myristoylated alanine-rich C - 13 

420818 AW969635 Hs.33032 ESTs, WeaVfy similar to 1207289A reverse 13 

408829 NNL006042 Hs.48384 heparan sulfate (c^jcosamme) 3-Osutfot 13 

_ 408681 AW953853 HS-292&33 ESTs, WeaWy similar to 138022 hypotheG 3.3 

75 436278 BE39S290 Hs.5097 synaptogyrin 2 13 

424560 AA158727 Hs. 150555 protein predicted by done 23733 13 

417215 BE253181 Hs.81687 ncMnetastatic cells 3. protein expresse 13 

418166 AI754416 Coc42 effector protein 3 13 

434629 AA789081 Hs.4029 gGoma-arnpOEed sequence-41 12 

80 427398 AW390020 Hs^0415 cfuomoscme21 open readr^ frame 11 12 

457489 AI 593815 Ks. 127 179 crypScgene 12 

434747 AA837085 ESTs 12 

413125 BE244589 Hs.75207 glyoxalase I 12 
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435561 AA35197B Hs.4943 hepatocellular amnorna associated pro! 3.2 

447210 AF035269 Hs.17752 phosphatidylseiine-spedfic phosphoCpas 12 

422522 AI023428 Hs.34549 ESTs, Wghly similar to S94541 1 done 4 3.2 

_ 423396 AI382555 Hs.1 27950 bromodomaiivcontaning 1 3.2 

5 401785 N^002275**tano sapiens kerafin 15 (KRT1 3.2 

441345 AW068579 Hs.7780 Homo sapiens mRNA; cONA DKFZp564A072 (fr 3.2 

451404 AA460775 Hs.6295 ESTs,WeaHy similar to T1 7248 hypotheQ 12 

422173 BE385828 Hs.250619 phorbofin-Eke protein MOS019 3.2 

_ 452099 BE612992 Hs.27931 hypothetical protein RJ1 0607 similar to 3.2 

10 452826 BE245266 Hs.301636 peroxisomal biogenesis factor 6 3.2 

419829 AJ924228 Ks.115185 ESTs > ModeraterysjntitartoPC4259ferri ' 12 

407688 W25317 Hs.37616 Human D9 spHce variant B mRNA, complete 3.2 

406214 NMJ)16371:Homo sapiens rr/droxysterotd (1 3.2 

. 416292 M179233 Hs.42390 nasopharyngeal carcinoma susceptibility 3.2 

15 438714 AA814859 Hs.160074 ESTs 3.2 

433213 AW665130 Hs.137190 ESTs 12 

412170 D16532 Hs.73729 very low density lipoprotein receptor 3.2 

409340 BE174629 Hs.321130 hypothetical protein MGC2771 3.2 

413142 M81740 Hs.75212 ornithine decarboxylase 1 12 

20 451338 AW612322 Hs.1 9131 transcription factor Dp-2 (E2F dimerizat 12 

437179 AA393508 serological defined colon cancer an tig 12 

418700 AI963808 Hs.86970 ESTs, Moderately similar to AUI5_HUMAN A 3.2 

438361 AA805666 Hs.146217 Homo sapiens cDNA: FU23077 fis, clone L 12 

450546 AA010200 Ks. 175551 ESTs 12 

25 409995 AW960597 Hs.129206 ESTs 12 

408739 W01556 Hs.44685 ESTs, Moderately skn3ar to 138022 hypot 12 

438821 AA826425 Hs.192375 ESTs 12 

403532 NMJE4638:Homo sapiens hypothetical prot 12 

452941 AU 10347 Hs.31074 M^fbglucwanilnesulfohvdrotase {sulfa 11 

30 414341 D80004 Hs.75909 K1AA01 82 protein 3.1 

423044 AA320829 Hs.97266 protocadherin 18 11 

429716 R25685 Hs.211933 collagen, type XIII. alpha 1 11 

444367 H54892 Hs.10974 hypothetical protein FU22390 11 

423551 AA327598 Hs.89633 ESTs 11 

35 453186 AKD01708 Hs.32271 hypothetical protein FU10846 11 

413550 W03011 Hs.306881 MSTP043 protein 11 

431933 AI187057 Hs.132554 ESTs 11 

439778 AU 09729 Hs.99364 putative transmembrane protein 11 

448044 AI458682 gb:tk13e01.x1 NCLCGAP_Lu24 Homo sapiens 11 

40 422017 NM.003877 Hs.1 10776 STAT Induced STAT lnhlbitor-2 11 

421254 AK001724 Hs.1 02950 cod protein gammacop 11 

424339 BE257148 endoglycan 11 

445109 AF039916 Hs.1 2330 ectonudeosidB triphosphate diphosphohyd 11 

425320 U29344 Hs.83190 fatty acid synthase 11 

45 426170 BE161065 Hs.167531 rr^tr^tonoy^Coenzyme A carboxylase 2 11 

408770 AW270608 Hs.170195 bone morphogenefic protein 7 (osteogenic 11 

442202 BE272862 Hs.1 05534 hypothetical protein FU22625 11 

417622 AW298163 Hs.82318 WAS protein family, member 3 11 

439864 AT720078 Hs.291997 ESTs, Weakly similar to A47582 B-cell gr 11 

50 433036 AA574091 Hs.105964 ESTs 11 

447050 NMJJ16314 Hs.17200 ■ STAhMte protein containing SH3 and ITA 11 

436299 AK000767 Hs.5111 rrvpcthefcal protein FU20729 11 

408380 AF123050 Hs.44532 (fiubiquiOn 11 

_ _ 407910 AA650274 Hs.41296 fibrcnecfin leucine rich transmembrane p 11 

55 445098 AL050272 Hs.12305 DKFZP566B183 protein 11 

431797 BE169641 Hs^70134 hypothetical protein RJ20280 11 

437296 AA350994 Hs^0281 WAA1700 11 

452627 A1122B43 Hs.1 843 19 ESTs, Weakly similar to WAA1006 protein 11 

_ 448913 AA194422 Hs.22564 myosin VI 11 

60 415079 R43179 Hs.22895 hypothetical protein FU23548 10 

408157 AA047685 Hs.62946 ESTs 10 

443337 Y076O4 Hs.9235 non-metastaBccells 4, protein expresse 10 

410310 J02931 Hs.62192 coagulation factor Eii (lhrcmboptas8n, 10 

437546 AW074836 Hs.173984 T-boxl 10 

65 451 134 AA318315 Hs.25999 hypothefjcai protein FU22195 10 

423932 T95633 Hs.189703 ESTs 3.0 

433495 AW373784 Hs.71 a1pha-2^ycoprotein 1,zmc 10 

453293 AA382267 Hs.10653 ESTs 10 

428171 AA489323 Hs.182825 ntosoma) protein 135 10 

70 428293 BE250944 Hs.1 83555 soWBcamerfamBy 1 (neutral amino a 10 

426657 NMJH5865 Hs.171731 solute carrier famfly 14 (urea transport 10 

406627 T64904 Hs.163780 ESTs 3.0 

418259 AA215404 ESTs 10 

426110 NM.002913 Hs.166563 repficafion factor C (activator 1) 1 (14 3.0 

75 411089 AA456454 cell division cycle 2-Hke 1 (PfTSLRE pr 10 

431416 AA532718 ESTs 10 

431631 AA548906 Hs.122244 ESTs 10 

409251 R10723 H3.2Q573 ESTs 10 

434974 AA778711 euxaryoQc translation initiation factor 10 

80 447147 AA910353 Hs.75432 ESTs, Weakly similar to T23482 hypotheS 3.0 

450937 R49131 Hs. 26267 ATP-dependant interferon response protei 3.0 

432728 NM.006979 Hs .2787 21 HUdasst) region expressed gene KE4 10 

433037 NMJJ14158 HsJ79938 HSPC067 protein 10 
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411598 


BE336654 


Hs.70937 


K3 hlstone famfly, member A 


3,0 




430268 


AK000737 


Hs.237480 


hypothetical protein RJ20730 


3.0 




443725 


AW245680 


Hs.9701 


growth arrest and DNA-damage-tndudble, 


3.0 


5 


431155 


AW971213 




gb£ST383301 MAGE resequences, MAGL Homo 


3.0 


407355 


AA846203 


Hs.193974 


ESTs. WeaWy similar to ALU1 ..HUMAN ALU S 


3.0 




438209 


AL120659 


Hs.6111 


aryi-hydrocarbon receptor nuclear trans! 


ao 




434392 


AW983709 


Hs.250824 


Homo sapiens cDNA: FU 23435 fis, clone H 


ao 




432810 


AA863400 


Hs.23054 


ESTs 


ao 


1 A 


430333 


S70114 


Hs.239489 


TIA1 cytotoxic granule-associated RNA-bi 


ao 


445596 


R89543 


Hs.12942 


vesicle trafficking protein 


ao 




406670 


W79632 


Hs.256301 


hypothetica! protein MGC13170 


ao 




452779 


AA418775 


Hs.47234 


ESTs 


ao 




439680 
408298 


AW245741 


Hs.58461 


ESTs, WeaWy similar to A35659 krueppe!* 


ao 


15 


AI745325 


Hs.271923 


Homo sapiens cDNA: RJ22785 fis, clone K 


ao 


424395 


AA165082 


Hs.146388 


rnicrotubuls-associated protein 7 


2.9 




423472 


AF041260 


Hs.129057 


breast carcinoma amplified sequence 1 


19 




425920 


AL049977 


Hs.162209 


daudin 8 


2.9 




409927 


T69981 




gb:yc19d03.r1 Stratagene lung (937210) H 


2.9 


20 


447881 


BE620886 




GCN1 (genera) control of amino-acid synt 


2.9 


439584 


AA838114 


Hs.221612 


ESTs 


29 




417601 


NM.014735 


Hs.82292 


WAA0215 gene product 


2.9 




452268 


NMJM3512 


Hs.28777 


H2A hlstone family, member L 


29 




422576 


BE548555 


Hs.1 18554 


CGt-83 protein 


29 


25 


438510 


AL080220 


Hs.6285 


DKFZP586P0123 protein 


29 


423217 


NMJXXJ094 


Hs.1640 


collagen, type VII, alpha 1 (epidermolys 


29 




427581 


NM_014788 


Hs.179703 


KIAA01 29 gene product 


2.9 




447963 


AI452973 


Hs.165900 


ESTs. WeaWy similar to ALU1.HUMAN ALU S 


2.9 




443015 


R33261 


Hs.6614 


ESTs, Weakly similar to A43932 mucin 2 p 


19 


30 


429351 


AK001490 


Hsi00016 


hypothetica! protein FU10628 


2.9 


425294 


AF033827 


Hs.1 55553 


HNK-1 sulfotransferase 


19 




452955 


AW390282 


Hs.31130 


transmembrane 7 superfamlly member 2 


19 




416439 


M180363 


Hs.118769 


ESTs 


19 




442315 


AA173992 


Hs.7956 


ESTs, Moderately similar to ZN91JWMAN Z 


19 


35 


446911 


N27605 


Hs.16492 


DKFZP564G2022 protein 


19 


428801 


AW277121 


Hs.254881 


ESTs 


19 




430462 


AI584156 


Ks.105640 


Homo sapiens, clone IMAGE:4139775, mRNA, 


19 




420552 


AK000492 


Hs.98806 


hypothetica) protein 


19 




428160 


AI129767 


Hs.182874 


guanine nucleotide binding protein (G pr 


19 


40 


418222 


A1675881 


Hs.86538 


ESTs 


19 


433009 


AA761668 




gb;nz24c08.s1 NCLCGAP.GC81 Homo sapiens 


29 




432140 


AK000404 


Hs.272688 


hypothetical protein FU20397 


19 




434517 


AA635690 


Hs.337251 


hypothetical protein MGC2487 


29 




448718 


AA220235 


Ks.153959 


hypothetica! protein MGC15436 


29 


45 


408196 


AL034548 


Hs.43827 


SRY (sex determining region Y)-box 22 


19 


419849 
452256 


BE041436 


Hs.93379 


eukaryofjc translation inifiaSon factor 


19 




AK000933 


K&28661 


Homo sapiens cDNA FU10071 fis, clone HE 


19 




430375 


AW371048 


Hs.93758 


H4 hlstone family, member H 


19 




409650 


T08490 


Hs.288969 


HSCARG protein 


19 


^ A 

50 


413588 


AAS71014 


Hs.75432 


IMP (inosine monophosphate) dehydrogenas, 


29 


412719 


AW016610 




ESTs 


29 




438552 


AJ245820 


Hs.6314 


type I transmembrane receptor (seizure-r 


29 




439621 


AJ 200281 


Hs.123910 


ESTs, Highly similar to B34087 hypomefj 


29 




437050 


AA766420 




ESTs 


29 


55 


414256 


AW410035 


Hs.75862 


MAD (mothers against decapentaptegte, Dr 


29 


429343 


AK000785 


Hs.199480 


Homo sapiens, Similar to epsin 3, done 


29 




416941 


BE000150 


Hs.48778 


niban protein 


29 




421594 


R45689 


Hs.21889 


Homo sapiens cDNA FU 12978 fis, done NT 


29 




436213 


AA325512 


Hs.71472 


hypothetical protein FU10774; K1AA1709 


29 


60 


436823 


AW749B65 




ESTs. WeaWy similar to 138022 hypothefi 


28 


441266 


H15968 


Ks.293845 


Homo sapiens, done IMAGE:3502329, mRNA, 


28 




426761 


AI015709 


Hs.172089 


Homo sapiens mRNA; cDNA DKFZp586l2022 (f 


28 




413746 


M133243 


Hs.171553 


ESTs. Moderatery similar to CLC3_HUMAN C 


2.8 




457733 


AW974812 




ESTs 


28 


65 


400860 






Target Exon 


28 


. 436165 


AJ373544 


H&331328 


Intermediate fJiament protein syncoiBn 


28 




433312 


A1241331 


Hs.131765 


ESTs, Moderately similar to 138937 DNA/R 


28 




431604 


AF175265 


Hs.264190 


vacuolar protein sorting 35 (yeast homo! 


28 




421717 


AF230924 


Hs.107187 


divalent caCon tolerant protein CUTA 


28 


70 


453912 


AL121031 




SW1/SNF related, matrix associated, acfi 


28 


435703 


AW630133 


Hs.83313 


GKD03 protein 


28 




431108 


AA991508 


Hs.105317 


ESTs 


26 




451040 


AA324743 


Hs.40808 


ESTs 


28 




443837 


A1984625 


Hs5884 


spindle pole body protein 


28 


75 


418198 


A1745649 


Hjl26549 


WAA1708 protein 


28 


450447 


AF212223 


Hsl25010 


hypothetical protein P15-2 


28 




409330 


AK001231 


Hs£3940 


hypothetical protein FU10369 


28 




413762 


AW411479 


Hs.848 


FK506-bindfog protein 4 (59(0) 


28 




433586 


T85301 


Hs.194397 


gb:ya7BdQ&s1 Scares fetal Over spleen 


28 


80 


424720 


M89907 


Hs.152292 


SWI/SNF rdated, matrix assodated, acfi 


28 


422027 


AL043100 


Hs.306319 


tatty acid amiae fryorouse 


28 




422717 


A1557623 


Hs.119475 


cold Inducible RNA-fcndmg protein 


28 




412843 


AF007555 


Hs.74624 


protein tyrosine phosphatase, receptor t 


28 




434237 


AF119908 


Hs£35516 


hypothetica! protein PR02955 


28 
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407255 AA012992 Hs^S6301 hypoflie&al prolan MGC1 3170 28 

431472 AK001023 Hs.255549 nucleotide binding protein 2 (ExoG 28 

423318 AW467064 Hs.5740 ESTs 2.8 

_ 454144 BE280478 Hs. 182695 hypothetical protein MGC3243 2.8 

5 452827 AI571835 Hs.55468 ESTs 2.8 

452040 AW973242 Ks.293690 ESTs, Weakly similar to 138022 hypotheti 2.8 

418830 BE513731 Hs.88959 hypotheDcal protein MGC481 6 2.8 

417517 AF001176 Hs.82238 POP4 (processing of precursor , S. cerev 2.8 

419286 AA236005 Hs. 128764 ESTs 2.8 

10 430287 AW182459 Hs.125759 ESTs, Weakly dmflar to IEU5JIUMAN LEUKE 28 

420911 U77413 Hs. 100293 CWinked fl-acetyighjcosarnine (GcNAc) tr 2.8 

428054 AI94B688 Hs.266619 ESTs 2.8 

409504 AA304961 Hs.699 pepQdyfprotyl bomerase B (cydopriiEn 18 

440080 AW051597 ESTs 2.8 

15 431609 AW792792 Hs.264330 N^cyisphingosine amidohydroiase (acid c 28 

447896 AI436124 H&294069 Homo sapiens cDNA FU 1 3384 fis, done PL 28 

431525 AA506556 Hs.6185 KIAA1557 protein 28 

442995 AA532511 Hs^88455 Homo sapiens cDMA: FU 23270 fis, done C 28 

433101 AW572317 Hs. 12082 Homo sapiens mRNA; cONA DKFZp566L203 (fr 28 

20 449052 AW0295O7 Hs.161102 ESTs 28 

418663 AK001100 Hs.41690 desmocdrm3 28 

442013 AA506476 Hs.10600 Human DMA sequence from done RP1 1453C1 28 

449051 AW961400 Hs.333526 HER2 receptor tyrosine kinase (oerb-b2, 28 

453038 AW292415 Hs.20509 HBVpX associated protein* 28 

25 419175 AW270037 WAA0779 protein 28 

426788 U66615 Hs.172280 SWl/SNF related, matrix associated, acti 28 

421091 W22821 rfoosornat protein 126 28 

413435 X51405 Hs.75360 carboxypeptidase E 28 

432651 AW973744 Hs.293100 ESTs 28 

30 452700 A1859390 Hs.288940 five-span transmembrane protein M83 28 

454148 AW732837 Hs.42390 nasopharyngeal carcinoma susceplibiHty 28 

419452 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 28 

425491 AA883316 Hs.255221 ESTs 28 

^ _. 414556 AW975063 Ks.343443 ribosomal protein L36 28 

35 435126 A1393666 Hs.42315 p104)mding protein 28 

435602 AF217515 Hs.283532 uncharacterized bone marrow protein BM03 27 

441128 AA570256 ESTs,WeaWysiinilartoT23273rrypotheti 27 

440340 AW895503 Hs. 125276 ESTs 27 

410339 AI916499 Hs.298258 ESTs 27 

40 430542 A1557486 Hs.119122 ribosomai protein L1 3a 27 

406789 AI041403 ribosomai protein 129 27 

424130 AL050136 Hs. 140945 Homo sapiens mRNA; cDNA DKFZp586L141 (fr 27 

449518 BE395253 Hs.30861 ESTs 27 

, _. 410099 AA081630 WAA0036 gene produri 27 

45 422000 M30599 Hs.1 10837 homeoboxAlO 27 

407110 AA018042 Hs.252085 Prader-WaiiyAngetman syrr*ome-5 27 

415443 T07353 Hs.7948 ESTs 27 

432745 AI821926 gb3it78f05j(5 NCLCGAP_Pf3 Homo sapiens 27 

409299 AAD45650 Hs.53125 small nuclear nborructeoproteln 02 poiyp 27 

50 424369 R87622 Hs^6714 WAA1831 protein 27 

417831 H16423 Hs.82685 C047 antigen (Rh-related antigen, integr 27 

424723 BE409813 Hs.152337 protein arginlne r^rnethYltransferase 3(h 27 

451428 AW083384 Hs.11067 ESTs,Hightys&niIartoT46395hypotheQ 2.7 

400263 Eos Control 27 

55 435750 AB029012 Hs.4990 WAA1G89 protein 27 

443804 AL13S352 Hs355883 ESTs, Weakly similar to 138022 hypotheti 27 

421709 AA159394 Hs.1 07056 CEM protein 27 

421077 AK000061 Hs.101590 hypothetical protein 27 

416359 AL042210 Hs.1 6493 hypothetical protein 0KFZp762N2316; KIAA 27 

60 442104 L20971 Hs.188 phosphodiesterase 4B, cAMP-spedfic (dun 27 

411605 AW006831 ESTs 27 

437133 AB018319 Ks.5460 WAA0776 protein 27 

407236 W79485 Hs.1 73980 nuclear matrix protein NMP200 related to 27 

, r 432125 AW972667 Hs.183006 Homo sapiens cDNA FU 12300 fis, done MA 27 

65 427657 AV652249 Hs.1 80107 polymerase (DMA directed), beta 27 

445636 AW105401 ribosomai protein L29 27 

443801 AW206942 Hs.253594 intron otohorhinophalangea! syndro 27 

420223 N27807 rtaomal protein L4 27 

427515 T79526 Hs.1 79516 integral type I protein 27 

70 447673 AW23987 Hs.1 82285 ESTs 27 

411960 R77776 Hs.18103 ESTs 27 

433212 BE218049 Hs.121820 ESTs 27 

448244 BE613416 Hs.336425 Homo sapiens, done MGC:17295, mRNA, com 27 

„ 441551 AA318224 Hs^98141 ESTs 2.7 

75 421883 X55C79 Hs.1437 grucosidase, alpha; ackl (Pompe disease, 27 

411562 AL050201 Hs.70769 hypothetical protein DKFZp586E1923 27 

434851 AA806164 Hs.1 16502 ESTs 27 

442875 BE623003 Hs33825 Homo sapiens done TCCCTA0Q142 mRNA sequ 27 

ork 421109 L32832 Hs.1 01842 AT-binding transcripfion factor 1 27 

CO 432450 AI990739 Hs.1 58184 ORF 27 

426174 AA547959 Ks.115838 ESTs 27 

445919 T53519 Hs.334692 hypothsCcai protein MGC14141 27 

423115 AA421973 Hs.169119 ESTs, WeaWy simflar to T25731 hypotheti 27 
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431222 X56777 Hs.273790 zona petludda glycoprotein 3A(spenn re 27 

451558 NM_001089 Hs.26830 ATP-oinding cassette, sub-family A (ABC1 2.7 

431615 AW295B59 Hs.235860 ESTs 27 

418649 A1096485 Hs.169341 ESTs, Moderately sfcfflar to SS5S57 alpha 27 

5 431122 AI267593 Hs-250535 Homo sapiens mRNA; cONA DKFZp434N2412 (f 27 

438682 AA354489 Hs^22103 EBP50-POZtnteractorof 64kD 27 

424954 NM_000546 Hs.1846 tumor protein p53 (L^raumeni syndrome) 27 

458082 AW978811 Hs^14451 ESTs, Weakly similar to ALU1_HUMAN ALU S 27 

427087 BE073913 Hs.173515 uncharacterized hypothalamus protein HTO 2.7 

10 412524 AA417813 Hs.44208 hypoflwfical protein FU 231 53 27 

435176 AA744875 Hs.189413 ESTs 27 

419390 AI701162 Hs.90207 hypothetical protein MGC1 1138 27 

428013 AF151020 Hs.181444 hypothetical protein 27 

441094 U33819 Hs.7647 MYOassodated zinc finger protein (pun 27 

15 451356 AA748418 Hs.164577 ESTs 27 

408461 AB037756 Hs.45207 hypothetical protein WAA1335 27 

418516 NML006218 Hs£5701 phcsphoinosi0de-3*inase, catalytic d 2.7 

423476 AL035633 Human DNA sequence from done RP5-1046G1 27 

444034 AL161957 Hs. 10177 pleckstrin homology domain interacting p 27 

20 404913 NM_024408-:rtemo sapiens Notch (Drosophil 27 

407378 AA299264 Hs.57776 ESTs, Moderately similar to 138022 hypot 2.7 

441054 AA913591 ESTs 2.6 

428500 A1815395 Hs.184641 fatly add desaturase 2 2.6 

430512 AF182294 Hs.241578 US snRNA-assodated Sm4Ike protein LSm8 2.6 

25 433301 AW296280 Hs.152016 Homo sapiens cONA: FU221 40 lis, clone H 2.6 

444381 BE387335 Hs2B3713 ESTs, Weakly similar to S64054 hypolheti 2.6 

443666 AI458179 Hs.41528 ESTs, Weakry similar to neuronal thread 2.6 

427378 BE515037 Hs.177556 melanoma antigen, family D, 1 2.6 

439414 NM.001183 Hs.6551 ATPase, H transporting, lysosomal {vacuo 2.6 

30 421866 M24470 Hs.1435 guanosine monophosphate reductase 2.6 

421605 6E440108 Ks.106127 RNA polymerase 1 16 kDa subunit 2.6 

426797 AW936258 Hs.342849 ADP-ribosylation factor-like 5 2.6 

410935 BE067395 Ks.66881 dynein, cytoplasmic, intermediate polype 2.6 

407887 AA579668 Hs.41072 serine (or cysteine) proteinase inhibito 2.6 

35 430341 NMJJ05348 Hs.239631 golfli transport complex 1 (90 kDa subuni 2.6 

424956 AW198103 Hs.18724 ESTs, Weakty similar to granule cell mar 2.6 

452748 AB011128 Hs.30512 Homo sapiens mRNA tor K1AA0556 protein, 26 

408212 AA297567 Hs.43728 hypothetical protein 2.6 

407B93 BE408359 Hs.43621 Homo sapiens, Similar to hypothetical pr 2.6 

40 425976 C75094 Hs.334514 NG22 protein 2.6 

448424 AW009892 Hs.31924 ESTs 2.6 

420164 AW339037 Hs24908 ESTs 2.6 

443444 AW952619 Hs.17235 Homo sapiens done TCCOA00176 mRNA sequ 26 

412774 AA120865 Hsi3136 ESTs 26 

45 445922 AI337316 Hs.147998 ESTs 26 

429925 NM-000786 cytcohrome P450, 51 (lanosterol 14-alpha 26 

419222 AD001528 Ks.89718 spermine synthase 26 

417327 NMJKW922 Hs.81964 SEC24 (S. cerevteiae) related gene famil 26 

420337 AW295840 Hs. 14555 Homo sapiens cONA: FU21513 fis, done C 26 

50 408743 AL110246 Hs.47367 KIAA1785 protein 26 

434011 AW953437 Hs.5486 clone FLB5214 26 

434976 A1953821 ESTs 26 

417454 NNL000202 Hs.172458 Wuronate 2-sulfetase (Hunter syndrome) 26 

448789 BE539108 Hs^2051 hypc^BcalprotetoMGC15548 26 

55 441049 W88920 Hs.29341 hypothefical protein FU22376 26 

437271 AL137445 Hsi8846 Homo sapiens mRNA; cONA DKFZp5660134 (fr 26 

425017 AL119305 Hs.26409 ESTs 26 

440333 AI378424 Hs.288761 hypothelical protein FU21749 26 

434551 BE387162 Hs.280858 ESTs, Highly similar to A35661 ONAexds 26 

60 413132 NM.006823 Hs.75209 protein kinase (cAMP-dapendent. catalyfi 26 

409619 AK001015 Hs.55220 BCU-assodated athanogene 2 26 

444550 BE250716 Hs.87614 ESTs 26 

446258 AI283476 rK263478 ESTs 26 

_ 440191 AI990417 tubulin, beta 5 26 

65 453187 AI161383 Hs34549 ESTs, Highly simlar to S94541 1 clone 4 26 

412133 U83460 Hs.73614 solute carrier family 31 (copper transpo 26 

426841 A1052358 Hs.193726 ESTs 26 

435937 AA830893 Hs.1 19769 ESTs 26 

425960 AW410646 Hs.1 64649 Ivypotheficai protein OKFZp434H247 26 

70 456161 BE264645 Hs.282093 tiypdheficai protein FU21 91 8 26 

456236 AF045229 Hs.82280 regulator of G-protein signalling 10 26 

435025 T08990 Hs.4742 anchor attachment protein 1 (Gaalp, yeas 26 

416751 T48130 Hs.5897 Homo sapiens mRNA; cONA DKFZp586P1622 (f 26 

458946 AA009716 Hs.42311 ESTs 26 

75 424762 AL1 19442 Hs.1 83684 eukaryofic translation Matlon factor 26 

449458 A1805078 Hs^08261 ESTs 26 

432409 AA806538 Hs.1 30732 K1AA1575 protein 26 

425184 BE278288 Hs.1 55048 Lutheran blood group {Auberger b anfigen 26 

442336 AW34095S Hs.7572 ESTs 26 

80 445622 AF106697 Hs. 12971 (htoredoxin reductase beta 26 

440409 AW294316 ESTs 26 

444610 AM 74783 gfcHA2501 Human fetd Over cDNACbrary 26 

425883 U43604 Hs.1 59901 Human unidentified mRNA, partial sequenc 26 
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438972 AA284679 Hs.25640 daudin3 2.6 

426803 BE313114 Hs.297C6 ESTs 16 

431183 NM_006B55 Hs.25Q696 KDEL (lys-Asp-Qu-Leu) eratoptasmic refic 2.6 

457638 A1792670 Hs.144405 ESTs 2.6 

442821 BE391929 Hs.8752 transmembrane protein 4 2.5 

435522 N84214 Hs.9774 symwial sarcoma traislocaSon gene one 2.5 

406764 AA429825 Ks.343443 rfcosomal protein L36 2.5 

422032 AA476966 Hs.11 0857 polymerase (RNA) 111 (DNA directed] pofy 2.5 

409125 R17268 Hs.343567 axonai transport of synaptic vesicles 2.5 

416801 X98834 Hs.79971 sat (DrosophteHite 2 2.5 

446091 AW022192 ESTs 2.5 

419848 AA251242 Ks.103238 ESTs 2.5 

415015 NMJW2109 Hs.77798 NsfidyWRNA synthetase 2.5 

431838 AF178532 Hs.271411 beta-site APP-deavmg enzyme 2 2.5 

414987 AA524394 Ks.294022 hypothetical protein FU1 4950 2.5 

447087 AW403870 Hs.301872 hypothetical protein MGC4840 2.5 

456050 R79445 Ks.76230 rtbosomaJ protein S10 15 

426178 AA371409 Hs. 105664 hypotheScal protein FU22294 25 

414528 AA148950 Hs.1 68836 ESTs 15 

426494 AL1 19528 Hs.170098 K1AA0372 gene product 15 

407656 AW747986 Hs.37443 Homo sapiens mRNA; cONA OKFZp434B2119 (f 25 

437330 AL353944 Hs.50115 Homo sapiens mRNA;cDNAOKF2p761J111 2 (f 15 

449139 BE268315 HsJ23111 phenytalanine-tRNA synthetase-like 15 

448153 Y10805 Hs.20521 HMT1 (hnRNP methyitransferase. S. cerevi 25 

417386 AL037228 Hs.82043 D1 23 gene product 25 

409420 Z15008 Ks 54451 brninin, gamma 2 (rucein (100kD), kaJlnl 15 

408796 AA688292 Hs.170345 ESTs 15 

407627 A1419020 Hs.62620 chromosome 6 open reading frame 1 15 

421535 AB002359 Hs. 105478 phosphonTwsytfom^g^nanrofine syntha 15 

415862 R51034 Hs.144513 ESTs 15 

426418 M90464 Hs.169825 collagen, type IV. alpha 5 (Alport syndr 15 

436557 W15573 Hs.5027 ESTs, Weakly similar to A475B2 &cel) gr 15 

413517 N76712 Ks.44829 ESTs, Weakly similar to 138022 hypolheti 15 

449500 AW956345 Hs.12926 ESTs 15 

457090 AL080243 Hs.180920 nbosomal protein S9 15 

445389 NM.016831 Hs.1 2592 period (Drosophila) homolog 3 15 

410006 AW732308 Hs.57783 eukaryolic translation initiation factor 15 

452906 BE207039 H&306123 MAGEF1 protein 15 

450341 N90956 Hs.17230 hypothetical protein FU22087 15 

419430 A1878942 Hs.90336 ATPase, H transporting, lysosomal (vacuo 15 

440099 AL080058 Hs.6909 DKFZP564G202 protein 25 

452518 AA280722 Hs.24758 ESTs, Weakly similar to 138022 hypotheti 15 

431843 AA516420 ESTs, Weakly similar to 138022 hypotheti 15 

424795 AW102850 Hs.153177 nbosomal protein S28 15 

448019 AW947164 Hs.1 95641 ESTs, Moderately similar to 138022 hypot 15 

450704 H85157 Hs.40696 ESTs 15 

400262 Eos Control 15 

435266 AK001942 Hs.4863 hypothetical protein DKFZp566A1524 15 

418039 R06859 Hs.193172 ESTs, Weakly similar to I38022 hypotheti 15 

429491 NMJ012111 Hs.204041 chromosome 14 open reading frame 3 15 

456844 A1264155 Hs.1 52981 COP-dacylglycefol synthase (phosphaSda 15 

413781 J05272 Hs*50 IMP ftnostne monophosphate) dehydrogenas 15 

437450 AL390154 Hs.26954 Homo sapiens mRNA; cDNA OKFZp762G123 (fir 15 

424005 AB033041 Hs.1 37507 vang (van gogh. DrosophOaHike 2 15 

429332 AF030403 Hs.199263 Ste-20 related kinase 15 

438572 BE267017 Hs.6315 acetylserotorrin 0^e%ltransferase^ike 15 

425692 090041 Hs.155956 N-acetyltransferase 1 (arytamine N-acety 15 

431188 W05656 Ks.169755 ESTs 15 

408803 AF248953 Hs.24049 golgi autoanfigen, gotgjn subfarrfly a, 2 15 

414482 S57498 Hs.76252 endotheEn receptor type A 15 

437762 T78028 Hs.1 54679 synaptotagmln I 15 

425722 AI659076 Hs.97031 hypotheScal protein MGC13047 15 

433681 A1004377 Hs^00360 Homo saptens cONAFU13027fis. done NT 15 

451585 AK001171 H&326422 hypothetical protein MGC4549 15 

424959 NVL005781 Hs.1 53937 activated p21cdc42Hs kinase 15 

407151 H25836 Hs.301527 ESTs, Moderately simHar to unknown [Us 15 

446054 AB014537 Hs. 13604 WAA0637 gene product 15 

452556 H78517 Hs.33905 ESTs 15 

421026 AL047332 Hs.1 01 067 GCN5 (general control of ardrtr>add synt 15 

407687 AK002011 Hs.37558 hypothefical protein RJ1 1149 15 

433683 AI817723 Hs^2678 hypothefical protein FU21832 15 
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Pkey CAT Number Accession 

415989 10194 1 BC01 3389 BC01 7398 AI023543M1 91 424 At2877ffl)Al4696^ 

BG285837 AI720344 BF541715 AA355086 AA172236 
429220 15103J AW341473AA448195AW207206AI951341 AA969259 
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449625 249224.1 BI918168 AW779760 N48674 AI37S997 AA235370 BG699146 A1913631 AI498402 A1016320 AA323193 R49021 DS9344 BG98675Q N45526 
BG986917 T61382 R49391 R45432 A1203107 R35004 F07491 R25094 R35360 

412446 63467.1 BC021735 AI 66921 2 AL120184 A/769949 BE701002 BE1 84363 BE619031 8G702238 6F090049 BF963318 8F981912 6F943013 AA934514 
AA151245 BF960659 AA9S7907 Z41449 BF908059 BF908053 BF908049 BE699424 BF9080SO BF962832 BF952020 BF963134 BI035538 
BF908052 BF908057 8F090026 BF943158 AI632924 BF512340 BF952021 BF960776 BF943437 8F942847 AI768015 F09778 F04816 F02721 
AA10264S AI633838 AA617929 BF947001 BI035448 BE935876 AW890837 AW898604 BF957405 BF963433 BG70481 5 

433404 7392_1 BC022980 BF224081 BG149908 AW672842 BE670687 A17021 61 AW341B32 BE222503 N71 836 AI026061 AW9531 16 AW083132 A1979261 

AV725377 AI423298 A1640707 AW675518 AI032611 AI818044 AI299508 AI91 1386 A1270418 BE219257 BM141826 AA826491 Z25159 AA587421 
N59447 239436 T32982 R54110 BF115783 F09044 BFBC8433 

453160 6028.5 BC009612 NWL003526 BI597616 AV761592 AV760377 AL601008 BI604131 BE645918 BG187760 BG181525 BG210634 BG192999 AI263307 

AA344186 AW952966 AA033609 AA037562 AA722183 R79452 H70775 BF674991 BE769437 BG007855 AA0374B3 AW572535 AM43991 
AA084581 AA033610AV742510AV735788 R08336 

42021B 191547J AW958037 R42557 A1337047 AA948360 AI638005AA459950AI624915 AI638047 AM67856A1521826AA860305AI932315AW003092AW271756 
AW779380 AA609879 AJ634791 AI493770AJ565211 Z411 45 AI527952AA303734BE343457AW1 96765 AA256527 BE089727 

437124 59408J AL050013 8G939500 AW969191 AA769925 AJ377973 A1625545AA81 1365 AA521 114 N24705A1379579 AA424899 A1684671 AA829715 AI453010 
N35401 AA677452 AA504340 AI209149 AA883574 AI379062 AI084455 A1280147 AA644327 BF432508 N27873 N47364 N34880 AI147024 T86860 
A121 9716 AA96C926 H25544 BI857123 AW960489 AA599099 AA765246 AA328537 BG434703 AA890373 AA424765 AI292318 AA829886 N95742 
AI218758 H25588 N36282 AA024987 N36687 61919187 N49471 AA889970 AW166152 AA468546 AI262504 A1452782 AA554458 AA807080 
AA724542 A W1 02730 AA909978 AW1 18134 AW827241 W56431 BM127381 8F436987 AI016509 AW663972 BM127686 C15552 N63435 N51744 
T98800 N56980 BG108638 N49381 R49886 D61278 BI756612 AA508234 R49885 BF850422 

438869 52134.1 AF075009 R63109 R63058 

428342 6712 J AK056315 AI015524 AA724079 BI713619 AI377728 AW293682 AI928140 AI092404 AI085630 AA731340 BM469629 AW968804 AM25658 

AA769094 BF446026 AW1 18719 AI332765 AW5008B8 AW576556 A1859571 AW499B64 AW61 4573 AW629495 AW505314 W74704 AI356361 
AI923640 AW070509 A1521500 AL042Q95 AA609309 AA761319 AI381 489 H45700 AA761333 AW265424 AA909524 AA63531 1 AA649040 
AI392620 240708 A1985554 AW263513 AA913892 AI693486 AW263502 A1606164 AW291 137 B1061 872 BI059498 AA134476 AW084888 AA036967 
AW370823 T55263 BI002756 AA489664 BF827261 W74741 BF963166 

450203 19009J AKD55952 BG182168 BG220105 BG191569 BG188964 BG1 87388 BG220104 BG183714 BE645998 AI819354 AW974068 A1393635 AI580846 
AI024796 AWO20098 BI491 127 A1393644 N74993 AW472959 BM478854 BI597437 H12165 B1458612 BE543192 

432586 6633J BC022881 AU150944 BG750783 AW754175 AW857737 A191 1659 AI05OO36 AA554053 AI826259 AA568548 

450377 12109.1 AB033091 AL520743 BE81 1813 N53332 N99716 AL561910 AA280655 BE710392 AV705100 AW29397B AW444556 AA281459 AI679751 AJ873695 
BG700891 B1553517 R80518 BG779771 BG534451 AA479402AW961580 BF061430 A1B57643AI768488AW51211 8 AA479302 AW770384 
AW072470 A1041596 AI049699 AW592865 AA976261 BE879747 BG114119 AW183811 AA909938 BF571621 BF350794 BF351375 BE925699 
AL050294BC010371 BF982270AL042656BF095732AW812618BF095731 BG212397BF678765B1038602BG388664 AW675337 BG289398 
BE939598 AW805570 AA527097 AW1 50540 AI 693720 AA743364 AI915793 N48185 AI573107 AA043474 AI35161 5 A1969490 AI910763 R50866 
AI699181N73808H08164 

436063 5483J AK000028 AA494483 AI298674 AA720773 AV761529 AI884670 AI936202 AW294235 D61652 BF881 184 AV71 1384 N27154 AI926970 AV734970 

N40094 N28596 AA884747 AA512890 BG436593 
424036 6226.1 NMJJ33445 BC001 1 93 AI885781 BF794032 AA476620 AA810906 AA8 10905 AJ291 244 AI885097 AI359708 A1335629 H97395AI344589AA300377 

AA457566 AW771833 BE465621 A1364088 AI 364452 A1648505 A1918342 AI928670 AA886580 AL531 029 AA886344 A] 1 86419 BG329096 

BM045465 AL531028 BG437151 BE868021 AA179427 
407819 7393.2 AK056626 AI800896 BF939022 BE644718 AI954754 BE218177 BE348567 AI962406 AW293122 A1968798 AI457321 8E327228 BG913531 

AW939055 T30280 R54166 Z43366 BG819153 BF003119 BE646274 BF940881 R18246 R42185 
409151 4123.1 N^(XM892AF(W7442BE275338Bm4863Bl917206BE276993Al602308AA306^ AW993481 AW993283 

AW99291 9 AW992921 AW992980 AW992861 AW993220 BG573124 AA456385 AA234796 AI902726 AA354813 BI092644 BG778400 BI260O01 

BG007325 AI267455 AA426574 AI160782 AI472186 AA255500 AA434006 BG435520 AI3561 1 1 H00525 AV749Q60 BG944497 BG292031 

AA902153 C04925 AA902160 AA383100 AW073533 AA256706 AA 150809 R65766 AW958448 BE090972 BF693195 AV738979 R65855 R80138 

AA484677 

422890 61426.1 AK057805 AW162343 A!1 90479 A1093318 BE048820 AJ198397 M654667 BE219303 Z33851 F02655Z28734T16575F10145Z45266AW572911 
AW964436 AW0O4O3O AI632565 BE502530 BT792383 BF056928 AA449241 AI651825 AA805324 A1264863 AW19691 8 AA948267 A) 953735 
A1263703 BF056387 AW594171 AI867447 AA319159 A1903440 AW9561 10 AI366013 A1867923 BG91 1906 081142 C15616 AL538697 Z25032 
Z43784 R13382 AV746924 AA4493S9 AA318815 BF364265 H17038 H10064 F04161 T87230 R40898 AW204071 BI819428 AA683393AA683376 
Z43192T74078T05103F12527 T77951 H10118H17037 BF855407 R19603 

451752 10408.5 AB032997AJ141678AW978722BE467119Ai761408BF727385AW237035AI934521 BF436248 AI479568Z40632AA832081 AW295901 

BF057835 BE465977 AI 621 269 BE465983 8F756369 N74056 AI817896 AA716567 AA934774 K62600 H09497 BF943762 8E395335 BE883333 

432363 1234917.1 AW970240 AA534469 AW970323 

417379 1610005.1 AA1 96390 AA507837AA1 96468 

414922 1563.2 BG107484 AA632009 AI432670 AI656660 AI650884 AI521919 AI 264653 AW150793 AW611894 A1917098 BI091245 AI651454 BF434889 AI580286 

A1880735 BE301995 AJ392959 AW613965 BM023628 AW515374 AK60102 BM023318 BE328188 A19S2820 AI581363 AA557165 A1695677 
AL562079 AI700925 AI470561 BF063058 AW196387 AU132984 BI064046 A1970157 R02122 H55924 AJ521721 AA808206 AA725223 AI766003 
AW339821 AAB05951 AI287869AW564827 

419733 761*_3 AKD27321 WB3676 W63789BE046412BF114614BE646183BM126230 AW044233 A1951970 AW663548 AI1 39947 AA51 4302 AA846232 

BM126251 AA789002 AA581966 AA809643 AW168870 BE706664 BE706539 BE153177 BF084925 AL133779 AW961788 AA659693 AA347970 
AA295134 AA526037 M449282 BG190454 H61476 T91396 N20018 T901 14 H75644 BE710736 BF687723 H28581 AA249370 BF726698 
BE841554 BI045099 T84625 AW12987B BG770825 

432675 1237917.1 AW973834 AI791932 AI791855 AT732640 AA558833 AA559897 AI821610 

410297 2990J BO013939 B1494690 B149121 1 AJ928393 AA843540 BG938644 AU 185628 B1495842 AW173255 BM052709 AI743999 AI690144 AI922209 A1740907 

AW340368 AA928759 AW1 18737 BF513970 AA707807 BF435295 AI339463 AI373642 AJ433809 BE222392 AA602308 AA428261 AI460355 
AW662760 AI888087 AI342098 AA72241 8 W78151 N64382BE221848AW025901 AA45212) AJ150479 AI016166 AA779515 AA661791 BM474307 
Bi91 1 169 BG575154 AW953303 T33604 D59141 AA385785AA148848 BM461981 BG681 1 58 BI602483 BE889592 AW95431 1 BM052986 81962893 
AI989299 Z42328 BF029504 T35668 BG402602 AU185770 A1023271 AA1 4771 9 AI 434079 AI 569000 AI 276488 AA992453 AA342821 AA64S303 
AI349364 A1051O08 AA926941 AA350894 AW071451 N22249 AI7841 38 AA083847 N22258 AW440825 AA661570 AA376687 AA659125 A1356299 
R70463 A138358S AA827189 B1494872 AW021094 B1494871 AA905500 AA460923 BI492041 AW028965A1624611 BG271780A1497723 H88862 
D59858 N89979 AA658425 N81 154 D62341 A1274437 N66697 H96993 A1370663 AA728850 H05232 R59374 H12223 AI935759 A1362553 D60006 
N29572 Af916833 M75273 M148710 BI5971 17 BG740471 AA332671 AA333874 AA643052 AW020175 R70550 BG623469 AA452342 AW965441 
W19723 R32966 BE883841 R61003 AI910374 AI865262 R55325 A! 468927 R34681 H96211 Z39807 BF954386 

447620 687223.1 AI973051 AJ400921 AF796154AW241817AW290951 

407192 2200202J AA602964 AA6O92O0 

434194 62680.1 AF119847 AA437»1 AA436967 AJ132965 

448663 16112J BE968763 AA659765 AI961656 AI520918AA761743AA281477 NS6431 BE463652 AA28 1 329 AW272944 AA058687 BM1 45087 AA0455 16 

AW341820 AA112515 AA258766 AI886639 AA71 41 33 AA768245 AA035533 A1630459 T20165 AW971268 BE966269 AA522722 
429163 1238297 J AW974271 AA592975 AM47312 AA884766 
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434747 
437179 



448044 
424339 



431416 
434974 



431155 
409927 
447881 



433009 
412719 



437050 
436823 
457733 
453912 



440080 
419175 



452541 1 
647290.1 
39576J 



414222 18695J 



412652 18858.2 



418166 18858J 



117643.1 
12239.1 



1111791 1 
50559.1 



418259 133853 J 



411089 5597.6 



12091S.1 
77302.1 



1235742.1 
2333801.1 
44623J 



2142268.1 
1634.2 



1240141.1 
WH1660.153 
119160.1 
32562.3 



517737.1 

J 



AK001536 AK056135 BM474813 BE887303 AK022914 AW581996 AW812945 BE882302 M134266 B1043673 AA019433 B1862088 BM468657 
AU128438 BE384458 AL353967 BIB57117 BF686525 BI465223 BM460132 AU129877 B1222283 BG171592 BI043544 BG496295 BG750710 
BI256542 BG108520 AU 150719 AW510354 A15542S6 AU53968 AA191092 BF132635 
AF150424 Al 861896 AA570057 AV738855 
AW182329 AA613792 T05304 AW858385 

BE644758 AI082238 BF940027 A1201079 A1436035 AW275966 AJ085394 AI291655 AW070441 A1474134AI268978AI769279AI567682AA693941 
BF477668 AW664149 AA283782 BF509538 AW2968S8 AI268977 A) 168 133 BM352065 AI262769 BFS41976 Al 056920 AM8 1861 BTO697 
AL565888 8M352383 AA4277S8 AA385346 AI186988 AA931 831 M134972 BF217480 BF1 1 1012 AA908246 AA319849 AA318136 AL514271 
BF364291 AL515057 AV70231 2 AA377395 AL544217 AI341000 AW193583 AI350789 AA888338 BF945380 AW879092 AA 130839 T91066 N92326 
AI004389 AA078832 AL572370 W04622 BE314003 AW980808 BM360872 AA3191 60 AA1 30778 AL514257 

BC021085 AL527872 AL526296 AL557087 B125509Q AU143499 AL560356 BG823170 BE736988 AU141 388 AL58Q262 BI764173 AU120299 
AU141755 AU141988 AU142941 AU141330 AU141784 BI770885 BI91 1394 BE901426 BI918039 B6760842 BE883026 B1254740 BE732690 
BG430761 BE792868 AW328267 AL135173 AA1 02674 BF726986 AL564735 8E155962 BE155979 BF74 1679 H67776 H59234 H89665 AW1 17774 
AW274435 129008 NMJJQ3104 U07361 BF002824 BI222949 AI458045 AW951537 BF195474 A1669049 AI042523 BE463928 AU160125 AU 160580 
AW069877 AW015214 A1946718 BE219706 AI953605 BE217755 AI991382 AI245005 BF431 179 AI521 B43 BE048908 AJ420449 AW275385 
AA336950 BE501521 BF740558 AA311404 AA384639 BI772535 BG473076 BE891298 BE246928 BM012988 BE242693 BE901342 BG746358 
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TABLE 68C 



Pkey: 
Ref: 



Unique number corresponding to an Eos probeset 

Sequence source. Trie 7 digit numbers in this column ere Genbank Identifier (GJ) numbers. 'Dunham I. ei al." refers to the pubication entitled The DNA 
sequence of human chromosome 22/ Dunham letal, Nature (1999) 402:489-495. 
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Strand: trioleates DNA strand from which exons were predicted. 
NLposiOon: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposiSofl 


403047 


3540153 


Minus 


59793-S9968 


401424 


8176894 


Rus 


24223-24428 


401451 


6634068 


Minus 


119926-121272 


401747 


9789672 


Minus 


118596-118816,119119.119244,119609-11976 


404632 


9796668 


Rus 


4509645229 


403046 


3540153 


Minus 


55707-55859,55369-56511 


404922 


7341893 


Rus 


13248-13428 


401519 


6649315 


Plus 


157315-157950 


401197 


9719705 


Rus 


176341-176452 


401866 


6018106 


Rus 


73126-73623 


405387 


6587915 


Minus 


3769-3833.5708-5895 


403752 


7678857 


Plus 


33704-33828 


404210 


5006246 


Rus 


169926-170121 


401785 


7249190 


Minus 


165776-165996,166189-166314,166408-16656 


406214 


7342036 


Rus 


86320-86523 


403532 


8076842 


Minus 


8175041901 


400860 


9757499 


Minus 


151830-152104,152649-152744 


404913 


7341740 


Rus 


97717-97976 



Table 69A Dsts about 200 genes that are upregutated in human umbilical vein endothelial ceils (HUVEC) compared to normal body tissues. These genes were selected from 59680 
pro besets on the Eos/Affymetrix Hu03 Genechfp array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (A!}, a normalized 
value reflecting the relative level of mRNA expression. 

Table 70A lists about 148 genes that are deregulated using the mean of vessels (veins and arteries) compared to the mean of HUVEC. These genes were selected from 59680 
probesets on the EosfAflymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (AI), a normalized 
value reflecting relative level of mRNA expression. 

TABLE 69A: 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unlgene Title: Unigene gene title 

R1: Ratio of the mean of HUVEC ATs to the mean of the normal body tissue ATs 



Pkey ExAccn UnigenelD Unigene Tilie R1 

424806 AA382523 Hs. 105689 MSTP031 protein 1.51 

418036 Z37976 Hs.83337 latent transforming growth factor beta b 1.94 

444381 BE387335 Hs^83713 ESTs, Weakly similar to S64054 hypotheQ 1.60 

442802 AL133035 Hs.8728 hypothetical protein DKFZp434G171 1.73 

417944 AU077196 Hs.82985 collagen, type V, alpha 2 1.48 

410480 R97457 Hs.63984 cadherin 13, H-cadherin (heart) 1.91 

452516 AA058630 Hs.29759 RNA POLYMERASE I AND TRANSCRIPT RELEASE 1.59 

425139 AW630488 Hs.25338 protease, serine. 23 < 1.63 

402463 NMJ>14624:Komo sapiens S100cakAim*ind 1.50 

423798 AF047033 Hs. 132904 solute carrier family 4. sodium Hcarbon 1.31 

412568 AI878826 Hs.74034 caveoSn l.caveobe protein, 22kD 1.53 

419948 AB041035 Hs.93847 NMJ)16931:Komo sapiens NADPH oxidase 4 ( 2.06 

447526 AL0487S3 Hs.303649 small inducible cytokine A2 (monocyte ch 2.10 

446500 U78093 Hs.15154 susN-repeat-ccnlaWng protein, Xchrom 1.67 

441457 AW996651 Hs.43838 ESTs 1.30 

408296 AL1174S2 Hs.44155 DKFZP586G1517 protein 1.61 

418994 AA296520 Hs.89546 seiectin E (endothelial adhesion molecul 1.87 

413795 AL040178 Hs. 142003 ESTs 1.64 

441689 A1123705 Hs.289068 ESTs 1.44 

412567 AI750979 Hs.74034 Homo sapiens clone 24651 mRNA sequence 1.75 

424432 AB037821 Hs. 146858 protocadhedn 10 1.65 

432673 AB028859 Hs.278605 DnaJ (Hsp40) homotog, subfamily B, membe 1.31 

418683 U90908 Hs.87241 hypothetical protein from clones 23549 a 1.42 

453085 AW954243 WAA0251 protein 1.47 

438887 R68857 Hs.265499 ESTs 1.49 

436729 BE621807 transmembrane 4 superfamlly member 1 1.91 

400494 ENSP00000238OT:CIG30 (Fragment). 1.34 

442506 BE566411 ESTs 154 

425023 AW956889 Hs. 154210 endothelial differerrfiaSon, sphmgotipi 155 

414476 AA301867 Hs.76224 EGF-containing ftjuMke extracelluta 2.65 • 

452933 AW391423 Hs^88555 Homo sapiens cDNA: FU 22425 fis, clone H 1.62 

422389 AF240535 Hs.115697 protocadherin 12 1.38 

417124 BE122762 Hs^5338 ESTs 2.13 

433681 AI004377 Hs.200360 Homo sapiens cDNAFU 13027 fis. done NT 1.31 

419983 W55956 Hs.94030 Homo sapiens mRNA; cDNA DKFZp586E1 624 (f 1.54 

414776 AA1 55598 Hs.2 12839 hypothetical protein FU 14195; WAA1 71 4 1.51 

450534 A1570189 Hs.25132 K1AA0470 gene product 1.43 

449618 AI076459 Hs.15978 K1AA1272 protein 1.42 

446098 AW072215 H&208470 ESTs 1.53 

413281 AA861271 Hs.222024 tramcriplton factor BMAL2 1.40 

448674 W31178 Hs. 154140 ovary-specific acidic protein 1.47 

407061 X97748 gb:ftsapiens PTX3 gene promotor region. 1.33 
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434846 
408570 
436772 
453789 
424259 
441020 
401234 
420174 
445684 
418693 



412330 
412646 
434523 
433376 
422099 
449394 
445119 
438142 
437478 
422809 
453459 
442711 
428428 



417640 
407252 
418825 
417805 



415714 
446984 
421778 
421218 
452298 
434596 
412351 
433374 
429113 
423099 
435705 



AW295389 
AL046406 
AW975688 
AA628517 
AK001776 
W79283 

A1824144 

AK001696 

A1750878 

AUD77012 

NM_005100 

NM.006825 

AA703709 

AI249361 

AA156022 

AA0O4368 

AF035121 

T90309 

AL390172 

AK001379 

BE047032 

AF151073 

AL037544 

X06256 

D30857 

AA659037 

AA228881 

U38545 

NM_000020 

NM.002290 

AB020722 

M428000 

NM.000499 

AI039243 

T59538 

AL135960 

AI821409 

D28235 

NM-002B37 

AA782114 

AA319798 



453983 
406506 



Hs.119768 ESTs 

Hs.103483 WAA1798 protein 

rretatoihbneiri 1E (funcfonal) 

Hs.118502 ESTs 

Hs.143954 hypothetical protein FU1Q914 

Hs.35962 ESTs 

mitogen- activated protein kinase 8 inter 

Hs. 199749 ESTs 

Hs.13109 Ran binding protein 11 

Hs. 87409 thrombospondjn 1 

Hs.288582 ESTs. WeaWy similar to ubiquitous TPR m 

Hs.788 A kinase (PRKA) anchor protein (gravin) 

Hs .74368 transmembrane protein (63kD), endoplasmi 

Hs.23410 transtocase of inner mitochondria! membr 

Hs .74122 caspase 4, apoptoste-related cysteine pr 

Hs.111518 hypoMca) protein 

Hs.18160 Homo sapiens cDNA RJ1 1 550 fis. clone HE 

Hs.1 2337 kinase insert domain receptor (a type It 

Hs.269551 ESTs 

Hs.317432 branched chain aminotransferase 1, cytos 

Hs.121028 hypothetical protein FU10549 

Hs.257789 ESTs 

Hs.8545 hypothetical protein 

Hs. 184298 cycfin-dependent kinase 7 (homotog of Xe 

Hs.149609 integrln, alpha 5 (fibronedin receptor, 

Hs.82353 protein C receptor, endothelial (EPCR) 

Hs.163780 (AB075832) KIAA1952 protein [Homo saplen 

Hs.22394 hypothetical protein FU10893 

Hs.82587 pncsphdipase 01 , phophattdytcholine-spe 

Hs.172670 activtn A receptor type (Hike 1 

Hs.78672 larntnin, alpha 4 

Hs.1 6714 Rho guanine exchange factor (GEF) 15 

Hs.283072 acfin related protein 2/3 complex, subun 

HsJ2912 cytochrome P4S), subfamiry 1 (aromatic c 

Hs.278585 NM-024756; Homo sapiens hypothetical pro 

gb:yb65g12.s1 Stratagene ovary (937217) 

Hs.73828 T-cefl acuta lymphocytic leukemia 1 

Hs.304471 EST 

Hs.198384 prostaglaniftn^ndoperoxide synthase 2 (p 

Hs, 1 23641 protein tyrosine phosphatase, receptor t 

H&28043 ESTs 

HsJ298581 eukaryotic translation elongation factor 

Hs.16450 ESTs 

Target Exon 



424503 


NMQ2205 


Hs.149609 


mtegrin, alpha 5 (fibronecfin receptor, 


452924 


AW580939 


Hs.97199 


complement component C1q receptor 


426696 


AW363332 


Hs.171844 


NM.006505 Homo sapiens poBovirus recept 


418941 


AA452970 


Hs.239527 


E1B-55kDa-associ3ted protein 5 


450152 


AI138635 


Hs.22968 


mtron of VEGFR 


437269 


AA334384 


Hs.149420 


ESTs 


445279 


R41900 


Hs.22245 


ESTs 


409509 


AL036923 


Hs.322710 


ESTs 


410240 


AL157424 


Ks.61289 


synaptojanin 2 


424711 


NMJJ05795 


Hs.152175 


calcitonin receptor-tike 


449318 


AW236021 


Hs.78531 


Homo sapiens, Similar to RIKEN cDNA 5730 


407862 


BE548267 


H&337986 


Homo sapiens cOMA FU 10934 Gs, clone OV 


410057 


R66634 


Hs.268107 


mufiknerin 


446727 


AB011095 


Hs.16032 


KIAA0523 protein 


412564 


X83703 




cardiac ankyrin repeat protein 


414786 


A1246482 


Hs.243010 


Homo sapiens ras ho mo log gene family, me 


410276 


AI554545 


Hs.71832 


angiopotetin-2 


406627 


T64904 


Hs.163780 


ESTs 


405025 






Bora morphogenefc protein 6 


422648 


086983 


Hs.1 18833 


Melanoma associated gene 


452852 


AK001972 


Hs.30822 


hypothetical protein FU11110 


427912 


AL022310 


Hs.181097 


tumor necrosis factor (iigand) superfami 


432004 


BE018302 


Hs^894 


placental growth factor, vascular endoth 


430129 


BE301708 


KsJ233955 


hypothetical protein FU2O401 


429276 


AF056085 


Hs.198612 


Q protein-coupled receptor 51 


407704 


8E315072 


Hs.78768 


maSgnant cell expression-enhanced gene/ 


448694 


AA478756 


Hs.194477 


E3 ubiquifin Ugase SMURF2 


434398 


AA121098 


Hs.3838 


serum-indutfble kinase 


416851 


AW963951 


Hs.85618 


ESTs 


412420 


AU335668 


Hs.73853 


bone morphogeneOc protein 2 


416179 


R19015 


Hs.79067 


MAO (mothers against decapemapiegtc, Or 


450689 


A1369275 


Hs.243010 


Homo sapfens cONA FU14445 fb, done HE 


416658 


U03272 


Hs.79432 


CbrBSn 2 (congenital ccntractural era 


425235 


AA353113 


Hs.112497 


Homo sapiens cDHk. FU22743 6s, clone H 


428206 


AB020643 


Hs.183006 


WAA0838 protein 


412755 


8E144306 


Hs.179891 


ESTs, Weakly similar b P4HAJtUMAN PROLY 


456977 


AKD0Q252 


Hs.169758 


hypothetical protein FU20245 


456977 


AK000252 


Hs.1 69758 


hypotheQcal protein FU20245 


427897 


NM_017413 


Hs.303084 


apefin; peptide Bgand for APJ receptor 



1.29 

1.47 

1.B1 

1.41 

1.35 

1.62 

1.48 

1.75 

1.49 

1.89 

1.77 

2.02 

1.64 

1.26 

1.49 

1.56 

1.54 

1.40 

1.45 

1.44 

1.42 

1.51 

1.42 

1.32 

2.67 

2.47 

1.76 

1.40 

1.35 

1.55 

1.90 

1.71 

1.88 

1.75 

1.66 

1.52 

1.61 

1.45 

2.14 

1.50 

1.53 

1.52 

1.94 

1.97 

2.32 

2.36 

1.61 

1.41 

1.70 

1.60 

1.59 

152 

1.61 

1.96 

1.81 

151 

1.85 

1.67 

1.51 

1.71 

1.91 

1.76 

1.69 

133 

1.78 

1.79 

Z02 

1.93 

1.65 

1.48 

2.01 

1.83 

1.65 

1.69 

1.71 

1.86 

1.65 

1.83 

1.77 

100 

1.61 

1.61 

1.94 
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425234 
444409 
421340 
445701 
444009 
409220 
416940 
418922 
431548 
434927 
434558 
438183 
428438 
453365 
418058 
439410 
453467 
420209 
417933 
436420 
414430 
412140 
442104 
425397 
418506 
413745 
405121 
425811 
415765 
431089 
421937 



416815 
418203 
408243 
410)54 
407891 
407891 
436032 
439382 
419172 
442006 
447560 
432128 
417426 
434217 
439265 
442923 
433001 
418741 
442573 
422544 
449722 
413794 
422603 
406964 
448231 



429663 
418738 
418185 
417308 
407975 
420256 



431728 



413132 
451979 
415138 
414577 
444330 
422424 
400289 
418007 



A W1 52225 

AI792140 

RJ7783 

AF055581 

AI380792 

BE243323 

N75620 

AW956580 

AI834273 

H46612 

AW264102 

BE263252 

NM_001955 

AA035211 

AW161552 

AA632012 

AI535997 

AA256444 

X02308 

AA443966 

A1346201 

AA219691 

L20971 

J04088 

AA084248 

AW247252 

AL039104 

NM.005424 

BE041395 

A1878857 

AI493591 

U41514 

X54942 

Y00787 

NMJJ02318 

AA486620 

AA486620 

AA150797 

BE247684 

AW338625 

AW9751B3 

AF065214 

AA127221 

NM.002291 

AW014795 

AI134430 

AW248322 

AF217513 

H83265 

H93366 

AB018259 

BE280074 

AF234532 

BE242587 

M21305 

AI701916 

AW381666 

M68874 

AW388633 

AW958272 

H60720 

XB9428 

U84722 

NR.007351 

NM.005402 

NM.008823 

F0S972 

CI 8356 

AI 056548 

Ai 597655 

AI186431 

X07820 

M13509 



Hs.165909 

Hs.49265 

Hs.1369 

Hs.13131 

Hs.135104 

Hs.51233 

Hs.43157 



Hs.9711 

Hs.293815 

Hs39168 

Hs.6101 

Hs.2271 

Hs.17404 

Hs.83381 

Hs.188746 

Hs.30089 

Hs. 125485 

Hs.82962 

Ks.31595 

Hs.76118 

Hs.73625 

Hs.188 

Hs.156346 

Hs.85339 

Hs.75514 

Hs.159557 

Hs.78824 

Hs. 1561 10 

Hs.109706 

Hs.78146 

Hs.80120 

Hs.83758 

Hs.624 

Hs.83354 

Hs.41135 

Hs.41135 

Hs.109276 

Hs.103070 

Hs.22120 

HS.1885B 

Hs.296502 

Hs.82124 

Ks.23349 

Hs.6906 

Hs.95835 

Hs.279905 

Hs.8881 

Hs.7567 

Hs.118140 

Hs.23960 

Hs.61638 

Hs.1 18651 

Hs.202509 

Hs.4950O 

Hs.211587 

Hs.6682 

Hs.347326 

Hs.81892 

Hs.41716 

Hs.76206 

Hs.268107 

Hs388757 

Hs.75209 

Hsi7372 

H&295944 

Hs.72116 

Hs.49265 

Hs^96638 

Hs^258 

Hs.83169 



ESTs, Weakly similar to O8022 hypothec* 
ESTs 

decay accelerating factor for complement 

lymphocyte adaptor protein 

ESTs 

tumor necrosis factor receptor superfami 

ESTs 

ESTs 

novel protein 

Homo sapiens HSPC285 mRNA, partial cds 
ESTs 

hypothetical protein MGC3178 
enaothdin 1 

SOX7 SRY (sex determining region Y)-box 
guanine nucleotide binding protein 11 
ESTs 
ESTs 

hypothetical protein RJ12604; WAA1682 

mymfdytate synthetase 

ESTs 

ubiquifin carboxyt-terminal esterase L1 
RAB6 interacting, Idnesin-Gke (rabkines 
phosphodiesterase 4B, cAMP-specific (dun 
topoisomerase (DMA) li alpha (1 70kO) 
G protein-coupled receptor 39 
nucleoside phcsphorytase 
rratogen-adivated protein kinase 8 Inter 
karyopherin alpha 2 (RAG cohort 1, impor 
tyrosine kinase with immunoglobulin and 
ESTs, Weakly similar to unknown protein 
hematologics) and neurological expressed 
plateJet/endothelial ceD adhesion molec 
UDP4^tyl-alpha^aIactosamIne:polyp 
COC28 protein kinase 2 
interteuWn 8 
lysyi oxidase-like 2 
endomucin-2 
endomucin-2 
latexin protein 
ESTs 

ESTssirnilar to TRANSMEMBRANE 4 SUPERF 
ESTs, WeaWy similar to S72482 hypothefi 
phosphoBpase A2, group IVC (cytosoBc, 
ESTs 

temWn, beta 1 
ESTs 

Homo sapiens cDNA: FU23197 fis, clone R 
ESTs, WeaWy similar to unnamed protein 
clone HQ0310PR00310p1 
ESTs, Weakly similar to S41044 chrornosom 
branched chain aminotransferase 1, cytos 
WAA0716 gene product 
cycfin B1 
myosin X 

hematopolGffcally expressed homeobox 
gb:Human alpha satellite and satellite 3 
ESTs 

K1AA0746 protein 

phosphoQpase A2, group IVA (cytosoBc, 
sotute carrier famiiy 7, (cationic amino 
hterce0utaratheskximotecule2 
WAA0101 gene product 
endothelial cetkpecific molecule 1 
cadherin 5, type 2, V&cadherto (vascula 
NM_002425:Homo sapiens matrix metaitopro 
mulfimerin 

v-ral simian leukemia viral oncogene horn 



endolheBaJ tyrosine kinase (Elk) (BMX), 
Bssue factor pathway Inhibitor 2 
hypothetical protein FU20992 similar to 
ESTs 

prostate differentiation factor 

matrix metaJloproteinase 10 (strametysin 

matrix metaQoproteinase 1 (intersGSal 



1,74 
1.68 
1.74 
Z18 
1.90 
2.16 
2.03 
Z09 
1.81 
2.14 
1.93 
2.05 
2.46 
Z20 
Z26 
1.93 
Z39 
1.84 
1.70 
1.97 
1.93 
1.80 
2.03 
1.58 
181 
1.99 
2.99 
1.76 
2.13 
2.71 
1.65 
139 
1.77 
1.67 
Z09 
2.42 
Z34 
Z34 
224 
Z07 
ZOO 
Z33 
Z11 
Z27 
Z11 
Z03 
Z08 
1.57 
1.60 
Z37 
Z55 
Z09 
1.76 
Z01 
Z1S 

aos 

Z27 
1.43 
Z67 
Z18 
Z60 
1.86 
Z92 
Z92 
Z59 
Z72 
Z25 

I. 76 
137 
3.11 
111 
108 
Z96 
4.84 

II. 91 



TABLE 69B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Ptey 



CAT Number Accession 



778 



WO 03/042661 



PCT/US02/36810 



453085 10017.1 



436729 



6624.1 



442506 29197J 



436772 
434596 
412564 



442006 



1239464.1 

14701.1 

18571J 



1239046.1 



BC017338 BG716430 BG501286 BI458528 A1582223 N98532 AI338138 A1273442 AW102617 AA831 177 AA745642 AA412583 AA35S375 
BG547492 AW954243 BI765546 BG057641 AI192435 AI338935 AI31 2651 AI708679 AJ191 125 Ai 206832 AA676899 AI078010 A18B8718 AA452830 
BF445542 N69930 AA715017 BF446713 BE046852 AW771909 AA907729 AI143749 AI761290 AA890233 BF925759 A178371 3 AI767267 AA81 4538 
W56778 M91B481 BG743526BE645242AI025328AI298436AI290445 H27710AI475034BG740023AI090348 AI340003BI602481 W38495 
AI183314 AI927418 BG397181 AA878310W19369 W56507 005751 AW3 80750 AW380770 AW380790 BF930729 H28425 AA037326 AA375805 
X75684 AL573167 A1445461 AJ453743 AI983655 AI564644 AA9771B0 A16941 1 1 AI591358 AW071625 AI678712 AI720939 AI927769 BE439796 
AI963432 AA292956 AW192593 AI865B38 AI696905 AI424384 AI161 31 2 Ai91 1921 AI597801 BI494959 AI240988 AI492554 AW262737 BE044033 
AW008570AW629505 BI494958 AA088439 AA706057 BF222820 BFS93608 BE501957 AA524526 BE044134 AW572531 AW015724 BE349186 
AA043217 BE219784 AI799814 AA1 29575 AI671727 AI470033 BE645195 AW779725 AA903050 AA1 47228 AA4O4570 A1075878 W381 61 AI972739 
AW673152 AA723200 C06123 BP057147 AA627686 AA157944 AI990245 AA662517 T32487 AI800106 AI333170 AI859160 W45410 AI990827 
AW275048 AA182640 AA478328 A1298935 AW085158 AW471421 AW103470 AW300456 AW191997 A1823466 AA962397 AA136658 A1251817 
AW3391 04 AA72472J AA41 1 100 AA191 349 AA757735 AA037696 AI76951 6 AW772283 AA010631 At692846 A1061065 H80983 R79933 AI950693 
AI245632 AI349390 AA148284 AI798502 AA487893 A1621320 AW194272 C06365 AA953883 BE858936 AI918523 AI872628 A1927217 AI453453 
A1 189356 AW338678 A1261359 AI500576 BF477735 AKH2569 AI972899 AI985583 Z28771 AI363829 A1693030 AA603586 BE773488 AW339301 
BE773489 BE773462 BE773495 A1650338 BE773499 AI745717 BE81 1475 BE81 1470 BE811464 BE81 1418 BE811415 BE811400 BE81139B 
BE811388BE811352BE773501 BE773494 BE773486 BE773474 BE773473 BE773470 BE773461 BE811350 BE811337 BF593847 BG055071 
AW675302 BF003068 AA719173 BE81 1348 AI582462 A1686240 BE773500 AI244845 AI565439 A191 8453 AI472527 AI446740 AA035576 AA1 91414 
AW6741 45 C05782 A) 589264 D57558 AI468237 AI432033 AA989662 R21752 BF002457 AA98B297 AL574095 AL576200 AL571074 AL574525 
AL578810 BG498381 AB28364 BE879732 AA479834 AA479712 C17732 BM091258 BF843901 AW820230 C17476 BE327120 AA129574 
AA13B645 BFB43900 AW806193 AA502832 AA649494 AL568520 AL547960 BE706937 BE811360 BE77349B BE811401 BE773484 BE81 1437 
BE81 1380 BE811399 BF997171 BF757734 BE926037 AI377595 C061 1 1 AW088968 BE81 1404 BE81 1472 AI865912 A1925607 AI871950 AI09351 0 
BE905927 BE811435 AA191387 AW772000 BE811453 8E614379 BF844522 BI044896 AI744233 AW984527 C17504 BF843883 AI246307 
BE773483 AI567995 W60075 BF941 183 AI738844 BE81 1458 BE773481 AG62930 AA948565 BE7Q6942 BE156360 T65026 AW242958 AW1 97954 
BE905184 AA722206 AI344943 AI348877 AI334860 BE621857 BE156280 AA454099 AA037722 BF843897 AW806183 AA043216 BG482896 
AA182734 AA877242 AW372926 H27252 R38114 BF851858 BE156214 AA190427 T91762 AA035067 AA837326 T10930 BF906587 BI755027 
BG5Q6731 BC008442 BC010166 AL550134 AL553096 AL548700 AL550751 AL547978 AL545286 AL540643 AIM 18627 AL601379 BI259821 
BG741786 81868522 AU1 35866 BI552770 BI259210 BI256520 BI255569 BG485098 BI258228 BG498501 BM044512 AU133984 AL556586 
BE745111 BI222633AU133917BG288151 B1260715 BI550550 BG500773 BI551761 BG707601 BI81 8593 BF691383 BG721 129 BG541 578 
BE9Q6666 BG751098 B1224135 BG400746 BG478065 BE790438 AW080238 AU137549 BG429B96 BE 3924 85 AW9 61 685 BG721056 BE908365 
BE546656 BG541235 AW583735 BG528290 BI260895 AW651691 BM048974 BM043B05 BG142185 AA315188 A1446615 C06300 BG497644 
AA088544 AI815987 BG528631 BE619182 AW239185 AW062910 AW062902 AA347236 F11933 AA488005 AA301631 AA376800 D56120 
AA343532 AA308636 F00242 AA376086 AA316968 AA343799 BI870221 BE910282 BG538748 AW950564 AV732879 D16854 AA192519 BF922148 
AA216013 BG624091 BE544387 BG507008 AW176446 BF790033 BE088925 BE088854 AA921353 R21800 AA01 1222 T97525 
NM002540 L40377 BG227962 AI925897 W60100 BG980023 AA853496 AA360401 AW956168 W61361 AI074846 AA373798 BG1 93973 BG499418 
BF816267 AI675932 BG611734 BG777123 AW449485 BF912016 BE56641 1 BM068728 AW976385 AL121 194 AI767324 BM054718 AW366882 
AA156151 Z25109C05177AW975688AA731063N57084 
AF147374 T59538 T59589 T59598 T59542 

X83703 NM 014391 BC018667 AU139209 BE924924 BE924899 BE924977 BE924985 BE924947 BF229055 BE924917 BE924938 BE812621 
BE924937 AA969184 BE924898 BF22S066 BF229054 BF081 177 BE924934 BF229040 BE925037 BE92501 1 BG943776 AA488072 AA486364 
Z36249 AW455806 AW582753 BF081 131 BE924910 BF229058 BF081172 BE924956 AW378093 BF081 132 BF081 168 BE812736 BE812738 
BF081165 C04160 C04483 BF229048 
AW975183 AA973583 AI365103 AI699495 A1301787 



TABLE 69C 

Pkey: Unique number corresponding to an Eos pro beset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank identifier (Gl) numbers. •Dunham, et at." refers to the publication entitled The DNA sequence of 
human chromosome 22* Dunham, et al (1999) Mature 402:489-495. 
Strand: Indicates DMA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


402463 


9796896 


Minus 


88184952 


400494 


9714719 


Rus 


169845-170272 


401234 


993642 


Plus 


120173-120337 


406506 


7711374 


Minus 


68434077 


405025 


7107727 


Plus 


105267.105343,106184-106294.106387.10653 


405121 


8102330 


Minus 


3581636004,36587-36684 


400666 


8118496 


Plus 


17982-18115,20297-20455 



TABLE 70A 



Pkey: Unique Eos prcbeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UrrigenelD: Unigene number 
Unigene Tlue: Unigene gene fifle 



R1: 



Ratio of the mean of the vessel Al's to the mean of the HUVECATs 



Pkey 


ExAccn 


UnigenelD 


428928 


BE409838 


Hs.194657 


439180 


AJ393742 


Hs.199067 


412636 


NM.004415 




426158 


NMJ0O1982 


Hs.199067 
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406826 AW516005 Hs.84293 C074 antigen Cmvariant po»ypep6de of m 202 

453023 AWQ28733 Hs.31439 serine protease inhibitor. Kunite type, 2.03 

413391 AI223328 Hs.75335 glycine amMnotransferase (L-arglninsg 2.12 

406824 AW515961 Ks.84298 C074 antigen (invariant polypeptide of m 2.15 

J 416299 AA279530 Hs.83968 integrin, beta 2 (antigen C0 18 (p95), ly 2.17 

437536 X91221 Hs.144465 ESTs 221 

442402 NM.000954 Hs.8272 praaagtandin 02 synthase (21 kO, brain) 2.26 

446945 AI193115 Hs.16611 toror protein D52-0ke 1 2.27 

422639 AI929377 Hs.173724 creatine kinase, brain 2.29 

10 427451 AI690916 Hs.178137 transducer of ERBB2, 1 2.31 

454042 H22570 hypothetical protein FU20093 2.37 

418283 S79895 Hs.83942 cathepsm K (pyenodysostosis) 2.38 

424247 X14008 Ks.234734 rysozyme (renal amyloidosis) 239 

422103 AA984330 Hs.111676 protein kinase H11; small stress protein 2.41 

15 415994 NMJJ02923 Hs.7B944 regulate of C^m^n staling 2 24k 243 

427393 AB029018 Hs. 177635 WAA1 095 protein 147 

414217 AI309298 Ms.279898 Homo sapiens cDNA: FU23165 fis, clone L 149 

414063 H26904 Hs.75736 apoOpopmtein D 250 

426716 NMJW6379 Hs.171921 sema domain. Immunoglobulin domain (tg), 251 

20 414803 X03100 Hs.914 Human mRNA for SB dassll Nstocornpaiibi 251 

410532 T53088 Hs. 155376 hemoglobin, beta 256 

431369 BE184455 Hs.251754 secretory leukocyte protease inhibitor ( 258 

437952 063209 Hs.5944 solute carrjer family 11 (protorncoupied 258 

453655 AW960427 H*342874 transforming growth factor, beta recepto 259 

25 453655 AW960427 Hs.342874 transforming growth tactor . beta recepto 259 

453855 AW960427 Hs^42874 transfbrmtog growth factor, beta recepto 259 

427980 AA418305 Hs.303205 EST 259 

442695 AC002425 Hs.8603 p8 protein (canddate of metastasis 1) 262 

0 _ 425751 T19239 Hs.1940 crystaflm, alpha B 263 

30 414194 BE175494 Hs.75811 N-acylsphingosine amidohydrolase (add c 263 

443907 AU076484 Hs.9963 TYRO protein tyroshe kinase binding pro 266 

453464 AI884911 Hs.32989 receptor (calcitonin) activity modifying 267 

410677 NU.003278 Hs.65424 tetranectin (plasrnirrogen-binding protein 267 

452685 AI634651 Hs.30250 v-maf musculoaponeurotic fibrosarcoma (a 267 

35 425869 AA524547 Hs.160318 FXYD dornain-containbg Ion transport reg 273 

406543 N77976 Hs.347939 hemoglobin, alpha 2 273 

453767 AB011792 Hs.35094 extracellular matrix protein 2, female o 274 

424614 X54486 Hs.151242 serine (or cysteine) proteinase Inhfoto 280 

444195 AB002351 Hs.10587 WAA0353 protein 281 

40 417233 W25005 Hs24395 smafl inducible cytokine subfamily B (Cy 286 

442498 U54617 Hs.8364 Homo sapiens pyruvate dehydrogenase idna 292 

412810 M21574 Hs.74615 platelet-derived growth factor receptor, 293 

440820 AL031846 cdakophUin 4 294 

429123 AB011099 Hs.196647 KIAA0527 protein 297 

45 411529 AA430348 Hs.317596 Homo sapiens cDNA FU 12927 fis, done NT 3.01 

413305 NM.000426 Hs.323511 Homo sapiens cDNA: FU23176 fis, clone L 3.01 

425622 AW360B47 Hs.16578 ESTs 3.01 

420195 N44348 Hs.251385 Homo sapiens cONA FU11 177 fis, done PL 104 

446584 U53445 Hs. 15432 downregulated in ovarian cancer 1 3.04 

50 407869 AI827976 Hs.24391 hypothetical protein RJ13612 3.06 

423915 AF039018 Hs.135281 aipha^nin-2-assccrated UM protein 107 

410036 R57171 Hs.57975 calsequestrin 2 (cardiac muscle) 3.10 

412047 AA934589 Hs.49696 ESTs 112 

414840 R27319 Hs23823 halry/enhancer^spOt related with YRP 3.14 

55 424651 A1493206 ESTs 3.17 

443932 AW888222 Hs.9973 tensin . 3.18 

420859 AW468397 Hs.100000 S100 caidum-binding protein A8 (caigran 120 

434025 AF114264 Hs2 16381 Homo sapiens done HH409 unknown mRNA 320 

421834 BE543205 Hs288771 DKFZPS86A0522 protein 320 
60 423961 D13666 Hs.136348 perfosfjn(OSF-2os) 324 
447384 A1377221 Hs.40528 ESTs 124 
425383 D83407 Hs.155007 Down syndrome critical region gene 1-Ck 325 
447023 AA356764 Hs.17109 Integral membrane protein 2A 125 
„ 420136 AW801090 Hs.195851 acfin, alpha 2 smooth musde, aorta 3.25 
65 406972 M32053 gb:Human H19 RNAgene, complete cds. 126 
410132 NbL003480 Hs.300946 MicrofibrS-assodated giycoprotem-2 127 
430998 AF128847 Hs.204038 iruloiethytamine rimethyitransferase 3.28 
456898 NMJXM928 Hs. 155597 D component of comptement (adipsin) 129 
443623 AA345519 Hs.9641 c»rr$ement component 1, q subcomponent, 133 
70 410614 AI091195 Hs.65029 growth arreskjpedfte 1 135 
451529 A1917901 Hs208641 ESTs 135 
430310 U60115 Hs239069 four and a half UM domains 1 137 
424897 D63216 Hs. 153684 frizzled-retated protein 137 
__ 427111 AA351026 Hs. 173594 serine (or cysteine) proteinase inhibito 138 
75 443604 C03577 Hs.9615 myosin regutatcry tight chain 2 , smooth 141 
421913 A1934365 Hs. 109439 osteoglydn (osteoinductive factor, mime 143 
451331 AK002039 Hs 25 1385 Homo sapiens cDNARJU 177 fis, done PL 144 
416908 AA333990 Hs.80424 coagulatkm factor XM, A1 porypepQde 146 
407938 AA905097 Hs.85050 phospWamban 148 
OU 418005 A1188220 Ks.83164 collagen, type XV, alpha 1 151 
452877 AE50789 Hs.32478 ESTs 152 
419577 L36531 Hs.91296 tntegrm. alpha 8 152 
404277 NM.0191 1 1 *:Homo sapiens major rristocompa 154 
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llO.tXklUOV 


nmtftin ohosuhatase 1 reaulaiorv finhib 


419936 


AUQ76772 


Hs.1279 


complemenl component 1, r subcomponent 


415165 


AW8876C4 


Hs 78065 


cofnpternont oornponflnl 7 


414176 


BE140638 


Hs.75794 


endothe&ai diRerenOaEon, tysophospha 


452114 


N22687 


Hs.8236 


ESTs 


406850 


Atfi943nn 

SUQZtOUU 


no. 1 f iJiO 


ml Ian An tvn^ 1 flflnhd 1 
UJiklvjvll, ij^K t, annua i 


428411 


AW291464 


Hs 10338 

CIO. 1 vyvU 


ESTs 


412802 


U41518 


Re 74602 
no. i**yut 


aouaoonn 1 fchannet-fbrmino tnteoraj or 


406849 


AA454809 


Hs 172928 

no. I » fcjtu 


ooJlaaen tvoe 1 aloha 1 


•tdOVDt 




Hs.89463 


nnf necfi irn l#rrnp f^iiiiiiCrancA osKninV'acu 


429500 
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hflxahrachion ftenasein C cvtetactin) 




MTU 1 (300 


Me?1%fi 


corrafpH frfejlpd-felated oroteki 2 

oouoicu u lAfcic^n qpiwj pivwai & 


401A14 


1 1 


He tm&n 

no. lUOOco 


thfombospondin 2 


427373 


AB007972 


Hs. 130760 


myosin phosphatase, target subunit 2 


414290 


A1568801 


Hs.71721 


ESTs 


415274 


AF001548 


Hs.78344 


myosin, heavy polypeptide 11 , smooth mus 


417130 


AWZ76858 


Hs.81256 


S100 calcium-binding protein A4 (calcium 


416784 


AA334592 


Hs.79914 


lumican 


453355 


AW295374 


Ks.31412 


Homo sapiens cDNA RJ11422 fis, clone HE 


408221 


AA912183 


Hs.47447 


ESTs 


430223 


NMJ02514 


Hs.235935 


nephroblastoma overexpressed gene 


416585 


X54162 


Hs.79386 


leiomodin 1 (smooth muscle) 


422737 


M26939 


Hs.119571 


collagen, type 111, alpha 1 (Ehlers-Oanl 


426406 


AI742501 


Hs.1 69756 


complement component 1 , s subcomponent 


448429 


017408 


Hi21223 


catponin 1, basic, smooth muscle 


413624 


BE177019 


Hs.75445 


SPARC4ike1(mast9,hevin) 
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TABLE 70B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 
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Pkey 



CAT Number Accession 



M77830 NM.004415 AF139065 BG681115 BG740377 BI712964 BGQ00658 AA12B470 61438324 ^^^^J^^f^^ ocali 
AW370813 J05211 BG698865 BG740734BG680618BG739778 81765807 BM353403 BM353248 AW1 77784 AW205789 AW951 576 AW848592 
BE182164 BF149266 BE940187 BI060445 BW60444BF350983 BE720095 BE720069 BE715154 BE082584 8^)82576 BE004W7 AA85^16 
BI039774 BE713818 BE713548 AW170253 BE1 60433 B1039775 AW886475 BM462504 BE931734 BF149264 AA340777 BF381183 BGK21^7 
AU127260 AW364859 BF993352 BG223489 BE819009 BF381184 BE715956 R58704 AA852212 AW366566 BI090358 BF087707 BE819046 
BE819005 AA377127 BE073467 BE819069 BEB19048 BI036306 BG990973 BI040954 BF919911 AU140155 AI951766 A1434518 A^04674 
BF752969 BE837009 BE925826 BF149265 AW995615 BE814264 BI039782 AU140407 BE144243 BE709863 BF985642 BE001923 BF933510 
AW265328B(^19BE182iaAV^175AVm7688BE8182o0AW177933BF673 

BE082507 BE082514 BE082505 BF873693 AW068840 AW847678 BF804153 AW365157 BE813930 BE002030 AW365153 BE184941 BF749421 
BE^^62BE184933BF842254BE698470BE931048BF99^ 

AA131128 AA099891 W39488 C04715 BF096124 BEB65341 AW799304 AU603116 BE149760 BE705967 BE705966 BE705968AW848723 
AS9^6817AW376697BG00^7BF751115BE69^ 

AW997139 BE865474 BE185187 BE156621 BE715089 BE713297 BE713298 BE179915 AW799309 BF872345 BF088676 ^»9AW52599 
BG005197 BF350086 BE715196 BE715155 BF752396 BF093817 BFB31 190 BF752409 BE006561 BG9S9922 ! BFM4W3 1 WW748 BRm583 
AW377699AW607238 BBJ82S19 AW377700 BF349467 AJ1 90590 AI554403AJ392926 AU158477 BMSW IAU1S919 A176W16 i 
AJ439101 AA451923 AI340326 A1590975 B1791553AI7009S3 Al 142882 AA039975 AA946936AA644381 BM314884 ^702424/0417612 
AW9(^AI22(^AJ3C4772AI2703« 

AJ078449 AJ819984 AI858282 BI468588 A1860584 AI025932 AA026047 AA703232 AA658154 AA51 5500 AW192085 AA918281 T7786 AIK7207 

H44405 AJ910434 BF082513 AJ494069 A1270027 A1635878 AA128330 BG681425 BE705078 R20904 BG680059 BBmfl ^764409 
AV745530 B1762798 BG287391 AW798780 BE706045 BE926470 AW799118 BF087996 BE002273 AW879451 AI571075 B^86AV721320 
aS^C03378N84767AA131077H^^ 

BE067737 BG319486 AA247685 AW798883 AW103521 BF989173 AW860878 BE939707 BE185750 BE714064 BE713W BE71WM > BE713763 
bS50164 BE713810 AW365151 BG955489 BE005272 BF915937 AW36514B AI905927 BF992780 AW853812 BG954443 81770853 BG679406 
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BG740832 BG581087 8G698430 AA455100 T87267 BE696209 BE696210 B1089483 BE005273 BEB72225 AW391912 BE92551 5 BG677012 
BG74197OAA026480BE705999BG677157 BE009090BG681378BE712291 BG961498 BG578984 BI040941 AA337270 AW384371 AW847442 
B1058859 BE81366S W95048 W25458 AW177786 AA025851 BE931733 BF154837 BG949393 BE714441 AW996245 BE711801 AI284090 
BE0S4323 BE719390BE940148 BG991212 BF375714 BF349522 BG996267 T46793 B!013292 BE001925 AW365156 AW365154 AW606553 
BF763109 BE931637 BE167181 BE713879 BF354008 BF678726 H90899 AW365145 W38382 AI498487 

454042 30254 1 AJ420458 A1018523 AA708886 BF949533 AL1 19553 BF9A5950 AI081 305 AA041432 AI921013 AI584910 AI654847 AW874199 A1206120 

AW241428 R43035 T66767 AW103715 W28478 BF953052 H45926 BF807568 AW903943 BE170143 B1040435 BF931989 BI600000 AV722350 
W27787 H45331 BI549761 R53955BI549855BG991583B1491075AW020049 AW1 29293 H45263 AA410309AA340613 R42410AA707199 
AI431587 BE858679 AW292267 AI421678 AA041 195 BE466753 AI243913 AI358894 AW1 37298 AI366468 N64350 AA779107 AW025969 R49056 
AA34701 1 R55722 AW771 106 F04969 Z38381 F01659 H 17396 B1493714 AI880103 AW771447 A1202561 AA788851 AI494436 BF8561 14 H22570 

440820 3091 1 AW964381 BG007409 BM314056 AA465642 T30B61 T331 1 1 Z42834 C04542 BF948152 BF944325 8F858895 AA935284 AI267360 N64249 

W67500 F07962 AA322394 BI489987 BE644965 BM313782 AA91Q364 AI809246 AA836750 BF1 1 5228 AA829730 N27413 BM141766 AI742325 
AA455261 AA938708 AI420241 AI130039 BF222341 BF941301 AA771807 Z41034 BF447457 BF447426 AW515347 T03874 F023S0 F02302 
N34898 AI67B566 AA807824 AA948556 AW204638 BM142045 BF446775 AV648364 AI801 368 AA971739 A1017351 AA760722 AI460007 A1458383 
AI694152AA226536BE467282 N47808BE348825BEB30581 BE830583N53009 N59351 N41056 W67501 AA016246 H43293 F21282 R59121 
R56280 

424651 46029 5 BG435302 BM083687 AA904035 AA488889 W76175 AA761 874 H28767 AA9 10081 AA837086 Al 521 825 BG98S378 AJ478562 AA743152 AA746092 
H88853 BG9B6375 AA635644 AI493206 AA669979 BE245127 BG986430 BG986529 BF665873 BG030157 BG622575 AA766495 

TABLE 70C 

Pkey: Unique number corresponding to an Eos probesel 

Ref: Sequence source.Trte 7 digit numbers In fob column are Genbank Identifier (Gl) nurnbers.T)unham, et at" refers to the publication entitled The DNA sequence of 
human chromosome 22" Dunham, et al (1999) Msg 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
N Lposition: Indicates nucleotide posifions of predicted exons. 



Pkey 
404277 



Ref Strand NLposiBon 
1834458 Minus 91665-91946 



TABLE 71A: 774 GENES UP-REGULATED IN EWiNG SARCOMA COMPARED TO NORMAL ADULT TISSUES 

Table 1 A fists 774 genes up-regulated in Ewing Sarcoma compared to normal adult tissues. These were selected from 35371 probesets on the ArTymetrix/Ecs Hu03 GeneChlp array 
such that the ratio of •average" Ewlng Sarcoma to ■average" normal adult tissues was greater than or equal to Z\ . The "average" Ewing Sarcoma level was set to the 75th percentile 
amongst numerous Ewing sarcomas. The "average" normal adult tissue level was set to the 85th percentile amongst various non-malignant tissues. In order to remove gene-specific 
background levels of non-specific hybridization, the 10th percentile value amongst the norwnaOgnant tissues was subtracted from both the numerator and the denominator before the 
ratio was evaluated. 

Pkey: Unique Eos probesel identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnlgenelD: Unlgene number 
Unigene Title: Unigene gene title 
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104659 


AW969769 


Hs.105201 


ESTs 


7a3 


101447 


M21305 




gb:Human alpha satellite and satellite 3 


64.7 


105782 


H09748 


Hs.57987 


B-ceil auiymphoma 1 1B (zinc ringer pro 


50.5 


115881 


NMJ05756 


Hs.184942 


GproteirKXMjpted receptor 64 


48.8 


121792 


AW969726 


Hs.98381 


ESTs, Weakly similar to serine protease 


46.7 


101104 


AW862258 


Hs.169266 


neuropeptide Y receptor Y1 


46.6 


110278 


AF061573 


Hs.19492 


protocadherin 8 


46.1 


126645 


AA316181 


Hs.61635 


six transmembrane epithelial antigen of 


4J.4 


116752 


AL008583 


Hs.91622 


neuronal pentrasdn receptor 


40.9 


119717 


AA918317 


Hs.57987 


B-cell CHVtymphorna 1 1 B (zinc finger pro 


39.6 


104691 


U29690 


Hs.37744 


Homo sapiens beta-1 adrenergic receptor 


36.9 


110728 


AA737106 


Hs.32250 


ESTs, Moderately similar to 178885 serin 


355 


121362 


AF050147 


Hs.97832 


chortdrornodt&i t precursor 


34.7 


131291 


NMJ04350 


Hs.1 70019 


runt-related transcription factor 3 


33.0 


101083 


D54745 


Hs50247 


choJecystokmin 


31.7 


121619 


AA528339 


Hs.178062 


ESTs, Weakly similar to phcsphafidytserl 


28.7 


122651 


AW975398 


Hs^93836 


ESTs 


28.0 


100299 


D49493 


Hs^171 


growth differentiation factor 10 


265 


129977 


N(4_000399 


Hs.1395 


early growth response 2 (Krox-20 {Drosop 


26.2 


123619 


AA602964 




gtato97c02*1 NCLCGAP_Pr2 Homo sapiens 


26.1 


124006 


A1147155 


Hs.279727 


ESTs; homologue of PEM-3 [Ciona savignyi 


23.1 


116301 


AW969706 


Hs^93332 


ESTs 


225 


121231 


AA814948 


Hs.96343 


ESTs, WeaWy sim3ar to AWCJWMAN IU1 


22.3 


106533 


AL134708 


Hs.145998 


ESTs 


213 


109166 


AA219691 


Hs.73625 


RAB6 Interacfing, kmeshvEke (rabxines 


215 


131313 


R96290 


Hs.75874 


ribosomal protein L44 


205 


116790 


AW161357 




microtubutoassociatBd protein tau 


18.7 


105316 


AI671245 


Hs^4835 


hypothetical protein FU14594 


18.2 


102123 


NMJJ01809 


Hs.1594 


centromere protein A (17kD) 


175 


126218 


AL049801 


Hs.1 3849 


Novel human gene mapping to chomosorne 13 


175 


119791 


AA554907 


Hs.58291 


ESTs 


16.7 


113003 


AW292315 


Hs.7215 


ESTs 


165 


102836 


U94320 


Hs.158330 


neuropeptide Y receptor Y5 


165 


126799 


AW753865 


Hs.74376 


ouactomedln related ER bcaTr/ed prota' 


165 


105298 


BE387790 


Hs^6369 


hypothetical protein FU 20287 


155 


107160 


AA314490 


Hs.27669 


WAA1 563 protein 


145 


115313 


AA808001 


Hs.184411 


albumin 


14.6 


123308 


C14187 


Hs.103538 


ESTs 


14.2 
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126077 M78772 Hs^ 10836 ESTs H.O 

100898 AA884407 Hs. 21 1595 protein tyrosine phosphatase, non-recept 119 

106821 NM.006S94 Hs^8298 adaptor-related protein complex 4, beta 13.9 

110288 H40665 Hs.31564 ESTs 13.8 

5 106498 AI221919 hypotheocalprotefo FU 10582 13.8 

112134 R41823 HsJ413 ESTs,-calsyntenhv2 13.7 

115265 AA334274 Hs.18368 DKFZP564B0769 protein 13.4 

128639 AW582962 Hs, 102897 CGM7 protein 13.2 

, . 125698 AF078847 Hs.191356 general transcription factor IIH, polype 1Z8 

10 105200 AA328102 Hs.24641 cytoskeletan associated protein 2 113 

119456 AA248897 Hs.48784 ESTs 12^ 

135155 AI207958 Hs. 166556 Homo sapiens. SimSar to TEA domain fami 12.1 

106614 AA648459 Hs.335951 hypothetical protein AF301222 12.0 

131941 BE252983 Hs.35086 ubiquitin specific protease 1 110 

15 132968 AF234532 Hs.61638 myosin X 11.7 

132725 NM 006276 Hs.184167 splicing factor, argmiriersenne-rich 7 11.7 

111353 W20090 Hs.6616 ESTs 11.4 

123289 AA495904 Hs.103316 ESTs 11.3 

104968 AI249502 Hs.29669 ESTs 11.1 

20 123532 AA608733 6 b:ae56f06.s1 Stratagene lung carcinoma 11.0 

104173 AA084273 Hs.76561 ESTs, WeaWy similar to S47072 finger pr 10.7 

125556 AB033084 Hs.236463 K1M1 238 protein 10.7 

127240 AJ005683 Hs.86998 nuclear factor of activated T-ceUs 5, 1 10.6 

123049 8E047680 Hs^11869 dlckkopf (Xenopus laevis) homcbg 2 10.6 

25 132315 AF091086 Hs.44563 hypothefical protein 10.5 

134321 BES38082 Ks.8172 ESTs, Moderately similar to A46010 X-Un 10.5 

128609 NML003616 Hs.102456 survival of motor neuron protein interac 10.4 

131945 NM.002916 Hs.35120 rer^rcafcnfar^C(aclnrator1)4(37 10.4 

110730 N67655 Hs.26411 ESTs 10.3 

30 119186 A1979147 Hs.101265 hypotheljca! protein FU22593 10.3 

115909 AW872527 Hs.59761 ESTs, Weakly similar to DAP1 .HUMAN DEATH 103 

• 121916 AW117207 Hs.98523 ESTs 10.2 

131402 Y08890 Hs.1 13503 karyopherin (importin) beta 3 10.1 

101879 AA176374 Hs.243886 nuctear autoanOgenic sperm protein (his 10.0 

35 133181 X91662 Hs.66744 twist (Drosophiia) homclog (acrocephalos 10.0 

131307 NMJXJ0025 Hs.2549 adrenergic bet*3-, receptor 10.0 

113849 AA457211 Hs.8858 bromodomain adjacent to zinc finger doma 9.9 

103446 X98834 Hs.79971 sal (DrosophBaHike 2 9.7 

105091 AA148859 Ks.179909 hypothetical protein FU22995 9.6 

40 106146 W07288 Hs.118392 ESTs, Weakly slrnlar to LNHUER IgE Fc re 9.5 

127003 AW816515 Hs.173540 ATPase. aassV.typelOD 9,5 

115414 AA662240 Hsl83099 AF15q1 4 protein 9.4 

117829 BE160952 Hs.247117 ESTs^KtoderaterysirreiartoALUFJIUMANl 9.4 

. _ 126098 M79088 gfcEST01236 Subtracted Hippocampus, Stra 93 

45 121910 AI204600 Hs.96978 hypothetical protein MGC10764 &2 

130625 AF176012 Hs360720 J domain containing protein 1 9.2 

129755 R42216 Hs.7759 Homo sapiens done 24538 mRNA sequence 9.1 

100335 AW247529 Hs.6793 platelet-activating factor acetylhydrola 9.0 

111109 AI940675 Hs.20914 hypothetical protein FU23056 9.0 

50 120217 AA862257 Hs.66035 ESTs 9.0 

112891 T039Z7 Hs.293147 ESTs, Moderately similar to A46010 X-Iln 8.9 

131601 NMJXJ7115 Hs.29352 tumor necrosis factor, atpha^nduced pro 8.8 

126262 AK85153 Hs.143764 ESTs, WeaWy similar to unknown (H.sapie a8 

113903 AI368034 Hs.36679 ESTs. Weakly sim3ar to A4 601 0 X-Tmked 8.7 

55 105253 AW997484 Hs.5003 WAA0456 protein 8.6 

102581 AU077228 Hs.77256 enhancer of zeste (Drosophita) homotog 2 a6. 

105172 AA187554 Hs30049S mitoctumdrial sotuta carrier a6 

100380 D82343 Hs.74376 neuroblastoma (nerve tissue} protein 8.6 

129229 AF013758 Hs.1 09543 polyadenytate binding protein-interacfin a5 

60 102725 AB026187 Hs.1 591 56 protocadherin 11 a5 

130298 AI347487 Hs.1 32781 ciass I cytokine receptor 8.4 

132294 ABQ23191 Hs.44131 WAA0974 protein a2 

118644 AA443241 ribosomal protein L44 8,2 

106575 AW970602 Hs.105421 ESTs a2 

65 133142 AW952412 Hs.65874 ESTs, WeaWy simitar to A40348 Sav/Sex- ai 

119499 AI918906 Hs.55080 ESTs 8.1 

100248 NM.015156 Hs.78398 WAA0071 protein 8.1 

114837 BE244930 Hs.166895 ESTs a0 

107098 AI823593 Hs57688 ESTs ao 

70 123960 AW082862 Hs.287733 hypothetical protein FU23189 7.9 

105280 AA894638 Hs.1 4600 ESTs 7.9 . 

115147 AA745781 H&38399 hypothefical protein MGC2454 7.9 

134570 U66615 Hs.1 72280 SVWSNF related, matrix associated, acfi 7.9 

133063 AI654133 Hs.30212 thyroid receptor interacting protein 15 7.8 

75 105730 ' AW377314 Hs£364 DKFZP564J052 protein 7.8 

126426 AA125984 gbzn27h0Sj1 Stratagene neuroepithdium 7.8 

107254 Z45998 Hs.22543 Hc^&ar^n^NA;cDNADKFZp761l1912(f 7.8 

134367 AA339449 Hs.82285 phosphoribosyfetycta^ formyttransfer 7.8 

119082 AF252297 H&91546 cytochrome P450 retinoid metabofizing pr 7.7 

80 125400 AL110151 Hs.128797 OKFZP586O0824 protein 7.7 

129413 AW377610 Hs.1 1123 OKF2PS64G092 protein 7.7 

116766 A1608657 Hs.95097 ESTs 7.7 

129075 BE250162 Hs33765 dSrydrofolate reductase 7.6 
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105143 AI368836 Hs.24808 ESTs, Weakly similar to 138022 hypothec 7.6 

132438 AW363887 Hs.82916 ctaperonin containing TCP1. subuti!t6A{ 7.6 

118036 A1471862 Hs.196008 Homo saplenscDNAFU 117236s. done HE 7.6 

_ 131170 NMJJ14253 odz (odd Oz/ter>m, Drosophila) homdog 1 7.S 

5 104548 R39398 Hs.91559 ESTs 7.5 

131495 AA812434 SMC2 (structural maintenance of chromoso 7.5 

132543 BE568452 Hs.344037 protein regulator of cytokinesis 1 7.4 

124040 U23752 HsJ2964 SRY(sexdeterminingregtonY}^11 7.4 

_ 127695 AA714731 Hs.291457 ESTs, Weakly similar to heterogeneous ri 7.3 

10 132559 AF119848 Hs.270863 hypothetical protein PRO1580 7.3 

101050 AU077324 Hs.1832 neuropeptide Y 7.2 

111823 R35252 Hs.130558 ESTs, WeaWy similar to 2109260AB cell 12 

116401 AW893940 Hs.59898 ESTs 7.1 

105127 AA045S48 Hs.301957 nudix (nudeoside diphosphate linked moi 7.1 

15 128478 AA708205 Hs.100343 ESTs 7.1 

110456 H52348 Hs.36636 ESTs 7.1 

118846 AW299598 Hs.50895 homeoboxC4 7.1 

120934 AA226198 gb:nc26aQ7.s1 NCLCGAPJM Homo sapiens 7.1 

^ 128132 AA225632 gbw08a07j1 NCLCGAP_Pr1 Homo sapiens 7.0 

20 127431 AW771958 Hs. 175437 ESTs, Moderately similar to PC4259 ferri 7.0 

106157 W37943 Hs.34892 K1AA1323 protein 6.9 

117852 AW877787 Hs.136102 WAA0853 protein 6.9 

106213 N45018 Hsi769 hypothetical protein DKFZp761J17121 6.9 

118013 A1674126 Hs.94031 ESTs 6.9 

25 120147 AI917116 hemoglobin, beta 6.8 

118267 N34905 Hs.44653 Homo sapiens cDMA: FU22669 fis. clone H 6.8 

132581 AK000631 Hs.52256 hypothetical protein FU 20624 6.8 

120325 AA195651 - AP-2 beta iranscripfon factor 6.8 

133276 AW978439 Hs.69504 ESTs 6.8 

30 127742 AW293496 Hs.180138 ESTs 6.7 

127664 AA806164 Hs.116502 ESTs 6.7 

108778 AF133123 Hs.90847 general transcription factor lilC, polyp 6.6 

121553 AA412488 Hs.48820 TATA box binding protein (TBP)-associate 6.6 

107059 BE614410 Hs.23044 RAD51 (S. cerevlslae] homolog (E coB Re 6.6 

35 133658 AA319146 Hs.75426 secretogranln H (chromogranin C) 6.6 

115197 R18856 ESTs 6.5 

112268 W39609 Hs.22003 solute carrier family 6 (neuratransmitte 6.5 

103346 X87613 Hs.5464 thyroid hormone receptor coactivating pr 6.4 

126967 AA205976 gb:zq48a10.r1 Stratagene hNT neuron (937 6.4 

40 118499 N67274 Hs,50141 EST 6.4 

105301 AW352357 Hs.7457 MAGE1 protein 6.3 

113617 AI869372 Hs.17207 Homo sapiens cONA FIJ1 1922 fe, clone HE 6.3 

127968 AA830201 Hs.124347 ESTs 6.3 

134719 AA852985 Hs.89232 chromooox homolog 5 (Drosophila HP1 alph 6.3 

45 110748 AW581992 Hs.301434 WAA1387 protein 6.3 

115844 A1373062 Hs.332938 hypothetical protein MGC5370 6.3 

121885 AA934883 Hs.98467 ESTs, Highly similar to AF257737 1 cilia 6.2 

102398 U42359 gb:Human N33 protein form 1 (N33) gene, 6.2 

130882 AA497044 Hs^0887 hypothetical protein FU10392 6.2 

50 132256 AJ078645 Hs.431 murine leukemia vlra! (bmi-1) oncogene h 6.2 

106383 AA447453 Hs57860 Homo sapiens mRNA; cDNA DKFZp586M0723 (f 6.2 

135186 U73328 Hs.172648 distal-less homeobox 4 6.2 

105510 Z42047 Hs.283978 Homo sapiens PR02751 mRNA, complete cds 6.2 

109261 BE466639 Hs.61779 Homo sapiens cDNA FU 13591 lis, done PI 6.1 

55 114357 R41677 Hs.6107 Homo sapiens cDNA FU 14839 fc, clone OV 6.1 

128501 AL133572 Hs.199009 protemccfltaintog(^C<tonwin2 6.1 

109747 A1223001 Hs.22969 ESTs, Weakly similar to Z14LHUMAN ZINC 6.1 

128361 AW172570 Hs.14600 ESTs 6.1 

115773 AW445044 Hs^8207 Human D NA sequence to done RP4-530J 1 5 6.0 

60 104689 AA420450 Hs392911 PtekophSto 6.0 

113575 AW138168 Hs. 15671 ESTs, Weakly similar to KBF3J1UMAN NUCLE 6.0 

121830 AW394055 HsS8427 ESTs, Weakly simflar to 138022 hypotheti 5.9 

119298 NWL001241 Hs.155478 cycEnT2 5.9 

_ 109841 H01052 gb:yj32h01.s1 Scares placenta Nb2HP Homo 5.9 

65 115622 A1088691 Hs^Oj3414 Homo sapiens mRNA; cDNA DKFZp584D0472 (f 5.8 

122969 AW821252 Hs.104336 hypothetical protein 5.8 

109872 R65841 Hs^8653 ESTs 5.8 

114208 AL049465 Hs.7859 ESTs 5.8 

113494 T91451 Hs.B6538 ESTs 5.8 

70 127684 AA668631 Hs. 159971 WAA0379 protein 5.8 

129300 W94197 Hs.110165 ribosomal protein 126 homotog 5.8 

127489 AA650250 Hs.272076 ESTs 5.8 

133479 WD1556 Hs.238797 ESTs, Moderately slm2ar to 138022 hypot 5.8 

105909 AA195191 hypothetical protein FU20729 5.8 

75 101255 BE385864 Hs.149894 mitochondrial translatonal initiation f 5.8 

134676 W28051 Hs.87819 Homo sapiens, done MGC'2492, mRNA, comp 5.7 

130895 AA641767 Hs^1015 hypotheScal protein DKFZp564L0864slmil 5.7 

126395 A1468004 Hs.278956 hypotheScal protein FU 12929 5.7 

_ _ 130723 BE247676 Hs.18442 E-1 enzyme &7 

80 107230 A1034467 Hs.34650 ESTs 5.7 

102745 AW753865 Hs.74376 olfactomedin related ER localized protei S£ 

108699 AA121514 Hs.70832 ESTs 5.6 

128080 F12310 gb:HSC38D041 raxmaflzed infant brain cON 5.6 
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117357 N24829 gb.7x98h1Zs1 Scares melanocyte 2NbHM Ho 5.6 

120827 AA382525 Hs.132967 Human EST done 122887 mariner transpose 5.6 

123494 AW17S019 Hs.112110 miiDchondrfal ribosomal protein U2 5.5 

100438 AA013051 H&91417 lopobomerase (DNAJ II binding protem 5.5 

5 126165 AI741816 Hs.125897 ESTs 5.5 

125934 AA193325 Hs.32846 rtypofoetical protem FU21 901 5,5 

124370 A1243499 Hs.170652 ESTs 5.5 

106135 ' AL117474 Hs.41181 Homo sapiens mRNA; cONA DKFZp727C1 91 (fr 5.5 

„ 106668 R49390 Hs.254129 KIAM678 5.4 

10 132967 AA316181 Hs.61635 six transmembrane epithet antigen of 5.4 

109606 Z43371 Hs.7012 ESTs 5.4 

115816 BE042915 Hs.22572 Homo sapiens cONA RJ13675 6s, clone PL 5.4 

125166 AA610620 Hs.181165 major histoarnipaSillty complex, class 5.4 

131185 BE280074 Hs.23960 cydinBI. 5.4 

15 111227 T06701 Hs.12268 ESTs 5.4 

105516 AK001269 Hs.30738 hypothetical protein FU 10407 5.4 

103409 NM.004454 Hs.43697 ets variant gene 5 (ets-retated molecule 5.4 

116008 T79153 Hs.48589 zinc finger protein 228 5.3 

106394 Z42993 Hs.25320 Homo sapiens clone 25142 mRNA sequence 5.3 

20 115121 A1634549 Hs.325422 ESTs 5.3 

125464 N71807 gb:yz29d09.r1 Scaresjmi!tip!e_sderosls_ 5.3 

115596 AW298597 Hs.61884 Homo sapiens, clone IMAGE:4298026, mRNA, 5.3 

119040 R02394 Hs.269436 ESTs. Moderately similar to PC4259 fern 5.3 

128040 AW500486 Hs.180610 spfidng factor proOne/^utamine rich ( 5.3 

25 110151 H18835 Hs.31608 hypothefical protein FU20041 5.3 

105539 AB040884 Hs. 109894 K1AA1451 protein 5.2 

130567 AA383092 Hs.1608 replication protein A3 (14kD) 5.2 

111077 N41367 Hs.173002 ESTs, WeaWy similar to (38022 hypothefi 5.2 

„ 120830 AI568170 Hs.96886 ESTs 5.2 

30 106516 AL137311 Hs.234074 r^sapiensmRNA;cDNADKFZp761G02121 ( 5.2 

103316 X83301 Hs.324728 SMA5 5.2 

116129 AF189011 Hs.49163 putative ribonuclease III 5.2 

131253 R71802 Hs.24853 ESTs 5.2 

116680 AW902848 Hs.273829 ESTs 5.2 

35 123949 AA621665 Hs.208957 EST 5.2 

105511 AB037829 Hs.3862 regulator of nonsense transcripts 2; DKF 5.2 

125847 AW161885 Hs.269745 ESTs 5.1 

106301 AA069728 Hs.184582 ribosomal protein L24 5.1 

110799 A1089660 Hs.323401 dpy-30-like protein 5.1 

40 104899 AA054726 Hs385574 ESTs 5.1 

125972 A1927475 Hs.35406 ESTs, Highly slmBar to unnamed protein 5.1 

107869 AW975998 Hs.58595 ESTs, Weakly similar to I38022 hypotheS 5.1 

121309 AA293834 Hs.97312 ESTs 5.1 

A . 125321 T86652 Hs.178294 ESTs 5.1 

45 102627 AL021918 Hs.158174 zinc finger protein 184 (KruppeHike) 5-1 

104446 AF084555 Hs.7351 cydic AMP phosphoprotefn, 19kO 5.0 

126020 H79863 Hs.114243 ESTs 5.0 

116814 H50834 Hs.77899 gb7p88a10s1Soares fetal Ever spleen 5.0 

130622 AI582291 Hs.16846 ESTs, Weakly similar to O4HU01 debrisoqu 5.0 

50 110818 AL157503 Hs.27552 Homo sapiens mRNA; cDMA 0KFZp586N2424 (f 4.9 

127765 AA971146 Hs. 129074 ESTs, Moderately similar to M£G1 MOUSE M 4.9 

108768 AF181721 Hs.61345 RU2S 4.9 

120484 AA253170 Hs.96473 EST 4.9 

114518 AW163267 Hs.106469 suppressor of varl (S.cerevistae) ma 4.9 

55 129012 R81936 ribosomal protein L44 4.9 

124973 AI476066 Hs. 102243 ESTs, Weakly similar to 178885 serine/th 4.9 

124648 AA249086 Hs. 125034 hypothetical protein FU13340 4.9 

126904 AA948033 Hs.130853 ESTs 4.9 

, rt 131526 AI005169 Hs28274 Homo sapiens cONA: FU22049fis, done H 4.9 

60 130637 AA356764 Hs.17109 Integral membrane protein 2A 4.8 

126769 AA083456 gb2n09g08rt Stratagene hNT neuran (937 4.8 

126086 H75681 gb.-yr77g01.r1 Soares fetal Ever spleen 4.8 

100169 AL037228 Hs.82043 D123 gene product 4.8 

_ 130262 D63216 Hs. 153684 rnzzteoVelated protein 4.8 

65 109260 AW978515 Hs.131915 WAA0B63 protefn 4.8 

103120 BE410731 Hs.74050 to War lymphoma variant translocatio 48 

111099 R15337 Hs.21958 Homo sapiens mRNA; cONA OKFZp547D086 (fr 4.8 

126142 H86261 Hs.40568 ESTs 4.8 

126802 AW805510 Hs.97056 hypothetical protein FU21634 4.8 

70 105848 AW954064 Hs^4951 ESTs 4.7 

127987 AI022103 Hs.124511 ESTs 4.7 

129706 AA443241 ribosomal protem L44 4.7 

129598 N30436 Hs.11556 Homo sapiens cDNAFU12566fis, clone NT 4.7 

126629 AL096739 Hs.107260 trypothefical protein OKFZp586H0623 4.7 

75 111348 AA034922 Hs5585 ESTs 4.7 

100448 AF234887 Hs.57652 cadherin, EGF LAG seven-pass G-type rece 4.7 

127542 AA703684 Hs.179833 ESTs, Moderately similar to ALU5_HUMAN A 4.7 

132452 AW973521 Hs.247324 mitochondrial ribosomal protein S14 4.7 

123778 AW352149 Hs. 102314 ESTs 4.7 

80 126521 AI475110 Hs.203933 ESTs 4.7 

110343 AW136703 Hs. 17268 ESTs 4.6 

102963 X02404 Hs^74534 catetonirwelaied polypeptide, beta 4.6 

127207 AA377165 Hs.44833 ESTs 4.6 
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113974 AW9B9756 Hs.34145 ESTs, Weakly similar to B49647 GTP-binrJi 4.6 

133098 AW629223 Hs.64794 zinc finger protein 183 (RING finger, C3 4.6 

109920 H05840 Hs.111323 ESTs 4.6 

1034B7 AA7436C3 Hs. 1721 08 nudeoporin 88kO 4.6 

5 125353 AB033043 Hs.149377 hypoWca) protein OXFZp761L0424 4.6 

100893 8E245294 Hs.160789 S164 protein 4.6 

135047 AL134197 Hs.93597 cydin-dependent kinase 5, regulatory su 4.6 

111148 AB020690 Hs.7782 paraneoplastic antigen MA2 4.6 

, _ 107681 BE379594 Hs.49136 ESTs, Moderately similar to ALU7JHUMAN A 4.6 

10 108480 AL133092 Hs.68055 hypotheflcal protein DKF2p434l0428 4.5 

132742 AA025480 Hs^92812 ESTs, Weakly similar to T33468 hypotheti 4.5 

104912 AA813192 Hs.200596 K1AA0547 gene product 4.5 

110223 H19836 Hs.31697 ESTs 4.5 

, _. 113047 AI571940 Hs.7549 ESTs 4.5 

15 101031 J05070 Hs.151738 matrix metaDoproteinase 9 (gelatinase B 4.5 

113002 BE243513 Hs.7212 hypolhefical protein PP1044 4.5 

105304 AW134924 Hs.190325 ESTs 4.5 

112386 AA831785 Hs. 171 914 Homo sapiens cDNA FU 14209 fis, clone NT 4.5 

106489 AM52054 Hs.119338 ESTs 4.5 

20 122792 AW188551 Hs.99519 hypotheficat protein FU14007 4.5 

112651 AA595802 Hs.33410 ESTs,WeaMysirnHartoT17279hypotheU 4.5 

111346 AW970976 Hs.293653 ESTs 4.5 

123596 AA421130 Hs.112640 EST 4.5 

125447 A1582222 Hs.128686 ESTs 4.5 

25 106178 AL049935 Hs.301763 WAA0554 protein 4.4 

102250 AI249361 Hs.74122 caspase 4, apoptosls-related cysteine pr 4.4 

105868 AA378780 Hs.334842 tubulin alpha 1 4.4 

127496 AI031650 ESTs 4.4 

127315 AF116622 gb:Homo sapiens clone FLB4217 mRNA seque 4.4 

30 110193 A1004874 Hs.310764 Homo sapiens mRNA; cDNA DKFZp434M062 (fir 4.4 

105012 AF098158 Hs.9329 chromosorne 20 open reading frame 1 4.4 

129445 W52452 Hs.29797 ribosomai protein L10 4.4 

114721 D61939 Hs.103822 ESTs 4.4 

. c 120922 AA481003 Hs.97128 ESTs 4.3 

35 115167 AA749209 Hs.43728 hypothetical protein 43 

102407 AW602154 Hs.82143 E74-lke factor 2 (ets domain transcript 43 

106111 AW875398 Hs,6451 PRO0659 protein 4.3 

123829 AF251237 Hs.112208 XAGE-1 protein 4.3 

103616 NMJ02647 Hs.32971 prrasphdr^itide-^kinase, class 3 4.3 

40 100269 NM.001949 Hs.1189 E2Ftranscripfion factor 3 4.3 

112728 R91913 Hs.272104 ESTs, Moderately similar to ALU1_HUMAN A 4.3 

135098 AW274526 Hs.277721 membrane component, chromosome 17, surfa 4.3 

106034 AW952005 Hs.14928 hypothetical protein FU12903 4.3 

A _ 133571 BE515037 Hs.177556 melanoma antigen, family D t 1 4.3 

45 106246 AL036917 Hs.288821 KIAA1 638 protein 4.3 

125724 AL360190 Hs.3 18501 Homo sapiens mRNA fiitl length insert cDN 4.3 

132206 AA425204 HsJ334721 hypothetical protein FU13391 4.3 

130227 BE397151 Hs.153003 seiirasrthreonine kinase 16 4.3 

129650 AF109298 Hs.118258 prostate cancer associated protein 1 4.3 

50 130382 NMJ03450 Hs.155204 zinc finger protein 174 4.3 

106073 AL157441 Hs.17834 downstream neighbor of SON 4.2 

105403 AI473827 Hs.31793 ESTs 4.2 

118138 AA374756 Hs.93560 Homo sapiens mRNA for K1AA1771 protein, 4.2 

134126 NM.0O3747 Hs.131814 tankyrase,TRF1-Weracfing arikyrawela 4.2 

55 115041 AA252457 Hs.86543 ESTs.MoleralelysirrflartoTO^hypol 4.2 

106012 A1240665 ESTs 4.2 

116732 AW152225 Hs.165909 ESTs, Weakly slmlar to 138022 hypothec 4.2 

130832 AW836008 Hs30082 zinc finger protein 3 (A8-51) 4.2 

125960 AI754693 Hs.145968 ESTs 4.2 

60 133916 AL039185 Hs.77558 thyroid hormone receptor interacted 7 4.2 

10S232 AB037828 Hs.15370 WAA1407 protein 4.2 

129228 U40714 Hs.239307 tyrosyMRNA synthetase 4.2 

109929 AA773187 Hs.294027 ESTs 4.2 

„ 123729 AL039779 Hs^78672 msmbrane component ctarcorne 1 1 , surfa 4.2 

65 101266 135645 Hs.73984 EphA4 4.2 

132389 AA31Q393 Hs. 190044 ESTs 4.2 

124320 H95749 Hs.102342 EST 4.2 

103023 AW500470 Hs.1 17950 multifunctional polypeptide similar to S 4.1 

116133 AW449597 Hs313652 EST, Weakly simitar to I38022 hypothetic 4.1 

70 129001 AA443323 Hs.107812 BPOZ protein 4.1 

119271 AI081118 Hs.65328 Fanconi anemia, complementation group F 41 

114767 AI859865 Hs.154443 rrrfrtlchromosome maintenance deficient {S. 4.1 

126107 H75477 Hs.93361 ESTs 4.1 

„ 115333 BE241812 Hs.87850 protein tyrosine phosphatase, nornecept 4.1 

75 100571 L14561 Hs.78546 ATPase, Ca~ transporting, plasma membra 4.1 

134104 L3S253 Hs.79107 maogen-ecGvated protein kinase 14 4.1 

120150 BE005771 Hs.1 53746 hypothetical protein FU22490 4.1 

134470" X54942 Hs.83758 COC28 protein kinase 2 4.1 

129948 AI537162 Hs^63988 ESTs 4.1 

80 106882 M149537 Hs.26994 hypothetical protein FU20477 4.1 

126199 AW088276 Hs.1 25829 ESTs 4.1 

112727 T91029 Hs, 15069 ESTs 4.1 

118872 AI039201 Hs.283316 ESTs 4.1 
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131263 


AU077002 


Hs.24950 




134417 


NSL006416 


Hs.82921 




113609 


T93263 


Hs.16875 




127153 


A1732303 


Hs.186518 


5 


124105 


H11484 


Hs.79133 




100031 








106897 


AF039023 


Hs.167496 




128659 


AW630087 


Hs.103315 


10 


133928 


AW403084 


Hs.7766 


126965 


AI470523 


Hs.139336 




103100 


MMJJ05574 


Hs.184585 




119005 


AL038511 


Ks.125316 




118751 


N74210 


Hs.50454 


15 


128797 


NH.002975 


Hs.105927 


111387 


AI244489 


Hs.285724 




118676 


N45312 


Hs.46506 




107348 


AW973753 


Hs. 182426 




120528 


AI92351 1 


Hs.1 04413 


20 


115652 


BE093589 


Hs.38178 


102935 


BE561850 


Hs.80506 




115399 


AF151534 


Hs.92023 




133615 


M62843 


Hs.75236 




118129 


N57493 




25 


126522 


W31912 


Hs.21168 


131965 


W79283 


Hs-35962 




102507 


U52154 


Hs.193044 




126029 


AA704253 


Hs.169359 




107168 


W57578 


Hs.237955 


30 


105808 


AI133161 


Hs.286131 


134087 


U51166 


Hs.173824 




133195 


AI434760 


Hs.279949 




112996 


BE276112 


Hs.7165 




129428 


AA256906 


Hs.1 11364 


35 


118967 


AI5S8670 


Hs.216756 


127335 


AA830210 


Hs~>14263 




106636 


AW958037 


Hs.286 




133529 


W45623 


Hs.74571 




102266 


U29725 


Hs.3080 


40 


125905 


AI678638 


Hs.6456 


103080 


AU077231 


Hs.82932 




131971 


BE567100 


Hs.154938 




129703 


BE388665 


Hs.1 79999 




128895 


AW467000 


Hs.106985 


45 


110924 


AW058463 


Hs.1 2940 


112921 


R91095 


Ks.4276 




120820 


AA347417 


Hs.96869 




105459 


AA769081 


Hs.4029 




114253 


BE149855 


Hs.14831 


50 


105476 


AL117352 


Hs.77196 


122682 


AA984531 


Hs.159293 




130580 


N32388 


Hs.334370 




130174 


M29551 


Hs.151531 




115577 


AA393167 


Hs.41294 


55 


129785 


H19006 


Ks. 184780 


126127 


N95428 






109793 


F13088 


Hs.8040 




103175 


X69089 


Hs.79227 




100154 


H60720 


H3.81892 


60 


106140 


AB006624 


Hs.14912 


129052 


BE275031 


Hs.158210 




103319 


XB3492 


H&82359 




106319 


W22335 


Hs.7392 




102391 


AA296874 


Hs.77494 


65 


127262 


AA828125 




126872 


AW450979 






107315 


AA316241 


Hs.90691 




113119 


T47910 






126962 


R12014 


Hs.20976 


70 


116203 


AW137166 


Hs.87306 


123377 


AW969163 


Hs371297 




122798 


AW36628S 


Hs.145696 




107780 


AA018927 


HSJ69213 




132733 


AB020631 


Hs.123654 


75 


120649 


AA687322 


Hs.192843 


131163 


AA099524 


Hs.23754 




126708 


AW962593 


Hs.135260 




117417 


AI241733 


Ks.43871 




106272 


AW959731 


H&323099 


80 


110834 


AW273860 


Ks.5759 


123663 


AL039482 


Hs.142517 




124250 


AA350256 






127038 


AA233014 


Hs.105965 




107711 


W96141 


Hs.220687 



regulator of G-protein signalling 5 4.0 

solute carrier family 35 (CMP^afcaci 4.0 

EST*. Weakly similar to S23650 retrcviru 4.0 

hypothefcal protein MGC2599 similar to 4.0 

ESTs 4.0 

AFFX control. OapX-M 4.0 

RAN binding protein 6 4.0 

trinucleotide repeat containing 1 4.0 

ubiquiSivconjugafing enzyme E2E 1 (homo 4.0 

ATP^rKDng(assetta,sub4arnilyC(CrrR 4.0 

UM domain only 2 (rhomboM© 1) 4.0 

ESTs, WeaWy similar to S33990 finger pr 4.0 

ESTs 4.0 

stem cell growth factor; ryrnphocyte seer 4.0 

ESTs 4.0 

ESTs 4.0 

ribosomai protein S2 4.0 

ESTs 4.0 

hypothetical protein FU 23468 4.0 

small nuclear ribonucteoprotein pcJypept 4.0 

core histone macroH2A2.2 4.0 

ELAV (embryonic lethal, abnormal vision, 4.0 

gb;yy54c08.s1 Soares_mutOple_sderosis_ 4.0 

gbzc76d03.s1 Pancreatic Islet Homo sapi 4.0 

ESTs 3.9 

potasshjmimi^tHectifying channels 3.9 

ESTs 3.9 

RAB7, member RAS oncogene family 3.9 

CGH01 protein 3.9 

thyrnine-ONAglycosylase 3.9 

K1AA1007 protein 3.9 

zinc finger protein 259 3.9 

ESTs, Weakly similar to ubiquitous TPR m 19 

ESTs 3.9 

ESTs, Moderately similar to ALUI.HUMAN A 3.9 

ribosomai protein L4 3.9 

ADP-ribosytalion factor 1 3.9 

mitogen-activaied protein kinase 7 3.9 

chaperonin containing TCPl.subunit 2 (b 3.9 

cycBn D1 (PRAD1: parathyroid adenomatos 3.9 

hypothetical protein MDS025 3.9 

Homo sapiens, clone !MAGE:3457003, mRNA 3.9 

ESTs 3.9 

zinc-fingers and homeoboxes 1 3.9 

K1AA1701 protein 3.9 

EST 3.8 

glioma-amplified sequence-41 3.8 

Homo sapiens, Similar to zinc finger pro 3.8 

Human DMA sequence from clone RP5-876B1 0 3.8 

ESTs 3.8 

uncharacterized hypothalamus protein HBE 3.8 

protein phosphatase 3 (formerly 2B), cat 3.8 

ESTs 3.8 

ESTs 3.8 

gb:zb80dQ9.s1 Soares_se nescentj broblas 3.8 

heparan sulfate (glucosamine) 3-O-sutfot 3.8 

rnyornesin (M-protein) 2(155kD) 3.8 

K1AA0101 gene product 3.8 

K1AA02B8 protein 3.8 

hypothetical protein MGC2555 3.8 

tumor necrosis factor receptor superfam) 3.8 

hypothetical protein MGC3199 3.8 

deoxyguanostne kinase 3.8 

gbx>d71a09.s1 NO CGAP_Ov2 Homo sapiens 3.7 

gb:UWt.BI3^a-12^Ul.s1 NQ_CGAP.Su 3.7 

nudeophosrrWrarieorJasmin 3 3.7 

gfcyblBblUI Stratagene fetal spleen {9 3.7 

ESTs 3.7 

ESTs 3.7 

ESTs 3.7 

spBctng factor (CC1.3) 3.7 

ESTs 317 

PCFIIphomolog 3.7 

leucine zipper protein FKSG14 3.7 

ESTs 3.7 

ESTs 17 

ESTs 3.7 

ESTs 3.7 

ESTs 3.7 

Homo sapiens mRNA; cDNA DKFZp434P0810 (f 3.7 

EST.WeaWyslnilarto210926QABcelg 17 

ESTs 3.7 

ESTs 3.7 
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130850 AB040922 Hs.20237 DKFZPS66C1 34 protein 3.7 

119126 R45175 Hs.117183 ESTs 3.7 

125486 R08234 Hs.180461 ESTs 3.6 

_ 105150 AA631977 Hs. 155995 WAA0643 protein 16 

5 103163 AU077018 Hs.3235 keratin 4 16 

104495 AW9756B7 HsJ292979 ESTs 16 

107599 AW664072 Hs.60136 ESTs 3.6 

113577 AB00699 Hs.111334 PRO0470 protein 3.6 

102681 Y08B30 Hs.1 13503 karyopherin (hnportin) beta 3 16 

10 109411 R98881 Hs.109655 sex comb on midleg (Drcsophila^ke 1 3.6 

127563 AB035898 Hs.150587 WnesWike protein 2 16 

131112 H15302 Hs.168950 Homo sapiens mRNA; cONA DKFZp566A1046 (f 16 

114118 F01598 Hs.175930 ESTs 16 

103937 AA934063 Hs.13836 ESTs,WeaWysimlartol38022hypotheO 3.6 

15 125174 W51835 Hs231082 EST 16 

104799 AA029703 gb2e95rt0&s1 SoaresJetaLhearOfoHH19W 16 

128952 ALQ43463 Hs.6755 RaP2 interacting protein 8 16 

135191 X16866 Hs.333497 cytochrome P450, subfamily IID (debrtsoq 16 

124367 A1683183 Hs59348 distaHess homeo box 5 16 

20 113560 T91015 ESTs 16 

119232 AI655226 Hs.117659 ESTs, WeaHy similar to T46481 hypotheti 16 

113988 W87538 Hs.36473 ESTs, Weakly similar to JC5238 galactosy 16 

115173 BE612940 Hs.8B252 ESTs 16 

126600 AA699949 Hs.191385 ESTs 3.6 

25 127256 A1738610 Hs.267967 ESTs, Moderately similar to ALU8_HUMAN A 16 

123419 T66087 Hs. 11 2482 Homo sapiens unknown mRNA sequence 3.6 

102025 U04045 Hs.78934 mutS (E ccfi) homolog 2 (colon cancer, 3.6 

100661 BE623001 Hs. 132748 Homo sapiens ribosomal protein L39 mRNA, 3.6 

132339 D80030 Hs.45127 chofKrroiun sulfate proteoglycan 5 (neur 16 

30 115691 AW190215 Hs.62348 hypothetical protein FU 11 753 16 

134853 BE268326 Hs.90280 5^noimdazole-4-ranboxamide rioonucle 16 

134075 NM:012201 Hs.78979 Gotgi apparatus protein 1 16 

128468 T23625 Hs.150580 putative translation initiation factor 16 

„ 127229 AA316181 Hs.61635 six transmembrane epithelial antigen of 16 

35 108451 AA079195 gb:zm92h12.s1 Stratagene ovarian cancer 15 

133421 AF134160 Hs.7327 claudinl 15 

135332 AW393883 Hs.98968 hypothefica! protein FU23058 15 

132520 AA257992 Hs.50651 Janus kinase 1 (a protein tyrosine klnas 15 

125562 A1494372 Hs.98968 hypothefcal protein FU23G58 15 

40 126996 BE161065 Hs.167531 rneihytcrotcroyt-Coer^ 15 

125558 R59305 gbryh 1 6c lO.ii Soares infant brain 1HIBH 15 

117265 AA451966 RAB94ke protein 15 

130215 BE301883 Hs.152707 gDobiastoma ampnfied sequence 15 

109482 AI0O2238 Hs.11482 splicing factor, argmine/serine-rich 11 15 

45 133726 AK03188 Hs.252716 cxysterol-hinding protetn-refated protei 15 

132317 BE262438 Hs.44592 b«ta-1 ,4 mannosyl transferase 15 

117613 AW029507 Hs.161102 ESTs 15 

100944 L07518 mudn 6. gastrfc 15 

105226 R58958 . Hs.26608 hypothetical protein MGC15880 15 

50 125032 T74884 gb.7c5Bd02.s1 Stratagene fiver (937224) 15 

123720 AA609734 Hs.112755 EST 15 

128846 AA730767 H&285753 SCGWikeprotefn 15 

116443 AW962196 Hs.339808 LBP protein 32 15 

_ _ 128770 AB015982 Hs.143460 protein kinase C, nu 15 

55 106918 AA076049 > Hil274415 Homo sapiens cDNA FU10229 fis. dene HE 15 

131244 AI638429 Hs.24763 RAN binding protein 1 15 

128765 AF073310 Hs.143648 insulin receptor substrate 2 15 

111223 AA852773 Hs.334838 K1AA1 866 protein 15 

rf . 104857 AI920902 Hs. 19058 ESTs, Moderately stmflar to S65657 alpha 15 

60 105395 AI580880 Hs.268149 putative rnethyltransferase 15 

1335B2 BE391579 Hs.75087 Fas-acfivated serine/mreonrne kinase 15 

128527 AA504583 Hs.101047 transcription factor 3 (E2A immunogbbul 15 

105367 AA504747 Hs.136102 K1AA0853 protein 15 

_ 103392 X94563 gb:Rsaptens dbi/acbp gene exon 1 & 2. 15 

65 101086 AA382524 Hs.250959 Nstafinl 15 

133423 T84084 Hs. 196008 Homo saptens cDNA FU1 1723 fis, done HE 15 

117714 N45226 Hs.46495 EST 15 

110986 AL133117 Hs.81376 Homo saptens mRNA; cDNA DKFZp586L1 121 (f 15 

^ 114096 AF060219 Hs^7007 chromosome condensation 1-Cke 14 

70 117147 AW901347 Hs.38S2 hypothetical protein FU23342 14 

133347 BE257758 Hs.71475 acid duster protein 33 14 

132456 AB011084 Hs.48924 K1AA0512 gena product; ALEX2 14 

133765 M62194 Hs.75929 cadhenn11.1ype2.QB«adherui (osteob 14 

109734 AI927212 HsJ734 ESTs 14 

75 132786 BE083422 Hs.56851 hypothe&al protein MGC2668 14 

106685 AU076617 Hs.16251 deavage and pciyadenytauon specific fa 3.4 

105593 AA279341 Hs.174151 aktehyde oxidase 1 14 

131381 M92642 Hs^6208 coOagen, type XVI, alpha 1 14 

108687 BE544475 Hs.54347 ESTs 3.4 

80 113115 AI141426 Hs.8705 ESTs 3.4 

107234 AA972447 H*288833 Homo sapiens mRNA; cDMA DKFZp434K087 (fr 14 

123110 AA486256 Hs. 1935 10 EST 3.4 

131019 W28614 chorionic sanataTumimotropin hormone 1 (p 14 
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129079 


AK000157 


Ks.108502 


hypothetical protein FU20150 


3.4 


111122 


N63753 


Hs.16492 


0XFZP554G2022 protein 


3.4 


103934 


BE278111 


Ha.134200 


0KFZP564C186 protein 


3.4 


128671 


AI885045 


Hs.211586 


phosprninosi&le-^kinase, regulatory su 


3.4 


123258 


AA490929 


Hs.105274 


ESTs. Weakly similar b RMS1.HUMAN REGUL 


3.4 


128826 


Z40313 


Hs. 106330 


Homo sapiens clone IMAGE:23371, mRNA seq 


3.4 


120149 


AA227609 


Hs.94834 


ESTs 


3.4 


128817 


BE395776 


Hs.168640 


ankylosis, progressive (mouse) horoolog 


3.4 


110679 


AA004798 


Hs.108311 


ESTs, Weakly simflar to T00351 hypotheQ 


3.4 


122365 


AA813546 


Hs.99034 


GTP-brntfing protein Rho7 


3.4 


134455 


BE378152 


Hs*3530 


hypothetical protein 


3.4 


115506 


AB037756 


Hs.45207 


hypothetical protein KIAA1335 


14 


134267 


AM 74596 


Hs.196209 


RAE1 (RNA export 1, S.pombe) homdog 


3.4 


106691 


AA443164 


Hs^3259 


hypothetical protein FU13433 


14 


105169 


BE245294 


Hs.180789 


S164 protein 


3.4 


120120 


BE547267 


Hs.59791 


hypothetical protein MGC13183 


3.4 


126638 


AA649257 


Hs.86998 


ESTs 


14 


128531 


H03721 


Hs.63236 


ribosomal protein S15a 


13 


106650 


AL049951 


Hs.22370 


Homo sapiens mRNA; cONA DKFZp564O0122 (f 


13 


108859 


AL121500 




ESTs 


13 


112276 


R53442 


Hs.26038 


ESTs, Weakly simflar to 138022 hypotheQ 


13 


125693 


H23989 


Hs.169743 


Homo sapiens clone 25121 neuronal olfact 


3.3 


113474 


R50752 


Hs.23856 


hypothetical protein MGCS297 


13 


132994 


AA112748 


Hs.279905 


clone HQ0310PRO0310p1 


3.3 


124691 


R05835 


Hs.110153 


ESTs 


3.3 


112511 


AW970420 




dynacun2(p50) 


3.3 


132985 


AL045579 


Hs.62113 


MAA0717 protein 


13 


125743 


H17151 


Hs.7416 


gb:ym37a05,r1 Soares infant brain 1MB H 


13 


133363 


AI866286 


Hs.71962 


ESTs, Weakly similar to B36298 proline-r 


13 


107908 


AF087999 




ESTs 


13 


105312 


BE613348 


Hs.211579 


melanoma cell adhesion molecule 


13 


110256 


H63947 


Hs.237955 


RAB7, member RAS oncogene tamfly 


13 


132073 


AA295052 


Hs.38516 


Homo sapiens, done MGG1S887, mRNA, com 


13 


107090 


AW809208 


Hs.183297 


DKFZP566F2124 protein 


3.3 


105463 


AA825974 


Ks.32646 


hypothetical protein FU21901 


13 


109592 


A1198059 


Hs.26370 


ESTs 


3.3 


113649 


N94768 


Hs.16400 


ESTs, WeaWy similar to K1AA1435 protein 


13 


103898 


AA248884 




gb:k3517.seq.F Human fetal heart, Lambda 


13 


116439 


AA251594 


Hs.43913 


PIBF1 gene product 


13 


129535 


AA397972 


Hs.1 12603 


cWmerin (crumaerin) 1 


13 


113283 


T66813 


Hs.12947 


EST 


3.3 


128992 


H04150 


Hs.1 07708 


ESTs 


13 


133160 


N54968 


Hs.66309 


hypothetical protein MGC11061 


13 


134076 


AF086215 


Hs.78980 


gb:Homo sapiens full length insert cONA 


13 


128301 


U90552 


Hs.284283 


butyrophilin, subfamily 3, member A1 


. 13 


127728 


AW404061 




protein kinase C, beta 1 


13 


126516 


R95872 


Hs.117572 


chemokine binding protein 2 


13 


127506 


T61039 


Hs.76067 


ribosomal protein LlOa 


13 


104769 


AA025887 


Hs.293343 


hypothetical protein MGC1 1266 


13 


126666 


AA648886 


Hs.151999 


ESTs 


13 


130453 


U80735 


Hs.173854 


PAX transcription activafon domain inte 


13 


107131 


AW961605 


Hs.21145 


hypothetical protein RG083M0^2 


3.3 


130422 


AW160614 


Hs.180034 


cleavage stimulation factor, 3" pre-RNA, 


13 


121292 


AA401807 




0tav65f1U1 SoaresJotaUetus_Nb2HFB_ 


13 


123284 


AA488988 


Hs^93796 


ESTs 


13 


130734 


AW137091 


Hs.18624 


WAA1052 protein 


13 


105400 


AF198620 


Hs.10283 


RNA binding motif protein 8A 


3.3 


105014 


AA121123 


HsJ69267 


ESTs, WeaWy similar to AF161361 1 HSPC0 


3.3 


114988 


AA251089 




gb3S04f05^1 NCLCGAPJ3C81 Homo sapiens 


3.3 


132867 


AF226667 


Hs.58553 


CTP synthase II 


3.3 


124169 


BE079334 


Hs.271630 


ESTs 


13 


114652 


A1521936 


Hs.107149 


novel protein similar to archaeal, yeast 


13 


113876 


AI799751 


Hs.5635 


ESTs 


13 


111520 


AI985369 


Hs.301134 


ESTs 


13 


121748 


BE536911 


Hs.234545 


hypothetical protein NUF2R 


13 


106834 


AL044182 


Hs.28070 


WAA0753 gene product 


13 


128869 


AA768242 


Hs.60618 


hypothetical protein 


3.3 


129619 


AA209534 


Hs.284243 


tetraspan NET-6 protein 


3.3 


126770 


A! 292320 


Hs.81361 


heterogeneous nuclear ribonucleoprotein 


13 


116734 


AW900992 


HS.9379S 


DKFZP586D2223 protein 


13 


100253 


038024 


Hs.157425 


double homeobox, 2 


13 


130999 


AA326683 


Hs.21992 


GkeVorlrMjtogofrnousevar^tporyade 


13 


114092 


H81213 


Hs.14825 


ESTs. WeaWy similar to KIAA1503 protein 


13 


129284 


AA318224 


Hs.298141 


ESTs 


13 


119127 


AA708035 


Ks.12248 


ESTs 


13 


123910 


AA621262 


Hs.179923 


ESTs, WeaWy simflar to S65657 alpha-1 C- 


13 


132654 


A1740461 


Hs.54542 


ESTs 


13 


118397 


BE139479 


Hs.161492 


ESTs 


13 


107003 


AW138437 


Hs.24790 


WAA1573 protein 


13 


126735 


M69113 


Ks.226795 


gtutathiona S- transferase pi 


12 


130847 


AI672483 


Ks. 20220 


lipase protein 


12 


101166 


AA020956 


Hs.179881 


core-binding, factor, beta subunit 


12 


113626 


T94318 


Hs.17359 


ESTs, Moderatety stmOar to RL44.HUMAN 6 


12 
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121782 


AW452957 


Hs.63348 




119863 


AA081218 


Hs£8608 




108103 


BE620779 


Hs.12094 




123808 


AA620552 




5 


1337S1 


AF041430 


Hs.75922 




110561 


AA379597 


Hs.5199 




11539 


BE251328 


Hs.73291 




117457 


N29682 


Hs.44071 


10 


128862 


BE2S0742 


Hs.106673 


129673 


038552 


Hs.1191 




105154 


AA307279 


Hs.35947 




126722 


N66148 


Hs.11 125 




109966 


H09103 


Hs.30897 


15 


129315 


NMJJ14563 


Hs.174038 


109517 


A1631874 


Hs.155140 




105252 


AB039670 


Hs.9728 




101754 


$70114 


Hs.239489 




110384 


H45282 


Hs^68798 


20 


134118 


BE336680 


Hs.1 82877 


134869 


AL157518 


Hs.90421 




100780 


BE561958 


Hs.293441 




125728 


AW954565 


Hs.57987 




129794 


AF161399 




25 


129056 


A1769958 


Hs.108336 


107129 


AC004770 


Hs.4756 




103038 


AA926960 






130553 


AF082649 


Hs^52587 




125394 


BE178502 


Hs.173772 


30 


132305 


AI806090 


Hs.44344 


131136 


AB033099 


Hs^3413 




102983 


BE387202 


Hs.11 8638 




117639 


AA377165 


Hs.44833 




127076 


AI422951 


Hs.146162 


35 


126153 


H85692 


Hs.40730 


132676 


N92589 


Hs.261038 




104946 


AW242407 


Hs.73848 




122110 


AM 23000 


Hs.301240 




127705 


AJ003322 




40 


109442 


AW296134 


Hs.86999 


100474 


NM.000699 


Hs.300280 




115536 


AK001468 


Hs.62180 




119651 


AW663858 


Hs.333513 




127211 


AA305520 


Hs.108812 


45 


134964 


AI803516 


Hs.272891 


105551 


AW005822 


Hs, 25292 




119750 


A1538880 


Hs.94812 




125590 


R23858 


Hs.1 43375 




104590 


AW373062 




50 


110724 


AW016783 


Hs.30799 


116429 


AF191018 


Hs.279923 




133915 


AA815092 


Hs.77554 




128721 


AW403911 


Hs.266175 




128538 


R44214 


Hs.101189 


55 


129179 


AWS69025 


Hs.109154 


108793 


AA129395 


Hs.71139 




108807 


AI652236 


Hs.49376 




133461 


NM_000762 


Hs.334345 




132571 


AW674699 


H&5169 


60 


134937 


AI251449 


Hs.171939 


125756 


BE174587 


Hs.289721 




112369 


AW966243 


Hs.4243 




127002 


AL353940 


Hs34979 




128179 


AW293689 


Hs.127116 


65 


117121 


H95044 


Hs.321386 


126556 


AF255503 


Hs.1 12227 




128403 


AI9080Q6 


H&295362 




127930 


AA809672 


Hs.1 23304 




114250 


AI914699 


Hs.13297 


70 


108828 


AK001693 


H&273344 


105225 


AA211777 






117997 


N52090 


Hs.47420 




104558 


R56678 


Hs.88959 




124777 


R41933 


Hs.140237 
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106035 


N35568 


Hs.5245 


127521 


AW297206 


Hs.164018 




120215 


AF109219 


Hs.108787 




119403 


AL117554 


Hs.1 19908 




105024 


AA126311 


Hs^879 


80 


123485 


A1308876 


Hs.103849 


109416 


8E268388 


Hsi6945 




132490 


NNL00129C 


\ Hs.4980 




115348 


AA281562 


Hs.766 




117297 


AW779829 





Homo sapiens, done MGG 15203, mRNA, com 


3.2 


Homo sapiens cDNA FU14206 fis, done NT 


3.2 


mitochondrial ribosomal protein L30 


3.2 


gb:ae58g1 1 js1 Stratagene hing carcinoma 


3.2 


brain protein 13 


3.2 


HSPC1 50 protein similar to ubiquiiin-con 


3.2 


hypotheBca) protein FU10881 


3.2 


ESTs, Weakly stmBar to ALU5_HUMAN ALU S 


3.2 


eukaryoDc translation initiation factor 


3.2 


K1AA0073 protein 


3.2 


methyl-CpG binding domain protein 4 


12 


HSPC033 protein 


3.2 


EST 


12 


spondyloepiphyseal dysplasia late 


3.2 


casein kinase 2, alpha 1 polypeptide 


3.2 


ALEX1 protein 


3.2 


TIA1 cytotoxic granule-associated RNArbi 


3.2 


ESTs 


3.2 


WAA0116 protein 


3.2 


PR02463 protein 


3.2 


trnmunogtobuSn heavy constant mu 


3.2 


B-cell CLUlymphoma 11B (zinc finger pro 


3.2 


hypothetical protein RJ 13433 


3.2 


ESTs, WeaJdy simitar to ALUEJ4UMAN !!!! 


12 


flap structure-specific endonudease 1 


3.2 


CDC28 protein kinase 1 


3.2 


pituitary tumor-transforming 1 


3.2 


ESTs, WeaMy similar to 178885 serineAh 


12 


hypothetical protein FU20S34 


3.2 


WAA1273 protein 


12 


non-metastalfc cells 1, protein (NM23A) 


12 


ESTs 


12 


ESTs 


12 


ESTs 


12 


ESTs, Weakly similar to 138022 hypothetl 


3.2 


cardnoembryontc antigen-related cell ad 


12 


melanocorfin 1 receptor (alpha melanocyt 


12 


gbAI003322 Selected chromosome 21 cONA 


12 


ESTs. Weakly simter to S65657 alpha-1 C- 


12 


amylase, alpha 2A; pancreatic 


12 


anHfin (DrosophQa Scraps homolog), act 


12 


small Inducible cytokine subfamily E, me 


12 


hypothetical protein FU 22004 


12 


Wppocatein-fike protein 4 


11 


ribonudease HI, large subunfl 


11 


ESTs 


11 


Homo sapiens, done IMAGE:3840937, mRNA, 


11 


nudear receptor subfamily 1, group 1, m 


11 


Homo sapiens cDMA FU 13471 lis, done PL 


11 


putative nucleotide binding protein, est 


11 


Homo sapiens cDNA FU 14367 hs, clone TH 


11 


phosphoprotem associated with GEMs 


11 


ESTs 


11 


ESTs 


11 


ESTs 


11 


hypothetical protein FU20644 


11 


cytochrome P450, subfamily llA (phenobar 


11 


suppressor of G2 allele of SKP1 , S. cere 


11 


ESTs 


11 


growth arrest specific transcript 5 


11 


hypothetical protein FU12650 


11 


hypothetical protein DKFZp761P1010 


11 


ESTs 


11 


EST 


11 


membrane-asscciated nucleic add binding 


11 


Homo sapiens cONA FU 14459 hs, done HE 


11 


ESTs 


11 


ESTs 


11 


DKFZP564O0463 protein 


11 


gfczn57dOZs1 Stratagene muscle 937209 H 


11 


EST 


11 


hypothetical protein MGC4816 


11 


ESTs, Weakly similar to ALU1JWMAN ALU S 


11 


rrypottetfca! protein FU20643 


11 


ESTs 


11 


phosphatidyfinositoi glycan, class N 


11 


nucleolar protein NOP5/NOP58 


11 


ESTs 


11 


hypothetical protein DKFZp761D112 


11 


ESTs. Weakly slmJtar to A46010 X-Cnked 


11 


UM domain binding 2 


11 


ESTs 


11 


gkhn88a05.x1 NCLCGAPJ<id11 Homosapten 


11 
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112501 AA972447 Hs^88833 Homo salens mRNA; cONA OKFZp434K087 (fr 3.1 

107532 AA443473 Hs.173684 Homo sapiens mRNA; cDNA DKFZp762G207 (fr 3.1 

130753 AA205223 Hs.189 phosphodiestease 4C> cAMP-sperific (dun 3.1 

_ 132425 N87549 Hs.125287 zfric finger protein ZNFUttfke protein 3.1 

5 115188 AK000219 Hs.88367 hypothetical protein FU2021 2 3.1 

129707 AW572317 Hs.12082 Homo sapiens mRNA: cDNA OKFZp566L203 (fr 11 

106474 BE383668 Hs.42484 hypothetical protein RJ 1061 8 3.1 

106472 AI207162 Hs.3815 stathnM^protein RB3 10 

102687 NMJJ07019 Hs.93002 ubtqwCn carrier protein E2-C 3.0 

10 117602 N35020 Hs.44685 C3HC«ke zmc firmer protein 3,0 

105522 W19789 Hs.336635 Homo sapiens, clone IMAG£:4179482, mRNA 10 

133650 D84294 Hs.118174 tetratricopepfide repeat domain 3 10 

134624 AF035119 Hs.8700 deleted in Over cancer 1 10 

112435 NM.013255 Hs.288791 muskeEn l.totracetMarnieflTalorconta 10 

15 134654 AKD01741 Hs*739 hypothetical protein FU10B79 10 

106651 AA460421 Hs.30875 ESTs 3.0 

109597 AA989362 Hs.293780 ESTs 3.0 

132342 AW162758 Hs.341729 ESTs, Weakly similar to ALU5.HUMAN ALUS 10 

„ 106057 BE614474 Hs.289074 F-box only protein 22 10 

20 106378 AA824298 Hs.21331 hypothetical prolein FU10036 10 

128660 AA011597 Hs.1773S8 ESTs 3.0 

115467 A1366784 Hs.48820 TATA box Knding protein (TBP)-associate 10 

128604 AI879099 Hs.1 02397 dOT-3 for gonadotropm inducWe transc 10 

125219 A1804331 Hs.99423 ATP-dependentRNAheficase 10 

25 126698 AI221147 Hs.145088 ESTs, Weakly simitar to T1 5936 hypotheti 10 

133966 BE280478 Hs.182695 hypothetical protein MGC3243 10 

119155 R61715 Hs.310598 ESTs, Moderately similar to ALU1 ..HUMAN A 10 

131586 AA460352 Hs.26966 K1AA1 171 protein 10 

100237 030715 Human PAP (pancreatlUs-essociated prota 10 

30 105515 T24968 Hs.23038 HSPO071 prateto 10 

123073 AA485061 Hs.105652 ESTs 10 

111375 H56499 Hs.252692 ESTs, Weakly similar to 138022 hypotheti 3.0 

130724 AK001507 Hs.44143 Homo sapiens clone FLB6914 PR01821 mRNA, 10 

129928 A038993 Hs.1 34535 ESTs 3.0 

35 118922 AW206193 hypothetical protein DKFZp761B2423 10 

131902 AA180145 Hs.34348 Homo sapiens mRNA; cDNA DKFZp434P0235 (f 10 

125165 W45350 gbzc81h08.s1 Pancreatic Isiet Homo sapi 3.0 

122219 AA436002 Hs.183161 ESTs 3.0 

132195 BE018717 Hs.42124 ESTs 3.0 

40 102298 AA382169 Hs.54483 N-royc (and STAT) interactor 3.0 

103286 D38616 Hs.54941 phosphorylase kinase, aipha 2 (liver) 10 

130168 AK001389 Hs.1 5144 hypothetical prolein OKFZp564O043 10 

126997 AI377150 Hs.150914 ESTs 10 

128902 AAD36637 Hs.107052 ESTs 3.0 

45 117186 H98988 Hs.42612 ESTs, Weakly simflar to ALU1 JiUMAN ALU S 10 

126096 F08208 Hs.283844 sintotoralmcanyjxylatecarrier-B 10 

10S71 1 BE390125 Hs.143187 hypothefca! protein 10 

114046 BE018658 Hs.141003 Homo sapiens cDNA: FU21691 fe, clone C 10 

135029 H58818 hydroxysteraid (17-oeta) dehydrogenase 7 10 

50 112003 AW978731 Hs.301824 hypothefical protein PR01 331 10 

106735 R77698 HsJ37778 ESTs 3.0 

126628 N49776 Hs.170994 hypothetical protein MGC10946 10 

133350 A1499220 Hs.71573 hypothetical protein FU 10074 10 

110312 BE256986 Hs.11896 hypothetical protein FU12089 10 

55 125659 T57693 Hs.87929 Homo sapiens cDNA RJ 13707 fis, done PL 10 

129601 AB032964 Hs.1 15726 WAA1 138 protein 10 

123423 AA598484 gb:as38f04.s1 Gessler Wilms tumor Homos 10 

128695 NM.003478 Hs.101299 cuffinS 3.0 

123470 AW303285 HumanDNA sequence from clone RP1 1-1 10H4 10 

60 109252 BE440157 Hs.85944 ESTs 10 

101445 M21259 gbfluman Atu repeats in the region ff to 10 

102126 AW950870 Hs.78961 protein phosphatase 1 . regulatory finhib 10 

116475 AA733050 Hs. 33461 2 small nuctear rOxxwcleoprotein pdypept 10 

_ 106573 AA223447 Hs.1 2835 A kinase (PRKA) anchor protein 7 10 

65 103106 W27172 Hs.1857 phosphodiesterase 6G. c^P-specrfic, rod 10 

130755 BE293520 Hs.18910 prostate cancer overexpressed gene 1 10 

112246 R51321 Hs.25780 Homo sapiens cDNA FU12252 fe. clone MA 10 

126887 H20832 gb:ym48dQls1 Soares infant brain 1NIB H 10 

117960 AA310417 Hs.47044 ESTs 10 

70 133626 AW836130 Hs.75277 hypothetical protein RJ 13910 10 

113179 BE622021 Hs.152571 ESTs, Highly similar to IGWIrnRNA-bind 10 

109968 H09232 Hs.264B4 HiRA^nterading protein 3 3.0 

125770 AA143045 Hs.81665 \M<tt Hardy-Zuckerman 4 feSne sarcoma v Z1 
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AA0791 95 AA084955 AA1 26308 AA084956 ^„ nnn „ 
AF087999 N29296 AI928858 AA846757 N20229 AA026894 080634 AW388923 AW388802 AW388957 AW571771 AW388839 
AI686662 AW389079 AA602099 AW134567 AW51 7843 AI682674 A1474874 AA374167 
AA6029S4 AA609200 

AA812434 AI831542 AA766261 AI769894 N63376 AA21 4392 AA1 35833 AW60501 7 AA1 35965 AW450072 H04812 AA446459 
AW439151 AA426273 A1061423 AA598549 Z40087 A1753216 AA743075 AAB54268 AA458920 AA843893 AI566516 AI864957 
A1566634 AW971760 AA21 1796 AA430089 
M21259 

L07518AW815215 

M451966 NM.016370AB036693 AL139228 R58124 AI634847 AL1 19333 W07356AI334284 H29050AI192685AA652438 
AW172843 W19794 N21460 AI743862 AW130622 AA991348 A1204553 AA992664 N80848 AA699329 A1824676 R26624 R49853 
AW779829 N22494A1248120 

AL121500 AW291763 A1732432 AA129708 AA133309 AI733750 AI225224 
W45350 W45406 

AW161357A1879062A1928938AW161097AW161167BE314465AA351715R)7096M17^ 

AA476718 AW772454 AI807703 R44253 AA976667 A1985186 AI650254 H38942 R84829 AA018724 AA001000 H85934 AA01 9126 
H85609 AA017000 AA339355 AW950556 051397 AA213981 BE548002 A1056359 AA00 1560 AW9521 13 AA317769 A1857477 
AI857475 AW249771 AW162661 H38943AA018628 R85885 AI984613 AI934765 A1796172 AW157488 A1929191 R85523 051221 
D53851 H85610 AI749674 F21582 AA323145 AA019127 AA687444 T06745 A1699293 H29532 AA214029 AA223656 NM.016834 
X14474 R19697 H09695 R17455 R13812 R19056 A1681231 AI590200 R37671 AA861828 AI990023 AI935669 AW005821 AA324581 
H17335 R37659 R42802 R46242 R60936 R59731 H28993 AA479907 R44570 A1890696 AA308884 AA507078 R41 274 A1365507 
T16348 AI560453 F03259 F04722T16312 AA016081 AW073061 BE314824 W28930 R44098 R51045 
AA926960 AA926959 W76521 W24270 W21526 AA0371 72 BE267636 H83186 AA469909 N86396 AA001348 8E535735 AA081745 
BE566245 AA082435 H72525 H77575 N49786 W80565 H78746 BE569085 W04339 R98127 T55938 BE279271 AW960304 T29812 
AA476873BE2973B7AA292753AA177048NM 001826 X54941 BE314366AA908783A1719075BE270172BE269819AA689955 
AI204630 W25243 AK35150 AA872039 W72395 T99630 AW22691 H98460 N31428 BE25591 6 H03265 AI857576 AA776920 
AA910644AM59522AA293140AW514667W^ 

AW024595 AW069734 AI828996 AA282997 AA876046 AW61 3002 AA527373 AW972459 A1831 360 AA621 337 AA100926 AA77241 8 
AA594628 AI033892 W95098 AW34317 AA398727 AI085031 K95210 AM59432 AI041437 AA932124 AA627684 AA935829 
AHXM827 A142351 3 A1094597 H42079 R54703 AI630359 AA617681 AA978045 AA643280 W44561 A1991988 AI537692 AI090262 
AA740817AI312104AI911822AA416871 Al 1 85409 AA1 29784 AA701623 AI075239 All 39549 AA633648AI339996A1336880 
AA399239 AI07B708 AI085351 A1362835AI346618 AI146955 A1989380 AI348243 N92892AA765850AI494230AI278887 AA962596 

AI492600 W80435 AA001 979 R97424 A1129015 N24127 AA157451 AA235549 AA459292 AA0371 14 AA129785 A149421 1 

AW059601 AW886710 R92790N59755AI361128AW589407 H47725 H97534 H48076 H48450T99631 AW300758 H03431 R76789 

AA954344 H77576 R96823 A1457100 N92845 N49682 H42038 BE220698 BE220715 H99552 AA701624 N74173 R54704 H79520 

H72923 H03266 BE261919 AA769633AA480310 AA507454 AA91 0586 A1203723 AW1 04725 W256 11 W25071 T88980 H03513 

T77589 R99156 W95095 R97470 AA702275 T77551 AA91 1952 H82956 N83673 AA283672 

H75681 H70975 

M79088N88221 

N71B07AA203399 

N95428 W24040 AW751366 K81987 

R59305 R19748 

AA125984 AA127189 AA065075 AA070377 AA100017 AA079891 AA1 13255 AA075168 AA082764 AA083380 N84829 AA084752 

AA44&41 raS WraraSw AM57612 R63254 R28645 R27616 H01310 R78329 R76046 R76055 AA369734 N57914 H94864 
Al953638 R31669AW769278 R82398M131925R21776R79(»1An^ N63734 ^1S^^ 
AA3271 33 AI805980 AA005377 AW611716 T86946 AA369083 R67250 H00240 R32578 W86279 R80248 R23734 H00977 R21732 
R78932 AA368675 R167B0 R26523 H40161 W93671 R99823 R77956 H51601 R98785 R0973B H78317 H53537 H81056 T53780 
H12600 R82345 AA653499 AW953717 R63207 R33888 H54071 K66993 T48748 AI702300 R30775 H68995 AI014957 M369082 
AF075351 A1110886 AI742050 AW954245 AI768458 C19062 AI393674 C18911 C18029 AA708613 A1168432 R67389 M168453 
K00188 R21233 H03055 R53822 AA367558 R75872 W04151 AI220869 A1090290 AA368730 R94434 H81153 H70950 AA367783 
H81514 H53536 C16968 C17797 C17677 C17064 AA082581 C17044 D63217 C18791 D78839 D63290 D78838 N91085 H54070 
R80360 H78318 H40199 R33887 R02529 H94918 R00900 T87029 AA131924 R99891 AAD04267 D78764 D7B757 HSIBOOJ C18615 
K60972 R98784 W86323 R09737 C17492 D78811 R67494 D78775 AA368244 H 12650 D79043 AW957062 R82759 C15002 H01715 
AA828125 AA834883 AA330555 
AF1 16622 A1114507 AA640834 AA377999 
AA248884 

AA083456 AA1 27026 AA084934 AA1 20972 AA085317 
A19171 16 AA633698 AI580389 Z39483 

AVwSS^99190BE174210AW57aM1 H01811 W40166 R67 100 A! 923886 AVV952164 AA6284^ AW8^ 
AW898616 AA709126 AW898628 AW898544 AA947932 AW898625 AW898622 AI27612S AI185720 AW510698 AA987230 T52522 
BE467708 AW243400 AW043642 A12B8245 AI186932 D52654 D55017 D52715 052477 D53933 054679 AI298739 AI146984 
A1922204 N98343BE174213 AAB45571 AI813854AI214518 AI635262AI139455 AI707807 AI698085 AW884528 AI024768 
A1004723 AW087420 AI565133 N94984 A1268939 AW513280 AI061126 AI435818 AI859106 AI360506 A1024767 AA513019 
AA757598 X56196 AA902959 AI334784 A1860794 AA010207 AW890091 AW513771 AI951391 A1337671 T52499 AA890205 
W6409C» S AA4^7 AA358688 AI961767 A1866295 AA780994 AK38591 3 BE174196 AA029094 AW5921 59 T55581 N79072 
A161 1201 AA910812 AI220713 AW149306 AT758412 AA045713 R79750 N76096 

^M5W79M1^^^S^6KAW^81 AA984358 AA492073 BE168945 AAB09054 AW238038 BE011212 BE011359 

BE011387 BE011368 BE011362 BE011215 BE011365 BE011363 

H20832Z40184 

AA225632 A1820970 AI820952 AA226472 AI732140 A1732059 AA226307 AA225500 
AA205976AA205930 

B^^W761796 AI185049 R94606 A1343947 AI565814 A»341735 AI749190 AVTO A^45^^ 
AVWB752A175(X>35A]624837AJ^^ 
AA^AW304422 R68273 AV^615C01651 AI090327 A!5382M 

^Sra8 AW182752T84462 R11149 AI7S7627 BE218556 BE217986 BE222M7 AI^B^^« 
AJ7671 36 A1268928 AA090761 AA329290 AI700593 AA460766 BE222710 8E550651 R1 1 148 A1979278 AW235819 AW874095 
AW196492 
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100237 
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123423 
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8113.1 
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genbanKJT47910 
genbanKJ\A029703 
genbankJ91015 
39565.1 



314220.1 
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entrezj)30715 

genbanKJ^A251089 

genbank u AA598484 



AC40865T53681 N77468 H51833 AA147247 R7S732C18450 R73999 AI095755 T49904 H03868AA411580 R33395 M410586 
T48869 063292 R31981 HI 2498 H02668 AA035018 R75957 A1803329 R27528 R36203 AI809932 AI808765 R78948 AA41 1449 
AA976929 AI378760 AI378620 T48870 R73906 R75632 H03612 AA909684 N50595 H0258O H12839 N58781 M742532 AI360919 
H^B£208298R68568A1350463R31935AVV069127AA411621 R25671 R36105H12451 H03869H51263AA035486 R25109 
R25110AA147933 
AJ003322AJ003324 

AW404061 R09654 T67160 N 50566 N53259 

R81936 D78695 D78806 D78780 C17009 AA004406 AA122102 R70625 AA148932 H59583 H81 146 H80378 H49863 H21 182 
H46534 H51478 H20702 H46515 H40200 H50046 H23647 H50439 H18383 H21846 H23849 H40182 R21924 H0 1290 H01283 
H42464 W31947 W39660 AA046219 H03539 HOI 741 R22008 AA04391 1 AA1 56838 078832 R36616 R66039AI492481 M088608 
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H13647 H01193 N74660 AA156601 H03455 R66040 R81937 R92416 H89486 R36617 R55795 A1088338 AJ373324 H66992 R96235 
AJ494132R16678AA088178AA705356AA982143AA148933 R09231 AI160937R70525 T46980A1200046 H02301 AA367587 
R35968T97106 078703 N78072 D78668 D53268 R28197 AA085579 R63766 R92415 W00998 R8&766 R67875 R27583 R09343 
HI 3646 R27682 T89007 

AI221919 Z19967 AA348780 AW964077 AW1S6028 BE540193 N94800 AA452368 M99604 AI341345 AW298800 AA724961 
AA931158 A1741227 AI806660 AI982626 D81263 053937 D52496 AA974487 AW043854 N50483 Z39997 A1492961 A1361526 
F04002 AA452141 T23551 A14726S5 Al 193667 AI341984 N92658 T32870 R52664 N50428 AW089291 AI934175 A1423737 060665 

^56AVra0U^ 3 ^AW9880O2 X93079 AJ002788 R51324 A1381600 080031 AW500520 AW593740 RJ9382 AA810597 

AM43241 ^3784^67255^047 AI457612 R63254 R28645 R27616 H01310 R78329 R76046 R76055 AA369734 N57914 H94864 
AB53638 R31669 AW769276 R82398 AA131 925 R21776 R79031 AI129553 N70340 AI2761 16 AA709381 N63734 AI342605 
AA327133 A1805980 AA005377 AW611716 T86946 AA369083 R67250 H00240 R32578 W86279 R80248 R23734 H00977 R21732 
R78932 AA36B675 R16780 R26523 H40161 W93671 R99823 R77956 H61601 R98785 R09738 H78317 H53537 H81056 T53780 
H12600 R82345 AA653499AW953717 R63207 R33888 H54071 H66993T48748AJ702300 R30775 H68996 AW14957 AW69082 
AF075351 AI110886 A1742050 AW954245 AI768458 C19082 A1393674 C18911 CI 8029 M708813 Al 168432 R67389 All 684 53 
H00188 R21233H03055 R53822AA367558 R75872 W04151 AI220869AI090290AA368730 R94434 H81153H70950AA367783 
H81514H53536 C16968 C17797 C17677 C17064 AA082581 C17044 D63217 C18791 D78839 063290 D78838 N91085 H54070 
R80360 H78318 H401 99 R33887 R02529 H9491B R00900 T87029 AA131924 R99891 AA004267 D78764 078757 H61600 C1861 5 
H60972 R98784 W86323 R09737 C17492 D7881 1 R67494 078775 AA368244 H12650 D79043 AW957062 R8Z759 C190O2 H01715 
VV28614V^435AI983043AA364395AW572472AW190386AI129278A1913081AW 

AI828466 AW572486 N52583 N89687 AW075567 AI571047 A1887479 AI559469 A1685802 AA805256 A1458777 AA974369 AI866929 
AI886032 AI823925 A1823566 AW198135 AI287510 AI565910 AA765775 A1866019 A1263697 AI355825 R42668 AA894603 
AW105585 AI824555 AW339175 020479 



AA235370 AW779760 N48674 AI375997 R45432 059344 AI203107 F07491 R35360 R25094 AI913631 A1498402 T61382 AI016320 

SSs AW117M6 AA888929 BE174517 270704 AL042292 H54958 AA310063 AW499643 AW501380 AV^1^AW501293 

AW501596 AW630168 AL041326 AA780690 L40517 U21556 AKJ93182 AW062487 AA883387 AA931302 BE252601 BE258381 

AA827330 AL043930 AA356337 AW962830 AW393814 Z70703 AW389484 T62231 AW780O49 AL043931 BE149744 AW963292 

BE439669BB>91737BE091657AA484004AI6^ 

AA362576A1289927AA382592 

AA608733 

T74884 

AA620552 

U423S9 

N57493 

AWTO193 AW137594 AI953685 AI919143 A1424371 AW007638 AW025681 AI954289 ^^(^679^^1 
AI623213AI652310 AI631411 A1565384 A1566734AA1 49597 AI5381 72 A1040831 AI770021 AI914287AA279311 W73001 AW93117 
A1693374 AH)2007 AI990645 N29752 R94559 AA806475 AA806828 R90998 R94558 AW3001 12 W24097 AW003016 N91920 
T47910 
AA029703 

AKB3097 AI859509 T27070 AA326718 AA024743 W23922 AA479593 AI765668 W23908 U92986 AA081632 N50578 AA065245 
AW365046AW365014AA981091 AA150231 H53426AA234651 N50522T79343T87399 N91858 H41179AW009453AA024744 

F03857 H88369 AI174488 AI002696 H53427 T87293 N91869 W35270 AA453723 AA453705 ^^PJ^^llfJ^ 

W23592 AI342074 D31158AA833756AI991896 AA447791 AI864125 AW377239 AA295365 AI687970 R34076 F07607 AA521310 

AW499865 R94584 R21283 

AA350256H68126 

M21305 

N24B29 

X94553 

H58818 

AA211777 

M195191 AA1 90578 AI532307 AA195227 AA743633 A1823408 AJ832203 AI6531 14 AA205307 AW021913 W6OT980 W674198 

^67^M41^AA707350 AA135157 AW34721 AI151036 AI038305 R52643 AA780141 AW207645 D19891 AI474370 

AA401739 N22905 N70378 

D30715 

AA251089 

AA598484 



Table 72A shows the Seq ID No, Ptey. ExAccn, UnigenelD. and Unlgene 718a far ail of the sequences m Table 73. 

80 Rcey. Unique Eos probesel identifier number 

ExAccn: Exemplar Accessbn number, Genbank accession number 

UnigenelD: Unlgene number 
Unlgene TUte: Uragene gene title 
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SeqlD No: 


SeqlD number correlation 


Pkey* 


ExAocn 


UnigeneiD 


103080 


AU077231 


Ks.82932 


410102 


AW248508 


Hsl79727 


101104 


AW862258 


Hs.1 69266 


447761 


AF061S73 


Hs.19492 


428163 


AW969726 


Hs.98381 


439221 


AA737105 


Hs.32250 


121619 


AA528339 


Hs.178052 


104659 


AW969769 


Hs.105201 


105782 


H09748 


Hs.57987 


129977 


NMJD00399 


Hs.1395 


100299 


D49493 


Hs.2171 


116301 


AW969706 


Hs.293332 


106533 


AL134708 


Hs.145998 


131313 


R96290 


Hs.75874 


105316 


AI671245 


Hs.24835 


113003 


AW292315 


Hs.7215 


102836 


U94320 


Hs.158330 


102745 


AW753855 


Hs.74376 


123308 


C14187 


Hs.1 57208 


120147 


AI917116 




123049 


BE047680 


Hs.211869 


119082 


AF252297 


Hs.91546 


105301 


AW352357 


Hs.7457 


128478 


AA708205 


Hs.100343 


106111 


AW875398 


Hs.6451 


131307 


NMJXW025 


H3.2549 


120830 


A1568170 


Ks.96886 


127664 


AA806164 


Hs.116502 


102725 


AB026187 


Hs.159156 


132520 


AA257992 


Hs£0651 


130637 


AA356764 


Hs.17109 


117602 


N35020 


Hs.44685 


128797 


NM.002975 


Hs.1 05927 


129703 


BE388665 


Ks.179999 


125770 


AA143045 


Hs.81665 


414761 


AU077228 


Hs.77256 


420462 


AF050147 


Hs.97932 


410268 


AA316181 


Hs.61635 


104691 


U29690 


Hs.37744 


416836 


D54745 


Hs.80247 



Unigene Title 

cydin D1 (PRAD1: parathyroid adenomatos 
ESTs; homotague of PEM-3 [Ciona savignyi 
neuropeptide Y receptor Y1 
protocadherin 8 

ESTs. Weakly similar to serine protease 
ESTs, Moderately similar to I78885 serin 
ESTs. Weakly similar to phosphaWyberi 
ESTs 

B^CU7rympr«)ma11B(zinc finger pro 
early growth response 2 (Krax-20 (Drosop 
growth drfferenSaSon factor 10 
ESTs 
ESTs 

ribosomal protein L44 
hypothetical protemFU 14594 
ESTs 

.neuropeptide Y receptor Y5 
olfactomedin related ER localized protei 
ESTs 

hemoQlobjn, beta 

(flckkopf (Xenopus laevis) homotog 2 
cytochrome P450 retinoid metabolizing pr 
MAGE1 protein 
ESTs 

PRO0659 protein 
adrenergic, beta-3-, receptor 
ESTs 
ESTs 

protocadherin 11 

Janus kinase 1 (a protein tyrosine Unas 

integral membrane protein 2A 

C3HC4-ttke zinc finger protein 

stem cell growth factor, tymphccyte seer 

Homo sapiens, done IMAGE:34 57003, mRNA 

v-kit Hardy-Zuckerman 4 feline sarcoma v 

enhancer of zeste (Drosophila) homotog 2 

chondrornodulln I precursor 

six transmembrane epithelial antigen of 

Homo sapiens beta-1 adrenergic receptor 

cholecystoWnin 



SeqIDNo 

SeqlD No B1&B2 
SeqlD No B3&84 
SeqlD No B5&B6 
SeqlD No B7&B8 
SeqlD No B9&B10 
SeqlD No B1UB12 
SeqlD No B13.B14.&B15 
SeqlD No B16 
SeqlD No B17&B1B 
SeqlD No B19&B20 
SeqlD No B21&B22 
SeqlD No B23&B24 
SeqlD No B25-827 
Seq ID NoB28&829 
SeqlD No B30&B31 
Seq!DNoB32 
SeqlDNoB33&B34 
SeqlD No B35-B40 
SeqlDNoB41&B42 
SeqtONoB43 
SeqlDNoB44&B45 
SeqiDNoB46&B47 
SeqlDNoB48&B49 
SeqlD No B50B53 
SeqlD No B54&B55 
SeqlDNoB56&B57 
SeqlDNoB58&B59 
SeqlD NoBSO 
Seq(DNoB61&B62 
SeqIDNoB63&B64 
Seq(DNoB65&B66 
SeqiDNoB67&B68 
Seq!DNoB69&B70 
SeqlDNoB71&B72 
SeqlDNoB73&B74 
SeqlD No B75&B76 
SeqlDNoB77&B78 
SeqtDNoB79&B80 
SeqlDNoB8UB82 
Seq!DNoB83&B84 



Table 72B 

Pkey. Unique Eos probeset identifier number 

CAT number Gene cluster number 

Accession: Gen bank accession numbers 

Pkey CAT Number Accession 

131307 3138J NWU000025 X72861 X70811 M29932 X70812 S53291 AW01 5148 AW581 776 AI276134 

131313 93372.1 RS6290 H02411 C18327 AA367588 AA367557 H89632 C17954 AA568860 AI752983 AA699451 H04260 ^128118^193364^^3^ 

AA029995 T40536 AA368746 AI189909 BE047384 AA747591 R22855 A1032539 A1151343 AA148534 H63941 T^/M^HTO 
AI200363 R79731 AA702947 A1400076 AI088494 C17938 AA599478 H02962 R77665 C17370 R65618 H7371 1 R58545 D79189 AW265710 
R77664TS2101 AW953745 AW953739 D79107 AA029105 AW953738 AA456487 R67000 M156623 AA368336 H63662 

116301 52669_2 AW969706AA456258AA491881 BE501 639 D52113 AW969710 A1591236 AI379869 AW968997AA040053AIB07206 AW663917 AA454645 

103080 17092.1 AUN7231 ^2^^4C^X59798 M64349 NM.001758 AA226808 M73554 BE409154 AA160096 BE384352 AA160820 BE382880 

BE261734 AA113821 BE407745 AA1 56380 BE390287 BE390020M1 00854 AA1 271 52 AW794066AW357101 AW^^AB4TO77 
W05266 A1824103AI499061 AA642944 A1042556 AA906539 W60380AI571777AL1 35581 M112340N7M59AA592929^0^^90 
AA126942 AI023701 A1873252 AA156319 AI190622 W60289 A1274886 R81309 AA100801 AA227161 AI568929 AA160503 AI074344 A1344561 

AI674613 AA642544 AW503909 AA128851 W39350 N40420 AA113072 BE 1681 16 AI620604 AI298125 BE075272 N40078 BEO75109 BE0B0779 
AI918938 BE1681 17 BE087369 AW995539 BE080949 BE080727 BE075271 BE075108 BE080955 BE089655 BE081115 AW750304 H66084 
A1146884 BE075154 AW992247 AI186525 AI752230 AW263140 W03329 N26056 AA948080 ^}^^ A ^^\^^l iAi 
AA148042 AI740837 BE082728 AA1 49570 W44495 BE089351 AA375044 N26775 H27771 AA064705 BE091204 R89337 N32676 N27141 
BE164704 H98049 W67603 AA425549W31090 AA80741 1 BE 173280 BE000178 T09020 W23852 AA062709 BE167894 AAD78515 R97329 
BE541980 N42086 AA102307 AA1 13772 BE276181 H20622 W44436 W67604 W46412 AW7711 13 AI7Q0678 AA502628 AA133137 BE274186 
BE396090 BE613371 BE612645 W46650 W95203 W92651 A1087288 R76299 AW604781 N55320 AI912334 AA403248 AW169156 W4970 
AW298822 AW080962 AI073747 W24123 AA577596 H21715 H27925 H26436 AI288304 AA1 48043 AA204678 BE047090 W48631 AA9C8347 
AA599485 AI276505 AI953979 AA563710 H25674 H51747 AA425389 AA516104 AI095335 T77237 AA151696 T92084 A1689037 A1K4162 
W49709AW514883M1(W676AI366087AA069474W 

A1346589 AA487700 A1081104 AA613344 A1377520 A128491 1 A131 1390 AA622062 AI055890 AI6SQ881 ^«7AMW090AWffl|8 
A1073353 W46300 AA062689 /U755078 AI753397 AI633564 AI273471 A1339890 AA699584 AA983722 A1079S58 A1752231 AA076tf1AA1 13245 



AW275281 A1280632 T29597 W48728 AW954336 W38317 W94768 A1084717 W4S567 A1245645 AW302501 N72201 AW51(^V^9132 
AA207064 A1143740 AW440672 AAB32154 A1290286 AB50704 AI271377 AA025369 AI864756 T77451 H97348 AA852165 A1932951 N98526 
AA487486 R92970 AA934071 AI080448 AA063257 C05786 N99899 R42969 AA887065 AA662686 AA533833 AA662304 H51748BE^444 
AI382164 AJ814595 BE537043A1 168307 BE408935 AA453606 R89428 AA936S27 AA936890 AW369618 AW264602 R18074M4741M 
AW372354 AI094358 R37210 AA948510 AA226909 8E172527 A1086652 BE408324 AW292848 A1768962 BE540703 BE409478 AA331692 
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AA257992 AA317029 AA362097 D17042 H48100 AW838107 AW838106 AW838110 AW838231 AW610241 AW838442 AA045061 AW384991 
BE000988 AA1 31806 BE 180577 AW838269 AL039831 AI754380 C06051 AA131737 AW838239 A1767455 AJ480134 AA61 0312 AW8381 90 
AA379252 AW838282 AA484027 AJ423413 AI685064 BE328307 A1241 BS7 AW838281 AW8382B4 A! 671 267 A1814928 AA828367 A1476306 
AI270180 AA622362 AI076497 AI635919 AI444994 AM 94028 AI669159 AI928131 AA448853 AI350143 AW079289 AW457807 AA480442 
AA522935 BE180570 AW380C87 AW380109 A1081015 AI690818 Al 589485 A1698510 AA64201 9 AA714366 AI580430 AA985S27 AI740475 
C21398 AA257993 A1302393 A168901B AW770194 AW753750 A10791S4 BE550338 AA559851 T18108 AI864822 AI932827 AA04S095 
AA045062 AI954225 AA768569 AA709308 AW958363 
N35020 

AB026187 NMJH4522 U79247 F13304 AA224524 F10902 R39431 

AW753865 F07644 F1 1 280 F06355 RJ8136 U79299 F07459 F08750 F12419 F12842 F08488 F08585 F 13403 F05921 F05512 F1 1683 F05416 
F12841 F1281Q F05418 FI^SO R24551 F06276 H11803 H10237 H11542 H22894 R59563 H19351 H18722 F11237 F0S507 H23123 R61595 
F07795 F06201 F12289 F07107 T78113 F08734 F13344 F05760 F11784 F08780 F05835 R20S88 F07739 F11497 R17410 H17414 R2044O 
T66090 R25292T66236T78766 T79908 R2S286 R60071 C14761 AW905192 AA331914 AW965291 H09000 F05212 F08313 F05825 H06399 
H15135H11378 R66424 R61541AA235405AI205041 R595S4H23124H18638 R373S9AI571275 R41780T66174T66159R54102W07657 
H10794 R52337 R42890 H14354 R40952 T17391 H1206B R56797 R51000 R60011 R37617 R39434 H29286 T16403 H09129 R49114 H15161 
H11729 R39160 H10468 H24454 H11432 R54433 H08768 H24042 H09511 H17281 H15292 R37849 AI991955 R52815 R42850T17390T16438 
T17388 T16285 R48963 R50782 R44247 R46729 R59558 R40B96 R43575 R44420 T17403 R37754 R46637 R51039 R40513 T23785 F10433 
T16350 H10191 R43688 T23543 H08591 R44351 R37575 R49508 F04379 F04056 F05067 F02010 F09158 F10036 F08900 F02559 F09914 
F09434 F05034 H17415 F02644 H10650 F10451 H14589 F10444T16440 R44517 R46643 F03993 F10443 PD2170 F09343 F10414 F04794 
F11 122 F04997 F03894 AW005932 F02767 F10941 F01777 R45260 F03386 R45261 T65009 F10990 F08942 F01673 H232S3 F04533 F02082 
F01669 F03689 H09568 R40763 H06400 R50958 N8Q570 A1082296 All 98226 T23763 AA81 4486 AW01 5823 A1886272 H06514 AA51 5969 
F10049 AI984267 A1886279 AA558516 F02071 AI783843 AI804884 F01 468 AA730438 AF035301 R38389 R41440 R59468 R14437 T89036 
R435B0 H10821 R52850 R5S722 R14559 T09299 H10467 R25829 F13546 R56796 R51 1 51 D82657 T32952 T66257 H06569 F12430 T74326 
H08489 R34430 R13083 H23364 R21 134 T77098 AW9571 57 H24150 H1 1403 R52336 F12830 T77173 T74281 T77663 T74736 H10649 H12067 
R20092 T74735 R54530R17552R59557 R54529H17280T75171 H14694 T74975H15291 R1 8574 R54 1 57 R1 3446 H15221 R35120R35O31 
R21277 R19632 R35820 R50899 R13923 R36771 R26485 R33061 R20100 H24426 H09184 H15958 R19253 H15196 K09510 R36032 R19691 
H14402H14885T17068 

AF252297 N27888 F13415 R34552 R14377 AA323962 AW61 1514 AW614097 AW195487 AI026040 A1097663 AA012834 AW969885 AA988198 
R49069 AW873673 R37774 F08840 AA484668 AW873664 AA170805 AA170825 AA58S275 AA585280 AA012833 
U94320MMJM6174U66275 

AA143045 AW271802 A1017093 AA450183 W93694 A1885044 Al 890068 AW264522 AW073559 AI355325 AI051302 A1567686 AI143554 
AI457891 AW468784 A1567693 AW884460 AI424892AA401 298 AW244098A1 122936 N24824 AA552932AA873164 A1308810 M26098 H81852 
N20793 AW263947 AI696267 AA449422 AW877967 
AI917116 AA633698 AI580389 239483 

AW292315AI090408 AW517041 AI92S695 A1092060 AA716741 N98235 T23629 

AW969769 AA484137 A1291362 N64753 AL133784 AA007153 BE046923 D59990 C15166 D59991 D60194 AA648108 N76160 D61027 060630 
U29690 AF272830 AA708166 AA010748 R23839 H59572 R23383 A1625747 AA011176 AW665324 H02674 KS9573 AA757931 H12502 R66851 
AA702310AA748431 AA749352 C01 127 A1003679 T96352 

AW352357 AAB52419 T80568 AA332510 AW 149659 AA341609 AA809550 AI183690 AA730875 T31719 AW965361 AA779709 T05461 
AA564806 AI142536 AI553857 053384 D53747 D54101 D551 12 AW071935 A1082447 AI29091 1 AW168872 AA1 99743 AI091646 AI246702 
AC71662 AW1 17296 AI032138 AI741489 A1538705 AA233541 AI682685 AW136268 AI689641 AA599916 AI174443 A!432175 AI1 31328 
AW136277 AA852418 AA243035 T03902 C15651 052660 H081Q8 AAB44525 AI420648 AI282445 AW068986 AW497637 AA779042 AA298564 
BE262936 

AK71245 081062 AI085879 Al 1 93252 AI870535 N23132 H97661 H18098 AA552173 AA233924 AA503379 H19618 060732 F04762 H1 1343 
AW59Q283 

AA806164 AA678564 AW978234 AW023951 AA651 859 Al 699785 AA91 3686 AW974569 Al 68 1287 

AW875398 A1064861 NNL014138 AF090943BE071887 BE547232 BE622599 A1345700 AA284315 AA147700 AW376340 BE220170 BE348475 
AA025037 AA057284AA025038T16193AW130196 AW594516AI989991 AI139091 AW675747AA830435AI924977AI670778 AI051887 
AA912574 AW130213 AI828335 AI570587 A1332760 AI3131 1 5 AA804653 AA477920 AA147645 AI916912AW768839AA715098AA292723 
AA732421 T33122 AI631560 AI698686 A11 85386 R43829 AA421638 AA961400 C00503 AA604025 AA554026 AA5S3885 BE172244 R24040 
AA708205H15953R20851 L13829 Ai934508 AJ95Q163 H92893 AI267243 N66614 H28983 R42031 H06198 AI468628 H92779 L13828 T34896 
H09748 R61554 AA350215 R40551 

AI568170 AA731212 AW294848 AW466878 AA568908 AI281 804 Al 923633 AA347885 AI718854 AA714940 

NMJ02975 AFQ20044 AB009244 BE620464 AA541683 AW000991 AW089850 AW664612 AI638676 A1627854 A1885823 A1884548 AI554122 
AI499682 AI570905 AI376008 AI871 364 AI074230 AI383719 AA9G5016 AA977446 AA844732 T78922 AW244007 AI924747 AI968230 AJ824575 
A1699224 AI569490 AW379856 AW292314 AI016874 AW292037 

AL134708AW206610AW665919AW594227AI917093AI498361 AA635141 AA453786 D54178 D53224 AI804341 053111 M78885AL1 34707 
AA453903 AI278054 A1334339 AI380496 AW235939 060373 D50993 081 246 
AAS28339 A1O17210 AA41691 1 

BE047680AF177395NM_014421 AB033208 AB035181 W55979 AI093 1 0S Al B63392 AA71 5452 AA300480 AW02231 3 AA6481 03 AA720544 
D61856 AA481794 AA737112 AW316551 AA714937 AW974378 AI370632 Al 184289 A1333962 AA714945 

BE388665 AA740614 AI394542 AA484451 AW070675 AA877654 A1360981 AI475799 BE349560 AL037351 N29437 AI246453 AW083444 
AA844441 Al 150871 AI440477 AJ889061 AA682422 AA680O53 AA74021 2 A1858728 AA490589 AW243692 AI249450 AI304448 AA534421 
AI500317 AI273337 AW027729 AI979132 C06120 A188C916 AW591923 A1803174 AI559264 BE250500 AW675522 AA883284 T71033 AI346838 
A1066754 AI197913AA931979 AA427599 AI304730 AI285159 A1273325 A1038450 AI033760 M74972 AA136517 A1304446 A1262652 AA401348 
AW514617 AI3S1795 AW514389 N3Q307 N93439 AA1 90775 BE349902 AI342114 AI186831 AA039628 H74242 AJ914825 A1955761 AI343851 
AA693856 T71096 AI498539 AA778995 AA084278 A1274678 AI022673 AW339621 AA704929 AW089735 AAD41252 AA161337 A! 185333 
AW189397 AI16Q990 AI082392 AI935848 All 44404 A1141633 AA447987 AA464241 AI342103 AW026118 AA834025 AW664459 AA603777 
AA630752 AA1 15450 N49944 AW 166071 AW769795 AA991877 AA873609 AW083441 H991 30 AA91 0538 AA4861 99 AW083358AA1 51767 
AI161193 N54945 AA180193 AI423477 AI188387 AI608747 N58900 AI038449 AA136852 AA565913 AA180502 H61741 AA701 125 AI127678 
AA278541 R76523 AA666187 K85083 F24755 R77729 AA135137 BE250395 R76239 AW794625 AW794404 K61957 AA308197 AW996031 
AA345524 N52726 AA354360 AA179785 N57375 R16447 AA 160545 W21481 

D49493 NM.004962 D49492 AA948479 AA993843 AA017558 AA992557 AA912897 Al 1 85363 AA01 8254 NS3253 AA297544 T19187 R53287 
AA069337 AA297489 N62906 AW023929 AA018253 AA017615 W25110 R52085 R52633 W16549 H88916 H89134 AA069338 
NMJXXB99 AA018140 AF139463 J04076 AW390694 AW361756 AL079830 AI190587 AA731059 AA742743 Al 63591 5 AA878209 AI262612 
AC64201 AA446300 AA715016 AA446027 AI274101 AA593330 AA593338 AW015388 A1681724 AI347984 AL045275 AA970069 AA018188 
AW014396 AW769445 AI588869 AW338296 AI801548 AI367702 AI888410 
C14187 AA49621 1 D80456 AA484051 D80455 AA528446 
AW8622S8 107615 

AA356764 NMJXJ4867 AF038953 W16B28 R50284 A1929589 R01594 AA010474 W37917 K6421 2 M78175 R5751 1 AA329322 W77834 
AA169869 AA285240 AA356670 T93146 AA325779 AA357064 AA328151 AA308939 AA306801 N31 260 AA306202AA31 2530 BE242703 
AA354371 N42259 ALG21786 H50955 H38038 AI655965 AI566732 AI953952 AW972335 AI077300 AI948482 AI760966 AA648335 W72197 
N32815 AA768073AW338181 AW001 742 AI819896AI01 5666 N58011 AA677584AJ 183750 AW105514 W37875 AW301 679 AA775257 
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410102 117657J 
10 410268 11897 J 



414761 14860.1 



15 
20 
25 



416836 16233J 
420462 19379J 
428183 287965J 



439221 



.1 



30 447761 7355J 



AA730357 AI128690 AA630149 AW241 515 AI147003 AI1 83499 N90923 AB68582 AA354145 AI147297 BE55Q502 AT085777 AI709C87 
AA773947 AA469192 Al 199289 AI1 27085 H89232 N30719 AI02B165 AW029469 AI70931 4 AA780834 AA778057 H891 25 AI654762 T93061 
N59622 AI380625 AA719982 AI248242 AA233925 AW262512 AW438666 AA01 0378 AA484010 AW438790 N22875 AA483718 A] 185407 
AA736720 AA664S87 AA648672 AA503406 AA5081 23 AA5031 40 H46638 N7581 2 AA730938 D60546 AA483039 AA507705 AA480448 
AI768654 M658955 M730803AA330312AW(»127BE172031AW961052AA339070M^ 

AW118733 AA484130 AA730783 AA730510 D82577 062828 AI469294 AI801343 N53447 AA740226 AI831983 AA516146 D79799 A1758212 
AA48404Q H38039 A1038772 Al 0394 1 0 AA52B387 M730576 AA527231 AA748111 052946 U38439 AA721708 H22130 AA559229 H51620 
AA283355 AA730970 R01595 N72450 AA469123 AA579389 AA480477 AA283306 AA347822 AA328013 

AW248508 AI278454 AI389343 AW291235 AI343946 AI347189 M970152 AI279668 AI971894 AA664359 AI191039 AA084598 M081641 
AA316181 BE439545 AC005053 AF 186249 AW386101 AC004969 AA730199 AA032221 AI6861 39 Al 1 67942 AA809228AI1 84070 A1394674 
AW969977 AA032279 AW079284 AA513174 AA888312 AI453179 AA483363 AA528432 AA579511 

AU077228 U61 145 AL044258 AA206185 A1044975 NM_004456 X95653 AW890905 AA332806 AW965453 BE299226 U52965 BE275009 
AA428252 AA769212 AA312778 AA810901 AA46131 5 AI969286 AI953668 AW296248 AI291422 AA704747 Z44716 BE541 235 AW608806 
AA588583 AW403788 AW408386 AA053521 T87571 AA581 163 BE183241 T97906 R89149 AI808196 BE618521 AI970278 AI652680 AW130143 
A1970272 AW197254 BE550449 AI628403 M010143 AW005107 AA251815 AA805836 N53459 AA055309 AA643678 AI345616 AI345618 
A1635728 AI871286 A1018437 Al 2625 14 H75287 AKB3155 AI203388 AA205802 AI475265 AW594186 AI696059 AI333312 AW264710 AA876764 
AA864892 AA173156 AI292066 T79837 W93995 AA506427 AA815135 AI376342 AA010144 A1310421 AJ 223350 AA970576 N33926 T89364 
AI090913 N55098 AA101673 AA626665 AA251840 N68857 A1890053 H05930 AI474199 AI436166 AA628810 N64452 AI205538 AA055310 
BE244495 AI365179 T99536 A1203088 Z40549 AW139858 AA599627 R91847 AA483681 AW068982 H51746 AA091201 H 13030 AA459732 
T99430AI051764R45991 

054745 AU077189 NM.000729 R71188 H87563 R80701 AW956319 AA376289 R82442 H46452 R22481 R64323 AW043690 L00354 AW005571 
H19813 AI745597 AI769202 AI205982 AW205B41 R64324 AI201975 H46993 C18067 R71189 N29729 AI799838 A1091209 H87455 R82443 
AW970995AW072136N40841 H13148 A1219052 N30191 T2B007AI817428AA503602 

AF050147 NM.007015 ABC06000 AB005999 BE2181 14 A1703147 AI985891 BE222034 BE549774 BE327618 AI651459 BE549987 AI657194 
BE222414 BE046984 AW470357 AW339338 C02526 AW590220 AI298082 AW296998 AI333135 AI867816 AA730514 BE34851 3 AA405500 
AW136179 AA327834 BE327350 BE327366 

AW969726 AA424028 AI266636 AI271571 AA423988 AI268634 AA702780 AA452037 AI424866 AA742931 AA483361 

AA737106 N35765 N35779 AF086045 N30152 AW374028 AI478237 AA923562 AI382236 AA846668 AI627389 AI371709 H98083 H95983 

H26576 H95984 AA745021 AA639180 H26575 

AF061573 NML002590 AA780138 AW135620 AI197884 AA483348 R05468 AW969724 H29216 
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Table 73 i Sequences 

Seq ID NO: Bl DMA sequence 

Nucleic Acid Accession #: NM_0530S6.1 

Coding sequence: 210.. 1097 

1 11 21 31 41 51 

i i i i r i 

CACAOGGACT ACAGGGGAGT TTTGTTGAAG TTGCAAAGTC CTGGAGCCTC CAGAGGGCTG 60 

TCGGCGCAGT AGCAGCGAGC AGCAGAGTCC GCACGCTCCG GCGAGGGGCA GAAGAGCGCG 120 

AGGGAGCGCG GGGCAGCAGA AGCGAGAGCC GAGCGCGGAC CCAGCCAGGA CCCACAGCCC 180 

TCCCCAGCTG CCCAGGAAGA GCCCCAGCCA TGGAACACCA GCTCCTGTGC TGCGAAGTGG 240 

AAACCATCCG CCGCGCGTAC CCCGATGCCA ACCTCCTCAA CGACCGGGTG CTGCGGGCCA 300 

TGCTGAAGGC GGAGGAGACC TGCGCGCCCT CGGTGTCCTA CTTCAAATGT GTGCAGAAGG 360 

AGGTCCTGCC GTCCATGOGG AAGATCGTCG CCACCTGGAT GCTGGAGGTC TGCGAGGAAC 420 

AGAAGTGCGA GGAGGAGGTC TTCCCGCTGG CCATGAACTA CCTGGACCGC TTCCTGTCGC 480 

TGGAGCCCGT GAAAAAGAGC CGCCTGCAGC TGCTGGGGGC CACTTGCATG TTCGTGGCCT 540 

CTAAGATGAA GGAGACCATC CCCCTGACGG CCGAGAAGCT GTGCATCTAC ACCGACAACT 600 

CCATCCGGCC CGAGGAGCTG CTGCAAATGG AGCTGCTCCT GGTGAACAAG CTCAAGTGGA 660 

ACCTGGCCGC AATGACCCCG CACGATTTCA TTGAACACTT CCTCTCCAAA ATGCCAGAGG 720 

CGGAGGAGAA CAAACAGATC ATCCGCAAAC ACGCGCAGAC CTTCGTTGCC CTCTGTGCCA 780 

CAGATGTGAA GTTCATTTCC AATCCGCCCT CCATGGTGGC AGCGGGGAGC GTGGTGGCCG 840 

CAGTGCAAGG CCTGAACCTG AGGAGCCCCA ACAACTTCCT GTCCTACTAC CGCCTCACAC 900 

GCTTCCTCTC CAGAGTGATC AAGTGTGACC CAGACTGCCT CCGGGCCTGC CAGGAGCAGA 960 

TCGAAGCCCT GCTGGAGTCA AGOCTGCGCC AGGCCCAGCA GAACATGGAC CCCAAGGCGG 1020 

CCGAGGAGGA GGAAGAGGAG GAGGAGGAGO TGGACCTGGC TTGCACACCC ACOGAOGTGC 1080 

GGGACGTGGA CATCTGAGGG CGCCAGGCAG GCGGGCGCCA CCGCCACCCG CAGCGAGGGC 1140 

GGAGCCGGGC CCAGGTGCTC CCCTGACAGT CCCTCCTGTC CGGAGCATTT TGATACCAGA 1200 

AGGGAAAGCT TCATTCTCCT TGTTGTTGGT TGTTTTTTCC TTTGCTCTTT CCCCCTTCCA 1260 

TCTCTGACTT AAGCAAAAGA AAAAGATTAC CCAAAAACTG TCTTTAAAAG AGAGAGAGAG 1320 

AAAAAAAAAA TAGTATTTGC ATAACCCTGA GCGGTGGGGG AGGAGGGTTG TGCTACAGAT 1380 

GATAGAGGAT TTTATACCCC AATAATCAAC TCGTTTTTAT ATTAATGTAC TTGTTTCTCT 1440 

GTTGTAAGAA TAGGCATTAA CACAAAGGAG GCGTCTCGGG AGAGGATTAG GTTCCATCCT 1500 

TTACGTGTTT AAAAAAAAGC ATAAAAACAT TTTAAAAACA TAGAAAAATT CAGCAAACCA 1560 

TTTTTAAAGT AGAAGAGGGT TTTAGGTAGA AAAACATATT CTTGTGCTTT TCCTGATAAA 1620 

GCACAGCTGT AGTGGGGTTC TAGGCATCTC TGTACTTTGC TTGCTCATAT GCATGTAGTC 1680 

ACTTTATAAG TCATTGTATG TTATTATATT CCGTAGGTAG ATGTGTAACC rCTTCACCTT 1740 

ATTCATGGCT GAAGTCACCT CTTGGTTACA GTAGCGTAGC GTGGCCGTGT GCATGTCCTT 1800 

TGCGCCTGTG ACCACCACCC CAACAAACCA TCCAGTGACA AACCATCCAG TGGAGGTTTG 1860 

TCGGGCACCA GCCAGCGTAG CAGGGTCGGG AAAGGCCACC TGTCCCACTC CTACGATACG 1920 

CTACTATAAA GAGAAGACGA AATAGTGACA TAATATATTC TATTTTTATA CTCTTCCTAT 1980 

TTTTGTAGTG ACCTGTTTAT GAGATGCTGG TTTTCTACCC AACGGCCCTG CAGCCAGCTC 2040 

ACGTCCAGGT TCAACCCACA GCTACTTGGT TTGTGTTCTT CTTCATATTC TAAAACCATT 2100 

CCATTTCCAA GCACTTTCAG TCCAATAGGT GTAGGAAATA GCGCTGTTTT TGTTGTGTGT 2160 

GCAGGGAGGG CAGTTTTCTA ATGGAATGGT TTGGGAATAT CCATGTACTT GTTTGCAAGC 2220 

AGGACTTTGA GGCAAGTGTG GGCCACTGTG GTGGCAGTGG AGGTGGGGTG TTTGGGAGGC 2280 

TGCGTGCCAG TCAAGAAGAA AAAGGTTTGC ATTCTCACAT TGCCAGGATG ATAAGTTCCT 2340 

TTCCTTTTCT TTAAAGAAGT TGAAGTTTAG GAATCCTTTG GTGCCAACTG GTGTTTGAAA 2400 

GTAGGGACCT CAGAGGTTTA CCTAGAGAAC AGGTGGTTTT TAAGGGTTAT CTTAGATGTT 2460 

TCACACCGGA AGGTTTTTAA ACACTAAAAT ATATAATTTA TAGTTAAGGC TAAAAAGTAT 2520 

ATTTATTGCA GAGGATGTTC ATAAGGCCAG TATGATTTAT AAATGCAATC TCCCCTTGAT 2580 

TTAAACACAC AGATACACAC ACACACACAC ACACACACAC AAACCTTCTG CCTTTGATGT 2640 

TACAGATTTA ATACAGTTTA TTTTTAAAGA TAGATCCTTT TATAGGTGAG AAAAAAACAA 2700 

TCTGGAAGAA AAAAACCACA CAAAGACATT GATTCAGCCT GTTTGGOGTT TCCCAGAGTC 2760 

ATCTGATTGG ACAGGCATGG GTGCAAGGAA AATTAGGGTA CTCAACCTAA GTTCGGTTCC 2820 

GATGAATTCT TATCCCCTGC CCCTTCCTTT AAAAAACTTA GTGACAAAAT AGACAATTTG 2880 

CACATCTTGG CTATGTAATT CTTGTAATTT TTATTTAGGA AGTGTTGAAG GGAGGTGGCA 2940 

AGAGTGTGGA GGCTGACGTG TGAGGGAGGA CAGGCGGGAG GAGGTGTGAG GAGGAGGCTC 3000 

COGAGGGGAA GGGGCGGTGC CCACACCGGG GACAGGCCGC AGCTCCATTT TCTTATTGCG 3060 

CTGCTACCGT TGACTTCCAG GCACGGTTTG GAAATATTCA CATOGCTTCT GTGTATCTCT 3120 

TTCACATTGT TTGCTGCTAT TGGAGGATCA GTTTTTTGTT TTACAATGTC ATATACTGOC 3180 

ATGTACTAGT TTTAGTTTTC TCTTAGAACA TTGTATTACA GATGCCTTTT TT6TAGTTTT 3240 

TTTTTTTTTT ATGTGATCAA TTTTGACTTA ATGTGATTAC TGCTCTATTC CAAAAAGGTT 3300 

GCTGTTTCAC AATACCTCAT GCTTCACTTA GOCATGGTGG ACCCAGCGGG CAGGTTCTGC 3360 

CTGCTTTGGC GGGCAGACAC GCGGGCGOGA TCCCACACAG GCTGGGGGGG GCCGGCCCCG 3420 

AGGCOGCGTG CGTGAGAACC GCGCOGGTGT CCCCAGAGAC CAGGCTGTGT CCCTCTTCTC 3480 

TTCCCTGCGC CTGTGATGCT GGGCACTTCA TCTGATCGGG GGCGTAGCAT CATAGTAGTT 3540 

TTTACAGCTG TGTTATTCTT TGCGTGTAGC TATGGAAGTT GCATAATTAT TATTATTATT 3600 

ATTATAACAA GTGTGTCTTA CGTGCCACCA CGGCGTTGTA CCTGTAGGAC TCTCATTCGG 3660 

GATGATTGGA ATAGCTTCTG GAATTTGTTC AAGTTTTGGG TATGTTTAAT CTGTTATGTA 3720 

CTAGTGTTCT GTTTGTTATT GTTTTGTTAA TTACACCATA ATGCTAATTT AAAGAGACTC 3780 

CAAATCTCAA TGAAGCCAGC TCACAGTGCT GTGTGCCCCG GTCACCTAGC AAGCTGCOGA 3840 

ACCAAAAGAA TTTGCACCCC GCTGCGGGCC CACGTGGTTG GGGCCCTGCC CTGGCAGGGT 3900 

CATCCTGTGC TOGGAGGCCA TCTCGGGCAC AGGCCCACCC CGCCCCACCC CTCCAGAACA 3960 

CGGCTCACGC TTACCTCAAC CATCCTGGCT GOGGCGTCTG TCTGAACCAC GCGGGGGCCT 4020 

TGAGGGACGC TTTGTCTGTC GTGATGGGGC AAGGGGACAA GTCCTGGATG TTGTGTGTAT 4080 

CGAGAGGCCA AAGGCTGGTG GCAAGTGCAC GGGGCACAGC GGAGTCTGTC CTGTGACGCG 4140 

CAAGTCTGAG GGTCTGGGCG GCGGGCGGCT GGGTCTGTGC ATTTCTGGTT GCACCGCGGC 4200 

GCTTCCCAGC ACCAACATGT AACCGGCATG TTTCCAGCAG AAGACAAAAA GACAAACATG 4260 
AAAGTCTAGA AATAAAACTG GTAAAACCOC AAAAAAAAAA AAAAAA 

Seq id NO: B2 Protein sequence 
Protein Accession 8: NP_444284.l 

1 11 21 31 41 51 
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MEHQLLCCEV ETIRRAYPDA NLLNDRVLRA MLKAEBTCAP SVSYFKCVQK EVLPSWRKIV 
ATWMLEVCEE QKCEEEVFPL AMNYLDRFLS LEPVKKSRLQ LLGATCMFVA SKMKETI PLT 
AEKLCIYTDN SIRPEELLQM ELLLVNKLKW N1AAMTPHDF IEHFXjSKMPB AEENKQIIRK 
KAQTFVALCA TDVKFISNPP SMVAAGSWA AVQGLNLRSP NNFliSYYRLT RFLSRVIKCD 
PDCLRACQEQ IEALLESSLR QAQQNMDPKA AEEEEEEBEE VDLACTPTDV RDVDI 

Seq ID NO: B3 DNA sequence 

Nucleic Acid Accession «: XMJJ44166 

Coding sequence: 1..1576 



60 
120 
180 
240 



11 



21 



31 
I 



41 



SI 



CTTTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAQAA ATGGGGGCTT 
TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGAOGAGCG 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CTCCCGCCCC 
CAGGGCGGGC GAGGAGGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 
CGCAGCGCAG ACGCCCCAGC CXXXCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 
TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATCTCAGC 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC COGCAACAAG TCAGGCGCCG CCTTTGGTGT 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 
GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 
CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 
CGTGGAGCGT GCGCGCGAGG AGATCGAGAC GCACATCGCX3 GTGCGCACTG GCAAGATCCT 
CGAGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 
CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 
GAACAGCCTG GGCTGCATOG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 
CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 
QAACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 
GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 
AGTGACTGCC GCCCTTGTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 
CATCTGCGAG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA CGCAAGCCAT 
COGAATATTC TCCTAAGCCC CGTGCCCCAT GCCTCCGGGG CCCACTCCAC TGGGCCCACC 
CTGGACCTGT TTTCCACTAA AGCCTTTTGG AAAGCGGTGA TTTGAGGGGC AAGGTGCTTA 
GAGATACTCG CTCGCTGGGG AAGGGGGGAG GGAGGCAGTG GTGGCTGGAG GGTGCGCCAC 
TTTCAGAGCC TCTGGTCACC CTGTCCTGGA AAGATTGGGA GGGGGCCAGA CTGAAAATTT 
TACTAGAGTT ACAACTCTGA TACCTCAACA CACCCTTAAA TCTGGAAGCA GCTAAGAGAA 
ACTTTTGTTT TGCCAGAGGT GGCCACTAAG GCATTCTGAC GCCCTCTGCC CACCTCCCCC 
GCTGTGTGTC ACTCCACCCC TTCTTCCGAG GAGGGGGTGG GTAAAAGGGA GAGGGAGAAT 
TACCACCTGT ATCTAGAGGT GCTCTTTGCA ATCCCTAAGC CCTCTGGTCC TGACCTCCGA 
CCTCCCAGCT CTGTCTTGTT CCTTGTCTTT GTCTTTCTTC CCTTCCCCCT GCCCCTGCCC 
CTACCAGCCC AGCTTTGGGG ACACCATCCT TCTGGGGAGA AGTAGGGGGA GGAATATTTG 
GATGGTCCCT CCATTCCTCT TCAGGCATCT GGAGGCCCTC TCCCCCACTC CTCCAAAGAA 
ACATCTCAAA TTATTGATGG AATGTATCCC CATTCTCAGT GAAAATGTGA GGAGGGGACT 
AATACTGGGG TAAAGGGTCA AACCCCCACC TTCATCACTA TGGGCATTAT ATTTAGGGAG 
TAGTTCTTGG GCTGGATTTT CTGGTTGTGG AAGTGGGGGC GCCAGAGTAG TGTGTCTGCT 
ATTTAAAGGA GCAGGAAAGG GOGTGAGGCA GGAGGAGAGA CTGGTGGAGG GAAGAGCTGC 
TCCTCCCATG CAGTGCCCGA CTCCCTGCAC CCCTCTCAAC CTGACCTGAA CCTTTATTGA 
ATCCTTATTA GCTTGAATCC TTATTAGCTT GAATCCTCCA TGCAAATCAT GGAGTCTGTG 
TCCCACCTGA TGTGGTTGAG GAGAAGCCAG GTCTTCAAAG AGGGGTCAGC CTGGGGCAAA 
GCAGGACTGG GGGGAGGTGG GCAGCAGGGC CTATTCTGAG AATCACATAT TGTTACAGGC 
CTTGCACCCC CTTTGCTGCT TCCCTCCTGC TCATTTGGGG CTGCCACCAG CTCTCCACCC 
TCCTGGTTCC GCTGGCOGGG CCAAGAGAGG ATGGAGGGAT GGGAGTCCCA GGAGATCCTT 
GTAAATAGTG GGGTGGGACT GTTCTGAGTG ATCACCCGAG CACTTAAAGC TCCAGAGTCC 
CATTCTTCCT GGATGGAGCA GGTGGAGGTG CAGAGGGGAT TTCCTCCTCT CCTTCCTCCT 
GTCGAGAATT AACACCTCTC CACAGCCTTC CCCTCCAGAA CACCAGCCAG GGAGGGGTGG 
GGAAGGAGGT CACAGCCAAG AAAACTGCCC TGTGACGACT TCCCTCCTTC CCGCCTATGT 
GAGCCATCCT GAGATGTCTG TACAATAGAA ACCAAACCAA ATGGGCACCC TCGGTTGCCG 
GGGGGCAGGT GGGGAGGGGG GTGGGAAGAA GGGATGTCTG TCTGTCGTCC CCCTCCCCCT 
C7CCACTCTT TACCCACAAA GGCAGAAGAC TGTTACACTA GGGGGCTCAG CAAATTCAAT 
CCCACCCTTA CCAATTGAGC CAAACCTAGA AACAAACACA AAACACGAAT AGTGAGAGAC 
AAAATAGAGG AGAGAAAGAG AGCATQAGAG GGAGCGAGAC AGGCGACCAA CACAGAGGAG 
AGAAAACAAA AATAGCAAAA AAAAAAAAAA AAAGCAGTTC TTTATAATTT AATATTCTAT 
TTTAATAAAG GCGTTTATTA CCATATAAAT GTAGCAAAGA ACCTGGGCTA ATATGAA 

Seq ID NO: B4 Protein sequence 
Protein Accession #: XP 044166.2 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



1 
I 

FCFAMPSLW 
RAGEDGGGGG 
ALYKBAELRL 
PMVTGRREDV 
LWGPKGATI 
EYNNENDFLA 
GEQGGDFGYG 



11 
I 

SGIMERNGGF 
GGAPAQPTAP 
KGSSNTTECV 
ATARREXISA 
KRIQQQTNTY 
GSPDAAIDSR 
GYLPPGYGVG 



21 
1 

GSLGCFGGSA 
PQPAPPPPPA 
PVPTSEHVAE 
AEHPSMIRAS 
IITPSRDRDP 
YSDAWRVHQP 
KQDVYYGVAE 



31 
I 

KDRGLLEDER 
APPAAPTTAP 
IVGRQGCKIK 
RNKSGAAFGV 
VPBITGAPGN 
GCKPLSTFRQ 
TSPPLWAGQB 



41 
I 

ALQLALDQLC 
AAOTPQPPTA 
ALRAKTNTYI 
APALPGQVTI 
VERARBEIET 
NSLGCIGECG 
NATPTSVLFS 



51 
I 

LLGLGEPPAP 60 

PKGASDAKLC 120 

KTPVRGEEPV 180 

RVRVPYRWG 240 

HIAVRTGKIL 300 

VDSGFBAPRL 360 

SASSSSSSSA 420 
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KARAGPPGAH RSPATSAGPB LAGLPRRPPG EPLQGPSKLG GGGLRSPGGG RDCMVCFBSE 
VTAALVPOQH NLFCMECAVR ICERTDPECP VCHITATQAI RIPS 

Seq ID NO: B5 DNA sequence 

Nucleic Acid Accession #: NMJJ00909.1 

Coding sequence: 209- . 1363 
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I 

CATTCCCACC 
GTTCAGTTCA 
AATAAGAATA 
ATAATCTATA 
TCATTCAGTC 
TGATTGTCAT 
CATTCTTGGT 
GAGAAATGTT 
CATGTGTCTC 
GATGTGTAAG 
GGTTCTCATT 
TAATAGACAT 
GCCTTTCCTG 
GTACAAAGAC 
TACCACTCTC 
CTTCAAGATA 
TAAGTACAGG 
ATTTGCAGTC 
GATCATTGCT 
ATCCACTTGT 
GCAGTTCTTC 
CATGTCCACG 
ATTTAAAAAA 
GGTCCCGGAT 
TCTCCCAAGG 
TTACTGCTTT 
TCTTCTGGAA 
AATATAAAGA 
GCTGACTTCA 
TTAGATTAGA 
TAGTGTGTTA 
GAAGTCATTC 
TTTTTTCACC 
TTAAAAATGA 
ACAGGAATGA 
AGAGCATTTT 
TATATTTATT 
TTGTCAAGCT 
ACAAATATCG 
CATCTTTCAA 
AGGGAAAAAT 
GTGTGACTTG 
GTTAATGTGC 
AAACTATATT 
TGTTTGATTT 
GGAGTCTCTT 



11 
I 

CTTCCTTCTT 
AGGGAATGAA 
AGCTGAACAG 
ACAACCAAAC 
CACTCTAATT 
CTGCCCTTGG 
GTCTCTGGAA 
ACCAACATCC 
CCCTTTACAT 
TTGAATCCTT 
GCTGTGGAAC 
GCTTATGTAG 
ATCTACCAAG 
AAATACGTGT 
CTCTTGGTGC 
TATATACGCC 
TCCAGTGAAA 
TGCTGGCTCC 
ACCTGCAACC 
GTCAACCCCA 
TTCAACTTTT 
ATGCACACAG 
ATCAACAACA 
GACATCTGTT 
AATGGGGTTG 
TGTTGTAGTT 
ATAGTTTTGA 
CTTTTATACT 
GAAGTACCTG 
TTAGATTGTC 
CAATAGTAAC 
AGAAGTGGTT 
TTAAGGGAGG 
ATAAAAAGAC 
AGAGAGAAAG 
AGAGTAATTA 
TGAATTGATG 
TCTGGTCTAA 
TTTTCCATAC 
AGGAAGTAAC 
ACACAAAAAC 
TGGOGTCTTA 
CTAATTTCAT 
TTAAAGAACA 
TAAAAGGGCG 
TCAGTTTCAT 



21 

I 

TAATAAGCAG 
GAATTCAGAA 
TTGACCTGCT 
CAATCAAAAT 
TCTCAGAGAA 
CGATGATATT 
ACCTGGCCTT 
TGATTGTGAA 
TTGTCTACAC 
TTGTGCAATG 
GACATCAGCT 
GTATTGCTGT 
TAATGACTGA 
GCTTTGATCA 
TGCAGTATTT 
TAAAAAGGAG 
CCAAAAGAAT 
CTCTTACCAT 
ACAATCTGTT 
TATTTTATGG 
GTGATTTCCG 
ATGTTTCCAA 
ATGATGATAA 
TAAAAACAAG 
AAATCATTTG 
GTCATAATTA 
CCAGACATCT 
GTACTTATTG 
CCATCCAATA 
AACAGATTGG 
AGTATGCAAA 
TGAGGTTTCT 
CTTTCATTTC 
ATACTTCTCA 
CAGCTCCCCA 
ATTTAATAAA 
GTCAAGAGAT 
TATGTACTCG 
AGCAGTGCCT 
ACCAAGGTAC 
TGCAGATACT 
TAAATAATGC 
GTATCTTGTA 
AGACATACTT 
GACATTTTAT 
TTTTTCTCAT 



31 

I 

GAGC6AAAAA 
TAATTTTGGT 
TTGAAGAAAC 
GAATTCAACA 
GAATGCCCAG 
TACCTTAGCT 
GATCATAATC 
CCTTTCCTTC 
ATTAATGGAC 
TGTTTCAATC 
GATAATCAAC 
GATTTGGGTC 
TGAGCOGTTC 
ATTTCCATCG 
TGGTCCACTT 
AAACAACATG 
CAATATCATG 
CTTTAACACT 
ATTCCTGCTC 
GTTCCTGAAC 
GTCTCGGGAT 
AACTTCTTTG 
TGAAAAAATC 
CACAACCTGC 
AAAATGACTA 
CATTTGGAAC 
TTGAAGTGCT 
GAATGAAATT 
CGGTCATTAG 
GCCATCCTTA 
AGCAGCATTC 
GTTTTTTGGT 
CTCCCGACTG 
GCTGCAAATA 
ACTTCAAAAC 
GTAAATTAGT 
TTTCCATTTT 
AAAGACTTTC 
ATATAGTGAC 
AATGTTAAAG 
TCATATAGCC 
ACT6TAAAGA 
ATCATGATTG 
CAATGTATTA 
TAAAATCAAT 
CCCATGACTT 



41 

I 

GACAAATTCC 
AAATGGATTC 
ATACTGTCCA 
TTATTTTCCC 
CTTCTGGCTT 
CTTGCTTATG 
ATCTTGAAAC 
TCAGACTTGC 
CACTGGGTCT 
ACTGTGTCCA 
CCTCGAGGGT 
CTTGCTGTGG 
CAAAATGTAA 
GACTCTCATA 
TGTTTTATAT 
ATGGACAAGA 
CTGCTCTCCA 
GTGTTTGATT 
TGCCACCTCA 
AAAAACTTCC 
GATGATTATG 
AAGCAAGCAA 
TGAAACTACT 
AACATACTTT 
AGATTTTCTT 
AAAAGGTGTG 
TTTTGTGAAT 
TCTTTAAAGT 
ATTGGGTCAT 
CTTTATGATA 
AGGAGCCGAA 
GGTTTTTGTT 
ATTGTCACTT 
TTATGGAGAA 
CATTTTGGTA 
ATTGCTGCAA 
TTTTACAGAC 
CGCTTACAAT 
TGATTTTAAC 
GAATATTCAC 
CATTTTAACT 
TTACTGAATA 
AGCCTCAGAA 
TACAGATAAA 
ATTGTTTTTG 
CCCTCGGATG 



51 
I 

AAAGAGGATT 
CAATATCGGG 
TTTGTCTAAA 
AGGTTGAAAA 
TTGAAAATGA 
GAGCTGTGAT 
AAAAGGAGAT 
TTGTTGCCAT 
TTGGTGAGGC 
TTTTCTCTCT 
GGAGACCAAA 
CTTCTTCTTT 
CACTTGATGC 
GGTTGTCTTA 
TTATTTGCTA 
TGAGAGACAA 
TTGTGGTAGC 
GGAATCATCA 
CAGCAATGAT 
AGAGAGACTT 
AAACAATAGC 
GCCCAGTGGC 
TATAGCCTAT 
GATTACCTGT 
GTCTTGCTTT 
GGCTTTGGGG 
TTATGCATAT 
ATTACGATGC 
CTTGATTAGA 
GGCATCATTT 
AGATAGTCTT 
TGTTTTTTTT 
AAATCAAAAT 
TTGGGCACCC 
CCTGACAACA 
ATAGCTAAAT 
TGTTCAGTGT 
TTGTAGAAAC 
TTTCAATGTC 
TTTACCTAGC 
TGTATAAACT 
GTTGTGTCAT 
TCATTTGGAG 
GTATTACATG 
CTTTTTCTGA 
GT 



Seq ZD NO: B6 Protein sequence 
Protein Accession ft: NP 000900.1 



MNSTLFSQVE 
LIIIILKQKB 
CVSITVSIFS 
DEPFQNVTLD 
RNNMMDKMRD 
LPIiLCHLTAM 
KTSLKQASPV 



11 

1 

NHSVHSNPSE 
MRNVTNILIV 
LVLIAVERHQ 
AYKDKYVCFD 
NKYRSSETKR 
ISTCVNPIFY 
AFKKINNNDD 



21 
I 

KNAQLLAFEN 
NLSFSDLLVA 
LIINPRGWRP 
QFPSDSHRLS 
INIMLLSIW 
GPLNKNFQRD 
NEKI 



31 
I 

DDCHLPLAMI 
IMCLPFTFVY 
NNRBAYVGIA 
YTTLLLVLQY 
APAVCHLPLT 
LQPFFNFCDF 



Seq ID NO: B7 DNA sequence 

Nucleic Acid Accession ft: NM_002590.2 

Coding sequence: 204.. 3416 



I 

GCTTCTGTAA 
CTGGAGGCTC 
TTCOGAGAGC 
GGATCGCACT 
TCCCCTTGCA 
GATACAGCAC 
TGCATATGAA 
TGCTCCGGGT 
AGCGGCTGTG 



11 
I 

GAGACGGAGA 
GGAGCTGCTA 
CTGAGACTGA 
TGAGGCTGGA 
GCTCTTCAGC 
CTTCGAGGAG 
AGTATCGGGT 
GCGCGAAGGC 
TGGCCAGGCC 



21 
I 

GGCGCAGAGT 
CCCGCAGACT 
CGCCCGACCT 
GGCATGAGTC 
CTCTGCTGGG 
GATGCCCCCG 
GACACAAGCT 
GACGGGCAGC 
CCGCAGTGCG 



31 
I 

GAGGGCGGGT 
TCTCCCGCAC 
GGAAACCAGA 
CTGTGAGGCG 
TGCTCTCAGT 
GCACGGTCAT 
TOCGCCTGAT 
TGAOOGTCGG 
TGCTGGCCTT 



41 

I 

FTLALAYGAV 
TLMDHWVFGE 
VIWVLAVASS 
FGPLCPIFIC 
IFNTVFDWNH 
RSRDDDYETI 



41 
I 

CCGCGOGTCC 
AGGGCTCGCA 
GAAGACTTCC 
TTGGGGCAGC 
GGCCCAGAGC 
CGGGACCCTG 
GAAGCAATTC 
GGACGCCGGC 
OGATGTGGTC 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



51 
I 

IZLGVSGNLA 
AMCKLNPFVQ 
LPFLIYQVMT 
YPKIYIRLKR 
QIIATCNHNL 
AMSTMHTDVS 



51 
I 

TCAGAGCCCG 
AAGAGCGTGA 
TTAGCCTTTC 
CCCTGCCTTT 
AAAACAGTCC 
GCCGAGGACC 
AACAGCTCTC 
CTGGACCGCG 
AGCTTCTCGC 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 



799 



WO 03/042661 



AGGAGCAGTT COGGCTGGTG CACGTGGAGG TAGAGQTQAG GGACQTCAAC GACCACGCGC 600 
CGCGCTTCCC CAGGGCCCAG ATCCCGGTAG AOGTGTCCGA GGGTG066CA GTGGGCACGC 660 
GCATCCCCTT GGAGGTGCCG GTGGACGAGG ACGTGGGOQC CAACGGGCTG CAGACCGTGC 720 
GCCTGGCCGA GCCGCACAGC CCCTTTCGCG TGGAGCTGCA GACGCGAGCG GACGGCGCTC 780 
AGTGCGCAGA CCTGGTGCTG CTGCAGGAGC TGGACCGGGA GAGCCAGGCC GCCTACAGCC 840 
TGGAGCTGGT GGCCCAGGAC GGCGGCCGCC CGCOGCGCTC CGCCACGGCT GCCCTCAGCG 900 
TGCGCGTCCT GGATGCGAAT GACCACAGCC CGGCCTTCCC GCAGGGCGCC GTGGCCGAAG 960 
TGGAGCTGGC GGAAGACGCG CCCGTGGGCT CCCTGCTTCT CGACCTGGAC GCAGCCGACC 1020 
CCGACGAGGG ACCTAACGGC GACGTGGTGT TCGCATTTGG CGCCCGCACC CCGCCGGAGG 1080 
CGCGCCGCCT CTTTCGGCTT GACCCGCGAT CAGGCCGCCT CACCCTGGCC GGGCCCGTGG 1140 
ACTACGAGCG TCAGGACACC TACGAGCTGG ACGTGCGGGC GCAGGACCGC GGACCCGGGC 1200 
CCCGCGCTGC CACCTGCAAG GTCATCGTGC GCATCCGAGA CGTCAATGAC AACGCACCCG 1260 
ACATCGCCAT CACCCCGCTG GCCGCCCCAG GCGCGCOGGC AACCTCACCC TTCGCCGCTG 1320 
CCGCCGCCGC CGCTGCACTC GGGGGAGCGG ACGCTAGCTC GOCGGCGGGA GCCGGGACGC 1380 
CGGAGGCTGG TGCCACTTCG CTGGTGCOGG AGGGGGCGGC GCGCGAGAGC CTGGTGGCCC 1440 
TGGTCAGCAC CTCGGACAGG GACTCGGGCG CCAACGGGCA AGTGOGCTGC GCCCTCTATG 1500 
GGCACGAGCA CTTCCX3GCTG CAGCCGGCCT ACGCGGGCAG CTACCTGGTG GTGACCGCGG 1560 
CGTCGCTGGA CCGCGAACGC ATCGCCGAGT ACAACTTGAC GCTGGTGGCC GAGGATCGCG 1620 
GCGCGCCCCC GCTGCGCACA GTGCGGCCCT ACACGGTGCG TGTGGGCGAC GAGAACGACA 1680 
ACGCGCCGCT CTTCACGOGG CCGGTCTATG AGGTGTCGGT GCGCGAGAAC AACCCGCCAG 1740 
GCGCCTACCT GGCCACGGTG GCCGCCCGCG ACCGGGACCT GGGCCGCAAC GGCCAGGTCA 1800 
CCTACCGGCT GCTGGAGGCC GAGGTGGGCC GCGCCGGGGG CGCCGTGTCC ACTTATGTCT 1860 
CGGTGGACCC AGCTACCGGA GCCATCTACG CGCTGCGCAG CTTCGACTAT GAGACGCTGC 1920 
GCCAACTCGA CGTTCGCATC CAAGCTAGCG ACGGCGGCTC CCCTCAGCTT TCCAGCAGCG 1980 
CCCTAGTGCA AGTGCGCGTG CTGGACCAGA ACGACCATGC GCCAGTCCTG GTGCACCCGG 2040 
CGCCAGCCAA TGGCTCCCTA GAAGTGGCGG TGCCTGGGCG CACCGCAAAG GACACGGTTG 2100 
TGGCCCGTGT GCAGGCCCGG GATGCAGACG AGGGAGCCAA CGGGGAGCTG GCGTTCGAGC 2160 
TGCAGCAGCA GGAGCCGCGC GAAGCCTTCG CCATCGGCCG CCGCACGGGG GAGATACTGC 2220 
TCACCGGCGA CCTCTCGCAG GAGCCACCCG GTOGCGTGTT CAGGGCGCTC CTGGTCATAT 2280 
CCGACGGCGG CCGTCCCCCG CTCACCACCA CCGCAACTGT CAGCTTCGTG GTAACAGCAG 2340 
GGGGCGGGCG TGGGCCGGCT GCGCCTGCCA GTGCAGGAAG CCCGGAGCGT TCCCGCCCGC 2400 
CTGGCTCTCG GCTOGGGGTG TCCGGGTGGG TGCTGCAATG GGACACGCCG CTGATCGTCA 2460 
TCATCGTGCT GGCCGGGAGC TGCACGCTGC TGCTGGCOGC CATCATCGCC ATCGCCACCA 2520 
CCTGCAACCG CCGCAAGAAG GAGGTGCGCA AAGGGGGGGC CCTCCGGGAA GAGCGGCCCG 2580 
GGGCGGCGGG CGGCGGAGCC TCGGCTCCCG GCTCCCOGGA GGAGGCCGCC CGGGGAGCCG 2640 
GGCCCAGGCC CAACATGTTC GACGTGCTCA CCTTCCCTGG CACCGGCAAA GCGCCCTTTG 2700 
GCAGCCCCGC GGOGGACGCG CCTCCGCCTG CGGTOGCCGC GGCCGAAGTG CCGGGCTCAG 2760 
AGGGCGGCAG CGCCACTGGG GAAAGCGCCT GTCACTTCGA GGGGCAGCAG CGGCTCCGCG 2820 
GCGCGCACGC CGAGCCCTAC GGTGCCTCCC CGGGTTTTGG AAAGGAGCCG GCGCCCCCTG 2880 
TGGCGGTGTG GAAAGGACAC TCCTTCAACA CCATTTCTGG CAGAGAAGCA GAGAAGTTCA 2940 
GCGGCAAAGA CAGCGGTAAA GGGGACAGTG ATTTCAACGA CAGCGATTCC GACATCAGCG 3000 
GGGACGCTCT GAAAAAGGAT CTCATCAACC ACATGCAGAG TGGACTGTGG GCGTGCACCG 3060 
CTGAGTGTAA GATCCTGGGC CACTCTGACC GCTGCTGGAG CCCATCCTGC AGCGGGCCCA 3120 
ACGCACATCC ATCGCCTCAC CCACCAGCCC AGATGTCAAC CTTCTGTAAG AGCACGTCAC 3180 
TGCCTCGGGA TCCTCTGCGC AGGGACAATT ACTACCAGGC CCAGCTGCCC AAGACAGTGG 3240 
GGCTGCAGAG CGTCTATGAG AAAGTACTGC ACAGAGACTA TGACAGGACA GTCACTCTGC 3300 
TCTCCCCTCC CCGTCCAGGG AGGCTCCCAG ACCTGCAGGA GATTGGAGTA CCCCTCTACC 3360 
AGTCCCCTCC TGGCAGGTAC CTGTCCCCGA AGAAGGGAGC CAATGAAAAT GTGTAATCCC 3420 
ATGCTGCATG TCTTCACACA TATACAGGTC ACTCCGAGAA GCCCCT AAGT TATTGACCGG 3480 
TTTCAGTGTT GTATATATAA ATATGCAAGA TGTGCCTTAC AATGAAGTTG TTGGAAGCTA 3540 
TTTCCAATCA CATTGGTACT GTTTGGTATT TGGAAACAAA AATGTAGTTA ATGTAATTTT 3600 
TATGAAATGT GTGCAGTATT TAATTTTTCT TATGCTATTG ACTTTGATTT CAATTGCGGC 3660 
TTGCCATTTT CTTAGTGTTT TTAACCTGTA CATTGTGTAA TGTAATGTTT GTATATAATG 3720 
AAATTTTGTT ATATTTTTAT ATAATAAAAG CTAAAGTGGA AGTTATTGCC AAAGGAACTG 3780 
TCTGTAAGAC AAAAAACAAA ACATGTTGGA ATTACTTAAT TGAAATTTAT CTTTCACCTG 3840 
AAACAACCTA GTGTTTGAGA AATTTTGCCT TGCCAAGTAT AACTTGTATA TCTTGACTCT 3900 
GTGGTAGATT TCAAGTTCAA TGTTATTTAA TTACATTTGG TTTTCCGTAA ACCGTGTCAC 3960 
TTATAAGCAC AGTAATAAAA GATTTGTCAT GTGTTTGAAG AAAAAAAAAA AAAAAAAAAA 4020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO j B8 Protein sequence , 
Protein Acceesion #: NP_002S81.2 

1 11 21 31 41 51 

I I I I I 1 

MSPVRRVJGSP CLFPLQLFSL CWVLSVAQSK TVRYSTFEED APGTVIGTLA SDLHMKVSGD 60 

TSPRUiKQFN SSLLRVREGD GQLTVGDAGL DRERLCGQAP QCVLAPDWS FSQEQFRLVH 120 

VEVEVRDVND HAPRPPRAQI PVEVSEGAAV GTRIPLEVPV DEDVGANGLQ TVRLAEFHSP 180 

FRVELQTRAD GAQCADLVLL QBLDRESQAA YSLELVAQDG GRPPRSATAA LSVRVLDAND 240 

HSPAFPQGAV AEVELAEDAP VGSLLU5LDA ADPDEGPNGD WPAPGARTP PEARRLPRLD 300 

PRSGRLTLAG PVDYERQDTY BLDVRAQDRG PGPRAATCKV IVRIRBVNDN APDIAITPLA 360 

APGAPATSPF AAAAAAAALG GADASSPAGA GTPEAGATSL VPEGAARESL VALVSTSDRD 420 

SGANGQVRCA LYGHEHFRLQ PAYAGSYLW TAASLDRERI AEYNLTLVAE DRGAPPLRTV 480 

RPYTVRVGDB NDNAPLFTRP VYEVSVRENN PPGAYIATVA ARDRDLGRNG QVTYRLLEAE 540 

VGRAGGAVST YVSVDPATGA IYALR6FDYE TLRQLDVRIQ ASDGGSPQLS 8SALVQVRVL 600 

DQNDHAPVLV HPAPANGSLB VAVPGRTAKD TWARVQARD ADEGANGBLA PELQQQEPRE 660 

APAIGRRTGE ILLTGDLSQE PPGRVFRALL VISDGGRPPI* TTTATVSFW TAGGGRGPAA 720 

PASAGSPERS RPPGSRLGVS GSVLQHDTPL IVIIVLAGSC TLLLAAIIAI ATTCNRRKKE 780 

VRKGGALREE RPGAAGGGAS APGSPEBAAR GAGPRPNMPD VLTPPGTGKA PFGSPAADAP 840 

PPAVAAAEVP GSEGGSATGE SACHFEGQQR IiRGAHAEPYG ASPGFGKEPA PPVAVWKGHS 900 

FNTISGREAE KFSGKDSGKG DSDFNDSDSD ISGDALKKDL IKHMQSGLWA CTAECKILGH 960 

SDRCWSPSCS GPNAHPSPHP PAQMSTFCKS TSLPRDPLRR DNYYQAQLPK TVGLQSVYEK 1020 
VLHRDYDRTV TLLSPPRPGR LPDLQEIGVP LYQSPPGRYL SPKKGANEKV 



800 



WO 03/042661 



PCT/US02/36810 
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Seq ID NO: 89 dna sequence 

Nucleic Acid Accession #: AL121939.12 

Coding sequence: 185. .1426 

1 11 21 31 41 51 

I I I I I I 

AGCGGAGTST CAGTGGCGAG CTCCGGGTGC TGTGGCCOGG CCTTGGCGGG GCGGCCTCCO 60 

GCTCAGGCTG GCTGAGAGGC TCCCAGCTGC AGCGTCCCCG CCCGCCTCCT CGGGAGCTCT 120 

GATCTCAGCT GACAGTGCCC TCGGGGACCA AACAAGCCTG GCAGGACAAA ATTAGAAGAT 180 

CAAAATGGAA AATATGCTGC TTTGGTTGAT ATTTTTCACC CCTGGGTGGA CCCTCATTGA 240 

TGGATCTGAA ATGGAATGGG ATTTTATGTG GCACTTGAGA AAGGTACCCC GGATTGTCAG 300 

TGAAAGGACT TTCCATCTCA CCAGCCCCGC ATTTGAGGCA GATGCTAAGA TGATGGTAAA 360 

TACAGTGTGT GGCATCGAAT GCCAGAAAGA ACTCCCAACT CCCAGCCTTT CTGAATTGGA 420 

GGATTATCTT TCCTATGAGA CTGTCTTTGA GAATGGCACC CGAACCTTAA CCAGGGTGAA 480 

AGTTCAAGAT TTGGTTCTTG AGCCGACTCA AAATATCACC ACAAAGGGAG TATCTGTTAG 540 

GAGAAAGAGA CAGGTGTATG GCACCGACAG CAGGTTCAGC ATCTTGGACA AAAGGTTCTT 600 

AACCAATTTC CCTTTCAGCA CAGCTGTGAA GCTTTCCACG GGCTGTAGTG GCATTCTCAT 660 

TTCCCCTCAG CATGTTCTAA CTGCTGCCCA CTGTGTTCAT GATGGAAAGG ACTATGTCAA 720 

AGGGAGTAAA AAGCTAAGGG TAGGGTTGTT GAAGATGAGG AATAAAAGTG GAGGCAAGAA 780 

ACGTCGAGGT TCTAAGAGGA GCAGGAGAGA AGCTAGTGGT GGTGACCAAA GAGAGGGTAC 840 

CAGAGAGCAT CTGCAGGAGA GAGCGAAGGG TGGGAGAAGA AGAAAAAAAT CTGGCOGGGG 900 

TCAGAGGATT GCCGAAGGGA GGCCTTCCTT TCAGTGGACC CGGGTCAAGA ATACCCACAT 960 

TCCGAAGGGC TGGGCACGAG GAGGCATGGG GGACGCTACC TTGGACTATG ACTATGCTCT 1020 

TCTGGAGCTG AAGCGTGCTC ACAAAAAGAA ATACATGGAA CTTGGAATCA GCCCAACGAT 1080 

CAAGAAAATG CCTGGTGGAA TGATCCACTT CTCAGGATTT GATAAOGATA GGGCTGATCA 1140 

GTTGGTCTAT OGGTTTTGCA GTGTGTCYGA CGAATCCAAT GATCTCCTTT ACCAATAYTG 1200 

CGATGCTGAG TCGGGCTCCA CCGGTTOGGG GGTCTATCTG CGTCTGAAAG ATCCAGACAA 1260 

AAAGAATTGG AAGCGCAAAA TCATTGCGGT CTACTCAGGG CACCAGTGGG TGGATGTCCA 1320 

CGGGGTTCAG AAGGACTACA ACGTTGCTGT TCGCATCACT CCCCTAAAAT ACGCCCAGAT 1380 

TTGCCTCTGG ATTCACGGGA ACGATGCCAA TTGTGCTTAC GGCTAACAGA GACCTGAAAC 1440 

AGGGCGGTGT ATCATCTAAA TCACAGAGAA AACCAGCTCT GCTTACCGTA GTGAGATCAC 1500 

TTCATAGGTT ATGCCTGGAC TTGAACTCTG TCAATAGCAT TTCWACATTT TTCAAAATCA 1560 

GGAGATTTTC GTCCATTTAA AAAATGTATA GGTGCAGATA TTGAAACTAG GTGGGCACTT 1620 

CAATGCCAAG TATATACTCT TCTTTACATG GTGATGAGTT TCATTTGTAG AAAAATTTTG 1680 

TTGCCTTCTT AAAAATTAGA CACACTTTAA ACCTTCAAAC AGGTATTATA AATAACATGT 1740 

GACTCCTTAA TGGACTTATT CTCAGGGTCC TACTCTAAGA AGAATCTAAT AGGATGCTGG 1800 

TTGTGTATTA AATGTGAAAT YGCATAGATA AAGGTAGATG GTAAAGCAAT TAGTATCAGA 1860 

ATAGAGACAG AAAGTTACAA CACAGTTTGT ACTACTCTGA GATGGAYCCA T TCAG CTCAT 1920 

GCCCTCAATG TTTATATTGT GTTATCTGTT GGGTCTGGGA CATTTAGTTT AGTTTTTTTG 1980 

AAGAATTACA AATCAGAAGA AAAAGCAAGC ATTATAAACA AAACTAATAA CTGTTTTACT 2040 

GCTTTAAGAA ATAACAATTA CAATGTGTAT TATTTAAAAA TGGGAGAAAT AGTTTGTTCT 2100 

ATGAAATAAA CCTAGTTTAG AAATAGGGAA GCTGAGACAT TTTAAGATCT CAAGTTTTTA 2160 

TTTAACTAAT ACTCAAAATA TGGACTTTTC ATGTATGCAT AGGGAAGACA CTTCACAAAT 2220 

TATGAATGAT CATGTGTTGA AAGCCACATT ATTTTATGCT ATACATTCTA TGTATGAGGT 2280 

GCTACATTTT TAGGACAAAG AATTCTGTAA TCTTTTTCAA GAAAGAGTCT TTT TCTCC TT 2340 

GACAAAATCC AGCTTTTGTA TGAGGACTAT AGGGTGAATT CTCTGATTAG TAATTTTAGA 2400 

TATGTCCTTT CCTAAAAATG AATAAAATTT ATGAATATGA CTTAAAAAAA AAARWCGACG 2460 

CGGCCGCGAA TTTAGTAGTA GTAGTCGACC CGGGAATTCC GGACCGGTAC CTGCAGGCGT 2520 
ACCAGC 

Seq ID NO* B10 Protein sequence 
Protein Accession 8: CAC35071.1 



1 11 

I I 
MENMLLWLIF FTPGWTLIDG 
VCGIECQKEli PTPSLSELED 
KRQVYGTDSR PSILDKRPLT 
SKKLRVGLLK MRNKSGGKKR 
RIAEGRPSFQ • WTRVKNTHI P 
KMPGGMIHFS GFDNDRADQL 
NWKRKIIAVY SGHQWVDVHG 



21 31 

I I 
SEMEWDFMWH LRKVPRIVSE 
YLSYETVFEN GTRTLTRVKV 
NFPPSTAVKL STGCSGILI8 
RGSKRSRREA SGGDQREGTR 
KGWARGGMGD ATLDYDYALL 
VYRFCSVSDE SNDLLYQYCD 
VQKDYNVAVR ITPLKYAQIC 



41 
I 

RTPHLTSPAP 
QDLVLEPTQN 
PQHVLTAAHC 
EHLRERAKGG 
ELKRAHKKKY 
ABSGSTGSGV 
LWIHGNDANC 



51 

I 

EADAKMMVNT 
ITTKGVSVRR 
VHDGKDYVKG 
RRRKKSGRGQ 
MELGISPTIK 
YLRLKDPDKK 
AYG 



60 
120 
180 
240 
300 
360 



Seq ID NO: 11 DNA sequence 

Nucleic Acid Accession ft* NM_002035.1 

Coding sequence: 108.. 1106 



1 
I 

AGGCGCCCGC 
CCGGCGGCCC 
CTGCOGCCTT 
CCAAGCCCCT 
GGAAGTGCAT 
ATGAGGATAA 
AGGTGGTGCT 
TAAAACAAGC 
CAGTGTCAGG 
ATTACCTGGG 
TGGGCAGGAT 
CCTACTCTGC 
AGCCATATAA 
CCGAAGAAAA 
GCAAACCAGA 



11 
I 

CCGCCGCGCG 
OCGCCCGGCC 
CCTCGTGGCC 
CGCCCTGCCC 
TGCTATCGAG 
GCTGCTGCAG 
TTGCATATCA 
ACAGGAGAAA 
AAAATTTGAA 
CAGCGTGTAC 
CGTGTTTGTG 
ATCCAAGTTT 
TGTCTACATC 
CAGAACAAAG 
ACAGGTGGCC 



21 
I 

TGATTCTCGC 
GCTCCGGGCC 
TTCGTGCTGC 
GGGGCGCATG 
TGCTATAAAC 
GCAAAGAAAG 
GTTGATGTAT 
CTGGGTCCAG 
GATCTTGAAG 
CCCAGCCGGG 
TCCTCCCAGG 
GCCATAAGGG 
ACAGTTGCTT 
CCTTTGGAGA 
AAACAAATTG 



31 
I 

CTCGCCGCAG 
CCTGGCCCCG 
TGCTGTACAT 
TGGTGGTTAC 
AAGGAGCTTT 
AAATTGAAAT 
CTCAAGACTA 
TGGACATGCT 
TTAGTACCTT 
CCGTGATCAC 
CAGGACAGTT 
GATTGGCAGA 
ACCCACCAGA 
CTCGACTTAT 
TTAAAGATGC 



41 

I 

CCCAGCCCTG 
CGGAGCGATG 
GGTGTCTCCG 
AGGAGGTTCC 
TATAACTCTG 
GCACTCTATT 
TAACCAAGTA 
GGTAAATTGT 
TGAAAGGTTA 
CACCATGAAG 
GGGATTATTC 
AGCTTTGCAG 
CACAGACACA 
TTCAGAGACC 
CATACAAGGA 



51 
1 

CGCGCCTTGC 
CTGCTGCTGG 
CTCATCAGCC 
AGTGGCATCG 
GTTGCACGAA 
AATGACAAAC 
GAGAATGTCA 
GCAGGAATGG 
ATGAGCATCA 
GAGCGCCGGG 
GGTTTCACAG 
ATGGAGGTGA 
CCTGGCTTTG 
ACATCTGTGT 
AATTTCAACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 



801 



WO 03/042661 



PCT/US02/36810 



GTTCCCTTGG CTCAGATGGG TACATGCTCT CGGCCCTGAC CTGTGGGATG GCTCCAGTAA 960 

CTTCTATTAC TGAGGGGCTC CAGCAGGTGG TCACCATGGG CCTTTTCCGC ACTATTGCTT 1020 

TGTTTTACCT TGGAAGTTTT GACAGCATAG TTOGTCGCTG CATGATGCAG AGAGAAAAAT 1080 

CTGAAAATGC A6ACAAAACT GCCTAATCTT CTTACCCCTT GGAAGAA6AC TGTTTCCAAA 1140 

5 TAATTTGAAC AGCTTGCTGC TAAATGGGAC CCAATTTTTG GCCTATAGAC ACTTATGTAT 1200 

TGTTTTCGAA TACGTCAGAT TGGACCAGTG CTCTTCAGGA ATGTGGCTGC AAGCAAGGGG 1260 

CTAGAAGTTC ACCTCCTGAC AGTATTATTA ATACTATGCA AATATGGAAT AGGAGACCAT 1320 

TTGATTTTCT AGGCTTTGTG GTAGAGAGGT GAAGGTATGA GAATTAATAG OGTGTGAA CA 1380 

AAGTAAAGAA CAGGATTCCA GAATGATCAT TAAATTTGTT TCTATTTATT CTTTTTTGCC 1440 

10 CCCCTAGAGA TTAAGTCCAG AAATGTACTT TCTGGCACAT AAAGAAATCT TGAGGACTTT 1500 

GTTTAAACCT TCCATAAAAA AACAATTTTC GGTTTCTCGG GTTCTCTCTC TCTGTCTCTC 15 SO 

TGTCTCTCTG TCTCTCTGTC TCTCTGTCTC TCTCTCTCTC TCTCTTTCTT TCTTTGTGTA 1620 

TTTTATTCAA GATGAGTTGG ACCCATTGCC AGTGAGTCTG AATGTCACTG ACA GCCCTG T 16B0 

GTTGTGCTCA GGACTCACTC TGCTGCTGGT GGAAACTCAT GGCTTCTCTC TCTCTT TGAT 1740 

15 CCCATAAAGC TACGAGGGGG ACGGGAGAGG GCAGTGCAAT GGGAAGTAAA GAGATATTTT 1800 

CCAGTAGGAA AAGCAATGCT TTCTTGTCTT TAGACTCAAA TGCTTAGGGA ACGTTTCATT I860 

TCTCATTCAT GGGGAAAGGC AGCCTCCTTA AATGTTTTCT GAAGAGCGGT AAAATCTAGA 1920 

AGCTTAAGAA TTTACAGTTC CTTCAATAAC CATGATGACC TGAAGTTCAC CTATCCCATT 1980 

TTAGCATCTA CTTGTTTTTC CCATCTCTTC CTTTCCAATT TTGCTTATAC TGCTGTAATA 2040 

20 TTTTTGTAAA AAAAAAAAAA AAGGAAAAAA AAGACCAGCT AAAATTTTCG ACTTGACTTT 2100 

TTAACTTAAC TCATGAATTA ATTAAAGCAA ATGAAAAAAT TAAAAAGTGT GACTTTTTCT 2160 

CGGAGCATAT ATGTAGCTTT TAGGAAAGGC TGATGATGGT ATAAAGTTTG CTCATTAAGA 2220 
AAAAAAGACA AGGCTGATTT TGAAGAGAGT TGCTTTTGAA ATAAAATGAT CA 

25 Seq ID NO: B12 Protein sequence 
Protein Accession 8: NP_002026.1 

1 11 21 31 41 51 

30 MLLLAAAPLV AFVLLLYMVS PLISPKPLAL PGAHWVTGG SSGIGKCIAI ECYKQGAPIT 
LVARNEDKLL QAKKEIEMHS INDKQWLCI SVDVSQDYNQ VENVIKQAQE KLGPVDMLVN 
CAGMAVSGKP EDLEVSTFER LMSINYLGSV YPSRAVITTM KERRVGRIVF VSSQAGQLGL 
FGFTAYSASK FAIRGLAEAL QMEVKFYNVY ITVAYPPDTD TPGFAEENRT KPLETRLISE 
TTSVCKPEQV AKQZVKDAIQ GNFNSSLGSD GYMLSALTCG MAPVTSITEG LQQWTMGLF 

35 RTIALFYLGS FDSIVRRCKM QREKSENADK TA 

Seq ID NO: B13 DNA sequence 
Nucleic Acid Accession 8: CAT Cluster 

40 1 11 21 31 41 51 

I I I I I I 

CTTGGGATGC ATTATATATT GCATTATATT TGCCGGTAAA ACTCGGTTAA TTTTTAAAAT 
CGGCAAAATA TTGGTGCCTT TCCCGAAATT TGGTCCCGGG CTCCCTTATA GGATAATTGG 
TTTGGATTGG TTAAGTCCAA TTATTAAATG CTGCGGTTTC AAATTCCCAG CTGGAAGGAC 

45 CACCCATTTA AAAACTTCAG AAGGCAGGAT CCTGCTCAAT TTATAAGGCT TTGGAAAATT 
CCAGGCATTG GTTTGACATA TTTCCAGAGC TCAAATCTGC AGTGTTCCAC ATGCACATAC 
AAGATCCAGA GTCTCATGTT AAAATCACTT ACATACCCAG AAAGACCACC ACTTTGCAGG 
TATAATATTG TACTTAAAGA CAGAGAGGAA GTGTTTCTTA ATCCAAACAC ATGTACACCA 
AAGAACACAT AAGATGCCTT CTTCCATCAA ATGCACTTGC TTGTGAATTA ATGGACTTGT 

50 AAATGAAACA ATGCAATCAG TCTTTTATAA TGCACTGTTC AATTTGAGAT TCAAGTATTT 
CTATTTCTTG GAAAAAATTT TAAGAATCAA AAATAAAGAA AATAAAAAGT GCATACAGTT 
AAACATTCCA AAAAAAAAAA AA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



55 Seq ID NO: B14 DNA sequence 

Nucleic Acid Accession ft: XM_086767.1 
Coding sequence: 276.. 611 

1 11 21 31 41 51 

60 | | j I I I 

CTTGTTCATG GCATCTTTAG AAACAAACTG CAATTTTATT TCATTTCCTT GTCGTTCATA 60 

CAAAGATTAC AAGACTAGCT TATGTGTGGA CTGTGACTGT TTTAAGGAAA AATCATGTCC 120 

TCGGCTGGGT TATCAAGCCA AGCTATTTAA AGGTGTTTTA AAAGAAAGGA TGGAAGGAAG 180 

ACCTCTTAGG ACCACTGTGT TTTTGGATAC AAGTGGTACA TATCCATTCT GTAG CCTATT 240 

65 ATTTTGTTCT CAGTATAATT GTTCCAGATA AAACTATGAT GGATGGCTCG TTTTCATTTA 300 

AATTATTAAA TCAGCTTGGA ATGATTGAAG AGCCAAGGCT TTATGAAAAG AACAAACCAT 360 

TTTATAAACT TCAAGAAGTC AAGATTCTTG CTCAATTTTA TAATGACTTT GTAAATATTT 420 

CAAGCATTGG TTTGACATAT TTCCAGAGCT CAAATCTGCA GTGTTCCACA TGCACATACA 480 

AGATCCAGAG ACTCATGTTA AAATCACTTA CATACCCAGA AAGACCACCA CTTTGCAGGT 540 

70 ATAATATTGT ACTTAAAGAC AGAGAGGAAG TGTTTCTTAA TCCAAACACA TGTACACCAA 600 

AGAACACATA AGATGCCTTC TTCCATCAAA TGCACTTGCT TGTGAATTAA TGGACTTGTA 660 

AATGAAACAA TGCAATCAGT CTTTTATAAT GCACTGTTCA ATTTGAGATT CAAGTATTTC 720 

TATTTCTTGG AAAAAATTTT AAGAATCAAA AATAAAGAAA ATAAAAAATG CATACAGTTA 780 
AACATTCC 



75 



Seq ID NO: B15 Protein sequence 
Protein Accession #: XP_086767.1 



ort 1 11 21 31 41 51 

80 | | | I I I 

MMDGSFSFKL LNQbGMIEBP RLYEKNKPFY KLQEVKILAQ FYNDPVNISS IGLTYFQSSN 
liQCSTCTYKI QRLMLKSLTY PERPPLCRYN IVLKDREEVF LNPNTCTPKN T 

Seq ID NO: B16 DNA sequence 

802 



WO 03/042661 
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Nucleic Acid Accession #: CAT cluster 

1 ll 21 31 41 51 

c I 1 1 ! I I 

J viTrrnrrr rrrmTm ttaaatgcaa aggagcaaca aaatttattg actgaattaa 60 

ACACAACAGT AAAATGGCAG TGTTGTAATT TCATTTTCAG ATGTTTGAAT GGAACAAGAA 120 

AAGTGCTATT AGCCCAAGCT TCTTACATTC ATTAAAAGAG TGACTATCAA AAACAGCAAC 180 

ATGCACAATG GTACATATGC ACAAAATGGA ATTATATCAA CAAATATACA AAATACCCAA 240 

AATAAAATAT TTACAGGTTT AAAAATATAA ACATTGATTC CTCTATCCCA TTAAAC CATT 300 

10 GGAGTGGAGA AAGGAGGAAA GACCCTATTG CTATTTAGAA TCCTTTTTAA AACAAGTTTT 360 

TAAAACATAG AATTAGTTCT AGGAGACAAT TTTTGATGTT TTTCAGGGGT TTAACATTCT 420 

ATTATAAAAA TAACATCTAT AAACCTACTA ACAATTTTCC TCCTGTGCAC AAAAATAATA 480 

CTGCCAAAAC CTGTCCTCAA AGACATGCCT GACTTTCAGG AAAGCTAATT ATGGAAATGG 540 

AGTTTCTCGT TTGGGTTATC TTTGTTACTA TTTTCAAATA ACCAGCAACT CCCTATATTA 600 

15 CACTGAGATA CTTTATATAA ATAACGTGGG CGAAACCTGA AGTTCACAAT GAGCCTGCTA 660 

GGTAGCTGGT GTCAAGTACA AATGATAGGA ATTGACTTTG CCAACAACAT CAAAAGCATT 720 

TTCCCTGATA TTCCTGATAG ACCTACCACT ATCAGATCCT CCATATTCAA TAAGATTTAT 780 

CCTGGAAGCA ATGAAAATGT TAAATATTAC TTTGCTAGAG TTTCTCCTCC TTTATTTAGA 640 

AATAAATGTG TAGTGGGGAC CAGTGGTTGT AATGTAGATA CTTGAGAAGT TTCATTGATT 900 
CCTTCAGAOC CATGCCG 



20 
25 



Seq ID NO: B17 DNA sequence 

Nucleic Acid Accession #: NM_022898.1 

Coding sequence: 268. .2739 



1 11 21 31 41 51 

I I I I I I 

TTTTCTTGCT TTTCTTCCCT TTTTTTTCTT TTTGCAAACA AAACAAAAAA CAGCATAGAA 60 

GAAAGAGCAA AATAAAGAAG AAGAAGAGGA GGAAGAGAGG GAAAGAGAGG AAGGGAAAAA 120 

30 AAACACCAAC CCGGGCAGAG GAGGAGGTGC GGOGGCGGCG GCGGCGGCGG CAGCGGCGGC 180 

AGCGGCGCGG CGGCGGCTCG GACCCCCTCC CCCGGCTCCC CCCATCAGTG CAGCTCTCCG 240 

GGCGATGCCA GAATAGATGC CGGGGCAATG TCCCGCCGCA AACAGGGCAA CCCGCAGCAC 300 

TTGTCCCAGA GGGAGCTCAT CACCCCAGAG GCTGACCATG TGGAGGCCGC CATCCTCGAA 360 

GAAGACGAGG GTCTGGAGAT AGAGGAGCCA AGTGGCCTGG GGCTGATGGT GGGTGGCCCC 420 

35 GACCCTGACC TGCTCACCTG TGGCCAGTGT CAAATGAACT TCCCCTTGGG GGACATCCTG 480 

GTTTTTATAG AGCACAAAAG GAAGCAGTGT GGCGGCAGCT TGGGTGCCTG CTATGACAAG 540 

GCCCTGGACA AGGACAGCCC GCCACCCTCC TCACGCTCCG AGCTCAGGAA AGTGTCCGAG 600 

CCGGTGGAGA TCGGGATCCA AGTCACCCCC GACGAAGATG ACCACCTGCT CTCACCCACG 660 

AAAGGCATCT GTCCCAAGCA GGAGAACATT GCAGGTAAAG ATGAGCCTTC CAGCTACATT 720 

40 TGCACAACAT GCAAGCAGCC CTTCAACAGC GCGTGGTTCC TGCTGCAGCA CGCGCAGAAC 780 

ACGCACGGCT TCCGCATCTA CCTGGAGCCC GGGCCGGCCA GCAGCTCGCT CACGCCGCGG 840 

CTCACCATCC CGCCGCCGCT CGGGCCGGAG GCCGTGGOGC AGTCCCCGCT CATGAATTTC 900 

CTGGGCGACA GCAACCCCTT CAACCTGCTG CGCATGACGG GCCCCATCCT GCGGGACCAC 960 

CCGGGCTTCG GCGAGGGCCG CCTGCCGGGC ACGCCGCCTC TCTTCAGTCC CCCGCCGCGC 1020 

45 CACCACCTGG ACCCGCACOG CCTCAGTGCC GAGGAGATGG GGCTCGTCGC CCAGCACCCC 1080 

AGTGCCTTCG ACCGAGTCAT GOGCCTGAAC CCCATGGCCA TCGACTCGCC CGCCATGGAC 1140 

TTCTCGCGGC GGCTCCGCGA GCTGGCGGGC AACAGCTCCA CGCCGCCGCC CGTGTCCCCG 1200 

GGCCGCGGCA ACCCTATGCA CCGGCTCCTG AACCCCTPCC AGCCCAGCCC CAAGTCCCCG 1260 

' TTCCTGAGCA CGCCGCCGCT GCOGCCCATG CCCCCTGGCG GCACGCCGCC CCCGCAGCCG 1320 

50 CCAGCCAAGA GCAAGTCGTG CGAGTTCTGC GGCAAGACCT TCAAGTTCCA GAGCAATCTC 1380 

ATCGTGCACC GGCGCAGTCA CACGGGCGAG AAGCCCTACA AGTGCCAGCT GTGCGACCAC 1440 

GCGTGCTCGC AGGCCAGCAA GCTCAAGCGC CACATGAAGA CGCACATGCA CAAGGCCGGC 1500 

TCGCTGGCCG GCCGCTCCGA CGACGGGCTC TCGGCCGCCA GCTCCCCCGA GCCCGGCACC 1560 

AGCGAGCTGG CGGGCGAGGG OCTCAAGGCG GCCGACGGTG ACTTCCGCCA CCACGAGAGC 1620 

55 GACCCGTCGC TGGGOCACGA GOCGGAGGAG GAGGACGAGG AGGAGGAGGA GGAGGAGGAG 1680 

GAGCTGCTAC TGGAGAACGA GAGCOGGCCC GAGTCGAGCT TCAGCATGGA CTCGGAGCTG 1740 

AGCCGCAACC GCGAGAACGG CGGTGGTGGG GTGCCCGGGG TCCCGGGCGC GGGGGGCGGC 1800 

GCGGCCAAGG CGCTGGCTGA CGAGAAGGCG CTGGTGCTGG GCAAGGTCAT GGAGAACGTG 1860 

GGCCTAGGCG CACTGCCGCA GTACGGCGAG CTCCTGGCCG ACAAGCAGAA GCGCGGCGCC 1920 

60 TTCCTGAAGC GTGCGGCGGG CGGCGGGGAC GCGGGCGACG ACGACGACGC GGGCGGCTGC 1980 

GGGGACGCGG GCGCGGGCGG CGCGGTCAAC GGGCGCGGGG GCGGCTTCGC GCCAGGCACC 2040 

GAGCCCTTCC CCGGGCTCTT CCCGCGCAAG CCCGCGCCGC TGCCCAGCCC CGGGCTCAAC 2100 

AGCGCCGCCA AGCGCATCAA GGTGGAGAAG GACCTGGAGC TGCCGCCCGC CGCGCTCATC 2160 

CCGTCCGAGA ACGTGTACTC GCAGTGGCTG GTGGGCTACG CGGCGTCGCG GCACTTCATG 2220 

65 AAGGACCCCT TCCTGGGCTT CACGGACGCA CGACAGTCGC CCTTCGCCAC GTCGTCCGAG 2280 

CACTCGTCCG AGAACGGCAG CCTGCGCTTC TCCACGCCGC COGGGGACCT GCTGGACGGC 2340 

GGCCTCTCGG GCCGCAGCGG CACGGCCAGC GGAGGCAGCA CCCCGCACCT GGGCGGCCCG 2400 

GGCCCCGGGC GGCCCAGCTC CAAGGAGGGC CGCCGCAGCG ACACGTGGGA GTACTGCGGC 2460 

AAGGTGTTCA AGAACTGCAG CAACTTGACG GTGCACCGGC GGAGCCACAC CGGCGAGCGG 2S20 

70 CCTTACAAGT GCGAGCTGTG CAACTACGCG TGCGCGCAGA GCAGCAAGCT CACGCGCCAC 2580 

ATGAAGACGC ACGGGCAGAT CGGCAAGGAG GTGTACOGCT GCGACATCTG CCAGATGCCC 2640 

TTCAGCGTCT ACAGCAOCCT GGAGAAACAC ATGAAAAAGT GGCACGGCGA GCACTTGCTG 2700 

ACTAACGACG TCAAAATCGA GCAGGCCGAG AGGAGCTAAG CGCGCGGGCC CCGGCGCCCC 2760 

GCACCTGTAC AGTGGAACCG TTGCCAACCG AGAGAATGCT GACCTGACTT GCCTCCGTGT 2820 

75 CACCGCCACC CCGCACCCOG CGTGTCCCCG GGGCCCAGGG GAGGCGGCAC TCCAACCTAA 2880 

CCTGTGTCTG CGAAGTCCTA TGGAAACCCG AGGGTTGATT AAGGCAGTAC AAATTGTGGA 2940 

GCCTTTTAAC TGTGCAATAA TTTCTGTATT TATTGGGTTT TGTAATTTTT TTGGCATGTG 3000 

CAGGTACTTT TTATTATTAT r r ri TLT fl T T TGAATTCCTT TAAGAGATTT TGTTGGGTAT 3060 

CCATCCCTTC § mW L Vm TTTTAACCCG GTAGTAGCCT GAGCAATGAC TCGCA AGCAA 3120 

80 TGTTAGAGGG GAAGCATATC TTTTAAATTA TAATTTGGGG GGAGGGGTGG TGCTGCTTTT 3180 

TTGAAATTTA AGCTAAGCAT GTGTAATTTC TTGTGAAGAA GCCAACACTC AAATGACTTT 3240 

TAAAGTTGTT TACTTTTTCA TTCCTTCCTT TTTTTTGTCC TGAAATAAAA AGTGGCATGC 3300 

AGTTTTTTTT TTAATTATTT TTTAATTTTT TTTTTGGTTT TTOTTTTTGG GGTGGGGGGT 3360 

GTGGATGTAC AGCGGATAAC AATCTTTCAA GTCGTAGCAC TTTGTTTCAG AACTGGAATG 3420 

803 
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GAGATGTAGC ACTCATGTCG TCCCGAGTCA AGCGGCCTTT TCTGTGTTGA TTTCGGCTTT 3480 
CATATTACAT AAGGGAAACC TTGAGTGGTG GTGCTGGGGG AGGCACCCCA CAGACTCAGC 3540 
GCCGCCAGAG ATAGGGTTTT TGGAGGGCTC CTCTGGGAAA TGGCCCGACA GCATTCTGAG 3600 
GTTGTGCATG ACCAGCAGAT ACTATCCTGT TGGTGTGCCC TGGGGTGCCA TGGCTGCTAT 3660 
TCGCTGTAGA TTAGGCTACA TAAAATGGGC TGAGGGTACC TTTTTGGGGA GATGG GGTG G 3720 
CCTGCAGTGA CACAGAAAGG AAGAAACTAG OGGTGTTCTT TTAGGCGTTT TCTGGCTTGA 3780 
OGGCTTCTCT CTTTTTTTAA ATCACCCCCA CCACATAAAT CTCAAATCCT ATGTTGCTAC 3840 
AAGGGGTCAT CCATCATTTC CCAAGCAGAC GAATGCCCTA ATTAATTGAA GTTAGTGTTC 3900 
TCTCATTTAA TGCACACTGA TGATATTGTA GGGATGGGTG GGGTGGGGAT CTTGCAAATT 3960 
TCTATTCTCT TTTACTGAAA AAGCAGGGGA TGAGTTCCAT CAGAAGGTGC CCAGCGCTAC 4020 
TTCCCAGGTT TTTATTTTTT TTTTCCTATC TCATTAGGTT GGAAGGTACT AAAT ATTGAA 4080 
CTGTTAAGAT TAGACATTTG AATTCTGTTG ACCCGCACTT TAAAGCTTTT GTTTGCATTT 4140 
AAATTAAATG GCTTCTAAAC AAGAAATTGC AGCATATTCT TCTCTTTGGC CCAGAGGTGG 4200 
GTTAAACTGT AAGGGACAGC TGAGATTGAG TGTCAGTATT GCTAAGCGTG GCATTCACAA 4260 
TACTGGCACT ATAAAGAACA AAATAAAATA ATAATTTATA GGACAGTTTT TCTACTGCCA 4320 
TTCAATTTGA TGTGAGTGCC TTGAAAACTG ATCTTCCTAT TTGAGTCTCT TGAGACAAAT 43 BO 
GCAAAACTTT TTTTTTGAAA TGAAAAGACT TTTTAAAAAA GTAAAACAAG AAAAGTACAT 4440 
TCTTTAGAAA CTAACAAAGC CACATTTACT TTAAGTAAAA AAAAAAAAAA TTCTGGTTGA 4S00 
AGATAGAGGA TATGAAATGC CATAAGACCC AATCAAATGA AGAAATAAAC CCAGCACAAC 4560 
CTTGGACATC CATTAGCTGA ATTATCCTCA GCCCCTTTTG TTTTTGGGAC AACGCTGCTT 4620 
AGATATGGAG TGGAGGTGAT TTACTGCTGA ATTAAAACTC AAGTGACACA AGTTACAAGT 4680 
TGATATCGTT GAATGAAAAG CAAAACAAAA ACAATTCAGG AACAACGGCT AATTTTTTCT 4740 
AAAGTTAAAT TTAGTGCACT CTGTCTTAAA AATACGTTTA CAGTATTGGG TACATACAAG 4800 
GGTAAAAAAA AAATTGTGTG TATGTGTGTT GGAGCGATCT TTTTTTTTCA AAGTTTGCTT 4860 
AATAGGTTAT ACAAAAATGC CACAGTGGCC GCGTGTATAT TGTTTTCTTT TGGTGACGGG 4920 
GTTTTAGTAT ATATTATATA TATTAAAATT TCTTGATTAC TGTAAAAGTG GACCAGTATT 4980 
TGTAATAATC GAGAATGCCT GGGCATTTTA CAAAACAAGA AAAAAAATAC CCTTTTCTTT 5040 
TCCTTGAAAA TGTTGCAGTA AAATTTAAAT GGTGGGTCTA TAAATTTGTT C TTGTT ACAG 5100 
TAACTGTAAA GTCGGAGTTT TAGTAAATTT TTTTCTGCCT TGGGTGTTGA ATTTTTATTT 5160 
CAAAAAAAAT GTATAGAAAC TTGTATTTGG GGATTCAAAG GGGATTGCTA CACCATGTAG 5220 
AAAAAGTATG TAGAAAAAAA GTGCTTAATA TTGTTATTGC TTTGCAGAAA AAAAAAAAAT 5280 
CACATTTCTG ACCTGTACTT ATTTTTCTCT TCCCGCCTCC CTCTGGAATG GATATATTGG 5340 
TTGGTTCATA TGATGTAGGC ACTTGCTGTA TTTTTACTGG AGCTCGTAAT TTTTTAACTG 5400 
TAAGCTTGTC CTTTTAAAGG GATTTAATGT ACCTTTTTGT TAGTGAATTT GGAAATAAAA 5460 
AGAAAAAAAA AACAAAAACA AACAGGCTGC CATAATATAT TTTTTTAATT TGGCAGGATA 5520 
AAATATTGCA AAAAAAACAC ATTTGTATGT TAAGTCCTAT TGTACAGGAG AAAAAGGGTT 5580 
GTTTGACAAC CTTTGAGAAA AAGAAACAAA AGGAAGTAGT TAAATGCTTT GGTTCACAAA 5640 
TCATTTAGTT GTATATATTT TTTGTCGGAA TTGGCCTACA CAGAGAACCG TTCGTGTTGG 5700 
GCTTCTCTCT GAACGCCCCG AACCTTGCAT CAAGGCTCCT TGGTGTGGCC ACAGCAGACC 5760 
AGATGGGAAA TTATTTGTGT TGAGTGGAAA AAAATCAGTT TTTGTAAAGA TGTCAGTAAC 5820 
ATTCCACATC GTCCTCCCTT TCTCTAAGAG GCCATCTCTA AGATGTCAGA TGTAGAGGAG 5880 
AGAGAGCGAG AGAACATCTT CCTTCTCTAC CATCACTCCT GTGGCGGTCA CCACCACCAC 5940 
CTCTCCCGCC CTTACCAGCA GAAAGCAATG CAAACTGAGC TGCTTTAGTC CTTGAGAAAT 6000 
TGTGAAACAA ACACAAATAT CATAAAAGGA GCTGGTGATT CAGCTGGGTC CAGGTGAAGT 6060 
GACCTGCTGT TGAGACCGGT ACAAATTGGA TTTCAGGAAG GAGACTCCAT CACAGCCAGG 6120 
ACCTTTCGTG CCATGGAGAG TGTTGGCCTC TTGTCTTTCT TCCCTGCTTT G CTGC TTTGC 6180 
TCTCTGAAAC CTACATTCCG TCAGTTTCCG AATGCGAGGG CCTGGGATGA ATTTGGTGCC 6240 
TTTCCATATC TCGTTCTCTC TCCTTCCCCT GCGTTTCCTC TCCATCCTTC ATCCTCCATT 6300 
GGTCCTTTTT TTTTCTTTCA TTTTTTATTT AATTTCTTTT CTTCCTGTCT GTTCCTCCCC 6360 
TAATCCTCTA TTTTATTTTT ATTTTTTGTA AAGCCAAGTA GCTTTAAGAT AAAGTGGTGG 6420 
TCTTTTGGAT GAGGGAATAA TGCATTTTTA AATAAAATAC CAATATCAGG AAGCCATTTT 6480 
TTATTTCAGG AAATGTAAGA AACCATTATT TCAGGTTATG AAAGTATAAC CAAGCATCCT 6540 
TTTGGGCAAT TCCTTACCAA ATGCAGAAGC TTTTCTGTTC GATGCACTCT TTCCTCCTTG 6600 
CCACTTACCT TTGCAAAGTT AAAAAAAAGG GGGGAGGGAA TGGGAGAGAA AGCTGAGATT 6660 
TCAGTTTCCT ACTGCAGTTT CCTACCTGCA GATCCAGGGG CTGCTGTTGC CTTTGGATGC 6720 
CCCACTGAGG TCCTAGAGTG CCTCCAGGGT GGTCTTCCTG TAGTCATAAC AGCTAGCCAG 6780 
TGCTCACCAG CTTACCAGAT TGCCAGGACT AAGCCATCCC AAAGCACAAG CATTGTGTGT 6840 
CTCTGTGACT GCAGAGAAGA GAGAATTTTG CTTCTGTTTT GTGTTTAAAA AACCAACACG 6900 
GAAGCAGATG ATCCCGAGAG AGAGGCCTCT AGCATGGGTG ACCCAGCGGA CCTCAGGCCG 6960 
GTTTCCGCAC TGCCACAACT TTGTTCAAAG TTGCCCCCAA TTGGAACCTG CCACTTGGCA 7020 
TTAGAGGGTC TTTCATGGGG AGAGAAGGAG ACTGAATTAC TCTAAGCAAA ATGTGAAAAG 7080 
TAAGGAAATC AGCCTTTCAT CCCGGTCCTA AGTAACCGTC AGCCGAAGGT CTCGTGGAAC 7140 
ACAGGCAAAC COGTGATTTT GGTGCTCCTT GTAACTCAGC CCTGCAAAGC AAAGTCCCAT 7200 
TGATTTAAGT TGTTTGCATT TGTACTGGCA AGGCAAAATA TTTTTATTAC CTTTTCTATT 7260 
ACTTATTGTA TGAGCTTTTG TTGTTTACTT GGAGGTTTTG TCTTTTACTA CAAGTT TGGA 7320 
ACTATTTATT ATTGCTTGGT ATTTGTGCTC TGTTTAAGAA ACAGGCACTT TTTTTTATTA 7380 
TGGATAAAAT GTTGAGATGA CAGGAGGTCA TTTCAATATG GCTTAGTAAA ATATTTATTG 7440 
TTCCTTTATT CTCTGTACAA GATTTTGGGC CTCTTTTTTT CCTTAATGTC ACAATGTTGA 7500 
GTTCAGCATG TGTCTGCCAT TTCATTTGTA CGCTTGTTCA AAACCAAGTT TGTTCTGGTT 7560 
TCAAGTTATA AAAATAAATT GGACATTTAA CTTGATCTCC AAA 

Seq ID NO: B18 Protein sequence 
Protein Accession ft: NPJ375049.1 

1 11 21 31 41 51 

I I I I I I 

MSKRKQGNPQ HLSQREL1TP EADHVEAAIL EEDEGLEIKE PSGLGLMVGG PDPDLLTCGQ 60 

CQMNFPLGDI LVFIEHKRKQ CGGSLGACYD KALDKDSPPP SSRSELRKVS EPVEIGIQVT 120 

PDEDDHLLSP TKGICPKQEN IAGKDEPSSY ICTTCKQPFN SAWFIiLQHAQ NTHGFRIYLE 180 

PGPASSSLTP RLTIPPPLGP EAVAQSPLMN PLGDSNPPKL LRMTGPILRD HPGFGEGRLP 240 

GTPPLFSPPP RHHLDPHRLS AEBMGLVAQH PSAFDRVMRL NPMAIDSPAM DPSRRLRELA 300 

GNSSTPPPVS PGRGNPMHRL LNPFQPSPKS PPLSTPPLPP MPPGGTPPPQ PPAKSKSCEP 360 

CGKTPKPQSN LIVHRRSHTG BKPYKOQLCD HACSGASKLK RHKKTHMHKA GSLAGRSDDG 420 

LSAASSPEPG TSBLAGEGLK AADGDFRHHB SDPSLGHBPB EEDEESEBES EELLLENESR 480 
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PESSFSMDSE LSRNRENGGG GVPGVPGAGG GAAKALADHK ALVLGKVMEN VGLGALPQYG 540 

ELLADKQKRG APLKRAAGGG DAGDDDDAGG CGDAGAGGAV NGRGGGFAPG TEPFPGLFPR 600 

KPAPLPSPGL NSAAKRIKVE KDLELPPAAL IPSENVYSQW LVGYAASRHF MKDPFLGPTD 660 

ARQSPPATSS EHSSENGSUR FSTPPGDLLD GGLSGRSGTA SGGSTPHLGG PGPGRPSSKE 720 

5 GRRSDTCBYC GKVPKNCSNL TVHRRSHTGE RPYKCELCNY ACAQSSKLTR HMKTHGQIGK 780 
EVYRCDICQM PFSVYSTLEK HMKKWHGEHL LTNDVKIEQA ERS 

Seq ID NO: B19 DNA sequence 
Nucleic Acid Accession ft: NM_000399.2 
10 Coding sequence: 339.. 1769 

1 11 21 31 41 SI 

I I I I I I 

TAACTGAGCG AGGAGCAATT GATTAATAGC TCGGCGAGGG GACTCACTGA CTGTTATAAT 60 

15 AACACTACAC CAGCAACTCC TGGCTTCCCA GCAGCCGGAA CACAGACAGG AGAGAGTCAG 120 

TGGCAAATAG ACATTTTTCT TATTTCTTAA AAAACAGCAA CTTGTTTGCT ACTTTTATTT 180 

CTGTTGATTT TTTTTTCTTG GTGTGTGTGG TGGTTGTTTT TAAGTGTGGA GGGCAAAAGG 240 

AGATACCATC CCAGGCTCAG TCCAACCCCT CTCCAAAACG GCTTTTCTGA CACTCCAGGT 300 

AGCGAGGGAG TTGGGTCTCC AGGTTGTGCG AGGAGCAAAT GATGACCGCC AAGGCCGTAG 360 

20 ACAAAATCCC AGTAACTCTC AGTGGTTTTG TGCACCAGCT GTCTGACAAC ATCTACCOGG 420 

TGGAGGACCT CGCCGCCACG TCGGTGACCA TCTTTCCCAA TGCCGAACTG GGAGGCCCCT 480 

TTGACCAGAT GAACGGAGTG GCCGGAGATG GCATGATCAA CATTGACATG ACTGGAGAGA 540 

AGAGGTCGTT GGATCTCCCA TATCCCAGCA GCTTTGCTCC CGTCTCTGCA CCTAGAAACC 600 

AGACCTTCAC TTACATGGGC AAGTTCTCCA TTGACCCTCA GTACCCTGGT GCCAGCTGCT 660 

25 ACCCAGAAGG CATAATCAAT ATTGTGAGTG CAGGCATCTT GCAAGGGGTC ACTTCCCCAG 720 

CTTCAACCAC AGCCTCATCC AGCGTCACCT CTGCCTCCCC CAACCCACTG GCCACAGGAC 780 

CCCTGGGTGT GTGCACCATG TCCCAGACCC AGCCTGACCT GGACCACCTG TACTCTCCGC 840 

CACCGCCTCC TCCTCCTTAT TCTGGCTGTG CAGGAGACCT CTACCAGGAC CCTTCTGOGT 900 

TCCTGTCAGC AGCCACCACC TCCACCTCTT CCTCTCTGGC CTACCCACCA CCTCCTTCCT 960 

30 ATCCATCCCC CAAGCCAGCC ACGGACCCAG GTCTCTTCCC AATGATCCCA GACTATCCTG 1020 

GATTCTTTCC ATCTCAGTGC CAGAGAGACC TACATGGTAC AGCTGGCCCA GACCGTAAGC 1080 

CCTTTCCCTG CCCACTGGAC ACCCTGCGGG TGCCCCCTCC ACTCACTCCA CTCTCTACAA 1140 

TCCGTAACTT TACCCTGGGG GGCCCCAGTG CTGGGGTGAC CGGACCAGGG GCCAGTGGAG 1200 

GCAGCGAGGG ACCCCGGCTG CCTGGTAGCA GCTCAGCAGC AGCAGCAGCC GCCGCCGCOG 1260 

35 CCGCCTATAA CCCACACCAC CTGCCACTGC GGCCCATTCT GAGGCCTOGC AAGTACCCCA 1320 

ACAGACCCAG CAAGACGCCG GTGCACGAGA GGCCCTACCC GTGCCCAGCA GAAGGCTGCG 1380 

ACCGGCGGTT CTCCCGCTCT GACGAGCTGA CACGGCACAT CCGAATCCAC ACTGGGCATA 1440 

AGCCCTTCCA GTGTCGGATC TGCATGCGCA ACTTCAGCCG CAGTGACCAC CTCACCACCC 1500 

ATATCCGCAC CCACACCGGT GAGAAGCCCT TCGCCTGTGA CTACTGTGGC CGAAAGTTTG 1560 

40 CCCGGAGTGA TGAGAGGAAG CGCCACACCA AGATCCACCT GAGACAGAAA GAGCGGAAAA 1620 

GCAGTGCCCC CTCTGCATCG GTGCCAGCCC CCTCTACAGC CTCCTGCTCT GGGGGCGTGC 1680 

AGCCTGGGGG TACCCTGTGC AGCAGTAACA GCAGCAGTCT TGGCGGAGGG CCGCTCGCCC 1740 

CTTGCTCCTC TCGGACCCGG ACACCTTGAG ATGAGACTCA GGCTGATACA CCAGCTCCCA 1800 

AAGGTCCOGG AGGCCCTTTG TCCACTGGAG CTGCACAACA AACACTACCA CCCTTTCCTG 1860 

45 TCCCTCTCTC CCTTTGTTGG GCAAAGGGCT TTGGTGGAGC TAGCACTGCC CCCTTTCCAC 1920 

CTAGAAGCAG GTTCTTCCTA AAACTTAGCC CATTCTAGTC TCTCTTAGGT GAGTTGACTA 1980 

TCAACCCAAG GCAAAGGGGA GGCTCAGAAG GAGGTGGTGT GGGGATCCCC TGGCCAAGAG 2040 

GGCTGAGGTC TGACCCTGCT TTAAAGGGTT GTTTGACTAG GTTTTGCTAC CCCACTTCCC 2100 

CTTATTTTGA CCCATCACAG GTTTTTGACC CTGGATGTCA GAGTTGATCT AAGACGTTTT 2160 

50 CTACAATAGG TTGGGAGATG CTGATCCCTT CAAGTGGGGA CAGCAAAAAG ACAAGCAAAA 2220 

CTGATGTGCA CTTTATGGCT TGGGACTGAT TTGGGGGACA TTGTACAGTG AGTGAAGTAT 2280 

AGCCTTTATG CCACACTCTG TGGCCCTAAA ATGGTGAATC AGAGCATATC TAGTTGTCTC 2340 

AACCCTTGAA GCAATATGTA TTATATACTC AGAGAACAGA AGTGCAATGT GATGGGAGGA 2400 

ACGTAGCAAT ATCTGCTCCT TTTCGAGTTG TTTGAGAAAT GTAGGCTATT TTTTCAGTGT 2460 

55 ATATCCACTC AGATTTTGTG TATTTTTGAT GTACCCACAC TGTTCTCTAA ATTCTGAATC 2520 

TTTGGGAAAA AATGTAAAGC ATTTATGATC TCAGAGGTTA ACTTATTTAA GGGGGATGTA 2580 

CATATTCTCT GAAACTAGGA TGCATGCAAT TGTGTTGGAA GTGTCCTTGG TCGCCTTGTG 2640 

TGATGTAGAC AAATGTTACA AGGCTGCATG TAAATGGGTT GCCTTATTAT GGAGAAAAAA 2700 

ATCACTCCCT GAGTTTAGTA TGGCTGTATA TTTATGCCTA TTAATATTTG GAATTTTTTT 2760 

60 TAGAAAGTAT ATTTTTGTAT GCTTTGTTTT GTGACTTAAA AGTGTTAOCT TTGTAGTCAA 2820 

ATTTCAGATA AGAATGTACA TAATGTTACC GGAGCTGATT TGTTTGGTCA TTAGCTCTTA 2880 

ATAGTTGTGA AAAAATAAAT CTATTCTAAC GCAAAACCAC TAACTGAAGT TCAGATATAA 2940 
TGGATGGTTT GTGACTATAG TGTAAATAAA TACTTTTCAA CAAT 

65 Seq ID NO: B20 Protein sequence 
Protein Accession #: NP_000390.2 

1 11 21 31 41 51 

- A I I I I I i 

70 MMTAKAVDKI PVTLSGPVHQ LSDNIYPVED 1AATSVTIFP NAELGGPFDQ MNGVAGDGMI 60 

NIDMTGEKRS LDLPYPSSFA PVSAPRNQTF TYMGKFSIDP QYPGASCYPE G I IN IV SAG I 120 

LQGVTSPAST TASSSVTSAS PNPIATGPLG VCTMSQTQPD LDHLYSPPPP PPPYSGCAGD 180 

LYQDPSAFLS AATTSTSSSL AYPPPPSYPS PKPATDPGLF PMIPDYPGPP PSQOQRDLHG 240 

TAGPDRKPFP CPLDTLRVPP PLTPLSTIRN FTIXSGPSAGV TGPGASGGSE GPRLPGSSSA 300 

75 AAAAAAAAAY NPHHLPLRPI LRPRKYPNRP SKTPVHERPY PCPAEGCDRR FSRSDELTRH 360 

IRIHTGHKPF QCRICMRNFS RSDHIiTTHIR THTGEKPFAC DYCGRKFARS DERKRHTKIH 420 
LRQKERKSSA PSASVPAPST ASCSGGVQPG GTLCSSNSSS LGGGPLAPCS SRTRTP 



80 



Seq ID NO: B21 DNA sequence 

Nucleic Acid Accession #i NM_004962.2 

Coding sequence : 457 . . 1893 

1 11 21 31 41 51 

I I I I 1 I 

805 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 
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80 



CACACACGGG CGCACGCACA CGGCAGCCGG GCCAGGGACG ACCCTGTCAG CTGCAGOCCC 60 

AGAGGTCOGG GGCGGGCAGC OGGGTCCCCT CGAGGGCGCA GCCGGCCGCC CCGCCCCGCC 120 

CCTCGAAGCA GCCGGGCCGG GCGCGCAGTG GGCTACAAAC TTTOGCAGOG CGAGTCCGCC 180 

AAGGCAGCGC GCCGACTCGG GCTOGGCTCG GCTCTGCGCT GCTCOGGAOG GCTGTGACCG 240 

CTGGCOGGGG GCTCGGGCCG CCGGTACCCA CGGACCGOGC GCCCGGGTGC CTGCTCOGCT 300 

AAGCCCCTCG CCCCGCGOGG ACCTOGGTAT CCAGOGCCCT GCTGCCCGGG CTCTCCCCGC 360 

GCGCCCTACT GCCGCGAGGT CAGTCCGCAG CCTCCGGTGC GCCAGCGCTC GCCTTCCTCC 420 

TCCTGGACTT CGGCCCTTTG COGCCCTCAC CACGCCATGG CTCATGTCCC CGCTCGGACC 480 

AGCCCGGGAC CCGGGCCCCA GCTGCTGCTG CTGCTGCTGC OGTTGTTTCT GCTGTTGCTC 540 

CGGGATGTGG CCGGCAGCCA CAGGGCCCCC GCCTGGTCCG CACTGCCCGC GGCOGCOGAC 600 

GGCCTGCAGG GGGACAGGGA TCTCCAGCGG CACCCTGGGG ACGCGGCCGC CACGTTGGGC 660 

CCCAGCGCCC AGGACATGGT CGCTGTCCAC ATGCACAGGC TCTATGAGAA GTACAGCCGG 720 

CAGGGCGCGC GGCOGGGAGG GGGCAACACG GTCOGCAGCT TCAGGGCCAG GCTGGAAGTG 780 

GTCGACCAGA AGGCGGTGTA TTTCTTCAAC CTGACTTCCA TGCAAGACTC GGAAATGATC 840 

CTTACGGCCA CTTTCCACTT CTACTCAGAG CCGCCTCGGT GGCCTCGAGC GCTOGAGGTG 900 

CTATGCAAGC OGCGGGCCAA GAACGCTTCA GGCOGCCCGC TGCCCCTGGG CCCGCCCACA 960 

CGCCAGCACC TGCTCTTCCG CAGCCTCTCG CAGAACACGG CCACACAGGG GCTACTCCGC 1020 

GGGGCCATGG CCCTGGCGCC CCCACCGOGC GGCCTGTGGC AGGCCAAGGA CATCTCCCCC 1080 

ATCGTCAAGG CGGCCCGCCG GGATGGOGAG CTGCTCCTCT CCGCCCAGCT GGATTCTGAG 1140 

GAGAGGGACC CGGGGGTGCC CCGGCCCAGC CCCTATGCGC CCTACATCCT AGTCTATGCC 1200 

AACGATCTGG CCATCTCGGA GCCCAACAGC GTGGCAGTGA CGCTGCAGAG ATACGACCCC 1260 

TTCCCTGCCG GAGACCCCGA GCCCOGCGCA GCCCCCAACA ACTCAGCGGA CCCCCGCGTG 1320 

CGCCGAGCCG CGCAGGCCAC TGGGCCCCTC CAGGACAACG AGCTGCCGGG GCTGGATGAG 1380 

AGGCCGCOGC GCGCCCAOGC ACAGCACTTC CACAAGCACC AGCTGTGGCC CAGCCCCTTC 1440 

CGGGOGCTGA AACCCCGGCC AGGGCGCAAA GACCGCAGGA AGAAGGGCCA GGAGGTGTTC 1S00 

ATGGCCGCCT CGCAGGTGCT GGACTTTGAC GAGAAGACGA TGCAGAAAGC COGGAGGAAG 1560 

CAGTGGGATG AGCCGAGGGT GTGCTCCOGG AGGTACCTGA AGGTGGACTT CGCAGACATC 1620 

GGCTGGAATG AATGGATAAT CTCACGGAAA TCTTTTGATG CCTACTACTG CGCGGGAGCA 1680 

TGTGAGTTCC CCATGCCTAA GATOGTTCGT CCATCCAACC ATGCCACCAT CCAGAGCATT 1740 

GTCAGGGCTG TGGGCATCAT CCCTGGCATC CCAGAGCCCT GCTGTGTTCC OGATAAGATG 1800 

AACTCCCTTG GGGTCCTCTT CCTGGATGAG AATCGGAATG TGGTTCTGAA GGTGTACCOC I860 

AACATGTCCG TGGACACCTG TGCCTGCCGG TGAGACCACT CCAGGGTGGA AAGAAGCCAC 1920 

GCCCAGCAGA GCTGCCTTCT CGGAGCCTTC TGCAACCAGG ACTTGTGGTG CAGCTGCAGA 1980 

CACAGAGCAC AGCTCATGGG CAACATCACT GGGGGCCAGA GAGAGCTGTC CGCCAGTGCA 2040 

TCATTAGGGG GTCTTTCATT GCTAGTGACT AGCCCCTTAA ATGCCAGCCT GAGTACCTGA 2100 

AGGAATCTGG GAATTAGOCC TGGCCTGAAA GTGGCCCATC ATTCATACCC ACTGTTCTGA 2160 

AGGCTTGAAA ACAAAACATA TCCACAACAT TGGCTTGATG TGATCATCAT CTCATAACTG 2220 

AGCAAGAAGA CTATGCAAAT CTTAGGGCGC TOGCTCCCTG CACACGGAAA GAACTCTGTT 2280 

TAAATGCTCA GTTCAGAACA CTTTGGGCCA CATAGTGATT TTGGAAAACA GGATAATCGT 2340 

GGTGTAAATG AGTGTTTCCT TTCAAAGTCC ACTGCAGAGC TTTTATCCAT ATGGTATGCA 2400 

CATGTAGCCA ATATTGGTTT CTTTTTCTTA ATATATATAT TTTATTTTAA AACAACAAAA 2460 

AGGGAGGGCG TTGACACCAT TCCCCACAGA GATAGTCATG CTGAGTGTGG GTTGTTTAAA 2520 

CATGCATATT GAAATAACAC ATATAGTAAC GTGGGAATAC TAAAAAATAA CCAAGATTTT 2S80 

ATATTTTTGT AAATTATACT TTCTATACTG TAGATTGTGT ATGTTATGTG TTTTTATGGA 2640 
AAGCTAATAA ATTAAAGGTA CAGTGGTATC TTGA 



Seq ID NO: B22 Protein sequence 
Protein Accession #i NP__004953.1 



1 
I 

MAHVPARTSP 
GDAAATLGPS 
SMQDSEMILT 
TATQGLLRGA 
APYILVYAND 
MELFGLDERP 
TMQKARRKQW 
NHATIQSIVR 



11 
I 

QPGPQLLLLL 
AQDMVAVHMH 
ATFHFYSEPP 
KALAPPPRGL 
LAISEPNSVA 
PRAHAQHFHK 
DEPRVCSRRY 
AVGIIPGIPE 



21 
I 

LPLPLLtiLRD 
RLYEKYSRQG 
RWPRALEVLC 
WQAKDISPIV 
VTLQRYDPPP 
HQLWPSPFRA 
LKVDFADIGW 
PCCVPDKMNS 



31 

I 

VAGSHRAPAW 
ARPGGGNTVR 
KPRAKNASGR 
KAARRDGELL 
AGDPEPRAAP 
LKPRPGRKDR 
KEWIISPKSF 
LGVLFLDENR 



41 
I 

SALPAAADGL 
SFRARLEWD 
PLPLGPPTRQ 



NNSADPRVRR 
RKKGQEVFMA 
DAYYCAGACE 
NWLKVYPNM 



51 

I 

QGDRDLQRHP 
QKAVYFFNLT 
HLLFRSLSQN 
DPGVPRPSPY 
AAQATGPLQD 
ASQVUDFDEK 
FPMPKIVRPS 
SVDTCACR 



60 
120 
180 
240 
300 
360 
420 



Seq ZD NO: B23 DNA sequence 

Nucleic Acid Accession #: AK026322.1 



ATTCTTTAAA 
TCCATCATCT 
ACATATCCTT 
TTAGCATTTT 
TAGTTGAATT 
AAAGGCTATG 
CATGTGTTCA 
ATTTCGCAGT 
TATCCAACAA 
CCTTCCTTTG 
CTACTACTCT 
ATCTAAGGGA 
TACTATAOCA 
TTTCTTGTTA 
CTTTTCTTCA 
CAATGACTAG 
GAATGTGTGC 
TTTTTTTTTT 
CTACTATAGG 



11 
I 

TACTTAAACA 
GTCATTTTTG 
GCTTCTTCCC 
GGAGTGACTG 
ACTTGCAAAT 
CTACTTCAAG 
ATTTGAATGT 
AATTTTTTTT 
CAGTCTTGGA 
TGGTTATGTG 
GCTTGGATTC 
ATTTCAGGAG 
CCTGTCAGCC 
CAGGAGCCTC 
TTATGTGTTT 
AGAGCACTTC 
TGCAGATGTC 
TGTATTTTTA 
ATAAATTTGT 



21 
I 

TGGTTATAAC 
CATCTGTTTT 
ATATTTAGTA 
CTTTTGCACT 
CAGCTTGATC 
TATAATAAAA 
CTCCAACCCT 
GTTCAGCCTT 
GATCTCATGA 
CCCTGAAAAC 
CCCTCTTATA 
TCACCTCATT 
AATGTGTGAG 
CATTACTTTT 
ATCTCTGGCA 
TGTTGAGGCT 
ATACAGCATC 
TTGTATTACC 
TTTTATAGAG 



31 
I 

AGCTGTGTTT 
TATTGACTAA 
AAGTTTAGTT 
TGCCTTTAAA 
ATTCTGAGAC 
CCTAGTTTTA 
GTGTGAACTC 
GCAGTCTCAT 
AGATTTCTGA 
TOCAGCCTCC 
CAATGGTCTA 
TATTTTCCTT 
AATGGTGGTT 
CAAACAGTAA 
GTTTGAGCCA 
CATTCATTGA 
ACAGCTTCTT 
TGCTTGGTTG 
CAATTTCTCA 



41 

I 

TAAGCTCCTT 
CTTTCTTGCT 
GGATGCTGGA 
AGTATTGGAC 
TTGTTTTTAA 
GTTTTATCCT 
TGAAAATTGT 
CCTACTCAAG 
AACTTTTGCT 
GTGTCTTCAA 
GCAAGTGACT 
TTTTTCAGGG 
TCATACATTT 
TGCAGTTGCA 
AGAGAGGGCA 
CCCCTACCCC 
CTCTAATTTA 
CAAGAGGATG 
TGGGTCAOGA 



51 
I 

GTCTGTTAAT 
CATCATGGTC 
TTGTTTTAAT 
TTGGTTTTGA 
CTTTTGTOGA 
ATAACTAAGG 
TCAGCTCCCA 
TGTGGCTCTG 
CXGGAGGATA 
TTCAGTGAAA 
GCAAACAGAA 
ATTATAGTAG 
TCTCCAATTC 
GTTGCCTCTC 
CGGAGAAGTA 
AGTGCTTTAT 
TGAGCCATAA 
ATGAGGAGGA 
GGGATAGTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



806 



WO 03/042661 



TCCATTTTCC TCACTQATAT CGGCTTGATC ATCTGAQTCA GTGGGCTCAT TGTGGAAACA 1200 
TATGGAATCC TTTATGCTTT TTCTCAGGCT GCTTCTGTTA CATGAAATAA AGCCAGAGTT 12 SO 
GATTGTGAAA AAAAAAAAAA AAAA 

Seq ID MO: B24 DNA sequence 

Nucleic Acid Accession *h cat cluster 

1 11 21 31 41 51 

1)11)1 

TGTATACATT CCTTTCAAAT AAAGACCTTG AGAAAACAGC AGAGCCAAGT GAAGATCACC 60 

TAAAGAACTT TGTGGCTAAT TTATACTTCA TGTAGTAGCA GTGGGTACTG GGCAGGGTCT 120 

CTTTCCCACT CTGATGATTT GTGCTCTTAT TTTTCCTAGA TTTACCTCAT CTAGGGCATA 180 

TTCTTTTCCC TCTTCCTCTT TACCTTTCCT GGTCCTGATC CCTCT GTACT CAGTTCCCCT 240 

AAATTATTGG ACTACAAACT AATATACTAG AAAAGCATAC ACTTATTTTA TTTGAATGCA 300 

GAAATGCTAT CTATCAGTAT ATATACATAA GAATGTATAT TACAGTATAT CT AT AT AT AC 360 

TTAACACTGT AACTTTCAGT ATTCCCCAGT TAGCGTACCT AACTCTCCTG TGGGTTATGT 420 

TAATTCTTAT TAGACTACTA GAGAAAAACC AACTGGCAGT TTGCTAAGCA TATCTACTGG 480 

T GT T GT TTCT GCGCCCTCTT TTGGCTAATT GATGTAATTA TACTGGCTCT AAAGATTTAC S40 

TGCCCCATAA GTAAATAGTA TAGCCACATT CTGAACATAT CAAAAGTACA AACTTAGGAG 600 

GAGTGTATGT ACAAAAATGT AAAATTTTAT GAAAATGAAC ATGTTTTTAT GATGTTATTT 660 

CTAGTTCATA AGAATGTGAT GACTGCTTTG CTTCATTTAT GTACGTTCCC ATTATATTCT 720 

TGCTGTCAAT CAATCACAAA TTTATATCAG ATTAGGATAA ACTAAGCCAT TTTA TGTA TT 780 

TTATTTTAAA CCTTATTTTG GCAGAGTAAT TCCTTAGAAT TGGAAAAGCT GTTACTTTGA 840 

AATTACCAAT TTATTACAAA ACATAGAAAT GTATTGTAGC TACAAAGACA A CCAAG CATT 900 

TTCTGTGTTT TAATGAATAT CTAAAAAACT ACATTTAGTT TATTTTACTC AGTTTTGAAA 960 

TGATTTTTTT ACTGGCTCTA TTGCCTTAAA ATAACTAAGA GATTAATGAT T CTTTGT ATA 1020 

ATTTTCCTTT TCTTTGTTCT TTTTTTTACA TTTCGCAGAG TTATATCTAT AGTTTTAGTA 1080 

ACAATTTCTT ATGTATTCTG GATAACTGAA AACAACTAAA GGTGTTGGGC ATT AGAAA AT 1140 

AATTGTGAGC AGTAAGATTA CTGATGTAAT ATGTATGTTG GACTGAAGTA TTTCTTTATA 1200 

AACATTCTAT TTGATTTTAA GCAAAATGTA TGTTAAAGCA TGTTTTTACA TCAGTAAAGT 1260 

CATTTGTCGA CCTTCTGGAA ATGAAAGGTT TTTACCTAGA TACTGTAAGT TACACCTCCT 1320 

TAACAATCAT ATTTGTCATT GTTGTTTTCT GCAAACAAAA ATGTTTATGG GCTTCATGTA 1380 

GGCTTAAGAT TGTAGGCAAA AATGGACTGA GTTCAGGACC CTTCAAGCAG TAGGCATTCA 1440 

GTTACAGAGC AGTTGGTACT TTGTAACCCA GACTTACAGT TTAAAAATAT CAAGTTAGCT 1500 

GATGTTTCAT TATAATAAAA ATACTATTTT GCTTAAGAGT TGTATTACAA ATATTTGTGC 1560 

TTAACATTAG AAATAGCTGT TTTAAATTGT AGTTAACATA TTAACTTTTT CAGAAAAAAA 1620 

GCATGGTTTA TTTTTAATAA TGAAATAGAG AACATAATAC GTAATGTTCA GTATAACAGC 1680 

TGAGTTAAAA CATCTGCCAG GATTAACATC AGTGCGTTTT TGCCAATGCA TAGAGGCATT 1740 

TTTCTCTAAG TATGATGGCT AATGATAACT ATTCTTTGTT ACACATTCAA GTCACTCCCA 1800 

TACAAGTAAC TAGTGGGTGA TATGTTTCAC TCCAAAGGGT GTATTAATTC TGAATGCTAA 1860 
TCATGAAGAC TTAAGTTAGG ACAACACTTC AAACCAGGAA GTGTGAACTG ATTTA 

Seq ID NO: B25 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..1144 

1 11 21 31 41 51 

I I I 1 I I 

ATGGAGCGCG GGATGCACCT OGGTGCAGCG GCCGCCGGCG AGGACGACCT CTTCCTGCAC 60 

AAGAGCCTGA GCGCCTCCAC CTCCAAGCGC TTGGAAGCGG CTTTCCGCTC CACGCCCCCG 120 

GGCATGGAOC TGTCCCTGGC GCCGCCGCCT CGGGAACGCC CGGCGTCCTC CTCCTCGTCG 180 

CCCCTGGGCT GCTTCGAGCC GGCTGACCCC GAGGGGGCAG GGCTGCTGTT GCCGCCGCCT 240 

GGAGGAGGCG GCGGCGGCAG CGCGGGAAGT GGCGGCGGCG GCGGCGGCGG GGTGGGTGTC 300 

CCCGGGCTGC TAGTAGGTTC AGCCGGCGTT GGGGGCGACC CTAGCCTAAG CAGCCTGCCG 360 

GCCGGGGCCG CCCTTTGCCT CAAGTACGGC GAAAGCGCGA GCCGGGGCTC GGTGGCCGAG 420 

AGCAGCGGCG GCGAGCAGAG CCCCGACGAC GACAGCGACG GTCGCTGCGA GCTCGTGCTG 480 

CGGGCCGGAG TAGCCGACCC GCGGGCCTCC CCGGGAGOGG GAGGTGGTGG CGCGAAGGCA 540 

GCCGAGGGCT GCTCCAATGC CCACCTCCAC GGCGGCGCCA GCGTCCCCCC GGGGGGCCTG 600 

GGCGGCGGCG GCGGCGGGGG TAGCAGCAGC GGTAGCAGTG GCGGCGGTGG OGGTAGCGGT 660 

AGCGGCAGCG GCGGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAAGAAATC CAAAGAGCAA 720 

AAGGCGCTGC GGCTTAACAT CAATGCCCGA GAGCGCOGGC GGATGCACGA CCTGAACGAC 780 

GCGCTGGACG AGCTGCGCGC GGTGATCCCC TACGCGCACA GCCCCTCGGT GCGAAAGCTC 840 

TCCAAGATCG CCACGCTGCT GCTCGCCAAG AACTACATCC TCATGCAGGC GCAGGCCCTG 900 

GAGGAGATGC GGCGCCTAGT CGCCTACCTC AACCAGGGCC AGGCCATCTC GGCTGCCTCC 960 

CTGCCCAGCT CGGCGGCTGC AGCGGCAGCA GCTGCTGCCC TGCACCCGGC GCTCGGCGCC 1020 

TACGAGCAGG CAGCCGGCTA CCCGTTCAGC GCCGGACTGC CCCCGGCTGC CTCCTGCCCG 1080 

GAGAAGTGCG CCCTGTTTAA CAGCGTCTCC TCCAGCCTCT GCAAACAGTG CACGGAGAAG 1140 
CCTT 

Seq ID NO: B26 Protein sequence 
Protein Accession 8: FGENESH predicted 

1 11 21 31 41 51 

| I I I I I 

MERGMHLGAA AAGEDDLFLH KSIiSASTSKR LEAAPRSTPP GMDLSLAPPP RERPASSSSS 60 

PLGCFBPADP EGAGLLLPPP GGGGGGSAGS GGGGGGGVGV PGLLVGSAGV GGDPSLSSLP 120 

AQAALCLKYG ESASRGSVAE SSGGEQSPDD DSDGRCELVL RAGVADPRAS PGAGGGGAKA 180 

AEGCSNAHLH GGASVPPGGL GGGGGGGSSS GSSGGGGGSG SGSGGSSSSS SSSSKKSKEQ 240 

KALRLNINAR ERRRMHDLND ALDELRAVIP YAHSPSVRKL SKIATLLLAK NYILMQAQAL 300 

EEMRRLVAYL NQGQAISAAS LPSSAAAAAA AAALHPALGA YEQAAGYPPS AGLPPAASCP 360 
EKCALFNSVS SSLCKQCTEK P 

Seq ID NO: B27 DNA sequence 

Nucleic Acid Accession #: cat cluster 



807 



WO 03/042661 



PCT/US02/36810 



1 IX 21 31 41 SI 

I I I I I I 

TTTTTTTTTT TTTTTTTTAA ATGTTTGATC AAAATATTTT AATAAAGATT CTTTCTGACA 60 

5 TAGATACACA TACAAATGGT CGTACATAGC TGTCATAGTC TGATTGACCT ATTTAATATA 120 

TATATCATTC TTTACACATC CAAAACCCGC CAACAGATCC ATCACAGCTC CCAACTCACC 180 

ATCCAACCTG ACAAACTQAA TTTGTATTAT CTGCAAGGAG TGGAAAATAG CAGGACTCCA 240 

TTTTTAAAAA AGATTTTCTT GATTTTCATA GGGCGGAAAG GCAGTCAAAC AGCCATGCAA 300 

AACTAAAACT GAAAGCTCAC TTTGGGTAAA TAGCTTCTTG TTCTTCCTTA GTTTTCTTTC 360 

10 TTTTTAAATT TTATTTTTTT AGAAAAATAA CAAAGGATTT CACACCATAG GCAAATCAAA 420 

CCAGTCTTTT AACTTAAATA ATTCTCCACA GTTAAAATAA CATATATGTA CATATATATT 480 

AAAAGCAATT AAATTAGACC TTTTAAAAAT GCACAGCACA GCCTGGAAAA ATATTTGCTT S40 

AGCATGTTCT TACGTATAGC AACTATTGCT GTGATGTTTT TCCCTTTTGG AATGTAAAGG 600 

AGTCCCCTTT CAAAAAAGAG ATCAATTCAT TCATCAATTA AGAATACACC TTTCCTGTAA 660 

15 TTTTTGGACT GAAGCAATTT ATTAAAGCTC AATTTAAATA CAGGGATGAT GCAACTGAAA 720 

ATATCCAGGT GACCTTTCAT AAACCTAAGC AGCTCAGATA CATCAATATT TCTCTTCATA 780 

CTTGTTGGCA AATAAACCTT TAAACACTTG GCACACAGCA TAAGTAATCT ATAAAACAAT 840 

TTTAGAGGCA TTAAAAAAAT CTGCACATAA GACCGATGAC TTTAACACAT GATAAATACT 900 

GTTGATGTGG AAGGGTCATT GAAGAATAAC AAATAAATAC CATGAATTGT TAATACATCA 960 

20 TTGCAGAGTA GAAAGTAACA AGGTGCACAT AAATATTTTT AAATGCAATT CTTTCAGCCA 1020 

CAGTCAGTTT TTTTATATCA CTCTCGCCAA AACTTTGAGC ATTTTCACAG GATTAAAGTT 1080 

CAGAGACAAT AAAAAATACA AGTCTTTCAT AGTAACATGT TCTCTCTCTC TCGCTCTGGG 1140 

TTTTATTTCC CCCTCACTTA AGTCAACATT TCAAGTTTTT CTCCTGGCTC AGAATCAAAA 1200 

TTTATTTTCA AGTGCCCTTT CTGATTTGTC TGAATGAATA TTCCATCTCT CATGCTACCG 1260 

25 ATCCGCT 



30 



Seq ID NO: B28 DNA sequence 

Nucleic Acid Accession #: NM_002581.2 

Coding sequence: 368.. 5251 



1 11 21 31 41 51 

I I I I I I 

GGGGGAGGGA ATTCAGCGGA TCAGTCTTAA GAGGAGCTTT TTTTTGGAGC GAGAAATCAT 60 

ATAAAATAAA ATGAAATAAA ACAAGGAGGA AGGCAACCAG CTGTTAGGGG GAAAATAAGG 120 

35 CAGATAAAGG AGCGGGGAGA GAAATTAATT GCCAACCAGG AGGAGTTGGG CTGTATTTTT 180 

CAAAGGTGGG GAGAGTGGAG CACACACCTT GAGGAGGAAA GCGAGAAAGA AAAGAAAAAA 240 

GCAAGTGAAG GGGGGCTCGC CCAAGAAGGG TGAAGAAGCG AAGAAAGTCG AGGCGCCGAG 300 

GCTCCCAAAG CTGGCAGCTC CGGGTGGCGG TGCAGGGGCG AAGGGGGGGG CGGGGGGAAC 360 

GTCGGACATG CGGCTCTGGA GTTGGGTGCT GCACCTGGGG CTGCTGAGCG CCGOGCTGGG 420 

40 CTGCGGGCTG GCCGAGCGTC CCCGCCGGGC CCGGAGAGAC CCGCGGGCCG GCCGACCCCC 480 

GCGCCCOGCC GCCGGCCCGG CCACCTGCGC CACCCGCGGC CCGCGGCCGC CGCGCCTCGC 540 

CGCCGCCGCC GCCGCCGCCG GGCGTGCCTG GGAAGCOGTG CGCGTCCCCC GGCGGCGGCA 600 

GCAGCGGGAG GCGAGGGGCG CCACCGAGGA GCCGAGCCOG CCGAGCCGGG CGCTCTATTT 660 

CAGCGGGCGA GGCGAGCAGC TGCGAGTCCT CCGGGCCGAC CTCGAGCTGC CCCGGGACGC 720 

45 GTTCACGCTG CAAGTGTGGC TGCGAGCGGA GGGGGGCCAG AGGTCTCCGG CAGTGATCAC 780 

AGGGCTGTAT GACAAATGTT CTTATATCTC ACGTGACCGA GGATGGGTCG TGGGCATTCA 840 

CACCATCAGT GACCAAGACA ACAAAGACCC ACGCTACTTT TTCTCCTTGA AGACAGACCG 900 

AGCCCGGCAA GTGACCACCA TCAATGCCCA CCGCAGCTAC CTCCCAGGCC AGTGGGTATA 960 

CCTAGCTGCC ACCTATGATG GGCAGTTCAT GAAGCTCTAT GTGAATGGTG CCCAGGTGGC 1020 

50 CACCTCTGGG GAACAAGTGG GTGGCATATT CAGCCCACTG ACCCAGAAGT GCAAAGTGCT 1080 

CATGTTAGGG GGCAGTGCCC TGAATCACAA CTACCGGGGC TACATCGAGC ACTTCAGTCT 1140 

GTGGAAGGTG GCCAGGACTC AGCGGGAGAT ACTGTCTGAC ATGGAAACCC ATGGCGCCCA 1200 

CACTGCTCTA CCTCAGCTCC TCCTCCAGGA GAACTGGGAC AATGTGAAGC ATGCCTGGTC 1260 

CCCCATGAAG GATGGCAGCA GCCCCAAAGT GGAATTCAGC AATGCCCACG GCTTTCTGCT 1320 

55 GGACACGAGT CTGGAGCCTC CTCTGTGCGG ACAGACATTG TGTGACAACA CAGAGGTCAT 1380 

TGCCAGCTAC AATCAGCTCT CAAGTTTCCG CCAGCCCAAG GTGGTGCGCT ACCGCGTGGT 1440 

CAACCTCTAT GAAGATGATC ATAAGAACCC GACGGTGACG CGCGAGCAGG TGGACTTCCA 1500 

GCACCATCAG CTGGCTGAGG CCTTCAAGCA ATACAACATC TCCTGGGAGC TGGACGTGCT 1560 

GGAGGTGAGC AACTCCTCCC TTCGCCGCCG CCTCATCCTG GCCAACTGTG ACATCAGCAA 1620 

60 GATTGGGGAT GAGAACTGTG ACCCCGAGTG CAACCACACG CTGACGGGCC ACGACGGCGG 1680 

GGATTGCCGC CACCTGCGCC ACCCTGCCTT OGTGAAGAAG CAGCACAAOG GGGTGTGTGA 1740 

CATGGACTGC AACTATGAAC GGTTCAACTT TGATGGTGGA GAGTGCTGTG ACCCTGAAAT 1800 

CACCAATGTC ACTCAGACTT GCTTTGACCC CGACTCTCCA CACAGAGCCT ACTTGGATGT 1860 

TAATGAGCTG AAGAACATTC TTAAATTGGA TGGATCAACA CATCTCAATA TTTTCTTTGC 1920 

65 AAAATCCTCA GAGGAGGAGT TGGCAGGAGT AGCAACTTGG CCATGGGACA AGGAGGCCCT 1980 

GATGCACTTA GGTGGCATTG TCTTGAACCC ATCTTTCTAT GGCATGCCTG GGCACACCCA 2040 

CACCATGATC CATGAGATTG GTCACAGCCT GGGCCTCTAT CACGTCTTCC GAGGCATCTC 2100 

AGAAATCCAG TCCTGCAGTG ACCCCTGCAT GGAGACAGAG CCCTCCTTCG AGACTGGAGA 2160 

A CCTCTGCAAT GATACCAACC CAGCCCCTAA ACACAAGTCC TGTGGTGACC CAGGGCCAGG 2220 

70 AAATGACACC TGTGGCTTTC ATAGCTTCTT CAACACTCCT TACAACAACT TCATGAGCTA 2280 

TGCAGATGAC GACTGTACGG ACTCCTTCAC GCCCAATCAA GTCGCCAGAA TGCACTGTTA 2340 

CCTGGACCTG GTCTACCAGG GCTGGCAGCC CTCCAGGAAA CCAGCGCCTG TTGCCCTCGC 2400 

CCCCCAAGTT CTGGGCCACA CAACGGACTC TGTGACACTG GAGTGGTTCC CACCTATAGA 2460 

TGGCCATTTC TTTGAAAGAG AATTGGGATC AGCATGTCAT CTTTGCCTGG AAGGGAGAAT 2520 

75 CCTGGTGCAG TATGCTTCCA ACGCTTCCTC CCCAATGCCC TGCAGCCCAT CAGGACACTG 2S80 

GAGCCCTCGT GAAGCAGAAG GTCATCCTGA TGTTGAACAG CCCTGTAAGT CCAGTGTCCG 2640 

CACCTGGAGC CCAAATTCAG CTGTCAACCC ACACAOGGTT CCTCCAGCCT GCCCTGAGCC 2700 

TCAAGGCTGC TACCTCGAGC TGGAGTTCCT CTACCCCTTG GTCCCTGAGT CTCTGACCAT 2760 

OA TTGGGTGACC TTTGTCTCCA CTGACTGGGA CTCTAGTGGA GCTGTCAATG ACATCAAACT 2820 

80 GTTGGCTGTC AGTGGGAAGA ACATCTCCCT GGGTCCTCAG AATOTCTTCT GTGATGTCCC 2880 

ACTGACCATC AGACTCTGGG ACGTGGGCGA GGAGGTGTAT GGCATCCAAA TCTACACGCT 2940 

GGATGAGCAC CTGGAGATCG ATGCTGCCAT GTTGACCTCC ACTGCAGACA CCCCACTCTG 3000 

TCTACAGTGT AAGCCCCTGA AGTATAAGGT GGTCCGGGAC CCTCCTCTCC AGATGGATGT 3060 

GGCCTCCATC CTACATCTCA ATAGGAAATT CGTAGACATG GATCTAAATC TTGGCAGTGT 3120 

808 



WO 03/042661 



GTACCAGTAT TGGGTCATAA CTATTTCAGG AACTGAAGAG AGTQAGCCAT CACCTGCTGT 3180 
CACATACATC CATGGACGTG GGTACTGTGG CGATGGCATT ATACAAAAAG ACCAAGGTGA 3240 
ACAATGCGAC GACATGAATA AGATCAATGG TGATGGCTGC TCCCTTTTCT GCCGACAAGA 3300 
AGTCTCCTTC AATTGTATTG APGAACCCAG CCGGTGCTAT TTCCATGATG GTGATGGGGT 3360 
ATGTGAGGAG TTTGAACAAA AAACCAGCAT TAAGGACTGT GGTGTCTACA CGCCCCAGGG 3420 
ATTCCTGGAT CAGTGGGCAT CCAATGCTTC AGTATCTCAT CAAGACCAGC AATGCCCAGG 3480 
CTGGGTCATC ATCGGACAGC CAGCAGCATC CCAGGTGTGT CGAACCAAGG TGATAGATCT 3540 
CAGTGAAGGC ATTTCCCAGC ATGCCTGGTA CCCTTGCACC ATCAGCTACC CATATTCCCA 3600 
GCTGGCTCAG ACCACTTTTT GGCTCCGGGC GTATTTTTCT CAACCAATGG TTGCCGCAGC 3660 
TGTCATTGTC CACCTGGTGA CGGATGGGAC ATATTATGGG GACCAAAAGC AGGAGACCAT 3720 
CAGCGTGCAG CTGCTTGATA CCAAAGATCA GAGCCACGAT CTAGGCCTCC ATGTCCTGAG 3780 
CTGCAGGAAC AATCCCCTGA TTATCCCTGT GGTCCATGAC CTCAGCCAGC CCTTCTACCA 3840 
CAGCCAGGCG GTACGTGTGA GCTTCAGTTC GCCCCTGGTC GCCATCTCGG GGGTGGCCCT 3900 
CCGTTCCTTC GACAACTTTG ACCCCGTCAC CCTGAGCAGC TGCCAGAGAG GGGAGACCTA 3960 
CAGCCCTGCC GAGCAGAGCT GOGTGCACTT CGCATGTGAG AAAACTGACT GTCCAGAGCT 4020 
GGCTGTGGAG AATGCTTCTC TCAATTGCTC CAGCAGCGAC CGCTACCACG GTGCCCAGTG 4080 
TACTGTGAGC TGCCGGACAG GCTACGTGCT CCAGATACGG CGGQATGATG AGCTGATCAA 4140 
GAGCCAGACG GGACCCAGCG TCACAGTGAC CTGTACAGAG GGCAAGTGGA ATAAGCAGGT 4200 
GGCCTGTGAG CCAGTCGACT GCAGCATCCC AGATCACCAT CAAGTCTATG CTGCCTCCTT 4260 
CTCCTGCCCT GAGGGCACCA CCTTTGGCAG TCAATGTTCC TTCCAGTGCC GTCACCCTGC 4320 
ACAATTGAAA GGCAACAACA GCCTCCTGAC CTGCATGGAG GATGGGCTGT GGTCCTTCCC 4380 
AGAGGCCCTG TGTGAGCTCA TGTGCCTCGC TCCACCCCCT GTGCCCAATG CAGACCTCCA 4440 
GACCGCCCGG TGCCGAGAGA ATAAGCACAA GGTGGGCTCC TTCTGCAAAT ACAAATGCAA 4500 
GCCTGGATAC CATGTGCCTG GATCCTCTCG GAAGTCAAAG AAACGGGCCT TCAAGACTCA 4560 
GTGTACCCAG GATGGCAGCT GGCAGGAGGG AGCTTGTGTT CCTGTGACCT GTGACCCACC 4620 
TCCACCAAAA TTCCATGGGC TCTACCAGTG TACTAATGGC TTCCAGTTCA ACAGTGAGTG 4680 
TAGGATCAAG TGTGAAGACA GTGATGCCTC CCAGGGACTT GGGAGCAATG TCATTCATTG 4740 
CCGGAAAGAT GGCACCTGGA ACGGCTCCTT CCATGTCTGC CAGGAGATGC AAGGCCAGTG 4800 
CTCGGTTCCA AAGGAGCTCA ACAGCAACCT CAAACTGCAG TGCCCTGATG GCTATGCCAT 4860 
AGGGTCGGAG TGTGCCACCT CGTGCCTGGA CCACAACAGC GAGTCCATCA TCCTGCCAAT 4920 
GAACGTGACC GTGCGTGACA TCCCCCACTG GCTGAACCCC ACACGGGTAG AGAGAGTTGT 4980 
CTGCACTGCT GGTCTCAAGT GGTATCCTCA CCCTGCTCTG ATTCACTGTG TCAAAGGCTG 5040 
TGAGCCCTTC ATGGGAGACA ATTATTGTGA TGCCATCAAC AACCGAGCCT TTTGCAACTA 5100 
TGACGGTGGG GATTGCTGCA CCTCCACAGT GAAGACCAAA AAGGTCACCC CATTCCCTAT 5160 
GTCCTGTGAC CTACAAGGTG ACTGTGCTTG TCGGGACCCC CAGGCCCAAG AACACAGCCG 5220 
GAAAGACCTC CGGGGATACA GCCATGGCTA AGGAAGGACA AGAAGTTGTC AAAGAATTCC 5280 
CAACGCCAGG ACCCACATCC CTTTGGTATT GATTTCACAG TCAGCTGCTC AAOGGAATGG 5340 
CCTCTCCACA CCAGGGATCC TTAGCACCCA ACCGGTCTGC CTTTAATTTT ACCCAGGAAG S400 
GACTCACATT GGGGCGAATG AACCAAGTTT CGCCATGCTG GATGATGAAA TGGATTCCCA 5460 
TCCCAAAGTC TGAGATGGAT TGCATATACA GTGTGCAGTC CCAGAGCCTC CTAAAATTCT 5520 
AGCCATTTGT CACACAACCA CAGCAAAAAA AAA 

Seq ID NO: B29 Protein sequence 
Protein Accession fl: NP_002572.1 

1 11 21 31 41 SI 

I I I I I I 

MRLWSWVLHL GLLSAAIjGCG LAERPRRARR DPRAGRPPRP AAGPATCATR GPRPPRLAAA 60 

AAAAGRAWEA VRVPRRRQQR EARGATEEP8 PPSRALYPSG RGEQLRVLRA DLBLPRDAFT 120 

LQVWLRAEGG QRSPAVITGL YDKCSYISRD RGWWGIHTI SDQDNKDPRY FPSLKTDRAR 180 

QVTTINAHRS YLPGQWVYLA ATYDGQFMKL YVNGAQVATS GEQVGGIFSP LTQKCKVLM^ 240 

GGSALNHNYR GYIEHFSLWK VARTQREILS DMBTHGAHTA LPQLLLQENW DNVKHAWSPM 300 

KDGSSPK7BF SNAHGPLLDT SLEPPLCGQT LCDNTEVIAS YNQLSSFRQP KWRYRWNL 360 

YEDDHKNPTV TREQVDFQHH QLAEAFKQYN ISWELDVLEV SNSSLRRRLI LANCDISKIG 420 

DENCDPECNH TLTGHDGGOC RHLRHPAFVK KQHNGVCDMD CNYERFNPDG GECCDPEITN 480 

VTQTCFDPDS PHRAYLDVNE LKNILKLDGS THmiFFAKS SEEELAGVAT WPWDKEALMH 540 

LGGIVLNPSF YGMPGHTHTM IHEIGHSLGL YHVPRGISEI QSCSDPCMET EPSFETGDLC 600 

NDTNPAPKHK SCGDPGPGND TCGFHSPPNT PYNNFMSYAD DDCTDSFTPN QVARMHCYLD 660 

LVYQGWQPSR KPAPVALAPQ V1GHTTDSVT LEWFPPIDGH PPERELGSAC HLCLEGRILV 720 

QYASNASSPM PCSPSGHWSP REAEGHPDVE QPCK6SVRTVJ SPNSAVNPHT VPPACPEPQG 780 

CYLELEFLYP LVPESLTIWV TFVSTDWDSS GAVNDIKLLA VSGKNISLGP QNVPCDVPLT 840 

IRLWDVGEEV YGIQIYTLDE HLEIDAAMLT STADTPLCLQ CKPLKYKWR DPPLQMDVAS 900 

XLHLNRKFVD MDLNLGSVYQ YWVITISGTE ESBPSPAVTY IHGRGYCQDG IIQKBQGEQC 960 

DDMNKINGDG CSLFCRQEVS FNCIDEPSRC YPHDGDGVCE EFEQKTSIKD CGVYTPQGPL 1020 

DQWASNASVS HQCQQCPGWV I IGQPAASQV CRTKVIDLSE GISQHAWYPC TISYPYSQLA 1080 

QTTFWLRAYP SQPMVAAAVI VHLVTDGTYY GDQKQBTISV QLLDTKDQSH DI/JLHVLSCR 1140 

NNPLIIPWH DLSQPFYHSQ AVRVSPSSPI* VAISGVALRS PDNFDFVTLS SCQRGETYSP 1200 

AEQSCVHFAC EKTDCPELAV ENASLNCSSS DRYHGAQCTV SCRTGYVLQI RRDDBLIKSQ 1260 

TGPSVTVTCT EGKWNKQVAC EPVDCSIPDH HQVYAASPSC PEGTTFGSQC SFQCRHPAQL 1320 

KGNNSLLTOI EDGLWSPPEA LCELMCLAPP PVPNADLQTA RCRENKHKVG SFCKYKCKPG 1380 

YHVPGSSRKS KKRAFKTQCT QDGSWQEGAC VPVTCDPPPP KFHGLYQCTN GFQFNSECRI 1440 

KCEDSDASQG LGSNVIHCRK DGTWNGSPHV OQEKQGQCSV PNEUJSNLKL QCPBGYAIGS 1500 

ECATSCLDHN SESIILPMNV TVRDIPHWLN PTRVERWCT AGLKWYPHPA LIHCVKGCEP 1560 

FKGDNYCDAI NNRAFCNYDG GDCCTSTVKT KKVTPFPMSC DLQGDCACRD PQAQEHSRKD 1620 
LRGYSHG 

Seq ID NO: B30 DNA sequence 

Nucleic Acid Accession #: NM_032 808.1 

Coding sequence! 61.. 1008 

1 11 21 31 41 51 

I I I I I I eA 

CTAGTCTATC TCCGCTTCCT CAACCTCTCC TACAACCCCA TCAGCACCAT TGAGGGCTCC 60 
ATGTTGCATG AGCTGCTCCG GCTGCAGGAG ATCCAGCTGG TGGGCGGGCA GCTGGCCGTG 120 
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GTGGAGCCCT ATGCCTTCCG CGGCCTCAAC TACCTGCGCO TGCTCAATGT CTCTGGCAAC 180 

CAGCTGACCA CACTGGAGGA ATTAGTCTTC CACTCGGTGG GCAACCTGGA GACACTCATC 240 

CTGGACTCCA ACCCGCTGGC CTGCGACTGT CGGCTCCTGT GGGTGTTCCG GCGCCGCTGG 300 

CGGCTCAACT TCAACCGGCA GCAGCCCACG TGCGCCACGC CCGAGTTTGT CCAGGGCAAG 360 

GAGTTCAAGG ACTTCCCTGA TGTGCTACTG CCCAACTACT TCACCTGCCG CCGCGCCCGC 420 

ATCCGGGACC GCAAGGCCCA GCAGGTGTTT GTGGACGAGG GCCACACGGT GCAGTTTGTG 480 

TGCCGGGCCG ATGGCGACCC GCCGCCCGCC ATCCTCTGGC TCTCACCCCG AAAGCACCTG 540 

GTCTCAGCCA AGAGCAATGG GCGGCTCACA GTCTTCCCTG ATGGCACGCT GGAGGTGCGC 600 

TACGCCCAGG TACAGGACAA CGGCACGTAC CTGTGCATCG CGGCCAACGC GGGCGGCAAC 660 

GACTCCATGC CCGCCCACCT GCATGTGOGC AGCTACTCGC CCGACTGGCC CCATCAGCCC 720 

AACAAGACCT TCGCTTTCAT CTCCAACCAG CCGGGCGAGG GAGAGGCCAA CAGCACCCGC 780 

GCCACTGTGC CTTTCCCCTT CGACATCAAG ACCCTCATCA TCGCCACCAC CATGGGCTTC 840 

ATCTCTTTCC TGGGCGTCGT CCTCTTCTGC CTGGTGCTGC TGTTTCTCTG GAGCCGGGGC 900 

AAGGGCAACA CAAAGCACAA CATCGAGATC GAGTATGTGC CCCGAAAGTC GGACGCAGGC 960 

ATCAGCTCCG CCGACGCGCC CCGCAAGTTC AACATGAAGA TGATATGAGG CCGGGGCGGG 1020 

GGGCAGGGAC CCCCGGGCGG CCGGGCAGGG GAAGGGGCCT GGCCGCCACC TGCTCACTCT 1080 

CCAGTCCTTC CCACCTCCTC CCTACCCTTC TACACACGTT CTCTTTCTCC CTCCCGCCTC 1140 

CGTCCCCTGC TGCCCCCCGC CAGCCCTCAC CACCTGCCCT CCTTCTACCA GGACCTCAGA 1200 

AGCCCAGACC TGGGGACCCC ACCTACACAG GGGCATTGAC AGACTGGAGT TGAAAGCCGA 1260 

CGAACCGACA CGCGGCAGAG TCAATAATTC AATAAAAAAG TTACGAACTT TCTCTGTAAC 1320 

TTGGGTTTCA ATAATTATGG ATTTTTATGA AAACTTGAAA TAATAAAAAG AGAAAAAAAC 1380 

TATTTCCTAT AGCTAGTCGG AATGCAAACT TTTGACGTCC TGATTGCTCC AGGGCCCTCT 1440 

TCCAACTCAG TTTCTTGTTT TTCTCTTCCT CCTCCTCCTC TTCTTCCTCC TTTCTCTTCT 1500 

CTTCCCCAGT GGGGAGGGAT CACTCAGGAA AACAGGAAAG GAGGTTCCAG CCCCACCCAC 1560 

CTGCCCACCC CGCCCCAGGC ACCATCAGGA GCAGGCTAGG GGGCAGGCCT GGGCCCAGCT 1620 

CCGGGCTGGC TTTTTGCAGG GCGCAGGTGG AGGGGACAGG TCTGCCGATG GGGGTGGGAG 1680 

CCTGTCTGCT GGGCTGCCAG GCGGCACCAC TGCAAGGGGT GGGAGCCTGG CTTGGGTGTG 1740 

GCTGAGACTC TGGACAGAGG CTGGGGTCCT CCTGGGGGAC AGCACAGTCA GTGGAGAGAG 1800 

CCAGGGGCTG GAGGTGGGGC CCACCCCAGC CTCTGGTCCC AGCTCTGCTG CTCACTTGCT 1860 

GTGTGGCCTC AAGCAGGTCA CTGGCCTCTC TGGGCCTCAG TCTCCACATC TGTACAAATG 1920 

GGAACATTAC CCCCTGCCCT GCCTACCTCA CAGGGCTGTT GTGAGGAATT GATGAGATGA 1980 
TGTATGTGAA ACACTTTGTA ACCTGTAAAG CGCTGTGCAC ACGTG 

Seq ID NO: B31 Protein sequence 
Protein Accession ft: NP_116197.1 

1 11 21 31 41 51 

I I 1 I I 1 

MLHELLRLQE IQLVGGQLAV VEPYAFRGLN YLRVLNVSGN QLTTLEELVF HSVGNLETLI 60 
LDSNPLACDC RLLWVFRERW RLNFNRQQPT CATPEFVQGK EFKDFPDVLL PNYFTCRRAR 120 
IRDRKAQQVF VDEGHTVQFV CRADGDPPPA ILWLSPRKHL VSAKSNGRLT VFPDGTLiEVR 180 
YAQVQDNGTY LCIAANAGGN DSMPAHLHVR SYSPDWPHQP NKTFAFISNQ PGEGEANSTR 240 
ATVPFPFDIK TLI IATTMGP ISFLGWLFC LVLLFLWSRG KGNTKHNIEI BYVPRKSDAG 300 
ISSADAPRKF NMKMI 

Seq ID NO: B32 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 

1 11 21 31 41 51 

CAGCACATAC AAGAAACATA CAGTGTACCT CAAAAGGGGC CCTTGAAATG TCATCAAAGG 60 
GTAATAACCT AGTGAGTGAG TTGTGATGTC ATCTGGAACA TAGGAAATGG GGCTCTTAGC 120 
GTATTCGGTA CGAAGGAAGC CAGGCTGGTC CTGGCAGGAA GTAAATGATA ATCTTTGGGA 180 
AACCAGGACC CTGCCTCCCA GCCCAGAGGT GGAGGAGGGC GGTCAGGGTG GGGTCTACAG 240 
TGGCACAGCA CTGACAAAGG TAGAGGGAAA TGTAATAGCA CATCTACGCT GCAGTCTGGT 300 
GAAAGTGGCC GGGGTGGTCC TTGGAAAACA GTTGGGCTGT TCTTGGCAGG AATTAGTGAC 360 
AGCCTTTCOG TCACGGGCAG GGACGCCTTG ATTTAAAAAA AATAAATAAA TAAAACGTCT 420 
GGGTATAGAA A 

Seq ID NO: B33 DNA sequence 

Nucleic Acid Accession ft: NM_006174.l 

Coding sequence: 71.. 1408 

1 11 21 31 41 51 

GAAAGGCTAT CGGTAACAAC TGACCTGCCA CAAAGTTAGA AGAAAGGATT GATTCAAGAA 60 

AGACTATAAT ATGGATTTAG AGCTCGACGA GTATTATAAC AAGACACTTG CCACAGAGAA 120 

TAATACTGCT GCCACTCGGA ATTCTGATTT CCCAGTCTGG GATGACTATA AAAGCAGTGT 180 

AGATGACTTA CAGTATTTTC TGATTGGGCT CTATACATTT GTAAGTCTTC TTGGCTTTAT 240 

GGGGAATCTA CTTATTTTAA TGGCTCTCAT GAAAAAGCGT AATCAGAAGA CTACGGTAAA 300 

CTTCCTCATA GGCAATCTGG CCTTTTCTGA TATCTTGGTT GTGCTGTTTT GCTCACCTTT 360 

CACACTGACG TCTGTCTTGC TGGATCAGTG GATGTTTGGC AAAGTCATGT GCCATATTAT 420 

GCCTTTTCTT CAATGTGTGT CAGTTTTGGT TTCAACTTTA ATTTTAATAT CAATTGCCAT 480 

TGTCAGGTAT CATATGATAA AACATCCCAT ATCTAATAAT TTAACAGCAA ACCATGGCTA 540 

CTTTCTGATA GCTACTGTCT GGACACTAGG TTTTGCCATC TGTTCTCCCC TTCCAGTGTT 600 

TCACAGTCTT GTGGAACTTC AAGAAACATT TGGTTCAGCA TTGCTGAGCA GCAGGTATTT 660 

ATGTGTTGAG TCATGGCCAT CTGATTCATA CAGAATTGCC TTTACTATCT CTTTATTGCT 720 

AGTTCAGTAT ATTCTGCCCT TAGTTTGTCT TACTOTAAGT CATACAAGTG TCTGCAGAAG 780 

TATAAGCTGT GGATTGTCCA ACAAAGAAAA CAGACTTGAA GAAAATGAGA TGATCAACTT 840 

AACTCTTCAT CCATCCAAAA AGAGTGGGCC TCAGGTGAAA CTCTCTGGCA GCCATAAATG 900 

GAGTTATTCA TTCATCAAAA AACACAGAAG AAGATATAGC AAGAAGACAG CATG TGTGT T 960 

ACCTGCTCCA GAAAGACCTT CTCAAGAGAA CCACTCCAGA ATACTTCCAG AAAACTTTGG 1020 

CTCTGTAAGA AGTCAGCTCT CTTCATCCAG TAAGTTCATA CCAGGGGTCC CCACTTGCTT 1080 

TGAGATAAAA CCTGAAGAAA ATTCAGATGT TCATGAATTG AGAGTAAAAC GTTCTGTTAC 1140 
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AAGAATAAAA AAGAGATCTC GAAGTGTTTT CTACAGACTG ACCATACTGA TATTAGTATT 
TGCTGTTAGT TGGATGCCAC TACACCTTTT CCATGTGGTA ACTGATTTTA ATGACAATCT 
TATTTCAAAT AGGCATTTCA AGTTGGTGTA TTGCATTTGT CATTTGTTGG GCATGATGTC 
CTOTTGTCTT AATCCAATTC TATATGGGTT TCTTAATAAT GGQATTAAAG CTGATTTAGT 
GTCCCTTATA CACTGTCTTC ATATGTAATA ATTCTCACTG TTT 

Seq ID NO: B34 Protein sequence 
Protein Accession #: NP_006165.1 

1 11 21 31 41 51 

I I I I I I 

MDLELDEYYN KTLATENNTA ATRNSDFPVW DDYKSSVDDL QYFLIGLYTF VSLLGFMGNL. 60 

LILMALMKKR NQKTTVNFLI GNIiAFSDILV VLFCSPFTLT SVLLDQWMFG KVMCHIMPPL 120 

QCVSVLVSTL ILISIAIVRY HMIKHPISNN LTANHGYFLI ATVWTLGPAI CSPLPVFHSL 180 

VELQETFGSA LLSSRYLCVE SWPSDSYRIA FTISLLLVQY ILPLVCLTVS HTSVCRSISC 240 

GLSNKENRLE ENEMINLTLH PSKKSGPQVK LSGSHKWSYS FIKKHRRRYS KKTACVLPAP 300 

ERPSQENHSR ILPEKFGSVR SQLSSSSKFI PGVPTCFEIK PBENSDVHEL RVKRSVTRIK 360 

KRSRSVPYRL TILILVFAVS WMPLHLFHW TDFNDNLISN RHFKLVYCIC HLLGMMSCCL 420 
NPIfcYGFUJN GIKADLVSLI HCLHM 

Seq ID NO: B35 DNA sequence 

Nucleic Acid Accession ft: NM_014279.l 

Coding sequence t 286.. 1689 

1 11 21 31 41 51 

I I I 1 I I 

GCGCGGGGGA gccattagga ggcgaggaga GAGGAGGGCG CAGCTCCOGC CCAGCCCAGC 60 

CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACCGG GACGCGATAA ATATGCAGAG 120 

CGGAGGCTTC GCGCAGCAGA GCCCGCGCGC CGCCCGCTCC GGGTGCTGAA TCCAGGCGTG 180 

GGGACACGAG CCAGGCGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGCGCG 240 

TCCGCGTCCA CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTCGTTGG 300 

AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AGTGCTGCCC ACCAACCCTG AGGAGAGCTG GCAGGTGTAC 420 

AGCTCTGCCC AGGACAGCGA GGGCAGGTGT ATCTGCACAG TGGTCGCCCC ACAGCAGACC 480 

ATGTGTTCAC GGGATGCCCG CACAAAACAG CTGAGGCAGC TACTGGAGAA GGTGCAGAAC 540 

ATGTCTCAAT CCATAGAGGT CTTGGACAGG CGGACCCAGA GAGACTTGCA GTACGTGGAG 600 

AAGATGGAGA ACCAAATGAA AGGACTGGAG TCCAAGTTCA AACAGGTGGA GGAGAGTCAT 660 

AAGCAACACC TGGCCAGGCA GTTTAAGGCG ATAAAAGCGA AAATGGATGA ACTTAGGCCT 720 

TTGATACCTG TGTTGGAAGA GTACAAGGCC GATGCCAAAT TGGTATTGCA GTTTAAAGAG 780 

GAGGTCCAGA ATCTGACGTC AGTGCTTAAC GAGCTGCAAG AGGAAATTGG CGCCTATGAC 840 

TACGATGAAC TTCAGAGCAG AGTGTCCAAT CTTGAAGAAA GGCTCCGTGC ATGCATGCAA 900 

AAACTAGCTT GCGGGAAGTT GACGGGCATC AGTGACCCCG TGACTGTCAA GACCTCCGGC 960 

TCGAGGTTCG GATCCTGGAT GACAGACCCT CTCGCCCCTG AAGGCGATAA CCGGGTGTGG 1020 

TACATGGACG GCTATCACAA CAACCGCTTC GTACGTGAGT ACAAGTCCAT GGTTGACTTC 1080 

ATGAACACGG ACAATTTCAC CTCCCACCGT CTCCCCCACC CCTGGTCGGG CAOGGGGCAG 1140 

GTGGTCTACA ACGGTTCTAT CTACTTCAAC AAGTTCCAGA GCCACATCAT CATCAGGTTT 1200 

GACCTGAAGA CAGAGACCAT CCTCAAGACC CGCAGCCTGG ACTATGCCGG TTACAACAAC 1260 

ATGTACCACT ACGCCTGGGG TGGCCACTCG GACATCGACC TCATGGTGGA CGAGAGCGGG 1320 

CTGTGGGCCG TGTACGCCAC CAACCAGAAC GCTGGCAACA TCGTGGTCAG TAGGCTGGAC 1380 

CCCGTGTCCC TGCAGACCCT GCAGACCTGG AACACGAGCT ACCCCAAGCG CAGCGCCGGG 1440 

GAGGCCTTCA TCATCTGCGG CACGCTGTAC GTCACCAACG GCTACTCAGG GGGTACCAAG 1500 

GTCCACTATG CATACCAGAC CAATGCCTCC ACCTATGAAT ACATCGACAT CCCATTCCAG 1560 

AACAAATACT CCCACATCTC CATGCTGGAC TACAACCCCA AGGACCGGGC CCTGTATGCC 1620 

TGGAACAACG GCCACCAGAT CCTCTACAAC GTGACCCTCT TCCACGTCAT CCGCTCCGAC 1680 

GAGTTGTAGC TCCCTCCTCC TGGAAGCCAA GGGCCCAOGT CCTCACCACA AAGGGACTCC 1740 

TGTGAAACTG CTGCCAAAAA GATACCAATA ACACTAACAA TACCGATCTT GAAAAATCAT 1800 

CAGCAGTGCG GATTCTGACA TCGAGGGATG GCATTACCTC CGTGTTTCTC CCTTTCGAGC 1860 

CGGCGGGCCA CAGACGTOGG AAGAAACTCC CGTATTTGCA GCTGGAACTG CAGCCCACGG 1920 

CGCCCCGGTT TTCCTCCCCG CCCTGTCCCT CTCTGGTCAA ACAACATACT AAAGAGGCGA 1980 

GGCAATGACT GTTGGCCAGT TCTCACCGGG GAAAAACCCA CTGTTAGGAT GGCATGAACA 2040 

TTTCCTTAGA TCGTGGTCAG CTCCGAGGAA TGTGGCGTCC AGGCTCTTTG AGAGCCATGG 2100 

GCTGCACCCG GCCGTAGGCT AGTGTAACTC GCATCCCATT GCAGTGCCGT TTCTTGACTG 2160 

TGTTGCTGTC TCTTAGATTA ACCGTGCTGA GGCTCCACAT AGCTCCTGGA CCTGT GTCTA 2220 

GTACATACTG AAGCGATGGT CAGAGTGTGT AGAGTGAAGT TGCTGTGCCC ACATTGTTTG 2280 

AACTCGCGTA CCCCGTAGAT ACATTGTGCA ACGTTCTTCT GTTATTCCCT TGAGGTGGTA 2340 

ACTTCGTATG TTCAGTTTAT GCGATGATTG TTGTAAATGC AATGCCGTAG TTTGGATTAA 2400 

TAAGTGGATG GTTTTTGTTT CTAAAAAGAA AAAAAAAATC AGTGTTCACC CTTATAGAGA 2460 

CATAGTCAAG TTCATGTTGA TAATAATCAA AGGAATTACT CTCTTCTTGT TAAATTAGCT 2520 

AAATCATGTA ACCGCAGATA GGAAGGGCTC ACCTGGGGAA ACTCTGGTTT CCGATGGGAC 2580 

AGGAAAGTCA TACGGGCAAC AGTATGCGGA AAGTACGTTT TTTTAAGTAA AAAACAAAGG 2640 

CAAACTTTGT ACTATCCAGT TATCTAAGGA ACAATAAAAA CATTAGGAGA AAAAAAAAAA 2700 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: B36 Protein sequence 
Protein Accession #: NP_0S5094.1 

1 11 21 31 41 51 

I I I i I I 

MPGRWRWQRD MHPARKLLSL LFLILMGTEL TQVLPTNPEB SWQVYSSAQD SEGRCICTW 60 

APQQTMCSRD ARTKQLRQLL BKVQNHSQSI EVLDRRTQRD LQYVBKMENQ MKGLESKFKQ 120 

VEESHKQHLA RQPKAIKAKM DELRPLIPVL EEYKADAKLV LQFKEEVQNL TSVLNELQBB 180 

IGAYDYDELQ SRVSNLEERL RACKQKLACG KLTGISDPVT VKTSGSRFGS WMTDPLAPEG 240 

DNRVWYMDGY HNNRFVREYK SMVDFMNTDN FTSHRLPHPW SGTGQWYNG SIYFNKPQSH 300 

IIIRFDLKTE TILBCTRSLDY AGYNNMYHYA WGGHSDIDLM VDBSGLWAVY ATNQMAGNIV 360 
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VSRLDPVSLQ TLQTWNTSYP KRSAGEAFII CGTLYVTNGY SGGTKVHYAY QTNASTYEYI 420 
DIPFQNKYSH ISMLDYNPKD RALYAKNNGH QILYNVTLFH VIRSDEL 

Seq ID NO: B37 DNA sequence 

Nucleic Acid Accession «: NM_006334.1 

Coding sequence: 28 6.. 693 

1 11 21 31 41 51 

I I I I I 1 

GCGCGGGGGA GCCATTAGGA GGCGAGGAGA GAGGAGGGCG CAGCTCCCGC CCAGCCCAGC 60 

CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACOGQ GACGOGATAA ATATGCAGAG 120 

CGGAGGCTTC GCGCAGCAGA GCCCGCGOGC CGCCCGCTCC GGGTGCTGAA TCCAGGCGTG 180 

GGGACACGAG CCAGGCGCCG CCGCOGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGOGCG 240 

TCCGCGTCCA CGCAGCCGCC GGCOGGCCAO CACCCAGGGC CCTGCATGCC AGGTCGTTGG 300 

AGGTGGCAGC GAGACATGCA CCCGGCCCGQ AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AGTGCTGCCC ACCAACCCTG AGGAGAGCTG GCAGGTGTAC 420 

AGCTCTGCCC AGGACAGCGA GGGCAGGTGT ATCTGCACAG TGGTOGCCCC ACAGCAGACC 480 

ATGTGTTCAC GGGATGCCCG CACAAAACAG CTGAGGCAGC TACTGGAGAA GGTGCAGAAC 540 

ATGTCTCAAT CCATAGAGGT CTTGGACAGG CGGACCCAGA GAGACTTGCA GTACGTGGAG 600 

AAGATGGAGA ACCAAATGAA AGGACTGGAG TCCAAGTTCA AACAGGTGGA GGAGAGTCAT 660 

AAGCAACACC TGGCCAGGCA GTTTAAGGGC TAACTTAAAA GAGTTTTTTC AATGCTGCAG 720 

TGACTGAAGA AGCAGTCCAC TCCCATGTAA CCATGAAAGA GAGCCAGAGA GCTTTTTGCA 780 

CCATGCATTT TTACTATTAT TTTCCAATAC TTAGCACCAT TTCACTAAGG AACCTTGAAT 840 

ACAACCAGGA TCCTCCTTTG CATGCGACTG TAGCTGCATT TCATGAATAG TTTGAACCCT 900 

TGTCAATGCA TTTTTTGAAA AAGAAAGAAA AAAAAAACTT CGTGTATGTG ACTCAAAGCA 960 
TGTAACCTTA AGATGTTGCA TTCTAAACTG ACAATAAAGA CCTTTCCCC 

Seq ID NO: B38 Protein sequence 
Protein Accession 8: NPJ>06325.1 

1 11 21 31 41 51 

I I I I I I 

MPGRWRWQRD MHPARKLLSL LFLILKGTEL TQVLPTNPEE SWQVYSSAQD SEGRCICTW 60 
APQQTMCSRD ARTKQI*RQLL EKVQNMSQSI BVLDRRTQRD LQYVEKMENQ MKGLESKFKQ 120 
VEESHKQHLA RQPKG 

Seq ID NO: B39 DNA sequence 

Nucleic Acid Accession ft NM_058199.1 

Coding sequence: 286.. 795 

1 11 21 31 41 SI 

I I I I I i 

GCGCGGGGGA GCCATTAGGA GGCGAGGAGA GAGGAGGGCG CAGCTCCCGC CCAGCCCAGC 60 

CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACCGG GACGCGATAA ATATGCAGAG 120 

CGGAGGCTTC GCGCAGCAGA GCCCGCGCGC CGCCCGCTCC GGGTGCTGAA TCCAGGCGTG 180 

GGGACACGAG CCAGGCGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGCGCG 240 

TCCGCGTCCA CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTCGTTGG 300 

AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AAATAAAAGA GAAAACAAAG CAGAGAAGAT GGGAGGGCCA 420 

GAGAGCGAGA GGAAGACCAC AGGAGAGAAG ACACTGAACG AGCTTCCCTT GTTTTGCCTG 480 

GAAGCCCACG CTGGCTCCCT GGCTCTGCCC AGGATGTGCA GTCCAAATCC CAATCCAGCA 540 

GTGGGGTTAT GTCGTCCCGC TTACCCTCAG AGCCCTTCTC CTGGTGCTGC CCAGACGATC 600 

AGCCAGTCCC TCCTGGAGAG GTTCTGCATG GCCTCTAGGA GAGAAGTTTT CTTGGCCCCA 660 

GGAAGGCCTG GTGGAGGGTG GTGGTTGTGC ACTGTTGCTG GACAGATGCA TTCATTCATG 720 

TGCACACACA CACACACACA TGCACACACA GGGGAGCAGA TACCTGCAGA GAAGAGCCAA 780 

CCAGGTCCTG ATTAGTGGCA AGCTGCCCCA CAAAGGGCTA TGCCTGTGTC TTATTGAGAC 840 

ACCTTGGCAA AGAGATGGCT GATTCTGGGT GGTCCTGGAC ATGGCCGCAC CCAAGGGCCC 900 

TCCAAGCCTT AATGGCACCC TGAAGCCTCC ATGCCCAGGC CAAAAGATGC TTTTCCTCCC 960 
TAAAAAAAAA AAAAAAAAAA 

Seq ID NO: B40 Protein sequence 
Protein Accession #* NP_478106.1 

1 11 21 31 41 51 

I I I I I I 

MPGRWRWQRD MHPARKLLSL LFLILMGTEL TQNKRENKAE KMGGPESERK TTGEKTLNEL 60 
PLFCLEAHAG SLALPRMCSP NPNPAVGLCR PAYPQSPSPG AAQTISQSLL ERFCMASRRE 120 
VPLAPGRPGG GWWLCTVAGQ MHSPMCTHTH THAHTGEQIP AEKSQPGPD 

Seq ID NO: B41 DNA sequence 

Nucleic Acid Accession 9: AY038071.1 

Coding sequence: 1..1686 

1 11 21 31 41 51 

I I I I I I 

ATGAGCAATC AGTACCAGGA GGAGGGCTGC TCCGAGAGGC CCGAGTGCAA AAGTAAATCT 60 
CCAACTTTGC TCTCCTCCTA CTGCATCGAC AGCATCCTGG GCOGGAGGAG CCCGTGCAAA 120 
ATGCGGTTGC TGGGAGCCGC GCAGAGCTTO CCTGCTCCGC TGACCAGCCG CGCCGACCCG 180 
GAAAAGGCCG TGCAAGGCTC CCCTAAGAGC AGCAGCGCCC CGTTCGAGGC CGAGCTGCAC 240 
CTGCCGCCCA AGCTGCGGCG CCTGTACGGC CCGGGCGGGG GCCGCCTCCT TCAGGGTGCG 300 
GCAGCGGCGG CGGCGGCGGC GGCGGCGGCG GCGGCAGCGG CCGCCACGGC CACGGCGGGT 360 
CCACGCGGGG AGGCCCCTCC GCCGCCACCG CCAACCGCGC GGCCCGGGGA ACGGCCGGAC 420 
GGCGCAGGGO CCGCCGCGGC AGCCGCGGCC GCGGCCGCCG CGGCCTGGGA CACGCTCAAG 480 
ATCAGCCAGG CGCCGCAGGT GAGCATCAGC CGCAGCAAGT CGTACCGCGA GAACGGGGCG 540 
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CCCTTCGTGC CGCCGCCGCC OGCGCTGGAC GAGCTGGGCG GCCCGGGGGG CGTCACGCAC 600 

CCGGAGGAGC GCCTCGGCGT GGCOGGCGGC COGGGCAGCQ CCCCGGCTGC GGGTGOTGGC 660 

ACCGGCACCG AGGACGACOA GGAGGAGCTG CT6GAG6ACG AAGAAGATGA GGACGAGGAA 720 

GAGGAACTGC TGGAGGAOGA CGAGGAGGAG CTGCTGGAGG ACGACGCCCG CGCGCTGCTC 780 

5 AAGGAGCCCC GGCGCTGTCC TGTGGCCGCC ACTGGCGCCG TGGCCGCAGC AGCTGCCGCT 840 

GCAGTGGCCA CAGAGGGCGG GGAGCTGTCA CCCAAGGAGG AGCTGCTGCT GCACCCGGAA 900 

GACGCTGAGG GCAAGGAOGG CGAGGACAGC GTGTGCCTCT CTGCGGGCAG CGACTCGGAG 960 

GAGGGGCTGC TGAAACGCAA ACAGAGGCGC TACCGCACCA CGTTCACCAG CTACCAGCTG 1020 

GAGGAACTGG AGCGGGCCTT CCAGAAGACG CACTACCCGG ACGTCTTCAC CAGGGAGGAA 1080 

10 CTGGCCATGA GGCTGGACTT GACCGAGGCC CGAGTCCAGG TCTGGTTCCA GAACCGTCGG 1140 

GCCAAGTGGC GCAAGCGGGA GAAGGCAGGC GCGCAGACCC ACCCCCCTGG GCTGCCCTTC 1200 

CCGGGGCCGC TCTCCGCCAC CCACCCGCTC AGCCCCTACC TGGACGCCAG CCCCTTCCCT 1260 

CCGCACCACC CGGCGCTCGA CTCCGCTTGG ACTGCCGCTG COGCOGCCGC CGCCGCCGCC 1320 

TTCCCGAGCC TACCTCCGCC TCCGGGCTCG GCCAGCCTGC CGCCCAGCGG GGCGCCGCTG 1380 

15 GGCCTGAGCA CTTTCCTCGG AGCGGCAGTG TTCCGACACC CAGCTTTCAT CAGCCCGGCA 1440 

TTCGGCAGGC TCTTTTCCAC AATGGCCCCC CTGACCAGCG CGTCGACCGC GGCCGCGCTC IS 00 

CTGAGACAGC CCACACCCGC CGTGGAGGGC GCAGTGGCAT CGGGCGCCCT GGCCGACCCG 1560 

GCCACGGCGG CCGCAGACAG ACGCGCCTCT AGCATAGCCG CGCTGAGGCT CAAGGCCAAG 1620 

GAGCACGCGG CGCAGCTCAC GCAGCTCAAC ATCCTGCCGG GCACCAGCAC GGGCAAGGAG 1680 

20 GTGTGC 

Seq ID NO: B42 Protein sequence 
Protein Accession #: AAK93901.1 

25 1 11 21 31 41 51 

I I I I I I 

MSNQYQBEGC SERPECKSKS PTLLSSYCID SILGRRSPCK MRLLGAAQSL PAPLTSRADP 60 

EKAVQGSPKS SSAPPEAELH LPPKLRRLYG PGGGRLLQGA AAAAAAAAAA AAAAATATAG 120 

PRGEAPPPPP PTARPGERPD GAGAAAAAAA AAAAAWDTLK ISQAPQVSIS RSKSYRENGA 180 

30 PPVPPPPALD ELGGPGGVTH PEERLGVAGG PGSAPAAGGG TGTEDDEEEL LEDEEDEDEE 240 

EELLEDDEEE LLEDDARALL KEPRRCPVAA TGAVAAAAAA AVATEGGELS PKEELLLHPE 300 

DAEGKDGEDS VCLSAGSDSE EGLLKRKQRR YRTTFTSYQL EELERAFQKT HYPDVPTREE 360 

LAMRLDLTEA RVQVWFQNRR AKWRKREKAG AQTHPPGLPF PGPfcSATHPL SPYLDASPFP 420 

PHHPALDSAW TAAAAAAAAA FPSLPPPPGS ASLPPSGAPL GLSTFLGAAV FRHPAFISPA 480 

35 FGRLFSTMAP LTSASTAAAL LRQPTPAVEG AVASGALADP ATAAADRRAS SIAALRLKAK 540 
EHAAQLTQLK ILPGTSTGKE VC 



Seq ID NO i B43 DNA sequence 
. _ Nucleic Acid Accession #: CAT cluster 

40 

1 11 21 31 41 51 

I I i I I I 

CGCCTGCCCC CCCGTTGCTT CTCTCTCTTC CCTTCTGCCT GCGTGCTCGC CCGTCTTTGG 60 

TTCCGCCGGT TCCGTGCGCC CTCTCTGGGC TTCGGCTTCA CCCTCCGTCT GGCTGTCGCT 120 

45 CTGCTCTCCT CCGCTCTCAT GCCCCTTTCA GCTCCTACTC TGGTGTCGTC CCCCTCCCTC 180 

GCTCGTTTCT CTTGTTCCTC TCCTTCCCGG TTCGTGTTCT GCTTCTTCGT GTTCCCTCTC 240 

GTCTGTGTTT GTGGGTCGTC CTCTGCGTCC TGCCTCTTTC TCTTGCGTTT CCGCGCGCTT 300 

CCTGTTTTGG TCCCCCTGGT TCTCGCGCGG TTCTGTGTGC GCGGGCCGCC CCGTTCTTTC 360 

TTGGTGCTTT GCTCCTGCTC TCTGTCTGGT GTTCTCCGCC GTCTGTGTCT TTCTCCTTTT 420 

50 ATCTTCTGTT TTCTCTTTTT CTTGTTTCTT CTTGTTTTTT TTTGTCTTTG TTTTCTCTTA 480 

GCTGAGGTTG GGGAGAGATA ACGCTGTAAA CTTTTATTTT TCAGGAAATC TGGAAACCTA 540 

CAGTCTCCAT GCCTGCTCAG CCAAGAAGGA GCTCACTGTG GGCACCAGAG ACAGGGACCC 600 

AATGTGGAGA CCTGTGAGCC TGTGTCCGGC CCTGAACTCT CAAGCACAGG GCAGGCTTCC 660 

TGAGCATTGA AGAGAATATG TGGGAGAACA AAACAQAAAC TGAAAGAATA TGCAAGGTGT 720 

55 CTTTCTTGGA TGTTATTCCA TGATAGATAG TAGGGGCAGG AGTGAGAGAG GCTGACTAGG 780 

TCTGGACATG GAGGCTGGAA GAGTCAGGGT GTGATTCGGA GAGGCGCATG AGAAGGAAGG 840 

TGGATTTTAA GGCTGGAAAT CTGAGGGTCA GTGGTCCAAG TCACTCAGAG ACAGAATCAC 900 
AGCATAGCCC TTGCTGATGG CAA 

60 Seq ID NO: B44 dna sequence 

Nucleic Acid Accession 8: NMJJ14421.1 
Coding sequence: 718.. 1497 



1 11 21 31 41 51 

65 | | | | | I 

CCACGCGTCC GGCGGGAGCC CGCGGCGAGC GTAGCGCAAG TCCGCTCCCT AGGCATCGCT 60 

GCGCTGGCAG CGATTCGCTG TCTCTTGTGA GTCAGGGGAC AACGCTTCGG GGCAACTGTG 120 

AGTGCGCGTG TGGGGGACCT CGATTCTCTT CAGATCTCGA GGATTCGGTC CGGGGACGTC 180 

TCCTGATCCC CTACTAAAGC GCCTGCTAAC TTTGAAAAGG AGCACTGTGT CCTGCAAAGT 240 

70 TTGACACATA AAGGATAGGA AAAGAGAGGA GAGAAAAGCA ACTGAGTTGA AGGAGAAGGA 300 

GCTGATGCGG GCCTCCTGAT CAATTAAGAG GAGAGTTAAA COGCCGAGAT CCCGGCGGGA 360 

CCAAGGAGGT GCGGGGCAAG AAGGAACGGA AGCGGTGCGA TCCACAGGGC TGGGTTTTCT 420 

TGCACCTTGG GTCAOGCCTC CTTGGCGAGA AAGCGCCTCG CATTTGATTG CTTCCAGTTA 480 

_ _ TTGCAGAACT TCCTGTCCTG GTGGAGAAGC GGGTCTCGCT TGGGTTCCGC TAATTTCTGT 540 

75 CCTGAGGCGT GAGACTGAGT TCATAGGGTC CTGGGTCCCC GAACCAGGAA GGGTTGAGGG 600 

AACACAATCT GCAAGCCCCC GOGACCCAAG TGAGGGGCCC CGTGTTGGGG TCCTCCCTCC 660 

CTTTGCATTC CCACCCCTCC GGGCTTTGCG TCTTCCTGGG GACCCCCTCG CCGGGAGATG 720 

GCCGCGTTGA TGCGGAGCAA GGATTCGTCC TGCTGCCTGC TCCTACTGGC CGCGGTGCTG 780 

_ _ ATGGTGGAGA GCTCACAGAT CGGCAGTTCG CGGGCCAAAC TCAACTCCAT CAAGTCCTCT 840 

80 CTGGGCGGGG AGACGCCTGG TCAGGCCGCC AATOGATCTG CGGGCATGTA CCAAGGACTG 900 

GCATTCGGCG GCAGTAAGAA GGGCAAAAAC CTGGGGCAGG CCTACCCTTG TAGCAGTGAT 960 

AAGGAGTGTG AAGTTGGGAG GTATTGCCAC AGTCCCCACC AAGGATCATC GGCCTGCATG 1020 

GTGTGTCGGA GAAAAAAGAA GCGCTGCCAC CGAGATGGCA TGTGCTGCCC CAGTACCCGC 1080 

TGCAATAATG GCATCTGTAT CCCAGTTACT GAAAGCATCT TAACCCCTCA CATCCOGGCT 1140 
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CTGGATGGTA CTCGGCACAG AGATCGAAAC CACGGTCATT ACTCAAACCA TGACTTGGGA 1200 
TGGCAQAATC TAGGAAGACC ACACACTAAG ATGTCACATA TAAAAGGGCA TGAAGGAGAC 1260 
CCCTGCCTAC GATCATCAGA CTGCATTGAA GGGTTTTGCT GTGCTCGTCA TTTCTGGACC 1320 
AAAATCTGCA AACCAGTGCT CCATCAGGGG GAAGTCTGTA CCAAACAACG CAAGAAGGGT 1380 
TCTCATGGGC TGGAAATTTT CCAGOGTTGC GACTGTGCGA AGGGCCTGTC TT GCAA AGTA 1440 
TGGAAAGATG CCACCTACTC CTCCAAAGCC AGACTCCATG TGTGTCAGAA AATTTGATCA IS 00 
CCATTGAGGA ACATCATCAA TTGCAGACTG TGAAGTTGTG TATTTAATGC ATTATAGCAT 1560 
GGTGGAAAAT AAGGTTCAGA TGCAGAAGAA TGGCTAAAAT AAGAAACGTG ATAAGAATAT 1620 
AGATGATCAC AAAAAGGGAG AAAGAAAACA TGAACTGAAT AGATTAGAAT GGGTG ACAAA 1680 
TGCAGTGCAG CCAGTGTTTC CATTATGCAA CTTGTCTATG TAAATAATGT ACACATTTGT 1740 
GGAAAATGCT ATTATTAAGA GAACAAGCAC ACAGTGGAAA TTACTGATGA GTAGCATGTG 1800 
ACTTTCCAAG AGTTTAGGTT GTGCTGGAGG AGAGGTTTCC TTCAGATTGC TGATTGCTTA 1860 
TACAAATAAC CTACATGCCA GATTTCTATT CAAOGTTAGA GTTTAACAAA ATACTCCTAG 1920 
AATAACTTGT TATACAATAG GTTCTAAAAA TAAAATTGCT AAACAAGAAA TGAAAACATG 1980 
GAGCATTGTT AATTTACAAC AGAAAATTAC CTTTTGATTT GTAACACTAC TTCTGCTGTT 2040 
CAATCAAGAG TCTTGGTAGA TAAGAAAAAA ATCAGTCAAT ATTTCCAAAT AATTGCAAAA 2100 
TAATGGCCAG TTGTTTAGGA AGGCCTTTAG GAAGACAAAT AAATAACAAA CAAACAGCCA 2160 
CAAATACTTT TTTTTCAAAA TTTTAGTTTT ACCTGTAATT AATAAGAACT GATACAAGAC 2220 
AAAAACAGTT CCTTCAGATT CTACGGAATG ACAGTATATC TCTCTTTATC CTATGTGATT 2280 
CCTGCTCTGA ATGCATTATA TTTTCCAAAG TATACCCATA AATTGTGACT AGTAAAATAC 2340 
TTACACAGAG CAGAATTTTC ACAGATGGCA AAAAAATTTA AAGATGTCCA ATATATGTGG 2400 
GAAAAGAGCT AACAGAGAGA TCATTATTTC TTAAAGATTG GCCATAACCT GTATTTTGAT 2460 
AGAATTAGAT TGGTAAATAC ATGTATTCAT ACATACTCTG TGGTAATAGA GACTTGAGCT 2520 
GGATCTGTAC TGCACTGGAG TAAGCAAGAA AATTGGGAAA ACTTTTTCGT TTGTTCAGGT 2 580 
TTTGGCAACA CATAGATCAT ATGTCTGAGG CACAAGTTGG CTGTTCATCT TTGAAACCAG 2640 
GGGATGCACA GTCTAAATGA ATATCTGCAT GGGATTTGCT ATCATAATNT TTCCTATGQI 2700 
GNTGAATTCN GTGTGAGGTC CTGTGTCCGT CCTATCCTCA AATTATTTAT TTTATAGTGC 2760 
TGAGATCCTC AAATAATCTC AATTTCGGAG GTTTCACAAA ATGGACTCCT GAAGTAGACA 2820 
GAGTAGTGAG GTTTCATTGC CCTCTATAAG CTTCTGACTA GCCAATGGCA TCATCCAATT 2880 
TTCTTCCCAA ACCTCTGCAG CATCTGCTTT ATTGCCAAAG GGCTAGTTTC GGTTTTCTGC 2940 
CAGCCATTGC GGTTAAAAAA TATAAGTAGG ATAACTTGTA AAACCTGCAT ATTGCTAATC 3000 
TATAGACACC ACAGTTTCTA AATTCTTTGA AACCACTTTA CTACTTTTTT TAAACTTAAC 3060 
TCAGTTCTAA ATACTTTGTC TGGAGCACAA AACAATAAAA GGTTATCTTA TAGTTGTGAC 3120 
TTTAAACTTT TGTAGACCAC AATTCACTTT TTAGTTTTCT TTTACTTAAA TCCCATCTGC 3180 
AGTCTCAAAT TTAAGTTCTC CCAGTAGAGA TTGAGTTTGA GCCTGTATAT CTATTAAAAA 3240 
TTTCAACTTC CCACATATAT TTACTAAGAT GATTAAGACT TACATTTTCT GCACAGGTCT 3300 
GCAAAAACAA AAATTATAAA CTAGTCCATC CAAGAACCAA AGTTTGTATA AACAGGTTGC 3360 
TATAAGCTTG GTGAAATGAA AATGGAACAT TTCAATCAAA CATTTCCTAT ATAACAATTA 3420 
TTATATTTAC AATTTGGTTT CTGCAATATT TTTCTTATGT CCACCCTTTT AAAAATTATT 3480 
ATTTGAAGTA ATTTATTTAC AGGAAATGTT AATGAGATGT ATTTTCTTAT AGAGATATTT 3540 
CTTACAGAAA GCTTTGTAGC AGAATATATT TGCAGCTATT GACTTTGTAA TTTAGGAAAA 3600 
ATGTATAATA AGATAAAATC TATTAAATTT TTCTCCTCTA AAAACTGAAA AAAAAAAAAA 3660 
AAAAAAAAAA A 

Seq ID NO: B45 Protein sequence 
Protein Accession #: NP_05S236.1 

1 11 21 31 41 51 

1 I I I I I 

MAALMRSKDS SCCLLLLAAV LMVESSQIGS SRAKLNSIKS SLGGBTPGQA ANRSAGMYQG 60 
LAFGGSKKGK NLGQAYPCSS DKECEVGRYC KSPHQGSSAC MVCRRKKKRC HRDGMCCP6T 120 
RCNNGICIPV TESILTPHIP ALDGTRHRDR NHGHYSNHDL GWQNLGRPHT KMSHIKGHEG 180 
DPCLRSSDCI EGFCCARHFW TKICKPVLHQ GBVCTKQRKK GSHGLEIPQR CDCAKGLSCK 240 
VWKDATYSSK ARLHVCQKI 

Seq ID NO: 46 DNA sequence 

Nucleic Acid Accession #: NMJ>19885.1 

Coding sequence: 1..1539 

1 11 21 31 41 51 

I I I I I 1 

ATGCTCTTTG AGGGCTTGGA TCTGGTGTCG GCGCTGGCCA CCCTCGCCGC GTGCCTGGTG 60 

TCCGTGACGC TGCTGCTGGC .CGTGTCGCAG CAGCTGTGGC AGCTGCGCTG GGCCGCCACT 120 

CGCGACAAGA GCTGCAAGCT GCCCATCCCC AAGGGATCCA TGGGCTTCCC GCTCATCGGA 180 

GAGACCGGCC ACTGGCTGCT GCAGGGTTCT GGCTTCCAGT CGTCGCGGAG GGAGAAGTAT 240 

GGCAACGTGT TCAAGACGCA TTTGTTGGGG CGGCCGCTGA TACGCGTGAC OGGCGCGGAG 300 

AACGTGCGCA AGATCCTCAT GGGCGAGCAC CACCTCGTGA GCACCGAGTG GCCTCGCAGC 360 

ACCCGCATGT TGCTGGGCCC CAACACGGTG TCCAATTCCA TTGGCGACAT CCACCGCAAC 420 

AAGCGCAAGG TCTTCTCCAA GATCTTCAGC CAOGAGGCOC TGGAGAGTTA CCTGCCCAAG 480 

ATCCAGCTGG TGATCCAGGA CACACTGCGC GCCTGGAGCA GCCACCCCGA GGCGATCAAC 540 

GTGTACCAGG AGGCGCAGAA GCTGACCTTC CGCATGGCCA TCCGGGTGCT GCTGGGCTTC 600 

AGCATCCCTG AGGAGGACCT TGGGCACCTC TTTGAGGTCT ACCAGCAGTT TGXGGACAAT 660 

GTCTTCTCCC TGCCTGTCGA CCTGCCCTTC AGTGGCTACC GGCGGGGCAT TCAGGCTCGG 720 

CAGATCCTGC AGAAGGGGCT GGAGAAGGCC ATCCGGGAGA AGCTGCAGTG CACACAGGGC 780 

AAGGACTACT TGGACGCCCT GGACCTCCTC ATTGAGAGCA GCAAGGAGGA OGGGAAGGAG 840 

ATGACCATGC AGGAGCTGAA GGAOQGGACC CTGGAGCTGA TCTTTGCGGC CTATGCCACC 900 

AOGGCCAGCG CCAGCACCTC ACTCATCATG CAGCTGCTGA AGCACCCCAC TGTGCTGGAG 960 

AAGCTGCGGG ATGAGCTGCG GGCTCATGGC ATCCTGCACA GTGGCGGCTG CCCCTGCGAG 1020 

GGCACACTGC GCCTGGACAC GCTCAGTGGG CTGCGCTACC TGGACTGOGT CATCAAGGAG 1080 

GTCATGOGCC TGTTCAOGCC CATTTCCGGC GGCTACCGCA CTGTGCTGCA GACCTTCGAG 1140 

CTTGATGGTT TCCAGATCCC CAAAGGCTGG AGTGTCATGT ATAGCATCCG GGACACCCAT 1200 

GACACAGCGC CCGTGTTCAA AGACGTGAAC GTGTTCGACC CCGATCGCTT CAGCCAGGCG 1260 

OGGAGCGAGG ACAAGGATGG CCGCTTCCAT TACCTCCCGT TCGGTGGCGG TGTCCGGACC 1320 

TGCCTGGGCA AGCACCTGGC CAAGCTGTTC CTGAAGGTGC TGGCGGTGGA GCTGGCTAGC 1380 
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ACCAGCCGCT TTGAGCTGGC TACACGGACC TTCCCCCGCA TCACCTTGGT CCCCQTCCTG 1440 
CACCCCGTGG ATGGCCTCAO CGTCAAGTTC TTTGGCCTGG ACTCCAACCA GAACQAGATC 1500 
CTGCCGGAQA OGGAGGCCAT GCTGAGCGCC ACAGTCTAAC CCAAGACCCA CCCGCCTCAG 1S60 
CCCAGCCCAG GCAGCGGGGT GGTGGTTGTG GGAGGTAGAA ACCTGTGTGT GGGAGGGGGC 1620 
CGGAACGGGG AGGGCGAGTG GCCCCCATAC TTGCCCTCCC TTGCTCCCCC TCCCTGGCAA 1680 
ACCCTACCCA AAGCCAGTGG GCCCCATTCC TAGGGCTGGG CTCCCCTTCT GGCTCCAGCT 1740 
TCCCTCCAGC CACTCCCCAT TTACCATCAG CTCAGCCCCT GGGAAGGGCG TGGCAGGGGC 1800 
TCTGCATGCC CGTGACAGTG TTAGGTGTCA GCGCGTGCTA CAGTGTTTTT GTGATGTTCT 1860 
GAACTGCTCC CTTCCCTCCG TTCCTTTCGG ACCCTTTTAG CTGGGGTTGG GGGACGGGAA 1920 
GAGCCGTGCC CCCTTGGGCG CACTCTTCAG CGTCTCCTCC TCCTGCGCCC CCACTGCGTC 1980 
TGCCCAGGAA CAGCATCCTG GGTAGCAGAA CAGGAGTCAA CCTTGGCGGG GCGGGGGCTG 2040 
CGTCCAACCT GGAGATTGCC CTTCCCTATG CCACGGTTCC CACCCTCCCT CACCAGTTTG 2100 
GACAATTTGA AATTACCTAT TGCTGCTACT TGTTCTGTCC TCTGACCTTG GGGCAAAGGA 2160 
GCCCCAGGCC CTGTCTCCCC AGCATCCTCC CTGGTGGCCC TGGGCAGGTG CACTGACACC 2220 
CCCACCTTCC CATCCCCTGC TGAACCAGGC CCTGTTACAC ACAGCCGCCT AAGGCCCGCG 2280 
GCTCATGTGC TGCCCGCCCC CATATTTATT CACTGATAGA GAATCTTGGG GATGCTGGGG 2340 
TCTGGAGTGA ACATCTCCTC CCCTTCATGC CCTAGCCTGT GTTCTAGCTG TCCTGGOGAG 2400 
ACTTCTGTGA GTGAAGAGGA AGGGGTCTCT GGTCAAACCC AGCCCCCAGG GCCTAGGGTT 2460 
GAAAGCCTTC CCCGGCTCOG GGCATTATTT GGGTTTAATC TCGGAGCCTC ACTCCTGGAC 2520 
TGAAGTCCGG TGCCTCTGCC TTATCCCTGG TGGAGATGGA ATGTGGCCCA TTGCCTCCTC 2580 
CCTCTCCTGT CAAAAACCCT GATCAGGTAG ATTTGGAGGC GGCCACGATT TCCTGTTTGG 2640 
CCCCTGTTCA CCCCAGTGCA CTGGCCCTGA CTCCAGGCGT GAGTATGGGG AAGGATACGG 2700 
GTTCTTCTGA CGGGGAGCAA GGGCCTCCGT CTTCCCTTCC TTAACTCTCC CCCTTTGCCC 2760 
TCCGCCCTGA AAAAGGTGTC CTTGAAGTCC CTTCCACCTC TATGCCACTG TCTGCTTAGC 2820 
CCAGCTCAGG GGTGGGGAAG AGGCGAAAGC GTGGGGGAGG TGAGCGCAGC GGCAGTTCTG 2880 
CCTCGGAGCT GATTGCAGGG CCCTGTGTGG TCTCCGGACA GCTGOGGGAA GGCTGCCGCA 2940 
GCTGAAGCTG AAGAGGCGGC TACGTGCGGT TTGTCAGGGG GATTGGGTTG AAAACTGGCC 3000 
AGTCGGGATG ACTGGGTGAA AGAGGAGTAG CTCCTGCCAC TGGOGTTTTG AGTGTTGGCA 3060 
ATTTGGGATG CCTCCTGGGG AAGGTTTCCG GGCGTTTGGT GAGTCTCTAG ATTTTTCCTT 3120 
GCTTTCTGTG TTTATTGGTT TTTGATGTTG TAAAAGCAAT GAATCCCCTT TACAAGAAAA 3180 
TCGAAAACAC AGAAGAATGA AGGACATGCC AGTCCCCGAT CGCTGCTGTG AGCACCTCAG 3240 
TGGCTCCCTC AGACCAGATC CCGTAGGCAG CCCCACAGAC CGACCCTGAC CCCACTCACA 3300 
GCCACCCTGA AGATAGACTA TAGGAACGGG CCCATACCAC ACAGACTGCT CTCCAATCCC 3360 
TGAGTCTCAG ATGTTTCATT TATTTCCTAC TTTTCCACTA CTAAAAAACA GTGTGGAATA 3420 
GACATTATTG GCAAAATTGC TCATCCCTAA TCCTGAAAAA CAGGCCAGAA TGGGTAAAGA 3480 
CTTGTCAAAG CTTGCAACAT AGCTACATGG TGCACCCGGA CCTGTACCCC CTCCCCOCAA 3540 
CACAAAACCA GTGTCTGGGA GGTTCATTTT CCTTTAAACT GATCCAGCTG GCCCTGAACC 3600 
AATTGTTTTT GACTGAGTAT CTAGGAGAGC AGTAAGTGGA ACTTCAGACA AGCCCACTGG 3660 
GTCTGGTCCA GGTGAGGGGC AGGGGGCATG GGGCTGGGAG GTCTCAGGGG CCTTCCCTGG 3720 
GGGTGGCCAG CCTGGTAGGG GGCAGAGAAG GAAAAGCTGA GGGGGGTCCC TGTGAGGGAG 3780 
GAAAGAAGGA TCATTTGCCC CGCTGGGTCT CAAAGGCAGT GAGAAGAGAG CTGAAGAAAG 3840 
CTCTGGCTGG CTGACAGQAT CCCTGTGTTG TAATTGGTCC CTCCTTTCAG CTCTCTAGTG 3900 
AGATGCCCGT GTCTGTGCGT GTGCGTGTGT GTTTCATACA GCTAGCATTA GATGGGTGAT 3960 
GTTTCTTACT TATCATCCCT AACTATTGCA ACTTGACCTT AAAAAGACAA AACCCCACAA 4020 
AACTCTTCCT GCCACGGGCT TGCAGATTGA AGCACTTTCG ATGTTGGGCG CTGGCGTTTG 4080 
TGTTCTGGGC ACCACCGTGA CCCTGCCCAG ATGGCTATAA TATTATTTTA TACACAAACC 4140 
TTTTTTTTCA TAAATGTTAT AATTTTGTGT CTGTCTTTAT AAACTATTAT AAGTACTATT 4200 
TTTGTTATAA TTCAAAATAG ATATTTAGTA TAAAGTTTTT GCTGTTAAAT ATTTGTTATT 4260 
TAGTAAAATA TGAATTTTGC TCTATTGTAA ACATGGTTCA AAATATTAAT ATGTTTTTAT 4320 
CACAGTCGTT TTAATATTGA AAAAGCACTT GTGTGTTTTG TTTTGATATG AAACTGGTAC 4380 
GGTGTGAGTG TTTTTGCTGT CGTGGTTTTA ATCTGTATAT AATATTCCAT GTTGCATATT 4440 
AAAAA 

Seq ID NO: B47 Protein sequence 
Protein Accession #: NP_063938.1 



1 
I 

MLFEGLDLVS 
ETGHWLLQGS 
TRMLLGPNTV 
VYQEAQKLTF 
QILQKGLEKA 
TASASTSLIM 
VMRLFTPISG 



HPVDGLSVKP 



11 
I 

ALATLAAdV 
GFQSSRREKY 
SKSIGDIHRN 
RMAIRVLLGF 
IREKLQCTQG 
QLLKHPTVLE 
GYRTVLQTFE 
YLPFGGGVRT 
PGLDSNQNEI 



21 
I 

SVTLLLAVSQ 
GNVFKTHLLG 
KRKVPSKIPS 
SIPEEDLGHL 
KDYLDALDLL 
KLRDELRAHG 
LDGFQIPKGW 
CLGKHLAKLP 
LPETEAMLSA 



31 
I 

QXjKQLRWAAT 
RPLIRVTGAE 
HEALBSYLPK 
PBVYQQFVDN 
IBSSKEHGKE 
ILHSGGCPCB 
SVMYSIRDTH 
LKVLAVELAS 
TV 



41 
I 

RDKSCKLPIP 
NVRKlUtGEH 
IQLVIQDTLR 
VFSLPVDLPF 
MTKQELKDGT 
GTLRLDTLSG 
DTAPVPKDVN 
TSRFELATRT 



51 
I 

KGSMGFPLIG 
HLVSTEWPRS 
AWSSHPKAIN 
SGYRRGIQAR 
LELIFAAYAT 
LRYLDCVIKE 
VFDFDRFSQA 
FPRITLVPVL 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ZD NO: B48 DNA sequence 

Nucleic Acid Accession #» AB040527.1; AL136582.1 
Coding sequence! 94.. 2319 



1 
I 

GCGGCTGCGG 
TGAGGGAAGC 
CGACGAAGTC 
TGGTGGCTAT 
GGGTCCAGGC 
CCAGACCCTG 
CCGTGCTGCC 
CGCTGCCCGT 
CTCAGGAGAG 
GCCACCCCTT 



11 
I 

GGTTCCTGGT 
TTCAGCGTGC 
GGGGAGGAAG 
GGCACCCTCA 
CTTCGCATCT 
GTGGAGGCAT 
AACGTAGCCC 
ACCGCCTTCA 
GATACCCAGC 
GCTTCTCCGC 



21 
I 

GCTGAGGACG 
AATCGGAAAG 
AGATGGTTGA 
CCAGCTTTGA 
TATCGAATGA 
TGCAGCTGGA 
GCGCCGCCGC 
GTCAGGTGGT 
CCACGACCTA 
AGACCTCCCA 



31 

! 

GACGCCATTG 
CTACAGTGTT 
AGGCAACGAC 
CATCCATATC 
GCCCTGGGAA 
TCCGGAAACA 
CTCCAACCGT 
CGCTAGCCAC 
CGCCGCCGAG 
GATGTTAGTC 



41 

i 

GAGTTCCCGA 
GAAGACATGG 
TATGAAGAAT 
CTCAGAGCCT 
CTGGAAAACC 
CTTGCCAATG 
GCGGCTOGGG 
CGGGTGGCCA 
GCTCAGGGGC 
ACCAGTAAGA 



51 
I 

GAAGCATGGC 
ATGAGGGTAG 
TCGGTGCGTT 
TCGGAAGCTT 
CTGTGCTGGC 
AGACGGCCGC 
CCGCTGCCGC 
CGCOGCAGGT 
CCACCCCTGA 
TGGCTGCCCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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CGAGGCTCCG GCAACCTCCG CACAGTCCCA GACAGGCTCC CCGGCCCAGG AGGCTGCTAC 660 

TGAGGGCCCT AGTAGCGCCT GTGCTTTCTC TCAGGCTOCG TGTGCCAGGG AGGTGGACGC 720 

CAACOGGCCC AGCACAOCCT TCCTGGGCCA GAATGATGTC TTCGATTTCA CTCAGCCGGC 780 

AGGTGTCAGT GGCATGGCCT TCCCGOGCCC CAA6AGACCT GCCCCAGCCC AAGAGGCTGC 840 

CACAGAGGGC CCCAGTGCTG CCTCTGGTGT GCCCCAGACG GGACCTGGCA GGGAGGTGGC 900 

AGCCACCOGG CCCAAGACCA CCAAGTCGGG GAAGGCGCTG GCCAAGACTC GGTGGGTGGA 960 

GCCTCAGAAT GTTGTGGCAG CAGCTGCTGC CAAGGCCAAG ATGGCCACGA GCATCCCTGA 1020 

GCCGGAGGGT GCAGCTGCTG CCACTGCTCA GCACAGTGCT GAGCCCTGGG CCAGGATGGG 1080 

AGGCAAGAGG ACCAAGAAGT CCAAGCACCT GGATGATGAG TATGAGAGCA GCGAGGAGGA 1140 

GAGAGAGACT CCCGCGGTCC CACCCACCTG GAGAGCATCA CAGCCCTCAT TGACGGTGCG 1200 

GGCTCAGTTG GCCCCTCGGC CCCCGATGGC CCCGAGGTCC CAGATACCCT CAAGGCACGT 1260 

ACTGTGCCTG CCCCCCCGCA ACGTGACCCT TCTGCAGGAG AGGGCAAATA AGTTGGTGAA 1320 

ATACCTGATG ATTAAGGACT ACAAGAAGAT CCCCATCAAG CGCGCAGACA TGCTGAAGGA 1380 

TGTCATCAGA GAATATGATG AACATTTCCC TOAGATCATT GAACGAGCAA CGTACACCCT 1440 

GGAAAAGAAG TTTGGGATCC ACCTGAAGGA GATCGACAAG GAAGAACACC TGTATATTCT 1S00 

TGTCTGCACA CGGGACTCCT CAGCTCGCCT CCTTGGAAAA ACCAAGGACA CTCCCAGGCT 1560 

GAGTCTCCTC TTGGTGATTC TGGGCGTCAT CTTCATGAAT GGCAACCGTG CCAGCGAGGC 1620 

TGTCCTCTGG GAGGCACTAC GCAAGATGGG ACTGOGCCCT GGGGTGAGGC ACCCATTCCT 1680 

CGGCGATCTG AGGAAGCTCA TCACAGATGA CTTTGTGAAG CAGAAGTACC TGGAATACAA 1740 

GAAGATCCCC AACAGCAACC CACCTGAGTA TGAATTCCTC TGGGGCCTGC GAGCCCGCCA 1800 

TGAGACCAGC AAGATGAGGG TCCTGAGATT CATCGCCCAG AATCAGAACC GAGACCCCCG 1860 

GGAATGGAAG GCTCATTTCT TGGAGGCTGT GGATGATGCT TTCAAGACAA TGGATGTGGA 1920 

TATGGCCGAG GAACATGCCA GGGCCCAGAT GAGGGCCCAG ATGAATATCG GGGATGAAGC 1980 

GCTGATTGGA CGGTGGAGCT GGGATGACAT ACAAGTCGAG CTCCTGACCT GGGATGAGGA 2040 

CGGAGATTTT GG0GATGCCT GGGCCAGGAT CCCCTTTGCT TTCTGGGCCA GATACCATCA 2100 

GTACATTCTG AATAGCAACC GTGCCAACAG GAGGGCCACG TGGAGAGCTG GCGTCAGCAG 2160 

TGGCACCAAT GGAGGGGCCA GCACCAGCGT CCTAGATGGC CCCAGCACCA GCTCCACCAT 2220 

CCGGACCAGA AATGCTGCCA GAGCTGGCGC CAGCTTCTTC TCCTGGATCC AGCACCGTTG 2280 

ACGAACTGCA GCGATCTTAC TGGCCAAGCC AGAGCGCCTC CTCTCAGATT CCTTCTCGAC 2340 

ACAGCACCCT AGGCGGCTTC TTCCTGTCAG TCGGAGGTGG CATGCAAGAT GAAGCTCTCT 2400 

TTGCTCTTCC TGCTTTCATT TTGTGCTTTT CCTTGTGTTT TCATGTTTTG GGTATCAGTG 2460 
TTACATTAAA GTTGCAAAAT TAAA 

Seq ID NO: B49 Protein sequence 

Protein Accession #: BAB33 378.1; CAB66S17.1 

1 11 21 31 41 51 

1 I I I I I 

MAEGSPSVQS ESYSVEDMDE GSDEVGSEEM VEGNDYEEFG AFGGYGTLTS FDIHILRAFG 60 

SLGPGLRILS NEPWELENPV LAQTLVEALQ LDPETLANET AARAANVARA AASNRAARAA 120 

AAAARTAFSQ WASHRVATP QVSGEDTQPT TYAAEAQGPT PEPPLASPQT SQMLVTSKMA 180 

APEAPATSAQ SQTGSPAQEA ATEGPSSACA PSQAPCAREV DANRPSTAFL GQNDVFDPTQ 240 

PAGVSGMAPP RPKRPAPAQE AATEGPSAAS GVPQTGPGRE VAATRPKTTK SGKALAKTRW 300 

VEPQNWAAA AAKAKMATSI PEPEGAAAAT AQHSAEPWAR MGGKRTKKSK HLDDEYESSE 360 

EERETPAVPP TWRASQPSLT VRAQLAPRPP MAPRSQIPSR HVLCLPPRNV TLLQERANKL 420 

VKYLMIKDYK KIPIKRADML KBVIREYDEH FPEIIERATY TLEKKFGIHL KEIDKEEHLY 480 

ILVCTRDSSA RLLGKTKDTP RLSLLLVILG VIFMNGNRAS EAVLWEALRK NGLRPGVRHP 540 

FLGDLRKLIT DDPVKQKYLE YKKIPNSNPP EYEFLWGLRA RHETSKMRVL RFIAQNQNRD 600 

PREWKAHFLE AVDDAPKTMD VDMAEEHARA QMRAQMNIGD EALIGRWSWD DIQVELLTWD 660 

EDGDFGDAWA RIPPAFWARY HQYILNSNRA NRRATWRAGV SSGTNGGAST SVLDGPSTSS 720 
TIRTRNAARA GASPFSWIQH R 

Seq ID NO: BS0 DHA sequence 

Nucleic Acid Accession #: XMJ) 84 965.1 

Coding sequence: 3S6..2014 

1 11 21 31 41 51 

I I I I I I 

GACCTAGCGT GTGCTCAGCT CTGGACAAGA CATGGATGTA GGCAGACATG CTTCCTGCCT 60 

GCTGGGCTTA GTGCACCAAT GCTAAGGACA GACATGAAGG GCTTGGGAAA GGCATGAAGA 120 

TGCAGGGAGT CTAGGAGAGA GGATAAGAAA AACAAGCATT TTGTGGTGGG TTGAATTGCG 180 

TTGTCGTTTT TAGAGGCATT TCGTGATCCA TGGTTGCGCT CCCGTGAAGA GOGAGTCTAG 240 

AGAAGGAATC CGAAGCGGCG GCGGCGAGCG CGGCTCAGGT TGATTTAGAA TACGGGTGAC 300 

AGTGGCCTGG CGCGAGCCCA CTGCTGAOGA AAGCGGCTTA TCCCGCGCGG TTTCCATGGA 360 

GACGAGCCGG AGCCGCGGCG GOGGCGGGGC TGTCAGCGAG CGCGGCGGAG CTGGCGCGTC 420 

CGTGGGGGTC TGCAGGAGGA AGGCGGAGGC CGGGGCCGGG ACOGGGACCC TCGCGGCAGA 480 

CATGGACTTG CATTGTGACT GTGCCGCCGA AAOGCCGGCC GCCGAGCCGC CGTOGGGGAA 540 

GATTAATAAA GCTGCCTTCA AATTATTCAA GAAGAGGAAA TCGGGTGGCA CCATGCCCAG 600 

CATTTTTGGG GTCAAAAACA AAGGGGAOGG GAAAAGCTCG GGTCCGACGG GGCTGGTGAG 660 

GAGCAGGACC CACGACGGAC TTGCCGAGGT GCTGGTGCTG GAGAGCGGCA GGAAGGAGGA 720 

GCCGCGCGGC GGGGGCGACA GCGGCGGGGG OGGCGGGGGG CGGCCGAACC CGGGGCCCCC 780 

CAGAGCCGCA GGGCCCGGCG GGGGCTCCCT CGCCAGCAGC TOGGTGGCCA AGTCGCACAG 840 

CTTCTTCTCG CTGCTGAAGA AGAACGGGCG CTCGGAAAAC GGCAAGGGAG AGCCTGTGGA 900 

CGCGAGCAAG GCOGGCGGCA AACAAAAGCG GGGGCTGCGG GGGCTGTTCA GCGGCATGOG 960 

CTGGCACAGG AAAGACAAGC GGGCCAAGGC GGAGGCCGCG GAGGGGOGOG CGGCGGGGGG 1020 

CGGCTTGATC CTACCCGGCT CGCTCACCGC CAGCCTGGAG TGCGTCAAGG AGGAGACGCC 1080 

CAGAGCCGCG CGCGAGCCGG AGGAGCCCAG CCAGGACGCC CCGCGAGACC CAGCAGGCTG 1140 

TGGAGATATT ATTGCAGACC AAGAGGAAGA GGCAGGTCCC AGCTGTGACA AGCATGTCCC 1200 

CGGGCCAGGC AAGCCGGCTC TGTCTAAAAA GAACCCCGGC GTGGTGGCCT ACCAAGGAGG 1260 

CGGGGAAGAG ATGGCCAGCC CGGACGAGGT GGACGACACC TATCTACAGG AGTTCTGGGA 1320 

CATGCTCTCC CAGACCGAGG AGCAGGGACC CGAGCCCCAG GAGGGCGCGG CTAAGGTGGC 1380 

AGCTGCGCTG GAAACCAAGG TGGTGCCCGA GACCCCCAAA GACACCAGGT GTGTGGAAGC 1440 

GGCCAAGGAC GCGTCCTCGG TCAAGCGCAG GAGGCTCAAC CGGATTCCCA TCGAGCCCCA 1500 

TCCTAAGGAG GAGCCCAAGC ACCCGGAGAA GGAGCAGCAG GAAGGCGTCC CCAACAGCGA 1560 



816 



WO 03/042661 



CGAGGGCTAC TGGGACTCCA CCACGCCAGG CCCAGAGGAA GACAGCTCGA GCAGCGGGAA 1620 

GAAGGCGGGC ATCCCCOGGG ATAGCTACAG CGGGGACGCG CTCTATGATC TCTATGCTGA 1680 

CCOGGAOGGA AGTCCAGCAA O CC TTCCTG G AGGGAAGQAC AAOGAGGAGA OGTCCTCCCT 1740 

GTCCCGGTTA AAGCCCGTAT CTCCAGGCAC CATCACCTGT CCACTGCGAA CACCAGGCAG 1800 

CTTGCTGAAG GACTCTAAGA TCCCTATTAG CATCAAGCAC CTGACCAACC TTCCATCTAG 1860 

CCATCCCGTG GTGCACCAGC AACCCTCCAG GAGTGAGATG CCCAGAACAA AAATCCGGGT 1920 

TTCCAAAGTG CTGGTCCGCA GAGTCAGCAA COGGGGCTTG GCTGGGACCA CCATCAGAGC 1980 

AACGGCCPGC CAOGACAGTG CCAAAAAGTT GTGAGGTCTT CCAGGCCAAG GTGGATGGGC 2040 

CCCATGCCAA GGAATACAAC TTTTCCCTGG AAACCACTAA AGTAAGTTTT GCTTTTCCTA 2100 

AAGAAAGTCT TTTAGGACAC CACCCGTCCC CCGCCCTGCT CCAGAGOGTG GACOGAGGAG 2160 

GTCTTTGTGC CCTGAGCAGG GACOGGATAA CACCAGAAAG .AGGGATGCTA CACGGGGGTT 2220 

TCTCCTCTCA AGATAAGTCC CTGAGAATTA TTTTCAAGCA CTTTTTTCTT TTTTACCTTT 2280 

AAGTTTTTCT TCCTTTTGCT TTAATATACT GAACACTTGG AAGTCACCTT TACTTGCCTT 2340 

TGCAGAAAAC AGAACTTAGC CAAACCTAAG TAAGAGTCAT GCCTGGATAT TGGGATAAGC 2400 

CAGTGTCTAG AGGCCTGAAG GAACCGCTGA AGAACCAGAG GAGATCTCCT CTTCCAAGAC 2460 
ACGTTTCCTT CCTTCCCTTC TTTCCTCTTT CCTTTCCAGT T 

Seq ID NO: BS1 Protein sequence 
Protein Accession #: XP_084965.1 

1 11 21 31 41 51 

1 I I I I I 

METSRSRGGG GAVSERGGAG ASVGVCRRKA EAGAGTGTLA ADMDLHCDCA AETPAAEPPS 60 
GKINKAAFKL PKKRKSGGTM PSIFGVKNKG DGKSSGPTGL VRSRTHDGLA BVLVLESGRK 120 
EEPRGGGDSG GGGGGRPNPG PPRAAGPGGG SLASSSVAKS HSFFSLLKKN GRSENGKGEP 180 
VDASKAGGKQ KRGLRGLFSG MRHHRKDKRA KAEAAEGRAP GGGLILPGSL TASLECVKEE 240 
TPRAAREPEE PSQDAPRDPA GCGDIIADQE EEAGPSCDKH VPGPGKPALS KKNPGWAYQ 300 
GGGEEMASPD EVDDTYLQEF WDMLSQTEEQ GPEPQEGAAK VAAALETKW PETPKDTRCV 360 
EAAKDASSVK RRRLNRIPIE PHPKEBPKHP BKEQQEGVPN SDEGYWDSTT PGPEEDSSSS 420 
GKKAGIPRDS YSGDALYDLY ADPDGSPATL PGGKDNEETS SLSRLKPVSP GTITCPLRTP 480 
GSLLKDSKIP ISIKHLTNLP SSHPWHQQP SRSEMPRTKI PVSKVLVRRV SNRGLAGTTI 540 
RATACHDSAK KL 

Seq ID NO: B52 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..2016 

1 11 21 31 41 51 

I I I I I I 

ATGGAGACGA GCCGGAGCCG CGGCGGCGGC GGGGCTGTCA GCGAGCGCGG OGGAGCTGGC 60 

GCGTCCGTGG GGGTCTGCAG GAGGAAGGCG GAGGCCGGGG CCGGGACCGG GACCCTCGCG 120 

GCAGACATGG ACTTGCATTG TGACTGTGCC GCCGAAACGC CGGCCGCCGA GCOGCCGTCG 180 

GGGAAGATTA ATAAAGCTGC CTTCAAATTA TTCAAGAAGA GGAAATCGGG TGGCACCATG 240 

CCCAGCATTT TTGGGGTCAA AAACAAAGGG GACGGGAAAA GCTOGGGTCC GACGGGGCTG 300 

GTGAGGAGCA GGACCCACGA CGGACTTGCC GAGGTGCTGG TGCTGGAGAG CGGCAGGAAG 360 

GAGGAGCCGC GCGGCGGGGG CGACAGCGGC GGGGGCGGCG GGGGGCGGCC GAACCCGGGG 420 

CCCCCCAGAG CCGCAGGGCC CGGCGGGGGC TCCCTCGCCA GCAGCTCGGT GGCCAAGTCG 480 

CACAGCTTCT TCTCGCTGCT GAAGAAGAAC GGGCGCTCGG AAAACGGCAA GGGAGAGCCT S40 

GTGGACGOGA GCAAGGCCGG CGGCAAACAA AAGGGGGGGC TGCGGGGGCT GTTCAGCGGC 600 

ATGCGCTGGC ACAGGAAAGA CAAGCGGGCC AAGGCGGAGG CCGCGGAGGG GOGCGCGCCC 660 

GGGGGCGGCT TGATCCTACC CGGCTCGCTC ACCGCCAGCC TGGAGTGCGT CAAGGAGGAG 720 

ACGCCCAGAG CCGCGCGCGA GCOGGAGGAG CCCAGCCAGG ACGCCCCGCG AGACCCAGCA 780 

GGTGAGCCOG CAGGGGGAGA GGAGGTGCCC GCCCCCGOCG ACCGCGCCCC AGCGCGGAGC 840 

TGCCGAGAGG CAGAGGGCCT CGCGCACCCC GGCGACACCG GCGCCCGGGG AGAGGACGCC 900 

GCGGGGCATC GGCGCGCCGA GCOGGGGCCC GGGGAGGTCC GCACGGCAGA GGACGCTTCC 960 

AGGACGGGGG COGTTCCCGT AAAGACGGTC CCCCTTGTCG ACTOCGAAGG CGGCAGCGGC 1020 

CGGGCGCCCG CCGCCCCAGA CCCTGCCTCT GTCGATCCAC CCTCAGACCC GTCGGCAGAT 1080 

CGTATTTGTT TGATGTTTTC TGACGTGACT TCACTGAAAA GCTTTGACTC TCTTACAGGC 1140 

TGTGGAGATA TTATTGCAGA CCAAGAGGAA GAGGCAGGTC CCAGCTGTGA CAAGCATGTC 1200 

CCCGGGCCAG GCAAGCCGGC TCTGTCTAAA AAGAACCCCG GCGTGGTGGC CTACCAAGGA 1260 

GGCGGGGAAG AGATGGCCAG CCCGGAOGAG GTGGACGACA CCTATCTACA GGAGTTCTGG 1320 

GACATGCTCT CCCAGACCGA GGAGCAGGGA CCCGAGCCCC AGGAGGGCGC GGCTAAGGTG 1380 

GCAGCTGCGC TGGAAACCAA GGTGGTGCCC GAGACOCCCA AAGACACCAG GTGTGTGGAA 1440 

GCGGCCAAGG ACGCGTCCTC GGTCAAGCGC AGGAGGCTCA ACCGGATTCC CATCGAGCCC 1500 

CATCCTAAGG AGGAGCCCAA GCACCCGGAG AAGGAGCAGC AGGAAGGCGT CCCCAACAGC 1560 

GACGAGGGCT ACTGGGACTC CACCACGCCA GGCCCAGAGG AAGACAGCTC GAGCAGCGGG 1620 

AAGAAGGCGG GCATCCCOCG GGATAGCTAC AGCGGGGACG CGCTCTATGA TCTCTATGCT 1680 

GACCCGGACG GAAGTCCAGC AACCCTTCCT GGAGGGAAGG ACAACGAGGA GACGTCCTCC 1740 

CTGTCCCGGT TAAAGCCCGT ATCTCCAGGC ACCATCACCT GTCCACTGCG AACACCAGGC 1800 

AGCTTGCTGA AGGACTCTAA GATCCCTATT AGCATCAAGC ACCTGACCAA CCTTCCATCT 1860 

AGCCATCCCG TGGTGCACCA GCAACCCTCC AGGAGTGAGA TGCCCAGAAC AAAAATCCCG 1920 

GTTTCCAAAG TGCTGGTGOG CAGAGTCAGC AACCGGGGCT TGGCTGGGAC CACCATCAGA 1980 
GCAACGGCCT GCCACGACAG TGCCAAAAAG TTGTGA 

Seq ID NO: B53 Protein sequence 
Protein Accession ft: FGENESH predicted 

1 11 21 31 41 SI 

I I 1 I I 1 

METSRSRGGG GAVSERGGAG ASVGVCRRKA EAGAGTGTLA ADMDLHCDCA AETPAAEPPS 60 
GKINKAAFKL FKKRKSGGTM PSIPGVKNKG DGKSSGPTGL VRSRTHDGLA BVLVLESGRK 120 
EEPRGGGDSG GGGGGRPNPG PPRAAGPGGG SLASSSVAKS HSFFSLLKKN GRSENGKGEP 180 
VDASKAGGKQ KRGLRGLFSG MRWHRKDKRA KAEAAEGRAP GGGLILPGSL TASLECVKEE 240 
TPRAAREPEE PSQDAPRDPA GEPAGGEEVP APADRAPARS CREAEGLAHP GDTGARGEDA 300 

817 



WO 03/042661 



PCT/US02/36810 



AGHRRAEPGP GEVRTAEDAS RTGAVPVKTV PLVDSEGGSG RAPAAPDPAS VDPPSDPSAD 360 

RICLMFSDVT SLKSFDSLTG CGDIIADQEE EAGPSCDKHV PGPGKPALSK KNPGWAYQG 420 

GGEEMASPDE VDDTYLQEPW DMLSQTEEQG PEPQEGAAKV AAALETKWP ETPKDTRCVE 480 

AAKDASSVKR RRLNRIPIEP HPKEEPKHPE KEQQEGVPNS DEGYWDSTTP GPEEDSSSSG 540 

5 KKAGIPRDSY SGDALYDLYA DPDGSPATLP GGKDNEETSS LSRLKPVSPG TITCPLRTPG 600 

SLLKDSKIPI SIKHLTNLPS SHPWHQQPS RSEMPRTKIP VSKVLVRRVS NRGIAGTTIR 660 
ATACHDSAKX L 



Seq ID NO: B54 DNA sequence 
10 Nucleic Acid Accession #: NM_014138.1 
Coding sequence : 60 . . 854 

1 11 21 31 41 51 

15 CTGCAGAGAC TTCCCAGGAA GGTCCAGCGC CCTCTCAGCC TTCGTACTCA GAACAGCCGA 60 

TGATGGGCCT CAGTAACCTG AGCCCCGGTC CTGGCCCCAG CCAGGCCGTG CCTCTCCCAG 120 

AGGGGCTOCT CCGCCAGCGG TACAGAGAGG AGAAGACCCT GGAAGAGCGG CGGTGGGAGA 180 

GGCTGGAGTT CCTTCAGAGG AAGAAAGCAT TCCTGCGGCA TGTGAGGAGG AGACACCGOG 240 

ATCACATGGC CCCCTATGCT GTTGGGAGGG AAGCCAGAAT CTCCCCATTA GGTGACAGAA 300 

20 GTCAGAATCG ATTCCGATGT GAATGTCGAT ACTGCCAGAG CCACAGGCCG AATCTTTCTG 360 

GGATCCCTGG GGAGAGTAAC AGGGCCCCAC ATCCCTCCTC CTGGGAGACG CTGGTGCAGG 420 

GCCTCAGTGG CTTGACTCTC AGCCTAGGCA CCAACCAGCC CGGGCCTCTG CCTGAAGCGG 480 

CACTCCAGCC ACAGGAGACA GAGGAGAAGC GCCAGCGAGA GAGGCAGCAG GAGAGCAAAA 540 

TAATGTTTCA GAGGCTGCTC AAGCAGTGGT TAGAGGAAAA CTGAGACGTG CACCCCCATG 600 

25 GGATGGAGAC CCGAAGGGAC TCAGACGGAG CCGCCGTGTT GGCAGCGCCT GGGTGTGGGC 660 

CCATTTTGGG GACCAAACAG CAAGCTGTGG TCGGATGAGT GCCAGGACCT GTGTACCGGG 720 

ACACGTGGGA GTCCTCCCAG CATGATGCTT GACTGACCCG AGGAAGGTCC TCATGTTTCG 780 

TGCCTGTCAT TCTCGGATGG CTGTCAGGCA TTCCTTGGCA AGGGACGCTG CGTACCAGCG 840 

GTCCTCACCG CATCTCACAT GGCTCCTGTG ATGCATGTTG TOGCTTTCCC ACCCGGGATC 900 

30 TCCATCTCTC TTCCCTTCCT GCTGTCAGTA AGAGATCACA TGTCTGTGTA GTGTGAATGC 960 

CTTGTCGCTG TCCTGTGCTT TTGCACCATT GAGTTGACTG CCTCTGAGAA GCAGCACTAG 1020 

GCCTGTTGAA ATGCAATGTG CTGCCCTGAG ATCCAGTTTC AAGAATGGGC AGGTAAAOGC 1080 

AGTGTGGGAA AGGAATGTGG AATGAGAACT TGGTGGTTCA CCGCTGTACT ATTTGTGTAA 1140 

ATGTTTACGT ATGTGATAAG CTACATGTAT GTAAATGTTG CAATACCCCT AACAGTCGAG 1200 

35 TAGTAGTCTC CCTTACAGGA ATTTTTGACG GGGTTCCTCA TCATCAATAC CAAATAAATA 1260 

TATGTAGGAA TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1320 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1380 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

40 Seq ID NO: B55 Protein sequence 
Protein Accession NP__054857.1 

1 11 21 31 41 51 

45 MMGLSNLSPG PGPSQAVPLP EGLLRQRYRE BKTLEERRWE RLEFLQRKKA FLRHVRRRHR 
DHMAPYAVGR EARISPLGDR SQNRFRCBCR YCQSHRPNLS GIPGESNRAP HPSSWETLVQ 
GLSGLTLSLG TNQPGPLPEA ALQPQETEEK RQRERQQESK IMFQRLLKQW LEEN 



60 
120 



60 
120 



Seq ID NO: B56 DNA sequence 
50 Nucleic Acid Accession ft: NMJ>00025.1 
Coding sequence: 198.-1424 

1 11 21 31 41 51 

55 GCTACTCCTC CCCCAAGAGC GGTGGCACCG AGGGAGTTGG GGTGGGGGGA GGCTGAGCGC 
TCTGGCTGGG ACAGCTAGAG AAGATGGCCC AGGCTGGGGA AGTCGCTCTC ATGCCTTGCT 

GTCCCCTCCC CTGAGCCAGG TGATTTGGGA GACCCCCTCC TTCCTTCTTT CCCTACCGCC 180 

CCACGCGCGA CCCGGGGATG GCTCCGTGGC CTCACGAGAA CAGCTCTCTT GCCCCATGGC 240 

CGGACCTCCC CACCCTGGCG CCCAATACOG CCAACACCAG TGGGCTGCCA GGGGTTCCGT 300 

60 GGGAGGCGGC CCTAGCCGGG GCCCTGCTGG CGCTGGCGGT GCTGGCCACC GTGGGAGGCA 360 

ACCTGCTGGT CATCGTGGCC ATCGCCTGGA CTCCGAGACT CCAGACCATG ACCAAOGTGT 420 

TCGTGACTTC GCTGGCOGCA GCCGACCTGG TGATGGGACT CCTGGTGGTG COGCCGGCGG 480 

CCACCTTGGC GCTGACTGGC CACTGGCCGT TGGGCGCCAC TGGCTGCGAG CTGTGGACCT 540 

CGGTGGACGT GCTGTGTGTG ACCGCCAGCA TCGAAACCCT GTGCGCCCTG GCCGTGGACC 600 

65 GCTACCTGGC TGTGACCAAC CCGCTGCGTT ACGGOGCACT GGTCACCAAG CGCTGCGCCC 660 

GGACAGCTGT GGTCCTGGTG TGGGTCGTGT CGGCCGCGGT GTCGTTTGCG CCCATCATGA 720 

GCCAGTGGTG GCGCGTAGGG GCCGACGCCG AGGCGCAGCG CTGCCACTCC AACCCGCGCT 780 

GCTGTGCCTT CGCCTCCAAC ATGCCCTACG TGCTGCTGTC CTCCTCCGTC TCCTTCTACC 840 
TTCCTCTTCT CGTGATGCTC TTCGTCTAOG CGCGGGTTTT CGTGGTGGCT ACGCGCCAGC 900 
70 TGOGCTTGCT GCGCGGGGAG CTGGGCCGCT TTCCGCCCGA GGAGTCTCCG CCGGCGCCGT 960 

OGCGCTCTCT GGCCCCGGCC CCGGTGGGGA CGTGCGCTCC GCCCGAAGGG GTGCCCGCCT 1020 

GCGGCCGGCG GCCCGCGOGC CTCCTGCCTC TCCGGGAACA CCGGGCCCTG TGCACCTTGG 1080 

GTCTCATCAT GGGCACCTTC ACTCTCTGCT GGTTGCCCTT CTTTCTGGCC AACGTGCTGC 1140 

GCGCCCTGGG GGGCCCCTCT CTAGTCCCGG GCCCGGCTTT CCTTGCCCTG AACTGGCTAG 1200 

75 GTTATGCCAA TTCTGCCTTC AACCCGCTCA TCTACTGCCG CAGCCCGGAC TTTCGCAGCG 1260 

CCTTCCGCCG TCTTCTGTGC CGCTGCGGCC GTCGCCTGCC TCGGGAGCCC TGCGCOGCCG 1320 

CCCGCCCGGC CCTCTTCCCC TCGGGCGTTC CTGCGGCCCG GAGCAGCCCA GCGCAGCCCA 1380 

GGCTTTGCCA ACGGCTCGAC GGGGCTTCTT GGGGAGTTTC TTAGGCCTGA AGGACAAGAA 1440 

GCAACAACTC TGTTGATCAG AACCTGTGGA AAACCTCTGG CCTCTGTTCA GAATGAGTCC 1500 

80 CATGGGATTC CCCGGCTGTG ACACTCTACC CTCCAGAACC TGACGACTGG GCCATGTGAC 1560 

CCAAGGAGGG ATCCTTACCA AGTGGGTTTT CACCATCCTC TTGCTCTCTG TCTGAGAGAT 1620 

GTTTTCTAAA CCCCRGCCTT GAACTTCACT CCTCCCTCAG TGGTAGTGTC CAGGTGCCGT 1680 

GGAGCAGCAG GCTGGCTTTG GTAGGGGCAC OCATCACCCG GCTTGCCTGT GCAGTCAGTG 1740 

AGTGCTTAGG GCAAAGAGAG CTCCCCTGGT TCCATTCCTT CTGCCACCCA AACCCTGATG 1800 
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AGACCTTAGT GTTCTCCAGG CTCTGTGGCC CAGGCTGAGA GCAGCAGGGT AGAAAAGACC 1860 

AAGATTTGGG GTTTTATCTC TGGTTCC C TT ATTACTGCTC TCAAGCAGTG GCCTCTCTCA 1920 

CTTTAGCCAT GGAATGGCTC CGATCTACCT CACAGCAGTG TCAGAAGGAC TTGGCCAGGG 1980 

TTTTGGGAGC TCCAGGGTTC ATAAGAAGGT GAACCATTAG AACAGATCCC TTCTTTTCCT 2040 

TTTGCAATCA GATAAATAAA TATCACTGAA TGCAGTTCAT CCTOGGCOCA CTTTCCCTCC 2100 

GTTTGTTTTC TTTTCATAAT CCACTTACTC CCTTCCCTTC TACTCTGCGC TGGCTTTTGA 2160 

CAGAGGCAGT AAATTAGGCC TAATCCTCAC TCTTTTCTTC CTAATCTTCA TCAAACAAAA 2220 

AATGAAAAGT CTGTCTGGAC GAAGGGGAGT GAGCTTGAGC CTTTGATATC TTGCTCCCCC 2280 

ACCCTTCCTG AAACTCTTGA AATCCAGTTG CCATTGAGTA GCAAAGCCAC GCTCCCCACA 2340 

GGACTTGGAC AGAGGGCCCA CAGGGGGATG GGCTGGCTGT GGCCAGGTTT AGGGCAGGGG 2400 

GCATTTGTCC CCTCCATGCT ATAATCCAGT GGTGCCTTAC ATGGTGTGTG TGTGTGTGTG 2460 

TGCGTGTGTG TGTGTGTGTG TGTGTCTGGA GGCACAGGCA CAAAGCATTG CTTGGGTTGG 2520 

TCAAATGTCT TGTGTCATAA ATATATTCTG ATGTTTCCCA GCCTTTCCAC AACCTCTACC 2580 

TTCCCACTCA CCTTCCCCAG CTACAAAAAT CTGTATTATC CTCTTAAAGT AAAACTGGAG 2640 
TTAC 



Seq ID NO: B57 Protein sequence 
Protein Accession #: NP_000016.1 

1 11 21 31 41 51 

I I I I I I 

MAPWPHBNSS LAPWPDLPTL APNTANTSGL PGVPWEAALA GALLALAVLA TVGGNLLVIV 60 
AIAWTPRLQT MTNVFVTSLA AADLVMGLLV VPPAATLALT GHWPLGATGC ELWTSVDVLC 120 
VTASIETLCA LAVDRYLAVT NPLRYGALVT KRCARTAWIi VWWSAAVSF APIMSQWWRV 180 
GADAEAQRCH SNPRCCAFAS NMPYVLLSSS VSPYLPLLVM LPVYARVFW ATRQLRLLRG 240 
ELGRFPPEES PPAPSRSLAP APVGTCAPPE GVPACGRRPA RLLPLREHRA LCTLGLIMGT 300 
FTLCWLPFFL ANVLRALGGP SLVPGPAFLA LNWLGYANSA FNPLIYCRSP DFRSAFRRLL 360 
CRCGRRLPPE PCAAARPALF PSGVPAARSS PAQPRLCQRL DGASWGVS 

Seq ID NO: B58 DNA sequence 

Nucleic Acid Accession # : NMJ)32553 . 1 

Coding sequence : 37 . . 1038 



1 11 21 31 41 51 

I 1 I I I 1 

CACCATTAGG CAAAGATAGT TTCTCTAGAG AGAATCATGC CTGCTAATTA CACGTGTACC 60 

AGGCCAGATG GAGACAATAC AGATTTTCGA TACTTTATTT ATGCAGTGAC ATACACTGTC 120 

ATTCTTGTGC CAGGTCTCAT AGGGAATATA TTAGCCCTGT GGGTATTCTA TGGTTATATG 180 

AAAGAAACAA AACGAGCTGT GATATTTATG ATAAACTTAG CCATTGCTGA CTTACTACAA 240 

GTTCTTTCCT TGCCACTGAG GATCTTCTAC TACTTGAATC ATGACTGGCC ATTTGGGCCT 300 

GGTCTCTGCA TGTTCTGTTT CTACCTGAAG TATGTCAACA TGTATGCAAG CATCTACTTC 360 

TTGGTCTGCA TCAGTGTGCG ACGATTTTGG TTTCTCATGT ACCCCTTTCG CTTCCATGAC 420 

TGCAAACAGA AATATGACCT GTACATCAGC ATTGCTGGCT GGCTGATCAT CTGCCTTGCC 480 

TGTGTACTCT TTCCACTCCT CAGAACCAGT GATGATACCT CTGGCAATAG GACCAAATGC 540 

TTTGTGGATC TTCCTACCAG GAATGTCAAC CTGGCCCAGT COGTTGTTAT GATGACCATT 600 

GGCGAGTTGA TTGGGTTTGT AACTCCGCTT CTGATTGTCC TATATTGTAC CTGGAAGACG 660 

GTTTTATCAC TGCAAGATAA ATATCCCATG GCCCAAGATC TTGGAGAGAA ACAGAAAGCC 720 

TTGAAGATGA TTCTAACCTG TGCAGGGGTA TTCCTAATTT GCTTTGCACC TTATCATTTC 780 

AGTTTTCCTT TAGATTTCCT GGTGAAGTCC AATGAAATTA AAAGCTGCCT AGCCAGAAGG 840 

GTGATTCTAA TATTTCATTC TGTGGCATTG TGTCTTGCTA GTCTGAATTC ATGTCTTGAC 900 

CCAGTCATAT ACTACTTTTC CACTAATGAG TTCCGAAGAC GGCTTTCAAG ACAAGATTTG 960 

CATGACAGCA TCCAACTCCA TGCAAAATCC TTTGTGAGTA ACCATACAGC TTCCACCATG 1020 

ACACCTGAAT TATGCTAAAA CAAAAAACCA AACTGAATGT GACCTGAAAT GCAAGTACAT 1080 

CAGAACATAT CTGCAATACC CAAGCCACAG GGAA6AACTT GCAAAACAAC ACAGCTTTTC 1140 

AGTTCTGCTC TATCTTACTG CTATGGGGAA TTCACTTCTT CAAAGCAGGA CCTATTTGGA 1200 
GCATTACGAT CCACGATTAT TGATGTTGAC ATGTCCATGT AGTAATTTTT CTTCAAGT 



Seq ID NO: B59 Protein sequence 
Protein Accession #: NP_115942.1 

1 11 21 31 41 51 

1 I I I I ! 

MPANYTCTRP DGDNTDFRYF IYAVTYTVIL VPGLIGNILA LWVFYGYMKE TKRAVIFMIN 60 

LAIADLLQVL SLPLRIFYYIj NHDWPFGPGL CMFCFYLKYV NMYASIYFLV CISVRRFWFL 120 

MYPFRFHDCK QKYDLYISIA GWLIICLACV LFPLLRTSDD TSGNRTKCFV DLPTRNVNLA 180 

QSWMMTIGE LIGFVTPLLI VLYCTWKTVL SliQDKYPMAQ DLGEKQKALK MILTCAGVFL 240 

ICFAPYHFSF PLDFLVKSNE IKSCLARRVI LIFHSVALCL ASLNSCLDPV IYYFSTNEFR 300 
RRLSRQDLHD SIQLHAKSFV SNHTASTMTP ELC 

Seq ID NO: B60 DNA sequence 

Nucleic Acid Accession #x CAT cluster 

1 11 21 31 41 51 

I I I I I I 

GATTCGGATT TTAATCCGAC TCACTATAGG GAATTTTGGC CCTCGAGGCC A AGAATT CGG 60 

CCCGAGGGTT AGACATTTAA CTTGAGTCTT TTCATATCAC GGTATGCTTT TATTTTACCC 120 

TCATATATAA TTAATTTTCA TCTGGGCATA TATTTCCAGA TTAGAAATCA TTTCTTTTCA 180 

TACATTTTAA GGAATTTATA CATTTGCATC TGAAATTGTA TTCATTATCT TCT AGAAACC 240 

AAGAGTGGAC TTGGTATCCG TCTGATTATA TTTTCTTTGT TGGTA AAGAG CCTCTTTTTT 300 

CCCTTTTTAT TTTCATCTTC TCTTTCTCTA GTGTTTATAA TCTTATTTTT ATGAATGTAG 360 

TCTAATTATT TAGAACACTG TCTCCGAGTA TGTGTTTATT TGGGTTTGTG TTTTGAACAT 420 

TGCCTCCAGG TGCGAGTTTA TGTGTGTTTA TGTTTCCTAT TTG AGGTTG G ACACTTAGAA 480 

AATTTTCTCA ATTCGAAAAC TATTTTCCTT CAGCACTCAG AAATTTTTAT CAAAGATTAT S40 

TTCCACGATT ATTTCCTGTC CTTGGTTTTT CGTGTTCTTA ACCTAGAATT CAATTGTTTA 600 
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ATGTTAGACC TTACACATCT TCAATGTATC ATATATATTT CACTCATATT ATTTATTGTC 660 

ATATCTTTTG GACTATTGTG TGAAAGCAAT AAATGTATAT TACTTTAAAA AAAAAAAAAA 720 

AAAACTHCAT TTTCAAATTT AATACACACA AATTGTGCAT ACCCACACAC ATATACCACA 780 

GATATATTCA TTTTCAGGAT TTCTTTAAGT GTTATTTTAA AAATAATCAT ATTCTTATTT 840 

TGTGAACTGT GGGATTTCTT QAAACTTTTT GAAAACTCTC ATTAGTCTGT GTT TGTGT GG 900 

TTTCTTTAAA TTATCTCTTC CTTCAGAATT ACTTTTGTGT TTTGAATTTG AGTCTTTGCC 960 

ATTCATAAGG CGGGCTTTCC TTGAATACCA GATAACTCTG AGTTTCCTGT TCATATTTAA 1020 
GAATAAATAA CCTAATTTGA AAAAAAAAAA AAAAA 

Seq ID NO: B61 DNA sequence 

Nucleic Acid Accession ft: NM_014522.l 

Coding sequence: 846.. 3911 

1 11 21 31 41 51 

CTGGTGGTCC AGTACCTCCA AAGATATGGA ATACACTCCT QAAATATCCT GAAAACTTTT 60 
TTTTTTCAGA ATCCTTTAAT AAGCAGTTAT GTCAATCTGA AAGTTGCTTA CTTGTACTTT 120 
ATATTAATAG CTATTCTTGT TTTTCTTATC CAAAGAAAAA TCCTCTAATC CCCTTTTCAC 180 
ATGATAGTTG TTACCATGTT TAGGCATTAG TCACATCAAC CCCTCTCCTC TCCCA AACTT 240 
CTCTTCTTCA AATCAAACTT TATTAGTCCC TCCTTTATAA TGATTCCTTG CCTCGTTTTA 300 
TCCAGATCAA TTTTTTTTCA CTTTGATGCC CAGAGCTGAA GAAATGGACT ACTGTATAAA 360 
TTATTCATTG CCAAGAGAAT AATTGCATTT TAAACCCATA TTATAACAAA GAATAATGAT 420 
TATATTTTGT GATTTGTAAC AAATACCCTT TATTTTCCCT TAACTATTGA ATTAAATATT 480 
TTAATTATTT GTATTCTCTT TAACTATCTT GGTATATTAA AGTATTATCT TTTATATATT 540 
TATCAATGGT GGACACTTTT ATAGGTACTC TGTGTCATTT TTGATACTGT AGGTATCTTA 600 
TTTCATTTAT CTTTATTCTT AATGTACGAA TTCATAATAT TTGATTCAGA ACAAATTTAT 660 
CACTAATTAA CAGAGTGTCA ATTATGCTAA CATCTCATTT ACTGATTTTA ATTTAAAACA 720 
GTTTTTGTTA ACATGCATGT TTAGGGTTGG CTTCTTAATA ATTTCTTCTT CCTCTTCTCT 780 
CTCTCCTCTT CTTTTGGTCA GTGTTGTGCG GGTTAATACA ACAAACTGTA ACAAGTGTAC 840 
CTGGTATGGA CTTGTTGTCC GGGACGTACA TTTTCGCGGT CCTGCTAGCA TGCGTGGTGT 900 
TCCACTCTGG CGCCCAGGAG AAAAACTACA CCATCCGAGA AGAAATGCCA GAAAACGTCC 960 
TGATAGGCGA CTTGTTGAAA GACCTTAACT TGTCGCTGAT TCCAAACAAG TCCTTGACAA 1020 
CTGCTATGCA GTTCAAGCTA GTGTACAAGA CCGGAGATGT GCCACTGATT CGAATTGAAG 1080 
AGGATACTGG TGAGATCTTC ACTACTGGCG CTCGCATTGA TCGTGAGAAA TTATGTGCTG 1140 
GTATCCCAAG GGATGAGCAT TGCTTTTATG AAGTGGAGGT TGCCATTTTG CCGGATGAAA 1200 
TATTTAGACT GGTTAAGATA CGTTTTCTGA TAGAAGATAT AAATGATAAT GCACCATTGT 1260 
TCCCAGCAAC AGTTATCAAC ATATCAATTC CAGAGAACTC GGCTATAAAC TCTAAATATA 1320 
CTCTCCCAGC GGCTGTTGAT CCTGACGTAG GAATAAACGG AGTTCAAAAC TACGAACTAA 1380 
TTAAGAGTCA AAACATTTTT GGCCTCGATG TCATTGAAAC ACCAGAAGGA GACAAGATGC 1440 
CACAACTGAT TGTTCAAAAG GAGTTAGATA GGGAAGAGAA GGATACCTAC GTGATGAAAG 1500 
TAAAGGTTGA AGATGGTGGC TTTCCTCAAA GATCCAGTAC TGCTATTTTG CAAGTGAGTG 1560 
TTACTGATAC AAATGACAAC CACCCAGTCT TTAAGGAGAC AGAGATTGAA GTCAGTATAC 1620 
CAGAAAATGC TCCTGTAGGC ACTTCAGTGA CACAGCTCCA TGCCACAGAT GCTGACATAG 1680 
GTGAAAATGC CAAGATCCAC TTCTCTTTCA GCAATCTAGT CTCCAACATT GCCAGGAGAT 1740 
TATTTCACCT CAATGCCACC ACTGGACTTA TCACAATCAA AGAACCACTG GATAGGGAAG 1800 
AAACACCAAA CCACAAGTTA CTGGTTTTGG CAAGTGATGG TGGATTGATG CCAGCAAGAG 1860 
CAATGGTGCT GGTAAATGTT ACAGATGTCA ATGATAATGT CCCATCCATT GACATAAGAT 1920 
ACATCGTCAA TCCTGTCAAT GACACAGTTG TTCTTTCAGA AAATATTCCA CTCAACACCA 1980 
AAATTGCTCT CATAACTGTG ACGGATAAGG ATGCGGACCA TAATGGCAGG GTGACATGCT 2040 
TCACAGATCA TGAAATCCCT TTCAGATTAA GGCCAGTATT CAGTAATCAG TTCCTCCTGG 2100 
AGACTGCAGC ATATCTTGAC TATGAGTCCA CAAAAGAATA TGCCATTAAA TTACTGGCTG 2160 
CAGATGCTGG CAAACCTCCT TTGAATCAGT CAGCAATGCT CTTCATCAAA GTGAAAGATG 2220 
AAAATGACAA TGCTCCAGTT TTCACCCAGT CTTTCGTAAC TGTTTCTATT CCTGAGAATA 2280 
ACTCTCCTGG CATCCAGTTG ACGAAAGTAA GTGCAATGGA TGCAGACAGT GGGCCTAATG 2340 
CTAAGATCAA TTACCTGCTA GGCCCTGATG CTCCACCTGA ATTCAGCCTG GATTGTCGTA 2400 
CAGGCATGCT GACTGTAGTG AAGAAACTAG ATAGAGAAAA AGAGGATAAA TA TTTA TTCA 2460 
CAATTCTGGC AAAAGATAAC GGGGTACCAC CCTTAACCAG CAATGTCACA GTCTTTGTAA 2520 
GCATTATTGA TCAGAATGAC AATAGCCCAG TTTTCACTCA CAATGAATAC AACTTCTATG 2580 
TCCCAGAAAA CCTTCCAAGG CATGGTACAG TAGGACTAAT CACTGTAACT GATCCTGATT 2640 
ATGGAGACAA TTCTGCAGTT ACGCTCTCCA TTTTAGATGA GAATGATGAC TTCACCATTG 2700 
ATTCACAAAC TGGTGTCATC CGACCAAATA TTTCATTTGA TAGAGAAAAA CAAGAATCTT 2760 
ACACTTTCTA TGTAAAGGCT GAGGATGGTG GTAGAGTATC ACGTTCTTCA AGTGCCAAAG 2820 
TAACCATAAA TGTGGTTGAT GTCAATGACA ACAAACCAGT TTTCATTGTC CCTCCTTCCA 2880 
ACTGTTCTTA TGAATTGGTT CTACOGTCCA CXAATCCAGG CACAGTGGTC TTTCAGGTAA 2940 
TTGCTGTTGA CAATGACACT GGCATGAATG CAGAGGTTCG TTACAGCATT GTAGGAGGAA 3000 
ACACAAGAGA TCTGTTTGCA ATCGACCAAG AAACAGGCAA CATAACATTG ATGGAGAAAT 3060 
GTGATGTTAC AGACCTTGGT TTACACAGAG TGTTGGTCAA AGCTAATGAC TTAGGACAGC 3120 
CTGATTCTCT CTTCAGTGTT GTAATTGTCA ATCTGTTCGT GAATGAGTOG GTGACCAATG 3180 
CTACACTGAT TAATGAACTG GTGCGCAAAA GCACTGAAGC ACCAGTGACC CCAAATACTG 3240 
AGATAGCTGA TGTATCCTCA CCAACTAGTG ACTATGTCAA GATCCTGGTT GCAGCTGTTG 3300 
CTGGCACCAT AACTGTCGTT GTAGTTATTT TCATCACTGC TGTAGTAAGA TGTCGCCAGG 3360 
CACCACACCT TAAGGCTGCT CAGAAAAACA AGCAGAATTC TGAATGGGCT ACCCCAAACC 3420 
CAGAAAACAG GCAGATGATA ATGATGAAGA AAAAGAAAAA GAAGAAGAAG CATTCCCCTA 3480 
AGAACTTGCT GCTTAATTTT GTCACTATTG AAGAAACTAA GGCAGATGAT GTTCACAGTG 3540 
ATGGAAACAG AGTCACACTA GACCTTCCTA TTGATCTAGA AGAGCA AACA ATGGGAAAGT 3600 
ACAATTGGGT AACTACACCT ACTACTTTCA AGCCCGACAG CCCTGATTTG GCCCGACACT 3660 
ACAAATCTGC CTCTCCACAG CCTGCCTTCC AAATTCAGCC TGAAACTCCC CTGAATTCGA 3720 
AGCACCACAT CATCCAAGAA CTGCCTCTCG ATAACACCTT TGTGGCCTGT GACTCTATCT 3780 
CCAAGTGTTC CTCAAGCAGT TCAGATCCCT ACAGCGTTTC TGACTGTGGC TATCCAGTGA 3840 
CGACCTTCGA GGTACCTGTG TCCGTACACA CCAGACOGGT AGGTATCCAA GTTTCTAACA 3900 
CAACTTTCTA ACTATTTTTT TATTATTATT TTCAGTTGAT GTAGAACTTT ACAAAATCTA 3960 
TTGACTTCAA AGAGGGATCA AAACAATCAT ATTCTACAGA TGTACCCAAT AGATATATGG 4020 
ATTCAATTAA GTTTGGTAGA AGATGAGAAC AAAATAACTA CTGATTTAGG AAAATTGGAT 4080 
GCAGAATAAT AATTATAGTA GGGGCAATTT TGTCTGTAGA TGGCAGTATG ACAATTCTTG 4140 
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CTAGAGAATA TATTGAAAAA AACTTCAACA CAAAGGGTTG TAQCACTGTC CTCAGTACCA 4200 

TTGTGTGCAT GAGGATCAGA ATAGTCTGGG CTAGATACAT CACATTAAAG CTTTTCAGAA 4260 

TCTGATAAAT AGCTCTAAAT ACTAATGATA TTGAGAAGCC TAGCTTCACT TGGGAAAATC 4320 

TGTGGCTGTT CACAGAAATT CAGCACCAAG TTATTCCCCC CATACTCTAC CAGGCCTTCA 4380 

GGTCCTCATA AAGAAAAGTG TCGTTTTCAG ATTAGGAACT CAAAATTATT TTGGTGCATC 4440 

AAATCTACAG TCACACAATA TAACAAGAAT GGGATTAGAA AAATGAAAGC CTACTCATTC 4500 

TCATCTTTAA GCCAGAGAAT GAAATATATA TGAGGTCTCT GGATAGCTAT TTAAATATTT 4560 
GCATATTTAT GCAAGGTATT TTGAGCCCTT CAGAAGACAT TCT 

Seq ID NO: B62 Protein sequence 
Protein Accession #: NP_055337.1 

1 11 21 31 41 51 

I I I I I I 

MDLLSGTYXF AVLLACWFH SGAQEKNYTI REEMPENVLI GDLLKDLNLS LIPNKSLTTA 60 

MQFKLVYKTG DVPLIRIEED TGEIPTTGAR IDREKLCAGI PRDEHCFYEV EVAILPDEIF 120 

RLVKIRFLIE DINDNAPLFP ATVINISIPE NSAINSKYTL PAAVDPDVGI NGVQNYELIK 180 

SQNIPGLDVI ETPEGDKMPQ LIVQKELDRE EKDTYVMKVK VEDGGPPQRS STAILQVSVT 240 

DTNDNHPVPK ETEIEVSIPE NAPVGTSVTQ LHATDADIGE NAKIHFSFSN LVSNIARRLF 300 

HLNATTGLIT IKEPLDREET PNHKLLVLAS DGGLMPARAM VLVNVTDVND NVPSIDIRYI 360 

VNPVNDTWL SENIPLNTKI ALITVTDKDA DHNGRVTCFT DHEIPFRLRP VFSNQFLLET 420 

AAYLDYESTK EYAIKLLAAD AGKPPLNQSA MLPIKVKDEN DNAPVFTQSF VTVSIPENNS 480 

PGIQLTKVSA MDADSGPNAK INYLLGPDAP PEFSLDCRTG MLTWKKLDR EKEDKYLFTI 540 

LAKDKGVPPL TSNVTVFVSI IDQNDNSPVP THNEYNFYVP ENLPRHGTVG LITVTDPDYG 600 

DNSAVTLSIL DENDDPTIDS QTGVIRPNIS PDREKQESYT FYVKAEDGGR VSRSSSAKVT 660 

INWDVNDNK PVFIVPPSNC SYELVLPSTK PGTWFQVIA VDNDTGKNAE VRYSIVGGNT 720 

RDLFAIDQET GNITLMEKCD VTDLGLHRVL VKANDLGQPD SLFSWIVNL FVNESVTMAT 780 

LINELVRKST EAPVTPNTEI ADVSSPTSDY VKILVAAVAG TITWWIPI TAWRCRQAP 840 

HLKAAQKNKQ NSEWATPNPE NRQMIMMKKK KKKKKHSPKN LLLMFVTIEE TKADDVDSDG 900 

NRVTLDLPID LEEQTMGKYN WVTTPTTFKP DSPDUVRHYK SASPQPAFQI QPBTPLNSKH 960 

HI IQELPLDN TFVACDSISK CSSSSSDPYS VSDCGYPVTT FEVPVSVHTR PVGIQVSNTT 1020 
F 

Seq ID NO: B63 DNA sequence 

Nucleic Acid Accession ft: XM_0S9180.2 

Coding sequence: 276.. 3740 

1 11 21 31 41 51 

I I I I I I 

GCGGCGGCCG CGGAGTATCC TGGAGCTGCA GACAGTGCGG GCCTGCGCCC AGTCCCGGCT 60 

GTCCTCGCCG CGACCCCTCC TCAGCCCTGG GCGCGCGCAC GCTGGGGCCC CGCGGGGCTG 120 

GCCGCCTAGC GAGCCTGCCG GTCGACCCCA GCCAGCGCAG CGACGGGGCG CTGCCTGGCC 180 

CAGGCGCACA CGGAAGTGCG CTTCTCTGAA GTAGCTTTGG AAAGTAGAGA AGAAAATCCA 240 

GTTTGCTTCT TGGAGAACAC TGGACAGCTG AATAAATGCA GTATCTAAAT ATAAAAGAGG 300 

ACTGCAATGC CATGGCTTTC TGTGCTAAAA TGAGGAGCTC CAAGAAGACT GAGGTGAACC 360 

TGGAGGCCCC TGAGCCAGGG GTGGAAGTGA TCTTCTATCT GTCGGACAGG GAGCCCCTCC 420 

GGCTGGGCAG TGGAGAGTAC ACAGCAGAGG AACTGTGCAT CAGGGCTGCA CAGGCATGCC 480 

GTATCTCTCC TCTTTGTCAC AACCTCTTTG CCCTGTATGA CGAGAACACC AAGCTCTGGT 540 

ATGCTCCAAA TCGCACCATC ACCGTTGATG ACAAGATGTC CCTCCGGCTC CACTACCGGA 600 

TGAGGTTCTA TTTCACCAAT TGGCATGGAA CCAACGACAA TGAGCAGTCA GTGTGGCGTC 660 

ATTCTCCAAA GAAGCAGAAA AATGGCTACG AGAAAAAAAA GATTCCAGAT GCAACCCCTC 720 

TCCTTGATGC CAGCTCACTG GAGTATCTGT TTGCTCAGGG ACAGTATGAT TTGGTGAAAT 780 

GCCTGGCTCC TATTCGAGAC CCCAAGACCG AGCAGGATGG ACATGATATT GAGAACGAGT 840 

GTCTAGGGAT GGCTGTCCTG GCCATCTCAC ACTATGCCAT GATGAAGAAG ATGCAGTTGC 900 

CAGAACTGCC CAAGGACATC AGCTACAAGC GATATATTCC AGAAACATTG AATAAGTCCA 960 

TCAGACAGAG GAACCTTCTC ACCAGGATGC GGATAAATAA TGTTTTCAAG GATTTCCTAA 1020 

AGGAATTTAA CAACAAGACC ATTTGTGACA GCAGCGTGTC CACGCATGAC CTGAAGGTGA 1080 

AATACTTGGC TACCTTGGAA ACTTTGACAA AACATTAOGG TGCTGAAATA TTTGAGACTT 1140 

CCATGTTACT GATTTCATCA GAAAATGAGA TGAATTGGTT TCATTCGAAT GACGGTGGAA 1200 

ACGTTCTCTA CTACGAAGTG ATGGTGACTG GGAATCTTGG AATCCAGTGG AGGCATAAAC 1260 

CAAATGTTGT TTCTGTTGAA AAGGAAAAAA ATAAACTGAA GCGGAAAAAA CTGGAAAATA 1320 

AACACAAGAA GGATGAGGAG AAAAACAAGA TCOGGGAAGA GTGGAACAAT TTTTCTTACT 1380 

TCCCTGAAAT CACTCACATT GTAATAAAGG AGTCTGTGGT CAGCATTAAC AAGCAGGACA 1440 

ACAAGAAAAT GGAACTGAAG CTCTCTTCCC ACGAGGAGGC UTWr UU TTT GTGTCCCTGG 1500 

TAGATGGCTA CTTCOGGCTC ACAGCAGATG CCCATCATTA CCTCTGCACC GACGTGGCCC 1560 

CCCCGTTGAT CGTCCACAAC ATACAGAATG GCTGTCATGG TCCAATCTGT ACAGAATA0G 1620 

CCATCAATAA ATTGCGGCAA GAAGGAAGCG AGGAGGGGAT GTACGTGCTG AGGTGGAGCT 1680 

GCACCGACTT TGACAACATC CTCATGACCG TCACCTGCTT TGAGAAGTCT GAGCAGGTGC 1740 

AGGGTGCCCA GAAGCAGTTC AAGAACTTTC AGATOGAGGT GCAGAAGGGC CGCTACAGTC 1800 

TGCACGGTTC GGACCGCAGC TTCCCCAGCT TGGGAGACCT CATGAGCCAC CTCAAGAAGC 1660 

AGATCCTGCG CACGGATAAC ATCAGCTTCA TGCTAAAACG CTGCTGCCAG CCCAAGCCCC 1920 

GAGAAATCTC CAACCTGCTG GTGGCTACTA AGAAAGCCCA GGAGTGGCAG CCCGTCTACC 1980 

CCATGAGCCA GCTGAGTTTC GATCGGATCC TCAAGAAGGA TCTGGTGCAG GGCGAGCACC 2040 

TTGGGAGAGG CACGAGAACA CACATCTATT CTGGGACCCT GATGGATTAC AAGGATGACG 2100 

AAGGAACTTC TGAAGAGAAG AAGATAAAAG TGATCCTCAA AGTCTTAGAC CCCAGCCACA 2160 

GGGATATTTC CCTGGCCTTC TTCGAGGCAG CCAGCATGAT GAGACAGGTC TCCCACAAAC 2220 

ACATCGTGTA CCTCTATGGC GTCTGTGTCC GCGACGTGGA GAATATCATG GTGGAAGAGT 2280 

TTGTGGAAGG GGGTCCTCTG GATCTCTTCA TGCACCGGAA AAGCGATGTC CTTACCACAC 2340 

CATGGAAATT CAAAGTTGCC AAACAGCTGG CGAGTGCCCT GAGCTACTTG GAGGATAAAG 2400 

ACCTGGTCCA TGGAAATGTG TGTACTAAAA ACCTCCTCCT GGCCCGTGAG GGCATCGACA 2460 

GTGAGTGTGG CCCATTCATC AAGCTCAGTG ACCCCGGCAT CCCCATTACG GTGCTGTCTA 2520 

GGCAAGAATG CATTGAAOGA ATCCCATGGA TTGCTCCTGA GTGTGTTGAG GACTCCAAGA 2580 

ACCTGAGTGT GGCTGCTGAC AAGTGGAGCT TTGGAACCAC GCTCTGGGAA ATCTGCTACA 2640 

ATGGOGAGAT CCCCTTGAAA GACAAGACGC TGATTGAGAA AGAGAGATTC TATGAAAGCC 2700 
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GGTGCAGGCC AGTGACACCA TCATGTAAGG AGCTGGCTGA CCTCATGACC CGCTGCATGA 2760 

ACTATGACCC CAATCAGAGG CCTTTCTTCC GAGCCATCAT GAGAGACATT AATAAGCTTG 2B20 

AAGAGCAGAA TCCAGATATT GTTTCAGAAA AAAAACCAGC AACTGAAGTG GACCCCACAC 2880 

ATTTTGAAAA GCGCTTCCTA AAGAGGATCC GTGACTTGGG AGAGGGCCAC TTTGGGAAGG 2940 

5 TTGAQCTCTG CAGGTATGAC CCCGAAGGGG ACAATACAGG GGAGCAGGTG GCTGTTAAAT 3000 

CTCTGAAGCC TGAGAGTGGA GGTAACCACA TAGCTGATCT GAAAAAGGAA ATCGAGATCT 3060 

TAAGGAACCT CTATCATGAG AACATTGTGA AGTACAAAGG AATCTGCACA GAAGACGGAG 3120 

GAAATGGTAT TAAGCTCATC ATGGAATTTC TGCCTTCGGG AAGCCTTAAG GAATATCTTC 3180 

CAAAGAATAA GAACAAAATA AACCTCAAAC AGCAGCTAAA ATATGCOGTT CAGATTTGTA 3240 

10 AGGGGATGGA CTATTTGGGT TCTCGGCAAT ACGTTCACOG GGACTTGGCA GCAAGAAATG 3300 

TCCTTGTTGA GAGTGAACAC CAAGTGAAAA TTGGAGACTT CGGTTTAACC AAAGCAATTG 3360 

AAACOGATAA GGAGTATTAC ACCGTCAAGG ATGACOGGGA CAGCCCTGTG TTTTGGTATG 3420 

CTCCAGAATG TTTAATGCAA TCTAAATTTT ATATTGCCTC TGACGTCTGG TCTTTTGGAG 3480 

TCACTCTGCA TGAGCTGCTG ACTTACTGTG ATTCAGATTC TAGTCCCATG GCTTTGTTCC 3540 

15 TGAAAATGAT AGGCCCAACC CATGGCCAGA TGACAGTCAC AAGACTTGTG AATACGTTAA 3600 

AAGAAGGAAA ACGCCTGCCG TGCCCACCTA ACTGTCCAGA TGAGGTTTAT CAACTTATGA 3660 

GGAAATGCTG GGAATTCCAA CCATCCAATC GGACAAGCTT TCAGAACCTT ATTGAAGGAT 3720 

TTGAAGCACT TTTAAAATAA GAAGCATGAA TAACATTTAA ATTCCACAGA TTATCAAGTC 3780 

CTTCTCCTGC AACAAATGCC CAAGTCATTT TTTAAAAATT TCTAATGAAA GAAGTTTGTG 3840 

20 TTCTGTCCAA AAAGTCACTG AACTCATACT TCAGTACATA TACATGTATA AGGCACACTG 3900 

TAGTGCTTAA TATGTGTAAG GACTTCCTCT TTAAATTTGG TACCAGTAAC TTAGTGACAC 3960 

ATAATGACAA CCAAAATATT TGAAAGCACT TAAGCACTCC TCCTTGTGGA AAGAATATAC 4020 

CACCATTTCA TCTGGCTAGT TCACCATCAC AACTGCATTA CCAAAAGGGG ATTTTTGAAA 4080 

ACGAGGAGTT GACCAAAATA ATATCTGAAG ATGATTGCTT TTCCCTGCTG CCAGCTGATC 4140 

25 TGAAATGTTT TGCTGGCACA TTAATCATAG ATAAAGAAAG ATTGATGGAC TTAGCCCTCA 4200 

AATTTCAGTA TCTATACAGT ACTAGACCAT GCATTCTTAA AATATTAGAT ACCAGGTAGT 4260 

ATATATTGTT TCTGTACAAA AATGACTGTA TTCTCTCACC AGTAGGACTT AAACTTTGTT 4320 

TCTCCAGTGG CTTAGCTCCT GTTCCTTTGG GTGATCACTA GCACCCATTT TTGAGAAAGC 4380 

TGGTTCTACA TGGGGGGATA GCTGTGGAAT AGATAATTTG CTGCATGTTA ATTCTCAAGA 4440 

30 ACTAAGCCTG TGCCAGTGCT TTCCTAAGCA GTATACCTTT AATCAGAACT CATTCCCAGA 4500 

ACCTGGATGC TATTACACAT GCTTTTAAGA AACGTCAATG TATATCCTTT TATAACTCTA 4560 

CCACTTTGGG GCAAGCTATT CCAGCACTGG TTTTGAATGC TGTATGCAAC CAGTCTGAAT 4620 

ACCACATACG CTGCACTGTT CTTAGAGGGT TTCCATACTT ACCACCGATC TACAAGGGTT 4680 

GATCCCTGTT TTTACCATCA ATCATCACCC TGTGGTGCAA CACTTGAAAG ACCCGGCTAG 4740 

35 AGGCACTATG GACTTCAGGA TCCACTAGAC AGTTTTCAGT TTGCTTGGAG GTAGCTGGGT 4800 

AATCAAAAAT GTTTAQTCAT TGATTCAATG TGAACGATTA OGGTCTTTAT GACCAAGAGT 4860 

CTGAAAATCT TTTTGTTATG CTGTTTAGTA TTCGTTTGAT ATTGTTACTT TTCACCTGTT 4920 

GAGCCCAAAT TCAGGATTGG TTCAGTGGCA GCAATGAAGT TGCCATTTAA ATTTGTTCAT 4980 

AGCCTACATC ACCAAGGTCT CTGTGTCAAA CCTGTGGCCA CTCTATATGC ACTTTGTTTA 5040 
40 CTCTTTATAC AAATAAATAT ACTAAAGACT TT 
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MQYLNIKEDC 
CIRAAQACRI 



DGHDIENSCL 
NNVFKDPLKE 
WPHSNDGGNV 
EEWNNFSFFP 
HYLCTDVAPP 
CPEKSEQVQG 
KRCCQPKPRE 
TLMDYKDDEG 
VENIMVEEFV 
LLAREGIDSE 
TTLWEICYNG 
ZMROINKLEE 
TGEQVAVKSL 
SGSLKEYLPK 
DFGLTKAIET 
DSSPMALPLK 
SPQNLIEGFB 
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1 

NAMAFCAKMR 
SPLCHNLFAL 
PKKQKNGYEK 
GMAVLAISHY 
FNNKTICDSS 
LYYEVMVTGN 
EITHIVIKES 
LIVHNIQNGC 
AQKQFKNFQI 
ISNLLVATKK 
TSEEKKZKVI 
EGGPLDLFMH 
OGPFIKLSDP 
EIPLKDKTTiI 
QNPDIVSRKK 
KPESGGNHIA 
NKNKINLKQQ 
DKEYYTVKDD 
MIGPTHGQMT 
ALLK 



21 
I 

SSKKTEVNXiE 
YDENTKLWYA 
KKIPDATPLL 
AMMKKMQLPE 
VSTEDIiKVKY 
LGIQWRHKPN 
WSINKQDNK 
HGPICTEYAI 
EVQKGRYSLH 
AQEWQPVYPM 
LKVLDPSHRD 
RKSDVLTTPW 
GIPITVLSRQ 



31 
I 

APEPGVEVIF 
FNRTITVDDK 
DASSLEYLFA 
LPKDISYKRY 
LATLETLTKH 
WSVEKEKNK 



NQPTEVDPTH 
DLKKEIEILR 
LKYAVQICKG 
RDSPVFWYAP 
VTRLVNTLKE 



GSDRSFPSLG 
SQLSFDRILK 
ISLAFFEAAS 
KFKVAKQLAS 
ECIERIPWIA 
RFVTPSCKEL 
FEKRFLKRIR 
NLYHENIVKY 
MDYLGSRQYV 
ECLKQSKFYI 
GKRliPCPPNC 



41 
i 

YLSDREPLRL 
MSLRLHYRMR 
QGQYDLVKCL 
IPETLNKSIR 
YGAEIFETSM 
LKRKKLEKKD 
EALSFVSLVD 
GMYVLRWSCT 
DLKSHLKKQI 
KDLVQGEHLG 
MMRQVSHKHI 
ALSYLEDKDL 
PECVEDSKNL 

adlmtrcmny' 

DLGEGHFGKV 
KGICTEDGGN 
HRDLAARNVL 
ASDVWSFGVT 
PDBVYQLMRK 



51 
I 

GSGEYTAEEL 
FYFTNWHGTN 
APIRDPKTEQ 
QRNLLTRMR1 
LLISSENEMN 



GYFRLTADAH 
DFDNILMTVT 
LRTDNISFML 
RGTRTHIYSG 
VYLYGVCVRD 
VHGNVCTKNL 
SVAADKWSFG 
DPNQRPFFRA 
ELCRYDFEDN 
GIKLXMEFLP 
VESEHQVKIG 
LHELLTYCDS 
CWEFQPSNRT 



Seq ID NO: B65 DNA sequence 

Nucleic Acid Accession #: NM_004867. 

Coding sequence: 140.. 931 



1 
I 

GATCCCAGAC 
TAACAGGATC 
CGCAGCCOGA 
GGAGGAGGGG 
GACCGGCAAG 
GCTTACTCTC 
TTACAAGTAC 
TGAGGATCCT 
GGCTGACATT 
TGATAGTGAC 
CTTGTTGCTG 



11 

I 

CTCGGCTTGC 
TCCTCTTGCA 
AGATTCACTA 
CGGCAAGACG 
GAGCTCCGAG 
TTAGGCCTTT 
TTCATGCCCA 
GCAAATTCCC 
CGTGAGGATG 
CCTGCAGCAA 
GGGAACTGCT 



21 
I 

AGTAGTGTTA 
GTCTGCAGCC 
TGGTGAAAAT 
TGGAGGCCCT 
TTGCCACCCA 
CATTCATCTT 
AGAGCACCAT 
TTCGTGGAGG 
ACAACATTGC 
TTATTCATGA 
ATCTGATGCC 



31 

I 

GACTGAAGAT 
CAGGAOGCTG 
CGCCTTCAAT 
CCTGAGCCGC 
GGAAAAAGAG 
GGCAGGACTT 
TTACCGTGGA 
AGAGCCTAAC 
AATCATTGAT 
CTTTGAAAAG 
CCTCAATACT 



41 
I 

AAAGTAAGTG 
ATTCCAGCAG 
AGCCCTACOG 
ACGGTCAGAA 
GGCTCCTCTG 
ATTGTTGGTG 
GAGATGTGCT 
TTCCTGCCTG 
GTGCCTGTCC 
GGAATGACTG 
TCTATTGTTA 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



51 

I ; 

CTGTTTGGGC 
CGCCTTACCG 
CCGTGCAAAA 
CTCAGATACT 
GGAGATGTAT 
GAGCCTGCAT 
TTTTTGATTC 
TGACTGAGGA 
CCAGTTTCTC 
CTTACCTGGA 
TGCCTCCAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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AAATCTGGTA GAGCTCTTTG GCAAACTGGC GAGTGGCAGA TATCTGCCTC AAACTTATGT 720 
GGTTCGAGAA GACCTAGTTG CTGTGGAGGA AATTCGTGAT GTTAGTAACC TTGGCATCTT 780 
TATTTACCAA CTTTGCAATA ACAGAAAGTC CTTCCGCCTT CGTCGCAGAG ACCTCTTGCT 840 
GGGTTTCAAC AAACGTGCCA TTGATAAATG CTGGAAGATT AGACACTTCC CCAACGAATT 900 
5 TATTGTTGAG ACCAAGATCT GTCAAGAGTA AGAGGCAACA GATAGAGTGT CCTTGGTAAT 960 
AAGAAGTCAG AGATTTACAA TATGACTTTA ACATTAAGGT TTATGGGATA CTCAAGATAT 1020 
TTACTCATGC ATTTACTCTA TTGCTTATGC CGTAAAAAAA AAAAAAAAAA AAAAAAAAAA 1080 
AA 

10 Seq ID NO: B66 Protein sequence 
Protein Accession #: NP_O04858.l 

1 11 21 31 41 51 

1C | I I I I ! 

ID mvkiafntpt avqkkearqd veallsrtvr TQILTGKELR VATQEKEGSS GRCMLTLLGL 60 

SFILAGLIVG GACIYKYFMP KSTIYRGEMC PFDSEDPANS LRGGEPNFLP VTEEADIRED 120 

DNIAI IDVPV PSFSDSDPAA IIHDFEKGMT AYLDLLLGNC YLMPLNTSIV MPPKNLVELF 180 

GKLASGRYLP QTYWREDLV AVEEIRDVSN LGIFIYQLCN NRKSFRLRRR DLLLGFNKRA 240 
IDKCWKIRHF PNEFIVETKI CQE 



20 



Seq ID NO: B67 DNA sequence 

Nucleic Acid Accession #: XM_083862.1 

Coding sequence: 121.. 813 



25 1 11 21 31 41 51 

I I I I I I 

ACGCGGGAGG TCTGAGCTGT GGGCTGAGGC AGCGCAGCCG CTGCGCCAGG GTGCGCGATG 60 

CCTTGAACCT GGGAAACTAT GTGAAGCAAC ACTCTGGATT TTGAAAGACA TCTTTTCATC 120 

ATGGGACAGC AAATTTCGGA TCAGACACAG TTGGTTATTA ACAAGTTACC AGAAAAAGTA 180 

30 GCAAAACATG TTACGTTGGT TOGAGAGAGT GGCTCCTTAA CTTATGAAGA ATTTCTCGGG 240 

AGAGTAGCTG AGCTTAATGA TGTAACGGCf AAAGTGGCTT CTGGCCAGGA AAAACATCTT 300 

CTCTTTGAGG TACAACCTGG GTCTGATTCC TCTGCTTTTT GGAAAGTGGT TGTACGGGTG 360 

GTCTGTACCA AGATTAACAA AAGCAGTGGC ATTGTGGAGG CATCACGGAT CATGAATTTA 420 

TACCAGTTTA TTCAACTTTA TAAAGATATC ACAAGTCAAG CAGCAGGAGT ATCGGCACAG 480 

35 AGCTCCACCT CTGAAGAACC TGATGAAAAC TCATCCTCTG TAACATCTTG TCAGGCTAGT 540 

CTTTGGATGG GAAGGGTGAA GCAGCTGACC GATGAGGAGG AGTGTTGTAT CTGTATGGAT 600 

GGGCGGGCTG ACCTCATCCT GCCTTGTGCT CACAGCTTTT GTCAGAAGTG TATTGATAAA 660 

TGGAGTGATC GACACAGGAA TTGCCCTATT TGTCGCCTAC AGATGACTGG AGCAAATGAA 720 

TCTTGGGTGG TATCAGATGC ACCCACTGAA GATGATATGG CTAACTATAT TCTTAACATG 780 

40 GCTGATGAGG CAGGCCAGCC CCACAGGCCA TGACCTTGAA GTGAAAGTCT TCTGTTGCTA 840 

TTGTGGGCTC AAATATTTGG TCATGGGGGA AGAATGTAGG GTTGTGGCAC TGGCACAGAC 900 

ACAGGAAAAT CCATTTTCCC CACTCTTTTA TTTTTGCTAT TCTGATCATT TGTCCCCCTT 960 
TTAAAAATAA ACTTCCCATG TCTTCCAAAA AAAAAAAAAA AAAAA 

45 Seq ID NO: B68 Protein sequence 
Protein Accession #: XP_083862.1 

1 11 21 31 41 51 

CA I i 1 I I I 

J\) MGQQXSDQTQ LVINKLPEKV AKHVTLVRES GSLTYEEPLG RVAELNDVTA KVASGQEKHL 60 

LFEVQPGSDS SAFWKVWRV VCTKINKSSG IVEASRIMNL YQFIQLYKDI TSQAAGVLAQ 120 

SSTSEEPDEN SSSVTSCQAS LWMGRVKQLT DEEECCICMD GRADLILPCA HSFCQKCIDK 180 
WSDRHRNCPI CRLQMTGANE SWWSDAPTE DDMANYILNM ADEAGQPHRP 

55 Seq ID NO: B69 DNA sequence 

Nucleic Acid Accession §: NM_002975.1 
Coding sequence: 180.. 1151 

1 11 21 31 41 51 

60 | I 1 | I I 

CGACCAACGG ACCGGACAGA GACGAGGAGA GGAACAGGAA GAGAGAAGCT GGGAGAATCG 60 

GGAACCTGGG GGCTAGTGAC CTGCACACAG GGCAGGGGCA CTCGGCAGTT CCCAGAGGCC 120 

ACCCCTCCCA CCCCAGACAT CCAGACATCT GGAACTTTGG GTGCCAAGAG TCCAGCTTAA 180 

TGCAGGCAGC CTGGCTTTTG GGGGCTTTGG TGGTCCCCCA GCTCTTGGGC TTTGGCCATG 240 

65 GGGCTCGGGG AGCAGAGAGG GAGTGGGAGG GAGGCTGGGG AGGTGCCCAG GAGGAGGAGC 300 

GGGAGAGGGA GGCCCTGATG CTGAAGCATC TGCAGGAAGC CCTAGGACTG CCTGCTGGGA 360 

GGGGGGATGA GAATCCTGCC GGAACTGTTG AGGGAAAAGA GGACTGGGAG ATGGAGGAGG 420 

ACCAGGGGGA GGAAGAGGAG GAGGAAGCAA CGCCAACCCC ATCCTCCGGC CCCAGCCCCT 480 

CTCCCACCCC TGAGGACATC GTCACTTACA TCCTGGGCCG CCTGGCOQGC CTGGACGCAG 540 

70 GCCTGCACCA GCTGCACGTC CGTCTGCACG CGTTGGACAC CCGCGTGGTC GAGCTGACCC 600 

AGGGGCTGCG GCAGCTGCGG AACGCGGCAG GCGACACCCG CGATGCCGTG CAAGCCCTGC 660 

AGGAGGCGCA GGGTCGCGCC GAGCGCGAGC ACGGCCGCTT GGAGGGCTGC CTGAAGGGGC 720 

TGCGCCTGGG CCACAAGTGC TTCCTGCTCT CGCGCGACTT CGAAGCTCAG GCGGCGGCGC 780 

c AGGCGCGGTG CACGGCGCGG GGCGGGAGCC TGGCGCAGCC GGCAGACCGC CAGCAGATGG 840 

75 AGGCGCTCAC TCGGTAOCTG CGCGCGGOGC TCGCTCCCTA CAACTGGCCC GTGTGGCTGG 900 

GCGTGCACGA TCGGCGCGCC GAGGGCCTCT ACCTCTTCGA AAACGGCCAG CGCGTGTCCT 960 

TCTTCGCCTG GCATCGCTCA CCCCGCCCCG AGCTOGGCGC CCAGCCCAGC GCCTCGCCGC 1020 

ATCCGCTCAG CCCGGACCAG CCCAAOGGTG GCACGCTCGA GAACTGCGTG GCGCAGGCCT 1080 

_ _ CTGACGACGG CTCCTGGTGG GACCACGACT GCCAGCGGCG TCTCTACTAC GTCTGCGAGT 1140 

80 TCCCCTTCTA GCGGGGCCGG TACCCCGCCT CCTTGCCCAT CCCACCACCC GGCCTTTCCC 1200 

TGCGCCGTGC CCACCCTCCT CCGGAATCGC CCTTCCCTTC CTGGCCAGGA ATGGCAGCGT 1260 

CCTCCCCGAC CCCCAGTCTG GGCGCTTCTG GGAGGGCTCT TGCGGTGCCG GCACTCCTCC 1320 

TTGTTAGTGT CTTTCCTTGA AGGGGCGGGC ACCAGGCTAG GTCCGGTGCC AATAAATCCT 1380 
TGTGGAATCT GAAAAAAAAA AAAAAAA 
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Seq ID NO: B70 Protein sequence 
Protein Accession #: NPJ>02966.1 

1 11 21 31 41 51 

|| j 1 I I 

MQAAWLLGAL WPQLLGFGH GARGAEREWE GGWGGAQEBB REREALMLKH LQEALGLPAG 60 

RGDENPAGTV EGKEDWEMEE DQGEBEEEEA TPTPSSGPSP SPTPEDIVTY ILGRIAGLDA 120 

GLHQLHVRLH ALDTRWELT QGLRQLRNAA GDTRDAVQAL QEAQGRAERE HGRLEGCLKG 180 

LRLGHKCFLL SRDFEAQAAA QARCTARGGS LAQPADRQQM EALTRYIiRAA LAPYNWPVWL 240 

GVHDRRAEGL YLFENGQRVS FFAWHRSPRP ELGAQPSASP HPLSPDQPNG GTLENCVAQA 300 
SDDGSWWDHD CQRRLYYVCE FPP 

Seq ID NO: B71 DNA sequence 

Nucleic Acid Accession #: BC000839.1 

I 11 21 31 41 51 

II i I I i 

CTGGCCAGGG CGACTGGCGG ATAAGGTCTT GTGCGTGGCC TCGAGGCTTA AAAGTAGCAG 60 

TGGGGCTTTG TGAAGGACAA AATGGCGATG GCGGGCCGTG TAGGTCCCCC TTCCTATGAT 120 

GAGGACCTTT TCACAGACCT GTACTGAGCT CCGTGAGGAT AAGTAACTCT GAGGAGATGG 180 

GCCCTGCAAG CCTCCTTCTT AGCCGTCTGT TCAGAAAATA GCGTTTTCGA AATGCCCTGA 240 

GTTGACCTAA TGTCTTATTG GGCTCCTGTC TGCAGGATTT ACGCGCACGT TGGAACCGAA 300 

GAGAGCTCTG TTGTTGCAAT GTTCAGCCCA CAAGAGCTTA CTGGTGAAGG AATGGGACAA 360 

GACCCATCTT TATGCAAAGC CAGCGTTACA GTAATGTTCC AGCATCTCAT AATCTATCCT 420 

GGGGAATTCA GCTGCCTCCC AGGGTGAATA CAGGTATTCC TGATGACAGT CTGCCTCTAT 480 

CTTACAGAGC AGCTTGTTGC TATATACCAT TGAAAAGCCT TCAGAGCTGA GAGGTACTAC 540 

TAACCAATAA CCTGCTTGGC TCAAAGGGCC AGCACCTTCT CTCTAAAGCC CAAGAGGAGT 600 

TTGAGGAAAA CTAGGTGTCT GTGTTCACTC CAGGCTGAAG TTACAGGTCT GAGCAAATAA 660 

GGTGTATAAA AAATGGAATC TGTCTTGGAG GACATCAGAA GGTGAATTTT CCAAGTTCTT 720 

GGACAACCTA GCTGTTGAAA AGCTTTCTGG GTTTGGGGGG TATTTCAGAT GTACCTTAAA 780 

GTGTTAGCAG ATACAGATTA AGACACTGGG AGCCAATGAA ACAGCAGTTG AGGGTTTGCT 840 

GTGTATCACA TTTCTGTATT TTATCACCCC CTTCCTGCAA CATTATTTAT CTGGAATCTA 900 

CCTGCCCTTT TGTTTTTTAG ATACAAGGGC TTGGTTTTGT TACCCAGGCT GGTTTCAAGG 960 

CCATAGCTTT AAGAGATCCT CTCACCACAG ATTTCCAAAG TGCTGGGATT GCAGGTGTGA 1020 

TTCATGGCAC CCAGACTTTG CTGCCTTTCT TACATGATCC AGGCCCAGAA CCCAAACTCA 1080 

GGCACTGTAT AGATGACCAC TTTCGTAAAC TACTGACCTA GCTTGTTGCC AATTGTTGAT 1140 

TGAACTTCCC ATAACTCCAC TTCGTGTCTG TTCCTCTGTA TACAGCCACC TTCTGTTCCC 1200 

GTCATGAGCC TTTAGGTCTC CATTTGCATA TTGCAAATAC TATGTTCCAT GTAGGTAGCT 1260 

CATTCAGGGC CTTGCTCTTC ACTTCAAAAA AGGTTCCCTT GAGGACTGGC TGTCAATTTG 1320 

TGTTGCTGTG TTGGTTGTTG ATGAAAATAA TAAAATGATT GATTACATAA AAAAAAAAAA 1380 
AAAAAAA 

Seq ID NO: B72 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

TGCTAGTAGG GCCTGGGTTA ATCGGCCGAG GGTGGCTTOG TGGTCTTTAT AGCTGTTACT 60 

CTTTGTACTT GTCTTTTTCT TTTATTTTCT TTTGAGCGAT TGTGCGAACA TAGCATAGCA 120 

CGCACTATGG CACGAGGCTC GTGCTGCCTG GCCAGGGCGA CTGGCGGATA AGGTCTTGTG 180 

CGTGGCCTCG AGGCTTAAAA GTAGCAGTGG GGCTTTGTGA AGGACAAAAT GGCGATGGCG 240 

GGCOGTGTAG GTCCCCCTTC CTATGATGAG GACCTTTTCA CAGACCTGTA CTGAGCTCCG 300 

TGAGGATAAA TAACTCTGAG GAGATGGGCC CTGCAAGCCT CTTGCTTAGC CGTCTGTTCA 360 

GAAAATAGCG TTTTCGAAAT GCCCTGAGTT GACCTAATGT CTTATTGGGC TCCTGTCTGC 420 

AGGATTTACG CGCACGTTGG AACCGAAGAG AGCTCTGTTG TTGCAATGTT CAGCCCACAA 480 

GAGCTTACTG GTGAAGGAAT GGGACAAGAC CCATCTTTAT GCAAAGCCAG CGTTACAGTA 540 

ATGTTCCAGC ATCTCATAAT CTATCCTGGG GAATTCAGCT GCCTCCCAGG GTGAATACAG 600 

GTATTCCTGA TGACAGTCTG CCTCTATCTT ACAGAGCAGC TTGTTGCTAT ATACCATTGA 660 

AAAGCCTTCA GAGCTGAGAG GTACTACTAA CCAATAACCT GCTTGGCTCA AAGGGCCAGC 720 

ACCTTCTCTC TAAAGCCCAA GAGGAGTTTG AGGAAAACTA GGTGTCTGTG TTCACTCCAG 780 

GCTGAAGTTA CAGGTCTGAG CAAATAAGGT GTATAAAAAA TGGAATCTGT CTTGGAGGAC 840 

ATCAGAAGGT GAATTTTCCA AGTTCTTGGA CAACCTAGCT GTTGAAAAGC TTTCTGGGTT 900 

TGGGGGGTAT TTCAGATGTA CCTTAAAGTG TTAGCAGACA CAGATTAAGA CACTGGGAGC 960 

CAATGAAACA GCAGTTGAGG GTTTGCTGTG TATCACATTT CTGTATTTTA TCACCC CCTT 1020 

CCTGCAACAT TATTTATCTG GAATCTACCT GCCCTTTTGT TTTTTAGATA CAAGGGCTTG 1080 

GTTTTGTTAC CCAGGCTGGT TTCAAGGCCA TAGCTTTAAG AGATCCTCTC ACCACAGATT 1140 

TCCAAAGTGC TGGGATTGCA GGTGTGATTC ATGGCACCCA GACTTTGCTG CCTTTCTTAC 1200 

ATGATCCAGG CCCAGAACCC AAACTCAGGC ACTGTATAGA TGACCACTTT OGTAAACTAC 1260 

TGACCTAGCT TGTTGCCAAT TGTTGATTGA ACTTCCCATA ACTCCACTTC GTGTCTGTTC 1320 

CTCTGTATAC AGCCACCTTC TGTTCCCGTC ATGAGCCTTT AGGTCTCCAT TTGCATATTG 1380 

CAAATACTAT GTTCCATGTA GGTAGCTCAT TCAGGGOCTT GCTCTTCACT TCAAAAAAGG 1440 

TTCCCTTGAG GACTGGCTGT CAATTTGTGT TGCTGTGTTG GTTGTTGATG AAAATAATAA 1500 

AATGATTGAT TACATAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACAAA CACAAAAAAC 1560 
CGCCGCGCTT TTTCCCGGGC GCACAAAGTT ATAAAACGCC GTCCATC 

Seq ID NOt B73 sequence 

Nucleic Acid Accession gs NM_000222.1 

Coding sequence: 22.. 2952 

1 11 21 31 41 SI 

1 I I I I I 

GATCCCATCG CAGCTACCGC GATGAGAGGC GCTCGCGGCG CCTGGGATTT TCTCTGCGTT 60 

CTGCTCCTAC TGCTTCGCGT CCAGACAGGC TCTTCTCAAC CATCTGTGAG TCCAGGGGAA 120 
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COGTCTCCAC CATCCATCCA TCCAGGAAAA TCAGACTTAA TAGTCCGCGT GGGCGACGAG 180 
ATTAGGCTGT TATGCACTGA TCGGGGCTTT GTCAAATGGA CTTTTGAGAT CCTGGATGAA 240 
ACGAATGAGA ATAAGCAGAA TGAATGGATC AOGGAAAAGG CAGAAGCCAC CAACACOGGC 300 
AAATACACGT GCACCAACAA ACACGGCTTA AGCAATTCCA TTTATGTGTT TGTTAGAGAT 360 
5 CCTGCCAAGC TTTTCCTTGT TGACCGCTCC TTGTATGGGA AAGAAGACAA CGACACGCTG 420 
GTCCGCTGTC CTCTCACAGA CCCAGAAGTG ACCAATTATT CCCTCAAGGG GTGCCAGGGG 480 
AAGCCTCTTC CCAAGGACTT GAGGTTTATT CCTGACCCCA AGGCGGGCAT CATGATCAAA 540 
AGTGTGAAAC GCGCCTACCA TCGGCTCTGT CTGCATTGTT CTGTGGACCA GGAGGGCAAG 600 
TCAGTGCTGT CGGAAAAATT CATCCTGAAA GTGAGGCCAG CCTTCAAAGC TGTGCCTGTT 660 
10 GTGTCTGTGT CCAAAGCAAG CTATCTTCTT AGGGAAGGGG AAGAATTCAC AGTGACGTGC 720 
ACAATAAAAG ATGTGTCTAG TTCTGTGTAC TCAACGTGGA AAAGAGAAAA CAGTCAGACT 780 
AAACTACAGG AGAAATATAA TAGCTGGCAT CACGGTGACT TCAATTATGA ACGTCAGGCA 840 
ACGTTGACTA TCAGTTCAGC GAGAGTTAAT GATTCTGGAG TGTTCATGTG TTATGCCAAT 900 
AATACTTTTG GATCAGCAAA TGTCACAACA ACCTTGGAAG TAGTAGATAA AGGATTCATT 960 
15 AATATCTTCC CCATGATAAA CACTACAGTA TTTGTAAACG ATGGAGAAAA TGTAGATTTG 1020 
ATTGTTGAAT ATGAAGCATT CCCCAAACCT GAACACCAGC AGTGGATCTA TATGAACAGA 1080 
ACCTTCACTG ATAAATGGGA AGATTATCCC AAGTCTGAGA ATGAAAGTAA TATCAGATAC 1140 
GTAAGTGAAC TTCATCTAAC GAGATTAAAA GGCACCGAAG GAGGCACTTA CACATTCCTA 1200 
GTGTCCAATT CTGACGTCAA TGCTGCCATA GCATTTAATG TTTATGTGAA TACAAAACCA 1260 
20 GAAATCCTGA CTTACGACAG GCTOGTGAAT GGCATGCTCC AATGTGTGGC AGCAGGATTC 1320 
CCAGAGCCCA CAATAGATTG GTATTTTTGT CCAGGAACTG AGCAGAGATG CTCTGCTTCT 1380 
GTACTGCCAG TGGATGTGCA GACACTAAAC TCATCTGGGC CACOGTTTGG AAAGCTAGTG 1440 
GTTCAGAGTT CTATAGATTC TAGTGCATTC AAGCACAATG GCACGGTTGA ATGTAAGGCT 1500 
TACAACGATG TGGGCAAGAC TTCTGCCTAT TTTAACTTTG CATTTAAAGG TAACAACAAA 1560 
25 GAGCAAATCC ATCCCCACAC CCTGTTCACT CCTTTGCTGA TTGGTTTCGT AATCGTAGCT 1620 
GGCATGATGT GCATTATTGT GATGATTCTG ACCTACAAAT ATTTACAGAA ACCCATGTAT 1680 
GAAGTACAGT GGAAGGTTGT TGAGGAGATA AATGGAAACA ATTATGTTTA CATAGACCCA 1740 
ACACAACTTC CTTATGATCA CAAATGGGAG TTTCCCAGAA ACAGGCTGAG TTTTGGGAAA 1800 
ACCCTGGGTG CTGGAGCTTT OGGGAAGGTT GTTGAGGCAA CTGCTTATGG CTTAATTAAG 1860 
30 TCAGATGCGG CCATGACTGT OGCTGTAAAG ATGCTCAAGC CGAGTGCOCA TTTGACAGAA 1920 
CGGGAAGCCC TCATGTCTGA ACTCAAAGTC CTGAGTTACC TTGGTAATCA CATGAATATT 1980 
GTGAATCTAC TTGGAGCCTG CACCATTGGA GGGCCCAOCC TGGTCATTAC AGAATATTGT 2040 
TGCTATGGTG ATCTTTTGAA TTTTTTGAGA AGAAAACGTG ATTCATTTAT TTGTTCAAAG 2100 
CAGGAAGATC ATGCAGAAGC TGCACTTTAT AAGAATCTTC TGCATTCAAA GGAGTCTTCC 2160 
35 TGCAGCGATA GTACTAATGA GTACATGGAC ATGAAACCTG GAGTTTCTTA TGTTGTCCCA 2220 
ACCAAGGCCG ACAAAAGGAG ATCTGTGAGA ATAGGCTCAT ACATAGAAAG AGATGTGACT 2280 
CCCGCCATCA TGGAGGATGA CGAGTTGGCC CTAGACTTAG AAGACTTGCT GAGCTTTTCT 2340 
TACCAGGTGG CAAAGGGCAT GGCTTTCCTC GCCTCCAAGA ATTGTATTCA CAGAGACTTG 2400 
GCAGCCAGAA ATATCCTCCT TACTCATGGT CGGATGACAA AGATTTGTGA TTTTGGTCTA 2460 
40 GCCAGAGACA TCAAGAATGA TTCTAATTAT GTGGTTAAAG GAAACGCTCG ACTACCTGTG 2520 
AAGTGGATGG CACCTGAAAG CATTTTCAAC TGTGTATACA CGTTTGAAAG TGACGTCTGG 2580 
TCCTATGGGA TTTTTCTTTG GGAGCTGTTC TCTTTAGGAA GCAGCCCCTA TCCTGGAATG 2640 
CCGGTCGATT CTAAGTTCTA CAAGATGATC AAGGAAGGCT TCCGGATGCT CAGCCCTGAA 2700 
CACGCACCTG CTGAAATGTA TGACATAATG AAGACTTGCT GGGATGCAGA TCCCCTAAAA 2760 
45 AGACCAACAT TCAAGCAAAT TGTTCAGCTA ATTGAGAAGC AGATTTCAGA GAGCACCAAT 2820 
CATATTTACT CCAACTTAGC AAACTGCAGC CCCAACCGAC AGAAGCCCGT GGTAGACCAT 2880 
TCTGTGCGGA TCAATTCTGT CGGCAGCACC GCTTCCTCCT CCCAGCCTCT GCTTGTGCAC 2940 
GACGATGTCT GAGCAGAATC AGTGTTTGGG TCACCCCTCC AGGAATGATC TCTTCTTTTG 3000 
GCTTCCATGA TGGTTATTTT CTTTTCTTTC AACTTGCATC CAACTCCAGG ATAGTGGGCA 3060 
50 CCCCACTGCA ATCCTGTCTT TCTGAGCACA , CTTTAGTGGC CGATGATTTT TGTCATCAGC 3120 
CACCATCCTA TTGCAAAGGT TCCAACTGTA TATATTCCCA ATAGCAACGT AGCTTCTACC 3180 
ATGAACAGAA AACATTCTGA TTTGGAAAAA GAGAGGGAGG TATGGACTGG GGGCCAGAGT 3240 
CCTTTCCAAG GCTTCTCCAA TTCTGCCCAA AAATATGGTT GATAGTTTAC CTGAATAAAT 3300 
GGTAGTAATC ACAGTTGGCC TTCAGAACCA TCCATAGTAG TATGATGATA CAAGATTAGA 3360 
55 AGCTGAAAAC CTAAGTCCTT TATGTGGAAA ACAGAACATC ATTAGAACAA AGGACAGAGT 3420 
ATGAACACCT GGGCTTAAGA AATCTAGTAT TTCATGCTGG GAATGAGACA TAGGCCATGA 3480 
AAAAAATGAT CCCCAAGTGT GAACAAAAGA TGCTCTTCTG TGGACCACTG CATGAGCTTT 3540 
TATACTACCG ACCTGGTTTT TAAATAGAGT TTGCTATTAG AGCATTGAAT TGGAGAGAAG 3600 
GCCTCCCTAG CCAGCACTTG TATATACGCA TCTATAAATT GTCCGTGTTC ATACATTTGA 3660 
60 GGGGAAAACA CCATAAGGTT TCGTTTCTGT ATACAACCCT GGCATTATGT OCACTGTGTA 3720 
TAGAAGTAGA TTAAGAGCCA TATAAGTTTG AAGGAAACAG TTAATACCAT TTTTTAAGGA 3780 
AACAATATAA CCACAAAGCA CAGTTTGAAC AAAATCTCCT CTTTTAGCTG ATGAACTTAT 3840 
TCTGTAGATT CTGTGGAACA AGCCTATCAG CTTCAGAATG GCATTGTACT CAATGGATTT 3900 
GATGCTGTTT GACAAAGTTA CTGATTCACT GCATGGCTCC CACAGGAGTG GGAAAACACT 3960 
65 GCCATCTTAG TTTGGATTCT TATGTAGCAG GAAATAAAGT ATAGGTTTAG CCTCCTTCGC 4020 
AGGCATGTCC TGGACACCGG GCCAGTATCT ATATATGTGT ATGTACGTTT GTATGTGTGT 4080 
AGACAAATAT TTGGAGGGGT ATTTTTGCCC TGAGTCCAAG AGGGTCCTTT AGTACCTGAA 4140 
AAGTAACTTG GCTTTCATTA TTAGTACTGC TCTTGTTTCT TTTCACATAG CTGTCTAGAG 4200 
TAGCTTACCA GAAGCTTCCA TAGTGGTGCA GAGGAAGTGG AAGGCATCAG TCCCTATGTA 4260 
70 TTTGCAGTTC ACCTGCACTT AAGGCACTCT GTTATTTAGA CTCATCTTAC TGTACCTGTT 4320 
CCTTAGACCT TCCATAATGC TACTGTCTCA CTGAAACATT TAAATTTTAC CCTTTAGACT 4380 
GTAGCCTGGA TATTATTCTT GTAGTTTACC TCTTTAAAAA CAAAACAAAA CAAAACAAAA 4440 
AACTCCCCTT CCTCACTGCC CAATATAAAA GGCAAATGTG TACATGGCAG AGTTTGTGTG 4500 
TTGTCTTGAA AGATTCAGGT ATGTTGCCTT TATGGTTTCC CCCTTCTACA TTTCTTAGAC 4560 
75 TACATTTAGA GAACTGTGGC CGTTATCTGG AAGTAACCAT TTGCACTGGA GTTCTATGCT 4620 
CTCGCACCTT TCCAAAGTTA ACAGATTTTG GGGTTGTGTT GTCACCCAAG AGATTGTTGT 4680 
TTGCCATACT TTGTCTGAAA AATTCCTTTG TGTTTCTATT GACTTCAATG ATAGTAAGAA 4740 
AAGTGGTTGT TAGTTATAGA TGTCTAGGTA CTTCAGGGGC ACTTCATTGA GAGTTTTGTC 4800 
TTGCCATACT TTGTCTGAAA AATTCCTTTG TGTTTCTATT GACTTCAATG ATAGTAAGAA 4860 
80 AAGTGGTTGT TAGTTATAGA TGTCTAGGTA CTTCAGGGGC ACTTCATTGA GAGTTTTGTC 4920 
AATGTCTTTT GAATATTCCC AAGCCCATGA GTCCTTGAAA ATATTTTTTA TATATACAGT 4980 
AACTTTATGT GTAAATACAT AAGCGGCGTA AGTTTAAAGG ATGTTGGTGT TCCACGTGTT 5040 
TTATTCCTGT ATGTTGTCCA ATTGTTGACA GTTCTGAAGA ATTC 
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Seq ID NO: B74 protein sequence 
Protein Accession 8: NP_000213.l 

,.1 11 21 31 41 51 

5 I 1 | I I 1 

MRGARGAWDF LCVLLLLLRV QTGSSQPSVS PGBPSPPSIH PGXSDLIVRV GDEIRLLCTD 60 

PGFVKWTFEI LDETNENKQN EWITEKAEAT NTGKYTCTNK HGLSNSIYVP VRDPAKLFLV 120 

DRSLYGKEDN DTLVRCPLTD PEVTNYSLKG CQGKPLPKDL RFIPDPKAGI MIKSVKRAYH 180 

RLCLHCSVDQ EGKSVLSBKF ILKVRPAFKA VPWSVSKAS YLLREGEEPT VTCTIKDVSS 240 

10 SVYSTHKREN SQTKLQEKYN SWHHGDFNYE RQATLTISSA RVNDSGVFMC YANNTFGSAN 300 

VTTTLEWDK GFINIFPMIN TTVFVNDGEN VDLIVEYEAF PKPEHQQWIY MNRTFTDKWE 360 

DYPKSENESN IRYVSELHLT RLKGTEGGTY TFLVSNSDVN AAIAFNVYVN TKPEILTYDR 420 

LVNGMLQCVA AGFPEPTIDW YFCPGTEQRC SASVLPVDVQ TLNSSGPPFG KLWQSSIDS 480 

SAFKHNGTVE CKAYNDVGKT SAYFNFAFKG NNKEQIHPHT LFTPLLIGFV IVAGMMCIIV 540 

15 MILTYKYLQK PMYEVQV3KW EEINGNNYVY IDPTQLPYDH KWBFPRNRLS FGKTLGAGAF 600 

GKWEATAYG LIKSDAAMTV AVKMLKPSAH LTEREALMSE LKVLSYLGNH MNIVNIiLGAC 660 

TIGGPTLVIT EYCCYGDLLN FLRRKRDSFI CSKQEDHAEA ALYKNLLHSK ESSCSDSTNE 720 

YMDMKPGVSY WPTKADKRR SVRIG3YIER DVTPAIMEDD ELALDLEDLL SFSYQVAKGM 780 

AFLASKNCIH RELAARNILL THGRITKICD FGLARDIKND SNYWKGNAR LPVKWMAPES 840 

20 IFNCVYTFES DVWSYGIFLW ELFSLGSSPY PGMPVDSKFY KMIKEGFRML SPEHAPAEMY 900 

DIMKTCWDAD PLKRPTFKQI VQLIEKQISE STNHIYSNLA KCSPNRQKPV VDHSVRINSV 960 
GSTASSSQPIi LVHDDV 



Seq ID NOs B75 DNA sequence 
25 Nucleic Acid Accession #: NMJ)04456 
Coding sequence: 58.. 2298 

1 11 21 31 41 51 

„ ! I I I 1 ! 

30 GAATTCCGGG CGACGCGCGG GAACAACGCG AGTCGGCGCG CGGGACGAAG AATAATCATG 60 

GGCCAGACTG GGAAGAAATC TGAGAAGGGA CCAGTTTGTT GGOGGAAGCG TGTAAAATCA 120 

GAGTACATGC GACTGAGACA GCTCAAGAGG TTCAGACGAG CTGATGAAGT AAAGAGTATG 180 

TTTAGTTCCA ATCGTCAGAA AATTTTGGAA AGAACGGAAA TCTTAAACCA AGAATGGAAA 240 

CAGCGAAGGA TACAGCCTGT GCACATCCTG ACTTCTGTGA GCTCATTGCG CGGGACTAGG 300 

35 GAGTGTTCGG TGACCAGTGA CTTGGATTTT CCAACACAAG TCATCCCATT AAAGACTCTG 360 

AATGCAGTTG CTTCAGTACC CATAATGTAT TCTTGGTCTC CCCTACAGCA GAATTTTATG 420 

GTGGAAGATG AAACTGTTTT ACATAACATT CCTTATATGG GAGATGAAGT TTTAGATCAG 480 

GATGGTACTT TCATTGAAGA ACTAATAAAA AATTATGATG GGAAAGTACA GGGGGATAGA 540 

GAATGTGGGT TTATAAATGA TGAAATTTTT GTGGAGTTGG TGAATGCCCT TGGTCAATAT 600 

40 AATGATGATG ACGATGATGA TGATGGAGAC GATCCTGAAG AAAGAGAAGA AAAGCAGAAA 660 

GATCTGGAGG ATCACCGAGA TGATAAAGAA AGCOGCCCAC CTCGGAAATT TCCTTCTGAT 720 

AAAATTTTGG AGGCCATTTC CTCAATGTTT CCAGATAAGG GCACAGCAGA AGAACTAAAG 780 

GAAAAATATA AAGAACTCAC CGAACAGCAG CTCCCAGGCG CACTTCCTCC TGAATGTACC 840 

CCCAACATAG ATGGACCAAA TGCTAAATCT GTTCAGAGAG AGCAAAGCTT ACACTCCTTT 900 

45 CATACGCTTT TCTGTAGGCG ATGTTTTAAA TATGACTGCT TCCTACATCC TTTTCATGCA 960 

ACACCCAACA CTTATAAGCG GAAGAACACA GAAACAGCTC TAGACAACAA ACCTTGTGGA 1020 

CCACAGTGTT ACCAGCATTT GGAGGGAGCA AAGGAGTTTG CTGCTGCTCT CACCGCTGAG 1080 

CGGATAAAGA CCCCACCAAA ACGTCCAGGA GGCCGCAGAA GAGGACGGCT TCCCAATAAC 1140 

AGTAGCAGGC CCAGCACCCC CACCATTAAT GTGCTGGAAT CAAAGGATAC AGACAGTGAT 1200 

50 AGGGAAGCAG GGACTGAAAC GGGGGGAGAG AACAATGATA AAGAAGAAGA AGAGAAGAAA 1260 

GATGAAACTT CGAGCTCCTC TGAAGCAAAT TCTCGGTGTC AAACACCAAT AAAGATGAAG 1320 

CCAAATATTG AACCTCCTGA GAATGTGGAG TGGAGTGGTG CTGAAGCCTC AATGTTTAGA 1380 

GTCCTCATTG GCACTTACTA TGACAATTTC TGTGCCATTG CTAGGTTAAT TGGGACCAAA 1440 

ACATGTAGAC AGGTGTATGA GTTTAGAGTC AAAGAATCTA GCATCATAGC TCCAGCTCCC 1500 

55 GCTGAGGATG TGGATACTCC TCCAAGGAAA AAGAAGAGGA AACACOGGTT GTGGGCTGCA 1560 

CACTGCAGAA AGATACAGCT GAAAAAGGAC GGCTCCTCTA ACCATGTTTA CAACTATCAA 1620 

CCCTGTGATC ATCCACGGCA GCCTTGTGAC AGTTOGTGCC CTTGTGTGAT AGCACAAAAT 1680 

TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA GAOTGTCAAA ACCGCTTTCC GGGATGCCGC 1740 

TGCAAAGCAC AGTGCAACAC CAAGCAGTGC CCGTGCTACC TGGCTGTCCG AGAGTGTGAC 1800 

60 CCTGACCTCT GTCTTACTTG TGGAGCCGCT GACCATTGGG ACAGTAAAAA TGTGTCCTGC 1860 

AAGAACTGCA GTATTCAGCG GGGCTCCAAA AAGCATCTAT TGCTGGCACC ATCTGACGTG 1920 

GCAGGCTGGG GGATTTTTAT CAAAGATCCT GTGCAGAAAA ATGAATTCAT CTCAGAATAC 1980 

TGTGGAGAGA TTATTTCTCA AGATGAAGCT GACAGAAGAG GGAAAGTGTA TGATAAATAC 2040 

ATGTGCAGCT TTCTGTTCAA CTTGAACAAT GATTTTGTGG TGGATGCAAC CCGCAAGGGT 2100 

65 AACAAAATTC GTTTTGCAAA TCATTCGGTA AATCCAAACT GCTATGCAAA AGTTATGATG 2160 

GTTAACGGTG ATCACAGGAT AGGTATTTTT GCCAAGAGAG CCATCCAGAC TGGCGAAGAG 2220 

CTGTTTGTTG ATTACAGATA CAGCCAGGCT GATGCCCTGA AGTATGTCGG CATCGAAAGA 2280 

GAAATGGAAA TCCCTTGACA TCTGCTACCT CCTCCCCCTC CTCTGAAACA GCTGCCTTAG 2340 

CTTCAGGAAC CTCGAGTACT GTGGGCAATT TAGAAAAAGA ACATGCAGTT TGAAATTCTG 2400 

70 AATTTGCAAA GTACTGTAAG AATAATTTAT AGTAATGAGT TTAAAAATCA ACTTTTTATT 2460 

GCCTTCTCAC CAGCTGCAAA GTGTTTTGTA CCAGTGAATT TTTGCAATAA TGCAGTATGG 2520 
TACATTTTTC AACTTTGAAT AAAGAATACT TGAACTTGAA AAAAAAAAAA AAAAAA 



75 



Seq ID NO; B76 Protein sequence 
Protein Accession #: NP 004447 



1 11 21 31 41 51 

I I I I I I 

MGQTGKKSBK GPVCWRKRVK SEYMRLRQLK RFRRADEVKS MFSSNRQKIIi ERTEILNQEW 60 

80 KQRRIQPVHI LTSVSSLRGT RECSVTSDLD FPTQVIPLKT LNAVASVPIM YSWSPLQQNF 120 

MVEDETVLHN IPYMGDBVU) QDGTFIEELI KNYDGKVHGD RECGPINDBI FVELVNALGQ 180 

YNDDDDDDDG DDPEEREEKQ KDLEDHRDDK ESRPPRKFPS DKILEAISSM FPDKGTAEEL 240 

KEKYKELTEQ QLPGALPPEC TPNIDGPNAK SVQREQSLHS FHTLFCRRCF KYDCFLHPFH 300 

ATPNTYKRKK TETALDNKPC GPQCYQHLEG AKEPAAALTA ERIKTPPKRP GGRRRGRLPN 360 
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NSSRPSTPTI NVLBSKDTDS DRBAGTETGG ENNDKEEEEK KDETSSSSEA NSRCQTPIKM 
KPNIEPPENV EWSGAEASMF RVLIGTYYDN FCAIARLIGT KTCRQVYEPR VKBSSIIAPA 
PAEDVDTPPR KKKRXHRLWA AHCRKIQLKK DGSSNHVYNY QPCDHPRQPC DSSCPCVIAQ 
NPCEKPCQCS SECQNRPPGC RCKAQCNTKQ CPCYLAVREC DPDLCLTCGA ADHWDSKNVS 
CKNCSIQRGS KKHLLLAPSD VAGWGIFIKD PVQKNEFISE YCGEIISQDE ADRRGKVYDK 
YMCSFLFNLN NDFWDATRK GNKIRFANHS VNPNCYAKVM MVNGDHRIGI FAKRAIQTGE 
ELFVDYRYSQ ADALKYVGIE REMEIP 
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Seq ID NO: B77 DNA sequence 
Nucleic Acid Accession If: NM_007015 
Coding sequence: 1..1005 



ATGACAGAGA 
TGCAGCCCCC 
AAGGTGGGAG 
GCCTTCTACT 
ATCAATGGGA 
TTTAAAATGG 
ACAGGAATTC 
ATTCCTGAGG 
ATGCCAGTCA 
GACAACAGCT 
CTTAAACCAA 
GTTCCAACTA 
CTGAATAATG 
CCTTATCATC 
ATCTGTTGTA 
GGGGGCTATT 
ATGCCATGTA 
CACGTGCTGT 
ATGCTGATGG 
TAACAGAATT 
AGTTCAAGTC 
CATAAGTCTT 
AGTTTGCC 




AAAATAAQAA 
GACCATAAAA 
TTTTTAATCG 
TAAAATGCCA 
CCCTTGCTTG 



21 
I 

AGTTCCCATT 
TACGCTGACG 
CATTTCGGGA 
GAGCGACAGT 
TGGGTCAATG 
TGAAGAAGCA 
AGGAGAGAAG 
GACCAAACAG 
AAATTCTCTT 
TAAGGTGTTA 
AGAAATCCAG 
ACCACACAGT 
CAGTGTTCAA 
GGAAAGCATG 
GCGGAGCTAC 
TTATAATTAT 
GGCCCGTATC 
CTAGCTGAAG 
TATTTTTACA 
TTTTCCAGAA 
TAACCCCGTT 
CATCTTCCAA 



31 
I 

GCCCTGGTGG 
GTGAAGCCCT 
GCTGTGCTGC 
CACATTTACA 
GAAATAGACG 
ATTGCAGTTA 
TGCTACATTA 
AGCATCTCCT 
ATCTGGGTGG 
GAACTCTGCG 
AGGGAAAGAA 
GGACCACGGA 
GAGGACTCAC 
ACATTCGACC 
ACCCACTGCC 
CAAGGCTGCC 
TTGGGCATGG 
AGACAACCAA 
CGCAGCCTGA 
CTTTAGTATA 
ATTTGTTATT 
AGCTATTTCG 



41 
I 

GACCTGATGA 
CCAGCCCCGC 
TGCTCTTTGG 
ATGTCCATTA 
CTGGGAACAA 
ATGATTTCCA 
AAGCGCAAGT 
CCAAACTGGA 
CTGTAGATCA 
GTGACCTTCC 
GAGAAGTGGT 
GCAACCCAGG 
AAGCCTTCAA 
CTAGACTGGA 
AGAAGATCTG 
GTTCGGCCTG 
TGTGAAATCA 
AGAAGCATTA 
GCGGTTATTC 
TGCAAATGCA 
TTTTATTTGC 
AAATAAACAC 



51 
I 

CGTGGAATTC 
GCGGCTGCTC 
GGCCATCGGG 
CACCATGAGT 
CTTGGAGACC 
GAATGGCATC 
GAAGGCTCGT 
AGGCAAGATC 
GCCTGTGAAG 
TATTTTCTGG 
AAGAAAAATT 
CGCTGGAAGA 
k TCCTGATAAT 
TCACGAAGGA 
TGAACCCCTG 
CAGAGTCATC 
CTTCATATAT 
AGGCAGGTTG 
TTGACACTCT 
CTGAAAGGGT 
ATTGATTTGC 
GAAAATTTAC 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: B78 Protein sequence 
Protein Accession # : NP 008946 



1 11 

I I 
MTENSDKVPI ALVGPDDVEF 
AFYFWKGSDS HIYNVHYTMS 
TGIRFAGGEK CYIKAQVKAR 
DKSFLSSKVL ELCGDLPIFW 
LNNETRPSVQ EDSQAFNPDN 
GGYYPWPYNY QGCRSACRVI 



21 31 41 51 

1 I I I 

CSPPAYATLT VKPSSPARLL KVGAWLISG AVLLLFGAIG 
INGKLQDGSM EIDAGNNLET FKMGSGAEEA IAVNDFQNGI 
IPEVGAVTKQ SISSKLEGKI MPVKYEENSL IWVAVDQPVK 
LKPTYPKEIQ RERREWRKI VPTTTKRPHS GPRSNPGAGR 
PYHQQEGESM TFDPRLDHEG ICCIECRRSY THCQKICEPL 
MPCSWWVARI LGMV 



Seq ID NO: B79 DNA sequence 

Nucleic Acid Accession #: NH_012449.1 

Coding sequence: 6 6.. 10 85 



CCGAGACTCA 
AATTAATGGA 
GGAGAAATTT 
AAAGACCTGT 
CAGAACTTCA 
CTATTATAGC 
CAACTTCCCA 
CAATGGTTTC 
TCCAACTTCA 
TAACAAGAAA 
GTCTGTCTTA 
AGGTCCAACA 
ATGTGTCTCT 
CATCTGTGAG 
TTGTTTCCCT 
ATATAAAACA 
TTGTTGTCCT 
AGATTAGACA 
TGTAGAATTA 
TCAAGTTTGT 



11 
I 

CGGTCAAGCT 
AAGCAGAAAA 
AGAAGAAGAC 
GCTTTTGCAT 
GCACACACAG 
ATCTCTGACT 
TCAACAATAT 
CATCACTCTC 
TAATGGAACC 
GCAGTTTGGG 
CCCAATGAGG 
AAATAAAGAA 
GGGAATTGTG 
TGACTCTTTG 
TCTACTGGGC 
ATTTGTATGG 
GATATTTAAA 
TGGTTGGGAA 
CTGTTTACAC 
ATTTGTTAAT 



21 
I 

AAGGCGAAGA 
GACATCACAA 
GATTATTTGC 
TTGCACCAAA 
GAACTCTTTC 
TTTCTTTACA 
TTTTATAAAA 
TTGGCATTGG 
AAGTATAAGA 
CTTCTCAGTT 
CGATCCTACA 
GATGCCTGGA 
GGATTGGCAA 
ACATGGAGAG 
ACAATACACG 
TATACACCTC 
AGCATACTAT 
GAOGTCACCA 
ACATTTTTGT 
AAAATGATTA 



31 
I 

GTGGGTGGCT 
ACCAAGAAGA 
ATAAGGACAC 
CAGCCCATGC 
CACAGTGGCA 
CTCTTCTGAG 
TTCCAATCCT 
TTTACCTGCC 
AGTTTCCACA 
TCTTTTTTGC 
GATACAAGTT 
TTGAGCATGA 
TACTGGCTCT 
AATTTCACTA 
CATTGATTTT 
CAACTTTTAT 
TCCTGCCATG 
AAATTAACAA 
TCAATATTGA 
TTCAAGGAAA 



41 
I 

GAAGCCATAC 
ACTTTGGAAA 
GGGAGAOACC 
TGATGAATTT 
CTTGCCAATT 
GGAAGTAATT 
GGTCATCAAC 
AGGTGTGATA 
TTGGTTGGAT 
TGTACTGCAT 
GCTAAACTGG 
TGTTTGGAGA 
GTTGGCTGTG 
TATTCAGAGC 
TGCCTGGAAT 
GATAGCTGTT 
CTTGAGGAAG 
AACTGAGATA 
TATATTTTAT 
AAAAAAAAAA 



60 
120 
180 
240 
300 



51 
I 

TATTTTATAG 
ATGAAGCCTA 
AGCATGCTAA 
GACTGCCCTT 
AAAATAGCTG 
CACCCTTTAG 
AAAGTCXTGC 
GCAGCAATTG 
AAGTGGATGT 
GCAATTTATA 
GCATATCAAC 
ATGGAGATTT 
ACATCTATTC 
AAGCTAGGAA 
AAGTGGATAG 
TTCCTTCCAA 
AAGATACTGA 
TGTTCCCAGT 
CACCAACATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: B80 Protein sequence 
Protein Accession ft: NP_036581.1 



1 11 21 31 41 51 

I I I I I I 

KBSRKDITNQ EELWKMKPRR NLEEDDYLHK DTGETSMLKR PVLLHLHQTA HADEFDCPSE 
LQHTQELFPQ WHLPIKIAAI IASLTPLYTL LREVIHPLAT SHQQYFYKIP ILVINKVLPM 
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120 
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VSITLLALVY LPGVIAAIVQ LHNGTKYKKP PHWLDKWMLT RKQFGLLSFP FAVLRAIYSL 180 

SYPMRRSYRY KLLNWAYQQV QQNKBDAWIE HDVWRMEIYV SLGIVGIAIL ALIAVTSIPS 240 

VSDSLTWREF HYIQSKLQIV SUULGTIHAL IFAWNKWIDI KQFVWYTPPT FMIAVFLPIV 300 
VLIFKSILFL PCLRKKILKI RHGWEDVTKI NKTEICSQL 

Seq ID NO: B81 DNA sequence 
Nucleic Acid Accession ih NM_000684 
Coding sequence: 87.. 1520 

1 11 21 31 41 51 

TGCTACCCGC GCCCGGGCTT CTGGGGTGTT CCCCAACCAC GGCCCAGCCC TGCCACACCC 60 

CCCGCCCCCG GCCTCCGCAG CTCGGCATGG GCGCGGGGGT GCTCGTCCTG GGCGCCTCCG 120 

AGCCCGGTAA CCTGTCGTCG GCCGCACCGC TCCCCGACGG CGCGGCCACC GCGGCGCGGC 180 

TGCTGGTGCC CGCGTCGCCG CCCGCCTCGT TGCTGCCTCC CGCCAGCGAA AGCCCCGAGC 240 

CGCTGTCTCA GCAGTGGACA GCGGGCATGG GTCTGCTGAT GGCGCTCATC GTGCTGCTCA 300 

TCGTGGCGGG CAATGTGCTG GTGATCGTGG CCATCGCCAA GACGCCGCGG CTGGAGACGC 360 

TCACCAACCT CTTCATCATG TCCCTGGCCA GCGCCGACCT GGTCATGGGG CTGCTGGTGG 420 

TGCOGTTCGG GGCCACCATC GTGGTGTGGG GCCGCTGGGA GTACGGCTCC TTCTTCTGCG 480 

AGCTGTGGAC CTCAGTGGAC GTGCTGTGCG TGACGGCCAG CATCGAGACC CTGTGTGTCA 540 

TTGCCCTGGA CCGCTACCTC GCCATCACCT CGCCCTTCCG CTACCAGAGC CTGCTGACGC 600 

GCGCGCGGGC GCGGGGCCTC GTGTGCACCG TGTGGGCCAT CTCGGCCCTG GTGTCCTTCC 660 

TGCCCATCCT CATGCACTGG TGGCGGGCGG AGAGCGACGA GGCGCGCCGC TGCTACAACG 720 

ACCCCAAGTG CTGCGACTTC GTCACCAACC GGGCCTACGC CATCGCCTCG TCCGTAGTCT 780 

CCTTCTACGT GCCCCTGTGC ATCATGGCCT TCGTGTACCT GCGGGTGTTC CGCGAGGCCC 840 

AGAAGCAGGT GAAGAAGATC GACAGCTGCG AGCGCCGTTT CCTCGGCGGC CCAGCGCGGC 900 

CGCCCTCGCC CTCGCCCTCG CCCGTCCCCG CGCCCGCGCC GCCGCCOGGA CCCCCGCGCC 960 

CCGCCGCCGC CGCCGCCACG GCCCCGCTGG CCAACGGGCG TGCGGGTAAG CGGCGGCCCT 1020 

CGCGCCTCGT GGCCCTAOGC GAGCAGAAGG CGCTCAAGAC GCTGGGCATC ATCATGGGCG 1080 

TCTTCACGCT CTGCTGGCTG CCCTTCTTCC TGGCCAACGT GGTGAAGGCC TTCCACCGCG 1140 

AGCTGGTGCC CGACCGCCTC TTCGTCTTCT TCAACTGGCT GGGCTACGCC AACTCGGCCT 1200 

TCAACCCCAT CATCTACTGC CGCAGCCCCG ACTTCCGCAA GGCCTTCCAG GGACTGCTCT 1260 

GCTGCGCGCG CAGGGCTGCC CGCCGGCGCC ACGOGACCCA CGGAGACCGG CCGCGCGCCT 1320 

CGGGCTGTCT GGCCCGGCCC GGACCCCCGC CATCGCCCGG GGCCGCCTCG GACGACGACG 1380 

ACGACGATGT CGTCGGGGCC ACGCCGCCCG CGCGCCTGCT GGAGCCCTGG GCCGGCTGCA 1440 

ACGGCGGGGC GGCGGCGGAC AGCGACTCGA GCCTGGACGA GCCGTGCCGC CCCGGCTTCG 1500 

CCTCGGAATC CAAGGTGTAG GGCCCGGCGC GGGGCGCGGA CTCCGGGCAC GGCTTCCCAG 1560 

GGGAACGAGG AGATCTGTGT TTACTTAAGA CCGATAGCAG GTGAACTCGA AGCCCACAAT 1620 

CCTCGTCTGA ATCATCCGAG GCAAAGAGAA AAGCCACGGA CCGTTGCACA AAAAGGAAAG 1680 
TTTGGGAAGG GATGGGAGAG TGGCTTGCTG ATGTTCCTTG TTG 

Seq ID NO: B82 Protein sequence 
Protein Accession #: NP_000675.1 

1 11 21 31 41 SI 

I 1 I I I I 

MGAGVLVLGA SEPGNLSSAA PLPDGAATAA RLLVPASPPA SLLPPASESP EPLSQQWTAG 60 
MGLLMALIVL LIVAGNVLVI VAIAKTPRLQ TLTNLFIMSL ASADLVMGLL WPFGATIW 120 
WGRWEYGSFF CELWTSVDVL CVTASIETLC VIALDRYLAI TSPFRYQSLL TRARARGLVC 180 
TVWAISALVS FLPILMHWWR ABSDBARRCY NDPKCCDFVT NRAYAIASSV VSFYVPLCIM 240 
AFVYIiRVFRE AQKQVKKIDS CERRFLGGPA RPPSPSPSPY PAPAPPPGPP RPAAAAATAP 300 
LANGRAGKRR PSRLVALREQ KALKTLGIIM GVFTLCWLPP FLANWKAFH RELVPDRLFV 360 
FFNWLGYANS AFNPIIYCRS PDFRKAFQGL LCCARRAARR RHATHGDRPR ASGCLARPGP 420 
PPSPGAASDD DDDDWGATP PARLLEPWAG CNGGAAADSD 6SLDEPCRPG FASESKV 

Seq ID NO: B83 DNA sequence 

Nucleic Acid Accession tf: NM_000729.2 

Coding sequence: 2.. 421 

1 11 21 31 41 51 

| | j I I I 

GGCTCAGCTG CCGGGCTGCT CCGGTTGGAA ACGCCAAGCC AGCTGCCGTC CTAATCCAAA 60 
AGCCATGAAC AGCGGCGTGT GCCTGTGCGT GCTGATGGCG GTACTGGCGG CTGGCGCCCT 120 
GACGCAGCCG GTGCCTCCCG CAGATCCCGC GGGCTCCGGG CTGCAGCGGG CAGAGGAGGC 180 
GCCCCGTAGG CAGCTGAGGG TATCGCAGAG AACGGATGGC GAGTCCCGAG CGCACCTGGG 240 
CGCCCTGCTG GCAAGATACA TCCAGCAGGC CCGGAAAGCT CCTTCTGGAC GAATGTCCAT 300 
CGTTAAGAAC CTGCAGAACC TGGACCCCAG CCACAGGATA AGTGACCGGG ACTACATGGG 360 
CTGGATGGAT TTTGGCCGTC GCAGTGCCGA GGAGTATGAG TACCCCTCCT AGAGGACCCA 420 
GCCGCCATCA GCCCAACGGA AGCAACCTCC CAACCCAGAG GAGGCAGAAT AAGACAACAA 480 
TCACACTCAT AACTCATTGT CTGTGGAGTT TGACATTGAA TGTATCTATT TATTAAGTTC 540 
TCAATGTGAA AATTGTGTCT GTAAGATTGT CCAGTGCAAC CACACACGCT CACCAGAAGT 600 
TGTGCAAACT GAAGACAAAA C l tJ lTi ' T C V r CATCTGTGAC TCCTGTTCTG AAAATGTTGT 660 
TATGCTATTA AAGTGATTTC ATTCTGCC 

Seq ID NO: B84 Protein sequence 
Protein Accession S: NP_000720.l 

1 11 21 31 41 51 

I I I I I I 

MNSGVCLCVL MAVLAAGALT QPVPPADPAG SGIiQRAEEAP RRQLRVSQRT DGESRAHLGA 60 
LLARYIQQAR KAPSGRMSIV KNLQNLDPSH RISDRDYMGW MDFGRRSABB YEYPS 



828 



WO 03/042661 



PCT/US02/36810 



TABLE 74 Gene Sequences Up- or Down -Regulated in Cancer 



Angiogenesis 

Al DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding sequence: 



endothelial differentiation, sphingolipid G-protein-coupled receptor, l 

Hs. 154210 

M31210 

M31210 

251-1396 



TCTAAAGGTC GGGGGCAGCA GCAAGATGCG AAGCGAGCCG TACAGATCCC GGGCTCTCCG 60 

AACGCAACTT CGCCCTGCTT GAGCGAGGCT GCGGTTTCCG AGGCCCTCTC CAGCCAAGGA 120 

AAAGCTACAC AAAAAGCCTG GATCACTCAT CGAACCACCC CTGAAGCCAG TGAAGGCTCT 180 

CTCGCCTCGC CCTCTAGCGT TCGTCTGGAG TAGCGCCACC CCGGCTTCCT GGGGACACAG 240 

GGTTGGCACC ATGGGGCCCA CCAGCGTCCC GCTGGTCAAG GCCCACCGCA GCTCGGTCTC 300 

TGACTACGTC AACTATGATA TCATCGTCCG GCATTACAAC TACACGGGAA AGCTGAATAT 360 

CAGCGCGGAC AAGGAGAACA GCATTAAACT GACCTCGGTG GTGTTCATTC TCATCTGCTG 420 

CTTTATCATC CTGGAGAACA TCTTTGTCTT GCTGACCATT TGGAAAACCA AGAAATTCCA 480 

CCGACCCATG TACTATTTTA TTGGCAATCT GGCCCTCTCA GACCTGTTGG CAGGAGTAGC 540 

CTACACAGCT AACCTGCTCT TGTCTGGGGC CACCACCTAC AAGCTCACTC CCGCCCAGTG 600 

GTTTCTGCGG GAAGGGAGTA TGTTTGTGGC CCTGTCAGCC TCCGTGTTCA GTCTCCTCGC 660 

CATOGCCATT GAGCGCTATA TCACAATGCT GAAAATGAAA CTCCACAACG GGAGCAATAA 720 

CTTCCGCCTC TTCCTGCTAA TCAOCGCCTG CTGGGTCATC TCCCTCATCC TGGGTGGCCT 780 

GCCTATCATG GGCTGGAACT GCATCAGTGC GCTGTCCAGC TGCTCCACCG TGCTGCCGCT 840 

CTACCACAAG CACTATATCC TCTTCTGCAC CACGGTCTTC ACTCTGCTTC TGCTCTCCAT 900 

CGTCATTCTG TACTGCAGAA TCTACTCCTT GGTCAGGACT CGGAGCCGCC GCCTGACGTT 960 

CCGCAAGAAC ATTTCCAAGG CCAGCCGCAG CTCTGAGAAT GTGGCGCTGC TCAAGACCGT 1020 

AATTATCGTC CTGAGCGTCT TCATCGCCTG CTGGGCACCG CTCTTCATCC TGCTCCTGCT 1080 

GGATGTGGGC TGCAAGGTGA AGACCTGTGA CATCCTCTTC AGAGCGGAGT ACTTCCTGGT 1140 

GTTAGCTGTG CTCAACTCCG GCACCAACCC CATCATTTAC ACTCTGACCA ACAAGGAGAT 1200 

GCGTCGGGCC TTCATCCGGA TCATGTCCTG CTGCAAGTGC CCGAGCGGAG ACTCTGCTGG 1260 

CAAATTCAAG CGACCCATCA TCGCCGGCAT GGAATTCAGC CGCAGCAAAT CGGACAATTC 1320 

CTCCCACCCC CAGAAAGACG AAGGGGACAA CCCAGAGACC ATTATGTCTT CTGGAAACGT 1380 

CAACTCTTCT TCCTAGAACT GGAAGCTGTC CACCCACCGG AAGCGCTCTT TACTTGGTCG 1440 

CTGGCCACCC CAGTGTTTGG AAAAAAATCT CTGGGCTTCG ACTGCTGCCA GGGAGGAGCT IS 00 

GCTGCAAGCC AGAGGGAGGA AGGGGGAGAA TACGAACAGC CTGGTGGTGT CGGGTGTTGG IS 60 

TGGGTAGAGT TAGTTCCTGT GAACAATGCA CTGGGAAGGG TGGAGATCAG GTCCCGGCCT 1620 

GGAATATATA TTCTACCCCC CTGGAGCTTT GATTTTGCAC TGAGCCAAAG GTCTAGCATT 1680 

GTCAAGCTCC TAAAGGGTTC ATTTGGCCCC TCCTCAAAGA CTAATGTCCC CATGTGAAAG 1740 

CGTCTCTTTG TCTGGAGCTT TGAGGAGATG TTTTCCTTCA CTTTAGTTTC AAACCCAAGT 1800 

GAGTGTGTGC ACTTCTGCTT CTTTAGGGAT GCCCTGTACA TCCCACACCC CACCCTCCCT 1860 

TCCCTTCATA CCCCTCCTCA ACGTTCTTTT ACTTTATACT TTAACTACCT GAGAGTTATC 1920 

AGAGCTGGGG TTGTGGAATG ATCGATCATC TATAGCAAAT AGGCTATGTT GAGTACGTAG 1980 

GCTGTGGGAA GATGAAGATG GTTTGGAGGT GTAAAACAAT GTCCTTCGCT GAGGCCAAAG 2040 

TTTCCATGTA AGCGGGATCC GTTTTTTGGA ATTTGGTTGA AGTCACTTTG ATTTCTTTAA 2100 

AAAACATCTT TTCAATGAAA TGTGTTACCA TTTCATATCC ATTGAAGCCG AAATCTGCAT 2160 

AAGGAAGCCC ACTTTATCTA AATGATATTA GCCAGGATCC TTGGTGTCCT AGGAGAAACA 2220 

GACAAGCAAA ACAAAGTGAA AACCGAATGG ATTAACTTTT GCAAACCAAG GGAGATTTCT 2280 

TAGCAAATGA GTCTAACAAA TATGACATCC GTCTTTCCCA CTTTTGTTGA TGTTTATTTC 2340 

AGAATCTTGT GTGATTCATT TCAAGCAACA ACATGTTGTA TTTTGTTGTG TTAAAAG TAC 2400 

TTTTCTTGAT TTTTGAATGT ATTTGTTTCA GGAAGAAGTC ATTTTATGGA TTTTTCTAAC 2460 

CCGTGTTAAC TTTTCTAGAA TCCACCCTCT TGTGCCCTTA AGCATTACTT TAACTGGTAG 2520 

GGAACGCCAG AACTTTTAAG TCCAGCTATT CATTAGATAG T AATTGA AGA TATGTATAAA 2580 

TATTACAAAG AATAAAAATA TATTACTGTC TCTTTAGTAT GGTTTTCAGT GCAATTAAAC 2640 

CGAGAGATGT ClTUTrriT T TAAAAAGAAT AGTATTTAAT AGGTTTCTGA CTTTTGTGGA 2700 
TCATTTTGCA CATAGCTTTA TCAACTTTTA AACATTAATA AACTGATTTT TTTAAAG 



A2 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



endothelial differentiation, sphingolipid G-protein-coupled receptor, 

Hs. 154210 

M31210 

AAA52336 

none found 

50-71, 92-110, 122-140, 160-177, 202-222, 251-269, 283-301 
plasma membrane 



1 11 21 31 41 51 

| 1 1 I I I 

MGPTSVPLVK AHRSSVSDYV NYDIIVRHYN YTGKLNISAD KENSIKLTSV VPILICCFII 
LENIPVLLTI WKTKKFHRPM YYFIGNLALS DLLAGVAYTA NLLLSGATTY KLTPAQWFLR 
EGSMPVALSA SVFSLLAXAI ERYITMLKMK LHNGSNNPRL FLLISACWVI SLILGGLPIM 
GWNCISALSS CSTVLPLYHK HYILPCTTVP TLLLLSIVII* YCRIYSLVRT RSRRLTFRKN 
ISKASRSSEN VALLKTVIIV LSVFIACWAP LFILLLLDVG CKVKTCDILF RAEYFLVLAV 
LNSGTNPIIY TLTNKEMRRA PIRIMSCCKC PSGDSAGKPK RPIIAGMEFS RSKSDNSSHP 
QKDEGDNPET IMSSGNVNSS S 



A3 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 



G protein-coupled receptor 51 
Hs. 198612 
AA4S2928 
NM 004624.1 



829 



60 
120 
180 
240 
300 
360 



WO 03/042661 



PCTYUS02/36810 



Coding sequence: 



1-2826 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

5 | | | I I I 

ATGGCTTCCC CGCGGAGGTC CGGGCAGCCA GGGCGGCCGC OGCCGCCGCC ACCGCCGCCC 
GCGCGCCTGC TACTGCTACT GCTGCTGCCG CTGCTOCTGC CTCTGGCGCC CGGGGCCTGG 
GGCTGGGCGC GGGGCGCCCC CCGGCCGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC 
CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCCGCC 
10 GTGGAACTGG CCATCGAGCA GATCCGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC 
CTGCGGCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA 
ATAAAATACG GGCCGAACCA CTTGATGGTG TTTGGAGGCG TCTGTCCATC OGTCACATCC 
ATCATTGCAG AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACG 
CCTGTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACCGTCCC ATCAGACAAT 
15 GCGGTGAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG 
CTGACGCAAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT 
GGCGAGGACA TTGAGATTTC AGACACCGAG AGCTTCTCCA ACGATCCCTG TACCAGTGTC 
AAAAAGCTGA AGGGGAATGA TGTGCGGATC ATOCTTGGCC AGTTTGACCA GAATATGGCA 
GCAAAAGTGT TCTGTTGTGC ATACGAGGAG AACATGTATG GTAGTAAATA TCAGTGGATC 
20 ATTCCGGGCT GGTACGAGCC TTCTTGGTGG GAGCAGGTGC ACACGGAAGC CAACTCATCC 
CGCTGCCTCC GGAAGAATCT GCTTGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTCGAG 
CCCCTGAGCT CCAAGCAGAT CAAGACCATC TCAGGAAAGA CTCCACAGCA GTATGAGAGA 
GAGTACAACA ACAAGCGGTC AGGCGTGGGG CCCAGCAAGT TCCAGGGGTA CGCCTACGAT 
GGCATCTGGG TCATCGCCAA GACACTGCAG AGGGCCATGG AGACACTGCA TGCCAGCAGC 
25 CGGCACCAGC GGATCCAGGA CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC 
AATGCCATGA ACGAGACCAA CTTCTTCGGG GTCACGGGTC AAGTTGTATT CCGGAATGGG 
GAGAGAATGG GGACCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA 
GAGTACAACG CTGTGGCCGA CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA 
TCCGAACCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT 
30 CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC 
TTCTTCAACA TCAAGAACCG GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC 
AACCTTATCA TCCTTGGAGG GATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT 
GGATCCTTTG TCTCTGAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 
ACCGTGGGCT ACACGACCGC TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCACGCC 
35 ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 
GTGGGGGGCA TGCTGCTGAT CGACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC 
CTGCGAAGGA CAGTGGAGAA GTACAGCATG GAGCCGGACC CAGCAGGACG GGATATCTCC 
ATCCGCCCTC TCCTGGAGCA CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATGGTC 
TATGCCTACA AGGGACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGGA GACCCGCAAC 
40 GTCAGCATCC CCGCACTCAA CGACAGCAAG TACATCGGGA TGAGTGTCTA CAAOGTGGGG 
ATCATGTGCA TCATCGGGGC CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG 
TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC 
GTGCCGAAGC TCATCACCCT GAGAACAAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC 
CAGTTCACTC AGAATCAGAA GAAAGAAGAT TCTAAAACGT CCACCTCGGT CACCAGTGTG 
45 AACCAAGCGA GCACATCCCG CCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCGAATG 
AAGATCACAG AGCTGGATAA AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA 
GAAAAGACCA CCTACATTAA ACAGAACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG 
GGAAACTTCA CTGAGAGCAC AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA 
AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAGAA 
50 GATATAAACT CTCCAGAACA CATCCAGCGT OGGCTGTCCC TCCAGCTCCC CATCCTCCAC 
CACGCCTACC TCCCATCCAT CGGAGGCGTG GACGCCAGCT GTGTCAGCCC CTGCGTCAGC 
CCCACCGCCA GCCCCCGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC 
CTQTAA 

55 A4 Protein sequence: 

Gene name: 

unigene number: 

Probeset Accession ft: 

Protein Accession 
60 signal sequence: 

Pfam domains: 

Transmembrane domains: 

Cellular Localization: 



65 



70 



75 



80 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



G protein- coupled receptor 51 
Hs. 198 612 
AA452928 
NP 005449.1 
1-42 

7tm_3 [481-754] , ANF_receptor [130-204] 

22-44. 477-499, 517-539, 552-574, 59S-617, 653-675, 692-713, 
plasma membrane 



722-744 



1 
I 

MASPRRSGQP 
LMPLTKEVAK 
IKYGPNHLMV 
AVNPAILKLL 
KKLKGNDVRI 
RCLRKNLLAA 
GIWVIAKTLQ 
ERMGTIKFTQ 
LYSILSALTI 
GSFVSEKTFE 
VGGMLLIDLC 
YAYKGLLMLF 
PCIVALVIIF 
NQASTSRLEG 
GNFTBSTDGG 
KAYLPSIGGV 



11 
I 

GRPPPPPPPP 
GSIGRGVLPA 
FGGVCPSVTS 
KHYQWKRVGT 
ILGQFDQNMA 
MEGYIGVDFE 
RAMBTLHASS 
FQDSREVKVG 
LGMIMASAFL 
TLCTVRTWIL 
ILICWQAVDP 
GCFLAWETRN 
CSTITLCLVP 



KAILKNHLDQ 
DASCVSPCVS 



21 
I 

ARLT..T.T.T.T.LP 
VELAIEQIRN 
IIABSLQGWN 
LTQDVQRPSB 
AKVFCCAYEE 
PLSSKQIKTI 
RHQRIQDFNY 
EYNAVADTLE 
PPNIKNRNQK 
TVGYTTAFGA 
LRRTVEKYSM 
VSIPALNDSK 
VPKLITLRTN 
KITSLDKDLB 
NPQLQWNTTE 
PTASPRHRHV 



31 
I 

LLLPLAPGAW 
BSLLRPYFLD 
LVQLSPAATT 
VRNDLTGVLY 
NMYGSKYQWI 
SGKTPQQYER 
TDHTLGRIIL 
IIKDTIRFQG 
liXKMSSPYMH 
MFAKTWRVHA 
BPDPAGRDIS 
Y1GMSVYNVG 
PDAATQNRRF 
EVTKQLQDTP 
PSRTCKDPIB 
PPSFRVMVSG 



41 . 
I 

GWARGAPRPP 
LRLYDTECDN 
PVLADKKKYP 
GBDIEISDTE 
IPGWYEPSHW 
EYNNKRSGVG 
NAMNETNFFG 
SEPPKDKTII 
NLIILGGMLS 
IFKNVKMKKK 
IRPLLEHCEN 
IMCIIGAAVS 
QFTQNQKKED 
EKTTYIKQNH 
DINSPEHIQR 
L 



51 
I 

PSSPPLSIMG- 
AKGLKAFYDA 
YFFRTVPSDN 
SFSNDPCTSV 
EQVHTEANSS 
PSKFHGYAYD 
VTGQWFRNG 
LEQLRKISLP 
YASIPLFGLD 
IIKDQKLLVI 
THMTIWLGIV 
FLTRDQPNVQ 
SKTSTSVTSV 
YQELMDILNL 
RLSLQLPILH 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



830 



WO 03/042661 



PCT/US02/36810 



AS DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 



ESTs 

Hs. 293616 
AW043782 
none found 



Coding sequence: 38-1075 (underlined sequence corresponds to start and stop codon) 



irv 1 11 21 ■ 31 41 51 

10 I I 1 I I I 

AGCAACGACG CCGGGCAGCG GGAGCGGCGG CCGCGCCATG TGGCTGCTGG GGCCGCTGTG 60 

CCTGCTGCTG AGCAGCGCCG CGGAGAGCCA GCTGCTCCCC GGGAACAACT TCACCAATGA 120 

GTGCAACATA CCAGGCAACT TCATGTGCAG CAATGGACGG TGCATCCCGG GCGCCTGGCA 180 

GTGTGACGGG CTGCCTGACT GCTTCGACAA GAGTGATGAG AAGGAGTGCC CCAAGGCTAA 240 

lj GTCGAAATGT GGCCCAACCT TCTTCCCCTG TGCCAGCGGC ATCCATTGCA TCATTGGTCG 300 

CTTCCGGTGC AATGGGTTTG AGGACTGTCC CGATGGCAGC GATGAAGAGA ACTGCACAGC 360 

AAACCCTCTG CTTTGCTCCA CCGCCCGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 420 

GAGCTTCATC TGCGATGGAC AGAATAACTG TCAAGACAAC AGTGATGAGG AAAGCTGTGA 480 

0 . AAGTTCTCAA GAACCCGGCA GTGGGCAGGT GTTTGTGACT TCAGAGAACC AACTTGTGTA 540 

ZU TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCCGTC ATTTTTGTGC TGGTGGTGGC 600 

CCTGCTGGCA CTGGTCTTGC ACCACCAGCG GAAGCGGAAC AACCTCATGA CGCTGCCCGT 660 

GCACCGGCTG CAGCACCCTG TGCTGCTGTC CCGCCTGGTG GTCCTGGACC ACCCCCACCA 720 

CTGCAACGTC ACCTACAACG TCAATAATGG CATCCAGTAT GTGGCCAGCC AGGCGGAGCA 780 

GAATGCGTCG GAAGTAGGCT CCCCACCCTC CTACTCCGAG GCCTTGCTGG ACCAGAGGCC 840 

Zj TGCGTGGTAT GACCTTCCTC CACCGCCCXA CTCTTCTGAC ACX3GAATCTC TGAACCAAGC 900 

CGACCTGCCC CCCTACCGCT CCCGGTCCGG GAGTGCCAAC AGTGCCAGCT CCCAGGCAGC 960 

CAGCAGCCTC CTGAGCGTGG AAGACACCAG CCACAGCCCG GGGCAGCCTG GOCCCCAGGA 1020 

GGGCACTGCT GAGCCCAGGG ACTCTGAGCC CAGCCAGGGC ACTGAAGAAG TATAAGTCCC 1080 

_ _ AGTTATTCCA AAGTCCATAT GGGTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCCAG ACTTCAGAGA TGTTTTTCTG GCGTCTCAGT 1260 

TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT TCCCTTGGGA CCCGAGATCA 1320 

CACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAGA CAGCATATAA AACAGTATTG 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG ATGCTCAGAA GTGCAGGAGA 1440 

5 J CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGGA TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCCCCC CAAAAAAATT CCATTTGAGC 1560 

ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGOCCCCAG TTCAGCAGAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AACGTTATTT TGGTTTTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

4 (J CTCATTCTGA GAGCTTTCCT CAGCAGCATA TATCATCAGC CTCATCCTAA AATAGGCAGG 1800 

GAGCCCCTCC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC 1860 

TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGGTTTCTG GACTGTCACC CTCCCAGCTG 1920 

ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 1980 

GTATGTCCCT GTGGCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

45 ctccaaagtt cccttaacac ttgcaaagtc CTTTTTACCT GTGCATTTGG ACTTGAGGAC 2100 

ACTGGTTTCT ATCACAGGTG AGAGCCATGT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 2160 

CTGCACTGTG CACGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 2220 

GGTCAGGGTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAGACA CATGGGCAAG 2280 

AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 2340 

5U TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC TATTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGGTCCAAA GAAAGATGCA AAAGGAGATC 2460 

ACACCCTTGC CCCGCTGAGC CCCGTGATAA CAAGTCACTC CAGACTAACC TGTGTGCCAG 2520 

ACATTTGTGC ATTGTTGCAC TTTGAGGTTA TTATTTATCA AGTTCTTGAA GGAAGCAGAA 2580 

AGAGGGACTC CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAGTT TTTCTTTTTT 2640 

55 TTCTCTGTGT CCAGTCAGCC ACAGGGCCCG CCTCCCTGCA GGAATAAGGG GTAAAACGTT 2700 

AGGTGTTGTT TGGCAAGAAA CCACACTGAC TGATGAGGGG TAAAATGGAA CCAGGTAGAG 2760 

CCACTCCGGG CAGCTGTCAC CCATTCAGAA CTTCTTTCCG CAGCTGAAGA AATGTTCAGT 2820 

AACCTGTTTG ACGCTAATTA AAACAGAGCC TGCAGGAAGT GGGGCTAAAG TGGCATTCAG 2880 

rA TGATCCTGTT CTGTAGACTT TTCTTTCTTT TTTTAACCAA ATCCAAAGGA TGTTACAGAA 2940 

OU AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 3000 

AACGGAAAGG AACCTAGCTG CCTGTATCTT TCATTTTTAA AATAGCACTT GAGTTATTTT 3060 

CTGAGTAATC CAATAAAGAA CTTTTGATGA CAGCCAGAAT GTGTTAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 

- - TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

OJ GAGTTAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 3360 

TGATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA OGCCAACCAG 3540 

70 AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG TTTTATTTTA GCATTAAATT 3600 

AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 3660 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 3720 

GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC riX^ TrTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAMARM 3840 

75 AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3900 

TGGGGCGGCG GGGCCCACGT AGGTACGGCG ACCACGCGGG CCCAAACGGG ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA ACGCGCCGGG 4020 

GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG CACACAAGCG 4080 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 



A6 Protein sequence: 
Gene name: 



ESTs 



831 



WO 03/042661 



PCT/US02/36810 



Unigene number: 
Probeset Accession B : 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
LDLa domains: 
Cellular Localization: 



1 11 21 

t I 1 

MWLLGPLCLL LSSAAESQLL PGNNFTNECN 
EKECPKAKSK CGPTFPPCAS GIHCIIGRFH 
KNGLCIDKSP ICDGQNNCQD NSDEESCESS 
VIFVLWALL ALVLHHQRKR NNLMTLPVHR 
YVASQAEQNA SEVGSPPSYS EALLDQRPAW 
NSASSQAASS LLSVEDTSHS PGQPGPQEGT 



Hs. 293616 
AW0437B2 
none found 
1-17 
169-191 

28-66, 70-108, 112-149 
plasma membrane 



31 41 51 

I I I 

IPGNFMCSNG RCIPGAWQCD GLPDCFDKSD 60 

CNGFEDCPDG SDEENCTANP LLCSTARYHC 120 

QEPGSGQVFV TSENQLVYYP S1TYAIIGSS 180 

LQHPVLLSRL WLDHPHHCN VTYNVNNGIQ 240 

YD1»PPPPYSS DTESLNQADL PPYRSRSGSA 300 
AEPRDSEPSQ GTEEV 



A7 DNA SEQUENCE 
Gene name: 

Unigene number: 
Probeset Accession tf: 
Nucleic Acid Accession #: 
Coding sequence: 



Integrin, alpha 5 (fibronectin receptor, alpha polypeptide) 

Hs. 149609 
X06256 

NM 002205 o , . 

1-3150 (underlined sequences correspond to start and stop cooonsj 



1 11 21 31 41 SI 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 
CGCCGACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 
GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 
GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 
CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 
TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 
CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 
TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 
AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 
GATAACTTCA CCCGAATTCT GGAGTATGCA COCTGCCGCT CAGATTTCAG CTGGGCAGCA 
GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 
TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 
ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 
CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 
TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCTCACTTAC 
GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 
CAGATGGCCT CCTACTTTGG CTATGCAGTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 
GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GCGGCCTCAG 
GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCOG GCATAGAGCC CACGCCCACC 
CTTACCCTCA CTGGCCATGA TGAGTTTGGC OGATTTGGCA GCTC CTTGA C CCCCCTGGGG 
GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 
CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 
CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 
CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 
GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTGT CCGCTAGTGC CTCCCTCACC 
ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTG 
GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 
GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 
CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 
CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGAACG AGTCAGAATT TCGAGACAAA 
CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 
CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 
ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 
GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCOC TGAACCTCAC TTTCCATGCC 
CAGAATGTGG GTGAGGGTGG CGCCTATGAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 
GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 
TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 
GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCCGGGACAC TAAGAAAACC 
ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGACGTGGTT 
TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAACGGTGT CTCCAAGCCT 
GAGGCAGTGC TATTCOCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 
GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA AOCAAGGCCC CAGCTCCATT 
AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 
GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 
GAGTTGGATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 
TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGA GTGTT T CAGGCTGCGC 
TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 
TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 
TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 
CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 
ATCATCATCC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 
TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 
CTCAAGCCTC CAGCCACCTC TGATGCCTGA 



60 

120 

180 

240 

300 

360 

420 

460 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 



832 



WO 03/042661 



PCT/US02/36810 



A8 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession B: 
Signal sequence: 
Transmembrane domains: 



Integrin, alpha 5 (fibronectin receptor, alpha polypeptide) 

Hs. 149609 

X06256 

NP 002196 

1-42 

998-1020 



Integrin alpha domains: 56-115, 268-318. 322-384, 388-444, 452-503, 1022-1036 
Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I I I I II 

MGSRTPESPL HAVQLRWGPR RRPPLLPUiL LLLPPPPRVG GFNLDAEAPA VLSGPPGSFF 60 

GPSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SS1LACAPLY SWRTEKEPLS DPVGTCYLST 180 

DNPTRILEYA PCRSDFSWAA GQGYCQGGFS AB F T K T GR W LGGPGSYPWQ GQILSATQEQ 240 

IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE PSGDDTEDPV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RPGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

VDKAWYRGR PXVSASASI*T IFPAMFNPEB RSCSLEGNPV ACINLSPCLN ASGKHVADSI 540 

GPTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

LSPIHIALNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNPSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 760 

SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHRVY ELINQGPSSI 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQRBH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 



A9 DNA SEQUENCE 

Gene name: Integrin, beta 1 {fibronectin receptor, beta, antigen CD29 includes MDF2, MSK12) 

Unigene number t Hs. 2 87797 

Probeset Accession #: X07979 
Nucleic Acid Accession #: NM_002211.1 

Coding sequence: 1-2397 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

I I I I I I 

ATGA ATTTAC AACCAATTTT CTGGATTGGA CTGATCAGTT CAGTTTGCTG TGTGTTTGCT 60 

CAAACAGATG AAAATAGATG TTTAAAAGCA AATGCCAAAT CATGTGGAGA ATGTATACAA 120 

GCAGGGCCAA ATTGTGGGTG GTGCACAAAT TCAACATTTT TACAGGAAGG AATGCCTACT 180 

TCTGCACGAT GTGATGATTT AGAAGCCTTA AAAAAGAAGG GTTGCCCTCC AGATGACATA 240 

GAAAATCCCA GAGGCTCCAA AGATATAAAG AAAAATAAAA ATGTAACCAA CCGTAGCAAA 300 

GGAACAGCAG AGAAGCTCAA GCCAGAGGAT ATTACTCAGA TCCAACCACA GCAGTTGGTT 360 

TTGCGATTAA GATCAGGGGA GCCACAGACA TTTACATTAA AATTCAAGAG AGCTGAAGAC 420 

TATCCCATTG ACCTCTACTA CCTTATGGAC CTGTCTTACT CAATGAAAGA CGATTTGGAG 480 

AATGTAAAAA GTCTTGGAAC AGATCTGATG AATGAAATGA GGAGGATTAC TTCGGACTTC 540 

AGAATTGGAT TTGGCTCATT TGTGGAAAAG ACTGTGATGC CTTACATTAG CACAACACCA 600 

GCTAAGCTCA GGAACCCTTG CACAAGTGAA CAGAACTGCA CCAGCCCATT TAGCTACAAA 660 

AATGTGCTCA GTCTTACTAA TAAAGGAGAA GTATTTAATG AACTTGTTGG AAAACAGCGC 720 

ATATCTGGAA ATTTGGATTC TCCAGAAGGT GGTTTCGATG CCATCATGCA AGTTGCAGTT 780 

TGTGGATCAC TGATTGGCTG GAGGAATGTT ACACGGCTGC TGGTGTTTTC CACAGATGCC 840 

GGGTTTCACT TTGCTGGAGA TGGGAAACTT GGTGGCATTG TTTTACCAAA TGATGGACAA 900 

TGTCACCTGG AAAATAATAT GTACACAATG AGCCATTATT ATGATTATCC TTCTATTGCT 960 

CACCTTGTCC AGAAACTGAG TGAAAATAAT ATTCAGACAA TTTTTGCAGT TACTGAAGAA 1020 

TTTCAGCCTG TTTACAAGGA GCTGAAAAAC TTGATCCCTA AGTCAGCAGT AGGAACATTA 1080 

TCTGCAAATT CTAGCAATGT AATTCAGTTG ATCATTGATG CATACAATTC CCTTTCCTCA 1140 

GAAGTCATTT TGGAAAAOGG CAAATTGTCA GAAGGAGTAA CAATAAGTTA CAAATCTTAC 1200 

TGCAAGAACG GGGTGAATGG AACAGGGGAA AATGGAAGAA AATGTTCCAA TATTTCCATT 1260 

GGAGATGAGG TTCAATTTGA AATTAGCATA ACTTCAAATA AGTGTCCAAA AAAGGATTCT 1320 

GACAGCTTTA AAATTAGGCC TCTGGGCTTT ACGGAGGAAG TAGAGGTTAT TCTTCAGTAC 1380 

ATCTGTGAAT GTGAATGCCA AAGCGAAGGC ATCCCTGAAA GTCCCAAGTG TCATGAAGGA 1440 

AATGGGACAT TTGAGTGTGG CGCGTGCAGG TGCAATGAAG GGCGTGTTGG TAGACATTGT 1500 

GAATGCAGCA CAGATGAAGT TAACAGTGAA GACATGGATG CTTACTGCAG GAAAGAAAAC 1560 

AGTTCAGAAA TCTGCAGTAA CAATGGAGAG TGCGTCTGCG GACAGTGTGT TTGTAGGAAG 1620 

AGGGATAATA CAAATGAAAT TTATTCTGGC AAATTCTGCG AGTGTGATAA TTTCAACTGT 1680 

GATAGATCCA ATGGCTTAAT TTGTGGAGGA AATGGTGTTT GCAAGTGTCG TGTGTGTGAG 1740 

TGCAACCCCA ACTACACTGG CAGTGCATGT GACTGTTCTT TGGATACTAG TACTTGTGAA 1800 

GCCAGCAACG GACAGATCTG CAATGGCOGG GGCATCTGCG AGTGTGGTGT CTGTAAGTGT 1860 

ACAGATCCGA AGTTTCAAGG GCAAACGTGT GAGATGTGTC AGACCTGCCT TGGTGTCTGT 1920 

GCTGAGCATA AAGAATGTGT TCAGTGCAGA GCCTTCAATA AAGGAQAAAA GAAAGACACA 1980 

TGCACACAGG AATGTTCCTA TTTTAACATT ACCAAGGTAG AAAGTCGGGA CAAATTACCC 2040 

CAGCC GGTCC AACCTGATCC TGTGTCCCAT TGTAAGGAGA AGGATGTTGA CGACTGTTGG 2100 

TTCTATTTTA CGTATTCAGT GAATGGGAAC AACGAGGTCA TGGTTCATGT TGTGGAGAAT 2160 

CCAGAGTGTC CCACTGGTCC AGACATCATT CCAATTGTAG CTGGTGTGGT TGCTGGAATT 2220 



833 



WO 03/042661 



PCT/US02/36810 



GTTCTTATTG GCCTTGCATT ACTGCTQATA TGGAAGCTTT TAATGATAAT TCATGACAGA 2280 
AGGGAGTTTG CTAAATTTGA AAAGGAGAAA ATGAATGCCA AATGGGACAC GGGTGAAAAT 2340 
CCTATTTATA AGAGTGCCGT AACAACTGTQ GTCAATCCGA AGTATGAGGG AAAATGA 



A10 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
INB domain: 
PSI domain: 

Cellular Localization: 



Integrin, beta 1 (fibronectin receptor, beta, antigen CD29 includes MDF2, MSK12) 

HS. 287797 

X07979 

NP 002202.1 

1-21 

732-754 

34-464 

26-76 

plasma membrane 



1 11 21 31 41 51 

I I I i 1 1 

MNLQPIFWIG LISSVCCVFA QTDENRCLKA NAKSCGBCIQ AGPNCGWCTN STFLQEGMPT 60 

SARCDDLEAL KKKGCPPDDI ENPRGSKDIK KNKNVTNRSK GTABKLKPED ITQIQPQQLV 120 

LRLRSGEPQT FTLKPKRAED YPIDLYYLMD LSYSMKDDLE NVKSLGTDLM NEMRRITSDF 180 

RIGFGSFVEK TVMPYISTTP AKLRNPCTSE QNCTSPFSYK NVLSLTNKGE VFNELVGKQR 240 

ISGNLDSPEG GFDAIMQVAV CGSLIGWRNV TRLLVFSTDA GPHPAGDGKL GGIVLPNDGQ 300 

CHLENNMYTM SHYYDYPSIA HLVQKLSENN IQTIFAVTBE FQPVYKELKM LIPKSAVGTL 360 

SANSSNVIQL IIDAYNSLSS EV1LENGKLS EGVTISYKSY CKNGVNGTGE NGRKCSNISI 420 

GDEVQFEISI TSNKCPKKDS DSFKIRPLGP TEEVEVILQY ICECECQSEG IPESPKCHEG 480 

NGTFECGACR CNEGRVGRHC ECSTDEVNSE DMDAYCRKEN SSEICSNNGE CVCGQCVCRK 540 

RDNTNEIYSG KFCECDNFNC DRSNGLIOGG NGVCKCRVCE CNPNYTGSAC DCSLDTSTCE 600 

ASNGQICNGR GICECGVCKC TDPKFQGQTC EMCQTCLGVC AEHKECVQCR AFNKGEKKDT 660 

CTQECSYFNI TKVESRDKLP QPVQPDPVSH CKEKDVDDCW PYFTYSVNGN NBVMVHWEN 720 

PECPTGPDII PIVAGWAGI VLIGIALLLI WKLLMIIHDR REFAKFEKEK MNAKWDTGEN 780 
PIYKSAVTTV VNPKYEGK 

All DNA SEQUENCE 

Gene name: ~~ selectin E (endothelial adhesion molecule 1) 

Unigene number: Hs.89546 

Probeset Accession 8: M24736 

Nucleic Acid Accession #i NM_000450 

Coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGATTGCTT CACAGTTTCT CTCAGCTCTC ACTTTGGTGC TTCTCATTAA AGAGAGTGGA 60 

GCCTGGTCTT ACAACACCTC CACGGAAGCT ATGACTTATG ATGAGGCCAG TGCTTATTGT 120 

CAGCAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAG AAGAGATTGA GTACCTAAAC 180 

TCCATATTGA GCTATTCACC AAGTTATTAC TGGATTGGAA TCAGAAAAGT CAACAATGTG 240 

TGGGTCTGGG TAGGAACCCA GAAACCTCTG ACAGAAGAAG CCAAGAACTG GGCTCCAGGT 300 

GAACCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAGA TCTACATCAA GAGAGAAAAA 360 

GATGTGGGCA TGTGGAATGA TGAGAGGTGC AGCAAGAAGA AGCTTGCCCT ATGCTACACA 420 

GCTGCCTGTA CCAATACATC CTGCAGTGGC CACGGTGAAT GTGTAGAGAC CATCAATAAT 480 

TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAGT GTGAGCAAAT TGTGAACTGT 540 

ACAGCCCTGG AATCCCCTGA GCATGGAAGC CTGGTTTGCA GTCACCCACT GGGAAACTTC 600 

AGCTACAATT CTTCCTGCTC TATCAGCTGT GATAGGGGTT ACCTGCCAAG CAGCATGGAG 660 

ACCATGCAGT GTATGTCCTC TGGAGEAATGG AGTGCTCCTA TTCCAGCCTG CAATGTGGTT 720 

GAGTGTGATG CTGTGACAAA TCCAGCCAAT GGGTTCGTGG AATGTTTCCA AAACCCTGGA 780 

AGCTTCCCAT GGAACACAAC CTGTACATTT GACTGTGAAG AAGGATTTGA ACTAATGGGA 840 

GCCCAGAGCC TTCAGTGTAC CTCATCTGGG AATTGGGACA ACGAGAAGCC AACGTGTAAA 900 

GCTGTGACAT GCAGGGCCGT CCGCCAGCCT CAGAATGGCT CTGTGAGGTG CAGCCATTCC 960 

CCTGCTGGAG AGTTCACCTT CAAATCATCC TGCAACTTCA CCTGTGAGGA AGGCTTCATG 1020 

TTGCAGGGAC CAGCCCAGGT TGAATGCACC ACTCAAGGGC AGTGGACACA GCAAATCCCA 1080 

GTTTGTGAAG CTTTCCAGTG CACAGCCTTG TCCAACCCCG AGCGAGGCTA CATGAATTGT 1140 

CTTCCTAGTG CTTCTGGCAG TTTCCGTTAT GGGTCCAGCT GTGASTTCTC CTGTGAGCAG 1200 

GGTTTTGTGT TGAAGGGATC CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTGGGACAAC 1260 

GAGAAGCCCA CATGTGAAGC TGTGAGATGC GATGCTGTCC ACCAGCCCCC GAAGGGTTTG 1320 

GTGAGGTGTG CTCATTCCCC TATTGGAGAA TTCACCTACA AGTCCTCTTG TGCCTTCAGC 1380 

TGTGAGGAGG GATTTGAATT ATATGGATCA ACTCAACTTG AGTGCACATC TCAGGGACAA 1440 

TGGACAGAAG AGGTTCCTTC CTGCCAAGTG GTAAAATGTT CAAGCCTGGC AGTTCCGGGA 1500 

AAGATCAACA TGAGCTGCAG TGGGGAGCCC GTGTTTGGCA CTGTGTGCAA GTTCGCCTGT 1560 

CCTGAAGGAT GGACGCTCAA TGGCTCTGCA GCTCGGACAT GTGGAGCCAC AGGACACTGG 1620 

TCTGGCCTGC TACCTACCTG TGAAGCTCCC ACTGAGTCCA ACATTCCCTT GGTAGCTGGA 1680 

CTTTCTGCTG CTGGACTCTC CCTCCTGACA TTAGCACCAT TTCTCCTCTG GCTTCGGAAA 1740 

TGCTTACGGA AAGCAAAGAA ATTTGTTCCT GCCAGCAGCT GCCAAAGCCT TGAATCAGAC 1800 
GGAAGCTACC AAAAGCCTTC TTACATCCTT TAA 



A12 Protein sequence: 

Gene name: Selectin E 

Unigene number: Hs.89546 

Probeset Accession th M24736 

Protein Accession ft: NP_000441 

Signal sequence: 1-22 

Transmembrane domains: 555-573 

C-lectin domain: 23-139 



(endothelial adhesion molecule 1) 



834 



WO 03/042661 



PCT/US02/36810 



Cellular Localization: plasma membrane 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 



1 
i 

MIASQFLSAL 
SILSYSPSYY 
DVGMWNDERC 
TALES PEHGS 
ECDAVTNPAN 
AVTCRAVRQP 
VCEAFQCTAL 
EKPTCEAVRC 
WTEEVPSCQV 
SGLLPTCEAP 
GSYQKPSYIL 



11 
I 

TLVLL1KESG 
WIGIRKVNNV 
SKKKLALCYT 
LVCSHPLGNF 
GFVECFQNPG 



SNPERGYMNC 
DAVHQPPKGL 
VKCSSLAVPG 
TESNIPLVAG 



21 
I 

AWSYNTSTEA 
WVWVGTQKPL 
AACTNTSCSG 
SYNSSCSISC 
SFPWNTTCTF 
PAGEFTFRSS 
LPSASGSFRY 
VRCAHSPIGE 
KINMSCSGEP 
LSAAGLSLLT 



31 
I 

MTYDEASAYC 
TEEAKKWAPG 
HGECVETINN 
DROYLPSSME 
DCEEGFELMG 
CNFTCEEGFM 



FTYKSSCAFS 
VFGTVCKFAC 
LAPFLLWLRK 



41 
I 

QQRYTHLVAI 
EPNNRQKDED 
YTCKCDPGFS 
TMQCMSSGEW 
AQSLQCTSSG 
LQGPAQVECT 
GFVLKGSKRL 
CEEGFELYGS 
PEGWTLNGSA 
CLRKAKKPVP 



SI 
I 

QNKEEIEYLN 
CVEIYIKREK 
GLKCEQIVNC 
SAPIPACNW 
NWDNEKPTCK 
TQGQWTQQIP 
QCGPTGEWDN 
TQLECTSQGQ 
ARTCGATGHW 
ASSCQSLESD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



A13 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



G protein- coupled receptor 39 

Hs. 85339 

AA349893 

NM_001S08 

1-1362 (underlined sequences correspond to start and stop codons) 



1 11 

I I 
ATGGCTTCAC CCAGCCTCCC 
CCOGAGTTTG AGGTGGCCAC 
TTCGTGATGG GCCTTCTGGG 
AAAGGATACT TGCAGAAGGA 
TTGGTGTTCC TCATCGGCAT 
ACGTCCAGCT ACACCCTGTC 
GCTACGCTGC TGCACGTGCT 
TTCAGGTACA AGGCTGTGTC 
GTCACCTCCG CCCTGGTGGC 
GTGAACGTGC CCAGCCACCG 
CAGCCCGAGA CCTCCAATAT 
CAGTCCAGCA TCTTCGGCGC 
ATGTGCTGGA ACATGATGCA 
ACGCGGCCTC CGCAGCTGAG 
ACCATCATCT TCCTGAGGCT 
ATTCGGAGGA TCATGGCTGC 
GCGTACATGA TCCTCCTCCC 
CCGCTCCTGT ACACGGTGTC 
TGCCGCCTGT CGCTGCAGCA 
ACCACCGACA GCGCCCGCTT 
TCTGCAAGGA GAACTGAGAA 
TCTAAGTCCC AGTCATTGAG 
AATTCTGCTG CAGAGAATGG 



21 
i 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGCC 
GGTGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACACTCAGC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
GAAGTCCGAG 
GATTGTTGTG 
GGCCAAACCC 
CTTCTCGGAG 
CTCQCAGCAG 
CGCCAACCAC 
TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTTCAGGAG 



31 
I 

TGCTCCCAAA 
ATCACCCTTA 
ACCATTCGGG 
CACATGGTGA 
TTCTACAQCA 
CACACTTTCC 
TTTGAGCGCT 
CAGGTGAAGC 
CTGTTTGCCA 
TGCAACCGCT 
ACCAACCTCT 
TACCTCGTGG 
AAAAGCCAGA 
AGCGAAGAGA 
ACATTGGCCG 
AAGCACGACT 
AOGTTTTTCT 
TTTCGGCGGG 
GAGAAGCGCC 
CCGTTGCTCT 
AGCACTTTTC 
CTAGAGCCCA 
CATGAAGTTT 



41 

I 

TCATTGATCA 
TTCTGGTGTA 
TCACCCAGGT 
GTTTGGCTTG 
TCATCTGGAA 
TCTTCGAGGC 
ACATCGCCAT 
TGCTGATTGG 
TGGGTACTGA 
CCAGCACCCG 
CCAGCCGCTG 
TCCTGCTCTC 
AGGGCTCGCT 
GCAGGACCGC 
TATGCTGGAT 
GGACGAGGTC 
ACCTCAGCTC 
TGTTCGTGCA 
TGCGCGTACA 
TCGCGTCCCG 
AGAGCGAGGC 
ACTCAGGCGC 
GA 



51 
t 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



A14 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 
Cellular Localization: 



G protein-coupled receptor 39 

Hs. 85339 

AA349893 

NMJ>01508, NPJJ01409 
none found 

7tm_l [72-172, 224-344] 

32-54, 68-90, 111-133, 151-173, 

plasma membrane 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



221-243, 280-301, 320-342 



1 11 21 

65 MASPSLPGSD CSQIIDHSHV PEFEVATWIK 
KGYLQKEVTD HMVSLACSDI LVPLIGMPME 
ATLLHVLTLS FERYIAICKP FRYKAVSGPC 
VNVPSHRGLT CNRSSTRHRE QPETSKMSIC 
MCWNMMQVLM KSQKGSLAGG TRPPQLRKSE 

70 IRRIMAAAKP KHDWTRSYFR AYMILLPFSE 
CRLSLQHANH EKRLRVHAHS TTDSARPVQR 
SKSQSLSLES LEPNSGAKPA KSAAENGFQE 



31 


41 


SI 
1 




1 

ITL1LVYLII 


1 

FVKGLLGNSA 


TIKVTQVLQK 


60 


FYSIIWNPLT 


TSSYTLSCKL 


HTFLFEACSY 


120 


QVKLLIGFVW 


VTSALVALPL 


LFAKGTEYPL 


180 


TNLSSRWTVF 


QSSIFGAFW 


YLWLLSVAF 


240 


SEESRTARRQ 


TIIFLRLTW 


TLAVCWMPNQ 


300 


TFFYLSSVIN 


PLLYTVSSQQ 


PRRVFVQVLC 


360 


PLLFASRRQS 


SARRTEKIFL 


STPQSEAEPQ 


420 


HEV 









75 AI Prostate 

A15 PNA sequence 
Gene name: 
Unigene number: 
80 Probeset Accession ftt 

Nucleic Acid Accession 8: 
Coding sequence: 



CEGP1 
Hs. 222399 
AA256485 
AJ400877 

81-3080 (underlined sequences correspond to start and stop codonsj 



11 



21 



31 



41 



51 

835 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



I I I I I I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG C08CCTCTGC 60 

CCGCAACCGC TGAGCCATCC ATGGGGGTOG CGGGCCGCAA CCGTCCCGGG GOGGCCTGGG 120 

CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGOGGGG GCCGTCCCGC 180 

CGGGTOGGGG CCGTGCCGOG GG6CG6CAGX3 AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 

ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 

AGCCTGGCTA CCAAGGGQAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 

TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATOGT TGCACTTOTT 420 

TTGATGGCTT CATGTTGGCT CATGAOGGTC ATAATTGTCT TGATGTQGAC GAGTGCCTGG 480 

AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT S40 

GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACOGC TOGGAAGAGG 600 

GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 

GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

TCTTGACCTG TAACCATGGG AAOGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 

GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 

AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 

ATAAACGGGT GAAACGGCGG CTGCTCATGG AAAOGTGTGC TGTCAACAAT GGAGGCTGTG 960 

ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 

GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1140 

AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 

GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 

CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 

GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 

AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 

CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 

GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 

AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 

CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 

GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 

TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1800 

TGAGCTGCAT OGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1860 

AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 

AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 

CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 

GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 

GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 

GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 

GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 

GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 

CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 

ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 

GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACOG 2S80 

AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 

TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 

CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 

CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2820 

TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 

GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 

TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 

AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 

TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 

GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 

CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 

GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 

CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 

TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 

TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 

CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 

CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 3600 

ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 

AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 



A16 Protein sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
PFAM i 



Cellular Localization: 



CEGP1 

Hs. 222399 

AA25648S 

CAB92285 

1-31 

none 

EGF-like_domains [49-84, 132-167,177-213,286-321, 407-442] 
CUB_domain [809-918) 
may be secreted 



1 11 21 31 41 51 

I I I I 1 I 

KGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 

LCQNTPTSYK CSCKPGYQGE GRQCBDLDEC GNELKGGCVH DCLNIPGNYR CTCFDGFMLA 120 

HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGPFL SDNQHTCIHR SEEGLSCMNK 180 

DHGCSHICKE APRGSVACBC RPGFBLAKNQ RDCILTQJHG NGGCQHSCDD TADGPECSCH 240 

PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 

836 



WO 03/042661 



PCT/US02/36810 



STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVQSPDOGCK KGFKLLTDEK 360 

SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGPTH CGDTNECSIN NGGCQQVCVN 420 

TVGSYECQCH PGYKLHWNKK DCVEVKQLLP TSVSPRVSLH OGKSGQGDGC FLRCHSGIHL 480 

SSDVTTIRTS VTFKLNEGXC SLKNAELPPE GLRPALPBKH SSVKESPRYV NLTCSSGKQV 540 

PGAPGRPSTP KEMPITVEPE ZiBTNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 

FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 

TPQNEEGQMT CEPCPRPGNS GALKTPEAWN MSECGGLCQP GEYSADGPAP CQLCALGTFQ 720 

PEAGRTSCFP CGGG LATKHQ GATSFQDCET RVQCSPGHFY KTTTHRCIRC PVGTYQPEFG 780 

KNNCVSCPGN TTTDFDGSTN ITQCKNRROG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 

PPPKRRILIV VPEIFLPIED DOGDYLVMRK TSSSNSVTTY ETCQTYBRPI APTSRSKKLW 900 

IQPKSNEGNS ARGPQVPYVT YDEDYQELIE DIVRDGRLYA SENHQEILKD KKLIKALPDV 960 
IAHPQNYPKY TAQESREMFP RSFIRLLRSK VSRPLRPYK 



A17 DNA sequence 

Gene name: ESTs 
Unigene number: Hs. 2 93 102 

Probeset Accession #: AI951118, AW170035, AL044891, AI908272 

Nucleic Acid Accession ft: none found 

Coding sequence: 1-489 (entire sequence is an open-reading- frame) 

1 11 21 31 41 si 

I 1 I I I I 

CAAAAAGAAA TAGATAAAAT AAATGGAAAA TTAGAAGAGT CTCCTGATAA TGATGGTTTT 60 

CTGAAGGCTC CCTGCAGAAT GAAAGTTTCT ATTCCAAGTA AAGCCTTAGA ATTGATGGAC 120 

ATGCAAACTT TCAAAGCAGA GCCTCCCGAG AAGCCATCTG GCTTCGAGCC TGCCATTGAA 180 

ATGCAAAAGT CTGTTCCAAA TAAAGCCTTG GAATTGAAGA ATGAACAAAC ATTGAGAGCA 240 

GATCAGATGT TCCCTTCAGA ATCAAAACAA AAGAASGTTG AAGAAAATTC TTGGGATTCT 300 

GAGAGTCTCC GTGAGACTGT TTCACAGAAG GATGTGTGTG TACCCAAGGC TACACATCAA 360 

AAAGAAATGG ATAAAATAAG TGGAAAATTA GAAGATTCAA CTAGCCTATC AAAAATCTTG 420 ' 

GATACAGTTC ATTCTTGTGA AAGAACAAGG GAACTTCAAA AAGACCCCTG TGACCCACGT 480 
TCAGGAAAA 



A16 Protein sequence 
Gene name: 
Unigene number: 
Probeset Accession #i 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 2931 02 

AI951118. AW170035, AL044891. 

none found 

none 



AI908272 



nuclear 



1 11 21 31 41 51 

I I I I I I 

QKEIDKINGK LEESPDNDGF LKAPCRMRVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE 60 

MQKSVPNKAL ELXNEQTLRA DQMPPSESKQ KXVEENSWDS ESLRETVSQK DVCVPKATHQ 120 

KEMDKISGKL EDSTSLSKIL DTVHSCERTR ELQKDPCDPR SGK 



Breast • 

A19 DNA SEQUENCE 

Gene name: TMPRSS3a 

Unigene number: Hs. 2 9 82 41 

Probeset Accession tf: AIS38613 

Nucleic Acid Accession #: AB038157 

Coding sequence: 202-1566 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I | | 

ACCGGGCACC GG ACGG CTOG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT OGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTOCTG AA ATAGTC AC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

QATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGQAGC AGGTGACGCC 1260 

TCCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 1320 

GACGTGTACG GTGGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT GACGGGTGGC 1380 



837 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 

50 
55 
60 
65 
70 
75 
80 



GTGGACAGCT 
AAGTTAGTGG 
TACACCCGTG 
ACCTGAAGAG 
TCCTCCCCTG 
TTCCGGCACC 
CAAGCTGCTT 
AGTGCAGTGG 
TTGCCTCAGC 
TTGTATTTTT 
CCTCAAATGA 
ACGCCTAGCC 
AGGGCGGCCT 
ACGAGATAAG 
GCACCAGCCC 
CCAAACCCAC 
ATGACTCGTT 
TTGTCCTTTG 
AAAAAAAA 



GCCAGGGGGA 
GAGCGACCAG 
TCACCTCCTT 
GAAGGGGACA 
GACTCCCGTG 
AGTAGCAGGC 
TTTGTTTTTT 
CGAAATCCCT 
TTCCCCAGTA 
AGTAGAGACA 
TGTGCCTGCT 
TCAOGCTCCT 
TTCCCACTGG 
CAGTTATGTG 
AGAAGTGCAG 
CCTTTCTACT 
TAAGGCCTAT 
ATTCCAAATA 



CAGCGGGGGG 
CTTTGGCATC 
CCTGGACTGG 
AGTAGCCACC 
TAGGAACCTG 
COGAAAGAGG 
GTTTTTTTGA 
GCTCACTGCA 
GCTGGGACCA 
GGGTTTCACC 
TCAGCCTCCC 
TTCTGATCTT 
TCCATCTGGT 
ACCTCAOGTG 
AACTGCAGTC 
TCCAAGACTT 
TTTCATGATT 
ATATGTTTCC 



CCCCTGGTGT 
GGCTGCGCAG 
ATCCACGAGC 
TGAGTTCCTG 
CACAOGAGCA 
CACCCTTCCA 
GGTGGAGTCT 
GCCTCCGCTT 
CAGGTGCCCG 
ATGTTGGCCA 
ACAGTGCTGG 
CACTAAGAAC 
TTTCTCTCCA 
CAAAGCCACC 
ACTGCACGTT 
ATTTTCACAT 
TCTTTGTAGC 
TTCCCTCAAA 



GTCAAGAGAG 
AGGTGAACAA 
AGATGGAGAG 
AGGTGATGAA 
GACACCCTTG 
TCTGATTCCA 
CGCTCTGTTG 
CCCTGGTTCA 
CCACCACACC 
GGCTGCTCTC 
GATTACAGGC 
AAAAGAAGCA 
GGGGTCTTGC 
AACAGCCACT 
TTCATCTCTA 
GTGGGGAGGT 
ATTTGGTGCT 
AAAAAAAAAA 



GAGGCTGTGG 
GCCTGGGGTG 
AGACCTAAAA 
GACAGCCOGA 
GAGCTCTGAG 
GCACAACCTT 
CCCAGGCTGG 
AGCGATTCTC 
CAACTAATTT 
AAACCCCTGA 
ATGGGCCACC 
GCAACTTGCA 
AAAATTCCTO 
CAGAAAAGAC 
GGGACCAGAA 
TAATCTAGGA 
TGACGTATTA 
AAAAAAAAAA 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



A20 Protein sequence: 
Gene name: 
Unigene numbers 
Probeset Accession §: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Tryp_SPc domain: 
Cellular Localization: 



TMPRSS3a 

Hs. 298241 

AIS38613 

BAB20077 

none found 

43-65, 239-261 

216-444 

not determined 



1 
I 

MGENDPPAVE 
LALAIGLG1H 
TAASWKTMCS 
VTALHHSVYV 
LCGGSVITPL 
LGNDIALMKL 
AVPLISNKIC 
FGIGCAEVNK 



11 
I 

APFSFRSLFG 
FDCSGKYRCR 
DDWKGHYANV 
REGCASGHW 
WIITAAHCVY 
AGPLTFNBMI 
NHRDVYGGII 
PGVYTRVTSF 



21 
I 

LDDLKISPVA 
SSFKCIELIA 
ACAQLGFPSY 
TLQCTAGGHR 
DLYLPKSWTI 
QPVCLPNSEE 
SPSMLCAGYL 
LDWIHEQMER 



31 
I 

PDADAVAAQI 
RCDGVSDCKD 
VSSDNLRVSS 
RGYSSRIVGG 
QVGLVSLLDN 
NFPDGKVCWT 
TGGVDSCQGD 
DLKT 



41 
I 

LSLLPLKFFP 
GEDBYRCVRV 
LBGQFREEFV 
NHSLLSQWPW 
PAPSHLVEK2 
SGWGATEDGA 
SGGPLVCQER 



51 
I 

IIVIGIIALI 
GGQNAVLQVF 
SIDHLLPDDK 
QASLQFQGYH 
VYHSKYKPKR 
GDASPVLNHA 
RLWKLVGATS 



60 
120 
180 
240 
300 
360 
420 



A21 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession Jh 



ESTs; opposite strand to TRPS1 
none 

AA428090 
AA42B090 



Coding sequence: 1-558 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

111)11 

ATGAAGCCCA GTTTGAAACA CTGGGCTGAT ATAAAAATGT TTTCTGAAAT AGACCAAAGG 60 

CAAGTTGTTG GAGAAGAAAT TCATCTACAA GTAGTATCTG TATCTTATCT TGTAGAAAAC 120 

TTTTCAGACA CAGATGATCT TATGTCCATA GTGGAGGAAT CAGACAGCTG CTACAACCGT 180 

GATTCTGTTG GGCTTCCTGG TAGAGCTGGA GACAGACTGC AACCCAAGAC AAAACCCAGA 240 
GGAAAAACAA CTAACCTCAG CCATTTGCCC AATTTCCACC TTGCTAGTTA TCAGAGTTCA 300 
CTATTGTTTA AGCTCCAGGG GTCATACTAT GGCAATCTAC TGGTGGAATT TATTCTGAGC 360 
AAGTGTTTTG TTCAGCTTGC AGTATTAAAA CAAAAAAAAC ATTGCCTCCA GCTGCAAAGC 420 
AAGGGCATTG CCATTATGAA AGCCCCTCAA AGACTCTCTG CTATTTTCAA AACATGGAAA 480 
GAAAAAGGGA AAAAAGAAAA AAAAATAATA ATTAGAAGGA TTTGTTCCTT AATTTGGGCT 540 
CCCAAAAATG AGAAATGAAG ATTGTATAAT GAGGGGAGAT ACTAATTATT TTAAACTCTC 600 
CAAGCAAATC TTCTGAAGCA ATCAATTATT TATATACTTT ATGTTCTGTC TTTTTGTATT 660 
TTTCCTTCTG GTTAAAAACA TGCAGGTGAG TCTTGCCAAC GTCCTTTCCT ATCTGGATCT 720 
GTTCTGCCTC ATTTCTCTTT CAAAGTCATC TTTCAGGGAA CTTGOCCTGA TTAATTTGAT 780 
TTTAACCAAA CAAATAAGAT ATTTGATATA TTAATTTAAA CTTTTTGAGA TGATTGATTA 840 
GGAATTGCAT CATGTTCACA TGAGTATACC GAATTCAAAG TTAAACTTCA TAAGCAGGAG 900 
TTTTTACACA TCGTAACATA ATCATTACCC AATACTCGAC ACTCAATATT TGATACTCAA 960 

CTGAATGTTT TTGAAATAAA CACATTTTTA TGTTATCTCT CTGGAGAAAG TAGTATATAT 1020 

CTTTTTACAC AAAATATATC AGTGAGAGAG TGTTTGTTTA AGAAAAAAAA TCAAAGCACA 1080 

ACAAGTTGAG AGAGTCCAGG CTTTATCAAT ATAAGTAATA ATTTTTTAGA ATGGTGATTT 1140 

GATTTCACCA TTTCAATTCA GCAGAGCCTG TATATATATA TATATATATA TATATATATA 1200 

TATATATATA TATATATATA TATATTACAA TGATCTGTAT TTCCTATTGC TAGAAGGATG 1260 

AAAGTGAATC CATATAAACC ATACCAACGC CGTTATGTGT AACTGGTGGT AAAACTTTAT 1320 

TATTCAAGTT TAGATGTAAC AGACATCTTT GCTGCCTGAA GATTGTTTGC ATAAGAAATA 1380 

CACCAAGAAC ATGTTTGTGA GTAGAAATGA ACATGCACTA TGAAAACAAA ATAAAATAAA 1440 

ACGAAAAAAT TTCATGTGTT GTAAGAACAG AACTATTATA GCCAACATTC TAGTATTCAA 1500 

ATCAGGACTA CAAATTGAAT lirmTiCTT AGCAACATGA AATCATTCCA TATGAAAGAC 1S60 

ATTTTCTGCT GGTGAATATT GCTGTAAGTT AAATTTTACA TTGGCATTTT GAGATGTTCC 1620 
CCCCTCATGC CTCCCCCAAA GTTTTCCATG TGGTTGTCAA ATAGTCCGC 



A22 Protein sequence: 



838 



WO 03/042661 



PCT/US02/36810 



Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: . 
Transmembrane domains: 
Cellular Localization: 



ESTs; opposite strand to TRPS1 



AA42S090 
none found 
none found 
113-129 

not determined 



10 



15 



1 11 21 31 41 SI 

I I I I I I 

MKPSLKHWAD IKMFSEIDQR QWGEEIHLQ WSVSYLVEN FSDTDDLMSI VEESDSCYNR 60 
DSVGLPGRAG DRLQPKTKPR GKTTNLSHLP NFHLASYQSS LLFKLQGSYY GNLLVEFILS 120 
KCFVQLAVLK QKKHCLQLQS KGIAIMKAPQ RLSAIFKTWK EKGKKEKKI I IRRICSLIWA 180 
PKNEK 



20 

25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A23 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession ft: 



ESTs 
HS. 29383 
AW2 07206 
AL133619 



Coding sequence: 1-2070 (underlined sequences correspond to start and stop codons) 



ATGAGCGGTG 
CGGCGCCGGC 
CTCAGGCAGA 
CAGCAGCAGC 
GAAAACAAGG 
ACACTGCCGC 
GGGGGAACAC 
CCTGTATGCC 
AGCCGTGGCT 
GGGCCTGAGG 
CCAAGTAGAG 
CCTCAGATTG 
ATGCTGGGGG 
GCAGCAACCA 
GCACTTCCCC 
GCTCACTTCC 
TGGAGCCAGC 
GACATGGAGA 
CTGTTCTGGG 
GACAGGACAC 
CCCTCCTGCT 
GGCGCTCGCT 
AGGCTGAAGG 
GGCGGTAGCG 
TCTGTCAAGT 
TCCTTCAACA 
CCCCTACTTC 
GAGAAAGCAG 
AGGCAGATGG 
ACCACACTTA 
ACCCAAGAGC 
CCGGAGGAAG 
AAGAGCCTCT 
CTGAAGCAGA 
AAACGGCGCC 



11 
I 

CGGGGGTGGC 
GCCAGCGCCC 
GCGACCCGCA 
ACTCGGAGAT 
GTGAGCCGGC 
TCCCGCAGCA 
AGGACGGGGA 
AACCCAGTGG 
GGACGATGTT 
TCATTGCAGG 
CTGAAATGGG 
CTGCTGTGGC 
CCCAGGGGAT 
TGGGGACAAA 
ATCCTGACAG 
CATTATCTTT 
CTGGGAACAT 
AGGGGGTTGA 
CAAAGTGTGG 
GGGAAGAGGC 
TTCCAGATGG 
GGGTCTGCAT 
AGGGCTCCTC 
CCGACACTGT 
CCATCTCTAA 
AGCAAGATTC 
ACAACAGCAA 
AGGCCTCTAA 
GGGCGGGGGC 
GGCAGTGCGA 
TGCGGCACCT 
CTAGCTTTCC 
CCAAGAAATG 
CCCCGAAGAA 
TGCATCGCTC 




CAGAAACACA 
GCCCCTCCAG 
GTACAGGTTC 
ATGCAGCCAA 
GCGGCAGGTG 
AAGGAACCCC 
CAGGCCCAGG 
ATGGACACAC 
GGGAGGAAGC 
CGGCCCCCAC 
GGGGCTGGGG 
CGCAGCTGGG 
GGGAGGGCCC 
CCCAAGTCGG 
CATGCTTTCC 
CCCCTCAGGA 
CAACGGAGTG 
ACGGACACAC 
GCGCTCTCCT 
TTCAGCCAAC 
AAAAGCTGAC 
GCTGGACAAA 
TGCAGGAGCT 
ACACCCCCCA 
AGTGCTCATC 
CAAGTCCCTC 
CAGGGACCAA 
CCTGAGCCCA 
CAACTTTGCC 
AGTGCTTTGA 



31 
I 

CGGCCCCCCA 
GTCCAGTCCT 
CTGGACCTGG 
CTCCATGAGG 
AGGCCGGCCC 
GCCATCAACT 
ACTGTCCTTG 
TGGGGGACCT 
GCACAGCACG 
GCCACAGGGT 
TGGGACAGCC 
ATTTCCAGCC 
TCCATCCAGG 
AGAGTCCTGT 
CCAGCCCAGG 
CTGACATCAG 
GCAGTGCCTA 
TTCCCTAGCC 
CAGCCCCAGC 
CTCGGGACCT 
AACCACCTTT 
TGGGTAGAGC 
AGGCCAGGAG 
GCAGACAGCC 
TCrCAAGGCA 
GTCTCCCAGA 
GTTCCTGGGG 
GCCTGTATGG 
ATGATCCTGC 
CGCGAGCTGT 
CTGGAAGGGA 
GAAGCCACGC 
CCTGTGGCGG 
GAGAGGCAGA 



CTATGGCTCT 
GATCCCTTCC 
TTCCTTGCCA 
ATCCTGGGCT 
GAGGACATCT 
GGGCTCTCCC 
GCTGTGGCAA 
CCTGCAGTGC 
GCTGTTCCAT 
CCAGGGCCTC 
CGGGAGGACC 
GCAAGCGTGG 
TCTCCATGTC 
AGGCCAGGCC 
AGGCGGACCT 
TACAAGGGCA 
GGAACAGCCA 
CCCTTCCCCT 
GGAATACCAA 
GCCAGAGGCC 
ATTTCCCCAA 
AGCGTGCCAT 
AGAGGCTGCA 



51 
I 

CCCGGGCTCT 
GAGCCCGCAG 
GCAGTTCCTG 
TCTGAAGCGG 
GGCACACTCA 
CCTGGGCTCA 
TGCACTGGCC 
CGCTACCTCT 
GGGAAGCCCA 
CCTCCCTCCT 
TAGATCTTTG 
GAGTCCTCAC 
TGCCATCTGG 
CTTGTCCAAG 
GTGGTCTCAA 
GACTGGTGGA 
TTCCCAGGGA 
CTCCAGTGAG 
TGGGGACGCT 
GTGTCCCAAG 
TGCTCCCTTG 
CAGCCCTGCC 
GCGTCTTGCG 
AAGCTTCCAG 
CCAGCCCGGC 
GGAAGAGGAG 
GGCCAGAAAG 
GCACCAGGGC 
GCGAAAGCCC 
CCTCCTGCAG 
CCAGGCAGCC 
GGTCTCCACC 
CCTGCCCGCA 
GGCAATGCAG 



A24 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs.29383 

AW207206 

T434S7 

hone found 

303-322 

not determined 



1 
I 

MSGAGVAAGT 
QQQHSEMLAK 
GGTQDGEPLQ 
GPEVIAGRQV 
MLGAQGIWTH 
AUFPLSLGLG 
LFWAKGGPSR 
GARWVCINGV 
SVKSISNSAN 



11 
I 

RPPSSPTPGS 
LHEEIEHLKR 
TVLAHUVALA 
ATGCSPDLPP 
SIQGSLPAIW 
LTSGGHLTGG 
QPQPCSAGDA 
WVEPGGPSPA 
SQGKARPQPG 



21 
I 

RRRRQRPSVG 
ENKGEPARGP 
PVCQPSGYRF 



31 
I 

VQSLRPQSPQ 
RPALPPQAHS 
WGTWTDAATS 
WDSPCPARSL 
AATMGTKGGS RVLFPCHLSK 
WSQPGNIAAG AVFRALPSQG 
DRTRBBAMLS LGTOCSMCPK 
RLREGSSRTB RPGGKRGRLA 
SFNKQDSKAD VSQKADLEEE 



41 

! 

LRQSDPQKRN 
TLPLPQKRNT 
SRGWTMLCSQ 
PQIAAVARPR 
ALPHPDSGPH 
DMEKGVBGGP 
PSCFPDGPSG 
GGSADTVRSP 
PLLHNSKLDK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



SI 
I 

LDLBKSLQFL 
AXNSSTRLGS 
AQHVLLSGSP 
XSSPMALSPH 
PAQDPGLWSQ 
FPSRCGNSSE 
NHLSRASAPL 



VPGVOGQARK 



60 
120 
180 
240 
300 
360 
420 
480 
540 



839 



WO 03/042661 



PCT/US02/36810 



EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELKNTNLLQ 600 
TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAIIiPA 660 
UCQTPKNNPA ERQKRLQAMQ KRRLHRSVL 



A25 DNA SEQUENCE 
Gene name: 
Uhigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs; prolactin receptor 

Ks. 25252 

AA057193 

NM_000949 

285-2153 {underlined sequences correspond to start and stop codons) 



1 

I 

GGAGGCTGAA 
CTCCCTCTTT 
ATGTCCTCGT 
TAAAGAACTC 
GTGAACTTCT 
CATCTGCAAC 
TACCTCCTGG 
GCTGGTGGAG 
GGGAAGGAGA 
GCCACTTTGG 
CTAACCAGAT 
AGCCAGACCC 
ACCTGTGGAT 
TCCTGTATGA 
GGCAGCAAAC 
TTCGCTGCAA 
TACCTAGTGA 
CTGTCATCTG 
GCATCTTTCC 
AGGGCAAGTC 
ACTATGAGGA 
TGTCAGTCCA 
ACACTGACTC 
AACCCCAGGC 
CTGAAACAAC 
TTCATGCTGG 
CCAGATCCTC 
CACCGGCCAC 
AGTCTAGAGA 
CTGACCAGGA 
CCTTGGATTA 
AACAGAGAGA 
ATGCCAAGGT 
CTAAAAACGT 
ATCAAGCTGA 
TGGGTGGTTT 
ACTAATGGAA 
AATGCTCAAG 
TTTTTAACCA 
AACTGTGATT 
AAAAGCACAC 
GCTTTCATGA 
AAGATGACAA 
AACTGCATAA 
AAAGAAAGAA 
TTGCTGATAT 



11 
I 

ATCCCCAGAC 
CTGGATTTTA 
GCAGGAAGTA 
TCCTATTCAT 
GATACATTTC 
CGTTTTCACT 
AAAACCTGAG 
GCCTGGGACA 
GACACTCATG 
CAAGCAGTAC 
GGGAAGCAGT 
TCCTTTGGAG 
TAAATGGTCT 
AATTCGATTA 
AGAGTTTAAG 
ACCAGACCAT 
CTTCACCATG 
TTTGATTATT 
GCCAGTTCCT 
TGAAGAACTA 
CTTGCTGGTG 
TTCAAAAGAA 
AGGCCGGGGG 
CAATCCCTCC 
CCACACCTGG 
TGGATCCAAA 
TTACCACAAT 
TCTGTTGAAT 
AGAGGGAAAG 
TACGCCCTGG 
TGTGGAGATT 
GAACAGCGGC 
GTCCGGGGTC 
GGCTTGCTTT 
GAAAGCCCTG 
GGATTACCTG 
TGATTGGTTA 
AATGTAGTCA 
CTTGCCTCTT 
TGTAGATTTA 
TGCTTAGTAT 
TTTGGCATGG 
AAGAAAATTT 
CCTTTACACT 
TGCATTCCAG 
GCAAGTAAGA 



21 
I 

GCCGGTTTTC 
CCGACCGTTC 
CTCATCGACT 
GGAGGCGAAC 
CTGCAGCAAG 
CTGCTACTTT 
ATCTTTAAAT 
GATGGAGGAC 
CATGAATGTC 
ACCTCCATGT 
TTCTOGGATG 
CTGGCTGTGG 
CCACCTACCC 
AAACCCGAGA 
ATTCTCAGCC 
GGATACTGGA 
AATGATACAA 
GTCTGGGCAG 
GGGCCAAAAA 
CTGAGTGCCT 
GAGTATTTAG 
CACCCAAGTC 
AGCTGTGACA 
ACATTCTATG 
GACCCCCAGT 
TGTTCAACAT 
ATTACTGATG 
GAAGCAGGTA 
GCAACCCAGC 
CTGCTGCCCC 
CACAAGGTCA 
AAGCCCAAGA 
ATGGATAACA 
GAAGAATCAG 
GCCAACTTCA 
GATCCCGCAT 
AAATGTGATT 
GACTGACACT 
TCTCCAGCAG 
CTTTTTGCTG 
TCTTGAGGGA 
GACAGACGGA 
TCCACATAGG 
CCTCGTCCAT 
AACAAATTGT 
AAT 



31 
1 

TGGGCTGGGC 
GCGAAACAGC 
GATGTGGCAG 
ACTGAGGATG 
AGAAGGCAGC 
TTCTCAACAC 
GTCGTTCTCC 
TTCCTACCAA 
CAGACTACAT 
GGAGGACATA 
AACTTTATGT 
AAGTAAAACA 
TGATTGACTT 
AAGCAGCTGA 
TACATCCAGG 
GTGCATGGAG 
CCGTGTGGAT 
TGGCTTTGAA 
TAAAAGGATT 
TGGGATGCCA 
AAGTAGATGA 
AAGGTATGAA 
GCCCTTCCCT 
ATCCTGAGGT 
GCATAAGCAT 
GGCCCTTACC 
TGTGTGAGCT 
AAGATGCTTT 
AGAGGGAGGT 
AGGAGAAAAC 
ACAAAGATGG 
AGCCCGGGAC 
ACATCCTGGT 
CCAAAGAGGC 
CTGCAACATC 
GTTTTACACA 
TTTCTTCAGG 
ACTAAAGCTC 
CTGATTCCAG 
TTAGTTATAA 
CAATGCCAAT 
AATGAAATTG 
AAAATGCCAT 
TTTATTAGGA 
TTACATAAGT 



41 

I 

TTTCTGCTTA 
TTTCCACACA 
ACTTTGCTCC 
CTTTCCACAT 
CAACATGAAG 
CTGCCTTCTG 
CAATAAGGAA 
TTATTCACTG 
AACCGGTGGC 
CATCATGATG 
GGACGTGACT 
GCCAGAAGAC 
AAAAACTGGT 
GTGGGAGATC 
ACAGAAATAC 
TCCAGCGACC 
CTCTGTGGCT 
GGGCTATAGC 
TGATGCTCAT 
AGACTTTCCT 
TAGTGAGGAC 
ACCCACATAC 
TTTGTCTGAA 
CATTGAGAAG 
GGAAGGCAAA 
ACAGCCCAGC 
GGCTGTGGGC 
AAAATCCTCT 
AGAAAGCTTC 
CCCCTTTGGC 
TGCATTATCA 
TCCTGAGAAC 
GTTGGTGCCA 
CCCACCATCA 
AAGCAAGTGC 
CTCCTTTCAC 
TAACACTACA 
CCAGCTCCTT 
AACAAATCAT 
AACTATGTGT 
AGGTATATCC 
TCAAAATTGT 
GAAAATTGCT 
TTACCCAAAT 
TCCTATACCT 



51 
I 

CTCACTCCTT 
ATGGAGCTTC 
CTGACAAAAC 
GAACCCTGAA 
GAAAATGTGG 
AATGGACAGT 
ACATTCACCT 
ACTTACCACA 
CCCAACTCCT 
GTCAATGCCA 
TACATAGTTC 
AGAAAACCCT 
TGGTTCACGC 
CATTTTGCTG 
CTTGTCCAGG 
TTCATTCAGA 
GTCCTTTCTG 
ATGGTGACCT 
CTGTTGGAGA 
CCCACTTCTG 
CAGCATCTAA 
CTGGATCCTG 
AAGTGTGAGG 
CCAGAGAATC 
ATCCCCTATT 
CAGCACAACC 
CCTGCAGGTG 
CAAACCATTA 
CATTCTGAGA 
TCCGCTAAAC 
TTGCTACCAA 
AATAAGGAGT 
GATCCACATG 
CTTGAACAGA 
AGGCTCCAGC 
TGATAGCTTG 
GAGTACGTGA 
TCATGCTCCA 
TATGTTTCCT 
TCAATGAAAT 
TCTGGAAAAG 
TTACCATAGA 
TTTGAAAAAC 
ATAACCATTT 
TACTGACACA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



A26 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
FK3 domains: 
Cellular Localization: 



ESTs; prolactin receptor 

H9.2S252 

AA057193 

NP 000940 

1-23 

237-253 

28-112, 127-215 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MKENVASATV PTLLLPLNTC LLNGQLPPGK PBIPKCRSPN KETPTCWWRP GTDGGLPTNY 60 

SLTYHREGET LKHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATOQMG SSFSDELYVD 120 

VTYIVQPDPP LELAVBVRQP EDRKPYLWIK WSPPTLIDLK TGWPTLLYEI RLKPEKAAEW 180 

EXHFAGQQTE FRZLSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDP TKNDTTVWIS 240 

VAVLSAVICL IIVWAVALKG YSKVTCIPPP VPGPKIKGFD AHLLBKGK8E ELLSALGCQD 300 

PPPTSDYEDL LVEYLEVDDS BDQHLKSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 

SEKCEEPQAN PSTPYDPEVI EKPENPBTTH TWDPQCISME GKIPYFHAGG SKCSTWPLPQ 420 



840 



WO 03/042661 



PCT/US02/36810 



PSQHNPRSSY HNITDVCKLA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQRBVE 480 

SFHSETDQDT PWLLPQEKTP FQSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 

ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAKK ALANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 



A27 DMA SEQUENCE 

Gene name: Human neuropeptide Y receptor Yl (NPYY1) 



Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession #: 
Coding sequence: 



Hs. 169266 
L07615 
NM_00 0909.1 

20 9-1363 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

I I I 1 I I 

CATTCCCACC CTTCCTTCTT TAATAAGCAG GAGCGAAAAA GACAAATTCC AAAGAGGATT 60 

GTTCAGTTCA AGGGAATGAA GAATTCAGAA TAATTTTGGT AAATGGATTC CAATATCGGG 120 

AATAAGAATA AGCTGAACAG TTGACCTGCT TTGAAGAAAC ATACTGTCCA TTTGTCTAAA 180 

ATAATCTATA ACAACCAAAC CAATCAA AAT GA ATTCAACA TTATTTTCCC AGGTTGAAAA 240 

TCATTCAGTC CACTCTAATT TCTCAGAGAA GAATGCCCAG CTTCTGGCTT TTGAAAATGA 300 

TGATTGTCAT CTGCCCTTGG CCATGATATT TACCTTAGCT CTTGCTTATG GAGCTGTGAT 360 

CATTCTTGGT GTCTCTGGAA ACCTGGCCTT GATCATAATC ATCTTGAAAC AAAAGGAGAT 420 

GAGAAATGTT ACCAACATCC TGATTGTGAA CCTTTCCTTC TCAGACTTGC TTGTTGCCAT 480 

CATGTGTCTC CCCTTTACAT TTGTCTACAC ATTAATGGAC CACTGGGTCT TTGGTGAGGC 540 

GATGTGTAAG TTGAATCCTT TTGTGCAATG TGTTTCAATC ACTGTGTCCA TTTTCTCTCT 600 

GGTTCTCATT GCTGTGGAAC GACATCAGCT GATAATCAAC CCTOGAGGGT GGAGACCAAA 660 

TAATAGACAT GCTTATGTAG GTATTGCTGT GATTTGGGTC CTTGCTGTGG C TT CTTCTT T 720 

GCCTTTCCTG ATCTACCAAG TAATGACTGA TGAGCCGTTC CAAAATGTAA CACTTGATGC 780 

GTACAAAGAC AAATACGTGT GCTTTGATCA ATTTCCATCG GACTCTCATA GGTTGTCTTA 840 

TACCACTCTC CTCTTGGTGC TGCAGTATTT TGGTCCACTT TGTTTTATAT TTATTTGCTA 900 

CTTCAAGATA TATATACGCC TAAAAAGGAG AAACAACATG ATGGACAAGA TGAGAGACAA 960 

TAAGTACAGG TCCAGTGAAA CCAAAAGAAT CAATATCATG CTGCTCTCCA TTGTGGTAGC 1020 

ATTTGCAGTC TGCTGGCTCC CTCTTACCAT CTTTAACACT GTGTTTGATT GGAATCATCA 1080 

GATCATTGCT ACCTGCAACC ACAATCTGTT ATTCCTGCTC TGCCACCTCA CAGCAATGAT 1140 

ATCCACTTGT GTCAACCCCA TATTTTATGG GTTCCTGAAC AAAAACTTCC AGAGAGACTT 1200 

GCAGTTCTTC TTCAACTTTT GTGATTTCCG GTCTCGGGAT GATGATTATG AAACAATAGC 1260 

CATGTCCACG ATGCACACAG ATGTTTCCAA AACTTCTTTG AAGCAAGCAA GCCCAGTCGC 1320 

ATTTAAAAAA ATCAACAACA ATGATGATAA TGAAAAAATC TGAAACIACT TATAGCCTAT 1380 

GGTCCCGGAT GACATCTGTT TAAAAACAAG CACAACCTGC AACATACTTT GATTACCTGT 1440 

TCTCCCAAGG AATGGGGTTG AAATCATTTG AAAATGACTA AG A TTTTCTT GTCTTGCTTT 1500 

TTACTGCTTT TGTTGTAGTT GTCATAATTA CATTTGGAAC AAAAGGTGTG GGCTTTGGGG 1560 

TCTTCTGGAA ATAGTTTTGA CCAGACATCT TTGAAGTGCT TTTTGTGAAT TTATGCATAT 1620 

AATATAAAGA CTTTTATACT GTACTTATTG GAATGAAATT TCTTTAAAGT ATTACGATGC 1680 

GCTGACTTCA GAAGTACCTG CCATCCAATA CGGTCATTAG ATTGGGTCAT CTTGATTAGA 1740 

TTAGATTAGA TTAGATTGTC AACAGATTGG GCCATCCTTA CTTTATGATA GGCATCATTT 1800 

TAGTGTGTTA CAATAGTAAC AGTATGCAAA AGCAGCATTC AGGAGCOGAA AGATAGTCTT 1860 

GAAGTCATTC AGAAGTGGTT TGAGGTTTCT GTTTTTTGGT GGTTTTTGTT TGTTTTTTTT 1920 

TTTTTTCACC TTAAGGGAGG CTTTCATTTC CTCCCGACTG ATTGTCACTT AAATCAAAAT 1980 

TTAAAAATGA ATAAAAAGAC ATACTTCTCA GCTGCAAATA TTATGGAGAA TTGGGCACCC 2040 

ACAGGAATGA AGAGAGAAAG CAGCTCCCCA ACTTCAAAAC CATTTTGGTA CCTGACAACA 2100 

AGAGCATTTT AGAGTAATTA ATTTAATAAA GTAAATTAGT ATTGCTGCAA ATAGCTAAAT 2160 

TATATTTATT TGAATTGATG GTCAAGAGAT TTTCCATTTT TTTTACAGAC TGTTCAGTGT 2220 

TTGTCAAGCT TCTGGTCTAA TATGTACTCG AAAGACTTTC CGCTTACAAT TTGTAGAAAC 2280 

ACAAATATOG TTTTCCATAC AGCAGTGCCT ATATAGTOAC TGATTTTAAC TTTCAATGTC 2340 

CATCTTTCAA AGGAAGTAAC ACCAAGGTAC AATGTTAAAG GAATATTCAC TTTACCTAGC 2400 

AGGGAA AAAT ACACAAAAAC TGCAGATACT TCATATAGCC CATTTTAACT TGTATAAACT 2460 

GTGTGACTTG TGGCGTCTTA TAAATAATGC ACTGTAAAGA TTACTGAATA GTTGTGTCAT 2520 

GTTAATGTGC CTAATTTCAT GTATCTTGTA ATCATGATTG AGCCTCAGAA TCATTTGGAG 2S80 

AAACTATATT TTAAAGAACA AGACATACTT CAATGTATTA TACAGATAAA GTATTACATG 2640 

TGTTTGATTT TAAAAGGGCG GACATTTTAT TAAAATCAAT ATTGTTTTTG CTTTTTCTGA 2700 
GGAGTCTCTT TCAGTTTCAT TTTTTCTCAT CCCATGACTT CCCTOCGATG GT 



A28 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 

Cellular Localization: 



Human neuropeptide Y receptor Yl (NPYY1) mSNA, exon 2-3 and complete cds 

Hs. 169266 

L07615 

NPJ)00900.1 

none found 

7tm 1 [57-91] 

39-61, 77-99, 118-139, 157-179, 212-234, 264-286, 
300-322 

plasma membrane 



1 11 21 31 41 51 

I I I I I I 

KNSTLFSQVE NHSVHSNPSE KNAQLLAFEN DDCHLPLAMI FTLALAYGAV IILGVSGHLA 60 

LIIIILKQKE MRNVTNILIV NLSFSDLLVA IMCLPFTFVY TLMDEWVFGE AMCKLNPFVQ 120 

CVSITVSIFS LVLIAVERHQ LIINPRGWRP NNRHAYVGIA VIWVLAVASS LPFLIYQVMT 180 

DEPFQNVTLD AYXDKYVCFD QFPSDSHRLS YTTLLLVLQY FGPLCFIFIC YFKIYIRLKR 240 

RNNMMDKMRD NKYRSSETKR INIMLLSIW AFAVCWLPLT IFNTVFDWNH QIIATCHHNL 300 

LFLLCHLTAM ISTCVMPIFY GFLNKNFQRD LQPFFNFCDF RSRDDDYETI AMSTKHTDVS 360 

841 



WO 03/042661 



PCT/US02/36810 



KTSLKQASPV AFKKINNNDD NEKI 



A29 DNA SEQUENCE 

Gene name: Homo sapiens G protein- coupled receptor (HOT7T17S) 
Unigene number: Hs. 2 082 2 9 

Probeset Accession ft: AI819198 
Nucleic Acid Accession ft: NM_032551.1 

Coding sequence: 1-1197 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I ! I 

ATGCACACCG TGGCTACGTC CGGACCCAAC GCGTCCTGGG GGGCACCGGC CAACGCCTCC 60 

GGCTGCCCGG GCTGTGGCGC CAACGCCTCG GACGGCCCAG TCCCTTCGCC GCGGGCCGTG 120 

GACGCCTGGC TCGTGCCGCT CTTCTTCGCG GCGCTGATGC TGCTGGGCCT GGTGGGGAAC 180 

TCGCTGGTCA TCTACGTCAT CTGCCGCCAC AAGCCGATGC GGACCGTGAC CAACTTCTAC 240 

ATCGCCAACC TGGCGGCCAC GGACGTGACC TTCCTCCTGT GCTGCGTCCC CTTCACGGCC 300 

CTGCTGTACC CGCTGCCCGG CTGGGTGCTG GGCGACTTCA TGTGCAAGTT CGTCAACTAC 360 

ATCCAGCAGG TCTCGGTGCA GGCCACGTGT GCCACTCTGA CCGCCATGAG TGTGGACCGC 420 

TGGTACGTGA CGGTGTTCCC GTTGCGCGCC CTGCACCGCC GCACGCCCCG CCTGGCGCTG 480 

GCTGTCAGCC TCAGCATCTG GGTAGGCTCT GCGGCGGTGT CTGCGCCGGT GCTCGCCCTG 540 

CACCGCCTGT CACCCGGGCC GCGCGCCTAC TGCAGTGAGG CCTTCCCCAG CCGCGCCCTG 600 

GAGCGCGCCT TCGCACTGTA CAACCTGCTG GCGCTGTACC TGCTGCCGCT GCTCGCCACC 660 

TGCGCCTGCT ATGCGGCCAT GCTGCGCCAC CTGGGCCGGG TCGCCGTGCG CCCCGCGCCC 720 

GCCGATAGCG CCCTGCAGGG GCAGGTGCTG GCAGAGCGCG CAGGCGCCGT GCGGGCCAAG 780 

GTCTCGCGGC TGGTGGCGGC CGTGGTCCTG CTCTTCGCCG CCTGCTGGGG CCCCATCCAG 840 

CTGTTCCTGG TGCTGCAGGC GCTGGGCCCC GCGGGCTCCT GGCACCCACG CAGCTACGCC 900 

GCCTACGCGC TTAAGACCTG GGCTCACTGC ATGTCCTACA GCAACTCCGC GCTGAACCCG 960 

CTGCTCTACG CCTTCCTGGG CTCGCACTTC CGACAGGCCT TCCGCCGCGT CTGCCCCTGC 1020 

GCGCCGCGCC GCCCCCGCCG CCCCCGCCGG CCCGGACCCT CGGACCCCGC AGCCCCACAC 1080 

GCGGAGCTGC ACCGCCTGGG GTCCCACCCG GCOCCCGCCA GGGCGCAGAA GCCAGGGAGC 1140 
AGTGGGCTGG CCGCGCGCGG GCTGTGCGTC CTGGGGGAGG ACAACGCCCC TCTCTGA , 

A30 Protein sequence 

Gene name: Homo sapiens G protein- coupled receptor (HOT7T17S) , mRNA 

Unigene number: Hs. 20822 9 

Protein Accession #: AI819198 

Signal sequence: none found 

Pfam domains: 7tm_l [59-323] 

Transmembrane domains: 43-65, 86-108, 122-144, 159-181, 203-225, 260-282 

Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MHTVATSGPN ASWGAPANAS GCPGCGANAS DGFVPSPRAV DAWLVPLFFA ALMLLGLVGN 60 

SLVIYVICRH KPMRTVTNFY lANLAATDVT FLLCCVPFTA LLYPLPGWVI* GDFMCKFVNY 120 

IQQVSVQATC ATLTAMSVDR WYVTVFPLRA LHRRTPRLAL AVSLSIWVGS AAVSAPVLAL 180 

HRLSPGPRAY CSEAPPSRAL ERAFALYNLL ALYLLPLLAT CACYAAMLRH LGRVAVRPAP 240 

ADSALQGQVL AERAGAVRAK VSRLVAAWL LFAACWGPIQ LFLVLQALGP AGSWHPRSYA 300 

AYALKTWAHC MSYSNSALNP LLYAFLGSHF RQAFRRVCPC APRRPRRPRR PGPSDPAAPH 360 
AELHRLGSHP APARAQKPGS SGLAARGLCV LGEDNAPL 

A31 DNA SEQUENCE 

Gene name: CH22_FGENES.706; cadherin, EGF LAG seven-pass G-type receptor 1, flamingo (Drosophila) homolog 
(CELSR1) 

Unigene number: Hs. 2 52 3 87 

Probeset Accession #: NM 014246 

Nucleic Acid Accession ft: NMJU4246 

Coding sequence: 1-9045 (underlined sequences correspond to start and stop codons) 



1 11 21 

I 1 i 

ATGGCGCCGC CGCCGCCGCC OGTGCTGCCC 
CTGCCGGCGA TGGGGCTGCG AGCGGCCGCC 
GCCTTCGCCC TCCGGCCCGG CTGTACCTAC 
CCGCGGGAGC TGCTGGAOGT GGGCCGCGAT 
GGCGCGGGGC GCCCGCTGCC GCTGCAAGTC 
CTGAGCCGCC GCCTGCGGGC GCGCACGCAC 
TGCGGAACCG GTGCCOGGCT CTGCGGGGCG 
GCCGCGCAGC ATTCGGCGCT CGCAGCTCCG 
CGCCCCAGGC CCCGCTGTCC CGGCCGTCCC 
CTGCGTCTGC TGTGCGOCCT GCGGCGCGCG 
GAGGCCGCCA CCGCGGGGAC GCCCTCCGCG 
AACTTGCCCG AAGCCCGGGC GGGGCCGGCG 
GGGAGCCTGA AGTTTCCGAT GCCCAACTAC 
GGCACCCTCA TCCTCCAGCT GCACGCGCAC 
AGCTATTACA TGGAGGGGCT GTTCGACGAG 
GCCACGGGCG CCGTGAGCAC GGACAGCGTA 
CTCAGGGTGA AAGCCGTGGA CTACAGTACG 
GTCTTGGTCA AAGACACCAA OGACCACAGC 
CGCGTGCGGG AGAACCTGGA GGTGGGCTAC 



31 41 51 

I I i 

GTGCTGCTGC TCCTGGCCGC CGCCGCCGCC 60 

TGGGAGCCGC GCGTACCCGG CGGGACCCGC 120 

GCGGTGGGCG CCGCTTGCAC GCCCCGGGCG 180 

GGGCGGCTGG CAGGACGTCG GCGCGTCTCG 240 

CGCTTGGTGG CCCGCAGTGC CCCGACGGCG 300 

CTTCCCGGCT GCGGAGCCCG TGCCCGGCTC 360 

CTCTGCTTCC CCGTCCCCGG CGGCTGCGCG 420 

ACCACCTTAC CCGCCTGCCG CTGCCCGCCG 480 

ATCTGCCTGC CGCCGGGCGG CTCGGTCCGC 540 

GCTGGCGCCG TCCGGGTGGG ACTGGCGCTG 600 

TCGCCATCCC CATCGCCGCC CCTGCCGCCG 660 

CGACGGGCCC GGCGGGGCAC GAGCGGCAGA 720 

CAGGTGGCGT TGTTTGAGAA CGAACCGGCG 780 

TACACCATCG AGGGCGAGGA GGAGCGCGTG 840 

CGCTCCCGGG GCTACTTCCG AATCGACTCT 900 

CTGGACCGCG AGACCAAGGA GACGCACGTC 960 

CCGCCGCGCT CGGCCACCAC CTACATCACT 1020 

CCGGTCTTCG AGCAGTCGGA GTACCGCGAG 1080 

GAGGTGCTGA CCATCCGCGC CAGCGACCGC 1140 
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GACTCGCCCA TCAACOCCAA CTTGCGTTAC OGOGTGT TG C GGGGCGCGTG GGACGTCTTC 1200 

CAGCTCAACG AGAGCTCTGG CGTGGTGAGC ACACGGGCGG TGCTGGACCG GQAGGAGGCG 1260 

GCCGAGTACC AGCTCCTGGT GGAGGCCAAC GACCAGGGGC GCAATCCGGG CCCGCTCAGT 1320 

GCCACGGCCA CCGTGTACAT CGAGGTGGAG QAOGAGAACG ACAACTACCC CCAGTTCAGC 1380 

GAGCAGAACT ACGTGGTCCA GGTGCCCGAG GACGTGGGGC TCAACACGGC TGTGCTGCGA 1440 

GTGCAGGCCA CGGACCGGGA CCAGGGCCAG AACGCGGCCA TTCACTACAG CATCCTCAGC 1500. 

GGGAACGTGG CCGGCCAGTT CTACCTGCAC TCGCTGAGCG GGATCCTGGA TGTGATCAAC 1560 

CCCTTGGATT TCGAGGATGT CCAGAAATAC TCGCTGAGCA TTAAGGCCCA GGATGGGGGC 1620 

OGGCCCCCGC TCATCAATTC TTCAGGGGTG GTGTCTGTGC AGGTGCTGGA TGTCAACGAC 1680 

AACGAGCCTA TCTTTGTGAG CAGCCCCTTC CAGGCCACGG TGCTGGAGAA TGTGCCCCTG 1740 

GGCTACCCCG TGGTGCACAT TCAGGCGGTG GACGCGGACT CTGGAGAGAA CGCCCGGCTG 1800 

CACTATCGCC TGGTGGACAC GGCCTCCACC TTTCTGGGGG GCGGCAGCGC TGGGCCTAAG 1860 

AATCCTGCCC CCACCCCTGA CTTCCCCTTC CAGATCCACA ACAGCTCCGG TTGGATCACA 1920 

1 c GTGTGTGCCG AGCTGGACCG CGAGGAGGTG GAGCACTACA GCTTCGGGGT GGAGGCGGTG 1980 

10 GACCACGGCT CGCCCCCCAT GAGCTCCTCC ACCAGCGTGT CCATCACGGT GCTGGACGTG 2040 

AATGACAACQ ACCCGGTGTT CACGCAGCCC ACCTACGAGC TTCGTCTGAA TGAGGATGCG 2100 

GCCGTGGGGA GCAGCGTGCT GACCCTGCAG GCCCGCGACC GTGACGCCAA CAGTGTGATT 2160 

ACCTACCAGC TCACAGGCGG CAACACCCGG AACCGCTTTG CACTCAGCAG CCAGAGAGGG 2220 

OH GGCGGCCTCA TCACCCTGGC GCTACCTCTG GACTACAAGC AGGAGCAGCA GTACGTGCTG 2280 

ZU GCGGTGACAG CATCCGACGG CACACGGTCG CACACTGCGC ATGTCCTAAT CAACGTCACT 2340 

GATGCCAACA CCCACAGGCC TGTCTTTCAG AGCTCCCATT ACACAGTGAG TGTCAGTGAG 2400 

GACAGGCCTG TGGGCACCTC CATTGCTACC CTCAGTGCCA ACGATGAGGA CACAGGAGAG 2460 

AATGCCCGCA TCACCTACGT GATTCAGGAC CCCGTGCCGC AGTTCCGCAT TGACCCGGAC 2520 

AGTGGCACCA TGTACACCAT GATGGAGCTG GACTATGAGA ACCAGGTCGC CTACACGCTG 2580 

ZD ACCATCATGG CCCAGGACAA CGGCATCCCG CAGAAATCAG ACACCACCAC CCTAGAGATC 2640 

CTCATCCTCG ATGCCAATGA CAATGCACCC CAGTTCCTGT GGGATTTCTA CCAGGGTTCC 2700 

ATCTTTGAGG ATGCTCCACC CTCGACCAGC ATCCTCCAGG TCTCTGCCAC GGACCGGGAC 2760 

TCAGGTCCCA ATGGGCGTCT GCTGTACACC TTCCAGGGTG GGGACGACGG CGATGGGGAC 2820 

TTCTACATCG AGCCCACGTC CGGTGTGATT CGCACCCAGC GCCGGCTGGA CCGGGAGAAT 2880 

JU GTGGCCGTGT ACAACCTTTG GGCTCTGGCT GTGGATCGGG GCAGTCCCAC TCCCCTTAGC 2940 

GCCTCGGTAG AAATCCAGGT GACCATCTTG GACATTAATG ACAATGCCCC CATGTTTGAG 3000 

AAOGACGAAC TGGAGCTGTT TGTTGAGGAG AACAACCCAG TGGGGTCGGT GGTGGCAAAG 3060 

ATTCGTGCTA ACGACCCTGA TGAAGGCCCT AATGCCCAGA TCATGTATCA GATTGTGGAA 3120 

or GGGGACATGC GGCATTTCTT CCAGCTGGAC CTGCTCAACG GGGACCTGCG TGCCATGGTG 3180 

OD GAGCTGGACT TTGAGGTCCG GCGGGAGTAT GTGCTGGTGG TGCAGGCCAC GTCGGCTCCG 3240 

CTGGTGAGCC GAGCCACGGT GCACATCCTT CTCGTGGACC AGAATGACAA CCCGCCTGTG 3300 

CTGCCCGACT TCCAGATCCT CTTCAACAAC TATGTCACCA ACAAGTCCAA CAGTTTCCCC 3360 

ACCGGCGTGA TCGGCTGCAT CCCGGCCCAT GACCCCGACG TGTCAGACAG CCTCAACTAC 3420 

ACi ACCTTCGTGC AGGGCAACGA GCTGCGCCTG TTGCTGCTGG ACCCCGCCAC GGGCGAACTG 3480 

W CAGCTCAGCC GCGACCTGGA CAACAACCGG CCGCTGGAGG CGCTCATGGA GGTGTCTGTG 3S40 

TCTGATGGCA TCCACAGCGT CACGGCCTTC TGCACCCTGC GTGTCACCAT CATCACGGAC 3600 

GACATGCTGA CCAACAGCAT CACTGTCCGC CTGGAGAACA TGTCCCAGGA GAAGTTCCTG 3660 

TCCCCGCTGC TGGCCCTCTT CGTGGAGGGG GTGGCCGCCG TGCTGTCCAC CACCAAGGAC 3720 

. GACGTCTTCG TCTTCAACGT CCAGAACGAC ACCGACGTCA GCTCCAACAT CCTGAACGTG 3780 

4D ACCTTCTCGG CGCTGCTGCC TGGCGGOGTC CGOGGCCAGT TCTTCCCGTC GGAGGACCTG 3840 

CAGGAGCAGA TCTACCTGAA TCGGACGCTG CTGACCACCA TCTCCACGCA GCGCGTGCTG 3900 

CCCTTCGACG ACAACATCTG CCTGCGCGAG CCCTGCGAGA ACTACATGAA GTGCGTGTCC 3960 

GTTCTGCGAT TCGACAGCTC CGCGCCCTTC CTCAGCTCCA CCACCGTGCT CTTCOGGCCC 4020 

C(\ ATCCACCCCA TCAACGGCCT GCGCTGCCGC TGCCCGCCCG GCTTCACCGG CGACTACTGC 4080 

3U GAGACGGAGA TOGACCTCTG CTACTCCGAC CCGTGCGGCG CCAACGGCCG CTGCCGCAGC 4140 

CGCGAGGGCG GCTACACCTG CGAGTGCTTC GAGGACTTCA CTGGAGAGCA CTGTGAGGTG 4200 

GATGCCCGCT CAGGCCGCTG TGCCAACGGG GTGTGCAAGA ACGGGGGCAC CTGCGTGAAC 4260 

CTGCTCATCG GCGGCTTCCA CTGCGTGTGT CCTCCTGGCG AGTATGAGAG GCCCTACTGT 4320 

rr GAGGTGACCA CCAGGAGCTT CCOGCCCCAG TCCTTOGTCA CCTTCCGGGG CCTGAGACAG 4380 

DO CGCTTCCACT TCACCATCTC CCTCACGTTT GCCACTCAGG AAAGGAACGG CTTGCTTCTC 4440 

TACAACGGCC GCTTCAATGA GAAGCACGAC TTCATCGCCC TGGAGATCGT GGACGAGCAG 4500 

GTGCAGCTCA CCTTCTCTGC AGGCGAGACA ACAACGACCG TGGCACCGAA GGTTCCCAGT 4560 

GGTGTGAGTG ACGGGCGGTG GCACTCTGTG CAGGTGGAGT ACTACAACAA GCCCAATATT 4620 

GGCCACCTGG GCCTGCOCCA TGGGCCGTCC GGGGAAAAGA TGGCCGTGGT GACAGTGGAT 4680 

OU GATTGTGACA CAACCATGGC TGTGCGCTTT GGAAAGGACA TCGGGAACTA CAGCTGCGCT 4740 

GCCCAGGGCA CTCAGACCGG CTCCAAGAAG TCCCTGGATC TGACCGGCCC TCTACTCCTG 4800 

GGGGGTGTCC CCAACCTGCC AGAAGACTTC CCAGTGCACA ACCGGCAGTT OGTGGGCTGC 4860 

ATGCGGAACC TGTCAGTCGA CGGCAAAAAT GTGGACATGG CCGGATTCAT CGCCAACAAT 4920 

GGCACCCGGG AAGGCTGCGC TGCTCGGAGG AACTTCTGCG ATGGGAGGCG GTGTCAGAAT 4980 

03 GGAGGCACCT GTGTCAACAG GTGGAATATG TATCTGTGTG AGTGTCCACT CCGATTOGGC 5040 

GGGAAGAACT GTGAGCAAGC CATGCCTCAC CCCCAGCTCT TCAGCGGTGA GAGCGTCGTG 5100 

TCCTGGAGTG ACCTGAACAT CATCATCTCT GTGCCCTGGT ACCTGGGGCT CATGTTCCGG 5160 

ACCCGGAAGG AGGACAGCGT TCTGATGGAG GCCACCAGTG GTGGGCCCAC CAGCTTTCGC 5220 

A CTCCAGATCC TGAACAACTA CCTOCAGTTT GAGGTGTCCC ACGGCCCCTC CGATGTGGAG 5280 

fV TCCGTGATOC TGTCCGGGTT GCGGGTGACC GACGGGGAGT GGCACCACCT GCTGATOGAG 5340 

CTGAAGAATG TTAAGGAGGA CAGTGAGATG AAGCACCTGG TCACCATGAC CTTGGACTAT 5400 

GGGATGGACC AGAACAAGGC AGATATCGGG GGCATGCTTC CCGGGCTGAC GGTAAGGAGC 5460 

GTGGTGGTCG GAGGCGCCTC TGAAGACAAG GTCTCCGTGC GCCGTGGATT CCGAGGCTGC 5520 

ATGCAGGGAG TGAGGATGGG GGGGACGCCC ACCAACGTCG CCACCCTGAA CATGAACAAC 5580 

/O GCACTCAAGG TCAGGGTGAA GGACGGCTGT GATGTGGACG ACCCCTGTAC CTCGAGCCCC 5640 

TGTCCCCCCA ATAGCCGCTG CCAOGACGCC TGGGAGGACT ACAGCTGCGT CTGTGACAAA 5700 

GGGTACCTTG GAATAAACTG TGTGGATGCC TGTCACCTGA ACCCCTGCGA GAACATGGGG 5760 

GCCTGCGTGC GCTCCCCCGG CTCCCOGCAG GGCTACGTGT GCGAGTGTGG GCCCAGTCAC 5820 

CA TACGGGCCGT ACTGTGAGAA CAAACTCGAC CTTCCGTGCC CCAGAGGCTG GTGGGGGAAC 5880 

OU CCOQTCTGTQ GACCCTGCCA CTGTGCOGTC AGCAAAGGCT TTGATCCCGA CTGTAATAAG 5940 

ACCAACGGCC AGTGCCAATG CAAGGAGAAT TACTACAAGC TCCTAGCCCA GGACACCTGT 6000 

CTGCCCTGCG ACTGCTTCCC CCATGGCTCC CACAGCCGCA CTTGCGACAT GGCCACCGGG 6060 

CAGTGTGCCT GCAAGCCCGG CGTCATCGGC CGCCAGTGCA ACCGCTGCGA CAACCCGTTT 6120 

GCCGAGGTCA CCACGCTCGG CTGTGAAGTG ATCTACAATG GCTGTCCCAA AGCATTTGAG 6180 
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GCCGGCATCT GGTGGCCACA GACCAAGTTC GGGCAGCCGG CTGCGOTGCC ATGCCCTAAG 6240 , 
GG ATCC GTTG GAAATGOGGT CCGACACTGC AGCGGGGAGA AGGGCTGGCT GCCCCCAGAG 6300 
CTCTTTAACT GTACCACCA* CTCCTTCGTG GACCTCAGGG CCATGAATGA GAAGCTGAGC 6360 
CGCAATGAGA CGCAGGTGGA CGGCGCCAGG GCCCTGCAGC TGGTGAGGGC GCTGCGCAGT 6420 
0 GCTACACAGC ACACGGGCAC GCTCTTTGGC AATGACGTGC GCACGGOCTA CCAGCTGCTG 6480 
GGCCA CGTC C TTCAGCACGA QAGCTGGCAG CAGGGCTTCG ACCTGGCAGC CACGCAGGAC 6540 
GCCGACTTTC ACGAGGACGT CATCCACTOG GGCAGCGCCC TCCTGGCCCC AGCCACCAGG 6600 
GCGGCGTGGG AGCAGATCCA GCGGAGCGAG GGCGGCACGG CACAGCTGCT CCGGCGCCTC 6660 
GAGGGCTACT TCAGCAACGT GGCACGCAAC GTGCGGGGGA CGTACCTGCG GCCCTTCGTC 6720 
ATCGTCACCG CCAACATGAT TCTTGCTGTC GACATCTTTG ACAAGTTCAA CTTTACGGGA 6780 
GCCAGGGTCC CGCGATTOGA CACCATCCAT GAAGAGTTCC CCAGGGAGCT GGAGTCCTCC 6840 
GTCTCCTTCC CAGCCGACTT CTTCAGACCA CCTGAAGAAA AAGAAGGCCC CCTGCTGAGG 6900 
CCGGCTGGCC GGAGGACCAC CCOGCAGACC ACGOGCCCGG GGCCTGGCAC CGAGAGGGAG 6960 
GCCCCGATCA GCAGGCGGAG GCGACACCCT GATGACGCTG GCCAGTTCGC CGTCGCTCTG 7020 
GTCATCATTT ACCGCACCCT GGGGCAGCTC CTGCCCGAGC GCTACGACCC CGACCGTCGC 7080 
AGCCTCCGGT TGCCTCACCG GCCCATCATT AATACCCGGA TGGTGAGCAC GCTGGTGTAC 7140 
AGCGAGGGGG CTCCGCTOCC GAGACCCCTG GAGAGGCCCG TCCTGGTGGA GTTCGCCCTG 7200 
CTGGAGGTGG AGGAGCGAAC CAAGCCTGTC TGCGTGTTCT GGAACCACTC CCTGGCCGTT 7260 
GGTGGGACGG GAGGGTGGTC TGCCOGGGGC TGCGAGCTCC TGTCCAGGAA CCGGACACAT 7320 
GTOGCCTGCC AGTGCAGCCA CACAGCCAGC TTTGCGGTGC TCATGGATAT CTCCAGGOGT 7380 
GAGAACGGGG AGGTCCTGCC TCTGAAGATT GTCACCTATG CCGCTGTGTC CTTGTCACTG 7440 
GCAGCCCTGC TGGTGGCCTT CGTCCTCCTG AGCCTGGTCC GCATGCTGCG CTCCAACCTG 75 00 
CACAGCATTC ACAAGCACCT CGCCGTGGCG CTCTTCCTCT CTCAGCTGGT GTTCGTGATT 7560 
GGGATCAACC AGACGGAAAA CCCGTTTCTG TGCACAGTGG TTGCCATCCT CCTCCACTAC 7620 
ATCTACATGA GCACCTTTGC CTGGACCCTC GTGGAGAGCC TGCATGTCTA CGGCATGCTG 7680 
ACCGAGGTGC GCAACATCGA CACGGGGCCC ATGCGGTTCT ACTACGTCGT GGGCTGGGGC 7740 
ATCCCGGCCA TTGTCACAGG ACTGGCGGTC GGCCTGGACC CCCAGGGCTA CGGGAACCCC 7800 
GACTTCTGCT GGCTGTCGCT TCAAGACACC CTGATTTGGA GCTTTGCGGG GCCCATCGGA 7860 
GCTGTTATAA TCATCAACAC AGTCACTTCT GTCCTATCTG CAAAGGTTTC CTGCCAAAGA 7920 
AAGCACCATT ATTATGGGAA AAAAGGGATC GTCTCCCTGC TGAGGACCGC ATTCCTCCTG 7980 
CT GCTQ CTCA TCAGCGCCAC CTGGCTGCTG GGGCTGCTGG CTGTGAACOG CGATGCACTG 8040 
AGCTTTCACT ACCTCTTCGC CATCTTCAGC GGCTTACAGG GCCCCTTCGT CCTCCTTTTC 8100 
CACTGCGTGC TCAACCAGGA GGTCCGGAAG CACCTGAAGG GOGTGCTCGG CGGGAGGAAG 8160 
CTGCACCTGG AGGACTCCGC CACCACCAGG GCCACCCTGC TGACGOGCTC CCTCAACTGC 8220 
AACACCACCT TCGGTGACGG GCCTGACATG CTGCGCACAG ACTTGGGCGA GTCCACCGCC 8280 
TCGCTGGACA GCATCGTCAG GGATGAAGGG ATCCAGAAGC TCGGCGTGTC CTCTGGGCTG 8340 
GTGAGGGGCA GCCACGGAGA GCCAGACGCG TCCCTCATGC CCAGGAGCTG CAAGGATCCC 8400 
CCTGGCCACG ATTCCGACTC AGATAGCGAG CTGTCCCTGG ATGAGCAGAG CAGCTCTTAC 8460 
GCCTCCTCAC ACTCGTCAGA CAGCGAGGAC GATGGGGTGG GAGCTGAGGA AAAATGGGAC 8520 
CCGGCCAGGG GCGCCGTCCA CAGCACCCCC AAAGGGGACG CTGTGGCCAA CCACGTTCCG 8580 
GCCGGCTGGC CCGACCAGAG CCTGGCTGAG AGTGACAGTG AGGACCCCAG CGGCAAGCCC 8640 
CGCCTGAAGG TGGAGACCAA GGTCAGCGTG GAGCTGCACC GCGAGGAGCA GGGCAGTCAC 8700 
CGTGGAGAGT ACCCCCCGGA CCAGGAGAGC GGGGGOGCAG CCAGGCTTGC TAGCAGCCAG 8760 
CCCCCAGAGC AGAGGAAAGG CATCTTGAAA AATAAAGTCA CCTACCOGCC GCCGCTGACG 8820 
CTGACGGAGC AGACGCTGAA GGGCOGGCTC CGGGAGAAGC TGGCCGACTG TGAGCAGAGC 8880 
CCCACATCCT CGCGCACGTC TTCCCTGGGC TCTGGCGGCC CCGACTGCGC CATCACAGTC 8940 
AAGAGCCCTG GGAGGGAGCC GGGGOGTGAC CACCTCAACG GGGTGGCCAT GAATGTGCGC 9000 
ACTGGGAGCG CCCAGGCCGA TGGCTCOGAC TCTGAGAAAC CGTGAGGCAA GCCCGTCACC 9060 
CCACACAGGC TGCGGCATCA CCCTCAGACC TTGGAGCCCA AGGGGCCACT GCCCTTGAAG 9120 
TGGAGTGGGC CCAGAGTGTG GCGGTCCCCA TGGTGGCAGC CCCCCGACTG ATCATCCAGA 9180 
CACAAAGGTC TTGGTTCTCC CAGGAGCTCA GGGCCTGTCA GACCTGGTGA CAAGTGCCAA 9240 
AGGCCACAGG CATGAGGGAG GCGTGGACCA CTGGGCCAGC ACCGCTGAGT CCTAAGACTG 9300 
CAGTCAAAGC CAGAACTGAG AGGGGACCCC AGACTGGGCC CAGAGGCTGG CCAGAGTTCA 9360 
GGAACGCCGG GCACAGACCA AAGACCGOGG TCCAGCCCCG CCCAGGCGGG CATCTCATGG 9420 
CAGTGCGGAC CCGTGGCTGG CAGCCOGGGC AGTCCTTTGC AAAGGCACCC CTTGTCTTAA 9480 
AATCACTTCG CTATGTGGGA AAGGTGGAGA TACTTTTATA TATTTGTATG GGACTCTGAG 9540 
GAGGTGCAAC CTGTATATAT ATTGCATTCG TGCTGACTTT GTTATCCOGA GAGATCCATG 9600 
CAATGATCTC TTGCTGTCTT CTCTGTCAAG ATTGCACAGT TGTACTTGAA TCTGGCATGT 9660 
GTTGACGAAA CTGGTGCCCC AGCAGATCAA AGGTGGGAAA TACGTCAGCA GTGGGGCTAA 9720 
AACCAAGCGG CTAGAAGCCC TACAGCTGCC TTCGGCCAGG AAGTGAGGAT GGTGTGGGCC 9780 
CTCCCCGCCG GCCCCCTGGG TCCCCAGTGT TCGCTGTGTG TGCGTTTGTC CTCTGCTGOC 9840 
ATCTGCCCCG GCTGTGTGAA TTCAAGACAG GGCAGTGCAG CACTAGGCAG GTGTGAGGAG 9900 
CCCTGCTGAG GTCACTGTGG GGCACGGTTG CCACACGGCT GTCATTTTTC ACCTGGTCAT 9960 
TCTGTGACCA CCACCCCCTC CCCTCACCGC CTCCCAGGTG GCCCGGGAGC TGCAGGTGGG 10020 
GATGGCTTTG TCCTTTGCTC CTGCTCCCCG TGGGACCTGG GACCTTAAAG CGTTGCAGGT 10O8O 
TCCTGATTTG GACAGAGGTG TGGGGCCTTC CAGGCCGTTA CATACCTCCT GCCAATTCTC 10140 
TAACTCTCTG AGACTGCGAG GATCTCCAGG CAGGGTTCTC CCCTCTGGAG TCTGACCAAT 10200 
TACTTCATTT TGCTTCAAAT GGCCAATTGT GCAGAGGGAC AAAGCCACAG CCACACTCTT 10260 
CAACGGTTAC CAAACTGTTT TTGGAAATTC ACACCAAGGT CGGGCCCACT GCAGGCAGCT 10320 
GGCACAGCGT GGCCCGAGGG GCTGTGGAAC GGGTCCCGGA ACTGTCAGAC ATGTTTGATT 10380 
TTAGCGTTTC CTTTGTTCTT CAAATCAGGT GCCCAAATAA GTGATCAGCA CAGCTGCTTC 10440 
CAAATAGGAG AAACCATAAA ATAGGATGAA AATCAAGTAA AATGCAAAGA TGTCCACACT 10500 
GTTTTAAACT TGACCCTGAT GAAAATGTGA GCACTGTTAG CAGATGCCTA TGGGAGAGGA 10560 
AAAGCGTATC TGAAAATGGT CCAGGACAGG AGGATGAAAT GAGATCCCAG AGTCCTCACA 1062O 
/J CCTGA ATGAA TTATACATGT GCCTTACCAG GTGAGTGGTC TTTOGAAGAT AAAAAACTCT 10680 
AGTCCCTTTA AACGTTTGCC CCTGGCGTTT CCTAAGTACG AAAAGGTTTT TAAGTCTTCG 10740 
AA CAGTCT CC TTTCATGACT TTAACAGGAT TCTGCCCCCT GAGGTGTAAT T TTTTTOTTC 10800 
TATTTTTTTC CAOGTACTCC ACAGCCAACA TCAOGAGGTG TAATTTTTAA TTTGATCAGA 10860 
ACTGTTACCA AAAAACAACT GTCAGTTTTA TTGAGATGGG AAAAATGTAA ACCTATTTTT 10920 
ATTACTTAAG ACTTTATGGG AGAGATTAGA CACTGGAGGT TTTTAACAGA ACGTGTATTT 10980 
ATTAATGTTC AAAACACTGG AATTACAAAT GAGAAGAGTC TACAATAAAT TAAGATTTTT 11040 
GAATTTGTAC TTCTGOGGTG CTGGTTJTTC TCCACAAACA CCCCCGOCCC TCCCCATGCC 11100 
CAGGGTGGCC GTGGAAGGGA O QG TTTACGG AOGTGCAGCT GAGCTGTCCG TGTCCCATGC 11160 
TCCCTCAGCC AGTGGAACGT GCCGGAACTT TTTGTCCATT CCCTAGTAGG CCTGCCACAG 11220 
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CCTAGATGGQ CAOTTTTTGT CTTTCACCAA ATTTGAGGAC TTTTTTTTTT TGCCATTATT 11280 
tcttcagttt tcttttcttg cactgatctt TCTCCTCTCC TTCTGTGACT CCAGTGACTC 11340 
AGACGTTAGA CCTCTTQATQ TTTTCCCACT GGTCCCTGAG GCTCTGTTC 
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MAPPPPPVLP VLLLLAAAAA LPAKGLRAAA WEPRVPGGTR APALRPGCTY AVGAACTPRA 60 

PRELLDVGRD GRLAGRRRVS GAGRPLPLQV RLVARSAPTA LSRRLRARTH LPGCGARARL 120 

CGTGARLCGA LCFPVPGGCA AAQHSALAAP TTLPACRCPP RPRPRCPGRP ICLPPGGSVR 180 

LRLLCALRRA AGAVRVGLAL EAATAGTPSA SPSPSPPLPP NLPEARAGPA RRARRGTSGR 240 

GSLKFPMPNY QVALF ENBPA GTLILQLHAH YTIEGEEERV SYYMEGLFDE RSRGYFRIDS 300 

ATGAVSTDSV LDRETKETHV LRVKAVDYST PPRSATTYIT VLVKDTNDHS PVFEQSEYRE 360 

RVREMLEVGY EVLTIRASDR DSPINANLRY RVLGGAWDVF QLNESSGWS TRAVLDREBA 420 

AEYQLLVKAN DQGRNPGPLS ATATVYIEVE DENDNYPQFS EQNYWQVPE DVGLNTAVLR 480 

VQATDRDQGQ NAAIHYSILS GNVAGQFYLH SLSGILDVIN PLDFEDVQKY SLSIKAQDGG 540 

RPPLINSSGV VSVQVLDVND NEPIFVSSPF QATVLENVPL GYPWHIQAV DADSGENARL 600 

HYRLVDTAST FLGGGSAGPK NPAPTPDFPP QIHNSSGWIT VCABLDREBV EHYSFGVEAV 660 

DHGSPPMSSS TSVSITVLDV NDNDPVFTQP TYELRLNEDA AVGSSVLTLQ ARDRDANSVT 720 

TYQLTGGNTR NRFALSSQRG GGLITLALPL DYKQEQQYVL AVTASDGTRS HTAHVLINVT 780 

DANTHRPVFQ SSHYTVSVSE DRPVGTSIAT LSAKDEDTGE NARITYVIQD PVPQPRIDPD 840 

SGTMYTMMEL DYENQVAYTL TIMAQDNGIP QKSDTTTLEI LILDANDNAP QFLWDFYQGS 900 

IPEDAPPSTS ILQVSATDRD SGPNGRLLYT FQGGDDGDGD PYIEPTSGVI RTQRRLDREN 960 

VAVYNLWALA VDRGSPTPLS ASVEIQVTIL DXNDNAPMFE KDELELPVEE NNPVGSWAK 1020 

IRANDPDEGP NAQIMYQIVE GDMRHFFQLD LLNGDLRAMV ELDFEVRRBY VLWQATSAP 1080 

LVSRATVHIL LVDQNDNPPV LPDFQILPNN YVTMKSNSFP TGVIGCIPAH DPDVSDSLNY 1140 

TFVQGNELRL LLLDPATGEL QLSRDLDKNR PLEALMEVSV SDGIHSVTAP CTLRVTIITD 1200 

DMLTNSITVR LENMSQEKFL SPLLALPVEG VAAVLSTTKD DVFVFNVQND TDVSSNILNV 1260 

TFSALLPGGV RGQFFPSEDL QEQIYLNRTL LTTISTQRVL PFDDNICLRE PCENYMKCVS 1320 

VLRFDSSAPF LSSTTVLFRP IHPINGLRCR CPPGFTGDYC ETEIDLCYSD PCGANGRCRS 1380 

REGGYTCECF EDFTGEHCEV DARSGRCAKG VCKNGGTCVN LLIGGFHCVC PPGEYERPYC 1440 

EVTTRSFPPQ SFVTFRGLRQ RFHFTISLTF ATQERNGLLL YNGRFNEKHD FIALEIVDEQ 1500 

VQLTFSAGET TTTVAPKVPS GVSDGRWHSV QVQYYNKPNI GHLGLPHGPS GEKMAWTVD 1560 

DCOTTMAVRF GKDIGNYSCA AQGTQTGSKK SLDLTGPLLL GGVPNLPEDF PVHNRQFVGC 1620 

MRNLSVDGKN VDMAGFIANM GTREGCAARR NFCDGRRCQN GGTCVNRWNM YLCECPLRFG 1680 

GKNCEQAMPH PQLFSGESW SWSDLNIIIS VPWYLGLMFR TRKEDSVLME ATSGGPTSFR 1740 

LQILNNYLQF EVSHGPSDVE SVMLSGLRVT DGEWHHLLIE LKNVKEDSEM KHLVTMTLDY 1800 

GMDQNKADIG GMLPGLTVRS VWGGASEDK VSVRRGFRGC MQGVRMGGTP TNVATLNMNN 1860 

ALKVRVKDGC DVDDPCTSSP CPPNSRCHDA WEDYSCVCDK GYLGINCVDA CHLNPCENMG 1920 

ACVRSPGSPQ GYVCECGPSH YGPYCENKLD LPCPRGWWGN PVCGPCHCAV SKGFDPDCNK 1980 

TNGQCQCKEN YYKLLAQDTC LPCDCFPHGS HSRTCDMATG QCACKPGVIG RQCNRCDNPF 2040 

AEVTTLGCBV ZYNGCPKAPE AGIWWPQTKP GQPAAVPCPK GSVGNAVRHC SGEKGWLPPB 2100 

LFNCTTISPV DLRAMNEKLS RNETQVDGAR ALQLVRALRS ATQHTGTLFG NDVRTAYQLL 2160 

GHVLQHESWQ QGFDLAATQD ADFHEDVIHS GSALLAPATR AAWEQIQRSE GGTAQLLRRL 2220 

EGYFSNVARN VRRTYLRPFV IVTANMILAV DIFDKFNFTG ARVPRFDTIH EBFPRELESS 2280 

VSFPADFFRP PEEKEGPLLR PAGRRTTPQT TRPGPGTERE APISRRRRHP DDAGQFAVAL 2340 

VIIYRTLGQL LPERYDPDRR SLRLPHRPII NTPMVSTLVY SEGAPLPRPL ERPVLVEFAL 2400 

LEVEERTKPV CVFWNHSLAV GGTGGWSARG CELLSRNRTH VAOQCSHTAS FAVLMDISRR 2460 

ENGEVLPLKI VTYAAVSLSL AALLVAFVLL SLVRMLRSNL HSIHKRLAVA LFLSQLVFVI 2520 

GIKQTENPFL CTWAILLHY IYKSTFAWTL VESLHVYRML TEVRNIDTGP MRFYYWGWG 2580 

IPAIVTGLAV GLDPQGYGNP DFCWLSLQDT LIWSFAGPIG AVIIINTVTS VLSAKVSCQR 2640 

KHHYYGKKGI VSLLRTAFLL LLLISATWLL GLLAVNRDAL SFHYLFAIFS GLQGPPVLLF 2700 

HCVLNQEVRX HLKGVLGGRK LHLEDSATTR ATLLTRSLHC NTTFGBGPDM LRTDLGESTA 2760 

SLDSIVRDEG IQKLGVSSGL VRGSHGEPDA SLMPRSCKDP PGHDSDSDSE LSLDEQSSSY 2820 

A5SHSSDSED DGVGAEEKWD PARGAVHSTP KGDAVANHVP AGWPDQSLAE SDSEDPSGKP 2680 

RLKVETKVSV BLHREEQGSH RGEYPPDQES GGAARLASSQ PPEQRKGILK NKVTYPPPLT 2940 

LTEQTLKGRL REKLADCEQS PTSSRTSSLG SGGPDCAITV KSPGREPGRD HLNGVAMKVR 3000 
TGSAQADGSD SEKP 



A33 DMA SEQUENCE 

Gene name: CXCR3 (G protein- coupled receptor 9); chemokine (C-X-C) receptor 3 
Unigene number: Hs. 198252 

Probeset Accession 8: X9S876 
Nucleic Acid Accession #: X9S876 

Coding sequence: 69-1175 (underlined sequences correspond to start and stop codons) 



l 



11 



21 



31 



41 



51 




WO 03/042661 



PCT/US02/36810 



CCCTCCTGGA GAACTTCAGC TCTTCCTATG ACTATGGAGA AAACGAGAGT GACTCGTGCT 180 

GTACCTCCCC GCCCTGCCCA CAGGACTTCA GCCTGAACTT OGACOGGGCC TTCCTGCCAG 240 

CCCTCTACAG CCTCCTCTTT CTGCTGGGGC TGCTGGGCAA CGGCGCGGTG GCAGCCGTGC 300 

TGCTGAGCCG GCGGACAGCC CTGAGCAGCA CCGACACCTT CCTGCTCCAC CTAGCTGTAG 360 

CAGACACGCT GCTGGTGCTG ACACTGCCGC TCTGGGCAGT GGACGCTGCC GTCCAGTGGG 420 

TCTTTGGCTC TGGCCTCTGC AAAGTGGCAG GTGCCCTCTT CAACATCAAC TTCTACGCAG 480 

GAGCCCTCCT GCTGGCCTGC ATCAGCTTTG ACCGCTACCT GAACATAGTT CATGCCACCC 540 

AGCTCTACCG CCGGGGGCCC CCGGCCCGCG TGACCCTCAC CTGCCTGGCT GTCTGGGGGC 600 

TCTGCCTGCT TTTCGCCCTC CCAGACTTCA TCTTCCTGTC GGCCCACCAC GACGAGCGCC 660 

TCAACGCCAC CCACTGCCAA TACAACTTCC CACAGGTGGG CCGCACGGCT CTGCGGGTGC 720 

TGCAGCTGGT GGCTGGCTTT CTGCTGCCCC TGCTGGTCAT GGCCTACTGC TATGGCCACA 780 

TCCTGGCCGT GCTGCTGGTT TCCAGGGGCC AGCGGCGCCT GOGGGCCATG CGGCTGGTGG 840 

TGGTGGTCGT GGTGGCCTTT GCCCTCTGCT GGACCCCCTA TCACCTGGTG GTGCTGGTGG 900 

ACATCCTCAT GGACCTGGGC GCTTTGGCCC GCAACTGTGG OOGAGAAAGC AGGGTAGACG 960 

TGGCCAAGTC GGTCACCTCA GGCCTGGGCT ACATGCACTG CTGCCTCAAC CCGCTGCTCT 1020 

ATGCCTTTGT AGGGGTCAAG TTCCGGGAGC GGATGTGGAT GCTGCTCTTG CGCCTGGGCT 1080 

GCCCCAACCA GAGAGGGCTC CAQAGGCAGC CATCGTCTTC CCGCCGGGAT TCATCCTGGT 1140 

CTGAGACCTC AGAGGCCTCC TACTOGGGCT TGTGAGGCCG GAATCCGGGC TCCCCTTTCG 1200 

CCCACAGTCT GACTTCCCCG CATTCCAGGC TCCTCCCTCC CTCTGCCGGC TCTGGCTCTC 1260 

CCCAATATCC TCGCTCCCGG GACTCACTGG CAGCCCCAGC ACCACCAGGT CTCCCGGGAA 1320 

GCCACCCTCC CAGCTCTGAG GACTGCACCA TTGCTGCTCC TTAGCTGCCA AGCCCCATCC 1380 

TGCCGCCCGA GGTGGCTGCC TGGAGCCCCA CTGCCCTTCT CATTTGGAAA CTAAAACTTC 1440 

ATCTTCCCCA AGTGCGGGGA GTACAAGGCA TGGCGTAGAG GGTGCTGCCC CATGAAGCCA 1500 

CA GCCCA GGC CTCCAGCTCA GCAGTGACTG TGGCCATGGT CCCCAAGACC TCTATATTTG 1560 

CTCTTTTATT TTTATGTCTA AAATCCTGCT TAAAACTTTT CAATAAACAA GATCGTCAGG 1620 

ACCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA BHCLITERAT 1680 

UREBHCBLAS TPBHCORTHO LOGUESMOUS ECXCRMYLEV SERQVLDASD PAPLLENSTS 1740 

PYDYGENESD PSDSPPCPQD FSLNFDRTFL PALYSLLFLL GLLGNGAVAA VLLSQRTALS 1800 

STDTFLLHIA VADVLLVLTL PLWAVDAAVQ WVFGPGLCKV AGALFKINPY AGAFLLACIS 1860 

FDRYLSIYHA TQIYRRDPRV RVALTCIWW GLCLLFALPD FIYLSANYDQ RLNATHCQYN 1920 

FPQVGRTALR VLQLVAGFLL PLLVMAYCYA HILAVLLVSR GQRRFRAMRL VWWAAFAV 1980 

CWTPYHLWL VDILMDVGVI* ARNCGRESHV DVAKSVTSGM GYMHCCLNPL LYAPVGVKFR 2040 
EQMWMLFTRL GRSDQRGPQR QPSSSRRESS WSBTTEASYL GL 



A3 4 Protein sequence 

Gene name: CXCR3 (G protein-coupled receptor 9) ;chemokine(C-X-C) receptor 3 

Unigene number: Hs. 1982 52 
Protein Accession ft: P49682 

Signal sequence: none found 

Pfam domains: 7tm 1 [70-318) 

Transmembrane domains: 57-78, 92-113, 129-147, 169-190, 222-243, 257-275, 307-323 

DRY box: 148-149 

Cellular localization: plasma membrane 



1 11 21 31 41 51 

I 1 I I I I 

MVLEVSDHQV LNDAEVAALL ENFSSSYDYG ENESDSCCTS PPCPQDFSLN FDRAFLPALY 60 

SLLFLLGLLG NGAVAAVLLS RRTALSSTDT FLLHLAVADT LLVLTLPLWA VDAAVQKVFG 120 

SGLCKVAGAL FNINFYAGAL LLACXSFDRY LNIVHATQLY RRGPPARVTL TCLAVWGLCL 180 

LFALPDFIFL SAHHDERLNA THOQYNPPQV GRTALRVLQL VAGFLLPLLV MAYCYAHILA 240 

VLLVSRGQRR LRAMRLWW WAFALCWTP YHLWLVD1L MDLGALARNC GRESRVDVAK 300 

SVTSGLGYMH CCLNPLLYAF VGVKFRERMW MLLLRLGCPN QRGLQRQPSS SRRDSSWSET 360 
SEASYSGL 



A35 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft i 
Coding sequence* 



Differentially expressed C016 gene (clone MGC:5257) 

Hs. 69517 

AA447522 

BC001291 

44-541 (start and stop codons are underlined) 



1 11 21 31 41 51 

I I I I 1 I 

GGGGGCGCCG CGCGCTGACC CTCCCTGGGC ACCGCTGGGG ACGATGGCGC TGCTCGCCTT 60 

GCTGCTGGTC GTGGCCCTAC CGCGGGTGTG GACAGACGCC AACCTGACTG CGAGACAACG 120 

AGATCCAGAG GACTCCCAGC GAACGGACGA GGGTGACAAT AGAGTGTGGT GTCATGTTTG 180 

TGAGAGAGAA AACACTTTCG AGTGCCAGAA CCCAAGGAGG TGCAAATGGA CAGAGCCATA 240 

CTGCGTTATA GCGGCCGTGA AAATATTTCC ACGTTTTTTC ATGGTTGCGA AGCAGTGCTC 300 

CGCTGGTTGT GCAGCGATGG AGAGACCCAA GCCAGAGGAG AAGCGGTTTC TCCTGGAAGA 360 

GCCCATGCCC TTCTTTTACC TCAAGTGTTG TAAAATTCGC TACTGCAATT TAGAGGGGCC 420 

ACCTATCAAC TCATCAGTGT TCAAAGAATA TGCTGGGAGC ATGGGTGAGA GCTGTGGTGG 480 

GCTGTGGCTG GCCATCCTCC TGCTGCTGGC CTCCATTGCA GCCGGCCTCA GCCTGTCTTG 540 

AGCCACGGQA CTGCCACAGA CTGAGCCTTC CGGAGCATGG ACTCGCTCCA GACCGTTGTC 600 

ACCTGTTGCA TTAAACTTGT TTTCTGTTGA TTACCTCTTG GTTTGACTTC CCAGGGTCTT 660 

GGGATGGGAG AGTGGGGATC AGGTGCAGTT GGCTCTTAAC CCTCAAGGGT TCTTTAACTC 720 

ACATTCAGAG GAAGTCCAGA TCTCCTGAGT AGTGATTTTG GTQACAAGTT TTTCTCTTTG 780 

AAATCAAACC TTGTAACTCA TTTATTGCTG ATGGCCACTC TTTTCCTTGA CTCCCCTCTG 840 

CCTCTGAGGG CTTCAGTATT GATGGGGAGG GAGGCCTAAG TACCACTCAT GGAGAGTATG 900 

TGCTGAGATG CTTCCGACCT TTCAGGTGAC GCAGGAACAC TGGGGGAGTC TGAATGATTG 960 

GGGTGAAGAC ATCCCTGGAG TGAAGGACTC CTCAGCATGG GGGGCAGTGG GGCACAOGTT 1020 

AGGGCTGCCC CCATTCCAGT GGTGGAGGOG CTGTGGATGG CTGCTTTTCC TCAACCTTTC 1080 



846 



WO 03/042661 



PCT/US02/36810 



CTACCAGATT CCAGGAGGCA GAAGATAACT AATTGTGTTG AAGAAACTTA GACTTCACCC 1140 

ACCAGCTGGC ACAGGTGCAC AGATTCATAA ATTCCCACAC GTGTGTGTTC AACATCTGAA 1200 

ACTTAGGCCA AGTAGAGAGC ATCAGGGTAA ATGGCGTTCA TTTCTCTOTT AAGATGCAGC 1260 

CATCCATGGG GAGCTGAGAA ATCAGACTCA AAGTTCCACC AAAAACAAAT ACAAGGGGAC 1320 
TTCAAAAGTT CACGAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 



10 



15 



A3 6 Protein sequence: 
Gene names 
Unigene number: 
Probeset Accession #: 
Protein Accession fl: 
Signal sequence: 
Transmembrane domain: 
Cellular localization: 



Differentially expressed C016 gene (clone MGC:5257) 

Hs. 69517 

AA447522 

AAH01291 
1-17 (first underlined sequence) 
146 - 162 

plasma membrane 



20 
25 

30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



1 11 21 31 41 SI 

I I I I I I 

MALLALLLW ALPRVWTDAN LTARQRDPED SQRTDEGDNR VWCHVCEREN TFECQNPRRC 
KWTEPYCVIA AVKIPPRPFM VAKQCSAGCA AMERPKPEEK RFLLEEPMPF PYLKCCKIRY 
CNLEGPP1NS SVFKEYAGSM GESCGGLWLA ILLLLASIAA GLSLS 

A3 7 DKA 



60 
120 



Gene name: ESTs 
Unigene number: Hs. 2 93616 

Probeset Accession #: AW043782 
Nucleic Acid Accession #: none found 

Ceding sequence: 38-107S (underlined sequence corresponds to start and stop codon) 



AGCAACGACG 
CCTGCTGCTG 
GTGCAACATA 
GTGTGACGGG 
GTCGAAATGT 
CTTCOGGTGC 
AAACCCTCTG 
GAGCTTCATC 
AAGTTCTCAA 
TTACCCCAGC 
CCTGCTGGCA 
GCACCGGCTG 
CTGCAACGTC 
QAATGCGTCG 
TGCGTGGTAT 
CGACCTGCCC 
CAGCAGCCTC 
GGGCACTGCT 
AGTTATTCCA 
TGCTCATGGG 
AACTATCTCT 
TGACATGATC 
CACCCTCATT 
AAATAGGCTG 
CGCTGGACCC 
ATGATCTAAC 
ATCAAAACCT 
AAGAAAACTT 
AAGGACTCTG 
CTCATTCTGA 
GAGCCCCTCC 
TACACCTGCC 
ACCTGCCCGT 
GTATGTCCCT 
CTCCAAAGTT 
ACTGGTTTCT 
CTGCACTGTG 
GGTCAGGGTC 
AGACAATTTG 
TGAAACAGTG 
AGCTGTCTCT 
ACACCCTTGC 
ACATTTGTGC 
AGAGGGACTC 
TTCTCTGIGT 
AGGT G TTGTT 
CCACTCCGGG 
AACCTGTTTG 
TGATCCTGTT 
AAGCTAGCCA 



11 
I 

CCGGGCAGCG 
AGCAGCGCCG 
CCAGGCAACT 
CTGCCTGACT 
GGCCCAACCT 
AATGGGTTTG 
CTTTGCTCCA 
TGCGATGGAC 
GAACCCGGCA 
ATCACCTATG 
CTGGTCTTGC 
CAGCACCCTG 
ACCTACAACG 
GAAGTAGGCT 
GACCTTCCTC 
CCCTACCGCT 
CTGAGCGTGG 
GAGCCCAGGG 
AAGTCCATAT 
AAGCTCTTTA 
GCATTCCCCT 
TGTTGTGCGT 
TTTCACATTA 
GGAGAGAGCA 
AATTCTCTCT 
CAGGAGGCCA 
GCTTTGCACA 
TGGACGTGAG 
AAACCATCTA 
GAGCTTTCCT 
CATGAGTTTA 
CTGGCTCTAC 
AGCCAAGGAA 
GTGGCCCACA 
CCCTTAACAC 
ATCACAGGTG 
CACGCTCCTC 
AGGCCTCTCC 
GAGTCAAGAT 
TGTTTGTTTT 
TTTTTTGTTT 
CCCGCTGAGC 
ATTGTTGCAC 
CTCTCTCCCT 
CCAGTCAGCC 
TGGCAAGAAA 
CAGCTGTCAC 
ACGCTAATTA 
CTGTAGACTT 
CTGGTATTTT 





CCAAGGCTAA 
TCATTGGTCG 
ACTGCACAGC 
GTATTGACAA 
AA AGCTGT GA 
AACTTGTGTA 
TGGTGGTGGC 
CGCTGCCCGT 
ACCCCCACCA 
AGGCGGAGCA 
ACCAGAGGCC 
TGAACCAAGC 
CCCAGGCAGC 
GCCCCCAGGA 
TATAAGTCCC 
TAACAATTTG 
TTGGGATATT 
GCGTCTCAGT 
CCCGAGATCA 
AACAGTATTG 
GTGCAGGAGA 
TTTGGGTTAG 
CCATTTGAGC 
TCAGTGGCCA 
TGGTTTTGTG 
CCCAAGAATG 
AATAGGCAGG 
GCCAAGACCC 
CTCCCAGCTG 
TGACCTGGCT 
CAACACTGGC 
ACTTGAGGAC 
TCCTGGCT CC 
TTAGAGTTAG 
CATGGGCAAG 
GAAATGCATT 
TGTATAGGAA 
AAAGGAGATC 
TGTGTGCCAG 
GGAAGCAGAA 
TTTCTTTTTT 
GTAAAACGTT 
CCAGGTAGAG 
AATGTTCAGT 
TGGCATTCAG 
TGTTACAGAA 
AAGAAAGAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



847 



WO 03/042661 



PCTAJS02/36810 



AACGGAAAGG 
CTGAGTAATC 
CATTTCATCT 
TTTCTGGTGC 
GAGTTAATCT 
AGATAAGGGA 
TGATTTTTTT 
TTTTTGGGGG 
TTTTTAACTC 
AAAATAGTCT 
AGTTACACTG 
TTTTTAAAAA 
GAAAGGTTGT 
TTATACTTTC 
AAMMAAAAAC 
TGGGGCGGCG 
GAAACCCTGG 
GGAAACOGCA 
GGACTCAACC 



AACCTAGCTG 
CAATAAAGAA 
CCTGTGAGTC 
TCTGGAAGTT 
CACTCGCTTT 
TGCCTACTAA 
AATGAATGTT 
GAGGGGGTTT 
ATTCCAACCA 
CATCTCTTTT 
TGATGACTGG 
ATGCAACTAA 
GTGTOGTTGC 
TAATAAATTT 
AWYWTTGGGG 
GGGCCCACGT 
CCAAGAAAAA 
GAGTGTTGCG 
AGGAGGACCC 



CCTGTATCTT 
CTTTTGATGA 
AGAAGGGCTT 
GTTTAGAGGA 
TCTGCTTCCA 
TGCTTTTTTA 
TTTAAAAATA 
GTTTTCCAAC 
GGAAGCTTTT 
TTTCTCAAAT 
CCTATTACCT 
GTGGTTAATA 
TTTTTGTGTT 
GCAGTTTCAT 
GGGCTTGGGC 
AGGTACGGCG 
GGTGGCGAGA 
TAAACCACAC 
AAGGGAACCC 



TCATTTTTAA 
CAGCCAGAAT 
TATTTCTCCC 
AAGAATTCTA 
GGCATCTTAG 
AAACAAACAG 
TATAAATAGG 
TCAAGATGGC 
TTATACATTG 
GAGATCOGTG 
GACTCAGCTC 
GTGTGTGACG 
TTGGTTAGGC 
TCTTTCTGTT 
CTCGGAAAAA 
ACCACGCGGG 
ATTCTCCACA 
CCGAAGAGAG 
GATAGAGTAC 



AATAGCACTT 
GTGTTAGAAC 
TTTGATGGGG 
ATTTTAATTA 
GAAAAACAAA 
GGACATTTTT 
ACACCAAAGC 
ACATTAGTGG 
CCTAAATCTA 
TTTTATTTTA 
CCTCTACCTT 
CTCAAAGTTA 
TTGGTTTTGT 
TGTGCAAAWG 
GTTTTTAACA 
CCCAAACGGG 
CCAGAAAAAA 
AACTCAGAAG 
G 



GAGTTATTTT 
TCTGGCTGAA 
CCCCTTCTTC 
ATTGCGCAGT 
TGGTTTTAGT 
ATTATAGATT 
GGCAGGGTTT 
CCAGCAATAT 
CGCCAACCAG 
GCATTAAATT 
GAAATTGACA 
ATGTAAACTG 
TTTTTAATTT 
GWMCTAMARM 
CCACTTCGGG 
ACCCCAGAAG 
ACGCGCCGGG 
CACACAAGCG 



A38 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequences 
Transmembrane domains: 
LDLa domains: 
Cellular Localization: 



ESTs 

Hs -293616 
AW043782 
none found 
1-17 
169-191 

28-66, 70-108, 112-149 
plasma membrane 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 



} » ?1 31 41 51 

MWLLGPLCLL LSSAAESQLL PGNNFTNECN IPGNFMCSNG RCIPGAWQCD GLPDCFDKSD 60 

EKECPKAKSK CGPTFPPCAS GIHCIIGRFR CNGFEDCPDG SDEENCTANP LLCSTARYHC 120 

KKGLCIDKSF ICDGQNNCQD NSDBESCESS QEPGSGQVFV TSENQLVYYP SITYAIIGSS 180 

^™ WALL A"™**** NNLMTLPVHR LQHPVLLSRL WLDHPHHCN VTYNVNNGIQ 240 

YVASQAEQNA SEVGSPPSYS EALLDQRPAW YDLPPPPYSS DTESLNQADL PPYRSRSGSA 300 
NSASSQAASS LLSVEDTSHS PGQPGPQEGT AEPRDSEPSQ GTEBV 



A3 9 DMA sequence 

Gene name: 

Unigene number: 

Probeset Accession #: 

Nucleic Acid Accession 

Coding sequence: 1-1749 

1 11 21 

> I i 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT 
GGAAAGAAAC TTCCGTGGGA GGCTTCCATC 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCT 
GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG 
TCCGGACGAC AGCCCCGGGG GCCTTCTGAC 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT 
CTTCTGGGAG ACCGTCCTGC GCTCTCTGGA 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT 
CGGGCGGACA AAGACGGGCC CCOACGGCTC 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC 
ATCTACGTGC CGTTCCTCAT TGTTGGCTCC 
CTGGTGGCAG CCTGTTGCTG CAGATGTCTC 
GCCCCAGGGG GTAACCGCTT GATGGAGACC 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG 
AACAACGTGT ATGTCAACAT GCCCACGAAT 
CAGATTGTGC CACATCAAGG GCAGTATCTG 
CACGACTCTG TGCCCATGAC AGCTGTGCCA 
AGGCAGATTC AGTCCCCCTT CCCTCACACC 
ACTGT ATAAC CQAGAGTCAC TGGTGGGTTC 
GATTCTCGAG GTGGAAGTCC GCACATGTCG 
GCTTCATTTG CCCCCAGACT GTATGAAAAC 
CATCCAGGGT ATCATTGATT TATGATGOAA 
TTGCTGATGG GTGTATAACA AATGCTTGAG 
AAGAATTTTA TAAACTGATA AATTAAGGAT 



ESTs 

Hs. 128899 

AA983251 

AA983251 

(underlined 

31 

I 

ACCCAGCACA 
GGTGCGCACA 
GCCGGGCTCC 
CCGCCCGCCT 
GGGACTGCGG 
GTCCGGTTGC 
TGCATCCCGA 
CCACCGGCTG 
CTCCTGGGCG 
GACGAGGCCC 
GACGCGCTGT 
CATCCTGGAA 
CGAGGGCCCA 
GGCTTCCAGT 
GCGTTGCGCT 
GACCGCCAGC 
GGCAGGGCTT 
AGGGCCTGGC 
TTCCCGGGGC 
CCCTCTCCCC 
GTGTTTGTCG 
CGGCCTAAGC 
ATCCCCATGA 
GCTGCCAGTT 
ACCAACTGTT 
TTCTCTGTGC 
CATCCCCCAT 
CCTTTCATGG 
AACAGTGAAC 
CTTTACTGAA 
GTGGTATTTA 
ATCTCCGAAT 
AACCGGCCTC 
TCCGAAGTGC 
TTTTATTATG 



sequences 
41 
I 

GAGCACAGTA 
CCTCCCGAGG 
TGTGGGACCG 
GGATCOGCGC 
CTGGGGGCGC 
CAGTGAAACC 
GATTTCCATC 
AGGACGGGGA 
TCGCGGCAGA 
GGGGGTCGCC 
CCGCGCCCAG 
CGCCGCTTCG 
GCGGCGAGTA 
GTCCCGAGCG 
ACTGCTGCTC 
AGGGCGCTGG 
CATGTCTTAG 
AGCGGTGCTC 
TGCTGCCCCG 
TGCAGCGGCC 
CCTTTATCAT 
AGGATCCCCA 
TCCCCAGTGC 
CCAGCTCCAG 
GCTTGCCGGA 
TGAACTGTCA 
AOGTGGGGTA 
ACGGCCTGCA 
AGAAGATGTA 
GGGAGAOGAA 
TGGCACGATT 
TAGCATTTCT 
AGCTGGAGAT 
CCTTGAGATA 
TTGTTATTAT 



correspond 

51 

I 

CACTCCCGGA 
GCGAGGCAGC 
CGCTGCAGCC 
CCAGCAGCAG 
GCAGGACCCT 
TCCAGAGGCT 
AGOGAGTGCA 
TGGCTTAGGA 
GGGGAGTGGC 
GGGGCCACGA 
GGTGGTGCCA 
CTCCTGCAGC 
CTGCCACGGC 
CTTCGACGGC 
CAGCGCCGAG 
CGAGCCTGGC 
GGGTACCCAA 
CCCTGAAGGC 
TGCCAGACGC 
OGCCTTGCCC 
CTTGGGGTCC 
GCAGAGCCGA 
CAGCACCTCC 
CGCCAACTCC 
AGGGACCATG 
GCAGGCCACC 
CACGGTGCAG 
GCCTGGCTAC 
CCCAGCGGTG 
GGCAGGGGTG 
CCTTTGGATG 
GGATATGTTT 
GACTGTGATG 
TGGTTGACGA 
TATTTCTTTT 



to start and stop codons) 

60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



848 



WO 03/042661 



PCT/US02/36810 



TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGOAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC OTGGCTGGQA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

TGTAGAGATO GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA QTATTCCACA 2580 

GGCACACCTT AATTTCATTG TAAAAAQATA TATATATTTT GTCTATTTTT 6TGCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT 6GTATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TT6TCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 

TGTTTC ATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A40 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular localization! 



BSTS 

Hs. 128899 

none found 

1-11 
402-424 

not determined 



1 11 21 31 41 51 

I I I I I | 

MLSGPLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 

GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 

SGRQPRGPSD CIPRPPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 

CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GPQCPERFDG GDATICOGSC ALRYCCSSAB 300 

ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 

SPKGRQLLRA PPGLLPRARR RGPPSSPRGG PSPLQRPAIiP IYVPFLIVGS VFVAFIILGS 420 

LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPPPHT NSEQKMYPAV TV 

A41 DNA SEQUENCE t 

Gene name: ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Unigene number i Bs. 100686 

Probeset Accession #: AA487468 

Nucleic Acid Accession #: AA487468 

Coding sequence? 55-555 (underlined sequences correspond to start and stop codons) 



1 11 21 

i I I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT 
CTACACTCAG CTTTGGGTCT CTGCCTCTTA 
GCAATAAAAA AGGAAAAGAG GCCTCCTCAG 
ACTTGGGTAC AAACTTATGA AGAAGGTCTC 
ATGGTTATTC ATCACCTGGA GGATTGTCAA 
CAAAATGAAG AAATACAAGA AATGGCTCAG 
GAAAOCACTG ATAAGAATTT ATCACCTGAT 
GACCCTTCTT TAACAGTTAG AGCTGACATA 
TATGAGCCTC GGGATTTACC CCTATTGATA 
CAGTCAGAGC T ATAAG AGAT GATAGAAAAA 
GAAGAAAACC TCTGGCACAT TGACAAATAC 
TTACTATTTA GTTTTTTTAA TGTGTTTGCA 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



31 


41 


51 




1 

CTTCTGAGAG 


1 

AGTCTCTAGA 


1 

AGACATGATG 


60 


CTCGTCACAG 


TTTCTTCCAA 


CCTTGCCATT 


120 


ACACTCTCAA 


GAGGATGGGG 


AGATGACATC 


180 


TTTTATGCTC 


AAAAAAGTAA 


GAAGCCATTA 


240 


TACTCTCAAG 


CACTAAAGAA 


AGTATTTGCC 


300 


AATAAGTTCA 


TCATGCTAAA 


CCTTATGCAT 


360 


GGGCAATATG 


TGCCTAGAAT 


CATGTTTGTA 


420 


GCTGGAAGAT 


ACTCTAACAG 


ATTGTACACA 


480 


GAAAACATGA 


AGAAAGCATT 


AAGACTTATT 


540 


AGCCTTCACT 


TCAAAGAAGT 


CAAATTTCAT 


600 


TAAATGTGCA 


AGTATATAGA 


TTTTGTAATA 


660 


ATAGTCTTAT 


TAAAATAAAT 


git rr IT AAA 


720 



A42 Protein sequence: 

Gene name: BSTs. Weakly similar to JE0350 Anterior gradient-2 (H. sapiens] 

Unigene number: Hs. 100686 

Probeset Accession 8: AA487468 

Protein Accession §: none found 

Signal sequence: 1-23 

Transmembrane domains: none found 

Cellular Localization: secreted 



849 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

11)111 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLFYAQKSKK 60 
PLMVIHHLED CQYSQALKKV PAQNEEIQEM AQNKFIMLNL MHETTDKKLS PDGQYVPRIM 120 
FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 



A43 DNA SEQUENCE 

Gene name: ATPase, Ca++ transporting, type 2C, member 1 

Unigene number: Hs. 106778 

Probeset Accession #: N51919 
Nucleic Acid Accession fl: AF189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGATTCCTG 
ATTCTCCAAG 
TTTCATGGCT 
TCTCAGTTTA 
ATGCATCAGT 
GCCTTTGTTC 
CCAGAATGCC 
CCAGGTGATA 
GAGGCTGTGG 
AAGGTGACAG 
GCCTTTATGG 
GAAAATTCTG 
CCTCTGCAGA 
ATAGGAATCA 
ATTAGTGTAA 
ACGCTAGCTC 
ATTGTTGAAA 
AAGAATGAAA 
GGAGTTGGCT 
TATAACCCAG 
AGAAACAATA 
ATGGGTCTTG 
GAGCAAAAGT 
TGTTTTATGA 
GGGCAGACCT 
ATGGGCTCAG 
ACATTTCTTG 
ACAACACTCA 
GCAGTTGCAA 
GAAGAAATAG 
TTTTACAGAG 
TCAGTTGTAG 
ATTGGAGTTG 
CTAGTGGATG 
AATAACATTA 
ATCTCATPGG 
ATCAATATTA 
GATGTCATTC 
CTTAAAATAC 
GAGCTA CGAG 
TTTTTTGACA 
GGACTCTGCA 
CTAGTTATTT 
GATCTGTTGT 
AAGGTTGAAA 
CTTGAAGTAT 



11 
I 

TATTGACATC 
CTGATCTTCA 
GGAATGAGTT 
AAAATCCCCT 
TTGATGATGC 
AGGAATATCG 
ATTGTGTGOG 
CAGTTTGCCT 
ATCTTTCCAT 
CTCCTCAGCC 
GAACACTGGT 
AATTTGGGGA 
AGAGCATGGA 
TCATGTTGGT 
GTTTGGCTGT 
TTGGTGTTAT 
CTCTGGGCTG 
TGACTGTTAC 
ATAATCAATT 
CTGTTAGCAG 
CTCTAATGGG 
ATGGACTTCA 
GGATGGCTGT 
AAGGTGCTTA 
TGACACTTAC 
CGGGACTCAG 
GCTTGGTGGG 
TTGCCTCAGG 
TCGCCAGTCG 
ATGCAATGGA 
CTAGCCCAAG 
CCATGACAGG 
CGATGGGCCA 
ATGATTTTCA 
AAAATTTCGT 
CTACATTAAT 
TTATGGATGG 
GTAAACCTCC 
TTGTTTCATC 
ACAATGTGAT 
TGTTCAATGC 
GTAATAGAAT 
ACTTTCCTCC 
TTCTTTTGGG 
GGAGCAGGGA 
GA 



21 
I 

AAAAAAAGCA 
GAATGGTCTA 
TGATATTAGT 
TATTATGCTG 
CGTCAGTATC 
TTCAGAAAAA 
7GAAGGAAAA 
TTCTGTTGGG 
TGATGAGTCC 
AGCTGCAACT 
CAGATGTGGC 
GGTTTTTAAA 
CCTCTTAGGA 
TGGCTGGTTA 
AGCAGCAATT 
GAGAATGGTG 
CTGTAATGTG 
TCACATATTT 
TGGGGAAGTG 
AATTGTTGAG 
GAAGCCAACA 
ACAAGACTAC 
TAAGTGTGTA 
CGAACAAGTA 
TCAGGAGCAG 
AGTTCTTGCT 
AATCATTGAT 
AGTATCAATA 
TCTGGGATTG 
TGTTCAGCAG 
GCACAAGATG 
AGATGGAGTA 
GACTGGTACA 
AACCATAATG 
TAGATTCCAG 
GAACTTTCCT 
ACCCCCAGCT 
TCGCAACTGG 
AATAATCATT 
TACACCTCGA 
ACTAAGTTCC 
GTTTTGCTAT 
GCTTCAGAAG 
TCTCACCTCA 
AAAGATCCAG 



31 
I 

AGTGAATTAC 
AACAAATGTG 
GAAGATGAGC 
CTTCTGGCTT 
ACTGTGGCAA 
TCTCTTGAAG 
TTGGAGCATA 
GATAGAGTTC 
AGCTTGACAG 
AATGGAGATC 
AAAGCAAAGG 
ATGATGCAAG 
AAACAACTTT 
CTGGGAAAAG 
CCTGAAGGTC 
AAGAAAAGGG 
ATTTGTTCAG 
ACTTCAGATG 
ATTGTTGATG 
GCGGGCTGTG 
GAAGGGGCCT 
ATCAGAAAAG 
CACCGAACAC 
ATTAAGTACT 
AGAGATGTGT 
TTGGCTTCTG 
CCACCTAGAA 
AAAATGATTA 
TATTCCAAAA 
CTTTCACAAA 
AAAATTATTA 
AATGATGCAG 
GATOTTTGCA 
TCTGCAATCG 
CTGAGCACGA 
AATCCTCTCA 
CAGAGCCTTG 
AAAGACAGCA 
GTTTGTGGGA 
GACACAACAA 
AGATCCCAGA 
GCAGTTCTTG 
GTTTTTCAGA 
TCAGTGTGCA 
AAGCATGTTA 



41 
I 

CAGTCAGTGA 
AAGTTAGTCA 
CACTGTGGAA 
CTGCAGTCAT 
TACTTATCGT 
AATTGAGTAA 
CACTTGCCCG 
CTGCTGACTT 
GTGAGACAAC 
TTGCATCGAG 
GTGTTGTCAT 
CAGAAGAGGC 
CCTTTTACTC 
ATATCCTGGA 
TCCCCATTGT 
CCATTGTGAA 
ATAAAACTGG 
GTCTGCATGC 
GTGATGTTGT 
TGTGCAATGA 
TAATTGCTCT 
CTGAATACCC 
AGCAGGACAG 
GTACTACATA 
ACCAACAAGA 
GTCCTGAACT 
CTGGTGTGAA 
CTGGAGATTC 
CTTCCCAGTC 
TAGTACCAAA 
AGTOGCTACA 
TTGCTCTGAA 
AAGAGGCAGC 
AAGAGGGTAA 
GTATAGCAGC 
ATGCCATGCA 
GAGTAGAACC 
TTTTGACTAA 
CTTTGTTTGT 
TGACCTTCAC 
CCAAGTCTGT 
GATCCATCAT 
CTGAGAGCCT 
TAGTGGCAGA 
GTTCGACATC 



51 
I 

AGTTGCAAGC 
TAGGCGAGCC 
GAAGTATATT 
CAGTGTTTTA 
TGTTACAGTT 
ACTTGTGCCA 
AGACTTGGTT 
ACGCTTGTTT 
GCCTTGTTCT 
AAGTAACATT 
TGGAACAGGA 
ACCAAAAACC 
CTTTGGTATA 
AATGTTTACT 
GGTCACAGTG 
AAAGCTGCCT 
AACACTGACG 
TGAGGTTACT 
TCATGGATTC 
TGCTGTAATT 
TGCAATGAAG 
TTTTAGCTCT 
ACCAGAGATT 
CCAGAGCAAA 
GAAGGCACGC 
GGGACAGCTG 
AGAAGCTGTT 
ACAGGAGACT 
AGTCTCAGGA 
GGTTGCAGTA 
GAAGAACGGT 
GGCTGCAGAC 
AGACATGATC 
AGGGATTTAT 
ATTAACTTTA 
GATTTTGTGG 
AGTGGATAAA 
AAACTTGATA 
CTTCTGGCGT 
ATGCTTTGTG 
GTTTGAGATT 
GGGACAATTA 
AAGCATACTG 
AATTATAAAG 
ATCATCTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



A44 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains 
Pfam domains: 
[744-889] 

Cellular Localization 

1 11 21 31 41 51 

I I I I I I 

MIPVLTSKKA SELPVSEVAS ILQADLQNGL NKCEVSHRRA FHGWNEFDIS EDEPLWKKYI 60 

SQFKNPLIML LLASAVISVL MHQPDDAVSI TVAILIWTV AFVQEYRSEK SLEELSKLVP 120 

PECHCVREGK LEHTLARDLV PGDTVCLSVG DRVPADLRLF EAVDLSIDBS SLTGETTPCS 180 

KVTAPQPAAT NGDLASRSNI AFKGTLVRCG KAKGWIGTG ENSBFGEVFK MMQAEEAPKT 240 

PLQKSMDLLG KQI*SFYSFGI IGIIMLVGWL LGKDILEMFT ISVSLAVAAI PEGLPIWTV 300 

850 



ATPase, Ca++ transporting, type 2C, member 1 
Hs. 106778 
N51919 
AAP27813 
none found 

60-82, 86-108, 252-274, 282-304, 757-779, 827-849, 856-878 
Cation ATPase N [6-83] , E1-B2 ATPase [89-324] , Hydrolase [328-649] , 

: not determined 



Cation ATPase C 



WO 03/042661 



PCT/US02/36810 



TL^GVMRMV KKRAIVKKLP IVBTLGCCNV ICSDKTCTLT KNENTVTHIP TSDGLHAEVT 360 

GVGYNQPGEV IVDGDWHGF YNPAVSRIVE AGCVCNDAVI RNNTLKGKPT EGALIALAMK 420 

%XZ SS W™K* HRTQQDRPEI CFMKGAYEQV IKYCTTYQSK til 

ZEZESS S^!^ m3AGLRV ^ LASQPELGQL TPLGLVGIID PPRTGVKEAV 540 

TTLIASGVSI KHITGDSQET AVAIASRLGL YSKTSQSVSG EEIDAHDVQQ LSQIVPKVAV 600 

FYRASPRHKM KIIKSLQKNG SWAMTGDGV NDAVALKAAD IGVAMGQTGT 660 

SESSE* m ™"> LSTSIAALTL ISlSS NP^H 20 

E^n^™ ^S!f! VDK W »»WW" KDSILTKNLI LKILVSSIII VCGTLFVFWR 780 

™ FFDMFNALSS RSQTKSVPEI GLCSNRMFCY AVLGSIMGQL 840 

LVIYPPPLQK VFQTESLSIL DLLFLLGLTS SVCIVABIIK KVERSREKIQ KKVSSTSSSF 900 



A4S DMA sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding Sequence: 



ESTs 

Hs. 157601 

W07459 

AC005383 

328-2751 (underlined sequences correspond to start and stop codons) 



51 



I i* ? X 31 41 

SiS^^I C GCGGCTGCAC CGCTCGGAGG CTGGGTGACC CGCGTAGAAG TGAAGTACTT 60 • 

TTTTATTTGC AGACCTGGGC CGATGCCGCT TTAAAAAACG CGAGGGGCTC M^CCTC 120 

SESSS CCICAGCCBG GTCGGGTCGT GCCGCCCTCT CG^gS l B0 

*^^GT GTCCCAOGTG GCAGCCGCGC CCCGGGCGCC CCTCCTCTGA TCCCGTAGCG 240 

TcSSSS CCGGGTCTGT GAGTAGAGCC GCCCGGGCAC CGAGcS 300 

GTtISSS TATCAACATG CCCCCTTTCC TGTTGCTGGA GGCCGTCTGT 360 

™ HISSSS! GCCCCCATCT CTCCCTCTCC AGGAAGTCCA TGTAAGCAAA 420 

n^^I^ GGAAGATTTC AGCTGCCAGC AAAATGATGT GGTGCTCGGC TGCAGTGGAC 480 

ATCATGTTTC TGTTAGATGG GTCTAACAGC GTCGGGAAAG GGAGCTTTGA AAGGTCCAAG 540 

^ESS ^Sf^ GACATCAGCC ctoS^ 600 

r^JIS^ T TCAGTTCCAC TCCTCATCTG GAATTGCCCT TGGATTCATT TTCAACCCAA 660 

CAGGAAGTGA AGGCAAGAAT CAAGAGGATG GTTTTCAAAG GAGGGCGCAC GGAGACGGAA 720 

S^SSS* AATACCTTCT GCACAGAGGG TTGCCTGGAG GCAGAAATGC ^CTGTGCCC 780 

CAGATCCTCA TCATCGTCAC TGATGGGAAG TCCCAGGGGG ATGTGGCACT GCCATCCAAG 840 

AAAGGGGTGT CACTGTQTTT GCTGTGGGGG TCAGGTTTCC SgGGAG sSo 

gSSS CGAGCCTAGA GGGCAGCACG TGCTGTTGGC TGAGcS 960 

CCAACGGCCT CTTCAGCACC CTCAGCAGCT GGGCCATCTG CTCCAGCGCC 1020 

£££££ ^ GGGTCG ^ GGCTCACCCC TGTGAGCACA GGACGCTGGA GATGGTCCGG 1080 

GAGTTCGCTG GCAATGCCCC ATGCTGGAGA GGATCGCGGC GGACCCTTGC GGTCCTGGCT 11 In 

GCACACTGTC CCTTCTACAG CTGGAAGAGA GTGTTCCTAA CCCAOC^ CA^S ^200 

AGGACCACCT GCCCAGGCCC CTGTGACTCG CAGCCCTGCC AGAATGGAGG CACATGTCTT llsO 

CCAGAAGGAC TGGACGGCTA CCAGTGCCTC TGCCCGCTGG CCTTTGGAGG GGAG3CTAAC lltl 

2S£?2£! V**™** GTCGACCTCC TCTTCCTGCT GGACAGCTCT 1380 

r^S^ CTCTGGACGG CTTCCTGCGG GCCAAAGTCT TCGTGAAGCG GTTTGTGOGG 1440 

CTGGTGGC^ TCGGGCCCGA GTGGGTGTGG CCACATACAG CAGGGAGCTG 3^500 

15^2^ ^TACCAG GATGTGCCTG ACCTGGTCTG GAGCCTCGAT 1560 

^S££S£ TCCGTGGTGG CCCCACCCTG ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG 1620 

CTCACTpIot n^fcSvvi^ CAGGACAGGC CAGGACGGGC CACGTAGAGT GGTGGTTTTG 1680 

^S? 3 * 07 CACACTCCGA GGATGAGGTT GCGGGOCCAG CGCGTCACGC AAGGGCGCGA 1740 

gSSS l^SZ ^ tGhG GCCGTGCGGG CAGAGCTGGA 1800 

GGCAGCCCAA AGCATGTGAT GGTCTACTCG GATCCTCAGG ATCTGTTCAA CCAAATCCCT 1860 

cSSS ( ^ GCAG «^<^GGT gSggS W20 

Sc??S CTCIGCCTCfc ^AGGGCCCG AGAATTTTGC TCAGATGCAG 1980 

AGCTTTGTGA GAAGCTGTGC CCTCCAGTTT GAGGTGAACC CTGACGTGAC ACAGGTCGGC ?n*n 

CTGGTGGTGT ATGGCAGCCA GGTGCAGACT GCCTTCGGGC tSaCACCAA AC^CC^ 2^00 

arrrnnplr^ TGOGGGCCAT TAGCCAGGCC CXXTACCTAG GTGGGGTGGG CTCAG^OGGC 2160 

^? TCTA ATGACCGTCC AGAGGGGTGC CCGGCCTGGT 2220 

GTCCCCAAAG CTGTGGTGGT GCTCACAGGC GGGAGAGGCG CAGAGGATGC AGCCGTT^r llln 

GOCGAGAAGC JGAGGAACAA TGGCATCTCT S?S ^SgS SccS 2 !o 

AGTGAGGCTC TGGGGAGGCT TGCAGGTCCC OGGGATTCCC TGATCCACGT GGCAGCTTAC 2400 

rrarrrsRw GGTACCACCA GGAGGTGCTC ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG 2460 

CCAGTCAACC TCTGCAAACC CAGCCCGTGC ATGAATGAGG GCAGCTGCGT CCTCCAGAAT 2520 

™ ~« ™ SS "2 

ATGGCTOTfYJ Tnr^^n^rt^ TGTGAGCCAG GGATGGATTC TTGAGACGCC CCTGAGGCAC 2640 

^SSS^ AC CCOCCCA GCAACTACAG AGAAGGCCTG 2700 

GGCACTGAAA TGGTGCCTAC CTTCTGGAAT GTCTGTGCCC CAGGTCCTTA GAATGTCTGC 2760 

SgSS ^S^ CC ACTATTCTCA CTGAGGGAGG A^CcMa^CA^ 2 2 

ATGCTGCTTA GAGACAAGAA AGCAGCTGAT GTCACCCACA AACGATGTTG TTC5AAAAGTT 2880 

^^^^ ^I^ ACC ^^CTGT ACCTGCTGTG CCTTGTTGAG GCTATGTCAT 2940 

CTGCCACCTT TCCCTTGAGG ATAAACAAGG GGTCCTGAAG ACTTAAATTT AGCGGCCTGA 3000 

CGTTCCTTTG CACACAATCA ATGCTCXX^ GAATGTTGTT GACACAGTAA TGCCCAGCAG 3060 

orarprlw TAGAGCATCC TTTGGAOGGC GAAGGCCACG GCCTTTCAAG ATGGAAAGCA 3120 

C^^T^ GAGACATT CT GGATGCATTT GCATTGAGTC TGAAAG^ 31sS 

OTGAGGGAC GTTTGTGACT TCTTGGCGAC TGCCTTTTGT GTGTGGAAGA GACTTGGAAA 3240 

GGTCTCAGAC TGAATGTGAC CAATTAACCA GCTTGGTTGA TGATGGGGGA GGGGCTGAGT 3300 

ISaaI S^T™ GAGGGCCA0G TAAAATCGTT CTGAGTCGTG 3360 



A46 Protein sequence 
Gene name: 



ESTs 



851 



WO 03/042661 



PCT/US02/36810 



Unigene number: Hs. 157601 

Protein Accession #: none found 

Signal sequence: 1-17 

Transmembrane domains: none found 

VOW domains: 49-223; 341-518; 529-706 

EGF domains: 298-333; 715-748 

Cellular Localization: secreted 



J H 21 31 41 51 

I I t t I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV D1MFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQPSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REPAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGBA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRPV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENPAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 
RTPPSNYREG LGTEMVPTFW NVCAPGP 



COLON 



A47 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession *h 
Coding sequence: 



ESTs, Weakly similar to JE0350 Anterior gradient -2 fH. sapiens] 

Hs. 100686 

AA487468 

AA487468 

55-S55 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I 1 

CGGCACCAAG AGCACTGGCC AAOTCAGCTT 
CTACACTCAG CTTTGGGTCT CTGCCTCTTA 
GCAATAAAAA AGGAAAAGAG GCCTCCTCAG 
ACTTGGGTAC AAACTTATGA AGAAGGTCTC 
ATGGTTATTC ATCACCTGGA GGATTGTCAA 
CAAAATGAAG AAATACAAGA AATGGCTCAG 
GAAACCACTG ATAAGAATTT ATCACCTGAT 
GACCCTTCTT TAACAGTTAG AGCTGACATA 
TATGAGCCTC GGGATTTACC CCTATTGATA 
CAGTCAGAGC T ATAAG AGAT GATAGAAAAA 
GAAGAAAACC TCTGGCACAT TGACAAATAC 
TTACTATTTA GTTTTTTTAA TGTGTTTGCA 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



31 41 51 

I I I 

CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

ACACTCTCAA GAGGATGGGG AGATGACATC 180 

TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

GAAAACATGA AGAAAGCATT AAGACTTATT 540 

AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 



A48 Protein sequence: 

Gene name: ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Unigene number: Hs. 100686 

Probeset Accession AA487468 

Protein Accession #t none found 

Signal sequence t 1-23 

Transmembrane domains: none found 

Cellular Localization: secreted 



1 11 21 31 41 51 

I I I I I I 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGKGD DITWVQTYEE GLFYAQKSKK 60 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFIMLNL MHETTDKNLS PDGQYVPRIM 120 
FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 



A49 DNA SEQUENCE 

Gene name: G protein-coupled receptor 56 
Unigene number: Hs.6527 
Probeset Accession #: AA478599 
Nucleic Acid Accession NM_005682 

Coding sequence: 163-2244 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I 1 I I 

CGGCAGCAGG GTCTCGCTCT GTCACACAGG CTGGAGTGCA GTGGTGTGAT CTTGGCTCAT 60 
CGTAACCTCC ACCTCCCGGG TTCAAGTGAT TCTCATGCCT CAGCCTCCCG AGTAGCTGGG 120 
ATTACAGGTG GTGACTTCCA AGAGTGACTC CGTCGGAGGA AAATGACTCC CCAGTCGCTG 180 



852 



WO 03/042661 



PCT/US02/36810 



CTGCAGACGA CACTGTTCCT GCTGAGTCTG CTCTTCCTGG TCCAAGGTGC CCACGGCAGG 240 

GGCCACAGGG AAGACTTTCG CTTCTGCAGC CAGCGGAACC AGACACACAG GAGCAGCCTC 300 

CACTACAAAC CCACACCAGA CCTGCOCATC TCCATCGAGA ACTCCGAAOA GGCCCTCACA 360 

GTCCATGCCC CTTTCCCTGC AGCCCACCCT GCTTCCCGAT CCTTCCCTQA CCCCAGGQGC 420 

CTCTACCACT TCTGCCTCTA CTGGAACCGA CATQCTGGGA GATTACATCT TCTCTATGGC 480 

AAGCGTGACT TCTTGCTGAQ TGACAAAGCC TCTAGCCTCC TCTGCTTCCA GCACCAGQAG 540 

GAGAGOCTGG CTCAGGGCCC CCCGCTGTTA GCCACTTCTG TCACCTCCTG GTGGAGCCCT 600 

CAGAACATCA GCCTGCCCAG TGCCGCCAGC TTCACCTTCT CCTTCCACAG TCCTCCCCAC 660 

ACGGCCGCTC ACAATGCCTC GGTGGACATG TGCGAGCTCA AAAGGGACCT CCAGCTGCTC 720 

AGCCAGTTCC TGAAGCATCC CCAGAAGGCC TCAAGGAGGC CCTOGGCTGC CCCCGCCAGC 780 

CAGCAGTTGC AGAGCCTGGA GTCGAAACTG ACCTCTGTGA GATTCATGGG GGACATGGTG 840 

TCCTTOGAGG AGGACCGGAT CAACGCCACG GTATGGAAGC TCCAGCCCAC AGCCGGCCTC 900 

CAGGACCTGC ACATCCACTC CCGGCAGGAG GAGGAGCAGA GCGAGATCAT GGAGTACTCG 960 

GTGCTGCTGC CTCGAACACT CTTCCAGAGG ACGAAAGGCC GGAGCGGGGA GGCTGAGAAG 1020 

AGACTCCTCC TGGTGGACTT CAGCAGCCAA GCCCTGTTCC AGGACAAGAA TTCCAGCCAA 1080 

GTCCTGGGTG AGAAGGTCTT GGGGATTGTG GTACAGAACA CCAAAGTAGC CAACCTCACG 1140 

GAGCCCGTGG TGCTCACTTT CCAGCACCAG CTACAGCCGA AGAATGTGAC TCTGCAATGT 1200 

GTGTTCTGGG TTGAAGACCC CACATTGAGC AGCCOGGGGC ATTGGAGCAG TGCTGGGTGT 1260 

GAGACCGTCA GGAGAGAAAC CCAAACATCC TGCTTCTGCA ACCACTTGAC CTACTTTGCA 1320 

GTGCTGATGG TCTCCTCGGT GGAGGTGGAC GCCGTGCACA AGCACTACCT GAGCCTCCTC 1380 

TCCTACGTGG GCTGTGTCGT CTCTGCCCTG GCCTGCCTTG TCACCATTGC CGCCTACCTC 1440 

TGCTCCAGGG TGCCCCTGCC GTGCAGGAGG AAACCTCGGG ACTACACCAT CAAGGTGCAC 1500 

ATGAACCTGC TGCTGGCCGT CTTCCTGCTG GACACGAGCT TCCTGCTCAG CGAGCCGGTG 1S60 

GCCCTGACAG GCTCTGAGGC TGGCTGCCGA GCCAGTGCCA TCTTCCTGCA CTTCTCCCTG 1620 

CTCACCTGCC TTTCCTGGAT GGGCCTCGAG GGGTACAACC TCTACCGACT CGTGGTGGAG 1680 

GTCTT TGGC A CCTATGTCCC TGGCTACCTA CTCAAGCTGA GCGCCATGGG CTGGGGCTTC 1740 

CCCATCTTTC TGGTGACGCT GGTGGCCCTG GTGGATGTGG ACAACTATGG CCCCATCATC 1800 

TTGGCTGTGC ATAGGACTCC AGAGGGCGTC ATCTACCCTT CCATGTGCTG GATCCGGGAC 1860 

TCCCTGGTCA GCTACATCAC CAACCTGGGC CTCTTCAGCC TGGTGTTTCT GTTCAACATG 1920 

GCCATGCTAG CCACCATGGT GGTGCAGATC CTGCGGCTGC GCCCCCACAC CCAAAAGTGG 1980 

TCACATGTGC TGACACTGCT GGGCCTCAGC CTGGTCCTTG GCCTGCCCTG GGCCTTGATC 2040 

TTCTTCTCCT TTGCTTCTGG CACCTTCCAG CTTGTCGTCC TCTACCTTTT CAGCATCATC 2100 

ACCTCCTTCC AAGGCTTCCT CATCTTCATC TGGTACTGGT CCATGCGGCT GCAGGCCCGG 2160 

GGTGGCCCCT CCCCTCTGAA GAGCAACTCA GACTGCGCCA GGCTCCCCAT CAGCTCGGGC 2220 

AGCACCTCGT CCAGCCGCAT CTAGGCCTCC AGCCCACCTG CCCATGTGAT GAAGCAGAGA 2280 

TGCGGCCTCG TCGCACACTG CCTGTGGCCC COGAGCCAGG CCCAGCCCCA GGCCAGTCAG 2340 

CCGCAGACTT TGGAAAGCCC AACGACCATG GAGAGATGGG CCGTTGCCAT GGTGGACGGA 2400 

CTCCCGGGGC TGGGGCTTTT GAATTGGCCT TGGGGACTAC TCGGCTCTCA CTCAGCTCCC 2460 

ACGGGACTCA GAAGTGCGCC GCCATGCTGC CTAGGGTACT GTCCCCACAT CTGTCCCAAC 2520 

CCAGCTGGAG GCCTGGTCTC TCCTTACAAC CCCTGGGCCC AGCCTCATTG CTGGGGGCCA 2580 

GGCCTTGGAT CTTGAGGGTC TGGCACATCC TTAATCCTGT GCCCCTGCCT GGGACAGAAA 2640 

TGTGGCTCCA GTTGCTCTGT CTCTCGTGGT CACCCTGAGG GCACTCTGCA TCCTCTGTCA 2700 

TTTTAAOCTC AGGTGGCACC CAGGGCGAAT GGGGCCCAGG GCAGACCTTC AGGGCCAGAG 2760 

CCCTGGCGGA GGAGAGGCCC TTTGCCAGGA GCACAGCAGC AGCTCGCCTA CCTCTGAGCC 2820 
G 



A50 Protein sequence 
Gene name: 
Uaigene number: 
Protein Accession #: 
Signal sequence: 
GPS domain: 
Pfam domain: 
Transmembrane domains: 
Cellular Localization: 



G protein-coupled receptor 56 

HS.6527 

NM 005682.1 

1-26 

342-394 

7tm_2 (400-6653 

410-432, 446-468, 482-504, 517-539, 
plasma membrane 



575-597, 608-630, 638-659 



1 11 21 31 41 51 

I I I I I I 

MTPQSLLQTT LFLLSLLFLV QGAHGRGHRE DFRFCSQRNQ THRSSLHYKP TPDLRISIEN 60 

SEEALTVHAP PPAAHPASRS PPDPRGLYHF CLYWNSHAGR LHLLYGKRDP LLSDKASSLL 120 

CFQHQEESLA QGPPLLATSV TSWWSPQNIS LPSAASPTPS FHSPPHTAAH NASVDMCBLK 180 

KDLQLLSQFL KHPQKASRRP SAAPASQQLQ SLESKLTSVR FMGDMVSFEE DRINATVWKL 240 

QPTAGLQDLH 1HSRQEEBQS EIMEYSVLLP RTLFQRTKGR SGEAEKRLLL VDFSSQALFQ 300 

DKNSS QVLG E KVLGIWQNT KVANLTEPW LTFQHQLQPK NVTLQCVFWV EDPTLSSPGH 360 

WSSAGCBTVR RETQTSCPCN HLTYFAVLMV SSVEVDAVHK HYLSLLSYVG CWSALACLV 420 

TIAAYLCSRV PLPCRRKPRD YTIKVHMNLL IAVFLLDTSP LLSEPVALTG SEAGCRASAI 480 

PLHPSU*TCL SWMGLEGYNI* YRLWEVFGT YVPGYLLKLS AMGWGPPIPL VTLVALVDVD 540 

NYGPIIIAVH RTPEGVIYPS KCWIRDSLVS YITNLGLPSL VPLFNMAMLA TMWQILRLR 600 

PHTQKWSHVL TLLGLSLVLG LPWALIFFSP ASGTFQLWL YLFSIITSPQ GFLIPIWYWS 660 
MRLQARGGPS PLKSNSDCAR LPISSGSTSS SRI 

A51 DNA SEQUENCE 

Gene name: Hypothetical protein PLJ20063 

Unigene number: Hs.5940 

Probeset Accession fh AA053660 

Nucleic Acid Accession tt: AA053660 

Coding sequence: 218-1360 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

11)11! 

CCCCCATGAC TTTGTACAGC TACTTCACTG CTTCCCCCCA ATTAGTACAC ATAGTTCCTC 60 

853 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 
20 
25 
30 
35 
40 
45 
50 
55 



CACAATTCCT 
ACCTACTGCT 
AATCACOGCT 
TAACAATGAA 
CGCTTTATTG 
TCCCTGTGCA 
AGAAGGGTAT 
TTCAGTGACA 
CTTGCATAGT 
GACTGTAATT 
CAAGTTTGTT 
GACTGTGACT 
TGATTTGACC 
TGGTTTAGCA 
CGTTGCTTCC 
AAAGAAGAGT 
TGGGAACTGC 
GCTGATCCTC 
ATTGATTGTC 
TGACGAAGAC 
AGGGAGOGTC 
CTATTCAAGA 
CGCCATGGCC 
GAAGTGAGCA 
CCTCTCTGAA 
TCTATGACAT 
TCAGTACAAA 
CAACAAGAAC 
CATTGCTCTT 
TCCGAGGGGA 
AAGTCAGAAT 
CAGCCC AGAG 
CTTTTTCAGG 
ACTAAAAATG 
GGCAGAGAGT 
AGGATCAATT 
CAGCAAAGGA 
AACAAAAAAT 
CAAAGGGTGC 
TTTCTGAGGT 
ACCAACACAC 
CAAGCTTTTA 
GTTATTTTAT 
AAGTGCCTTT 



ACACCTGCTC 
GCAGACAGTG 
TCATCTCCAA 
ATGTCCCCCA 
GAGACCAGCA 
GATAATTCGT 
TACTACAACT 
GTATCAGAAA 
GAAATTACTA 
CTTACTGTAA 
GATGTAACAA 
GAGAAAATTA 
CTTOGGTGTG 
TGCGATTGCA 
AGTCTCAAGT 
GGTGGGGCCC 
CAAAAGTGTG 
ACTATTGTGG 
ACAGCAAGAT 
TTTCAAAATC 
TTTCCTAAGG 
CACAGCAGCA 
CCCAACCAAT 
CCAGTAAAGA 
TGGAAGTTGT 
TAAATGTAGT 
GTACTGAGAC 
CATTTCCAAT 
GAGTTAAGTG 
TGAGAAGGGA 
AGGGACACTG 
CTTGCAACCT 
AGGCGTGCCT 
TCCACTATGG 
AAAAAACATG 
AGAGAGGAGG 
GCACTTCTCT 
CCAGGAGTAA 
GAGAGGGAGC 
GTGCCATTGG 
AACCACACAC 
CAAATGTTAT 
TTGTTATTAT 
GTTAGATGGT 



CCCCCATAAT 
AGTCAACCAC 
ATGATGGATT 
CCACAGAAGA 
CCCTAAACAG 
TATGTGTTAA 
CTTCTACATG 
CATTTGACCC 
GCTTGTTTAA 
GCACATCTCT 
TAGTAACAAT 
ATAAAGCAAT 
ATTATTATGG 
AATCTGACCT 
GTCCTGATGC 
CTGAGTGTGC 
CATTTGGCTA 
GCACCATCGC 
CAAATAACAA 
TAAAACTGCG 
TCAGGATAAC 
TGCCCCGCCC 
GTACAAGCTA 
TCTGGCCTCC 
GAATGTTTGC 
AGATGCTATT 
AATGGTTAGG 
CTAGAGGAAA 
ACCTAATTCC 
TACCCACCAT 
CTTCTATCCC 
AGCCTCACCC 
GGGAATCCAG 
GGTGCACTCT 
ACCTGGTAGA 
CACCTGGGAT 
AATCATGCCC 
GAGCCTTAGG 
TTGTGCTCAG 
GGCCTCAGCC 
ACACAACCAC 
TAGTGTCCTT 
TATTTGTTCT 
GAAAAAAAAA 



CAGTACACAT 
AAATGTAAAT 
■AATCACAATC 
CAATCAATCA 
CACAGGTCCC 
GCTGCATAAT 
TAAGAAAGGA 
AGAAGAGAAA 
AGATGTATTT 
GTCACCAAGA 
TTTGGCAGAA 
TAGAAGTAGC 
CTGTAACCAG 
GCAAAGGCCT 
CTGCAACGCA 
GTGCGTGCCC 
CAGTGGACTC 
TGGCATTGTC 
AACGAAGCAT 
GTCGACAGGC 
GGCCTCCAGA 
TGACTA TTAG 
TTATTTAGAG 
GGGGTTTTTC 
AACGAATCCA 
AGCGCTTGTC 
GTTGTTTTCT 
GCTCCCCAGC 
CCTGGGAGAC 
CTTTCAAGGG 
TCCAATGGAG 
AAGAAGACTG 
GAACTTTTTG 
ACAGTTTTTG 
AGGAAGAGAG 
CCACCTTCTT 
TCCOGAAGAC 
TCAGTTTGAA 
GAGTCCAGCC 
TTCTCTGGTG 
ACACACAAAT 
TTTTATTTCT 
TGACTGTTAA 
AAAAAAAAAA 



AGTTCCTCCA 
TCATTAGCTA 
GTri X TllTG 
TCAGGGCCTC 
AGCAATCCTT 
ACAAGTTTTT 
AAGGTATTCC 
CATTCCATGG 
GGCACATCTG 
TCTGAAATGC 
ACCACAAGTG 
TCAAGCAACT 
ACTGCGGATG 
AACCCACAGA 
CAGCACAAGC 
GGCTACCAGG 
GACTGTAAGG 
ATTCTCAGCA 
ATTGAAGAAG 
TTCACCAATC 
GACAGCCAGA 
AATCATAAGA 
TGTTTAGAAA 
TTCCATCTGA 
GCTCACTTGC 
AGAGAGGTGG 
TAATTCTTTT 
ATTGCTTGCT 
ATACGCATCA 
TCACAAGCTC 
AGATTCTGGC 
GAAAGAGACA 
ATGCTAATTA 
AAATGCTAGG 
GCAAAGGAAA 
CCTTAGGTCC 
TGGCTGGGAG 
ATTGGAGACA 
GTCCAGCCTC 
ACAGAGGCTC 
GGGGGCAACC 
AATGCCTTGT 
TTGTGAATGG 
AAAAAAAAAA 



CAATTCCTAT 
CCTCTGACAT 
AAACACAAAG 
CCACTGGCAC 
GCCAAGATGA 
GCCTGTGTTT 
CTGGGAAGAT 
CCTATCAAGA 
TTTATGGACA 
GTGCTGATGA 
ACAATGAGAA 
TTCTAAACTA 
ACTGCCTCAA 
GCCCTTTCTG 
AATGCTTAAT 
AAGATGCTAA 
ACAAATTTCA 
TGATAATTGC 
AGAACTTGAT 
TTGGAGCAGA 
TGCAAAATCC 
ATGTGGAACC 
GACTGATGGA 
CATCTGCCAG 
TAAATAAGAA 
TTTTCTTCAA 
OCTGGTAGGG 
CCTGGGCAAA 
ACTGTGGAGG 
ACTCTCTGAC 
AACCTTTGAA 
TATCTCTCAG 
GAAGGCCTGG 
AGGCAGAAGG 
CTGGGTGGGG 
CCTCCTCCAT 
AAGGTTTAAA 
AACTGTCTGG 
GGGGTGTAGG 
AGCTGTGGCC 
ACATCCAGTA 
CCTCTTAAAA 
TAATGCAATA 
A 



120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



A52 Protein sequence: 
Gene name: 
Uhlgene number: 
Probeset Accession 9: 
Protein Accession §: 
Signal sequence: 
Transmembrane domains: 
EGF domain: 
SKA domain: 

Cellular Localization: 



Hypothetical protein PLJ20063 

Hs.5940 

AA053660 

none found 

none found 

289-311 

45-74 

80-196 

plasma membrane 



60 
65 
70 
75 
80 



1 
I 

MVPSETQSNN 
NTSFCLCLEG 
FGTSVYGQTV 
SSSNPLNYDL 
AQHKQCIiIKK 
VILSMIIALI 
RDSQMQNPYS 



11 
I 

EMSPTTEDNQ 
YYYNSSTCKK 
ILTVSTSLSP 
TLRCDYYGCN 
SGGAPECACV 
VTARSNNKTK 
RHSSMPRPDY 



21 
I 

SSGPPTGTAL 
GKVPPGKISV 
RSEMRADDKP 
QTADDCLNGL 
PGYQEDANGN 
HIEEENLIDE 



31 
I 

LETSTLNSTG 
TVSETFDPEE 
VDVTIVTILA 
ACDCKSDLQR 
CQKCAFGYSG 
DFQNLKLRST 



41 



51 
I 



PSNPOQDDPC ADNSLCVKLH 60 

KHSMAYQDLH SEITSLPKDV 120 

ETTSDNEKTV TEKINKAIRS 180 

PNPQSPFCVA SSLKCPDACN 240 

LDCXDKFQLI LTIVGTIAGI 300 

GFTNLGAEGS VFPKVRITAS 360 



A53 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession # : 
Nucleic Acid Accession 8 : 
Coding sequence: 



THPRSS3a 
Bo. 298241 
AIS38613 
AB038157 

202-1S66 (underlined sequences correspond to start and stop codons) 



11 
I 



21 



31 41 51 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 
GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 
CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 
AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 

854 



60 
120 
180 
240 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TCATTCCGAT 
GATGCTGTTG 
ATTGGGATCA 
TCAGGGAAGT 
GGAGTCTOGG 
AATGCCGTGC 
AAGGGTCACT 
GATAACCTCA 
CACCTCTTGC 
TGTGCCTCTG 
AGCTCACGCA 
CTTCAGTTCC 
ACTGCTGCAC 
CTAGTTTCCC 
AGCAAGTACA 
CTCACGTTCA 
GATGGAAAAG 
TCCCCTGTCC 
GACGTGTACG 
GTGGACAGCT 
AAGTTAGTGG 
TACACCCGTG 
ACCTGAAGAG 
TCCTCCCCTG 
TTCCGGCACC 
CAAGCTGCTT 
AGTGCAGTGG 
TTGCCTCAGC 
TTGTATTTTT 
CCTCAAATGA 
AOGCCTAGCC 
AGGGCGGCCT 
ACGAGATAAG 
GCACCAGCCC 
CCAAACCCAC 
ATGACTCGTT 
TTGTCCTTTG 
AAAAAAAA 



CGCTTTTTGG 
CTGCACAGAT 
TTGCATTGAT 
ACAGATGTCG 
ATTGCAAAGA 
TCCAGGTGTT 
ACGCAAATGT 
GAGTGAGCTC 
CAGATGACAA 
GCCACGTGGT 
TCGTGGGTGG 
AGGGCTACCA 
ACTGTGTTTA 
TGTTGGACAA 
AGCCAAAGAG 
ATGAAATGAT 
TGTGCTGGAC 
TGAACCACGC 
GTGGCATCAT 
GCCAGGGGGA 
GAGCGACCAG 
TCACCTCCTT 
GAAGGGGACA 
GACTCCCGTG 
AGTAGCAGGC 
TTTGTTTTTT 
CGAAATCCCT 
TTCCCCAGTA 
AGTAGAGACA 
TGTGCCTGCT 
TCACGCTCCT 
TTCCCACTGG 
CAGTTATGTG 
AGAAGTGCAG 
CCTTTCTACT 
TAAGGCCTAT 
ATTCCAAATA 



CCTTGATGAT 
CCTGTCACTG 
ATTAGCACTG 
CTCATCCTTT 
CGGGGAGGAC 
CACAGCTGCT 
TGCCTGTGCC 
GCTGGAGGGG 
GGTGACTGCA 
TACCTTGCAG 
AAACATGTCC 
CCTGTGOGGG 
TGACTTGTAC 
TCCAGCCCCA 
GCTGGGCAAT 
CCAGCCTGTG 
GTCAGGATGG 
GGCCGTCCCT 
CTCCCCCTCC 
CAGCGGGGGG 
CTTTGGCATC 
CCTGGACTGG 
AGTAGCCACC 
TAGGAACCTG 
CCGAAAGAGG 
GTTTTTTTGA 
GCTCACTGCA 
GCTGGGACCA 
GGGTTTCACC 
TCAGCCTCCC 
TTCTGATCTT 
TCCATCTGGT 
ACCTCACGTG 
AACTGCAGTC 
TCCAAGACTT 
TTTCATGATT 
ATATGTTTCC 



TTGAAAATAA 
CTGCCATTGA 
GCCATTGGTC 
AAGTGTATCG 
GAGTACCGCT 
TCGTGGAAGA 
CAACTGGGTT 
CAGTTCCGGG 
TTACACCACT 
7GCACAGCCT 
TTGCTCTCGC 
GGCTCTGTCA 
CTCCCCAAGT 
TCCCACTTGG 
GACATCGCCC 
TGCCTGCCCA 
GGGGCCACAG 
TTGATTTCCA 
ATGCTCTGCG 
CCCCTGGTGT 
GGCTGCGCAG 
ATCCACGAGC 
TGAGTTCCTG 
CACACGAGCA 
CACCCTTCCA 
GGTGGAGTCT 
GCCTCCGCTT 
CAGGTGCCCG 
ATGTTGGCCA 
ACAGTGCTGG 
CACTAAGAAC 
TTTCTCTCCA 
CAAAGCCACC 
ACTGCACGTT 
ATTTTCACAT 
TCTTTGTAGC 
TTCCCTCAAA 



GTCCTGTTGC 
AGTTTTTTCC 
TGGGCATCCA 
AGCTGATAGC 
GTGTCOGGGT 
CCATGTCCTC 
TCCCAAGCTA 
AGGAGTTTGT 
CAGTATATGT 
GTGGTCATAG 
AGTGGCCCTG 
TCACGCCCCT 
CATGGACCAT 
TGGAGAAGAT 
TTATGAAGCT 
ACTCTGAAGA 
AGGATGGAGC 
ACAAGATCTG 
CGGGCTACCT 
GTCAAGAGAG 
AGGTGAACAA 
AGATGGAGAG 
AGGTGATGAA 
GACACCCTTG 
TCTGATTCCA 
CGCTCTGTTG 
CCCTGGTTCA 
CCACCACACC 
GGCTGCTCTC 
GATTACAGGC 
AAAAGAAGCA 
GGGGTCTTGC 
AACAGCCACT 
TTCATCTCTA 
GTGGGGAGGT 
ATTTGGTGCT 
AAAAAAAAAA 



ACCAGATGCA 
AATCATCGTC 
CTTCGACTGC 
TCGATGTGAC 
GGGTGGTCAG 
CGATGACTGG 
TGTGAGTTCA 
GTCCATOGAT 
GAGGGAGGGA 
AAGGGGCTAC 
GCAGGCCAGC 
GTGGATCATC 
CCAGGTGGGT 
TGTCTACCAC 
GGCCGGGCCA 
GAACTTCCCC 
AGGTGACGCC 
CAACCACAGG 
GACGGGTGGC 
GAGGCTGTGG 
GCCTGGGGTG 
AGACCTAAAA 
GACAGCCCGA 
GAGCTCTGAG 
GCACAACCTT 
CCCAGGCTGG 
AGCGATTCTC 
CAACTAATTT 
AAACCCCTGA 
ATGGGCCACC 
GCAACTTGCA 
AAAATTCCTG 
CAGAAAAGAC 
GGGACCAGAA 
TAATCTAGGA 
TGACGTATTA 
AAAAAAAAAA 



AS 4 Protein sequence: 
Gene name: 
Unigene number : 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Tryp_SPc domain: 
Cellular Localization: 



TMPRSS3a 

Hs. 298241 

AI538613 

BAB20077 

none found 

43-65, 239-261 

216-444 

plasma membrane 



MGENDPPAVE 
LALAIGLGIH 
TAASWKTMCS 
VTALHHSVYV 
LCGGSVITPL 
LGNDIALMKXi 
AVPLISUKIC 
PGIGCAEVNK 



11 
I 

APFSPRSLFG 
FDCSGKYRCR 
DDWKGHYANV 
REGCASGHW 
WIITAAHCVY 
AGPLTPNEMI 
NHRDVYGGII 
PGVYTRVTSF 



21 
I 

LDDLKISPVA 
SSPKCIBLIA 
ACAQLGFPSY 
TLQCTACGHR 
DLYLPKSWTI 
QPVCLPNSEE 
SPSMLCAGYL 
LDWIHEQMER 



31 
I 

PDADAVAAQI 
RCDGVSDCKD 
VSSDNLRVSS 
RGYSSRIVGG 
QVGLVSLLDN 
NPPDGKVCWT 
TGGVDSCQGD 
DLKT 



41 

I 

LSLLPLKFFP 
GEDEYRCVRV 
LEGQFREEFV 
NMSLLSQWPW 
PAPSHLVEKI 
SGWGATEDGA 
SGGPLVCQER 



51 

I 

lIVIGIIAIiI 
GGQNAVLQVF 
SIDHLLPDDK 
QASLQFQGYH 
VYHSKYKPKR 
GDASPVLNHA 
RLWKLVGATS 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920' 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
180 
240 
300 
360 
420 



A55 DNA SEQUENCE 

Gene name: Putative G protein- coupled receptor GPCR150 
Unigene number: Hs. 97101 

Probeset Accession #: AA215333 
Nucleic Acid Accession #: KM 014373 

Coding sequence: 322-1338 {underlined sequences correspond to start and stop codons) 



I 

GTGGCCTCGA 
GCCTCCGGAG 
ATGTCTCCGA 
AAATAACATA 
GACCCACTGG 
TCTTGAAATT 
CAGTTACGTC 
GGGAAAATAT 
TTTATGGAAT 
TCCATTATAT 
ATCTGCCTAT 
CTGACAGCTT 
TGTCAAAAAT 



11 
I 

GGTGGTGGCA 
GCAGGTTCGG 
GCTTACTCAC 
ATTGAAGGCA 
AGAGGACAGA 
TAACTAAAAA 
AAACAAACCA 
TATTAAATAT 
ATTTTTGCAT 
TGTATTTCAG 
TTACTCAAAT 
GTATAGATTA 
TATTTTATTT 



21 
I 

GGGCOGCCCC 
CTGGAAGGAA 
ATAGCATATT 
GTAAAAGTGA 
AAATGAAGCA 
TATGACTGCT 
GCCCCTAGAC 
CCTTACACTA 
TTCACTAGCA 
GGATTTTGTA 
TATTTCCTTT 
TTGCCTGAAT 
CTTTACAGTA 



31 
I 

CTGCAGTCCG 
CCGCTCTCGC 
GGTATATCAA 
AATTAAATAG 
GTGTTTTATC 
CTCTCTTCAG 
GTTAACTATC 
GGAATGAGAA 
TTCGTTGATC 
CTTTTAAGCA 
ACTTATGGCT 
TTCTCTAAAA 
ATTTTAATTT 



41 
1 

GAGACGAACG 
TTCGTCCTAC 
AATGAAATGC 
GAAGATCATC 
ATGTGTATTT 
AGAACTGCTC 
TGCTATTCTT 
GAAAAAACAC 
TTTTACTTTT 
TTAGGTTCAC 
TTTTGCATTA 
CAACCAAGCT 
GGATTTCAGT 



51 
I 

CACGGACOGG 
ACTTGCGCAA 
AAGGAACCAA 
AGTCAAGGAA 
CAGCAGGTCT 
TTTTCAGTAC 
GATCATACTT 
CTGTCAAAAT 
GGTAAACATT 
TAAATACCAC 
TCCAGTTTTC 
TTCATTTAAG 
CCTTGCTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



855 



WO 03/042661 
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GTTTTGGGAG ACCCAGCCAT CTACCAAAGC 
TGTCCTTTCT ATGTCAGCAT TCAGAGTTAC 
TTTGTAGCTT TCATAACCTG TTGGGAAGAA 
ACTTCCTATA TGAATGAAAC TATCTTATAT 
GTGAGATCTA AAAAAATATT CTTATCCAAG 
CCATTTGTAC TACTTCAGGT AATCATTGTT 
GAGATGAATA TTCCCTGGTT ATACTTTGTC 
TTTAATTGTC ACAAGCTTAA TTTAAAAGAC 
TGGAAGTGCT GCTTCATTCC ACTTACAATT 
TCAATAATGA TTTGTTAATA TTATTAATTA 
TATGAACAGA AAGAACTCAG GACATATTAA 
CCCTGACTGA TAGCATTTCA GAATGTGTCT 
GTTTTATTTT AAAAACAAAA TAATTCCAAG 
TTACAAATAT TACTTTGTTA TTAACACAAA 
GATGTTTGTG TTACTCAAAA AAACTACTGG 
ACTGACAACT TTAAGATATC AACCTAAACA 
AAAAAAAAA 



CTGAAGGCAC AGAATGCTTA TTCTCGTCAC 840 

TGGCTGTCAT TTTTCATGGT GATGATTTTA 900 

GTTACTACTT TGGTACAGGC TATCAGGATA 960 

TTTCCTTTTT CATCCCACTC CAGTTATACT 1020 

CTCATTGTCT GTTTTCTCAG TACCTGGTTA 1080 

TTACTTAAAG TTCAGATTCC AGCATATATT 1140 

AATAGTTTTC TCATTGCTAC AGTGTATTGG 1200 

ATTGGATTAC CTTTGGATCC ATTTGTCAAC 1260 

CCTAATCTTG AGCAAATTGA AAAGCCTATA 1320 

AAAGTTACAG CTGTCATAAG ATCATAATTT 1380 

AAAATAAACT GAACTAAAAC AACTTTTGCC 1440 

TTTGAAGGGC TATACCAGTT ATTAAATAGT 1500 

AAGTTTTTAT AGTTATTCAG GGACACTATA 1560 

AAGTGATAAG AGTTAACATT TGGCTATACT 1620 

ATGCAAACTG TTATGTAAAT CTGAGATTTC 1680 

TTTTTATTAA ATGTTCAAAT GTAAGCAAGA 1740 



A56 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



Putative G protein- coupled receptor GPCR150 
Hs. 97101 
NP_055188 
none found 

23-45, 59-81, 97-119, 138-160, 184-206, 241-263, 276-297 
plasma membrane 



1 H 21 31 41 51 

I I I I I I 

MTALSSEKCS FQYQLRQTNQ PLDVNYLLFL IILGKILLNI LTLGMRRKNT CQNFMEYFCI 60 
SLAFVDLLLL VNISIILYFR DPVLLSIRFT KYHICLFTQI ISFTYGFLHY PVFLTACIDY 120 
CLNFSKTTKL SPKCQKLFYF FTVILIWISV LAYVLGDPAI YQSUCAQNAY SRHCPPYVSI 180 
QSYWLSFFMV MILPVAPITC WBEVTTLVQA IRITSYMNET ILYFPFSSHS SYTVRSKKIF 240 
LSKLIVCPLS TWLPPVLLQV IIVLLKVQIP AYIEMNIPWL YPVNSPLIAT VYWFNCHKLN 300 
LKDIGLPLDP FVNWKCCPIP LTIPNLEQIE KPISIMIC 



A57 DNA SEQUENCE 

Gene name: ESTs, Weakly similar to CGHU7L collagen alpha l(III) chain precursor [H. sapiens] 

Unigene number: Hs.19322 
Probeset Accession #: AA088458 

Nucleic Acid Accession #: AA088458 

Coding sequence: 862-1995 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GCCCTTGGAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAGGACA CTGACATGGA 60 

CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 120 

GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAACGAG TGCAGGAGCG CCAGCGCCGC 180 

CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 300 

TGTGCCAGCC GGGCCCTGCC CCCGTCCTCC TCCGGGCCCC CCTGCCCTGC CCTGACGTCC 360 

ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 420 

CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 480 

GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 600 

AGCCCTTCGA GGGTGGGCGC CCCATCGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 660 

GCCCAGGCAC AGTCCCGGAG TGGGCGCCTT CCTGCCGCCC TTGCCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 780 

ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAG CTACTACTGG CCGCTGTCAG 840 

TGGACAGTGG GGTACCCCTC CATGAGTTAG CGTCCCCCCG TTTCCAGCGG TGCCGCCCTG 900 

GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG CACTTCCAAC AACGGGCAGC 960 

AGAGGGCGCG GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGACGAGG TGGCTGTAGC TCGGACGGAC GGAAGTAGAT GGAGGGGGTG GGGAOGGCCT 1080 

GTAAGCGGGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGGA TAGCGGTCGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGGACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GCGAGAGCTT 1200 

GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCCGGGACG 1260 

TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCCCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCACGTCA TAGGCAAAGC CTGTTTCCCC CGACTCAGGA TTTCCAAGGC 1380 

CTGGGGTCCT GCTCACCCCC CTTTGCTCTC ACGCCCAGCC TGTCOCCAGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 1500 

GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 

GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC CCAGGGCCCC CGATGCGGGG 1680 

TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG TGCGTGGGGG GCGCAGGGCC 1740 

CCCGATGCGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT CGTGTCCAGG GCACTTTGGT 1800 

ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACAGAG CCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCAGCCCA GCATGCAGCT 1980 

GCCCTCCTAC CCTGAAGATG GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 



856 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



TGGGGOATCC 
GGTGACTTCA 
GAGACAGGCT 
AAAGAAATAG 
CACGAGGGGA 
GCAGACCCTG 
GAGCAGCGTC 
GCGTGCACAC 
CAGAAGTGTC 
TTTTGTGTTG 
CTGGAATCCC 
CCCCATCTCT 
TCATAAACAC 
TAGACCCAGA 
AGAAATAAAA 



TGGCATCTTT 
TCAGGAGACC 
GGCACCTCOG 
GTCCTCCCAO 
GAATTTAAAG 
CCTGGAGCCT 
CCTGGGCTCT 
TGTGATGACA 
CCCAGTTGAG 
ATCAAGTTCC 
AGCACTTGAG 
ACAARAAAAA 
CACAAGGAAA 
TACTAGAATT 
GAGATTTCTG 



ACTGGACTGG 
GCCCACATAG 
GAAAAACTGC 
TTTACAGCTT 
GCCCCGGCTG 
GCCCTAGGAC 
ATCCGCGAGG 
CCCGGAAATQ 
AATCTGCCCC 
AAGGAAAAGG 
GCCAGGAGTT 
AAAAAGAAAG 
CAATACACTA 
ATCAGAGAGA 
GAAACATGAA 



AAGCAGGAGA 
AGCTGGACCC 
CTTTCAGCCT 
GAAATCAGGC 
GCAGGGTCTA 
GCTGGGCGGG 
TGCCAGTAGC 
TCTCAGGATG 
AGAGGAACAC 
AACATCTCAG 
CCAGAGCAGC 
AAAGAAAATG 
TGAGACCCAG 
ATATAAAGTA 



CAGAACAGTG 
CGCAGCTGAA 
TGGTGTTCOG 
TAGTGAGTGG 
GGTGGCTGGC 
TCAGTCTCCG 
GTGTGCAGGT 
TTGAAATGTG 
ACCCACACCA 
CCGGGCGTGG 
CTGGGCAACG 
AGAQATCCAG 
CAGAAGCAAC 
ACAGTGTTTT 



TCTGTCCCGG 
GCGGAAATGT 
TGCAAGGTGA 
CCCTGGAGAC 
AGAGGCACAT 
TGCAGGATGT 
ACATACACGT 
TCCTTGGGGG 
GGCCTCAGGA 
TGGTTCACGC 
CAGTGAGAGA 
GTTTAAAAAT 
AGATTGACTC 
ATATATCTAA 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 



A5B DNA sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession fh 
Coding Sequence: 



ESTs 

Hs. 157601 

W07459 

AC005383 

32 B -2751 (underlined sequences correspond to start and stop codons) 



1 
I 

GACAGTGTTC 
TTTTATTTGC 
CCTGGOGGTA 
ACAAACAGGT 
CCCCCTGGCC 
TCGCCGCTCT 
GTTTTCCTGT 
GAAACCATCG 
ATCATGTTTC 
CACTTTGCCA 
GCATTCCAGT 
CAGGAAGTGA 
CTTGCTCTGA 
CAGATCCTCA 
CAGCTGAAGG 
GAGCTGCATG 
GAGGATGCCA 
ACGCCAGACT 
GAGTTCGCTG 
GCACACTGTC 
AGGACCACCT 
CCAGAAGGAC 
TOTGCCCTGA 
GCGGGCACCA 
GCCGTGCTGA 
CTGGTGGOGG 
GGCATTCCCT 
CGTGGCTTCG 
CTCACTGAGT 
GAGCTGCTCC 
GGCAGCCCAA 
GAGCTGCAGG 
CTCGTCTTCA 
AGCTTTGTGA 
CTGGTGGTGT 
GCTGCGATGC 
ACCGCCCTGC 
GTCCCCAAAG 
GCCCAGAAGC 
AGTGAGGGTC 
GCCGACCTGC 
CCAGTCAACC 
GGGAGCTACC 
TGGAGCTCTT 
ATGGCTCCCG 
GGCACTGAAA 
TTCCCGCCGT 
ATGCTGCTTA 
TTGATGTGTA 
CTGCCACCTT 
GGTTCCTTTG 
AGGCCTTTAC 
GCAGCTTTTC 
CTTGAGGGAC 
GGTCTCAGAC 
TGTGCATGGG 
ACCTTQAAGG 



11 
I 

GCGGCTGCAC 
AGACCTGGGC 
GTTCCTCCGA 
GTCCCACGTG 
CGAGCCGCGC 
CCTTCCGTTA 
TTTCCAGAGT 
GGAAGATTTC 
TGTTAGATGG 
TCACAGTCTG 
TCAGTTCCAC 
AGGCAAGAAT 
AATACCTTCT 
TCATCGTCAC 
AAAGGGGTGT 
CACTGGCCAG 
CCAACGGCCT 
GCAGGGTCGA 
GCAATGCCCC 
CCTTCTACAG 
GCCCAGGCCC 
TGGACGGCTA 
AGCTGAGCCT 
CTCTGGACGG 
GCGAGGACTC 
TGCCTGTGGG 
TCCGTGGTGG 
GGAGCGCCAC 
CACACTCCGA 
TGCTGGGTGT 
AGCATGTGAT 
GGAAGCTGTG 
TGTTGGACAC 
GAAGCTGTGC 
ATGGCAGCCA 
TGCGGGCCAT 
TGCACATCTA 
CTGTGGTGGT 
TGAGGAACAA 
7GCGGAGGCT 
GGTACCACCA 
TCTGCAAACC 
GCTGCAAGTG 
GCTCTGTATG 
TGCAGGAGGG 
TGGTGCCTAC 
GGCCAGGACC 
GAGACAAGAA 
AGTAAATACC 
TCCCTTGAGG 
CACACAATCA 
TAGAGCATCC 
CACTTCCCCA 
GTTTGTGACT 
TGAATGTGAC 
CCCAGGTCTG 
TCTTC 



21 
I 

CGCTCGGAGG 
CGATGCCGCT 
CCTCAGCCGG 
GCAGCCGCGC 
CCGGGTCTGT 
TATCAACATG 
GCCCCCATCT 
AGCTGCCAGC 
GTCTAACAGC 
TGACGGTCTG 
TCCTCATCTG 
CAAGAGGATG 
GCACAGAGGG 
TGATGGGAAG 
CACTGTGTTT 
CGAGCCTAGA 
CTTCAGCACC 
GGCTCACCCC 
ATGCTGGAGA 
CTGGAAGAGA 
CTGTGACTCG 
CCAGTGCCTC 
GGAATGCAGG 
CTTCCTGCGG 
TCGGGCCCGA 
GGAGTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTCG 
CAGCOGGCAG 
CTCTGCCTCA 
CCTCCAGTTT 
GGTGCAGACT 
TAGCCAGGCC 
TGACAAAGTG 
GCTCACAGGC 
TGGCATCTCT 
TGCAGGTCCC 
GGACGTGCTC 
CAGCCCGTGC 
TCGGGATGGC 
TGTGAGCCAG 
CAGCAGCCGT 
CTTCTGGAAT 
ACTATTCTCA 
AGCAGCTGAT 
CACTTTCTGT 
ATAAACAAGG 
ATGCTCGCCA 
TTTGGACGGC 
GAGACATTCT 
TCTTGGCGAC 
CAATTAACCA 
GAGGGCCACG 



31 

I 

CTGGGTGACC 
TTAAAAAACG 
GTCGGGTCGT 
CCCGGGCGCC 
GAGTAGAGCC 
CCCCCTTTCC 
CTCCCTCTCC 
AAAATGATGT 
GTCGGGAAAG 
GACATCAGCC 
GAATTCCCCT 
GTTTTCAAAG 
TTGCCTGGAG 
TOCCAGGGGG 
GCTGTGGGGG 
GGGCAGCACG 
CTCAGCAGCT 
TGTGAGCACA 
GGATCGCGGC 
GTGTTCCTAA 
CAGCCCTGCC 
TGCCCGCTGG 
GTCGACCTCC 
GCCAAAGTCT 
GTGGGTGTGG 
GATGTGCCTG 
ACGGGCAGTG 
CAGGACCGGC 
GGGGGCCCAG 
GCCGTGCGGG 
GATCCTCAGG 
CGGCCAGGGT 
GTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC 
CCCTACCTAG 
ATGACCGTCC 
GGGAGAGGCG 
GTCTTGGTCG 
CGGGATTCCC 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 
CTGAGGGAGG 
GTCACCCACA 
ACCTGCTGTG 
GGTCCTGAAG 
GAATGTTGTT 
GAAGGCCACG 
GGATGCATTT 
TGCCTTTTGT 
GCTTGGTTGA 
TAAAATCGTT 



41 
I 

CGCGTAGAAG 
CGAGGGGCTC 
GCCGCCCTCT 
CCTCCTGTGA 
GCCCGGGCAC 
TGTTGCTGGA 
AGGAAGTCCA 
GGTGCTCGGC 
GGAGCTTTGA 
CCGAGAGGGT 
TGGATTCATT 
GAGGGCGCAC 
GCAGAAATGC 
ATGTGGCACT 
TCAGGTTTCC 
TGCTGTTGGC 
CGGCCATCTG 
GGACGCTGGA 
GGACCCTTGC 
CCCACCCTGC 
AGAATGGAGG 
CCTTTGGAGG 
TCTTCCTGCT 
TCGTGAAGCG 
CCACATACAG 
ACCTGGTCTG 
CCTTGCGGCA 
CACGTAGAGT 
CGCGTCACGC 
CAGAGCTGGA 
ATCTGTTCAA 
GCCGGACACA 
AGAATTTTGC 
CTGACGTGAC 
TGGACACCAA 
GTGGGGTGGG 
AGAGGGGTGC 
CAGAGGATGC 
TGGGCGTGGG 
TGATCCACGT 
TGTGTGGAGA 
GCAGCTGCGT 
CCCACTGOGA 
TTGAGACGCC 
GCAACTACAG 
CAGGTOCTTA 
AGGATGTCCC 
AACGATGTTG 
CCTTGTTGAG 
ACTTAAATTT 
GACACAGTAA 
GCCTTTCAAG 
GCATTGAGTC 
GTGTGGAAGA 
TGATGGGGGA 
CTGAGTCGTG 



51 
I 

TGAAGTACTT 
TATGCACCTC 
CCCAGGAGAG 
TCCCGTAGCG 
CGAGCGCTGG 
GGCCGTCTGT 
TGTAAGCAAA 
TGCAGTGGAC 
AAGGTCCAAG 
CAGAGTGGGA 
TTCAACCCAA 
GGAGACGGAA 
TTCTGTGCCC 
GCCATCCAAG 
CAGGTGGGAG 
TGAGCAGGTG 
CTCCAGCGCC 
GATGGTCCGG 
GGTGCTGGCT 
CACCTGCTAC 
CACATGTGTT 
GGAGGCTAAC 
GGACAGCTCT 
GTTTGTGCGG 
CAGGGAGCTG 
GAGCCTCGAT 
GGCGGCAGAG 
GGTGGTTTTG 
AAGGGCGCGA 
GGAGATCACA 
CCAAATCCCT 
AGCCCTGGAC 
TCAGATGCAG 
ACAGGTCGGC 
ACCCACCOGG 
CTCAGCCGGC 
CCGGCCTGGT 
AGCCGTTCCT 
GCCTGTCCTA 
GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACCGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
GAATGTCTGC 
AACTGCAGCC 
TTGAAAAGTT 
GCTATGTCAT 
AGCGGOCTGA 
TGCCCAGCAG 
ATGGAAAGCA 
TGAAAGGGGG 
GACTTGGAAA 
GGGGCTGAGT 
AGCAGTGTCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



857 



WO 03/042661 
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AS 9 Protein sequence 

Gene name: ESTs 

Unigene number: Hs. 157 601 

Protein Accession ft: none found 

Signal sequence: 1-17 
Transmembrane domains: none found 
VGW domains: 49-223; 341-518; 529-706 
EGP domains: 298-333; 71S-748 

Cellular Localization: secreted 

1 11 21 31 41 51 

I I I I i I 

MPPFLLLEAV CVFLPSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAPQPSSTPH LBFPLDSFST QQEVKARIKR 120 

MVFKGGRTBT ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRPPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPOCRVBAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGBA KCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRPV RAVLSEDSRA RVGVATYSRE LLVAVPVGBY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSPVRSCALQ PEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 
RTPPSMYREG LGTEMVPTFW NVCAPGP 

A60 DMA SEQUENCE 

Gene name: Homo sapiens type II membrane serine protease mRNA 

Unigene number: Hs. 63325 

Probeset Accession ft: AA411502 
Nucleic Acid Accession ft: NM_016425 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I 1 1 ! I ! 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TOGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGOCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTCCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGO GGGAAGTCAC OGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 

A61 Protein sequence: 

Gene name: Homo sapiens type II membrane serine protease mRNA 

Unigene number: Hs. 63325 

Probeset Accession ft: AA411502 

Protein Accession ft: NP_057509 

Signal sequence: none found 

Transmembrane domains: 31-53 

LDLa domain: 54-94 

TrypjBPc domain: 204-429 

Cellular Localization: plasma membrane/ER 

1 11 21 31 41 51 

I I I I I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMBTPR KVGIPIIIAL LSLASIIIW VLIKVILDKY 60 

YFLCGQPLHF IPRRQLCDGE LDCPLGEDEE HCVKSPPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALAETAC RQM8YSSKPT FRAVBIGPDQ OLDWEITBN SQBLRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSHPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCPRKH TDVFNWKVRA GSDKLGSPPS LAVAKIIIIE PNPWPKDND IALMKLQFPL 300 

TPSGTVRPIC LPFPDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAOIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSHGYGC GGPSTPGVYT 420 
KVSAYLNWIY NVWKABL 

858 



WO 03/042661 



PCT/US02/36810 



A62 DNA SEQUENCE 

Gene name: ESTS; Weakly similar to LITHOSTATHINE 1 BETA PRECURSOR (H. sapiens] 

Unigene number: Hs. 105484 

Probeset Accession «: AA314779 

Nucleic Acid Accession ft: none found 

Coding sequence: 103-579 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CCAAACAGAT TTGCAGATCA AGGAGAACCC AGGAGTTTCA AAGAAGCGCT AGTAAGGTCT 60 

CTGAGATCCT TGCACTAGCT ACATCCTCAG GGTAGGAGGA AGATGGCTTC CAGAAGCATG 120 

CGGCTGCTCC TATTGCTGAG CTGCCTGGCC AAAACAGGAG TCCTGGGTGA TATCATCATG 180 

AGACCCAGCT GTGCTCCTGG ATGGTTTTAC CACAAGTCCA ATTGCTATGG TTACTTCAGG 240 

AAGCTGAGGA ACTGGTCTGA TGCCGAGCTC GAGTGTCAGT CTTACQGAAA CGGAGCCCAC 300 

CTGGCATCTA TCCTGAGTTT AAAGGAAGCC AGCACCATAG CAGAGTACAT AAGTGGCTAT 360 

CAGAGAAGCC AGCCGATATG GATTGGCCTG CACGACCCAC AGAAGAGGCA GCAGTGGCAG 420 

TGGATTGATG GGGCCATGTA TCTGTACAGA TCCTGGTCTG GCAAGTCCAT GGGTGGGAAC 480 

AAGCACTGTG CTGAGATGAG CTCCAATAAC AACTTTTTAA CTTGGAGCAG CAACGAATGC 540 

AACAAGCGCC AACACTTCCT GTGCAAGTAC CGACCATAGA GCAAGAATCA AGATTCTGCT 600 

AACTCCTGCA CCAGCCCCGT CCTCTTCCTT TCTGCTAGCC TGGCTAAATC TGCTCATTAT 660 

TTCAGAGGGG AAACCTAGCA AACTAAGAGT GATAAGGGCC CTACTACACT GGCTTTTTTA 720 

GGCTTAGAGA CAGAAACTTT AGCATTGGGC CCAGTAGTGG CTTCTAGCTC TAAATGTTTG 780 

CCCCGCCATC CCTTTCCACA GTATCCTTCT TCCCTCCTCC CCTGTCTCTG GCTGTCTCGA 840 

GCAGTCTAGA AGAGTGCATC TCCAGCCTAT GAAACAGCTG GGTCTTTGGC CATAAGAAGT 900 

AAAGATTTGA AGACAGAAGG AAGAAACTCA GGAGTAAGCT TCTAGACCCC TTCAGCTTCT 960 

ACACCC TTCT GCCCTCTCTC CATTGCCTGC ACCCCACCCC AGCCACTCAA CTCCTGCTTG 1020 

TTTTTCCTTT GGCCATAGGA AGGTTTACCA GTAGAATCCT TGCTAGGTTG ATGTGGGCCA 1080 
TACATTCCTT TAATAAACCA TTGTGTACAT AAGAGAAAAA AAAAAAAAAA AAAAAAAAAA 



A63 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence! 
Transmembrane domains: 
C-type lectin domain: 
Cellular Localization: 



ESTs; Weakly Similar to LITHOSTATHINE 1 BETA PRECURSOR (H. sapiens] 

Hs. 105484 

AA314779 

none found 

1-22 

none found 

47-156 

secreted 



1 11 21 31 41 51 

I I I I I I 

MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYPRKLRN WSDAELECQS 
YGNGAHLASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQWIDG AMYLYRSWSG 
KSMGGNKHCA EKSSNNNFLT WSSNECNKRQ HFLCKYRP 



60 
120 



A64 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



Cadherin 3, P-cadherin (placental) 

HS.2877 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



1 
I 

GCGGAACACC 
TCCCTCGTGG 
CCTCCGAGCC 
CGGAGCAGGA 
CAGCTCTGTT 
AAAGAAGGTC 
GAAGACACAA 
CCTTCCCCCA 
ACAGCATCAC 
AGACAGGCTG 
TCTTTGGCCA 
TCATCGTGAC 
GTGTCTTAGA 
ATGATGCCAT 
AGGACCCACA 
CCAGTGGCCT 
TGGATGGGGA 
ACAATGCTCC 
GCCATGAGGT 
GTGCCACCTA 
CTGAGAGCAA 
AGCACACCCT 
CCACAGCCAC 
CCTCCAAAGT 
CTGCAGAAGA 



11 

I 

GGCCCGCCGT 
ACCTCTCGCG 
GTGCCGGGCG 
GCCCGGCCAG 
TAGCACTGAT 
ACTGAAGGAA 
GAGAGATTGG 
GAGACTGAAT 
GGGGCCGGGG 
GTTGTTGTTG 
CGCTGTGTCA 
CGACCAGAAT 
GGGAGTCCTA 
CTACACCTAC 
CGACCTCATG 
GGACCGGGAA 
CGGCTCCACC 
CATGTTTGAC 
GCAGAGGCTG 
CCTTATCATG 
CCAGGGCATC 
GTACGTTGAA 
CATAGTGGTC 
CGTTGAGGTC 
CCCTGACAAG 



21 
I 

CGCGGCAGCT 
TCTCTCCTCC 
GTCTTCAGGG 
GCGCTGGGGA 
AATGATGACT 
AGGAATCCAT 
GTGGTTGCTC 
CAGCTCAAGT 
GCAGACAGCC 
AATAAGCCAC 
GAGAATGGTG 
GACCACAAGC 
CCAGGTACTT 
AATGGGGTGG 
TTCACAATTC 
AAAGTCCCTG 
ACCACGGCAG 
CCCCAGAAGT 
ACGGTCACTG 
GGCGGTGACG 
CTGACAACCA 
GTGACCAACG 
CACGTGGAGG 
CAGGAGGGCA 
GAGAATCAAA 



31 
I 

GCTTCACCCC 
TTCTCCAGGT 
AGGCTGAAGT 
AAGTATTCAT 
TCACTGTGCG 
TGAAGATCTT 
CAATATCTGT 
CTAATAAAGA 
CCCCTGAGGG 
TGGACCGGGA 
CCTCAGTGGA 
CCAAGTTTAC 
CTGTGATGCA 
TTGCTTACTC 
ACCGGAGCAC 
AGTACACACT 
TGGCAGTAGT 
ACGAGGCCCA 
ATCTGGACGC 
ACGGGGACCA 
GGAAGGGTTT 
AGGCCCCTTT 
ATGTGAATGA 
TCCCCACTGG 
AGATCAGCTA 



41 

i 

TCTCTCTGCA 
TTGCTGGCTG 
GACCTTGGAG 
GGGCTGCCCT 
GAATGGCGAG 
CCCATCCAAA 
CCCTGAAAAT 
TAGAGACACC 
TGTCTTCGCT 
GGAGATTGCC 
GGACCCCATG 
CCAGGACACC 
GGTGACAGCC 
CATCCATAGC 
AGGCACCATC 
GACCATCCAG 
GGAGATCCTT 
TGTGCCTGAG 
CCCCAACTCA 
TTTTACCATC 
GGATTTTGAG 
TGTGCTGAAG 
GGCACCTGTG 
GGAGCCTGTG 
CCGCATCCTG 



SI 
I 

GCCATGGGGC 
CAGTGCGCGG 
GCGGGAGGCG 
GGGCAAGAGC 
ACAGTCCAGG 
CGTATCTTAC 
GGCAAGGGTC 
AAGATTTTCT 
GTAGAGAAGG 
AAGTATGAGC 
AACATCTCCA 
TTCCGAGGGA 
ACAGATGAGG 
CAAGAACCAA 
AGCGTCATCT 
GCCACAGACA 
GATGCCAATG 
AATGCAGTGG 
CCAGCGTGGC 
ACCACCCACC 
GCCAAAAACC 
CTCCCAACCT 
TTTGTCCCAC 
TGTGTCTACA 
AGAGACCCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080- 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



859 



WO 03/042661 



CAOGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 

GTGAGGATGA GCAGTTTGTG AGQAACAACA TCTATQAAGT CATGGTCTTQ 6CCATGGACA 1620 

ATGGAAGCCC TCCCACCACT GGGACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 

TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 , 

TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 19B0 

ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 

TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGOGGAAGA 2100 

TGAAGGAGCC CCTOCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 

AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACOCA 2280 

TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 

AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTGGACTATG 2400 

AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 

ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 

ACGGTGGCGG GGAGGACGAC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 

GGAAGTGGCC GTAGCAACTT GGOGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 

TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC OCTGTTGOGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATOGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



A65 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domain: 

Cellular localization: 



Cadherin 3, P-cadherin (placental) 

Hs.2877 

X63629 

CAA45177 

1-24 

659-675 (second underlined sequence) 
plasma membrane 



1 11 21 

I I 1 

MGLPRGPLAS LLLLQVCWLQ CAASBPCRAV 
QBPALFSTDN DDFTVRNGET VQERRSLKER 
KGPPPQRLNQ LKSNKDRDTK IFYSITGPGA 
YELFGHAVSB NGASVEDPMN ISIIVTDQND 
DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF 
TDMDGDGSTT TAVAWEILD ANDNAFMFDP 
AWRATYLIMG GDDGDHPTIT THPESNQGIL 
PTSTATIWH VEDVNEAPVF VPPSKWBVQ 
DPAGW1AMDP DSGQVTAVGT LDREDEQFVR 
VNDHGPVPEP RQITICNQSP VRHVMIITDK 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL 
GAVLALLFLL LVLLLLVRKK RKXKBPLLLP 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN 



31 41 51 

I I I 

FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

DSPPEGVFAV EKETGWtiUiN KPLDREEIAK 180 

HKPKPTQDTF RGSVLEGVLP GTSVKQVTAT 240 

TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

TTRKGLDFEA KNQHTLYVEV TNEAPFVIiKL 420 

EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 

EDDTRDNVFY YGEEGGGEBD QDYDITQLHR 720 

PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
EWGSRFKKLA DMYGGGEDD 



A66 DMA SEQUENCE 

Gene name: ATPase, Ca++ transporting, type 2C, member 1 

Unigene number t Hs. 106778 

Probeset Accession #x HS1919 
Nucleic Acid Accession #* AF189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

ATGATTCCTG TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTGA AGTTGCAAGC 60 

ATTCTCCAAG CTGATCTTCA GAATGGTCTA AACAAATGTG AAGTTAGTCA TAGGCGAGCC 120 

TTTCATGGCT GGAATGAGTT TGATATTAGT GAAGATGAGC CACTGTGGAA GAAGTATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCAGTCAT CAGTGTTTTA 240 

ATGCATCAGT TTGATGATGC CGTCAGTATC ACTGTGGCAA TACTTATCGT TGTTACAGTT 300 

GCCTTTGTTC AGGAATATCG TTCAGAAAAA TCTCTTGAAG AATTGAGTAA ACTTGTGCCA 360 

CCAGAATGCC ATTGTGTGCG TGAAGGAAAA TTGGAGCATA CACTTGCCCG AGACTTGGTT 420 

CCAGGTGATA CAGTTTGCCT TTCTGTTGGG GATAGAGTTC CTGCTGACTT ACGCTTGTTT 480 

GAGGCTGTGG ATCTTTCCAT TGATGAGTCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 540 

AA GGTGA CAG CTCCTCAGCC AGCTGCAACT AATGGAGATC TTGCATCGAG AAGTAACATT 600 

GCCTTTATGG GAACACTGGT CAGATGTGGC AAAGCAAAGG GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAG CAGAAGAGGC ACCAAAAACC 720 

CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACTC CTTTGGTATA 780 

ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCCTGGA AATGTTTACT 840 



860 



WO 03/042661 



PCT7US02/36810 



ATTAGTGTAA GTTTGGCTGT AGCAGCAATT CCTGAAGGTC TCCCCATTGT GGTCACAGTG 900 

ACGCTAGCTC TTGGTGT TA T GAGAATGGTG AAGAAAAGGG CCATTGTGAA AAAGCTGCCT 9 SO 

ATTGTTGAAA CTCTGGGCTG CTOTAATGTG ATTTGTTCAQ ATAAAACTGG AACACTGACQ 1020 

AAQAATGAAA TGACTGTTAC TCACATATTT ACTTCAGATG GTCTGCATGC TGAGGTTACT 1080 

GOAGTTGGCT ATAATCAATT TGGGGAAGTG ATTGTTGATO GTGATGTTGT TCATGGATTC 1140 

TATAACCCAG CTGTTAGCAG AATTGTTGAG GCGGGCTGTG TGTGCAATGA TGCTGTAATT 1200 

AGAAACAATA CTCTAATGGG GAAGCCAACA GAAGGGGCCT TAATTGCTCT TGCAATGAAG 1260 

ATGGGTCTTG ATGGACTTCA ACAAGACTAC ATCAGAAAAG CTGAATACCC TTTTAGCTCT 1320 

GAGCAAAAGT GGATGGCTGT TAAGTGTGTA CACCGAACAC AGCAGGACAG ACCAGAGATT 1380 

TGTTTTATGA AAGGTGCTTA CGAACAAGTA ATTAAGTACT GTACTACATA CCAGAGCAAA 1440 

GGGCAGACCT TGACACTTAC TCAGCAGCAG AGAGATGTGT ACCAACAAGA GAAGGCACGC 1500 

ATGGGCTCAG CGGGACTCAG AGTTCTTGCT TTGGCTTCTG GTCCTGAACT GGGACAGCTG 1560 

ACATTTCTTG GCTTGGTGGG AATCATTGAT CCACCTAGAA CTGGTGTGAA AGAAGCTGTT 1620 

ACAACACTCA TTGCCTCAGG AGTATCAATA AAAATGATTA CTGGAGATTC ACAGGAGACT 1680 

GCAGTTGCAA TCGCCAGTCG TCTGGGATTG TATTCCAAAA CTTCCCAGTC AGTCTCAGGA 1740 

GAAGAAATAG ATGCAATGGA TGTTCAGCAG CTTTCACAAA TAGTACCAAA GGTTGCAGTA 1800 

TTTTACAGAG CTAGCCCAAG GCACAAGATG AAAATTATTA AGTOGCTACA GAAGAACGGT 1860 

TCAGTTGTAG CCATGACAGG AGATGGAGTA AATGATGCAG TTGCTCTGAA GGCTGCAGAC 1920 

ATTGGAGTTG CGATGGGCCA GACTGGTACA GATGTTTGCA AAGAGGCAGC AGACATGATC 1980 

CTAGTGGATG ATGATTTTCA AACCATAATG TCTGCAATCG AAGAGGGTAA AGGGATTTAT 2040 

AATAACATTA AAAATTTCGT TAGATTCCAG CTGAGCACGA GTATAGCAGC ATTAACTTTA 2100 

ATCTCATTGG CTACATTAAT GAACTTTCCT AATCCTCTCA ATGCCATGCA GATTTTGTGG 2160 

ATCAATATTA TTATGGATGG ACCCCCAGCT CAGAGCCTTG GAGTAGAACC AGTGGATAAA 2220 

GATGTCATTC GTAAACCTCC TCGCAACTGG AAAGACAGCA TTTTGACTAA AAACTTGATA 2280 

CTTAAAATAC TTGTTTCATC AATAATCATT GTTTGTGGGA CTTTGTTTGT CTTCTGGCGT 2340 

GAGCTACGAG ACAATGTGAT TACACCTCGA GACACAACAA TGACCTTCAC ATGCTTTGTG 2400 

TTTTTTGACA TGTTCAATGC ACTAAGTTCC AGATCCCAGA CCAAGTCTGT GTTTGAGATT 2460 

GGACTCTGCA GTAATAGAAT GTTTTGCTAT GCAGTTCTTG GATCCATCAT GGGACAATTA 2520 

CTAGTTATTT ACTTTCCTCC GCTTCAGAAG GTTTTTCAGA CTGAGAGCCT AAGCATACTG 2580 

GATCTGTTGT TTCTTTTGGG TCTCACCTCA TCAGTGTGCA TAGTGGCAGA AATTATAAAG 2640 

AAGGTTGAAA GGAGCAGGGA AAAGATCCAG AAGCATGTTA GTTCGACATC ATCATCTTTT 2700 
CTTGAAGTAT GA 



A67 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #i 
Protein Accession #2 
Signal sequence: 
Transmembrane domains: 
Pfam domains: 
[744-889] 

Cellular Localization: 



ATPase, Ca++ transporting, 

Hs. 106778 

NS1919 

AAF27813 

none found 

60-82. 86-108, 252-274. 
Cation ATPase N [6-83] , 



type 2C, member 1 



282-304, 757-779, 827-849, 856-878 

E1-E2 ATPase [89-324] . Hydrolase [328-649] , Cation ATPase C 



not determined 



1 11 21 31 41 51 

I I I I I I 

MIPVLTSKKA SELPVSEVAS ILQADLQNGL NKCEVSHRRA FHGWNEFDIS EDEPLWKKYI 60 

SQPKNPLIML LLASAVISVL MHQFDDAVSI TVAILIWTV APVQSYRSEK SLEELSKLVP 120 

PECHCVREGK LEHTLARDLV PGDTVCLSVG DRVPADLRLF EAVDLSIDES SLTGETTPCS 180 

KVTAPQPAAT NGDLASRSNI AFKGTLVRCG KAKGWIGTG BNSEFGEVPK MMQAEEAPKT 240 

PLQKSMDLLG KQLSFYSPGI IGIIMLVGWL LGKDILEMFT ISVSLAVAAI PEGLPIWTV 300 

TLALGVMRMV KKRAIVKKLP IVETLGCCNV ICSDKTGTLT KNEMTVTHIP TSDGLHAEVT 360 

GVGYNQFGEV XVDGDWHGF YHPAVSRIVE AGCVCNDAVI RNNTLMGKPT EGALIALAMK 420 

MGLDGLQQDY IRKABYPFSS EQKWMAVKCV HRTQQDRPBI CFMKGAYEQV IKYCTTYQSK 480 

GQTLTLTQQQ RDVYQQEKAR MGSAGLRVLA LASGPELGQL TFLGLVGIXD PPRTGVKEAV 540 

TTLIASGVSI KM1TGDSQET AVAIASRLGL YSKTSQSVSG EEIDAMDVQQ LSQIVPKVAV 600 

FYRASPRHKM KIIKSLQKNG SWAMTGDGV NDAVALKAAD IGVAMGQTGT DVCKEAADMI 660 

LVDDDPQTIM SAIEEGKGIY NNIKNFVRFQ LSTSIAALTL ISLATLMNPP NPLNAMQILW 720 

INIIMDGPPA QSLGVEPVDK DVIRKPPRNW KDSILTKNLI LKILVSSIII VCGTLFVFWR 780 

ELRDNVITPR DTTMTPTCPV FFDMFNALSS RSQTKSVFEI GLCSNRMFCY AVLGSIMGQL 840 

LVIYPPPLQK VPQTESLSIL DLLFLLGLTS SVCIVAEIIK KVERSREXIQ KHVSSTSSSF 900 
LEV 



A68 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #< 
Coding sequence: 



bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 
Hs. 170195 
BE616633 
NM_0 01719 

123-1418 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 60 

CTGCCACCTG GGGCGOTGCO GGCCCGGAGC CCGGAGCCCG GGTAGCGCGT AGAGCOGGCG 120 

CQATGCACGT GCGCTCACTG OGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 180 

CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 240 

GCTTCATCCA CCGGCGCCTC CGCAGCCAGG AGCGGCGGGA GATGCAGCGC GAGATCCTCT 300 

CCATTTTGGG CTTGCCCCAC CGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 420 

GCCAGGGCTT CTCCTACCCC TACAAGGOCO TCTTCAGTAC CCAGGGCCCC CCTCTGGCCA 480 

GCCTGCAAGA TAGCCATTTC CTCACOGACG CCGACATGGT CATGAGCTTC GTCAACCTCG 540 

TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 60 0 

TTTCCAAGAT CCCAGAAGGG GAAGCTGTCA CGGCAGCCGA ATTCCGGATC TACAAGGACT 660 



861 



WO 03/042661 



PCT/US02/36810 



ACATCCGGGA ACGCTTOGAC AATGAGACGT TCCGGATCAG CGTTTATCAG OTGCTCCAGG 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG CCGTACCCTC TGGGCCTOGG 780 

AGGAGGGCTG GCTGGTGTTT GACATCACAG CCACCAGCAA CCACTGGGTG GTCAATCOGC 840 

GGCACAACCT GGGCCTGCAG CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACCCCA 900 

AGTTGGCGGG CCTGATTGGG CGGCACGG6C CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

TCTTCAAGGC CACGGAGGTC CACTTCCGCA GCATCCQGTC CACGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GCGGATGGCC AACGTGGCAG 1080 

AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 

GAGACCTGGG CTGGCAGGAC TGGATCATCG CGCCTGAAGG CTACGCCGCC TACTACTGTG 1200 

AGGGGGAGTG TGCCTTCCCT CTGAACTCCT ACATGAACGC CACCAACCAC GCCATCGTGC 1260 

AGACGCTGGT CCACTTCATC AACCOGGAAA CGGTGCCCAA GCCCTGCTGT GCGCCCACGC 1320 

AGCTCAATGC CATCTCCGTC CTCTACTTOQ ATGACAGCTC CAACGTCATC CTGAAGAAAT 1380 

ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCCGAGAA TTCAGACCCT 1440 

TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTG TGAGACCTTC CCCTCCCTAT CCCCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA GGAAAAAAAA ACAACGCATA AAGAAAAATG 1680 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTCGTTT CCAGAGGTAA 1740 

TTATGAGCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCCGGA AGTTCCTGTA ATAAATGTCA 1860 
CAATAAAACG AATGAATG 



A69 Protein sequence; 

Gene name: bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 



Unigene number i 
Probeset Accession ft: 
Protein Accession 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 
Cellular Localization: 



Hs. 170195 
BE616633 
NP 001710.1 
1-30 

TGFb_propeptide (37-281) 

none found 

secreted 



1 
I 

MHVRSLRAAA 
ILGLPHRPRP 
LQDSHFLTDA 
IRERFDNETF 
HNLGLQLSVE 



GECAFPLNSY 
RNMWRAOGC 



11 
I 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSFVNLV 
RISVYQVLQE 
TLDGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



31 
I 



LFLLRSALAD FSLDNEVHSS 
MFMLDLYNAM AVEEGGGPGG 
EHDKEFFHPR YHHREFRFDI* 
HLGRESDLFL LDSRTLWASE 
LAGLIGRBGP QNKQPFMVAF 
NSSSDQRQAC KKHBLYVSFR 
TLVHFINPET VPKPCCAPTQ 



41 
I 



QGFSYPYKAV 
SKIPEGEAVT 
EGWLVFDITA 
PKATEVHFRS 
DLGWQDWIIA 
LNAI SVLYFD 



51 
I 

RREMQREILS 60 

FSTQGPPLAS 120 

AAEFRIYKDY 180 

TSNHWWNPR 240 

IRSTGSKQRS 300 

PEGYAAYYCE 360 

DSSNVILKKY 420 



Cervical 



A70 DMA sequence 
Gene name: 
Unigene number » 
Probeset Accession ft: 
Nucleic Acid Accession #: 
Coding sequence: 



bone morphogenetic protein receptor IB (ALK-6) 

Hs. 87223 

AA250737 

NM_001203 

274*- 1782 (underlined sequences correspond to start and stop codons) 



l 
I 

CGCGGGGCGC 
GAGGACGCGG 
GTGAAAGGAA 
CATAACCATT 
TGCCATAAGT 
AATGTGGGCA 
TTGCGTTGTA 
GACGGATATT 
GGTTGCCTAG 
AGAAGATCAA 
CTGCCTCCAT 
ATATCTGTGA 
TATAAAAGAC 
ATTCCTCCTG 
TCAGGCCTCC 
ATTGGAAAAG 
GTGAAAGTGT 
ACAGTGTTGA 
GGGTCCTGGA 
TATCTGAAGT 
AGTGGCTTAT 
CATCGAGATC 
GACCTGGGCC 
ACTCGAGTTG 
AATCACTTCC 
GTTGCTAGGA 
CTAGTGCCCA 



11 
I 

GGAGTCGGCG 
GAGCCGGGAG 
AGGAAGATCA 
TGGCTCTGAG 
GAGAAGCAAA 
CCAAGAAAGA 
AATGCCACCA 
GTTTCACGAT 
GACTAGAAGG 
TTGAATGCTG 
TGAAAAACAG 
CTGTCTGTAG 
AAGAAACCAG 
GAGAATCCCT 
CTCTGCTGGT 
GTCGCTATGG 
TCTTCACCAC 
TGAGGCATGA 
CCCAGTTGTA 
CCACCACCCT 
GTCATTTACA 
TGAAAAGTAA 
TGGCTGTTAA 
GCACCAAACG 
AGTCTTACAT 
GATGTGTATC 
GTGACCCCTC 



21 
I 

GGGCCTCGCG 
CGCACGCGCG 
TTTCATGCCT 
CTATGACAAG 
CTTCCTTGAT 
GGATGGTGAG 
CCATTGTCCA 
GATAGAAGAG 
CTCAGATTTT 
CACAGAAAGG 
AGATTTTGTT 
TTTGCTCTTG 
ACCTCGATAC 
GAGAGACTTA 
CCAAAGGACT 
GGAAGTTTGG 
AGAGGAAGCC 
AAACATTTTG 
CCTAATCACA 
AGACGCTAAA 
CACAGAAATC 
AAACATTCTG 
ATTTATTAGT 
CTATATGCCT 
CATGGCTGAC 
AGGAGGTATA 
TTATGAGGAC 



31 
I 

GGACGCGGGC 
GGGTGGAGTT 
TGTTGATAAA 
AGAGGAAACA 
AACATGCTTT 
AGTACAGCCC 
GAAGACTCAG 
GATGACTCTG 
CAGTGTCGGG 
AACGAATGTA 
GATGGACCTA 
GTCCTTATCA 
AGCATTGGGT 
ATTGAGCAGT 
ATAGCTAAGC 
ATGGGAAAGT 
AGCTGGTTCA 
GGTTTCATTG 
GACTATCATG 
TCAATGCTGA 
TTTAGTACTC 
GTGAAGAAAA 
GATACAAATG 
CCAGAAGTGT 
ATGTATAGTT 
GTGGAAGAAT 
ATGAGGGAGA 



41 
I 

AGTGCGGAGA 
CAGCCTACTC 
GGTTCAGACT 
AAAAGTTAAA 
TGCGAAGTGC 
CCACCCCCCG 
TCAACAATAT 
GGTTGCCTGT 
ACACTCCCAT 
ATAAAGACCT 
TACACCACAG 
TATTATTTTG 
TAGAACAGGA 
CTCAGAGCTC 
AGATTCAGAT 
GGCGTGGCGA 
GAGAGACAGA 
CTGCAGATAT 
AAAATGGTTC 
AGTTAGCCTA 
AAGGCAAACC 
ATGGAACTTG 
AAGTTGACAT 
TGGACGAGAG 
TTGGCCTCAT 
ACCAGCTTCC 
TTGTGTGCAT 



51 
I 

CCGCGGQGCT 
TTTCTTAGAT 
TCTGCTGATT 
CTTACAAGCC 
AGGAAAATTA 
TCCAAAGGTC 
TTGCAGCACA 
GGTCACTTCT 
TCCTCATCAA 
ACACCCTACA 
GGCTTTACTT 
TTACTTCCGG 
TGAAACTTAC 
AGGAAGTGGA 
GGTGAAACAG 
AAAGGTAGCT 
AATATATCAG 
CAAAGGGACA 
CCTTTATGAT 
CTCTTCTGTC 
AGCAATTGCC 
CTGTATTGCT 
ACCACCTAAC 
CTTGAACAGA 
CCTTTGGGAG 
TTATCATGAC 
CAAGAAGTTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



862 
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OGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 

ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 

ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACT CT GATAQGAGAG GAAAAGTAAG 1800 

CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 

CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 



A71 Protein sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
PFAM domains: 
Cellular Localization: 



bone morphogenetic protein receptor IB (ALK-6) 
Hs. 72472 / Hs. 87223 
AA2S0737 / U89326 
NP_001194 

1-13 
128-144 

activin_receptor (30-111), protein kinase [204-491] 
plasma membrane 



1 U 21 31 41 51 

I I I ! I 1 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 60 

DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 

GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGBSLRDLI 180 

EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVPPTTEEAS 240 

WFRETBIYQT VLMRHENILG PIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 

MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKKGTCCIAD LGLAVKFISD 360 

TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIHADM YSFGLILWEV ARRCVSGGIV 420 

EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 



Bladder 

A72 DNA SEQUENCE 

Gene name: Homo sapiens type II membrane serine protease mRNA 

Unigene number: Hs. 63325 

Probeset Accession #: AA4U502 

Nucleic Acid Accession #: NMJU6425 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 



A73 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession Jh 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
LDLa domain: 
Tryp_SPc domain: 
Cellular Localization: 



Homo sapiens type II 

HS.63325 

AA411502 

NPJ)S7S09 

none found 

31-53 

54-94 

204-429 

plasma membrane /ER 



serine protease mRNA 



1 11 21 31 41 51 

I I 1 I I 1 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW VLIKVILDKY 60 
YFLCGQPLHF IPRKQLCDGB LDCPLGEDBE HCVKSPPEGP AVAVRLSKDR STLQVLDSAT 120 



863 



WO 03/042661 



PCT/US02/36810 



GNWPSACFDN FTEALAETAC RQMGYSSKPT PRAVEIGPDQ DLDWEITEN SQELHMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TPSGTVRPIC LPPFDEELTP ATPLWIIGWG PTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 
KVSAYLNWIY NVWKAEL 

A74 DNA SEQUENCE 

Gene name: ESTs, Weakly similar to CGHU7L collagen alpha l(III) chain precursor [H. sapiens] 
Unigene number: Hs.19322 
Probeset Accession 8: AA08B456 
Nucleic Acid Accession 0: AA0884S8 

Coding sequence: 862-1995 (underlined sequences correspond to start and .stop codons) 



1 11 21 31 41 51 



GCCCTTGGAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAGGACA CTGACATGGA 60 

CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 120 

GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAACGAG TGCAGGAGOG CCAGCGCCGC 180 

CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 300 

TGTGCCAGCC GGGCCCTCCC CCCGTCCTCC TCCGGGCCCC CCTGCCCTGC CCTGACGTCC 360 

ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 420 

CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 480 

GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 600 

AGCCCTTCGA GGGTGGGCGC CCCATOGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 660 

GCCCAGGCAC AGTCCCGGAG TGGGCGCCTT CCTGCCGCCC TTGCCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 780 

ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAG CTACTACTGG CCGCTGTCAG 840 

TGGACAGTGG GGTACCCCTC CATGAGTTAG CGTCCCCCCG TTTCCAGCGG TGCCGCCCTG 900 

GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG CACTTCCAAC AACGGGCAGC 960 

AGAGGGCGCG GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGACGAGG TGGCTGTAGC TCGGACGGAC GGAAGTAGAT GGAGGGGGTG GGGACGGCCT 1080 

GTAAGCGGGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGGA TAGCGGTCGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGGACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GCGAGAGCTT 1200 

GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCOGGGACG 1260 

TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCCCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCACGTCA TAGGCAAAGC CTGTTTCCCC OGACTCAGGA TTTCCAAGGC 1380 

CTGGGGTCCT GCTCACCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 1500 

GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 

GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC CCAGGGCCCC CGATGCGGGG 1680 

TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG TGCGTGGGGG GCGCAGGGCC 1740 

CCCGATGCGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT CGTGTCCAGG GCACTTTGGT 1800 

ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACAGAG CCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCAGCCCA GCATGCAGCT 1980 

GCCCTCCTAC CCTGAAGATG GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 

TGGGGGATCC TGGCATCTTT ACTGGACTGG AAGCAGGAGA CAGAACAGTG TCTGTCCCGG 2280 

GGTGACTTCA TCAGGAGACC GCCCACATAG AGCTGGACCC CGCAGCTGAA GCGGAAATGT 2340 

GAGACAGGCT GGCACCTCCG GAAAAACTGC CTTTCAGCCT TGGTGTTCCG TGCAAGGTGA 2400 

AAAGAAATAG GTCCTCCCAG TTTACAGCTT GAAATCAGGC TAGTGAGTGG CCCTGGAGAC 2460 

CACGAGGGGA GAATTTAAAG GCCCCGGCTG GCAGGGTCTA GGTGGCTGGC AGAGGCACAT 2520 

GCAGACCCTG CCTGGAGCCT GCCCTAGGAC GCTGGGCGGG TCAGTCTCCG TGCAGGATGT 2560 

GAGCAGCGTC CCTGGGCTCT ATCCGCGAGG TGCCAGTAGC GTGTGCAGGT ACATACACGT 2640 

GCGTGCACAC TGTGATGACA CCCGGAAATG TCTCAGGATG TTGAAATGTG TCCTTGGGGG 2700 

CAGAAGTGTC CCCAGTTGAG AATCTGCCCC AGAGGAACAC ACCCACACCA GGCCTCAGGA 2760 

TTTTGTGTTG ATCAAGTTCC AAGGAAAAGG AACATCTCAG COGGGCGTGG TGGTTCACGC 2820 

CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC CTGGGCAACG CAGTGAGAGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG GTTTAAAAAT 2940 

TGATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 3000 

TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 

A75 DNA SEQUENCE 

Gene name: SCCA2 or 2b, serine (or cysteine) proteinase, inhibitor 

Unigene number: Ha. 227948 

Probeset Accession #: AB0350B9 
Nucleic Acid Accession fh AB035089 

Coding sequence: 9845-10219 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 



GGGCATGCAG CCATCGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 
CAGTTCTAGT AAAAGGGAGA ACATCAATAT AGG ATGT TTC TTAGCAATAG AAAAAGAAGG 
CCAAGAGGAA TTAGGGAGAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 
TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG CAGAGTAGGA 



60 
120 
180 
240 



864 



WO 03/042661 



AGGAGGAAGG GAGGCAAGAG ATAATATCAT TTTCTCTGTG CTCCAACTGT ACTTACATAT 300 
GAGACTATTT CCCTCTCTGC TTTTCAAACC TTACTGGAGT TGTTTTCCCT CATGAAAACC 360 
AAGAAAGGAA AGCTAGTTAG TCTTGTTCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 
GTAGACAGAA TCCTTGGGAA TACAGTAATT GACATATATT CTGTTATTTG ATGCTTGAAA 480 
AATCTCCTCC ACTAACCAGT TTCCCTATAG ATTGCCACAA GCACATAATA AGAAACAATA S40 
AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CAGCCTTCAT 600 
AAGGAAGTGA GGTCCAGGAA AATCTAGGAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 
GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 
GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 
CAAGTGTTCA TATGCAAAAA CTTCTTGGAA TTTCTGAGTT CTCTGTGGCA ATATATGACA 840 
TCAGGATATG TCCAGTCTCA CACACCAGGA TATGTCCTTT CTAGCCTGTC TATCACATGC 900 
TAGGAGAACT ATTTAGGAAC AGAAAAAAAT GCCTGAAATG ATTTCTCATT TGAACTCATC 960 
CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 
TTCAACCTTC AGGGCAAACC TCCGTGCCTC AGACGTTTAG CCATAGTCTG AAATTCTCTT 1080 
CCATAGATTG GTCCCCTGTA ACCCCGGTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTTC 1140 
CCTCCATTCC CAGGATGAGC TTGTTGCTTC TGTCCTATGA GACATTAGAT TCCTTTTCTT 1200 
TGGTACCOGA GTAAATCCAT CCTACTCCAA TAGAGGAAGG TCCATTTTTG TCTTATAGCG 1260 
CTGGATGCAG ACTCAGCTGA GAAGACCATT ATTCATTTTT GGAATTCTTT ATCTCAGATA 1320 
TTTCCTCTTC TTTCTTTTTC TTCTATCTTT GGATTTTTAG TCCATCAACG CCCCATTAGT 1380 
CTATTCCCCG ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 
ATATATATTG ATACAGGAGA CCTAAGAAGA GCATGTCTTG GGGGTTGAGG AAACAGGCAG 1500 
GTGAGAAATT TCCAGATTGG AAACACAGCT TCCTTTCTCC CATCCAGCCC CTACTTTCAG 1560 
CCTATGTGTT TCTGGCACCT TGTTGTAGAT AAATCTCCCT TGACTTTGTG ATGTGCTGAG 1620 
AAAACAAACT CACGGCTGGT GTTAAAAAGG GCCCATGACA ATACCAAGTG TTGGGGAGAA 1680 
TGTGGAGAAA TCAGAACTCT ATTCACGGTC GGTTGGAATG CACACTTGTG CAGAATTCTA 1740 
TGGAGAAGAG TCTGGCATTT CCTCAAAATO TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 
TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCGATGCAA AAAAATGTAC 1860 
ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 
ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 1980 
AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 2040 
TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 
AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 2220 
GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 2280 
TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340 
TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 2400 
ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 
AAACAGAAGG ACCATTGAGA AATGTTGTGA TGCTGACAGG TCAAGCAATT TATTTTTCGG 2520 
CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 2580 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 2640 
AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 2700 
GGGGAGGCCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTAGATGGTG GCTGCAAGAG 2760 
AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 2820 
ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 2880 
TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2940 
GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 
AACAGAAAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 3060 
CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 3120 
ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 3180 
TACAGAAAAA TTACTAA GGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 3240 
GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 
CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TT GG T TT TGT TTTTGTTTTA CAGCIGGAGT 3360 
AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 3420 
CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 3480 
TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 
TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 3600 
TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 
GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 3720 
GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 
AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 3840 
TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 3900 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 
GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 4020 
AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACCAGCAAT 4080 
GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 
TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 
CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 4320 
TGATG CTTTT CCAGGAGTTC CAGATGACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 4380 
CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 4440 
CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 
CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 4S60 
TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 
AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 4680 
TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 
CACAGAGAAC ACCACAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 4800 
CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 4860 
AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 
TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 4980 
CCTCAGAGCA TAGGTCTGOA TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 5040 
AAATGATOAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 5100 
CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 
GCCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 5220 
TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 
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ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG S3 40 
CCTTAATAGT GCAAAATGCT TTGCTGQAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 5400 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 
. TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 
5 TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATQACCATC ATGAGGAAAG S580 
AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 5700 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT S760 
1 A AACCCTTTAA ACGAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 5820 
1U AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 
GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 5940 
GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 6000 
ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 6060 
AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 
15 ATATGAGCTG AAGATCGCCA ACAAGCTCTT OGGAGAAAAG ACGTATCAAT TTTTACAGGT 6180 
AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 
AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCGAG GTGGAGAGCA TTTACTCAGA 6300 
GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAAOGATAC 6360 
ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 
ZU TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 6480 
CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 
GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 
CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 
GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCXGC 6720 
25 CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 
TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 6840 
ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 
TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 
CATCAAGAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 7020 
3U AAGTCGAAAG AAGATTAACT GCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 
GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 7320 
35 TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 7560 
ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
40 TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
AC GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 
45 TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
5U GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 
55 TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCAQAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 8760 
^ CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 

OU CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGOAATG 8880 
ATGACrCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 9000 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCOCTOC CCCCTCCCCC 9120 
65 CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 9360 
ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
70 GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATAOGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 
75 CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATOGACAGT 9780 
TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTOGATT TACACTTACC 9900 
TOGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACAGG TTGAGAACCA TGGGAATGGT 9960 
GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10020 
oU ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACOGCT GTAGTAGTAG TOGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 
GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
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CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 10380 
TTCT CTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA COQTGTCTTC TAAATTTGAA 10500 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 10560 
ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC ATCATCAATA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTGATGATGA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
GGGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTT G TTTCT 11160 
GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAQAGTCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 



A76 Protein sequence ; 

Gene name: SCCA2 or 2b, 

Unigene number: Hs. 227948 

Probeset Accession ft: AB035089 

Protein Accession #» BAB21525 

Signal sequence: none found 

Transmembrane domains : none found 
Serine Proteinase Inhibitor domain: 

Cellular Localization: secreted 



serine (or cysteine) proteinase inhibitor 



13-390 



1 11 21 31 41 51 

I I I I I I 

MNSLSEANTK PMFDLFQQFR KSKENNIFYS PIS1TSALGM VLLGAKDNTA QQISKVLHFD 60 

QVTENTTEKA ATYHVDRSGN VHHQFQKLLT EFNKSTDAYE LKIANKLFGE KTYQFLQEYL 120 

DAIKKPYOTS VESTDFANAP EESRKKINSW VESQTNEKIK NLPPDGTIGN DTTLVLVNAI 180 

YFKGQWENKF KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALLEDVQAK VLEIPYKGKD 240 

LSMIVLLPNE IDGLQKLEEK LTABKLMEWT SLQNMRETCV DLHLPRFKME ESYDLKDTLR 300 

TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VBAAAATAW WELSSPSTN 360 
EEFCCNHPPL PPIRQNKTNS ILFYGRFSSP 



A77 DMA SEQUENCE 

Gene name: hypothetical protein FLJ13459 

unigene number: none found 

Probeset Accession #: XM_04 72 66 
Nucleic Acid Accession ft: XM_047266 

Coding sequence: 485-1471 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CTGACCTCAA GTGATCCACC CACCTCTGCC TCCCAAAGTG CTGGGATTAC AGGTGTGAGC 60 

CACCATGCCA GGCCCTCTTA ACCTCTTCAA GTCTGTTTTC TCATCTGCAA AACAGAGGTA 120 

ATAAGATCAG TATCTTCTTA ATGGAAGCAC CTGGACTACA TTTTTTTCAT TCATTGTTAT 180 

CATAAATGAG GACTAACCTG TCTCCCGTTG GGAGTTTTGA ACCTAGACCT CATGTCTTCA 240 

TGACGTCATC ACTGCCCCAG GCCCAGCTGT GTCCCTACAC CAGCCCCAGC TGACGCATCT 300 

TCTTTTTCTG CCTGTAGAGA TGGTTACAAT GCCTGGCGTG ATGCATTCTG GCCTTCGCAG 360 

ATCCTGGCGG GGCTGTGCCA ACGCTGTGGC CTCCCTGCCC CTGAATACCG AGCCGGTGCT 420 

GTCAAGGTGG GCAGCAAAGT CTTCCTGACA CCACOGGAGA CCCTGCCCCC AGGGATCTCT 480 

TCACATGTGG ATTGACATCT TTCCTCAAGA TGTGCCTGCT CCACCCCCAG TTGACATCAA 540 

GCCTCGGCAG CCAATCAGCT ATGAGCTCAG AGTTGTCATC TGGAACACGG AGGATGTGGT 600 

TCTGGATGAC GAGAATCCAC TCACCGGAGA GATGTCGAGT GACATCTATG TGAAGAGCTG 660 

GGTGAAGGGG TTGGAGCATG ACAAGCAGGA GACAGACGTT CACTTCAACT CCCTGACTGG 720 

GGAGGGGAAC TTCAATTGGC GCTTTGTGTT CCGCTTTGAC TACCTGCCCA CGGAGCGGGA 780 

GGTGAGCGTC TGGCGCAGGT CTGGACCCTT TGCCCTGGAG GAGGCGGAGT TCCGGCAGCC 840 

TGCAGTGCTG GTCCTGCAGG TCTGGGACTA TGACCGCATC TCTGCCAATG ACTTCCTTGG 900 

ATCCCTGGAG TTGCAGCTAC CAGACATGGT GCGTGGGGCC CGGGGCCCCG AGCTCTGCTC 960 

TGTGCAGCTG GCCCGCAATG GGGCCGGGCC GAGGTGCAAT CTGTTTCGCT GCCGCCGCCT 1020 

GAGGGGCTGG TGGCCGGTAG TGAAGCTGAA GGAGGCAGAG GACGTGGAGC GGGAGGCGCA 1080 

GGAGGCTCAG GCTGGCAAGA AGAAGCGAAA GCAGAGGAGG AGGAAGGGCC GGCCAGAAGA 1140 

CCTGGAGTTC ACAGACATGG GTGGCAATGT GTACATCCTC AOGGGCAAGG TGGAGGCAGA 1200 

GTTTGAGCTG CTGACTGTGG AGGAGGCCGA GAAACGGCCA GTGGGGAAGG GGCGGAAGCA 1260 

GCCAGAGCCT CTGGAGAAAC OCAGCCGCCC CAAAACTTCC TTCAACTGGT TTGTGAACCC 1320 

GCTGAAGACC TTTGTCTTCT TCATCTGGCG CCGGTACTGG CGCACCCTGG TGCTGCTGCT 1380 

ACTGGTGCTG CTCACCGTCT TCCTCCTCCT GGTCTTCTAC ACCATCCCTG GCCAGATCAG 1440 

CCAGGTCATC TTCCGTCCCC TCCACAA GTG A CTCTCGCTG ACCTTGGACA CTCACCCAGG 1500 

GTGCCAACCC TTCAATGCCT GCTCCTGGAA GTCTTTCTTA CCCATGTGAG CTACCCCAGA 1560 
GTCTAGTGCT TCCTCTGAAT AAACCTATCA CAGCCACTG 



A78 Protein sequence: 

Gene name: hypothetical protein PLJ134S9 
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Uhigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
C2 domain: 

Cellular Localization: 



none found 
XM_047266 
XPJ>47266 
none found 
291-313 



27-86 

plasma membrane / ER 



l 



11 



21 



31 



41 



51 



KWIDIFPQDV PAPPPVDIKP RQPISYELRV VIWNTEDWL DDENPLTGEM SSDIYVKSWV 60 

KGLEHDKQET DVHFNSLTGE GNFNWRFVFR FDYLPTEREV SVWRRSGPFA LEEAEFRQPA 120 

VLVLQVWDYD RISANDPLGS LELQLPDMVR GARGPELCSV QLARNGAGPR CNLPRCRRLR 180 

GWWPWKLKE AEDVEREAQE AQAGKKKRKQ RRRKGRPEDL EPTDMGGNVY ILTGKVEAEF 240 

ELLTVEEAEK RPVGKGRKQP EPLEKPSRPK TSFNWFVNPL KTFVFFIWRR YWRTLVLLLL 300 
VLLTVFLLLV FYTIPGQISQ VIFRPLHK 



A79 DNA SEQUENCE 

Gene name: Homo sapiens mRNA; cDNA DKPZp434K0322 (from clone DKFZp434K0322) 

Unigene number: Hs. 161031 

Probeset Accession #: AL137708 

Nucleic Acid Accession 8: AL137708 

Coding sequence i 1315-1791 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 



GGCATTGATG CTGTGTGCGC GTGCGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 60 

GTGTGTGTGT GTCTGGAGTC ATGGCAGGGT CCCTTTCTGT CTGTCTCCTT GCTCTGCCCC 120 

AGACTGGGGG GCTGCAGAGG TGAGGGTATC TGGCCTCAAC AGCTGCTTAT TCCCGATGGG 180 

ATGGCCTGGG CTGGGCCCCT GAGGCCAGGC TGACTTGGAC ATGGCAAGAG GGGTCCCAGG 240 

CTCTTGTGGG CAAAGCAGGG GAGGCGCCAA TGTGGAGGAA CAGAGTCTCC TGGCTGGCTG 300 

CTGCTCTCTG GAGCGGGTGG AGTCAGGGAA GAGCTGAGCT GGGGAGTCAC CCTGGGCCTG 360 

GGGTCACCGT AGGCCCCATG TAGCACCCTG GTTCCCCTGC CTGTAGGTGA CAGGAGCCAG 420 

CCCAGCCAGG TGTGCTCCCT CCCCAGGCCC TAGGCAGGCG GGTACAGGGG CCAGCAGCTG 480 

CGCCCGCCCC ACCTTCCTTC CCACCCACAT GCCGAAGGGT GGCCAGGCAG GCAGGTGGAC 540 

GAGTCCAGGC AGCGGCTGAG TCAGTGTGTG TGGAATGTTC TGGCCGCTCC CAGCTGCACC 600 

CTGCCCCTAC CTGCCACCAC CTCACCTTCA TCCTCAGGCG CTGCGGCCCT GAGCCCCTGC 660 

CAGGAATGCA CCTTTAGCCC AGGCCTGCTC AGTGAGCTCC GCCOACAGCC AGCCCTGCTC 720 

CTCCCGCCAT GACCCTGCAG ACCCCTCTGG GCTTCCAAGT TCCTGGGGGC TGCAGTGAAC 780 

ATGCTCCACC TGCATGGCTG GCAAACCATG GTGGGCCCCA GCTGTGGTGC GTGCTGGGGT 840 

AGAGGCAAGG AAGTGATGGG ACOGCAGAGA TGAGACCCCC AGGGATGAGA TGGGACCCCC 900 

AGGCAGGGCC CAGGGTCCAG GGCCCAGGAG AGAGAAGCAG GGAGGGAGAG AGCTTCCTGG 960 

TGGAGGACGC ATCCTACAGT GGGGGCAAGG GTGCTCTGAG GTCCGGTGAA GGCAGGGACT 1020 

AGGCTGCCCA GGCCCTGCCT GCTTGGCTGG GGCTGGGGGC TGCTGGGAGG TGGCTGGGAG 1080 

GCTGGGCCTG GGCAGCTAAG CTGGAGCTTT GGCCAGGGTC CAGAGCCTCC CTCCCTTCAG 1140 

CTTCCTGCTG CACAGAACCC TCGCCCCTGG CCACCCCGTG CTGCCTCCTT GCCCTGGCAG 1200 

ACCCAGCACT GGCTGCTGCT AGTCAGATGG GGTAGCGGGC AGGGGCCGGA GGGGCCACCC 1260 

TCCCAGCTGA CCCAGCCTCC TGGGCCGCTT CTTCCAAACC AGCAGGGTAG AAAGATGGGG 1320 

CACOCACCAG TCTCTCCCAG TGCCCCGGCC CCAGCTGGCA CCACAGCTAT ACCTGGGCTT 1380 

ATTCCAGACC TTGTCGCCGG GACCCCCTGT GAGTTGTGGG ATTCCCAAGA GGGGTGTGGG 1440 

GATAACCCAG CCAAGTGGGG GCTGCAGCTG TCCACAGATG CACTCAGCCT GGCCTCTACC 1500 

CCAGGGCCCC GCTGGGCTCT CATTGCCGGC GCCCTTGCCG CGGGCGTCCT CCTCGTCTCC 1560 

TGCCTCCTCT GTGCTGCCTG CTGCTGCTGC OGCCGCCACA GGAAGAAGCC CAGGGACAAG 1620 

GAGTCCGTGG GTCTGGGCAG TGCCCGCGGC ACCACCACCA CCCACCTGGT GAGGAGCGGC 1680 

TCCTTGCTCA CTCAGTCCAG AGAGGGCTTG AAATCCAGGC TCCAGAGCCC AGGGCAGOGA 1740 

GGCGAGTTCA GCCCCAGGGA TGGTTTAACC CCCACAGAGG CAGGGCG TTG AGGACCTTCC 1800 

TGGCAGGGAA AGTGGGTGAA CAGAGGTGAG AAGGAGGCCA TGCAACAGGG GCTGCCCCAT 1860 

GGGCCCGAGG GAGCCACAGC GGGTTCTTGA GGAAGGCAGG GGGTACCCCA GATGCCACGT 1920 

TTTGGGTGGG TTTGGCCGGT CTCACAGAGC GAAGCCGACG ATTTGTGCCT GTTGGGTGGC 1980 

CTGGCCTGGA GGCGGGGGGT CTTGACCCAT GTCATGCAAG GGCTGCCCGG GAGCCCAGGG 2040 

CTCTGATGAG GCATGATGTC AGCACCACCT GCCCCTTGTC CCAACTCACT CCAGGTGCAA 2100 

CCTGATGTGG ATGGCCTGGA GTCCAGCCCG GGGGATGCTC AGCAATGGGG GTGCCTGCAG 2160 

CTCTCCCTGG AGTTCGACTT TGGAAGCCAG GAGGTGAAGG GCCCCGCTGC GCAGGACCAG 2220 

CGGTTCTGCG AGTTTCCGGA AAGGGTGACG GGGGAAGGGC AGACCCCATG CCCTGGGTGG 2280 

TGGGGAGCTG ACAGGGCAGG GGCCCTTGGC TGAGCCCACC CCGCTGGCTC CCAGATCAGG 2340 

GTGGGCCTGA GGCAGGCAGC CGACCTGAGG CCTGGGGGCA CCGTGGACCC CTATGCCCGG 2400 

GTCAGCGTCT CCACCCAGGC CGGACACAGA CATGAGACAA AAGTGCACCG AGGCAOGCTC 2460 

TGCCCCGTGT TTGACGAGAC CTGCTGCTTC CACGTGAGTC AGGGATGGTC GGCTGGGTGG 2520 

GCCTGGACGG CTGGATGGGC CTGGGCTGGG TGOGCCTGGG CAGCTGGGTG GGCCTGGGCA 2580 

GCTGGGTGGG CCTGAGCTAG GGCAGCAGGG CCTGGCTCAC GCCGCTGCCT CAGATCCCGC 2640 

AGGCGGAGCT GCCAGGGGCC ACCCTGCAGG TGCAGCTTTT CAACTTCAAG CGCTTCTCGG 2700 

GGCATGAGCC CCTGGGTGAG CTCCGTCTGC CACTGGGCAC CGTGGATCTG CAGCATGTTC 2760 

TGGAGCACTG GTACCTGCTG GGCOCGCCGG CTGCCACTCA GGTGAGGTGC TGGTCACCAG 2820 

GCCACAGCCC AAGGCAGAGC TGGCAGGGAC CCTGCCCTAT GGGCCATOGG AAAGACAGGC 2880 

CTGATGGGCA GCATTTTCGG GGGTCTGAGC CCCAACTCGG CCAGAATCAC CCTCCCGGGC 2940 

TGAAGCCCCT CTTGCTGCCC ACAGCCCGAG CAGGTCGGGG AGCTGTGCTT CTCTCTCCGG 3000 

TACGTGCCCA GCTCAGGCCG GCTGACOGTG GTGGTGCTGG AGGCTCGAGG CCTGOGTCCA 3060 

GGACTTGCAG AGCCCTACGT GAAGGTCCAG CTCATGCTGA ACCAGAGGAA GTGGAAGAAG. 3120 

AGAAAGACAG CCACCAAAAA GGGCACGGCG GCCCOCTACT TCAATGAGGC CTTCACCTTC 3180 

CTGGTGCCCT TCAGCCAGGT CCAGAATGTG GACCTGGTGC TGGCTGTCTG GGACCGCAGC 3240 

CTGCCGCTCC GAACTGAGCC CGTAGGCAAG GTGCACCTGG GTGCOCGGGC CTCGGGGCAG 3300 
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CCCCTGCAGC ACTGGGCAGA CATGCTGGCC CACGCCCGGC GGCCCATTGC CCAGCGGCAC 3360 

CCCCTGCGGC CAGCCAGGGA GGTGGACCGC ATGCTGGCCC TGCAGCCCCG CCTTCGCCTG 3420 

CGCCTGCCCT TGCCCCACTC CTGAATGCAC CACATGCCTC TGTCTCCCCG CTGAGCCCAG 3480 
GCACTTGCCC AGGCCGCCCT GCAGGACCAC TGCAATAAAC GCCTTCTCCT GCC 



A80 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 

1 11 



Homo sapiens mRNA; cDNA DKFZp434K0322 (from clone DKF2p434K0322) 

Hs. 161031 

AL137708 

CAB7088S 

none found 

69-85 

plasma membrane 



21 31 41 51 

I I I I I I 

MGHPPVSPSA PAPAGTTAIP GLIPDLVAGT PCELWDSQEG CGDNPAKWGL QLSTDALSLA 
STPGPRWALI AGALAAGVLL VSCLLCAACC CCRRHRKKPR DKESVGLGSA RGTTTTHLVR 
SGSLLTQSRE GLKSRLQSPG QRGEFSPRDG LTPTEAGR 



60 
120 



A81 DNA SEQUENCE 

Gene name: Cadherin 3, P-cadherin (placental) 

Unigene number: Ha. 2 877 

Probeset Accession #: X63629 

Nucleic Acid Accession #: X63629 

Coding sequence: 54-2543 (start and stop codons are underlined) 



1 11 21 31 41 51 

I 1 I I I I 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATGGGGC 60 

TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 

CCTCCGAGCC GTGCOGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 

CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 

CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 

AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 

GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 

CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 

A GACA GGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 

TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 

TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 

GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 

ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 

AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 

CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 

TGGATGGGGA CGGCTCCACC ACCAOGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 

ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 

GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 

GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 

CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGA7TTTGAG GCCAAAAACC 1260 

AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 

CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 

CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 

CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 

GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCOCAGC 1800 

TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 

TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 

ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 

TCCTGGCTCT GCTGTTCCTC CT G CT G GTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 

TCAAGGAGCC CCTCCTACTC CCAGAAGATO ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 

AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 

TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 

AGGOGGCTAA CACAGACCCC ACAGCCCCGC OCTACGACAC CCTCTTGGTG TTCGACTATG 2400 

AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 

ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 

ACGGTGGCGG GGAGGAOGAC TAGG CGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 

GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

TGCTCAACCC TGTG TCC T GG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGOCG 3000 

TCCTGCATTT CTG G TTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGGGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 
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M2 Protein sequence t 
Gene name: 
Unigene number: 
Probeget Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domain: 



Cadberin 3, P-cadherin (placental) 

Hs.2877 

X63629 

CAA45177 

1-24 

659-675 (second underlined sequence) 



Cellular localization: plasma membrane 



MGLPRGPLAS 
QEPALFSTDN 
KGPFPQRLNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFL 
GAVLALLFLL 
GLEARPEWL 
DYEGSGSDAA 



11 
I 

LLLLQVCWLQ 
DDFTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEILD 
GDDGDHFTIT 
VEDVNEAPVP 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 



21 

I 

CAASEPCRAV 
VQERRSLKER 
IPYSITGPGA 
ISIIVTDQND 
EPKDPHDLMF 
ANDNAPMPDP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRHVLNITDK 



RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



31 
I 

FREAEVTLEA 
NPLKIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TXHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGEPVC 
NNIYEVMVLA 
DLSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFIIE 
EWGSRPKKLA 



41 
I 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLLLN 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQLTDDSDIY 
HGHVETCPGP 
YGBEGGGEED 
NLKAANTDPT 
DMYGGGEDD 



51 

I 

LGKVFMGCPG 
VAPISVFENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLLTLID 
WTAEVNEEGD 
WKGGFIItPVL 
QDYDITQLHR 
APPYDTLLVF 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A83 DNA SEQUENCE 

Gene name: putative G-protein coupled receptor 
Unigene number: Hs. 16085 

Probeset Accession ft: F07953 
nucleic Acid Accession ft: NM_016334 

Coding sequence: 104-1471 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I 1 I ! I 

AGCACCTGGG AAAAGGCAGA CCGTGTGAGG GGGCCTGTGG CCCCAGCGTG CTGTGGCCTC 60 

GGGGAGTGGG AAGTGGAGGC AGGAGCCTTC CTTACACTTC GCCATGAGTT TCCTGATCGA 120 

CTCCAGCATC ATGATTACCT CCCAAATACT ATTTTTTGGA TTTGGGTGGC TTTTCTTCAT 180 

GCGCCAATTG TTTAAAGACT ATGAGATACG TCAGTATGTT GTACAGGTGA TCTTCTCCGT 240 

GACGTTTGCA TTTTCTTGCA CCATGTTTGA GCTCATCATC TTTGAAATCT TAGGAGTATT 300 

GAATAGCAGC TCCCGTTATT TTCACTGGAA AATGAACCTG TGCGTAATTC TGCTGATCCT 360 

GGTTTTCATG GTGCCTTTTT ACATTGGCTA TTTTATTGTG AGCAATATCC GACTACTGCA 420 

TAAACAACGA CTGCTTTTTT CCTGTCTCTT ATGGCTGACC TTTATGTATT TCTTCTGGAA 480 

ACTAGGAGAT CCCTTTCCCA TTCTCAGCCC AAAACATGGG ATCTTATCCA TAGAACAGCT 540 

CATCAGCCGG GTTGGTGTGA TTGGAGTGAC TCTCATGGCT CTTCTTTCTG GATTTGGTGC 600 

TGTCAACTGC CCATACACTT ACATGTCTTA CTTCCTCAGG AATGTGACTG ACACAGATAT 660 

TCTAGCCCTG GAACGGCGAC TGCTGCAAAC CATGGATATG ATCATAAGCA AAAAGAAAAG 720 

GATGGCAATG GCACGGAGAA CAATGTTCCA GAAGGGGGAA GTGCATAACA AACCATCAGG 780 

TTTCTGGGGA ATGATAAAAA GTGTTACCAC TTCAGCATCA GGAAGTGAAA ATCTTACTCT 840 

TATTCAACAG GAAGTGGATG CTTTGGAAGA ATTAAGCAGG CAGCTTTTTC TGGAAACAGC 900 

TGATCTATAT GCTACCAAGG AGAGAATAGA ATACTCCAAA ACCTTCAAGG GGAAATATTT 960 

TAATTTTCTT GGTTACTTTT TCTCTATTTA CTGTGTTTGG AAAATTTTCA TGGCTACCAT 1020 

CAATATTGTT TTTGATCGAG TTGGGAAAAC GGATCCTGTC ACAAGAGGCA TTGAGATCAC 1080 

TGTGAATTAT CTGGGAATCC AATTTGATGT GAAGTTTTGG TCCCAACACA TTTCCTTCAT 1140 

TCTTGTTGGA ATAATCATOG TCACATCCAT CAGAGGATTG CTGATCACTC TTACCAAGTT 1200 

CTTTTATGCC ATCTCTAGCA GTAAGTCCTC CAATGTCATT GTCCTGCTAT TAGCACAGAT 1260 

AATGGGCATG TACTTTGTCT CCTCTGTGCT GCTGATCCGA ATGAGTATGC CTTTAGAATA 1320 

CCGCACCATA ATCACTGAAG TCCTTGGAGA ACTGCAGTTC AACTTCTATC ACCGTTGGTT 1380 

TGATGTGATC TTCCTGGTCA GCGCTCTCTC TAGCATACTC TTCCTCTATT TGGCTCACAA 1440 

ACAGGCACCA GAGAAGCAAA TGGCACCTTG AACTTAAGCC TACTACAGAC TGTTAGAGGC 1500 

CAGTGGTTTC AAAATTTAGA TATAAGAGGG GGGAAAAATG GAACCAGGGC CTGACATTTT 1560 

ATAAACAAAC AAAATGCTAT GGTAGCATTT TTCACCTTCA TAGCATACTC CTTCCCCCTC 1620 

AGGTGATACT ATGACCATGA GTAGCATCAG CCAGAACATG AGAGGGAGAA CTAACTCAAG 1680 

ACAATACTCA GCAGAGAGCA TCCCGTGTGG ATATGAGGCT GGTGTAGAGG CGGAGAGGAG 1740 

CCAAGAAACT AAAGGTGAAA AATACACTGG AACTCTGGGG CAAGACATGT CTATGGTAGC 1800 

TGAGCCAAAC ACGTAGGATT TCCGTTTTAA GGTTCACATG GAAAAGGTTA TAGCTTTGCC I860 
TTGAGATTGA CTCATTAAAA TCAGAGACTG T 



A84 Protein sequence 
Gene name: 
75 Unigene number: 

Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
424-446 

OK) Cellular Localization: 

1 11 21 



31 



putative G-protein coupled receptor 
Hs. 16085 

NP_0 5741 8.1 
none found 

S-27, 42-64, 76-98, 110-132, 144-166, 289-311. 342-364, 380-402, 
plasma membrane 
41 51 

870 



WO 03/042661 



PCT/US02/36810 



I I I I I I 

MSPLIDSSIM ITSQILFFGF GWLFFMRQLF KDYEIRQYW QVIPSVTPAP SCTNFELIIF 60 

EILGVLNSSS RYFHWKMNLC VILLILVFMV PFYIGYFIVS NIRLLHKQRL LFSCLLWLTF 120 

MYFFWKLGDP PPILSPICHGI LSIEQLISRV GVIGVTLMAL LSGFGAVNCP YTYMSYPLRN 180 

VTDTDILALE RRLLQTMDMI ISKKKRMAMA RRTMFQKGEV HNKPSGFWGM IKSVTTSASG 240 

SENLTLIQQE VDALEELSRQ LFLETADLYA TKERIEYSKT PKGKYFNPLG YFFSIYCVWK 300 

IFMATINrVP DRVGKTDPVT RQIEITVNYL GIQFDVKFWS QHISFILVGI IIVTSIRGLL 360 

ITLTKPPYA1 SSSKSSNVIV LLLAQIKGMY FVSSVLLIRM SMPLEYRTI I TEVLGELQFN 420 
FYHRWFDVIF LVSALSSILF LYLAHKQAPE KQMAP 

ASS DNA SEQUENCE; 

Gene name: " TTK protein kinase 

Unigene number: Hs. 169840 

Probeset Accession ft: M86699 

Nucleic Acid Accession if: NM_003318 

Coding sequence: 1026-3551 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 Si 

I I 1 I I I I 

GGAATTCCTT TTTTTTT r rT TTTGAGATGG AGTTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAATGGCACA ATCTCAGCTT ACTGCAACCT CCGCCTCCCQ GGTTCAAGCG ATTCTCCTGC 120 

CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACTAATTT 180 

CTTTTCTATT TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGGTCT TGAACTCCTG 240 

ACCTCAGGTG ATCCACTTGC CTTGGCCTCC CAAA6TGCTA GGATTACAGC CGTGAAACTG 300 

TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC TTGGTCGCCC 360 

AGGCTGGAGT GCAGTGGTGC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 

GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACCCGG 480 

CTAATTTTTG TATTTTTATT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGTTCTCAAA 540 

CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 600 

AGTCACCATG CCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 

GGTCCTCACA ACAAATCTGC ATTAGGCGG? ACAATAATCC TTAACTTCAT GATTCACAAA 720 

AGGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 780 

GGATGATGAT CCTAAATCCA GATACAGTAA AAATGGGGTA TGGGAAGGTA GAATACAAAA 840 

TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGGATACATT GTTGATGTGA 900 

ATGTAAGACT GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 

TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

CCATAATGAA CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 1080 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 1260 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 1380 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG 1500 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC IS 60 

TTTCAGAGGA GGAAAAGAAG AATTTATCAG CATCTACGGT ATTAACTGCC CAAGAATCAT 1620 

TTTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTCC AGAGGACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAGATGCA GAAATAGGTT 1740 

ACCGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAGTC ATGCCCATTT GGAAGAGTCC 1800 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT 1860 

GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 1920 

AACCAAGTGG AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAACGGAA TCAAGTCTTC TAGCTAAATT AGAAGAAACT AAAGAGTATC 2100 

AAGAACCAGA GGTTCCAGAG AGTAACCAGA AACAGTGGCA AGCTAAGAGA AAGTCAGAGT 2160 

GTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGCA GATTCCGGAG TTAGCCCGAA 2220 

AAGTTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AGTCACCACC AATATCAACA TCTAAATGGT TTGACCCAAA ATCTATTTGT AAGACACCAA 2340 

GCAGCAATAC CTTGGATGAT T ACATGAG CT GTTTTAGAAC TCCAGTTGTA AAGAATGACT 24 00 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGOCTGTTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAGCATCT TCTTCAGCAA 2520 

ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTCCATATT AAAGCAGATA GGAAGTGGAG 2580 

GTTCAAGCAA GGTATTTCAG GTGTTAAATG AAAAGAAACA GATATATGCT ATAAAATATG 2640 

TGAACTTAGA AGAAGCAGAT AACCAAACTC TTGATAGTTA CCGGAACGAA ATAGCTTATT 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGG 2760 

ACCAGTACAT CTACATGGTA ATGGAGTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 

AGAAAAAATC CATTGATCCA TGGGAACGCA AGAGTTACTG GAAAAATATG TTAGAGGCAG 28B0 

TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2940 

TAGTTGATGG AATGCTAAAG CTAATTGATT TTGGGATTGC AAACCAAATG CAACCAGATA 3000 

CAACAAGTGT TGTTAAAGAT TCTCAGGTTG GCACAGTTAA TTATATGCCA CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAGAATG GGAAATCTAA GTCAAAGATA AGCCCCAAAA 3120 

GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 

AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAAGGG 3300 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 3420 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3540 

GAAAAAA ATG A TTTGCAGTT ATTCGTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTGAAGTGTT 3660 

ATCAGCAAAA AAAATTCAGT GAGATTATCT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 

TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 3780 

TTGACATCAT TTTACTCTTG GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 



871 
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TAAATAAAGT TTTGTGGCTA AAATGA 
A86 Protein sequence: 

Gene name: TTK protein kinase 

Unigene number: Hs. 169840 

Probeset Accession 8: M86699 

Protein Accession ft: NP_003309 

Signal sequence: none found 

Transmembrane domains: none found 

Protein Kinase Domain: 510-775 

Cellular Localization: cytoplasmic and nuclear 

1 11 21 31 41 51 

I I I I I I 

MNKVRDIKNK FKNEDLTDEL SLNKXSADTT DNSGTVNQIM MMANNPEDWL SLLLKLEKNS 60 

VPLSDALLNK LIGRYSQAIE ALPPDKYGQN ESFARIQVRF AELKAIQEPD DARDYFQMAR 120 

ANCKKFAFVH IS FAQ PEL SQ GNVKKSKQLL QKAVERGAVP LEMLEIALRN LNLQKKQLLS 180 

EBEKXNLSAS TVLTAQBSFS GSLGHLQNRN NSCDSRGQTT KARFLYGENM PPQDAEIGYR 240 

NSLRQTNKTK QSCPFGRVPV NLLNSPDCDV KTDDSWPCF MKRQTSRSEC RDLWPGSKP 300 

SGNDSCELRN LKSVQNSHPK EPLVSDEKSS ELIITDSITL KNKTESSLLA KLEETKEYQE 360 

PEVPBSNQKQ WQAKRKSECI NQNPAASSNH WQIPBLARKV NTEQKHTTFE QPVFSVSKQS 420 

PPISTSKWFD PKSICKTPSS NTLDDYMSCF RTPWKNDFP PAOQLSTPYG QPACFQQQQH 480 

QILATPLQNL QVLASSSANE CISVKGRIYS ILKQIGSGGS SKVFQVLNEK KQIYAIKYVN 540 

LEEADNQTLD SYRNEIAYLN KLQQHSDKII RLYDYEITDQ YIYMVMECGN IDLNSWLKKK 600 

KSIDPWERKS YWKNMLEAVH TIHQHGIVHS DLKPANFLIV DGMLKLIDFG IANQMQPDTT 660 

SWKDSQVGT VNYMPPEAIK DMSSSRENGK SKSKISPKSD VWSLGCILYY MTYGKTPFQQ 720 

IINQISKLHA IIDPNHEIEF PDIPEKDLQD VLKCCLKRDP KQRISIPELL AHPYVQIQTH 780 

PVNQMAKGTT EEMKYVLGQL VGLNSPNSIL KAAKTLYEHY SGGESHNSSS SKTFEKKRGK 840 
K 

A87 DNA SEQUENCE 

Gene name: bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 

Unigene number: Hs. 170195 

Probeset Accession #: BE616633 
Nucleic Acid Accession #: NM_001719 

Coding sequence: 123-1418 (underlined sequences correspond to start and stop codons) 



1 11 21 

1 I I 

GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG 
CTGCCACCTG GGGCGGTGCG GGCCCGGAGC 
C GATG CACGT GCGCTCACTG CGAGCTGCGG 
CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG 
GCTTCATCCA CCGGCGCCTC CGCAGCCAGG 
CCATTTTGGG CTTGCCCCAC CGCCCGCGCC 
CCATGTTCAT GCTGGACCTG TACAACGCCA 
GCCAGGGCTT CTCCTACCCC TACAAGGCCG 
GCCTGCAAGA TAGCCATTTC CTCACCGACG 
TGGAACATGA CAAGGAATTC TTCCAOCCAC 
TTTCCAAGAT CCCAGAAGGG GAAGCTGTCA 
ACATCCGGGA ACGCTTCGAC AATGAGACGT 
AGCACTTGGG CAGGGAATCG GATCTCTTCC 
AGGAGGGCTG GCTGGTGTTT GACATCACAG 
GGCACAACCT GGGCCTGCAG CTCTCGGTGG 
AGTTGGCGGG CCTGATTGGG OGGCACGGGC 
TCTTCAAGGC CACGGAGGTC CACTTCCGCA 
GCCAGAACCG CTCCAAGACG CCCAAGAACC 
AGAACAGCAG CAGCGACCAG AGGCAGGCCT 
GAGACCTGGG CTGGCAGGAC TGGATCATCG 
AGGGGGAGTG TGCCTTCCCT CTGAACTCCT 
AGACGCTGGT CCACTTCATC AACCCGGAAA 
AGCTCAATGC CATCTCCGTC CTCTACTTCG 
ACAGAAACAT GGTGGTCCOG GCCTGTGGCT 
TTGGGGCCAA GTTTTTCTGO ATCCTCCATT 
CTGCCTTTTG TGAGACCTTC CCCTCCCTAT 
AAACATGAGC AGCATATGGC TTTTGATCAG 
TCCTACAAGC TGTGCAGGCA AAACCTAGCA 
GCCGGGCCAG GTCATTGGCT GGGAAGTCTC 
TTATGAGCGC CTACCAGCCA GGCCACCCAG 
GGGCACATTG GTGTCTGTGC GAAAGGAAAA 
CAATAAAACG AATGAATG 



31 
I 

ACCGACGGCC 
CCGGAGCCCG 
CGCCGCACAG 
ACTTCAGCCT 
AGCGGCGGGA 
CGCACCTCCA 
TGGCGGTGGA 
TCTTCAGTAC 
CCGACATGGT 
GCTACCACCA 
CGGCAGCCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
AGACGCTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAGCA 
CGCCTGAAGG 
ACATGAACGC 
CGGTGCCCAA 
ATGACAGCTC 
GCCACTAGCT 
GCTCGCCTTG 
CCCCAACTTT 
TTTTTCAGTG 
GGAAAAAAAA 
AGCCATGCAC 
CCGTGGGAGG 
TTGACCOGGA 



41 



51 

■I 



GGGACGGCGG CCTGCCCCCT ' 60 

GGTAGCGCGT AGAGCCGGCG 120 

CTTOGTGGCG CTCTGGGCAC 180 

GGACAACGAG GTGCACTCGA 240 

GATGCAGCGC GAGATCCTCT 300 

GGGCAAGCAC AACTCGGCAC 360 

GGAGGGCGGC GGGCCCGGCG 420 

CCAGGGCCCC CCTCTGGCCA 480 

CATGAGCTTC GTCAACCTCG 540 

TCGAGAGTTC CGGTTTGATC 600 

ATTCCGGATC TACAAGGACT 660 

CGTTTATCAG GTGCTCCAGG 720 

CCGTACCCTC TGGGCCTCGG 780 

CCACTGGGTG GTCAATCCGC 840 
TGGGCAGAGC ATCAACOCCA 900 
GCAGCCCTTC ATGGTGGCTT 960 

CAOGGGGAGC AAACAGCGCA 1020 

GCGGATGGCC AACGTGGCAG 1080 

CGAGCTGTAT GTCAGCTTCC 1140 

CTACGCCGCC TACTACTGTG 1200 

CACCAACCAC GCCATCGTGC 1260 

GCCCTGCTGT GCGCCCAOGC 1320 

CAACGTCATC CTGAAGAAAT 1380 

CCTCOGAGAA TTCAGACCCT 1440 

GCCAGGAACC AGCAGACCAA IS 00 

AAAGGTGTGA GAGTATTAGG 1560 

GCAGCATCCA ATGAACAAGA 1620 

ACAACGCATA AAGAAAAATG 1680 

GGACTCGTTT CCAGAGGTAA 1740 

AAGGGGGCGT GGCAAGGGGT 1800 

AGTTCCTGTA ATAAATGTCA 1860 



A8B Protein sequence: 

Gene name: " bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 



Unigene number: 
Probeset Accession #: 
Protein Accession 8: 
Signal sequence: 
Pfara domains: 
Transmembrane domains: 
Cellular Localization! 



Hs. 170195 
BB616633 
NP_001710.1 
1-30 

TGFb_propeptide [37-281] 

none found 

secreted 



872 



WO 03/042661 



PCT/US02/36810 



MHVRSIiRAAA 
ILGLPHRPRP 
LQDSHFLTDA 
IRERFDNETF 
HNLGLQLSVE 
QNRSKTPKNQ 
GECAFPLNSY 
RNMWRACGC 



11 
I 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSPVNLV 
RISVYQVLQE 
TLDGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



21 

I 

LFLLRSALAD 
MFMLDLYNAM 
EHDKEFPHPR 
HIiGRBSDLFL 
LAGLIGRHGP 
NSSSDQRQAC 
TLVHPINPET 



31 
I 

FSLDNEVHSS 
AVEEGGGPGG 
YHHREFRFDI* 
LDSRTLWASE 
QNKQPFMVAF 
KKHELYVSFR 
VPKPCCAPTQ 



41 

I 

FIHRRLRSQE 
QGFSYPYKAV 
SKIPEOEAVT 
EGWLVFDITA 
FKATEVHFRS 
DLGWQDWIIA 
LNAISVLYFD 



51 
I 

RREMQREILS 60 

FSTQGPPIAS 120 

AAEFRIYKDY 180 

TSNHWWNPR 240 

IRSTGSKQRS 300 

PEGYAAYYCE 360 

DSSNVILKKY 420 



A89 DMA SEQUENCE; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



Homo sapiens mRNA; cDNA DKFZp56401763 {from clone DKPZp56401763> 
Hs. 27373 
F13036 
AC0 12478 

1-1696 (underlined sequences correspond to start and stop codons) 



ATGCGCGCCG 
GCTCCCGCCG 
CGCGAGTCGC 
GCGGCGGGCG 
ATTTCCATCC 
TTTACCACCC 
AAGAAGACAC 
CTAAATGAAG 
TCCTTGCCGG 
GTGCCCCCAC 
GGTGGAATCA 
TCAGCTGCAA 
GAAACCAAAA 
TCAGACTGTC 
TTTGGGCATC 
CTGAATCTCA 
GCCAGGTGCT 
CCTTGGTGGC 
AGTACCTTGG 
AGAGCAGTGA 
GCCCTAGAGA 
ACCCATCCTG 
AGGGTGTTTC 
TGCCTTTTGG 
ATCTGTCTCC 
GTCCTTGCGT 
AAACGAGGGC 
GCGAACCTGC 
CACCCTGGAG 
GATGGCAGAT 
TGTGAGAGGC 
TCCCCCCGAC 



11 
I 

TGCOGCTGCC 
CCOGCGCCAG 
GGCCACCGCC 
GCAGCGGCAG 
TCCTCCGCGA 
TCCTCATCGC 
GCAAGTATGA 
AGGATGATGA 
CTGCACTGAG 
CCTTCATCCT 
GACCTGGTAA 

CGAACGTTAT 
ACTGGCAAGC 
CCTTTAAAGT 
TGGAAAAGCT 
TGCCACTGGT 
ACTTCAGCGC 
GCTTGGATGT 
CTAAGGTTCT 
GTGGGTTTCC 
TCAGGTTGGC 
GGCGTCCGCG 
TTTTGAAGAT 
CCTGCTGTGC 
CATTTGAGCA 
CCAGTGGGCA 
AGACAATTCC 
GAGACTTGGA 
GCCAGAAGAT 
TCACAGGTTC 
AGCCCCTGTT 



21 
I 

CGCCCCGCTC 
CAGAGCCGAG 
CGGCCCGGGG 
CTCCAGCTCC 
CCTACCCACC 
CTGCCTGCTG 
TATCATCACC 
AGATGAGGAC 
ACGTCAGCTG 
CGACATTGAC 
AACCTGTTTC 
GAAGGACTGG 
GTATAAAACC 
TCGTTTCCAC 
GCCCCCTACT 
GGATTCCTCT 
CCTGGCAGAA 
CACAGGCTCT 
TTTCTGTGGT 
CCAGGGTAGC 
CCATCATCTC 
TCGTTCAGAT 
GCAGTCTCTG 
TCTGTTGAGG 
CGTGGAACAC 
GAGCCCACAA 
ATTGACCAGA 
AGATACCCAA 
TGGGGTGGCA 
GGTCCTGATG 
CCATCACTTC 
TCTGTCCAGG 



31 
I 

CTGCCGCTGC 
TCCGTCTCCG 
CCCGGGAACA 
AACAGCAGTG 
CTCAAGGCAG 
CTGCGCGTCT 
ACTCCAGCAG 
TCCACAGTAT 
CCAGGGTGCC 
CTTCCAGCAA 
CCAGCCTGGT 
ACCTGGAAGC 
CCAGCTCCAT 
GTCACCACAA 
TCTACTCCCC 
GCCTTACGCA 
ATGGCGGCTG 
CCAATAAAAA 
GCCGGCCAGC 
TCTTTCTCCA 
AGGCTTCTCA 
GCCCGGGGAC 
CATGGCGGAG 
CGCCATCCTC 
CTACGGGAAG 
AAGGCAGCTG 
CACACATGCC 
GGCCAGGAAG 
AATTTCTATT 
TCTGAGGAAG 
TCCAGCCATT 
CCCTGA 



41 
I 

TGCTGCTCGC 
CGCCGTGGCC 
CCACCCGGTT 
GCGACGCCTT 
CCGTGATCGT 
TCAGGTCGGG 
AGCGAGTGGA 
TCGACATCAA 
AGACGCTACT 
GATGCAGTGG 
GGCATCCTGT 
CCTCTTGCGT 
CGTGCGTGTC 
TGGAGTTGCT 
ATGGTTTTCG 
GAAACACCCG 
CTGAAAGTGA 
CCCTTTACAC 
GGGGCACCTT 
AACAGCTGOG 
GAGAGTGTCC 
AAGCCAGCCT 
GGTCAGCGGG 
ACCTTGACCT 
CCAAGAGAAG 
CTGCCCACGG 
CTGGCTGGGG 
GCCCACGTGA 
TGGAGGAAGA 
GGCCACCTAG 
CCAAGTCTTG 



51 

I 

GCTCCTGGCC 
CGAACCCGAG 
TGGGTCTGGG 
GGTGACCCGC 
GGCGTTCGCC 
AAAGAGGTTA 
AATGGCGCCA 
ATACAGAGTG 
GACAGTTCCT 
AAGGCCTGAT 
GGAAAGTTGG 
CGGAGGTGTT 
AGGCATCTGC 
TCTGCCACCC 
ACAACTGCAG 
GGCTCCATCT 
CCTTCCAAAT 
ACAAACCATG 
TTGTGAAGAC 
CTGGAAGCCA 
TCCGCTGAGC 
GACGGGGAGG 
TACCGCAACT 
CTTCTACAAA 
CTCAGTGACT 
GGAGCCTGTC 
GATCACACAT 
GGATGTCACT 
GGGTTTCCAG 
TTTGACAGGA 
GTCCTTCCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



A90 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



Homo sapiens mRNA; 

Hs. 27373 

F13036 

FGENESH predicted 
1-27 

94-115, 448-469 
not determined 



CDNA DKFZp56401763 (from clone DKFZp5 6401763) 



1 11 21 31 41 51 

I I I I I I 

MRAVPLPAPL LPLLLiLALIiA APAARASRAE SVSAPWPEPE RESRPPPGPG PGNTTRFGSG 60 

AAGGSGSSSS NSSGDALVTR ISILLRDLPT LKAAVIVAFA FTTLLIACLL LRVFRSGKRL 120 

KKTRKYDIIT TPAERVEMAP LNEEDDEDED STVFDIKYRV SLPAALRRQL PGCQTLLTVP 180 

VPPPFILDID LPARCSGRPD GGIRPGKTCF PAWWHPVESW SAATWGVKDW TWKPSCVGGV 240 

ETKTNVMYKT PAPSCVSGIC SDCHWQARFH VTTMELLLPP FGHPPKVPPT STPHGFRQLQ 300 

LNLMEKLDSS ALRRNTRAPS ARCLPLVLAE MAAAESDLPN PWWHFSATGS PIKTLYTQTM 360 

STLGLDVFCG AGQRGTFCED RAVTKVLQGS SPSKQLRWKP ALESGFPHHL RLLRECPPLS 420 

THPVRLARSD ARGQASLTGR RVFRRPRQSL HGGGSAGTAT CLLVLKILLR RHPHLDLFYK 480 

ICLPCCAVEH LREAKRSSVT VLASFEQSPQ KAAAAHGEPV KRGPSGQLTR HTCPGWGITH 540 

ANLQTIPDTQ GQEGPREDVT HPGGDLDGVA NFYLEEBGFQ DGROQKMVLM 8EEGPPSLTG 600 
CERLTGSHHF SSHSKSWSFL SPRQPLFLSR P 

EWING 



A91 DNA SEQUENCE 

Gene name: G protein-coupled receptor 64 
Unigene number: Hs. 184942 



873 



WO 03/042661 



PCT/US02/36810 



Probeset Accession 8: 
Nucleic Acid Accession 8: 
Coding sequence: 



AA43S577 
NM 005756 

73-3117 (underlined sequences 



5 

1 11 21 31 41 51 

1 i i 1 I I 

AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTGAA 
CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 
10 GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT TCTGGTAACA 
TCCCTGGAAG AAGATACTGA TAATTCCAGT TTGTCACCAC CACCTGCTAA ATTAT CTGT T 
GTCAGTTTTG CCCXXTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 
AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 
GCTTCAGGCG TCAAACCCCA GAGAAATATC TGCAATTTGT CATCTATTTG CAATGACTCA 
15 GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 
CAACATATAA OGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 
GAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 
GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 
TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC CAATGGAACA CTGCTGCTGT 
20 TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA GTGTGACCTG 
CAGGATCCCA TTGTCTGTCT TGCTGACCAT CCACGTGGCC CACCATTTTC TTCCAGCCAA 
TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 
GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAGATT 
CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTGCCAT TGACATGCCC 
25 CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATGTCTCCGG CACCCCACCT 
CCTGTGAAAG CCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 
AGCGCACCTC CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 
CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCTAA CCTCGCAGGA 
GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCCGC CTGACATGCT GGCCCCTCTG 
30 GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 
ACTATAAGTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT 
TTCAACACAA CTACCTTTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 
CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 
CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 
35 TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCTACGTCAT ATCATCGAGT 
GTTGCAAACC TGACCGTCAG GAACTTGACA AGAAAOGTGA CAGTCACATT AAAGCACATC 
AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 
GGCAGAGGAG GCTGGTCAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC 
ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTGC TGGACCTATC TAGGACATCT 
40 GTGCTGCCTG CTCAAATGAT GGCTCTGAOG TTCATTACAT ATATTGGTTG TGGGCTTTCA 
TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT 
TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 
CTCCTGGACT CGTGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 
TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 
45 TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCCGAA AATACATCCT TAAATTCTGC 
ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 
AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 
ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 
CTGAACGTCA GCATGTTCAT TGTGGTGCTG GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 
50 CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GGAGTATCGC TGGCCTTACA 
TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGACCAGT TAACGTGACC 
TTCATGTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 
TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGGGGT ATCTTTGTTG TGGAAAGTTA 
CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 
55 GTAAACCAAG GAGTGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 
TCCACCACAC TGCTAGTGAA TAATGATTGC TCAGTACACG CAAGOGGGAA TGGAAATGCT 
TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCACGAT 
TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG GAAAGGCCGT 
ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT TTATTGAGCA AATGTGATTC 
60 CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT TTACCTTTTA 
CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 
AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 
TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 
ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 
65 TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 
TTGGTGAGTT TAGTTGTGCA TGOCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 
GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 
TTGTAGAATC ATGGTTGTTT TGTCTCACGT GATAATTCAG AAAATCCTTG CTOGTTCCGC 
AAATCCTAAA GCTC CTTT TG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 
70 1 TCAAGAAATA ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 
TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 
TGGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGG 
GTGCACGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC CTGACCGCAT 
CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 
75 AGAGGGATGA ACTGTCATCC AGAOCATGTG TCAGGAAAAT TGTGAAOGTA GATGAGGTAC 
ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 
CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCOGTTCT TTAAAAGGCA 
ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 
TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT ATTGGGAACA 
80 GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 
TATAGAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC 
TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 
ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAGTAAT AATACACATT TCTGTGAGTG 
CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA TTTTCTTGTA TTTATATGTT 

874 



spend to start and stop codons) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4060 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 



WO 03/042661 



PCT/US02/36810 



AAAATCAAAA ATGTTAAAAT CAATGAAATA AATTTGCAGT TAAGA 



A92 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession ft: 
Signal sequence: 
GPS domain: 

Transmembrane domains: 
Cellular Localization: 



G protein- coupled receptor 64 

Hs. 184942 

NP 005747.1 

1-38 

564-615 

624-646, 660-682. 688-710, 733-755, 
plasma membrane 



783-805, 828-850, 858-880 



1 11 21 31 41 51 

I I I I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PAKLSWSFA 60 

PSSNEVETTS LNDVTLSLLP SNETBKTKIT IVKTFNASGV KPQRNICNLS SICNDSAFFR 120 

GEIMFQYDKE STVPQNQHIT NGTLTGVLSL SELKRSELNK TLQTLSETYP IMCATAEAQS 180 

TLNCTPTIKLi NNTMNACAAI AALERVKIRP MEHCCCSVRI PCPSSPEBLG KLQCDLQDPI 240 

VCLADHPRGP PPSSSQSIPV VPRATVLSQV PKATSPAEPP DYSPVTHNVP SPIGEIQPLS 300 

PQPSAPIASS PAIDMPPQSB TISSPMPQTH VSGTPPPVKA SPSSPTVSAP ANVNTTSAPP 360 

VQTDIVNTSS XSDLENQVLQ MEKALSLGSL EPNLAGEMIN QVSRLLHSPP DMLAPLAQRL 420 

LKWDDIGLQ LNFSNTTISL TSPSLALAVI RVNASSFNTT TFVAQDPANL QVSLETQAPE 480 

NSIGTITLPS SLMNNLPAHD MELASRVQFN FFETPALFQD PSLENLSLIS YVISSSVANL 540 

TVRNLTRNVT VTLKHINPSQ DELTVRCVFW DLGRNGGRGG WSDNGCSVKD RRLNETICTC 600 

SHLTSPGVLL DLSRTSVLPA QMMALTFITY IGCGLSSIPL SVTLVTYIAF EKIRRDYPSK 660 

ILIQLCAALL LLNL.VFLLDS WIALYKMQGL CISVAVFLHY FLLVSFTWMG LEAFHMYLAL 720 

VKVPNTYIRK YILKFCIVGW GVPAVWTII LTISPDNYGL GSYGKFPNGS PDDFCWINNN 780 

AVPYITWGY PCV1FLLNVS MFIWLVQLC RIKKKKQLGA QRKTSIQDLR SIAGLTFLLG 840 

ITWGFAPFAW GPVNVTFWYL FAIFNTLQGF FIFIPYCVAK ENVRKQWRRY LCCGKLRLAE 900 

NSDWSKTATN GLKKQTVNQG VSSSSNSLQS SSNSTNSTTL LVNNDCSVHA SGNGN ASTER 960 
NGVSFSVQNG DVCLKDFTGK QHMFNEKEDS CNGKGRMALR RTSKRGSLHF IEQM 



Fibrosis 



A93 DNA SEQUENCE 

Gene name: EGF- like -domain; multiple 6 

Unigene number: Hs. 12844 

Probeset Accession #: N67SS1 

Nucleic Acid Accession #: NMJ)15507 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACAOG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCOCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACOGGC GAAATCGCAG TGGATGGOGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTQGATGA CT QAA TGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



875 



WO 03/042661 



PCT/US02/36810 
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A94 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
MAM domain: 
EGP domain: 

Cellular Localization: 



EGP- like -domain; multiple 6 

Hs. 12844 

N67551 

NP 056322.2 

1-21 

none found 
402-546 
80-259 
secreted 



l 

I 

MPLPWSLALP 
CEATCEPGCK 
LSGHMLMPDA 
GKVICPYNRR 
GSFKCKCKQG 
PEPTRTPTPK 
SLRGDVPFPK 
WNPADRDNAI 
FVKKSNNALA 
SGLCPDSLLS 



11 
I 

LLLSWVAGGF 
PGECVGPMKC 
TCVNSRTCAM 
CVNTFGSYYC 
YKGNGLRCSA 
VNLQPFNYEE 
VNEAGEFGLI 
GFYMAVPALA 
WKKTTSEDEK 
VDD 



21 
I 

GNAASARHHG 
RCFPGYTGKT 
INCQYSCEDT 
KCHIGFELQY 
IPENSVKEVL 
IVSRGGNSHG 
LVQRKALTSK 
GHKKDIGRLK 
WKTGKIQLYQ 



31 
I 

LLASARQPGV 
CSQDVNECGM 
EEGPQCLCPS 
ISGRYDCIDI 
RAPGTIKDRI 



41 

I 

CHYGTKLACC 
KPRPCQHRCV 
SGLRLAFNGR 
NECTMDSHTC 
KKLLAHKNSM 



LEHKDLNISV 
LLLPDLQPQS 
GTDATKSIIP 



DCSFNHGICD 
NFCLLFDYRL 
EAERGKGKTG 



51 
I 

YGWRRNSKGV 
NTHGSYKCFC 
DCLDIDECAS 
SHHANCFNTQ 
KKKAKIKNVT 
KALKNDIEER 
WKQDREDDFD 
AGDKVGKLRV 
EIAVDGVLLV 



60 
120 
180 
240 
300 
360 
420 
480 
540 



GLIOBLASTOMA 



A95 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession 



ESTs 

Hs. 293616 
AW043782 
none found 



Coding sequence: 38-1075 (underlined sequence corresponds to start and stop codon) 



1 
I 

AGCAACGACG 
CCTGCTGCTG 
GTGCAACATA 
GTGTGACGGG 
GTCGAAATGT 
CTTCCGGTGC 
AAACCCTCTG 
GAGCTTCATC 
AAGTTCTCAA 
TTACCCCAGC 
CCTGCTGGCA 
GCACCGGCTG 
CTGCAACGTC 
GAATGCGTCG 
TGCGTGGTAT 
CGACCTGCCC 
CAGCAGCCTC 
GGGCACTGCT 
AGTTATTCCA 
TGCTCATGGG 
AACTATCTCT 
TGACATGATC 
CACCCTCATT 
AAATAGGCTG 
CGCTGGACCC 
ATGATCTAAC 
ATCAAAACCT 
AAGAAAACTT 
AAGGACTCTG 
CTCATTCTGA 
GAGCCCCTCC 
TACACCTGCC 
ACCTGCCCGT 
GTATGTCCCT 
CTCCAAAGTT 
ACTGGTTTCT 
CTGCACTGTG 
GGTCAGGGTC 
AGACAATTTG 
TGAAACAGTG 
AGCTGTCTCT 
ACACCCTTGC 
ACATTTGTGC 
AGAGGGACTC 
TTCTCTGTGT 
AGGTGTTGTT 
CCACTCOGGG 
AACCTGTTTG 



11 
I 

CCGGGCAGCG 
AGCAGCGCCG 
CCAGGCAACT 
CTGCCTGACT 
GGCCCAACCT 
AATGGGTTTG 
CTTTGCTCCA 
TGCGATGGAC 
GAACCCGGCA 
ATCACCTATG 
CTGGTCTTGC 
CAGCACCCTG 
ACCTACAACG 
GAAGTAGGCT 
GACCTTCCTC 
CCCTACCGCT 
CTGAGCGTGG 
GAGCCCAGGG 
AAGTCCATAT 
AAGCTCTTTA 
GCATTCCCCT 
TGTTGTGCGT 
TTTCACATTA 
GGAGAGAGCA 
AATTCTCTCT 
C AGGA GGCCA 
GCTTTGCACA 
TGGACGTGAG 
AAACCATCTA 
GAGCTTTCCT 
CATGAGTTTA 
CTGGCTCTAC 
AGCCAAGGAA 
GTGGCCCACA 
CCCTTAACAC 
ATCACAGGTG 
CACGCTCCTC 
AGGCCTCTCC 
GAGTCAAGAT 
TGTTTGTTTT 
TTTT T TSTTT 
CCCGCTGAGC 
ATTGTTGCAC 
CTCTCTCCCT 
CCAGTCAGCC 
TGGCAAGAAA 
CAGCTGTCAC 
ACGCTAATTA 



21 
I 

GGAGCGGCGG 
CGGAGAGCCA 
TCATGTGCAG 
GCTTCGACAA 
TCTTCCCCTG 
AGGACTGTCC 
CCGCCCGCTA 
AGAATAACTG 
GTGGGCAGGT 
CCATCATCGG 
ACCACCAGCG 
TGCTGCTGTC 
TCAATAATGG 
CCCCACCCTC 
CACCGCCCTA 
CCCGGTCCGG 
AAGACACCAG 
ACTCTGAGCC 
GGGTTAATCT 
AGCACCTGTA 
CCTCCCCCAG 
CTTTTCTGTC 
TTCTGTTTCT 
ATGTTTCTGT 
GCTGGGTAGT 
TCACTGGATG 
ATCCTATTTG 
TAACACCCTT 
CCCTGTATAA 
CAGCAGCATA 
TCCAAGTTCT 
AGCCACTTAC 
TGAGGACCTA 
CCCAGCCTGT 
TTGCAAAGTC 
AGAGCCATGT 
TTCGCAAGGT 
CAACATCCCA 
TTTCCATTTG 
TTCCCTTCTA 
TTCCTTTAAC 
CCOGTGATAA 
TTTGAGGTTA 
CCGTGTATAG 
ACAGGGCCCG 
CCACACTGAC 
CCATTCAOAA 
AAACAGAGCC 



31 
I 

CCGCGCCATG 
GCTGCTCCCC 
CAATGGACGG 
GAGTGATGAG 
TGCCAGCGGC 
CGATGGCAGC 
CCACTGCAAG 
TCAAGACAAC 
GTTTGTGACT 
CAGCTCCGTC 
GAAGCGGAAC 
CCGCCTGGTG 
CATCCAGTAT 
CTACTCCGAG 
CTCTTCTGAC 
GAGTGCCAAC 
CCACAGCCCG 
CAGCCAGGGC 
GCTCTGACTT 
AGGATGTCTC 
ACTTCAGAGA 
AGGTCACTCT 
GTTGGAGAGA 
GCTATATTGG 
TACCTTATAG 
GTCACCCCCC 
ATGCCCCCAG 
CAGCAGTCGC 
ATTCTGGCTT 
TATCATCAGC 
CAGCTCCTAA 
CTGGTTTCTG 
ACTTGAGTTG 
CTTGCTCATT 
CTTTTTACCT 
TCAATACCTC 
CCCAATAjCCA 
GTAGTTTCTC 
GATCTATTTT 
GTTAAGGGAC 
AAGGTCCAAA 
CAAGTCACTC 
TTATTTATCA 
TCTCTATGTT 
CCTCCCTGCA 
TGATGAGGGG 
CTTCTTTCCG 
TGCAGGAAGT 



41 

1 

TGGCTGCTGG 
GGGAACAACT 
TGCATCCCGG 
AAGGAGTGCC 
ATCCATTGCA 
GATGAAGAGA 
AACGGCCTCT 
AGTGATGAGG 
TCAGAGAACC 
ATTTTTGTGC 
AACCTCATGA 
GTCCTGGACC 
GTGGCCAGCC 
GCCTTGCTGG 
ACGGAATCTC 
AGTGCCAGCT 
GGGCAGCCTG 
ACTGAAGAAG 
GTTGCCATTC 
AAGTTACAGT 
TGTTTTTCTG 
TCCCTTGGGA 
CAGCATATAA 
ATGCTCAGAA 
CATTTGGGGA 
CAAAAAAATT 
TTCAGCAGAG 
AACGTTATTT 
TAGAAATTTG 
CTCATCCTAA 
AATGCAGGCT 
GACTGTCACC 
GCCCAAAGTC 
CATGCAGCCT 
GTGCATTTGG 
CAGCAAGCTC 
GCACCTCTAG 
CTCTGAGACA 
AAATCTTTTA 
TATTTATATG 
GAAAGATGCA 
CAGACTAACC 
AGTTCTTGAA 
TGTGCTAGTT 
GGAATAAGGG 
TAAAATGGAA 
CAGCTGAAGA 
GGGGCTAAAG 



51 
I 

GGCCGCTGTG 
TCACCAATGA 
GCGCCTGGCA 
CCAAGGCTAA 
TCATTGGTCG 
ACTGCACAGC 
GTATTGACAA 
AAAGCTGTGA 
AACTTGTGTA 
TGGTGGTGGC 
CGCTGCCCGT 
ACCCCCACCA 
AGGCGGAGCA 
ACCAGAGGCC 
TGAACCAAGC 
CCCAGGCAGC 
GCCCCCAGGA 
TATAAGTCCC 
TAACAATTTG 
TTGGGATATT 
GCGTCTCAGT 
CCCGAGATCA 
AACAGTATTG 
GTGCAGGAGA 
TTTGGGTTAG 
CCATTTGAGC 
TCAGTGGCCA 
TGGTTTTGTG 
CCCAAGAATG 
AATAGGCAGG 
GCCAAGACCC 
CTCCCAGCTG 
TGACCTGGCT 
CAACACTGGC 
ACTTGAGGAC 
TCCTGGCTCC 
TTAGAGTTAG 
CATGGGCAAG 
GAAATGCATT 
TGTATAGGAA 
AAAGGAGATC 
TGTGTGCCAG 
GGAAGCAGAA 
TTTCTTTTTT 
GTAAAACGTT 
CCAGGTAGAG 
AATGTTCAGT 
TGGCATTCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2B20 
2880 



876 



WO 03/042661 



PCT/US02/36810 



10 



TGATCCTGTT CTGTAGACTT TTCTTTC TTT TTTTAACCAA ATCCAAAGGA TOTTACAGAA 2940 

AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA QAAAGAAAGA AAGAAAGAAA 3000 

AACGGAAAGG AACCTAGCTG CCTG TATCTT TCATTTTTAA AATAGCACTT GAGTTATTTT 3060 

CTGAGTAATC CAATAAAGAA CTTTTGATGA CAGCCAGAAT GTGTTAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 

TTTCTGGTGC TCTGGA AOTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

GAGTTAATCT CACTCGCTTT TCTGCTTOCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AGA I AAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 3360 

3g A ^^" A " 1 " 1 ' WTGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3 540 

AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG TTTTATTTTA GCATTAAATT 3600 

AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 3660 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 3720 

ID GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TQTGCAAAWG GWMCTAMARM 3840 

AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3900 

TGGGGCGGCG GGGCCCACGT AGGTACGGOG ACCACGCGGG CCCAAACGGG ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA ACGCGCCGGG 4020 

GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG CACACAAGCG 4080 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 



20 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A96 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
LDLa domains: 
Cellular Localization: 



ESTs 

Hs. 293616 
AW043782 
none found 
1-17 
169-191 

28-66, 70-108, 112-149 
plasma membrane 



1 
I 

MWLLGPLCLL 
EKECPKAKSK 
KNGLCIDKSF 
VI FVLWALL 
YVASQAEQNA 
NSASSQAASS 



21 



31 



41 



51 



11 
I 

LSSAAESQLL PGNNFTNECN 1PGNFMCSNG RCIPGAWQCD GLPDCPDKSD 60 
CGPTPFPCAS GIHCIIGRFR CNGFEDCPDG SDEENCTANP LLCSTARYHC 120 
ICDGONNCQD NSDEBSCESS QEPGSGQVFV TSENQLVYYP SITYAIIGSS 180 
ALVLHHQRKR NNLMTLPVHR LQHPVLLSRL WLDHPHHCN VTYNVNNGIQ 240 
SEVGSPPSYS EALLDQRPAW YDLPPPPYSS DTESLNQADL PPYRSRSGSA 300 
LLSVEDTSHS PGQPGPQEGT AEPRDSEPSQ GTEEV 



LUNG 



A97 DNA SEQUENCE 

Gene name: putative GPCR, Weakly similar to dJ365012.1 
Unigene number: Hs. 2 5 68 97 

Probeset Accession #: BE001836 
Nucleic Acid Accession #: BE001836 

Coding sequence: 1-29SS (underlined sequences correspond to start and stop codons) 



ATGGCCGTCA 
ATGTTAAGTA 
, GGTTTAGGCC 
GGGAATATTG 
AACATAGATG 
GAGAAGTCAA 
GAATATCAGC 
AATTTTCTGA 
AGAGCAAAGG 
GACAGCTACA 
GCTGGAGCAC 
TGTGAGAGAA 
TTGAATTCAT 
AAAGCATATG 
AGCATCGTTG 
ATTGAACATG 
GGCTCTTTCA 
AAGGATGATG 
TGTGAGTCCT 
CTGAACAAGA 
GTGCAAAATC 
TCGGTGGTGT 
TCCAATTCAA 
GTAACCAACT 
ACATTAGAAA 
AAATTCATTG 
TATCAGATTA 
GGGTCAGACC 
CTGGGGAACA 



11 
I 

AATCTTTTTC 
TGATAGGGAT 
CACATAGCAC 
TTCACTCTAA 
CTTCCTCTTT 
ATTCTGACAC 
TGCTGCTTCA 
AGCTCTTGAA 
CTACCACAGA 
CCTGGTTTCC 
TCCCAAGCTG 
CAAAGATTTG 
CTTCTGCTAT 
AAAGAATTCA 
CTGGGTATGA 
TTGCOGAGAA 
GAGTGTTCGG 
AATATACCCT 
CTGGGTGGCA 
ATTTCAGTAT 
TTTCTGTCAT 
CGATTCTGAG 
CAATGGAGGA 
GGACAGTCTT 
ACATCAGCAC 
ACTGGAAAGG 
AAATGTGTCC 
AATTCCAGAG 
TTCTACCCGT 



21 

I 

CTTCAACACT 
CAGTGGCCCA 
TAAAGCTCTT 
AAAAGATGTT 
OCTGATTAGT 
AATCCAGCAA 
GGTGACCTAT 
GCCTCCATTA 
CTGCAACAGC 
TCCCTCATGC 
TGAATGTCAT 
GGGCACTTTC 
ATACTCCAAA 
AGGTTTTGAG 
AGTTGTTGGC 
GGCTAAGACA 
AAAAGCCCAG 
GCCCTGCAGC 
GGTCATCAGG 
GATTGTAGGC 
CATTCGGCAA 
CAATATTTCA 
TGTCATCAGT 
ACTGCGGGAA 
TCTGGTGCCT 
GATTCCAGTG 
CCAAAATACA 
ATCCCTTOCA 
TTCCAAAAAT 



31 
I 

GGGAATGATG 
GCTGGTGTGG 
CAACATGCAA 
TCTATTAGAA 
GATGGCCATC 
GTAACTATAA 
AGAGATTCCA 
TTATGGTCAC 
CTGAATGGAG 
CTTGATCCCC 
CTCAACAACC 
AAAATTAATG 
TATGCAAATG 
TCGGTTCAGG 
TCCAGCAGTG 
GCCCTTCACA 
TGTAATGACA 
AGTGGCTACA 
GAGACTTGTG 
AATGCCACTG 
AACCCATCAA 
TCTCTGTCAC 
ATAGCTGACA 
GAAAAGTATG 
CCGACAGCTC 
AACAAAAGCC 
TCTATTCCCA 
GAAACTATTA 
GGAAATGCTC 



41 
I 

GGCTAGACCC 
AAGACAAG7G 
TTAGAAGCTC 
TTTACTTACT 
AGTTAACCCA 
AAACTGATGG 
AGGAGAAAAG 
ATGGGCTAAT 
TCCTGCAGTG 
AGAACTGCTA 
TCAGCCAGAG 
AAAGGTTTAC 
GAATTGAAAT 
TCACCCAATT 
CATCTGAACT 
AGCTGTTTCC 
TTGTCTTTGG 
GGGGAAACAT 
TGCTCTCTCT 
AGGCAGCTGT 
CCACAGTGGG 
TGGCCAGCCA 
ATATCCTTAA 
CCAGCTCACG 
TTCCTCTGAA 
AACTCAAAAG 
TCAGAGGCCG 
TCAGCATGGC 
AGGTCAATGG 



51 
I 

AGACCCAATC 
GGTCATACTG 
AAATGTTGAC 
CTTTCATGAA 
AGTGCACTCA 
CCCAGTCGAA 
AGATTTGAGA 
TAGAATTATC 
TACCXGTGAA 
CCTTCACACG 
TGTCAATTTC 
AAATGACCTT 
TCAACTTAAA 
TCGAAATGGA 
GCTGTCAGCC 
ATTAGAAGAC 
ATTTGGGTCC 
CACAGCCAAG 
GCTTGAAGAA 
GTCATCCTTC 
GAATCTGGCT 
TTTCAGGGTG 
TTCAGCCTCA 
GTTACTAGAG 
TTTTTCTCGG 
GGGTTACAGC 
TGTGTTAATT 
CTCGTTGACT 
ACCTGTGATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



877 



WO 03/042661 



PCT/US02/36810 



10 
15 
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TCCACGGTTA 
TCAAACCTGA 
GCAGGCTGCC 
ACCTCCTTCT 
TGGATCACCT 
GAGGCTTTGT 
ATGGTGAACA 
GTGGACACCA 
TTCTACCTCT 
ATCCTCGTGT 
TATGGGTGCC 
TACAAAAGGA 
TTTGTTGTCC 
ACAAAGCTCT 
ATCOGCGTGG 
GGAATAGGAA 
AATGCATTCC 
CAACTTCTGT 
TCATCAGATT 
GGCCATTATG 
GTCTCAAATG 



TTCAAAACTA 
GCCAGCCTCA 
ACCTAGTGAA 
CCATATTGAT 
ATGTGGGACT 
TTTGGAAGCA 
TAGCCCTGTC 
CGGTGAACCC 
CTTTGTTCTT 
TCCATCACAT 
CTCTCATTAT 
AAGATGTGTG 
CTGCACTGGC 
GGAGGCCGAC 
GGAAGAGCCT 

CAATAGTGGA 
AGGGATTTTT 

TCAACAAGTT 
TATCTGCCAA 
CATTTTCTCA 
AATAA 



TTCCATAAAT 
TTGTGTGTTT 
TGAAACTCAA 
GTCACCTTTT 
GGGTATCTCC 
GATTAAAAAA 
CCTCTTGATT 
TTCTGGAGTC 
CTGGATGCTC 
GGCCCAGCAT 
ATCTGTCATT 
TTGGCTTAAC 
TATTGTGGCT 
TGTTGGGGAA 
CCTCATTCTG 
CAGCCAGAAT 
TATCTTATGC 
GTCTGCCTTA 
ACCCAAATTC 
TACTGGAGAT 



GAAGTTTTCC 
TGGGATTTCA 
GACATOGTGA 
GTCCCCTCTA 
ATTGGAAGTC 
AGOCAAACCT 
GCTGATGTCT 
TGCACAGYTG 
ATGCTTGGCA 
TTGATGATGG 
ACCATTGCTG 
TGGTCCAATG 
GTGAACTTCG 
AGACTGAGTC 
ACCCCTCTGC 
CTGGCTTGGC 
TTTGGAATAC 
AGTTCTTGGA 
TCAAAGCCTT 
TCCTCCGACA 



TA TTTTTTT C 
GTCATTTGCA 
CGTGCCAATG 
CAATCTTCCC 
TCATTTTATG 
CTCACACACG 
GGTTTATTGT 
CTGTGTTCTT 
TCCTGCTGGC 
CTGTTGGATT 
TCACGCAACC 
GAAGCAAACC 
TTGTGGTGCT 
GGGATGACAA 
TAGGGCTCAC 
ATGTTATTTT 
TCTTGGACAG 
AGCAAACAGA 
TCAACCCACT 
ACATCATGCT 



CAAGATAGAG 
GTGGAACGAT 
TACTCACTTG 
CGTTGTAAAA 
CCTGATCATC 
TCGTATTTGC 
TGGTGCCACA 
TACACACTTC 
TTACCGGATC 
TTGCCTGGGT 
TAGCAATACC 
ACTCCTGGCT 
GCTAGTTCTC 
GGCCACCATC 
CTGGGGCTTT 
TGCTTTACTC 
TAAGCTGCGA 
AAAGCAAAAC 
GCAAAACAAA 
AACTCAGTTT 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



A98 PROTEIN SEQUENCE 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Pfam domain: 
Transmembrane domains: 
Cellular Localization: 



putative GPCR, Weakly similar to dJ36S012.1 
Hs. 256897 

none found 

1-17 

7tra_2 {561-820) 

545-567, 571-593, 605-627, 641-663, 677-699, 721-743, 
plasma membrane 



767-789 



MHALLLCFSV 
LIRIIRAKAT 
QSVNFCERTK 
QFRNGSIVAG 
PGFGSKDDEY 
AVSSFVQNLS 
LNSASVTNWT 
KRGYSYQIKM 
NGPVISTVIQ 
QCTHLTSFSI 
TRRICMVNIA 
LAYRIILVPH 
KPLLAFWPA 
LTWGFGIGTI 



11 

I 

LNGASGLSLL 
TDCNSLNGVL 
IWGTFKINER 
YEWGSSSAS 
TLPCSSGYRG 
VIIRQNPSTT 
VLLREEKYAS 
CPQNTSIPIR 
NYSINEVFLP 
LMSPPVPSTI 
LSLLIADVWP 
HMAQHLMMAV 
LAIVAVNFW 
VDSQNLAWHV 



21 
I 

QSPVEEYQLL 
QCTCEDSYTW 
PTNDLLNSSS 
ELLSAIEHVA 
NITAKCESSG 
VGNLASWSI 
SRLLETLENI 
GRVLIGSDQF 
FSKIBSNLSQ 
FPWKWITYV 
rVGATVDTTV 
GPCLGYGCPL 
VLLVLTKLWR 
IFALLNAFQV 



31 
I 

LQVTYRDSKE 
FPPSCLDPQN 
AIYSKYANGI 
EKAKTALHKL 
WQVIRETCVL 
LSNISSLSLA 
STLVPPTALP 
QRSLPETIIS 
PHCVFWDFSH 
GLGISIGSLI 
NPSGVCTAAV 
IISVITIAVT 
PTVGERLSRD 
RTVTITYCIV 



41 
I 

KRDLRNFLKL 
CYLHTAGALP 
EIQLKKAYER 
FPLEDGSPRV 
SLLEELNKNF 
SHFRVSNSTM 
LNFSRKPIDW 
MA8LTLGKIL 
LQWNDAGCHL 
LCLIIEALFW 
FFTHFFYLSL 
QPSNTYKRKD 
DKATIIRVGK 
K 



51 
I 

LKPPLLWSKG 
SCECHLNNLS 
IQGFESVQVT 
FGKAQCNDIV 
SMIVGNATEA 
EDVISIADNI 
KGIPVNKSQL 
PVSKNGNAQV 
VNETQDIVTC 
KQIKKSQTSH 
FFWMLMLGIL 
VCWLNWSNGS 
SLLILTPLLG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A99 DNA SEQUENCE 

Gene name: putative G-protein coupled receptor 
Unigene number: Hs. 16085 

Probeset Accession ft: P07953 
Nucleic Acid Accession 8: NM_016334 

Coding sequence: 104-1471 (underlined sequences correspond to start and stop codons)' 



AGCACCTGGG 
GGGGAGTGGG 
CTCCAGCATC 
GCGCCAATTG 
GACGTTTGCA 
GAATAGCAGC 
GGTTTTCATG 
TAAACAACGA 
ACTAGGAGAT 
CATCAGCCGG 
TGTCAACTGC 
TCTAGCCCTG 
GATGGCAATG 
TTTCTGGGQA 
TATTCAACAG 
TGATCTATAT 
TAATTTTCTT 
CAATATTGTT 
TGTGAATTAT 
TCTTGTTGGA 
CTTTTATGCC 
AATGGGCATG 
COGCACCATA 
TGATGTGATC 



11 
I 

AAAAGGCAGA 
AAGTGGAGGC 
ATGATTACCT 
TTTAAAGACT 
TTTTCTTGCA 
TCCCGTTATT 
GTGCCTTTTT 
CTGCTTTTTT 
CCCTTTCCCA 
GTTGGTGTOA 
CCATACACTT 
GAACGGCGAC 
GCACGGAGAA 
ATGATAAAAA 
GAAGTGGATG 
GCTACCAAGG 
GGTTACTTTT 
TTTGATCGAG 
CTGGGAATCC 
ATAATCATCG 
ATCTCTAGCA 
TACTTTGTCT 
ATCACTGAAG 
TTCCTGGTCA 



21 
I 

CCGTGTGAGG 
AGGAGCCTTC 
CCCAAATACT 
ATGAGATACG 
CCATGTTTGA 
TTCACTGGAA 
ACATTGGCTA 
OCTGTCTCTT 
TTCTCAGCCC 
TTGGAGTGAC 
ACATGTCTTA 
TGCTGCAAAC 
CAATGTTCCA 
GTGTTACCAC 
CTTTGGAAGA 
AGAGAATAGA 
TCTCTATTTA 
TTGGGAAAAC 
AATTTGATGT 
TCACATCCAT 
GTAAGTCCTC 
CCTCTGTGCT 
TCCTTGGAGA 
GCGCTCTCTC 



31 
I 

GGGCCTGTGG 
CTTACACTTC 
ATTTTTTGGA 
TCAGTATGTT 
GCTCATCATC 
AATGAACCTG 
TTTTATTGTG 
ATGGCTGACC 
AAAACATGGG 
TCTCATGGCT 
CTTCCTCAGG 
CATGGATATG 
GAAGGGGGAA 
TTCAGCATCA 
ATTAAGCAGG 
ATACTCCAAA 
CTGTGTTTGG 
GGATCCTGTC 
GAAGTTTTGG 
CAGAGGATTG 
CAATGTCATT 
GCTGATCCGA 
ACTGCAGTTC 
TAGCATACTC 



41 

I 

CCCCAGCGTG 
GCCATGAGTT 
TTTGGGTGGC 
GTACAGGTGA 
TTTGAAATCT 
TGCGTAATTC 
AGCAATATCC 
TTTATGTATT 
ATCTTATCCA 
CTTCTTTCTG 
AATGTGACTG 
ATCATAAGCA 
GTGCATAACA 
GGAAGTGAAA 
CAGCTTTTTC 
ACCTTCAAGG 
AAAATTTTCA 
ACAAGAGGCA 
TCCCAACACA 
CTGATCACTC 
GTCCTGCTAT 
ATGAGTATGC 
AACTTCTATC 
TTCCTCTATT 



51 
I 

CTGTGGOCTC 
TCCTGATCGA 
TTTTCTTCAT 
TCTTCTCCGT 
TAGGAGTATT 
TGCTGATCCT 
GACTACTGCA 
TCTTCTGGAA 
TAGAACAGCT 
GATTTGGTGC 
ACACAGATAT 
AAAAGAAAAG 
AACCATCAGG 
ATCTTACTCT 
TGGAAACAGC 
GGAAATATTT 
TGGCTACCAT 
TTGAGATCAC 
TTTCCTTCAT 
TTACCAAGTT 
TAGCACAGAT 
CTTTAGAATA 
ACCGTTGGTT 
TGGCTCACAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



878 



WO 03/042661 



PCT/US02/36810 



ACAGGCACCA 
CAGTGGTTTC 
ATAAACAAAC 
AGGTGATACT 
ACAATACTCA 
CCAAGAAACT 
TGAGCCAAAC 
TTGAGATTGA 



GAGAAGCAAA 
AAAATTTAGA 
AAAATGCTAT 
ATGACCATGA 
GCAGAGAGCA 
AAAGGTGAAA 
AOGTAGGATT 
CTCATTAAAA 



TGGCACCTTG 
TATAAGAGGG 
GGTAGCATTT 
GTAGCATCAG 
TCCOGTGTGG 
AATACACTGG 
TCCGTTTTAA 
TCAGAGACTG 



AACTTAAGCC 
GGGAAAAATG 
TTCACCTTCA 
CCAGAACATG 
ATATGAGGCT 
AACTCTGGGG 
GGTTCACATG 
T 



TACTACAGAC 
GAACCAGGGC 
TAGCATACTC 
AGAGGGAGAA 
GGTGTAGAGG 
CAAGACATGT 
GAAAAGGTTA 



TCTTAGAGGC 
CTGACATTTT 
CTTCCCCCTC 
CTAACTCAAG 
CGGAGAGGAG 
CTATGGTAGC 
TAGCTTTGCC 



1500 
1560 
1620 
1680 
1740 
1800 
1860 



A100 Protein sequence 



Unigene number j 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



putative G-protein coupled receptor 
Hs. 16085 
NPJ>57418.1 
none found 

5-27, 42-64, 76-98, 110-132, 144-166, 289-311, 
plasma membrane 



342-364, 380-402, 424-446 



I 

MSFLIDSSIM 
EILGVLNSSS 
MYFFWKLGDP 
VTDTDILALE 
SENLTLIQQB 
1FMATINIVF 
ITLTKFFYAI 
FYHRWFDVIF 



11 
I 

ITSQILFFGF 
RYFHWKMNLC 
FPILSPKHGI 
RRLLQTMDMI 
VDALEELSRQ 
DRVGKTDPVT 
SSSKSSNVIV 
LVSALSSILF 



21 
I 

GWLFFMRQLF 
VILLI LVFMV 
LSIEQLISRV 
ISKKKRMAMA 
LFLETADLYA 
RGIEITVNYL 
LLLAQIMGMY 
LYLAHKQAPE 



31 
I 

KDYEIRQYW 
PFYIGYFIVS 
GVIGVTLMAL 
RRTMFQKGEV 
TKBRIEYSKT 
GIQFDVKFWS 
FVSSVLLIRM 
KQMAP 



41 
I 

QVIFSVTFAF 
NIRLLHKQRL 
LSGFGAVNCP 



FKGKYFNFLG 
QHISFILVGI 
SMPLEYRTII 



51 
I 

SCTMFELIIF 60 

LFSCLLWLTF 120 

YTYMSYFLRN 180 

IKSVTTSASG 240 

YFFSIYCVWK 300 

IIVTSIRGLL 360 

TEVLGELQFN 420 



A101 DNA SEQUENCE 

Gene name: ESTs, Weakly similar to CGHU7L collagen alpha l(III) chain precursor [H. sapiens] 
Unigene number: Hs.19322 
Probeset Accession #: AA088458 
Nucleic Acid Accession #: AA088458 

Coding sequence: . . 862-1995 (underlined sequences correspond to start and stop codons) 



1 
i 

GCCCTTGGAC 
CTGAAGAAAA 
GCGCGGGGCC 
CTGGGCCAGA 
CGGCTACTGC 
TGTGCCAGCC 
ACCTCACOCC 
CTCACCCAGG 
GCGCTCATTA 
GATTCCACCT 
AGCCCTTCGA 
GCCCAGGCAC 
GCCTGCCCCC 
ACATGGGCTG 
TGGACAGTGG 
GGTCCCATCT 
AGAGGGCGCG 
CAGGACGAGG 
GTAAGOGGGG 
CTGGCCAAGG 
GGCCTGCATG 
TTGCCCCACG 
GACAGCTCCC 
CTGGGGTCCT 
GAGAGGCCAC 
GGCAGGTCCC 
GGAGTACGCA 
GAACCAGGGG 
TCAGTGTGTG 
CCCGATGOGG 
ACACTGTCCC 
CCTTCCGGAG 
TGCTGCACCT 
GCCCTCCTAC 
ACCTCCTGGG 
AGGTGGACTG 
GGCTGGGGTC 
TGGGGGATCC 
GGTGACTTCA 
GAGACAGGCT 
AAAGAAATAG 
CACGAGGGGA 
GCAGACCCTG 
GAGCAGCGTC 
GCGTGCACAC 
CAGAAGTGTC 



n 
I 

ACTGACATGG 
AGGAGCTGGA 
GCGACTGGTA 
GCAGAGCCAG 
CCAAGGTACA 
GGGCCCTGCC 
CGGTCTGGCA 
AGGTGACCGA 
AGCAGCTGTT 
TCATCTAGTC 
GGGTGGGCGC 
AGTCCCGGAG 
GGCTGGTCCC 
GGGGCTCTCT 
GGTACCCCTC 
TCAGGGAAAG 
GGGCGGCTCC 
TGGCTGTAGC 
GGTGCCTGCC 
CTGAGGGACC 
TGCCTCCCAC 
TTGAGTCCCA 
AGGCACGTCA 
GCTCACCCCC 
CTCCCTCAGC 
CTTGGGTGTC 
CTGGTGGGGG 
CACGGCAACA 
TGGGGCGCAG 
GGTCAGTGOG 
ACAAGGCACC 
CCCAGCTCCA 
GGTCTGCAGG 
CCTGAAGATG 
CAGGAAAGGG 
CAGCGCAGTG 
TGCCCACCAG 
TGGCATCTTT 
TCAGGAGACC 
GGCACCTCOG 
GTCCTCCCAG 
GAATTTAAAG 
CCTGGAGCCT 
CCTGGGCTCT 
TGTGATGACA 
CCCAGTTGAG 
ATCAAGTTCC 



21 
I 

ACTGAAGGAG 
GCAGGAGAAG 
CCAGCAGCAG 
CGCCGACTTT 
AGAGGTGGCC 
CCCGTCCTCC 
GCAGCAGACC 
GAAGAGTGAG 
TGAGGCCCGC 
CTTGTGGGCC 
CCCATCGCAC 
TGGGCGCCTT 
CGCACCGAGC 
TGAGTCCGCA 
CATGAGTTAG 
GCACTGCCCA 
GACGCGGGTC 
TCGGACGGAC 
TGGCTGGGGA 
CTGGCTGCAG 
AGACCCTGGG 
CACAACATCC 
TAGGCAAAGC 
CTTTGCTCTC 
CAAGGAAAAC 
ACTCCCTCAG 
GGCCCTGCTC 
GCATCGATGG 
GGCCTCOGAT 
TGGGGGGCGC 
TGTCTCAGAG 
TGCTAACCTG 
GGTGTCCCAG 
GGAGTGGGCT 
TGCAGGTCCT 
GGTGGGCCAG 
GGCCTCCCCA 
ACTGGACTGG 
GCCCACATAG 
GAAAAACTGC 
TTTACAGCTT 
GCCCCGGCTG 
GCCCTAGGAC 
ATCCGCGAGG 
CCCGGAAATG 
AATCTGCCCC 
AAGGAAAAGG 



31 
I 

TAGAATGGAG 
GAGGTGCTGC 
CTGCAACGAG 
GGGGCTGCAG 
CGGTGCCTGG 
TCCGGGCCCC 
ATCCTCATGC 
CGCATCACGC 
GCCCTGAGCC 
GCGTGGGCCC 
CCACCCTCTC 
CCTGCCGCCC 
GCTTGACTCC 
TAGTCCGCAG 
CGTCCCCCCG 
CGCCAGGCTG 
CAAGGGCAGC 
GGAAGTAGAT 
GCCCCAGGGA 
CGGATCGGCA 
GTGATGGCCT 
TGTGAGCCTG 
CTGTTTCCCC 
ACGCCCAGCC 
GAGAACCCCC 
CCCCTGCCCA 
AGCCCAACCT 
GTTCTGCAGC 
GCGGGGTCAG 
AGGGCCCCCT 
GAGGGGCCCT 
CCCACAGCAA 
GACAGGCCCA 
TTCCAGGGGA 
GAGGGCCTGT 
TGGCAGCCAG 
CGTCTGCCTT 
AAGCAGGAGA 
AGCTGGACCC 
CTTTCAGCCT 
GAAATCAGGC 
GCAGGGTCTA 
GCTGGGCGGG 
TGCCAGTAGC 
TCTCAGGATG 
AGAGGAACAC 
AACATCTCAG 



41 

I 

CACGAGGACA 
TGCAGGGTTT 
TGCAGGAGCG 
GGAGCCCCCG 
GGGAGCTGCT 
CCTGCCCTGC 
TGAAGGAGCA 
AGCTGGAGCA 
AGCAGGAOGG 
CCAGGGCCAG 
TGGCTGGAGA 
TTGCCAGATG 
GTTTKGGCTC 
CTACTACTGG 
TTTCCAGCGG 
CACTTCCAAC 
TTCCCGCTCA 
GGAGGGGGTG 
TAGCGGTCGG 
CGCCGGGTGG 
TCCCCCTCTT 
GCTCCCCAGG 
CGACTCAGGA 
TGTCCCCAGG 
AGGGTACAGG 
GGCCCACTCC 
GGAGGGTCCC 
CCAGGGCCCC 
TGCGTGGGGG 
CGTGTCCAGG 
GGCAGGCAGC 
CCCCACAGAG 
AGTCAGCCCA 
CATAAGGATG 
GCCCCACAGC 
GGAGAAGCCC 
TGAGGGTGCC 
CAGAACAGTG 
CGCAGCTGAA 
TGGTGTTCGG 
TAGTGAGTGG 
GGTGGCTGGC 
TCAGTCTCCG 
GTGTGCAGGT 
TTGAAATGTG 
ACCCACACCA 
OCGGGCGTGG 



51 
I 

CTGACATGGA 
GGAGATGATG 
CCAGCGCCGC 
CCCACTGGGG 
GGCTGCAGCC 
CCTGACGTCC 
GAACCGACTC 
GGAGAAGTCG 
GGGACCTCTG 
CCTGGCACTC 
CCCCCGGCAG 
GGCTCCCCAG 
CTGGTTGYTG 
CCGCTGTCAG 
TGCCGCCCTG 
AACGGGCAGC 
ACCAGGGCAC 
GGGAOGGCCT 
ACTTCAGGTT 
GCGAGAGCTT 
GGCCGGGACG 
AGGGCCCCCA 
TTTCCAAGGC 
TTTCAGCTGG 
AGGAGGCTGG 
CGCTGGTGCT 
AGTGTCACCA 
CGATGCGGGG 
GCGCAGGGCC 
GCACTTTGGT 
GTGGCAACTC 
CCACATTCCC 
GCATGCAGCT 
TCAGGCCTGG 
CCCAGCACCC 
CCCGTCAGCA 
TGCCATGCCC 
TCTGTCCCGG 
GCGGAAATGT 
TGCAAGGTGA 
CCCTGGAGAC 
AGAGGCACAT 
TGCAGGATGT 
ACATACACGT 
TCCTTGGGGG 
GGCCTCAGGA 
TGGTTCACGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



879 



WO 03/042661 



PCT/US02/36810 



CTGGAATCCC AGCACTTGAQ GCCAGGAGTT CCAGAGCAGC CTGGGCAACG CAGTGAGAGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 3000 

TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 



A1Q2 DNA SEQUENCE 

Gene name: TMPRSS3a 

Unigene number: Hs. 2 982 41 

Probeset Accession #: AI538613 

Nucleic Acid Accession #: AB038157 

Coding sequence: 202-1566 (underlined sequences correspond to start and .stop codons) 



} 11 21 31 41 51 

I I I I I I 

ACCGGGCACC GG ACGG CTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTOG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT. GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 1260 

TCCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 1320 

GACGTGTACG GTGGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT GACGGGTGGC 1380 

GTGGACAGCT GCCAGGGGGA CAGCGGGGGG CCCCTGGTGT GTCAAGAGAG GAGGCTGTGG 1440 

AAGTTAGTGG GAGCGACCAG CTTTGGCATC GGCTGCGCAG AGGTGAACAA GCCTGGGGTG 1500 

TACACCCGTG TCACCTCCTT CCTGGACTGG ATCCACGAGC AGATGGAGAG AGACCTAAAA IS 60 

ACCTGAAGAG GAAGGGGACA AGTAGCCACC TGAGTTCCTG AGGTGATGAA GACAGCCCGA 1620 

TCCTCCCCTG GACTCCCGTG TAGGAACCTG CACACGAGCA GACACCCTTG GAGCTCTGAG 1680 

TTCCGGCACC AGTAGCAGGC CCGAAAGAGG CACCCTTCCA TCTGATTCCA GCACAACCTT 1740 

CAAGCTGCTT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT CGCTCTGTTG CCCAGGCTGG 1800 

AGTGCAGTGG CGAAATCCCT GCTCACTGCA GCCTCOGCTT CCCTGGTTCA AGCGATTCTC 1860 

TTGCC ? CAGC ^CCCCAQTA GCTGGGACCA CAGGTGCCCG CCACCACACC CAACTAATTT 1920 

TTGTATTTTT AGTAGAGACA GGGTTTCACC ATGTTGGCCA GGCTGCTCTC AAACCCCTGA 1980 

CCTCAAATGA TGTGCCTGCT TCAGCCTCCC ACAGTGCTGG GATTACAGGC ATGGGCCACC 2040 

ACGCCTAGCC TCACGCTCCT TTCTGATCTT CACTAAGAAC AAAAGAAGCA GCAACTTGCA 2100 

AGGGCGGCCT TTCCCACTGG TCCATCTGGT TTTCTCTCCA GGGGTCTTGC AAAATTCCTG 2160 

ACGAGATAAG CAGTTATGTG ACCTCACGTG CAAAGCCACC AACAGCCACT CAGAAAAGAC 2220 

GCACCAGCCC AGAAGTGCAG AACTGCAGTC ACTGCACGTT TTCATCTCTA GGGACCAGAA 2280 

CCAAACCCAC CCTTTCTACT TCCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGGA 2340 

ATGACTCGTT TAAGGCCTAT TTTCATGATT TCTTTGTAGC ATTTGGTGCT TGACGTATTA 2400 

TTGTCCTTTG ATTCCAAATA ATATGTTTCC TTCCCTCAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAA 



A103 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession &: 
Signal sequence: 
Transmembrane domains » 
Tryp_SPc domain: 
Cellular Localization: 



TMPRSS3a 

HS. 298241 

AI538613 

BAB20077 

none found 

43-65, 239-261 

216-444 

plasma membrane 



1 11 21 31 41 51 

I I I I I | 

MGENDPPAVE APFSPRSLFG LDDLKISPVA PDADAVAAQI liSLLPLKFPP IIVIGIIALI 60 

1ALAIGLGIH FDCSGKYRCR SSPKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVP 120 

TAASWKTMCS DDWKGHYANV ACAQLGPPSY VSSDNLRVSS LEGQFRBBFV SIDKLLPDDX 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFOGYH 240 

LOGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVBKI VYHSKYKPKR 300 

LGNDIALMKL AGPLTPNEMI QPVCLPNSEE NPPDGKVCWT SGWGATEDGA GDASPVLNHA 360 

AVPLISNKIC NHRDVYGGII SPSMLCAGYL TGGVDSCQGD SGGPLVOQER RLWKLVGATS 420 
PGIGCAEVNK PGVYTRVTSF LDWIHEQMER DUCT 



880 



WO 03/042661 



PCT/US02/36810 



A104 DMA SEQUENCE 

Gene name: Homo sapiens G protein- coupled receptor (HOT7T175) 
Unigene number: Hs. 2 0822 9 

Probeset Accession ft: AI819198 
Nucleic Acid Accession #: nm 032551.1 

Coding sequence: 1 .i 197 (underlined sequences correspond to start and stop codons) 



I 



ATGCACACCG 
GGCTGCCCGG 
GACGCCPGGC 
TCGCTGGTCA 
ATCGCCAACC 
CTGCTOTACC 
ATCCAGCAGG 
TGGTACGTGA 
GCTGTCAGCC 
CACCGCCTGT 
GAGCGCGCCT 
TGOGCCTGCT 
GCCGATAGCG 
GTCTCGCGGC 
CTGTTCCTGG 
GCCTACGCGC 
CTGCTCTACG 
GCGCCGCGCC 
GCGGAGCTGC 
AGTGGGCTGG 



11 
I 

TGGCTACGTC 
GCTGTGGCGC 
TCGTGCCGCT 
TCTACGTCAT 
TGGCGGCCAC 
GGCTGCCCGG 
TCTCGGTGCA 
CGGTGTTCCC 
TCAGCATCTG 
CACCCGGGCC 
TCGCACTGTA 
ATGCGGCCAT 
CCCTGCAGGG 
TGGTGGCGGC 
TGCTGCAGGC 
TTAAGACCTG 
CCTTCCTGGG 
GCCCCCGCCG 
ACCGCCTGGG 
CCGCGCGCGG 



21 
I 

CGGACCCAAC 
CAACGCCTCG 
CTTCTTCGCG 
CTGCCGCCAC 
GGACGTGACC 
CTGGGTGCTG 
GGCCACGTGT 
GTTGCGCGCC 
GGTAGGCTCT 
GCGCGCCTAC 
CAACCTGCTG 
GCTGCGCCAC 
GCAGGTGCTG 
CGTGGTCCTG 
GCTGGGCCCC 
GGCTCACTGC 
CTCQCACTTC 
CCCCCGCCGG 
GTCCCACCOG 
GCTGTGCGTC 



31 
I 

GCGTCCTGGG 
GACGGCCCAG 
GCGCTGATGC 
AAGCCGATGC 

GGCGACTTCA 
GCCACTCTGA 
CTGCACCGCC 
GCGGCGGTGT 
TGCAGTGAGG 
GCGCTGTACC 
CTGGGCCGGG 
GCAGAGCGCG 
CTCTTCGCCG 
GCGGGCTCCT 
ATGTCCTACA 
CGACAGGCCT 
CCCGGACCCT 
GCCCCCGCCA 
CTGGGGGAGG 



41 

I 

GGGCACCGGC 
TCCCTTCGCC 
TGCTGGGCCT 
GGACCGTGAC 
GCTGCGTCGC 
TGTGCAAGTT 
CCGCCATGAG 
GCACGCCCCG 
CTGCGCCGGT 
CCTTCCCCAG 
TGCTGCCGCT 
TCGCCGTGCG 
CAGGCGCCGT 
CCTGCTGGGG 
GGCACCCACG 
GCAACTCCGC 
TCCGCCGCGT 
CGGACCCCGC 
GGGCGCAGAA 
ACAACGCCCC 



51 
I 

CAACGCCTCC 
GCGGGCCGTG 
GGTGGGGAAC 
CAACTTCTAC 
CTTCACGGCC 
CGTCAACTAC 
TGTGGACCGC 
CCTGGCGCTG 
GCTCGCCCTG 
CCGCGCCCTG 
GCTCGCCACC 
CCCCGCGCCC 
GCGGGCCAAG 
CCCCATCCAG 
CAGCTACGCC 
GCTGAACCCG 
CTGCCCCTGC 
AGCCCCACAC 
GCCAGGGAGC 
TCTCTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



A105 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession # : 
Signal sequence: 
Pfam domains: 
Transmembrane domains t 
Cellular Localization: 



Homo sapiens G protein- coupled receptor (H0T7T175) , mRHA 
Hs. 208229 

AI819198 
none found 
7tm_l [59-323] 

43-6S, 86-108, 122-144, 159-181, 203-225, 260-282 
plasma membrane 



1 11 21 31 41 SI 

I I t I | ! 

MHTVATSGPN ASWGAPANAS GCPGCGANAS DGPVPSPRAV DAWLVPLPFA ALKLLGLVGN 60 

SLVIYVICRH KPMRTVTNPY IANIAATDVT FLLCCVPFTA LLYPLPGWVL GDFMCKFVNY 120 

IQQVSVQATC ATLTAMSVDR WYVTVPPLRA LHRRTPRLAL AVSLSIWVGS AAVSAPVLAL 180 

HRLSPGPRAY CSEAFPSRAL ERAPALYNLL ALYLLPLLAT CACYAAMLRH LGRVAVRPAP 240 

tSfftSS?™ J^^V^ VSRLVAAWL LFAACWGPIQ LFLVLQALGP AGSWHPRSYA 300 

AYALKTWAHC MSYSNSALNP LLYAFLGSHF RQAFRRVCPC APRRPRRPRR PGPSDPAAPH 360 
AELHRLGSHP APARAQKPGS SGLAARGLCV LGEDNAPL 

A106 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



integrin, beta 8 
Hs. 52620 
AA479726 
NM_002214 

680-2990 {underlined sequences correspond to start and stop codons) 



1 
I 

CCCAGAGCCG 
CTGCCGACTT 
GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGOG 
GGCCCCGAGG 
GGGGCGGGCT 
TCTGCCTGCA 
CACTTGTTCT 
CCTGTGCCAG 
TTTCAGGT0G 
GCTCAGTTGA 
TTAATACCCA 
ATTTTATGCT 
ATOTCTCAGC 
CTAGAAAAAT 
AAACAGTTTC 
ACAATTTAGA 
TCACTGAGTT 



11 

I 

CCTCCCCCTG 
GTCTTTGCCC 
CTGCCCACCT 
CTCGCCGGCG 
TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGCCT 
TCGCCCGGGA 
GTTTTGCATT 
AAACGACCGG 
TGGACTGGGC 
GTGCCTTGCG 
ATCAAGAAGT 
TTCAATAGAA 
GGTGACACCA 
GAAAGTTCAT 
ATCAATGCAC 
GGCATTTTTC 
ACCATACATT 
CTGCATGCCT 
TGAGAAAGCA 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGOGGAG 
GCCGAGOGGT 
GCTAGGCCTG 
GGCCGAGCCC 
ATOTGCGGCT 
CGAGGTCCCG 
CAAGGTGAAG 
CTGGGTCCAG 
GAACGTTGTG 
TACCCATCTG 
GGAGAAGTGT 
CCTCTGAAGA 
AATAATATAG 
TCCCGTGACT 
AGCATCCACC 
CCCCATGGAT 
GTTCATAGAC 



31 
I 

CCCGAGCTTC 
AGACGGGGCT 
CTGCGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAAOGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 
ACAATAGATG 
AATGTGGATG 
ATATTGTTTC 
TGCATGTTAT 
CTATCCAGCT 
AATATCCTGT 
AAAAATTAAA 
TTCGTCTTGG 
CCGAAAGGAT 
ACATCCATGT 
AGAAGATCTC 



41 
I 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGCGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
CCGCCGTGCC 
GCTTACCT6C 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTTACC 
CTGGGCAGCC 
TGCATCTTCA 
GTGTGTTCAA 
CAATTTAATA 
AATACCCACT 
GCGTCCAGGA 
GGATCTTTAT 
TTCCGTTGGA 
ATTTGGCTCA 
TCATAATCAA 
GCTGTCTTTG 
TGGAAACATA 



51 



AGCCAGGACG 


60 


AACTAATGGT 


120 


CAGACTTTTT 


180 


GAATGTACAT 


240 


TGGCTCTTCG 


300 


CTCTTTTCTT 


360 


GCCGGGCCCT 


420 


GAGCCGGGAG 


480 


ACCGCTTGCT 


540 


CTCGCCCGCG 


600 


CGGCGGGCGC 


660 


GCTGCATTTG 


720 


TGGGTGTTTT 


780 


AATGCAGCAT 


840 


GAGGATTTCA 


900 


AGCAAAGGCT 


960 


GAAAATGAAA 


1020 


GCCGAAGCTA 


1080 


TATCTTGTTG 


1140 


AACGATTTAT 


1200 


TACGTTGATA 


1260 


TGCAGTGACT 


1320 


ACAGAGAACA 


1380 


GATACACCAG 


1440 



881 



WO 03/042661 



PCT/US02/36810 



AAGGAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 1500 

AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 1560 

GCAAATTGGC AGGCATAGTG GTGCCCAATG AOGGAAACTG TCATCTGAAA AACAACGTCT 1620 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 

ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 

TTCTACCCCT CTTCCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 

CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATOGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2S20 

GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2S80 

GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2820 

ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 

TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 

TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 3300 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 



A107 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains t 
EOF domain: 
1KB domain: 

Cellular Localization: 



Integrin, beta 8 

Hs.52620 

AA479726 

NPJ>02205 . 1 

1-39 

682-704 

552-584 

54-469 

plasma membrane 



11 21 31 

> I I 



41 


51 




1 

OGEDNRCASS 


1 

NAASCARCLA 


60 


YPSVHVIIPT 


ENEINTQVTP 


120 


NNIBKXNSVG 


NDLSRKMAFF 


180 


PHGYIHVLSL 


TBNITEFEKA 


240 


LVMTDQTSHL 


ALDSKLAGIV 


300 


FAVQGKQFHW 


YKDLLPLLPG 


360 


FNITAICPDG 


SRKPGMEGCR 


420 


IHIHRNCSOQ 


CEDNRGPKGK 


480 


SGRGVCVCGK 


CSCHKIKLGK 


540 


DRGQCPSAAA 


QHCVNSKGQV 


600 


MQCLHPHNLS 


QAILOQCKTS 


660 


VLIIRQVILQ 


WNSNKIKSSS 


720 


AHETFRCNF 







A108 DNA sequence 

Gene names ESTs 

Unigene number: Rs. 128899 

Probeset Accession 8: AA983251 

Nucleic Acid Accession ft: AA983251 

Coding sequence: 1-1749 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 * I I I I 1 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 

GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 

GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 

GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 

882 



WO 03/042661 



PCT/US02/36810 



CCGCGGCCGC 
CGCCTGCGTC 
TCCGGACGAC 
ACTCATAAGG 
GCTCCTGGAC 
CCGCGCGGAA 
CTTCTCGGAG 
TGTGGGGCGC 
TGCTGCTGGC 
TGGCTGGACG 
GGCGACGCCA 
GCGGGCCTGG 
CGGGCGGACA 
GGAGACGGCG 
TCCCCGAAAG 
CGCGGATTCC 
ATCTACGTGC 
CTGGTGGCAG 
GCCCCAGGGG 
CGGGGGTCGT 
GGGGCCCGGG 
AACAACGTGT 
CAGATTGTGC 
CACGACTCTG 
AGGCAGATTC 
ACTGTATAAC 
GATTCTCGAG 
GCTTCATTTG 
CATCCAGGGT 
TTGCTGATGG 
AAGAATTTTA 
TTGTTGTTGA 
TTTTTTTTTT 
TGGTGOGATC 
AGCCTCCCAC 
TGTAGAGATG 
TCTGCCTGTC 
TGAGCCTTTT 
ATTCTAAAAG 
GGCACACCTT 
GGGCCTATTT 
TTAAAAAGAG 
TGGGATGATT 
TTGATTGTAT 
TATATTGAAA 
TGGTTATGGT 
GAGTGCAAGT 
AAAAATTTTC 
TAGCTCCTCA 
GCAGATCATA 
GATTGTTAAG 
ATTTCCTCTC 
CAAATTCAAG 
ATGTGCAAGT 
GTATGGTATG 
TGTTTCATGA 
TTTTTTAATA 
TGAAAAAAAA 



OGCCAGCTGG 
CTGGACGTTC 
AGCCCCGGGG 
CAGTCCCTAA 
CTAGGGCCCG 
AGCGCOGCGG 
ACCGTCCTGC 
TCGCCGCTCG 
TGOGCTGCTG 
CGCAGGGCGT 
CCATCTGCTG 
ACCAGGGCGG 
AAGACGGGCC 
AGGGTGCGCC 
GAAGGCAGCT 
CATCTTCTCC 
CGTTCCTCAT 
CCrGTTGCTG 
GTAACCGCTT 
CCTCACGCCA 
CGCCCCCAAC 
ATGTCAACAT 
CACATCAAGG 
TGCCCATGAC 
AGTCCCCCTT 
OGAGAGTCAC 
GTGGAAGTCC 
CCCCCAGACT 
ATCATTGATT 
GTGTATAACA 
TAAACTGATA 
CTGCACAGGA 
TTTTTTTTAA 
TOGGCTCACT 
GTGGCTGGGA 
GGGTTTCACC 
TCAGCCTCCC 
TTTTTTTCTA 
GAAACCTGTT 
AATTTCATTG 
TGTGCTTTTT 
ACTGAAATAA 
TTAATCTTGA 
CTTAATTTTC 
TCATAAACTA 
TTGGOGTTTC 
AATTACTATA 
TTGTCTTAGT 
GCCATAACCT 
AGGCTTTTTA 
AGAAAAGCTT 
TCTTTCAGGC 
TGAATTTATT 
ACATTTCCAA 
GTCTTCTACA 
CTGAATAATG 
ACCAAAGGCA 
AAAAAAAAAA 



GCAGGCTCCC 
CGGGGGGAGG 
GCCTTCTGAC 
GGGGACCGGG 
GCGTOGTCGC 
GACAGTCAGT 
GCTCTCTGGA 
TCCGTCTCCT 
GCGGCGGGGG 
CTGGCGCATC 
CGGCAGCTGC 
CTGCGACAAT 
CCGACGGCTC 
CCCACCCGTG 
CCTCAGGGCT 
ACGCGGCGGC 
TGTTGGCTCC 
CAGATGTCTC 
GATGGAGACC 
GTCCAGCACA 
AAGGTCACAG 
GCCCACGAAT 
GCAGTATCTG 
AGCTGTGCCA 
CCCTCACACC 
TGGTGGGTTC 
GCACATGTCG 
GTATGAAAAC 
TATGATGGAA 
AATGCTTGAG 
AATTAAGGAT 
TCAAAATGCC 
TCAGACAGGG 
GCAACTTCAG 
TTACAGGTGC 
ATGTTGGCTG 
AAAGTGCTGG 
ATGCATCCAA 
TGAACTCTGT 
TAAAAAGATA 
TACCTTATGT 
ATTGTATAGT 
AGTAGTAGGT 
TGGCTTTAAG 
TCACCOGCTG 
CTTCTGTTTG 
CTTGTAAATG 
TATAAAAATT 
GAGACTTGGG 
TACTCTTGTT 
TTCAAOGAAG 
CTCAACAGGC 
TGTGTGTTCT 
CTGCTAGCAC 
CATTTATGTC 
TAAAACCAGT 
GGGGAAAATC 
AAAAAAAAAA 



GGGACTGCGQ 
GTOCGGTTQC 
TGCATCCOGA 
CCACCGGCTG 
CTCCTGGGCG 
GAOGAGGCCC 
GAOGCGCTGT 
CATCCTGGAA 
CGAGGGCCCA 
GGCTTCCAGT 
GCGTTGCGCT 
GACCGCCAGC 
GGCAGGGCTT 
AGGGCCTGGC 
TTCCOGGGGC 
CCCTCTCCCC 
GTGTTTGTCG 
CGGCCTAAGC 
ATCOCCATGA 
GCTGCCAGTT 
ACCAACTGTT 
TTCTCTGTGC 
CATCCCCCAT 
CCTTTCATGG 
AACAGTGAAC 
CTTTACTGAA 
GTGGTATTTA 
ATCTCCGAAT 
AACGGGCCTC 
TCCGAAGTGC 
TTTTATTATG 
TGTTATCTCC 
TCTTGCTCTG 
CCTCCTGGAT 
CTGCCCCCAT 
GGCTGGTCTC 
GATTACAGGC 
GGTTAAGGGG 
GAGATCAGTC 
TATATATTTT 
AGAGATCTTA 
TACTTAACTA 
GGTATAGTCA 
GTGACATCTG 
CTTCTCTGAG 
GTTTTCAGAG 
AAGATCAGTA 
CAAAGAAATG 
ATGAAATTTA 
ATCAAAATGG 
GATTGCCTTT 
ACTGTATTCA 
TTACTTATAT 
AACCAGTATT 
TATAGATATC 
GTTGGCAATT 
ATTTTACTTA 



CTGGGGGCGC 
CAGTGAAACC 
GATTTCCATC 
AGGAGGGGGA 
TOGCGGCAGA 
GGGGGTOGCC 
CCGCGCCCAG 
OGCCGCTTOG 
GCGGOGAGTA 
GTCCOGAGCG 
ACTGCTGCTC 
AGGGOGCTGG 
CATGTCTTAG 
AGOGGTGCTC 

TGCAGCGGCC 
CCTTTATCAT 
AGGATCCCCA 
TCCCCAGTGC 
CCAGCTCCAG 
GCTTGCCGGA 
TGAACTGTCA 
ACGTGGGGTA 
ACGGCCTGCA 
AGAAGATGTA 
GGGAGACGAA 
TGGCACGATT 
TAGCATTTCT 
AGCTGGAGAT 
CCTTGAGATA 
TTGTTATTAT 
CTTTTACTGG 
TTGCCCAGGC 
TCAGGCAACA 
GGCTAATTTT 
ACTCTCCTGA 
GTGAGCCACC 
AAGACGGAAA 
ATCAGTCTCA 
GTCTATTTTT 
TTACAAAGTG 
ATGAAGACAT 
TAAAACCATT 
AGAGGTAATG 
TTACTTTTAA 
CCCCATGTCT 
TTTCTGCCTA 
TGTTACAAAG 
AACCAGATAC 
CTTATTTTTC 
CTTCTCCCAC 
TTGCCAATGT 
AAAAAAAGAT 
T7GTAATTAA 
TATCGATCAT 
GGTATCATCA 
TTAATAAATA 



GCAGGACCCT 
TCCAGAGGCT 
AGOGAGTGCA 
TGGCTTAGGA 
GGGGAGTGGC 
GGGGCCACGA 
GGTGGTGCCA 
CTCCTGCAGC 
CTGCCAOGGC 
CTTCGAOGGC 
CAGCGCCGAG 
CGAGCCTGGC 
GGGTACCCAA 
CCCTGAAGGC 
TGCCAGACGC 
CGCCTTGCCC 
CTTGGGGTCC 
GCAGAGCCGA 
CAGCACCTCC 
CGCCAACTCC 
AGGGACCATG 
GCAGGCCACC 
CACGGTGCAG 
GCCTGGCTAC 
CCCAGCGGTG 
G GCAGG GGTG 
CCTTTGGATG 
GGATATGTTT 
GACTGTGATG 
TGGTTGACGA 
TATTTCTTTT 
GACTTTTTTT 
TGGAGTGCAG 
CTCCTGCCTC 
TTGTATTTTT 
CCTCAAGCAA 
GCCCCCAGCC 
TAACAGGACT 
GTATTCCACA 
GTGCTTTTGG 
ATTTTCTACA 
TTCAGAACTC 
CATCCCCTTC 
CATTCTTTTT 
TTTTGCCTTG 
ATATAGTCCT 
GATCTGATAA 
ATACTTAGTA 
GATTTACTTT 
AGGCACTAAG 
ACTGTTCTTG 
TCCAAATTAT 
AACTTTAAGG 
ACAAATCGCT 
CTTTCTATTC 
ATGATACTCA 
TTTTATGATG 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



A109 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



BSTs 
Hs. 128899 

none found 

1-11 

402-424 

not determined 



1 
I 

MLSGFLKSPS 
GEAEKGNRGE 
SGRQPRGPSD 
PRGKRRGTVS 
CCWLRCWRRG 
ARLDQGGGDN 
SPKGRQLLRA 
LVAACCCRCL 
GARAPPTRSQ 
HDSVPMTAVP 



11 

I 

TQHRAQYTPG 
PPAWIRAQQQ 
C1PRFPSASA 
DEARGSPGPR 
RGPSGBYCHG 
DRQQGAGEPG 
PPGLLPRARR 
RPKQDPQQSR 
TNCCLPEGTM 
PPMDGLQPGY 



21 
I 

GXKLPWEASI 
PRPPPAGQAP 
THKAVPKGTG 
LLGDRPALSG 
WLDAQGVWRI 
RADKDGPRRL 
RGFPSSPRGG 
APGGNRLMET 
NNVYVNMPTN 
RQIQSPPPHT 



31 
I 

GAETSRGRGS 
GTAAGGAQDP 
PPAEDGDGLG 
DALSAFRWP 
GFQCFERFDG 
GRASCLRGTQ 
PSPLQRPAZiP 
IPMIPSASTS 
FSVLNCQQAT 
NSEQKMYPAV 



41 
I 

DRERESRPEA 
RLRPGRSRGR 
APGPRARRRR 
OGALAARPSP 
GDATICCGSC 
GDGEGAPPPV 
IYVPPLIVGS 



51 
I 

AGLLWDRAAA 
VRLPVKPPEA 
LLGVAAEGSG 
RPGTPLRSCS 
ALRYCCSSAE 
RAWQRCSPEG 
VFVAPIILGS 



QIVPHQGQYL 
TV 



HPPYVGYTVQ 



60 
120 
180 
240 
300 
360 
420 
480 
540 



A110 DMA SEQUENCE: 
Gene name: 



ESTs, Weakly similar to JE03S0 Anterior gradient-2 [R . sapiens] 

883 



WO 03/042661 



PCT/US02/36810 



Unigene number: Rs. 100606 

Probeset Accession Hi AA487468 

Nucleic Acid Accession 0: AA487468 

Coding sequence: S5-55S (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAQ GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



Alll Protein sequence i 
Gene name: 
Unigene number: 
Probeset Accession #i 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs, Weakly similar to JE0350 Anterior gradient -2 IH. sapiens] 

Hs. 100686 

AA4B7466 

none found 

1-23 

none found 
secreted 



1 11 21 31 41 51 

I I t I I I 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEB GLFYAQKSKK 60 
PLMVIHHIiBD CQYSQALKKV FAQNEEIQEM AQNKFIMLNL MHBTTDKNLS PDGQYVPRIM 120 
PVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 

All 2 DMA SEQUENCE 

Gene name: Homo sapiens type II membrane serine protease mRNA 

Unigene number: Hs.63325 
Probeset Accession #i AA411502 
Nucleic Acid Accession #: NM_016425 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I 1 I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCAOOTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCOCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 



All 3 Protein sequence: 
Gene name:. 
Unigene number: 
Probeset Accession #: 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
LDLa domain: 
Tryp_SPc domain: 
Cellular Localization: 



Homo sapiens type II 

Hs.63325 

AA411S02 

NP_0 57509 

none found 

31-53 

54-94 

204-429 

plasma membrane/ ER 



serine protease mRNA 



884 



WO 03/042661 



PCT/US02/36810 



} 11 21 31 41 

I I I I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW 
YFLCGQPLHF IPRKQLCDGB LDCPLGEDEE HCVKSFPEGP AVAVRLSXDR 
GNWPSACFDN FTEALAETAC RQMGYSSKPT PRAVEIGPDQ DLDWEITEN 
GPCLSGSLVS LHCLACGKSL KTPRWGGBE ASVDSWPWQV SIQYDKQHVC 
LTAAHCFRKH TDVFNWKVRA GSDKLGSPPS LAVAKIIIIE FNPMYPKDND 
TFSGTVRPIC LPFPDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV 
AYQGBVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC 
KVSAYLNWIY NVWKABL 

A114 DNA SEQUENCE : 

Gene name : TTK protein kinase 

Unigene number: Hs.l69B40 

Probeset Accession #: M86699 

Nucleic Acid Accession #: NM_003318 

Coding sequence: 1026-3551 (underlined sequences correspond to start and stop codons) 



51 
I 

VLIKVILDKY 60 

STLQVLDSAT 120 

SQELRMRNSS 180 

GGSILDPHWV 240 

IALKKLQFPL 300 

IOSTRCNADD 360 

GGPSTPGVYT 420 



1 11 21 31 41 51 

I J I I I | 

GGAATTCCTT TTTTTTTTTT TTTGAGATGG AGTTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAATGGCACA ATCTCAGCTT ACTGCAACCT CCGCCTCCCG GGTTCAAGCG ATTCTCCTGC 120 

CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACTAATTT 180 

CTTTTCTATT TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGGTCT TGAACTCCTG 240 

ACCTCAGGTG ATCCACTTGC CTTGGCCTCC CAAAGTGCTA GGATTACAGC CGTGAAACTG 300 

TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC TTGGTCGCCC 360 

AGGCTGGAGT GCAGTGGTGC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 

GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACCCGG 480 

CTAATTTTTG TATTTTTATT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGTTCTCAAA 540 

CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 600 

AGTCACCATG CCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 

GGTCCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT GATTCACAAA 720 

AGGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 780 

GGATGATGAT CCTAAATCCA GATACAGTAA AAATGGGGTA TGGGAAGGTA GAATACAAAA 840 

TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGGATACATT GTTGATGTGA 900 

ATGT AAGAC T GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 

TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

CCATAATGAA CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 1080 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 1260 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 1380 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG 1500 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 

TTTCAGAGGA GGAAAAGAAG AATTTATCAG CATCTAOGGT ATTAACTGCC CAAGAATCAT 1620 

TTTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTCC AGAGGACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAGATGCA GAAATAGGTT 1740 

ACCGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAGTC ATGCCCATTT GGAAGAGTCC 1800 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT 1860 

GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 1920 

AACCAAGTGG AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAACGGAA TCAAGTCTTC TAGCTAAATT AGAAGAAACT AAAGAGTATC 2100 

AAGAACCAGA GGTTCCAGAG AGTAACCAGA AACAGTGGCA AGCIAAGAGA AAGTCAGAGT 2160 

GTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGCA GATTCCGGAG TTAGCCCGAA 2220 

AAGTTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AGTCACCACC AATATCAACA TCTAAATGGT TTGACCCAAA ATCTATTTGT AAGACACCAA 2340 

GCAGCAATAC CTTGGATGAT TACATGAGCT GTTTTAGAAC TCCAGTTGTA AAGAATGACT 2400 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGCCTGTTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAGCATCT TCTTCAGCAA 2520 

ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTCCATATT AAAGCAGATA GGAAGTGGAG 2580 

GTTCAAGCAA GGTATTTCAG GTGTTAAATG AAAAGAAACA GATATATGCT ATAAAATATG 2640 

TGAACTTAGA AGAAGCAGAT AACCAAACTC TTGATAGTTA CCGGAACGAA ATAGCTTATT 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGG 2760 

ACCAGTACAT CTACATGGTA ATGGAGTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 

AGAAAAAATC CATTGATCCA TGGGAACGCA AGAGTTACTG GAAAAATATG TTAGAGGCAG 2880 

TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2940 

TAGTTGATGG AATGCTAAAG CTAATTGATT TTGGGATTGC AAACCAAATG CAACCAGATA 3000 

CAACAAGTGT TGTTAAAGAT TCTCAGGTTG GCACAGTTAA TTATATGCCA CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAGAATG GGAAATCTAA GTCAAAGATA AGCCCCAAAA 3120 

GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 

AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAAGGG 3300 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 3420 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3S40 

QAAAAAA ATG A TTTGCAGTT ATTCGTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTGAAGTGTT 3660 

ATCAGCAAAA AAAATTCAGT GAGATTATCT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 



885 



WO 03/042661 



PCT/US02/36810 



TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 3780 
TTGACATCAT TTTACTCTTQ GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 
TAAATAAAGT TTTGTGGCTA AAATGA 



A115 Protein sequence i 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Protein Kinase Domain t 
Cellular Localization: 



TTK protein kinase 

Hs. 169840 

M86699 

NP_003309 

none found 

none found 

510-775 

cytoplasmic and nuclear 



l 
I 

MNKVRDIKNK 
VPLSDALLNK 
ANCKKFAFVH 
EEEKKNLSAS 
NSLROTNKTK 



PEVPESKQKQ 
PPISTSKWFD 
QILATPLQNL 
LEEADNQTLD 
KSIDPWERKS 
SWKDSQVGT 
IINQISKLHA 
PVNQMAKGTT 
K 



11 

I 

FKNEDLTDEL 
LIGRYSQAIE 
ISPAQFELSQ 
TVLTAQESFS 
QSCPFGRVPV 
LKSVQNSHPK 
WQAKRKSECI 
PKSICKTPSS 
QVLASSSANE 
SYRNEIAYLN 
YWKNMLEAVH 
VNYMPPEAIK 
IIDPNHEIEF 
EEMKYVLGQL 



21 
I 

SLNKISADTT 
ALPPDKYGQN 
GNVKKSKQLL 
GSLGHLQNRN 
NliLNSPDCDV 



NTLDDYMSCF 
CISVKGRIYS 
KLQQHSDKII 
TIHQHGIVHS 



31 
I 

DNSGTVKQIM 
ESFARIQVRF 
QKAVERGAVP 
NSCDSRGQTT 
KTDDSWPCF 
ELIITDSITL 
WQIPELARKV 
RTPWKNDFP 
ILKQIGSGGS 
RLYDYEITDQ 
DLKPANFLIV 



PDIPEKDLQD 
VGLNSPNSIL 



VLKCCLKRDP 
KAAKTLYEHY 



41 

I 

MMANNPEDWL 
AELKAIQEPD 
LEMLEIALRN 
KARFLYGENM 
MKRQTSRSEC 
KNKTESSLLA 
NTEQKHTTFE 
PACQLSTPYG 
SKVFQVLNEK 
YIYMVMECGN 
DGMLKLIDFG 
VWSLGCILYY 
KQRI9IPELL 



51 

I 

S LLLKLEKNS 
DARDYFQMAR 
LNIjQKKQLLS 
PPQDAEIGYR 
RDLWPGSKP 
KLEETKEYQE 
QPVFSVSKQS 
QPACFQQQQB 
KQIYAIKYVN 
IDLNSWLKKK 
IANQMQPDTT 
MTYGKTPFQQ 
AHPYVQIQTH 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 



OVARIAN 



A116 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession tit 
Coding sequence i 



1 
I 

ATGGCTTCAC 
CCCGAGTTTG 
TTCGTGATGG 
AAAGGATACT 
TTGGTGTTCC 
ACGTCCAGCT 
GCTACGCTGC 
TTCAGGTACA 
GTCACCTCCG 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCAGCA 
ATGTGCTGGA 
ACGCGGCCTC 
ACCATCATCT 
ATTCGGAGGA 
GCGTACATGA 
CCGCTCCTGT 
TGCCGCCTGT 
ACCACCGACA 
TCTGCAAGGA 
TCTAAQTCCC 
AATTCTGCTG 



G protein- coupled receptor 39 

Hs. 85339 

AA349893 

NMJ>01508 

1-1362 (underlined sequences correspond to start and stop codons) 




ACACGGTGTC 
CGCTGCAGCA 
GCGCCOGCTT 
GAACTGAGAA 
AGTCATTGAG 
CAGAGAATGG 




51 
I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



A117 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession 8i 
Signal sequence: 
Pram domains: 
Transmembrane domains: 
Cellular Localization: 



G protein-coupled receptor 39 

HS. 85339 

AA349893 

NM_001508, NP_001409 
none found 

7tm 1 [72-172, 224-344] 

32-54, 68-90. 111-133, 151-173, 

plasma membrane 



221-243, 280-301, 320-342 



1 11 21 31 41 51 

I I 1 I I I 

MASPSLPGSD CSQIIDHSHV PEFEVATWIK ITLILVYLII PVMGLLGNSA TIRVTQVLQK 

886 



WO 03/042661 



PCT/US02/36810 



KGYLQKEVTD 
ATLLHVLTLS 
VNVPSHRGLT 
MCWNMMQVLM 
IRRIMAAAKP 
CRLSLQHANH 



HMVSLACSDI 
FBRYIAICHP 
CKRSSTRHHE 
KSQKGSLA6G 
KHDWTRSYFR 
EKRLRVHAHS 
LEPNSGAKPA 



LVPLIGMPME 
FRYKAVSGPC 
QPETSNMSIC 
TRPPQLRKSE 
AYMILLPPSE 
TTDSARFVQR 
NSAAENGFQE 



All 8 DNA sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



FYSIIWNPLT TSSYTLSCKL HTPLFEACSY 120 

QVKLLIGPVW VTSALVALPL LFAMGTEYPL 180 

TNLSSRWTVF QSSIFGAPW YLWLLSVAF 240 

SEESRTARRQ TIIFLRLIW TLAVCWMPNQ 300 

TPPYLSSVIN PLLYTVSSQQ PRRVFVQVLC 360 

PLLFASRRQS SARRTEKJFL STPQSEAEPQ 420 
HEV 



bone tnorphogenetic protein receptor IB (ALK-6) 

Hs. 87223 

AA250737 

NM_001203 

274-1782 (underlined sequences correspond to start and stop codons) 



1 
I 

CGCGGGGCGC 
GAGGACGCGG 
GTGAAAGGAA 
CATAACCATT 
TGCCATAAGT 
AATGTGGGCA 
TTGCGTTGTA 
GAQGGATATT 
GGTTGCCTAG 
AGAAGATCAA 
CTGCCTCCAT 
ATATCTGTGA 
TATAAAAGAC 
ATTCCTCCTG 
TCAGGCCTCC 
ATTGGAAAAG 
GTGAAAGTGT 
ACAGTGTTGA 
GGGTCCTGGA 
TATCTGAAGT 
AGTGGCTTAT 
CATCGAGATC 
GACCTGGGCC 
ACTCGAGTTG 
AATCACTTCC 
GTTGCTAGGA 
CTAGTGCCCA 
CGCCCCTCAT 
ATGACAGAAT 
ACACTTGCCA 
CATCTCTGCA 
TAAGCATCCA 
CTTTCAGGGA 
TCTGTTTGTA 



11 
I 

GGAGTCGGCG 
GAGCCGGGAG 
AGGAAGA7CA 
TGGCTCTGAG 
GAGAAGCAAA 
CCAAGAAAGA 
AATGCCACCA 
GTTTCACGAT 
GACTAGAAGG 
TTGAATGCTG 
TGAAAAACAG 
CTGTCTGTAG 
AAGAAACCAG 
GAGAATCCCT 
CTCTGCTGGT 
GTCGCTATGG 
TCTTCACCAC 
TGAGGCATGA 
CCCAGTTGTA 
CCACCACCCT 
GTCATTTACA 
TGAAAAGTAA 
TGGCTGTTAA 
GCACGAAACG 
AGTCTTACAT 
GATGTGTATC 
GTGACCCCTC 
TCCCAAACCG 
GCTGGGCTCA 
AAATGTCAGA 
GAAAGCCAAC 
CAGTACAAGC 
GCGACCTGGG 
GGCGGAGAAA 



21 
I 

GGGCCTCGCG 
CGCACGCGCG 
TTTCATGCCT 
CTATGACAAG 
CTTCCTTGAT 
GGATGGTGAG 
CCATTGTCCA 
GATAGAAGAG 
CTCAGATTTT 
CACAGAAAGG 
AGATTTTGTT 
TTTGCTCTTG 
ACCTCGATAC 
GAQAGACTTA 
CCAAAGGACT 
GGAAGTTTGG 
AGAGGAAGCC 
AAACATTTTG 
CCTAATCACA 
AGACGCTAAA 
CACAGAAATC 
AAACATTCTG 
ATTTATTAGT 
CTATATGCCT 
CATGGCTGAC 
AGGAGGTATA 
TTATGAGGAC 
GTGGAGCAGT 
CAATCCTGCA 
GTCCCAGGAC 
AGGTACTCTT 
CTTGAACATC 
CAAAGACAGA 
CCGTTGGGTA 



31 
I 

GGACGCGGGC 
GGGTGGAGTT 
TGTTQATAAA 
AGAGGAAACA 
AA CATGC TTT 
AGTACAGCCC 
GAAGACTCAG 
GATGACTCTG 
CAGTGTCGGG 
AACGAATGTA 
GATGGACCTA 
GTCCTTATCA 
AGCATTGGGT 
ATTGAGCAGT 
ATAGCTAAGC 
ATGGGAAAGT 
AGCTGGTTCA 
GGTTTCATTG 
GACTATCATG 
TCAATGCTGA 
TTTAGTACTC 
GTGAAGAAAA 
GATACAAATG 
CCAGAAGTGT 
ATGTATAGTT 
GTGGAAGAAT 
ATGAGGGAGA 
GATGAGTGTC 
TCAAGGCTGA 
ATTAAACTCT^ 
CTGTTTGTGG 
GTOCTGCTTC 
GAAGCTCCCA 
ACTTGTTCAA 



41 
t 

AGTGCGGAGA 
CAGCCTACTC 
GGTTCAGACT 
AAAAGTTAAA 
TGCGAAGTGC 
CCACCOCCCG 
TCAACAATAT 
GGTTGCCTGT 
ACACTCCCAT 
ATAAAGACCT 
TACACCACAG 
TATTATTTTG 
TAGAACAGGA 
CTCAGAGCTC 
AGATTCAGAT 
GGCGTGGCGA 
GAGAGACAGA 
CTGCAGATAT 
AAAATGGTTC 
AGTTAGCCTA 
AAGGCAAACC 
ATGGAACTTG 
AAGTTGACAT 
TGGACGAGAG 
TTGGCCTCAT 
ACCAGCTTCC 
TTGTGTGCAT 
TAAGGCAGAT 
CAGCCCTGCG 
GATAGGAGAG 
GCAGAGCAAA 
CCAGTGGGTT 
GAAGGAGAGA 
GATATGATGC 



51 




1 

CCGCGGCGCT 


60 


TTTCTTAGAT 


120 


TCTGCTGATT 


180 


CTTACAAGCC 


240 


AGGAAAATTA 


300 


TCCAAAGGTC 


360 


TTGCAGCACA 


420 


GGTCACTTCT 


460 


TCCTCATCAA 


S40 


ACACCCTACA 


600 


GGCTTTACTT 


660 


TTACTTCCGG 


720 


TGAAACTTAC 


780 


AGGAAGTGGA 


840 


GGTGAAACAG 


900 


AAAGGTAGCT 


960 


AATATATCAG 


1020 


CAAAGGGACA 


1080 


CCTTTATGAT 


1140 


CTCTTCTGTC 


1200 


AGCAATTGCC 


1260 


CTGTATTGCT 


1320 


ACCACCTAAC 


1380 


CTTGAACAGA 


1440 


CCTTTGGGAG 


1500 


TTATCATGAC 


1560 


CAAGAAGTTA 


1620 


GGGAAAACTC 


1680 


GGTTAAGAAA 


1740 


GAAAAGTAAG 


1800 


AGACATCAAA 


1860 


CAGACCTCAC 


1920 


TTGATOCGTG 


1980 


AT 





All 9 Protein sequence 
Gene name; 
Unigene numbers 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
PFAM domains: 
Cellular Localization: 



bone tnorphogenetic protein receptor IB (ALK-6) 
HS. 72472 / HS. 87223 
AA250737 / U89326 
NP 001194 

1-13 
128-144 

activin_receptor (30-111) , protein kinase (204-491] 
plaBtna membrane 



1 11 

I I 
MLLRSAGKLN 

DSGLPWTSG 
GPIHHRALLI 



WFRETEIYQT 
MLKLAYSSVS 
TNEVDIPPNT 
EEYQLPYHDL 
RLTALRVKKT 



CLGLEGSOFQ 
SVTVCSLLLV 
GLPLLVQRTI 
VLMRHENILG 
GLCHLHTEIF 
RVGTKRYMPP 
VPSDPSYEDM 
LAKMSESQDI 



21 
I 

TAPTPRPKVL 
CRDTPIPHQR 
LIILFCYFRY 
AXQIQMVKQI 
FIAADIKGTG 
STQGKPAIAH 
EVLDESLNRN 
RETVCIKKLR 
KL 



31 
I 

RCKCHHHCPE 
RSIECCTERN 
KRQETRPRYS 
GKGRYGEVWM 
SWTQLYLITD 
RDLKSKNILV 
HFQSYIKADM 



41 
I 

DSVNNICSTD 
ECNKDLHPTL 
IGLEQDETYI 
GKWRGEKVAV 
YHENGSLYDY 
KKNGTCCIAD 
YSFGLILWBV 
ECLRQKGKLM 



51 
I 

GYCFTMIEED 60 

PPLKNRDFVD 120 

PPGEELRDLI 180 

KVFFTTEBAS 240 

LKSTTLDAKS 300 

LGLAVKPISD 360 

ARRCVSGGTV 420 

TECWAHNPAS 480 



A120 DNA SEQUENCE 

Gene name: LIV-l protein, estrogen regulated 
Unigene number: Hs. 79136 

Probeset Accession #: U41060 
Nucleic Acid Accession #: NM_012319.2 

Coding sequence: 138-2405 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 

887 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



I I I I I I 

CTCGTGCOGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCQAAGACA 60 

CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCOGTGGGA CAACGAGGCC 120 

GCGGAGAOGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAQ ACCACTGAGA 240 

AAATTAGTCC QAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 

ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 

TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 

AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 

CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 

GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 

GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAQACTCC AAGACCTGGA AAACTCTTCC 780 

CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 

TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 

GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 

GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 

CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1140 

TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 

AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 

TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 

CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 

GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 

TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 

AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1680 

AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 

GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1800 

TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1860 

CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTOGCCACTT 1920 

TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 

GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 

ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGG ATGACOGTTA 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 

GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 

GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATOGTGTTT CGTATAAATT 2400 

TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 

AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTAOGTTTTA ATATTTAAGT 2580 

TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 

TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 

TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 

AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 

AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 

CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC COGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 

AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 

TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 

TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 

TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 

GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 



A121 PROTEIN SEQUENCE 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Pfam domain: 
Transmembrane domains: 
Cellular Localization: 



1 
I 

MARKLSVXIjI 
FYRYGENNSL 



SVSASBVTST 



RSCLIHTSEK 
XiVALAVGTLS 
TWKGLTALGG 
EEKVDTDDRT 



11 
I 

LTFALSVTNP 
SVEGFRKLLQ 
HHNHAASGKN 
VYNTVSEGTH 
FMYSHNTNEN 
KAEIPPKTYS 
GOAFLHLLPB 
LYFMFLVEHV 
EGYLRADSQE 



21 
I 

LHELKAAAFP 
NIGIDKIKRI 
KRKALCPDHD 
PLETZBTPRP 



LIV-1 protein, estrogen regulated 
Hs. 79136 

NP 0364S1 

1-21 

Zip [591-743] 

330-346, 352-368, 427-444, 663-679, 688-703, 730-745 
plasma membrane 



31 
I 



LQIAWVGGPI 



LTLIKQPKDK 
PSHFDSQQPA 



GKLPPKDVSS 
LTSHGMGIQV 
AISIISFLSI* 
HEEPAMEMKR 
KKKNQXKPEN 
VLEEEBVMZA 



MALSAMLAYL 



GLAIGAAFTE 
GMATG1PIGH 



GLSSGLSTSV 
YAEKVSMWIP 



AVFCHELPHE 
ALTAGLFMYV 



41 


51 




1 

ESGXNVDLAI 


1 

STRQYHLQQL 


60 


DHEHHSDHER 


HSPHKHHSDH 


120 


SQGKGAHRPE 


HASGRRNVKD 


180 


STPPSVTSKS 


RVSRLAGRKT 


240 


PLNATBPNYL 


CPAIINQIDA 


300 


LGVILVPLMN 


RVPFKPLLSP 


360 


GPLPSHLSSQ 


NIEESAYFDS 


420 


DDDVBIKKQL 


SKYESQLSTN 


480 


HAHPQEVYNE 


YVPRGCKNKC 


540 


RYSREELKDA 


GVATLAWMVI 


600 


LGDFAVLLKA 


GMTVKQAVLY 


660 


ALVDMVPEML 


HNDASDRGCS 


720 



888 



WO 03/042661 



PCT/US02/36810 



RWGYFFLQNA GMLLGFGIML LISIPEHKIV FRINF 



A122 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession 8: 
Nucleic Acid Accession ft: 
Coding sequence: 



EGP- like-domain; multiple 6 

Hs. 12844 

N67551 

NMJUSS07 

241-1902 (underlined sequences correspond to start and stop codons) 



1 
I 

CCGCAGAGGA 
CGCCCCTGCC 
CGAGTGGAGC 
GGGTCCGGCC 
ATGCCTCTGC 
GGGAACGCGG 
TGTCACTATG 
TGTGAAGCTA 
AGATGCTTTC 
AAACCCOGGC 
CTCAGTGGCC 
ATAAACTGTC 
TCAGGACTCC 
GGTAAAGTCA 
AAATGTCACA 
AATGAATGTA 
GGGTCCTTCA 
ATCCCTGAAA 
AAGAAGTTGC 
CCAGAACCCA 
ATAGTTTCCA 
GAGGGGCTTG 
AGCCTGCGAG 
CTGGTCCAAA 
GACTGCAGCT 
TGGAATCCTG 
GGTCACAAGA 
AACTTCTGTT 
TTTGTGAAAA 
TGGAAGACAG 
GAAGCAGAAC 
TCAGGCTTAT 
TTGACTTTGT 
TTAGAATTAC 
TCTTGTATAA 
TTTCTGAATC 
CAGTATATCT 
TAGAAAAAAA 
TATGACATCA 
TGTATATTTA 



11 
I 

GCCTCGGCCA 
GCGGTGCCTG 
GGAGGACCCG 
GGCGCCCTCC 
CCTGGAGCCT 
CCAGTGCAAG 
GAACTAAACT 
CATGCGAACC 
CAGGATACAC 
CATGCCAACA 
ACATGCTCAT 
AGTACAGCTG 
GCCTGGCCCC 
TCTGTCCCTA 
TTGGTTTCGA 
CTATGGATAG 
AGTGTAAATG 
ATTCTGTGAA 
TTGCTCACAA 
CCAGGACTCC 
GAGGCGGGAA 
AGGATGAGAA 
GAGATGTGTT 
GGAAAGCGCT 
TCAATCATGG 
CTGATCGAGA 
AAGACATTGG 
TGCTCTTTGA 
ACAGTAACAA 
GGAAAATTCA 
GTGGCAAGGG 
GTCCAGATAG 
ATGTCAGTTC 
TAGCTGAAAA 
GATATGCCAA 
TTTCCACATT 
GATTTGTATA 
AGCACAGAGA 
AAGATAGACT 
AATTCTTTGT 



21 
I 

GGCTAGCCAG 
GCCTCCCCTC 
AGCGGCTGAG 
OGAGGGGGGC 
TGCGCTCCCG 
GCATCACGGG 
GGCCTGCTGC 
TGGATGTAAG 
CGGGAAAACC 
CAGATGTGTG 
GCCAGATGCT 
TGAAGACACA 
AAATGGAAGA 
CAATCGAAGA 
ACTGCAATAT 
CCATACGTGC 
CAAGCAGGGA 
GGAAGTCCTC 
AAACAGCATG 
TACCCCTAAG 
CTCTCATGGA 
AAGAGAAGAG 
TTTCCCTAAG 
AACTTCCAAA 
GATCTGTGAC 
TAATGCTATT 
CCGATTGAAA 
TTACCGGCTG 
TGCCCTGGCA 
GTTGTATCAA 
CAAAACCGGC 
CCTTTTATCT 
CCTGGTTTTT 
ATTGTAATGT 
TATTTGCTTT 
ATATTATAAA 
AGTAAGTTGA 
AATGTTTAAC 
TTTGCCTAAG 
AATAATAATA 



31 
I 

GGCGCCCCCA 
CCAGACTGCA 
GAGAGAGGAG 
TCAGGAGGAG 
CTGCTGCTCT 
TTGTTAGCAT 
TACGGCTGGA 
TTTGGTGAGT 
TGCAGTCAAG 
AATACACACG 
ACGTGTGTGA 
GAAGAAGGGC 
GACTGTCTAG 
TGTGTGAACA 
ATCAGTGGAC 
AGCCACCATG 
TATAAAGGCA 
AGAGCACCTG 
AAAAAGAAGG 
GTGAACTTGC 
GGTAAAAAAG 
AAAGCCCTGA 
GTGAATGAAG 
CTGGAACATA 
TGGAAACAGG 
GGCTTCTATA 
CTTCTCCTAC 
GCCGGAGACA 
TGGGAGAAGA 
GGAACTGATG 
GAAATCGCAG 
GTGGATGACT_ 
TTGATATTGC 
ACCAACAGAA 
AAATATCATA 
ATATGGAAAT 
TGAGCTTCTC 
TGTTTGACTC 
TGGCTTAGCT 
TCCAAATCAT 



41 

i 

GCCCCTCCCC 
GGGACAGCAC 
GCGGCGGCTT 
GAAGGAGGAC 
CCTGGGTGGC 
CGGCACGTCA 
GAAGAAACAG 
GCGTGGGACC 
ATGTGAATGA 
GAAGCTACAA 
ACTCTAGGAC 
CACAGTGCCT 
ATATTGATGA 
CATTTGGAAG 
GATATGACTG 
CCAATTGCTT 
ATGGACTTCG 
GTACCATCAA 
CAAAAATTAA 
AGCCCTTCAA 
GGAATGAAGA 
AGAATGACAT 
CAGGTGAATT 
AAGATTTAAA 
ATAGAGAAGA 
TGGCAGTTCC 
CTGACCTGCA 
AAGTCGGGAA 
CCACGAGTGA 
CTACCAAAAG 
TGGATGGCGT 
GAATGTTACT 
ATCATAGGAC 
ATATTATTGT 
TCACTGTATC 
GTCAGTTTAT 
TCTACAACAT 
TTATGATACT 
GGGTCTTTCA 
CAAAAAAAAA 



51 
I 

AGGCCGCGAG 
CCGGTAACTG 
AGCTGCTACG 
CCGTGCGAGA 
AGGTGGTTTC 
GCCTGGGGTC 
CAAGGGAGTC 
AAACAAATGC 
GTGTGGAATG 
GTGCTTTTGC 
ATGTGCCATG 
GTGTCCATCC 
ATGTGCCTCT 
CTACTACTGC 
TATAGATATA 
CAATACCCAA 
GTGTTCTGCT 
AGACAGAATC 
AAATGTTACC 
CTATGAAGAG 
GAAAATGAAA 
AGAGGAGCGA 
CGGCCTGATT 
TATCTCGGTT 
TGATTTTGAC 
GGCCTTGGCA 
ACCCCAAAGC 
ACTTCGAGTG 
GGATGAAAAG 
CATCATTTTT 
CTTGCTTGTT 
ATCTTTATAT 
CTCTGGCATT 
AAGATGCCTT 
TTCTCAGTCA 
CTCCCCTCCT 
TTCTAGAAAA 
TCTTGGAAAC 
TAGCCAAACT 



A123 Protein sequence t 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains i 
MAM domain: 
EGP domain: 

Cellular Localization: 



BGP- like-domain; multiple 6 

Hs. 12844 

N675S1 

NP_056322.2 

1-21 

none found 
402-546 
80-259 
secreted 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



1 11 21 31 41 51 

I I I | I | 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRKNSKGV 60 

CEATCBPGCK PGECVGPNKC RCPPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 

LSGHMLMPDA TCVNSRTCAM IKCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 

GKVICPYNRR CVNTPGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

GSPKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 300 

PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEXMK EGLEDEKREB KALKNDIEER 360 

SLRGDVPPPK VNEAGBFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 

WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 

FVKNSNNALA WBKTTSEDBK WKTGKIQLYQ GTDATKSIIP EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSIiLS VDD 



A124 DMA SEQUENCE 

Gene name: bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 

unigene number: Hs. 170195 

Probeset Accession ft: BE616633 
Nucleic Acid Accession ft: NM_001719 

889 



WO 03/042661 



PCT/US02/36810 



Coding sequence: 



123-1418 (underlined sequences correspond to start and stop codons) 



I 

GGGCGCAGCG 
CTGCCACCTG 
CGATGCACGT 
CCCTGTTCCT 
GCTTCATCCA 
CCATTTTGGG 
CCATGTTCAT 
GCCAGGGCTT 
GCCTGCAAGA 
TGGAACATGA 
TTTCCAAGAT 
ACATCCGGGA 
AGCACTTGGG 
AGGAGGGCTG 
GGCACAACCT 
AGTTGGCGGG 
TCTTCAAGGC 
GCCAGAACCG 
AGAACAGCAG 
GAGACCTGGG 
AGGGGGAGTG 
AGACGCTGGT 
AGCTCAATGC 
ACAGAAACAT 
TTGGGGCCAA 
CTGCCTTTTG 
AAACATGAGC 
TCCTACAAGC 
GCCGGGCCAG 
TTATGAGCGC 
GGGCACATTG 
CAATAAAACG 



GTTTTTCTGG 
TGAGACCTTC 
AGCATATGGC 
TGTGCAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 

1 

GCAGCAAGTG 
GGCCCGGAGC 
CGAGCTGCGG 
GCCCTGGOCG 
CGCAGCCAGG 
CGCCCGCGCC 
TACAACGCCA 
TACAAGGCCG 
CTCACCGACG 
TTCCACCCAC 
GAAGCTGTCA 
AATGAGACGT 
GATCTCTTCC 
GACATCACAG 
CTCTCGGTGG 
CGGCACGGGC 
CACTTCCGCA 
CCCAAGAACC 
AGGCAGGCCT 
TGGATCATCG 
CTGAACTCCT 
AACCCGGAAA 
CTCTACTTOG 
GCCTGTGGCT 
ATCCTCCATT 
CCCTCOCTAT 
TTTTGATCAG 
AAACCTAGCA 
GGGAAGTCTC 

GAAAGGAAAA 



31 

1 

ACCGACGGCC 
CCGGAGCCCG 
CGCCGCACAG 
ACTTCAGCCT 
AGCGGCGGGA 
CGCACCTCCA 
TGGCGGTGGA 
TCTTCAGTAC 
CCGACATGGT 
GCTACCACCA 
CGGCAGCCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
AGACGCTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAGCA 
CGCCTGAAGG 
ACATGAACGC 
CGGTGCCCAA 
ATGACAGCTC 
GCCACTAGCT 
GCTCGCCTTG 
CCCCAACTTT 
TTTTTCAGTG 
GGAAAAAAAA 
AGCCATGCAC 
CCGTGGGAGG 
TTGACCCGGA 



41 
I 

GGGACGGCCG 
GGTAGCGOGT 
CTTCGTGGCG 
GGACAACGAG 
GATGCAGCGC 
GGGCAAGCAC 
GGAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TCGAGAGTTC 
ATTCCGGATC 
CGTTTATCAG 
CCGTACCCTC 
CCACTGGGTG 
TGGGCAGAGC 
GCAGCCCTTC 
CACGGGGAGC 
GCGGATGGCC 
CGAGCTGTAT 
CTACGCCGCC 
CACCAACCAC 
GCCCTGCTGT 
CAACGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTGTGA 
GCAGCATCCA 
ACAACGCATA 
GGACTCGTTT 
AAGGGGGCGT 
AGTTCCTGTA 



51 
I 

CCTGCCCCCT 
AGAGCCGGCG 
CTCTGGGCAC 
GTGCACTCGA 
GAGATCCTCT 
AACTCGGCAC 
GGGCCCGGCG 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTGATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTCGG 
GTCAATCCGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGCGCA 
AACGTGGCAG 
GTCAGCTTCC 
TACTACTGTG 
GCCATCGTGC 
GCGCCCAGGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAGA 
AAGAAAAATG 
CCAGAGGTAA 
GGCAAGGGGT 
ATAAATGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



A125 Protein sequence i 

Gene name: bone raorphogenetic protein 7 (osteogenic protein 1) 



(BMP7) 



Unigene numbers 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 
Cellular Localization: 



Hs. 170195 
BE616633 
NP__00 1710.1 
1-30 

TGPb_propeptide (37-281] 

none found 

secreted 



1 
I 

MHVRSLRAAA 
ILGLPHRPRP 
LQDSHFLTDA 
IRERFDNETP 
HNLGLQLSVE 
QNRSKTPKNQ 
GECAFPLHSY 
RNMWRACGC 



11 
I 

PHSPVALWAP 
HLQGKHNSAP 
DMVMSFVNLV 
RISVYQVLQE 
TLDGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



21 
I 

LFLLRSALAD 
MFMLDLYNAM 



HLGRESDLFL 
LAGLIGRHGP 
NSSSDQRQAC 
TLVHFINPET 



31 
I 

FSLDNEVHSS 
AVEEGGGPGG 
YHHRBFRFDL 
LDSRTLHASE 
QNKQPFMVAP 
KKHELYVSPR 
VPKPCCAPTQ 



41 

1 

FIRRRLRSQE 
QGPSYPYKAV 
SKIPEGEAVT 
EGWLVFDITA 
PKATEVHFRS 
DLGWQDWIIA 
LNAISVLYFD 



51 
I 

RREMQREILS 60 

FSTQGPPLAS 120 

AAEPRIYKDY 180 

TSNHWWNPR 240 

IRSTGSKQRS 300 

PEGYAAYYCE 360 

DSSNVXLKKY 420 



A126 UNA 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession ft: 
Coding sequence i 



in t eg r in, beta 8 
Hs. 52620 
AA479726 
NN_002214 

680-2990 (underlined sequences correspond to start and stop codons) 



1 
I 

CCCAGAGCCG 
CTGCCGACTT 
GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGCG 
GGCCCCGAGG 
GGGGCGGGCT 
TCTGCCTGCA 



11 
I 

CCTCCCCCTG 
GTCTTTGCCC 
CTGCCCACCT 
CTCGCCGGCG 
TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGCCT 
TCGCCCGGGA 
GTTTTGCATT 
AAACGACCGG 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGCGGAG 
GCCGAGCGGT 
GCTAGGCCTG 
GGCCGAGCCC 
ATGTGCGGCT 
CGAGGTCCCG 



31 
I 

CCCGAGCTTC 
AGACGGGGCT 
CTGCGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGOGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCG7CCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 



41 

1 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGCGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
CCGCCGTGOC 
GCTTACCTGC 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTTACC 
CTGGGCAGCC 



51 
I 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 
GAATGTACAT 
TGGCTCTTCG 
CTCTTTTCTT 
GCCGG6CCCT 
GAGCOGGGAG 
AOCGCTTGCT 
CTCGCCCGOG 
CGGCGGGCGC 
GCTGCATTTG 
TGGGTGTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



890 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CACTTGTTCT TGGACTGGGC CAAGGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 840 

CCTGTGCCAG GTGCCTTGCG CTGGGTCCAG AATGTGGATG GTGTGTTCAA GAGGATTTCA 900 

TTTCAGGTGG ATCAAGAAGT GAACGTTGTG ATATTGTTTC CAATTTAATA AGCAAAGGCT 960 

GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 1020 

TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCAGCT GCGTCCAGGA GCCGAAGCTA 1080 

ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTGTTG 1140 

ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAATTAAA TTCCGTTGGA AACGATTTAT 1200 

CTAGAAAAAT GGCATTTTTC TCCCGTGACT TTCGTCTTGG ATTTGGCTCA TACGTTGATA 1260 

AAACAGTTTC ACCATACATT AGCATCCACC CCGAAAGGAT TCATAATCAA TGCAGTGACT 1320 

ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 

TCACTGAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA GATACACCAG 1440 

AAGGAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 1500 

AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 1560 

GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 

ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 

TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 

CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 

GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2820 

ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 

TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 

TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 3300 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GT6TGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 



A127 Protein sequence i 

Gene name: Integrin, beta 8 



Unigene number: Hs. 52620 

Probeset Accession #: AA479726 

Protein Accession «: NPJ>02205.1 

Signal sequence: 1-39 

Transmembrane domains: 682-704 

EGP domain: 552-584 

INB domain: 54-469 

Cellular Localization: plasma membrane 



1 11 21 31 41 SI 

I I I I I I 

MCGSALAPPT AAFVCLQNDR RGPASPLWAA WVFSLVLGLG QGEDNRCASS NAASCARCLA 60 

LGPECGWCVQ EDPISGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT ENBINTQVTP 120 

GEVSIQLRPG AEANFMLKVH PLRKYPVDLY YLVDVSASMH NN1BKLMSVG NDLSRKMAFP 180 

SRDFRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TENITEFEKA 240 

VHRQKISGNI DTPEGGFDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKLAGXV 300 

VPHDGNCHLK NNVYVK9TTM EHPSLGQLSB KLIDNNINVI PAVQGKQFHW YKDLLPLLPG 360 

TIAGEIESKA ANLNNLWEA YQKLISEVKV QVEKQVQGIY FNITAICPDG SRKPGMEGCR 420 

NVTSNDEVLP NVTVTMKKCD VTGGKNYAII KPIGFNETAK IHIHRNCSCQ CEDNRGPKGK 480 

CVDETPLDSK CFQCDENKCH PDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 540 

VYGKYCEKDD FSCPYHHGNL CAGHGECEAG ROQCPSGWEG DRCQCPSAAA QHCVHSKGQV 600 

CSGRGTCVCG RCBCTDPRSI GRPCEHCPTC YTACKENWNC MQCLHPHNLS QAILDQCKTS 660 

CALKEQQHYV DQTSECFSSP SYLRIFFIIF IVTPLIGLLK VLIIRQVILQ WNSNKIKSSS 720 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNP 



A128 DMA SEQUENCE 

Gene name: G protein- coupled receptor 64 



891 



WO 03/042661 



PCT/US02/36810 



Unigene number: 
Probeset Accession #; 
Nucleic Acid Accession 8: 
Coding sequence: 



Hs. 184942 

AA435577 

NMJ)0S756 

73*3117 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I I I I 

AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTGAA 60 

CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 120 

GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTOGT TCTGGTAACA 180 

TCCCTGGAAG AAGATACTGA TAATTCCAGT TTGTCACCAC CACCTGCTAA ATTATCTGTT 240 

GTCAGTTTTG CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 300 

AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 360 

GCTTCAGGCG TCAAACCCCA GAGAAATATC TGCAATTTGT CATCTATTTG CAATGACTCA 420 

GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 480 ' 

CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA S40 

GAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 600 

GAGGCCGAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 660 

TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC CAATGGAACA CTGCTGCTGT 720 

TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA GTGTGACCTG 780 

CAGGATCCCA TTGTCTGTCT TGCTGACCAT CCACGTGGCC CACCATTTTC TTCCAGCCAA 840 

TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTCCCAGG .TCCCCAAAGC TACCTCTTTT 900 

GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAGATT 960 

CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTGCCAT TGACATGCCC 1020 

CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATGTCTCCGG CACCCCACCT 1080 

CCTGTGAAAG CCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 1140 

AGCGCACCTC CTGTCCAGAC AGACATOGTC AACAOCAGCA GTATTTCTGA TCTTGAGAAC 1200 

CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCTAA CCTCGCAGGA 1260 

GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCCGC CTGACATGCT GGCCCCTCTG 1320 

GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 1380 

ACTATAAGTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT 1440 

TTCAACACAA CTACCTTTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 1500 

CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1560 

CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 

TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCTACGTCAT ATCATCGAGT 1680 

GTTGCAAACC TCACGGTCAG GAACTTGACA AGAAAOGTGA CAGTCACATT AAAGCACATC 1740 

AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 

GGCAGAGGAG GCTGGTCAOA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC 1860 

ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTGC TGGACCTATC TAGGACATCT 1920 

GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATTGGTTG TGGGCTTTCA 1980 

TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT COGGAGGGAT 2040 

TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 2100 

CTCCTGGACT CGTQGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 

TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 

TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCCGAA AATACATCCT TAAATTCTGC 2280 

ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 2340 

AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 

ATCAACAACA ATGCAGTATT CTACATTAGG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 

CTGAACGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 2520 

CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GGAGTATOGC TGGCCTTACA 2580 

TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGACCAGT TAACGTGACC 2640 

TTCATGTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 

TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG TGGAAAGTTA 2760 

CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2820 

GTAAACCAAG GAGTGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 2880 

TCCACCACAC TGCTAGTGAA TAATGATTGC TCAGTACACG CAAGOGGGAA TGGAAATGCT 2940 

TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCAOGAT 3000 

TTCACTGGAA AACAGCACAT GTTTAAOGAG AAGGAAGATT CCTGCAATGG GAAAGGCOGT 3060 

ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT TTATTGAGCA AAT GTGAT TC 3120 

CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT TTACCTTTTA 3180 

CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 

AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3300 

TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 

ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 

TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 3480 

TTGGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 3540 

GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 

TTGTAGAATC ATGGTTGTTT TGTCTCAOGT GATAATTCAG AAAATCCTTG CTOGTTCOGC 3660 

AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 

TCAAGAAATA ATGATOCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 3780 

TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3840 

TGGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGGCCCACTC ACCCTCTTGG 3900 

GTGCAGGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC CTGACCGCAT 3960 

CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 4020 

AGAGGGATGA ACTGTCATCC AGACCATGTG TCAGGAAAAT TGTGAACGTA GATGAGGTAC 4080 

ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 4140 

CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 4200 

ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGOC AGAGCTATAA 4260 

TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT ATTGGGAACA 4320 

GGGAACTGTC CTACACTGCT A TTGTT G CTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 4380 

TATACAGGGT CTATCTTGCT TCCTAOCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC 4440 

TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 

ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 

892 



WO 03/042661 



PCTYUS02/36810 



CTGACTTGTC TTTGCAATAT TTCTTTTCTO ATTTATTTAA TTTTCTTGTA TTTATATGTT 
AAAATCAAAA ATGTTAAAAT CAATGAAATA AATTTGCAGT TAAOA 



4620 



A129 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession ft: 
Signal sequence: 
GPS domain: 

Transmembrane domains: 
Cellular Localization: 



564-615 



G protein- coupled receptor 64 
HS. 184942 

NP_0 0574 7.1 

1-38 

624-646, 660-682, 688-710, 733-755, 783-805, 828-850, 858-880 
plasma membrane 



MVPSVRQCGH 
PSSNEVETTS 
GEIMFQYDKE 
TLKCTFTIKL 
VCLADHPRGP 
PQPSAPIASS 
VQTDIVNTSS 
LKWDDIGLQ 
NSIGTITLPS 
TVRNLTRNVT 
SHLTSFGVLL 
ILIQLCAALL 
VKVFNTYIRK 
AVFYITWGY 
ITWGPAFFAW 
NSDWSKTATN 
NGVSPSVQNG 



11 
I 

VGRTEEVLLT 
LNDVTLSLLP 
STVPQNQHIT 
NNTMWACAAI 
PFSSSQS1PV 
PAIDMPPQSE 
ISDLENQVLQ 
LNFSNTTISL 
SLMNNLPAHD 
VTLKHINPSQ 
DLSRTSVLPA 
LLNLVFLLDS 
YILKFCIVGW 
PCVIPLLNVS 
GPVNVTPMYL 
GLKKQTVNQG 
DVCLHDPTGK 



21 
I 

FKIFLVIICL 
SNETEKTKIT 
NGTLTGVLSL 
AALERVKIRP 
VPRATVLSQV 
TISSPMPQTH 
MEKAItSLGSL 
TSPSLA1AVI 
MELASRVQFN 
DELTVRCVFW 
QMMALTPITY 
WIALYKMQGL 
GVPAVWTII 
MFIWLVQLC 
FAI FNTLQGF 
VSSSSNSLQS 



31 
I 

HWLVTSLEE 
IVKTFNASGV 
SELKRSELKK 
MEHCCCSVRI 
PKATSFAEPP 
VSGTPPPVKA 
EPNLAGEMIN 
RVNASSFNTT 
FFETPALFQD 
DLGRNGGRGG 
ZGCGLSSIPL 
CISVAVFLHY 
LTISPDNYGL 
RIKKKKQLGA 
FIFIFYCVAK 
SSNSTNSTTli 
CNGKGRMALR 



41 

I 

DTDNSSLSPP 
KPQRNICNLS 
TLQTLSETYF 
PCPSSPEELG 
DYSPVTHNVP 
SPSSPTVSAP 
QVSRLLHSPP 
TFVAQDPANL 
PSLENLSLIS 
WSLNGCSVKD 
SVTLVTYIAF 
FLLVSFTWMG 
GSYGKFPNGS 
QRKTS1QDLR 
ENVRKQWRRY 
LVNNDCSVHA 
RTSKRGSLHF 



51 




1 

PAKLSWSFA 


60 


SICNDSAFFR 


120 


IMCATAEAQS 


180 


KLQCDLQDPI 


240 


SPIGBIQPLS 


300 


ANVNTTSAPP 


360 


DMLAPLAQRL 


420 


QVSLETQAPE 


480 


YVISSSVANL 


540 


RRLHBTICTC 


600 


EKTRRDYPSK 


660 


LEAFHMYLAL 


720 


PDDFCWINNN 


780 


SIAGLTFLLG 


840 


LCCGKLRLAE 


900 


SGNGNASTER 


960 


IEQM 





A130 DNA SEQUENCE 

Gene namei G protein-coupled receptor 49 (GPR49) (HG38) (LGRS) 
Unigene number: Hs. 2 85 52 9 

Probeset Accession #: AA460530 
Nucleic Acid Accession #: NM_003667 

Coding sequence: 201-2924 (underlined sequences correspond to start and stop codons) 



1 

I 

GTGGCGGCAA 
TCAGGAACGC 
CGCCCGGCGC 
GGCCCCCTAC 
TGCTGCTGCA 
GCCCCACACA 
ACCTGGGGCT 
GTATGAACAA 
AGTTACGTCT 
ACAGTCTTAA 
TGCAGAATTT 
CCCCAAGCTG 
TAACAGAAAT 
CCCTGAACAA 
TTCTACATCT 
ACAGCCTAGA 
GGACACTCTC 
AGAAAGCATT 
AATTTGTTGG 
GTGCCTCACA 
CTTTAACTGG 
TCCAAOTGCT 
AAAAGCTTCA 
TCCAGCAGTT 
ACCCCAATGC 
TGTCGTCTTT 
ATGCCTTACA 
CTTATGCTTA 
AATGGAATAA 
TTCAGGCTCA 
AAGCCCTTCA 
TGCTTGATGG 
GTAATGCTTT 
TGTTAATTGG 
CTGGTGTGGA 
GGGTTGGTTG 
1GCTTACTCT 
CGAAAGCTCC 
CCATGGCCGC 
CTTTGCCTTT 
CCCTTTGCTT 



11 
I 

CCGGCACCTC 
GGCGTCTGGC 
GCCACGGCCC 
TTCGGGCACC 
GCTGGCGACC 
CTGTCATTGC 
CTCGGAGCTG 
CATCAGTCAG 
TGCGGGAAAC 
AGTTCTTATG 
GCGAAGCCTT 
TTTCAGTGGC 
CCCGGTCCAG 
AATACACCAC 
CCATAACAAT 
GACTTTAGAT 
CAACCTTAAA 
TGTAGGCAAC 
GAGATCTGCT 
AATAACTGAA 
AGCACAGATC 
AGATCTGTCT 
GAAAATTGAC 
GCTTAGCCTC 
ATTTTCCACT 
TCCTATAACT 
GAGCTTGATA 
CCAGTGCTGT 
AGGTGACAAC 
AGATGAACGT 
TTCAGTGCAG 
CTGGCTGATC 
GGTGACTTCA 
GGTCATCGCA 
TGCGTTCACT 
CCATGTCATT 
GGCAGCCCTG 
ATTTTCTAGC 
AGTTCCCCTG 
TGGGGAGCCC 
CCTCATGATG 



21 

i 

AGTCCCCGCC 
GCTGCAGACG 
GTAGCAGTCC 
ATGGACACCT 
GGGGGCAGCT 
GAGCCCGACG 
CCTTCCAACC 
CTGCTCCCGA 
GCTCTGACAT 
CTGCAGAATA 
CAATCCCTGC 
CTGCATTCCC 
GCTTTTAGAA 
ATACCAGACT 
AGAATCCACT 
TTAAATTACA 
GAACTAGGAT 
CCTTCTCTTA 
TTTCAACATT 
TTTCCTGATT 
TCATCTCTTC 
TACAACCTAT 
CTAAGACATA 
CGATCGCTGA 
TTGCCATCCC 
GGGTTACATG 
TCATCTGAAA 
GCATTTGGAG 
AGCAGTATGG 
GACCTTGAAG 
TGTTCACCTT 
AGAATTGGAG 
ACAGTTTTCA 
GCAGTGAACA 
TTTGGCAGCT 
GGTTTTTTGT 
GAGCGTGGGT 
CTGAAAGTAA 
CTGGGTGGCA 
AGCACCATGG 
ACCATTGCCT 



31 



GCGCTTCTCC 
CCCGCTGAGT 
GGTGCTGCTC 
CCCGGCTCGG 
CTCCCAGGTC 
GCAGGATGTT 
TCAGCGTCTT 
ATCCCCTGCC 
ACATTCCCAA 
ATCAGCTAAG 
GTCTGGATGC 
TGAGGCACCT 
GTTTATCGGC 
ATGCCTTTGG 
CCCTGGGAAA 
ATAACCTTGA 
TTCATAGCAA 
TTACAATACA 
TACCTGAACT 
TAACTGGAAC 
CTCAAACCGT 
TAGAAGATTT 
ATGAAATCTA 
ATTTGGCTTG 
TAATAAAGCT 
GTTTAACTCA 
ACTTTCCAGA 
TGTGTGAGAA 
ACGACCTTCA 
ATTTCCTGCT 
CCCCAGGCCC 
TGTGGACCAT 
GATCCCCTCT 
TGCTCACGGG 
TTG CACGACA 
CCATTTTTGC 
TCTCTGTGAA 
TCATTTTGCT 
GCAAGTATGG 
GCTACATGGT 
ACACCAAGCT 



41 
I 

TCGCCGCCCA 
TGCAGAAGCC 
TCCGCCCGCG 
TGTGCTCCTG 



GCTCAGGGTG 
CACCTCCTAC 
CAGTCTCCGC 
GGGAGCATTC 
ACACGTACCC 
TAACCACATC 
GTGGCTGGAT 
ATTGCAAGCC 
AAACCTCTCC 
GAAATGCTTT 
TGAATTCCCC 
CAATATCAGG 
TTTCTATGAC 
AAGAACACTG 
TGCAAACCTG 
CTGCAATCAG 
ACCCAGTTTT 
CGAAATTAAA 
GAACAAAATT 
GGACCTATCG 
CTTAAAATTA 
ACTCAAGGTT 
TGCCTATAAG 
TAAGAAAGAT 
TGACTTTGAG 
CTTCAAACCC 
AGCAGTTCTG 
GTACATTTCC 
AGTCTCCAGT 
TGGTGCCTGG 
rrCAGAATCA 
ATATTCTGCA 
CTGTGCCCTG 
CGCCTCCCCT 
OGCTCTCATC 
CTACTGCAAT 



51 
I 

GGCCGTGGGG 
CACGGAGCGG 
TCCGGCTCGT 
TCCTTGCCTG 
CTGAGGGGCT 
GACTGCTCCG 
CTAGACCTCA 
TTCCTGGAGG 
ACTGGCCTTT 
ACAGAAGCTC 
AGCTATGTGC 
GACAATGCGT 
ATGACCTTGG 
AGCTTGGTAG 
GATGGGCTCC 
ACTGCAATTA 
TCGATACCTG 
AATCCCATCC 
ACTCTGAATG 
GAGAGTCTGA 
TTACCTAATC 
TCAGTCTGCC 
GTTGACACTT 
GCTATTATTC 
TCCAACCTCC 
ACAGGAAATC 
ATAGAAATGC 
ATTTCTAATC 
GCTGGAATGT 
GAAGACCTGA 
TGTGAACACC 
GCACTTACTT 
CCCATTAAAC 
GCCGTGCTGG 
TGGGAGAATG 
TCTGTTTTCC 
AAATTTGAAA 
CTGGCCTTGA 
CTCTGCCTGC 
TTGCTCAATT 
TTGGACAAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



893 



WO 03/042661 



PCT/US02/36810 



GAGACCTGGA GAATATTTGG GACTGCTCTA TGGTAAAACA CATTGCCCTQ TTGCTCTTCA 2520 

CCAACTGCAT CCTAAACTGC CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA 2580 

CATTTATCAG TCCTGAAGTA ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT 2640 

GTCTCAATCC CCTTCTCTAC ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC 2700 

TGAGAAAGCA AACCTACGTC TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT 2760 

CTGATGATGT GGAAAAACA6 TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT 2820 

CCAGCATCAC TTATGACCTG CCTCCCAGTT CCGTGCCATC ACCAGCTTAT CCAGTGACTG 2880 

AGAGCTGCCA TCTTTCCTCT GTGGCATTTG TCCCATGTCT CTAATTAATA TGTGAAGGAA 2940 

AATGTTTTCA AAGGTTGAGA ACCTGAAAAT GTGAGATTGA GTATATCAGA GCAGTAATTA 3000 
ATAAGAAGAG CTGAGGTGAA ACTCGGTTTA AA 



A131 Protein sequence 
Gene name: 
Onigene number: 
Protein Accession U 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



G protein-coupled receptor 49 (GPR49) (HG38) (LGR5) 
Hs. 285529 

NP 003658.1 

1-22 

S57-579, S96-618, 641-663, 687-709, 724-746, 770-792, 803-825 
plasma membrane 



1 11 21 31 41 51 

I ! I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 180 

AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 240 

LNYNNLDEFP TAIRTLSNLK ELGPHSNNIR SIPEKAFVGN PSI»1TIHFYD NPIQFVGRSA 300 

FQHLPELRTL TLNGASQITE FPDLTGTANL ESLTLTGAQI SSLPQTVCNQ LPNLQVLDLS 360 

YNLLEDLPSF SVCQKLQKID LRHNEIYBIK VDTFQQLLSL RSLNLAWNKI AIIHPNAFST 420 

IiPSLIKLDLS SNLLSSFPIT GLHGLTHIiKL TGNHALQSLI SSENFPELKV IEMPYAYQCC 480 

AFGVCENAYK I SNQWNKGDN SSMDDLHKKD AGMFQAQDER DLEDFLLDFE EDLKALHSVQ 540 

CSPSPGPFKP CEHLLDGWLI RIGVWTIAVL ALTCNALVTS TVFRSPLY1S PIKLLIGVIA 600 

AVNKLTGVSS AVLAGVDAFT FGSFARHGAW WEKGVGCHVI GFLSIFASES SVFLLTLAAL 660 

ERGFSVKYSA KFBTKAPFSS LKVI XLLCAL LALTMAAVPL LGGSKYGASP LCLPLPFGEP 720 

STMGYMVALI LLNSLCFLMM TIAYTKLYCN LDKGDLENIW DCSMVKHIAL LLFTNCILNC 780 

PVAFLSFSSL INLTFISPEV IKFILLVWP LPACLNPLLY ILFNPHFKED LVSLRKQTYV 840 

WTRSKHPSLM SINSDDVEKQ SCDSTQALVT PTSSSITYDL PPSSVPSPAY PVTESCHLSS 900 
VAFVPCL 

A132 DMA SEQUENCE 

Gene name: parathyroid hormone receptor 2 
Unigene number? Hs. 159499 

Probeset Accession #: U25128 
Nucleic Acid Accession 8: NM_005048 

Coding sequence: 143-1795 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 1 i I I 

GGCCGGTGGC CCGGGCCCGA CCACCCCAGC TGCGCGTGGT TACTGGCCAC AAGTTTGCTC 60 

TGGGCCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGGGC TCTGGAGGAG GGTCCCTGCT 120 

TCTTCCTACA GCCGTTCCGG GCATGGCCGG GCTGGGGGCG TCGCTCCACG TCTGGGGTTG 180 

GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGCCCAGCTG GATTCTGATG GCACCATTAC 240 

TATAGAGGAG CAGATTGTCC TTGTGCTGAA AGCGAAAGTA CAATGTGAAC TCAACATCAC 300 

AGCTCAACTC CAGGAGGGAG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 360 

GCCCAGAGGA ACAGTGGGGA AAATATCGGC TGTTCCATGC CCTCCTTATA TTTATGACTT 420 

CAACCATAAA GGAGTTGCTT TCCGACACTG TAACCCCAAT GGAACATGGG ATTTTATGCA 480 

CAGCTTAAAT AAAACATGGG CCAATTATTC AGACTGCCTT CGCTTTCTGC AGCCAGATAT 540 

CAGCATAGGA AAGCAAGAAT TCTTTGAACG CCTCTATGTA ATGTATACCG TTGGCTACTC 600 

CATCTCTTTT GGTTCCTTGG CTGTGGCTAT TCTCATCATT GGTTACTTCA GACGATTGCA 660 

TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GAGCTACAAG 720 

CATCTTTGTC AAAGACAGAG TAGTCCATGC TCACATAGGA GTAAAGGAGC TGGAGTCCCT 780 

AATAATGCAG GATGACCCAC AAAATTCCAT TGAGGCAACT TCTGTGGACA AATCACAATA 840 

TATCGGGTGC AAGATTGCTG TTGTGATGTT TATTTACTTC CTGGCTACAA ATTATTATTG 900 

GATCCTGGTG GAAGGTCTCT ACCTGCATAA TCTCATCTTT GTGGCTTTCT TTTCGGACAC 960 

CAAATACCTG TGGGGCTTCA TCTTGATAGG CTGGGGGTTT CCAGCAGCAT TTGTTGCAGC 1020 

ATGGGCTGTG GCACGAGCAA CTCTGGCTGA TGOGAGGTGC TGGGAACTTA GTGCTGGAGA 1080 

CATCAAGTGG ATTTATCAAG CACCGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 1140 

TCTGAATACG GTTAGAGTTC TAGCTACCAA AATCTGGGAG ACCAATGCAG TTGGGCATGA 1200 

CACAAGGAAG CAATACAGGA AACTGGCCAA ATCGACACTG GTCCTGGTCC TAGTCTTTGG 1260 

AGTGCATTAC ATCGTGTTCG TATGCCTGCC TCACTCCTTC ACTGGGCTCG GGTGGGAGAT 1320 

CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 1380 

CTGCTACTGC AATGGAGAGG TTCAGGCAGA GGTGAAGAAG ATGTGGAGTC GGTGGAATCT 1440 

CTCCGTGGAC TGGAAAAGGA CACCGCCATG TGGCAGCCGC AGATGCGGCT CAGTGCTCAC 1500 

CACCGTGACG CACAGCACCA GCAGCCAGTC ACAGGTGGCO GCCAGCACAC .GCATGGTGCT 1560 

TATCTCTGGC AAAGCTGCCA AGATCGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 1620 

TGGCTATGTC TGGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGGAGAC 1680 

CAAGGAAGAT AGTGGGAGGC AGGGAGATGA TATTCTAATG GAGAAGCCTT CCAGGCCTAT 1740 

GGAATCTAAC CCAGACACTG AAGGATGCCA AGGAGAAACT GAGGATGTTC TCTGAATGGA 1800 

CATTTGTGGC TGACTTTCAT GGGCTGGTCC AATGGCTGGT TGTGTGAGAG GGCTTGGCTG 1860 

ATACTCCTAT GCTTGAGTTC AAAGGCTGAA AATTCAGTTA AGGTGTTACT TAATAATAGT 1920 

TTTTAGGCTC CATGAATTGG CTCCTGTAAA TACTAACGAC ATGAAAATGC AAGTGTCAAT 1980 
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GGAGTAGTTT ATTACCTTCT ATTGGCATCA AGTTTTCCTC TAAATTAAT6 TATGGTATTT 2040 

GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAGATT CAATTGCTTG 2100 

GCTGTAGCTT TCTCTCATAT ATATCACCCT AAATATAATQ AAGATCTTTT AGTGTGTATC 2160 

ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 2220 

ATTTATTTTG CCTGTGCATA GGAGCAATTA GGATCTAAAA AAATATATGG GAAGATAAAA 2280 

GATCTAAGAA CAAGTACTTG CTGGAAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT 2340 

TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC 2400 

ACATCCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAGGTCT TCACTCTTTC 2460 

TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 2520 

TTGATTTTGT TTGTAATGTA TTTTGATAGC AAATCATGCT GCATCTATAT CTTTTTCTTG 2580 

TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TTAAAAATTT GTTTTAAAAA 2640 
T 



A133 PROTEIN SEQUENCE 
Gene name: 
Unigene number: 
Protein Accession ft: 
Signal sequence x 
Pf am domain i 
Transmembrane domains: 
Cellular Localization: 



parathyroid hormone receptor 2 
Hs. 1594 99 

NPJJ05039.1 

1-25 

7tm 2 [141-420] 

177-197, 228-250, 253-275. 280- 
plasma membrane 



, 320-342, 363-385, 398-419 



1 11 21 31 41 51 

I I I i I I 

MAGLGASLHV WGWLMLGSCL LARAQLDSDG TITIEEQIVL VLKAKVQCEL NITAQLQEGE 60 

GNCFPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAF RHCNPNGTWD FMHSLNKTWA 120 

NYSDCLRPLQ PDISIGKQEF PERLYVMYTV GYSISFGSLA VAILIIGYFR RLHCTRNYIH 180 

MHLFVSFMLR ATSIPVKDRV VHAHIGVKEL ESLIMQDDPQ NSIKATSVDK SQYIGCKIAV 240 

VMFIYFLATN YYWILVEGLY LHNLIFVAFP SDTKYLWGFI LIGWGPPAAP VAAWAVARAT 300 

LADARCWELS AGDIKWIYQA PILAAIGLNF ILFLNTVRVL ATKIWETNAV GHDTRKQYRK 360 

LAKSTLVLVL VFGVHYIVPV CLPHSFTGLG WEIRMHCELF FNSPQGFPVS IIYCYCNGBV 420 

QAEVKKMWSR WNLSVDWKRT PPCGSRRCGS VLTTVTHSTS SQSQVAASTR MVLISGKAAK 480 

IASRQPDSHI TLPGYVWSNS EQDCLPHSFH EETKEDSGRQ GDDILMEKPS RPMESNPDTE 540 
GCQGETEDVL 



A134 DNA sequence 

Gene name: ESTs 

Unigene number: Hs. 128899 

Probeset Accession #: AA9832S1. 

Nucleic Acid Accession #: AA983251 

Coding sequence t 1-1749 (underlined sequences correspond to start and atop codons) 



1 11 21 31 41 51 

I I ! I I I 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 6.0 

GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 

GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 

GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 

CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGGACCCT 300 

CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 

TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 

ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 

GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 

CCGCGCGGAA AGCGCCGOGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 

CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 

TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 

TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCAOGGC 780 

TGGCTGGACG CGCAGGGOGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGG0GCTGG CGAGCCTGGC 960 

OGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 

GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 

TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1140 

CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 

ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 

GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 

CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 

GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 

AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 

CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 

CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 

AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 

ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 

GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 

GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 

CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGOAGAT GACTGTGATG 1980 

TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 

AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 

TTGTTG'i'TGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

m T nTm . TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
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TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTAT6GT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A135 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession Mi 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 128899 

none found 

l-ll 
402-424 

not determined 



1 11 21 31 41 51 

I I I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 

GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 

SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 

CCWLRCWRRG RGPSGBYCHG WLDAQGWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 

ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 

SPKGRQLLRA PPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFIiIVGS VFVAFIIIX5S 420 

LVAACCCRCL RPKQDPQQSR APGGNRLKET IPMIPSASTS RG9SSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNGQQAT QIVPHQGQYI* HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 



A136 DNA SEQUENCE 

Gene name: selectin E (endothelial adhesion molecule 1) 

Unigene number: H8.89S46 

Probeset Accession #: M24736 

Nucleic Acid Accession #: NM_000450 

Coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

ATGATTGCTT CACAGTTTCT CTCAGCTCTC ACTTTGGTGC TTCTCATTAA AGAGAGTGGA 60 

GCCTGGTCTT ACAACACCTC CACGGAAGCT ATGACTTATG ATGAGGCCAG TGCTTATTGT 120 

CAGCAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAG AAGAGATTGA GTACCTAAAC 180 

TCCATATTGA GCTATTCACC AAGTTATTAC TGGATTGGAA TCAGAAAAGT CAACAATGTG 240 

TGGGTCTGGG TAGGAACCCA GAAACCTCTG ACAGAAGAAG CCAAGAACTG GGCTCCAGGT 300 

GAACCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAGA TCTACATCAA GAGAGAAAAA 360 

GATGTGGGCA TGTGGAATGA TGAGAGGTGC AGCAAGAAGA AGCTTGCCCT ATGCTACACA 420 

GCTGCCTGTA CCAATACATC CTGCAGTGGC CACGGTGAAT GTGTAGAGAC CATCAATAAT 480 

TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAGT GTGAGCAAAT TGTGAACTGT 540 

ACAGCCCTGG AATCCCCTGA GCATGGAAGC CTGGTTTGCA GTCACCCACT GGGAAACTTC 600 

AGCTACAATT CTTCCTGCTC TATCAGCTGT GATAGGGGTT ACCTGCCAAG CAGCATGGAG 660 

ACCATGCAGT GTATGTCCTC TGGAGAATGG AGTGCTCCTA TTCCAGCCTG CAATGTGGTT 720 

GAGTGTGATG CTGTGACAAA TCCAGCCAAT GGGTTCGTGG AATGTTTCCA AAACCCTGGA 780 

AGCTTCCCAT GGAACACAAC CTGTACATTT GACTGTGAAG AAGGATTTGA ACTAATGGGA 840 

GCCCAGAGCC TTCAGTGTAC CTCATCTGGG AATTGGGACA ACGAGAAGCC AACGTGTAAA 900 

GCTGTGACAT GCAGGGCCGT CCGCCAGCCT CAGAATGGCT CTGTGAGGTG CAGCCATTCC 960 

CCTGCTGGAG AGTTCACCTT CAAATCATCC TGCAACTTCA CCTGTGAGGA AGGCTTCATG 1020 

TTGCAGGGAC CAGCCCAGGT TGAATGCACC ACTCAAGGGC AGTGGACACA GCAAATCCCA 1080 

GTTTGTGAAG CTTTCCAGTG CACAGCCTTG TCCAACCCCG AGCGAGGCTA CATGAATTGT 1140 

CTTCCTAGTG CTTCTGGCAG TTTCCGTTAT GGGTCCAGCT GTGAGTTCTC CTGTGAGCAG 1200 

GGTTTTGTGT TGAAGGGATC CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTGGGACAAC 1260 

GAGAAGCCCA CATGTGAAGC TGTGAGATGC GATGCTGTCC ACCAGCCCCC GAAGGGTTTG 1320 

GTGAGGTGTG CTCATTCCCC TATTGGAGAA TTCACCTACA AGTCCTCTTG TGCCTTCAGC 1380 

TGTGAGGAGG GATTTGAATT ATATGGATCA ACTCAACTTG AGTGCACATC TCAGGGACAA 1440 

TGGACAGAAG AGGTTCCTTC CTGCCAAGTG GTAAAATGTT CAAGCCTGGC AGTTCCGGGA 1500 

AAGATCAACA TGAGCTGCAG TGGGGAGCCC GTGTTTGGCA CTGTGTGCAA GTTCGCCTGT 1560 

CCTGAAGGAT GGACGCTCAA TGGCTCTGCA GCTCGGACAT GTGGAGCCAC AGGACACTGG 1620 

TCTGGCCTGC TACCTACCTG TGAAGCTCCC ACTGAGTCCA ACATTCCCTT GGTAGCTGGA 1680 

CTTTCTGCTG CTGGACTCTC OCTCCTGACA TTAGCACCAT TTCTCCTCTG GCTTCGGAAA 1740 
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TGCTTACGGA AAGCAAAGAA ATTTGTTCCT GCCAGCAGCT GCCAAAGCCT TGAATCAGAC 1800 
GGAAGCTACC AAAAGCCTTC TTACATCCTT TAA 



A137 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
C- lectin domain: 
Cellular Localization: 



Select in B (endothelial 

HS.89S46 

M24736 

NP 000441 

1-22 

555-573 

23-139 

plasma membrane 



adhesion molecule 1) 



MIASQFLSAL 
SILSYSPSYY 
DVGMWNDERC 



BCDAVTNPAN 
AVTCRAVRQP 
VCEAFQCTAL 
EKPTCBAVRC 
WTEBVPSCQV 
SGLLPTCEAP 
GSYQKPSYIL 



11 

I 

TLVLLIKESG 
WIGIRJCVNNV 
SKKKLALCYT 
LVCSHPLGNF 
GFVECFQNPG 
QNGSVRCSHS 
SNPERGYMNC 
DAVHQPPKGL 
VKCSSLAVPG 
TESNIPLVAG 



21 
1 

AWSYNTSTEA 
WVWVGTQKPL 
AACTNTSCSG 



SFPWNTTCTF 
PAGEFTFKSS 
LFSASGSFRY 
VRCAHSPIGE 
KIKMSCSGEP 
LSAAGLSLLT 



31 
I 

KTYDEASAYC 
TEEAKNWAPG 
HGBCVETINN 
DRGYIiPSSME 
DCEEGFELMG 
CNFTCEEGFM 
GSSCEFSCEQ 
FTYKSSCAFS 
VFGTVCKFAC 
LAPFLLWLRK 



41 

I 

QQRYTHLVAI 
EPNNRQKDED 
YTCKCDPGFS 
TMQCMSSGEW 
AQSLQCTSSG 
LQGPAQVECT 
GFVLKGSKRL 
CEEGFELYGS 
PEGWTLNGSA 
CLRKAKKFVP 



51 

I 

QNKEEIEYLN 
CVEIYIKREK 
GLKCEQIVNC 
SAPIPACNW 
NWDNEKPTCK 
TQGQWTQQ1P 
QCGPTGEWDN 
TQLECTSQGQ 
ARTCGATGHW 
ASSCQSLESD 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 



A138 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession 8: 
Nucleic Acid Accession 8: 
Coding sequence: 



ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens) 
hs. 100686 
AA4 87468 
AA487468 

55-555 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1 I I I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A139 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Hs. 100686 

AA467468 

none found 

1-23 

none found 
secreted 



1 11 21 31 41 51 

I I I I I I 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLPYAQKSKK 60 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFIMLNL MHETTDKNLS PDGQYVPRIM 120 
FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 

A140 DNA SEQUENCE 

Gene name : — ■ TMPRSS3a 
Unigene number: Hs. 2 9 82 41 

Probeset Accession ft: AI538613 
Nucleic Acid Accession ft: AB03B157 

Coding sequence: 202-1566 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I 1 I I 

ACCGGGCACC GGAOGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGQAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 
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TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATCCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATOCTTT AAGTGTATOG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA OGGGGAGGAC GAGTACOGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTQAGCTC GCTGGAGGGG CAGTTCOGGG AGGAGTTTGT GTCCATCGAT 720 

CACXTTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TGGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCOGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 1260 

TCCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 1320 

GACGTGTACG GTGGCATCAT CTCCCCCTCC ATGCTCTGCG OGGGCTACCT GACGGGTGGC 1380 

GTGGACAGCT GCCAGGGGGA CAGCGGGGGG CCCCTGGTGT GTCAAGAGAG GAGGCTGTGG 1440 

AAGTTAGTGG GAGCGACCAG CTTTGGCATC GGCTGCGCAG AGGTGAACAA GCCTGGGGTG 1500 

TACACCCGTG TCACCTCCTT CCTGGACTGG ATCCAOGAGC AGATGGAGAG AGACCTAAAA 1560 

ACCTGAAGAG GAAGGGGACA AGTAGCCACC TGAGTTCCTG AGGTGATGAA GACAGCCCGA 1620 

TCCTCCCCTG GACTCCCGTG TAGGAACCTG CACACGAGCA GACACCCTTG GAGCTCTGAG 1680 

TTCCGGCACC AGTAGCAGGC CCGAAAGAGG CACCCTTCCA TCTGATTCCA GCACAACCTT 1740 

CAAGCTGCTT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT CGCTCTGTTG CCCAGGCTGG 1800 

AGTGCAGTGG CGAAATCCCT GCTCACTGCA GCCTCCGCTT CCCTGGTTCA AGCGATTCTC 1860 

TTGCCTCAGC TTCCCCAGTA GCTGGGACCA CAGGTGCCCG CCACCACACC CAACTAATTT 1920 

TTGTATTTTT AGTAGAGACA GGGTTTCACC ATGTTGGCCA GGCTGCTCTC AAACCCCTGA 1980 

CCTCAAATGA TGTGCCTGCT TCAGCCTCCC ACAGTGCTGG GATTACAGGC ATGGGCCACC 2040 

ACGCCTAGCC TCACGCTCCT TTCTGATCTT CACTAAGAAC AAAAGAAGCA GCAACTTGCA 2100 

AGGGCGGCCT TTCCCACTGG TCCATCTGGT TTTCTCTCCA GGGGTCTTGC AAAATTCCTG 2160 

ACGAGATAAG CAGTTATGTG ACCTCACGTG CAAAGCCACC AACAGCCACT CAGAAAAGAC 2220 

GCACCAGCCC AGAAGTGCAG AACTGCAGTC ACTGCACGTT TTCATCTCTA GGGACCAGAA 2280 

CCAAACCCAC CCTTTCTACT TCCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGGA 2340 

ATGACTCGTT TAAGGCCTAT TTTCATGATT TCTTTGTAGC ATTTGGTGCT TGACGTATTA 2400 

TTGTCCTTTG ATTCCAAATA ATATGTTTCC TTCCCTCAAA AAAAAAAAAA AAAAAAAAAA 2460 



A141 Protein sequence: 

Gene name: TMPRSS3a 

Uhigene number: Hs. 298241 

Probeset Accession 8: AI536613 

Protein Accession St BAB20077 

Signal sequence: none found 

Transmembrane domains: 43-65, 239-261 

Tryp_SPc domain: 216-444 

Cellular Localization: plasma membrane 



1 . 11 21 31 41 51 

I I I I I I 

MGENDPPAVE APFSPRSLPG LDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGIIALI 60 

LALAIGLGIH FDCSGKYRCR SSPKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVP 120 

TAASWKTMCS DDWKGHYANV ACAQLGPPSY VSSDKLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QA8LQFQGYR 240 

LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 

LGKDIALMRL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGA GDASPVLNHA 360 

AVPLISNKIC NHRDVYGGII SPSMLCAGYL TGGVDSCQGD SGGPLVCQER RLWKLVGATS 420 
FGIGCAEVNK PGVYTRVTSP LDWIHEQKER DLKT 

A142 DNA SEQUENCE 

Gene name: BSTs; Weakly similar to LITHOSTATHINE 1 BETA PRECURSOR [H. sapiens] 

Unigene number: Hs. 105484 

Probeset Accession #: AA314779 
Nucleic Acid Accession 8: none found 

Coding sequence: 103-579 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

! I I I I 1 

CCAAACAGAT TTGCAGATCA AGGAGAACCC AGGAGTTTCA AAGAAGCGCT AGTAAGGTCT 60 

CTGAGATCCT TGCACTAGCT ACATCCTCAG GGTAGGAGGA AGATGGCTTC CAGAAGCATG 120 

CGGCTGCTCC TATTGCTGAG CTGCCTGGCC AAAACAGGAG TCCTGGGTGA TATCATCATG 180 

AGACCCAGCT GTGCTCCTGG ATGGTTTTAC CACAAGTCCA ATTGCTATGG TTACTTCAGG 240 

AAGCTGAGGA ACTGGTCTGA TGCCGAGCTC GAGTGTCAGT CTTACGGAAA CGGAGCCCAC 300 

CTGGCATCTA TCCTGAGTTT AAAGGAAGCC AGCACCATAG CAGAGTACAT AAGTGGCTAT 360 

CAGAGAAGCC AGCCGATATG GATTGGCCTG CACGACCCAC AGAAGAGGCA GCAGTGGCAG 420 

TGGATTGATQ GGGCCATGTA TCTGTACAGA TCCTGGTCTG GCAAGTCCAT GGGTGGGAAC 480 

AAGCACTGTG CTGAGATGAG CTCCAATAAC AACTTTTTAA CTTGGAGCAG CAACGAATGC 540 

AACAAGOGCC AACACTTCCT GTGCAAGTAC CGACCATAGA GCAAGAATCA AGATTCTGCT 600 

AACTCCTGCA CCAGCCCCGT CCTCTTCCTT TCTGCTAGCC TGGCTAAATC TGCTCATTAT 660 

TTCAGAGGGG AAACCTAGCA AACTAAGAGT GATAAGGGCC CTACTACACT GGCTTTTTTA 720 



898 
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GGCTTAGAGA CAGAAACTTT AGCATTGGGC CCAGTAGTGG CTTCTAGCTC TAAATGTTTG 780 

CCCCGCCATC CCTTTCCACA GTATCCTTCT TCCCTCCTCC CCTGTCTCTG GCTGTCTCGA 840 

GCAGTCTAGA AGAGTGCATC TCCAGCCTAT GAAACAGCTG GGTCTTTGGC CATAAGAAGT 900 

AAAGATTTGA AGACAGAAGG AAGAAACTCA GGAGTAAGCT TCTAGACCCC TTCAGCTTCT 960 

ACACCCTTCT GCCCTCTCTC CATTGCCTGC ACCCCACCCC AGCCACTCAA CTCCTGCTTG 1020 

TTTTTCCTTT GGCCATAGGA AGGTTTACCA GTAGAATCCT TGCTAGGTTG ATGTGGGCCA 1080 
TACATTCCTT TAATAAACCA TTGTGTACAT AAGAGAAAAA AAAAAAAAAA AAAAAAAAAA 



A143 Protein sequence: 
Gene name: 
Uhigene number: 
Probeset Accession ft: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
C-type lectin domain: 
Cellular Localization: 



ESTs; Weakly similar to LITHOSTATHIHE 1 BETA PRECURSOR (H. sapiens] 

HS. 1054 84 

AA314779 

none found 

1-22 

none found 

47-156 
secreted 



1 11 21 31 41 51 

I I I I 1 I 

MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKLRN WSDAELECQS 60 
YGNGAHLASI LSLKEAST1A EYISGYQRSQ PIWIGLHDPQ KRQQWQWIDG AMYLYRSWSG 120 
KSMGGNKHCA EMSSNNNFLT WSSNECNKRQ HFLCKYRP 



A144 DNA SEQUENCE 

Gene name: " Cadherin 3, P-cadherin (placental) 

Unigene number: Hs.2S77 

Probeset Accession ft: X63629 

Nucleic Acid Accession ft: X63629 

Coding sequence: 54-2543 (start and stop codons are underlined) 



I 11 21 31 41 51 

II III! 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATGGGGC 60 

TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 

CCTCCGAGCC GTGCOGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 

CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 

CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 

AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 

GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 

CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 

AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 

TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 

TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 

GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 

ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATOCATAGC CAAGAACCAA B40 

AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 

CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 

TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 

ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 

GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 

GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 

CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 

AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 

CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 

CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAGA COCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 

CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 

GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 

ATGGAAGCCC TOCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 

TCACAGATGA CTCAGACATC TACTGGAGGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 

TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAGAGCAG CTGAOGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 

ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 

TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 

TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 

AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCAOC AACCATCATC COGACACCCA 2280 

TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 

AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 

AGGGCAGCGG CTCOGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 

ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 

ACGGTGGCGG GGAGGACGAC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 

GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 
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TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCA6CCCAGA GCTGCTGGGC CCACTGGCCG 3000 

TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GCOCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



A145 Protein sequence: 

Gene name: ~" Cadherin 3, 

Unigene number: Hs.2877 

Probeset Accession tf: X63629 

Protein Accession 8 : CAA45177 

Signal sequence: 1-24 

Transmembrane domain: 659-675 



Cellular localization: plasma membrane 



P- cadherin (placental) 



1 11 21 31 41 51 

I I I 1 S I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV PREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

CEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK I FYS I TG PGA DSPPEGVFAV EKETGWLLLN KPLDREBIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYBAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHPTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVP VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEBD QDYDITQLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 



A146 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



TTK protein kinase 
Hs. 169840 
M86699 
NM_0 03318 

1026-3551 (underlined 



sequences correspond to 



start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGAATTCCTT TTTTTTTTTT TTTGAGATGG AGTTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAATGGCACA ATCTCAGCTT ACTGCAACCT CCGCCTCCCG GGTTCAAGCG ATTCTCCTGC 120 

CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACTAATTT 180 

CTTTTCTATT TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGGTCT TGAACTCCTG 240 

ACCTCAGGTG ATCCACTTGC CTTGGCCTCC CAAAGTGCTA GGATTACAGC CGTGAAACTG 300 

TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC TTGGTCGCCC 360 

AGGCTGGAGT GCAGTGGTGC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 

GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACCCGG 480 

CTAATTTTTG TATTTTTATT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGTTCTCAAA 540 

CTCCTGGACT CAAGGQATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 600 

AGTCACCATG CCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 

GGTCCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT GATTCACAAA 720 

AGGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 780 

GGATGATGAT CCTAAATCCA GATACAGTAA AAATGGGGTA TGGGAAGGTA GAATACAAAA 840 

TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGGATACATT GTTGATGTGA 900 

ATGTAAGACT GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 

TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

CCATAATGAA CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 1080 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 1260 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 1380 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG IS 00 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 

TTTCAGAGGA GGAAAAGAAG AATTTATCAG CATCTACGGT ATTAACTGCC CAAGAATCAT 1620 

TTTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTCC AGAGGACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ' ACATGCCACC ACAAGATGCA GAAATAGGTT 1740 

ACCGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAGTC ATGCCCATTT GGAAGAGTCC 1800 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT I860 

GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA T TTQ OTT QTO CCTGGATCTA 1920 

AACCAAGTGG AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAACGGAA TCAAGTCTTC TAGCTAAATT AGAAGAAACT AAAGAGTATC 2100 

AAGAACCAGA GGTTCCAGAG AGTAACCAGA AACAGTGGCA AGCTAAGAGA AAGTCAGAGT 2160 

GTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGCA GATTCCGGAG TTAGCCCGAA 2220 

AAGTTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AGTCACCACC AATATCAACA TCTAAATGGT TTGACCCAAA ATCTATTTGT AAGACACCAA 2340 
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GCAGCAATAC CTTGGATGAT TACATGAGCT GTTTTAGAAC TCCAGTTGTA AAGAATGACT 2400 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGCCTGTTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAGCATCT TCTTCAGCAA 2 520 

ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTCCATATT AAAGCAGATA GGAAGTGGAG 2580 

GTTCAAGCAA GGTATTTCAG GTGTTAAATG AAAAGAAACA GATATATGCT ATAAAATATG 2640 

TGAACTTAGA AGAAGCAGAT AACCAAACTC TTGATAGTTA COGGAAOGAA ATAGCTTATT 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGG 2760 

ACCAGTACAT CTACATGGTA ATGGAGTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 

AGAAAAAATC CATTGATCCA TGGGAACGCA AGAGTTACTG GAAAAATATG TTAGAGGCAG 2880 

TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2940 

TAGTTGATGG AATGCTAAAG CTAATTGATT TTGGGATTGC AAACCAAATG CAACCAGATA 3000 

CAACAAGTGT TGTTAAAGAT TCTCAGGTTG GCACAGTTAA TTATATGCCA CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAGAATG GGAAATCTAA GTCAAAGATA AGCCCCAAAA 3120 

GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 

AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAAGGG 3300 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 3420 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3540 

GAAAAAA ATG ATTTGCAGTT ATTCGTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTGAAGTGTT 3660 

ATCAGCAAAA AAAATTCAGT GAGATTATCT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 

TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 3780 

TTGACATCAT TTTACTCTTG GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 
TAAATAAAGT TTTGTGGCTA AAATGA 



A147 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession tf: 
Signal sequence: 
Transmembrane domains: 
Protein Kinase Domain: 
Cellular Localixation: 



TTK protein kinase 

Hs. 169840 

M86699 

NPJJ03309 

none found 

none found 

510-775 

cytoplasmic and nuclear 



MNKVRDIKNK 
VPLSDALUJK 
ANCKKPAFVH 



NSLRQTNKTK 
SGNDSCELRN 
PEVPESKQKQ 
PPISTSKWFD 
QILATPLQNL 
LEEADNQTLD 
KS1DPWERKS 
SWKDSQVGT 
IINQISKLHA 
PVNQMAKGTT 
K 



11 

I 

FKNEDLTDEL 
IilGRYSQAIE 
ISPAQFELSQ 
TVLTAQESPS 
QSCPFGRVPV 
LKSVQNSHFK 
WQAKRKSECI 
PKSICKTPSS 
QVLASSSANE 
SYRNEIAYLN 
YWKNMLEAVH 
VNYMPPEAIK 
IIDPNHEIEF 
EEMKYVLGQI* 



21 
I 

SLNKISADTT 
ALPPDKYGQN 
GNVKKSKQLL 
GSLGHLQNRN 
NLLNSPDCDV 
EPLVSDEKSS 



NTLDDYMSCF 
CISVKGRIYS 
KLQQHSDKII 
TIHQHGIVHS 
DMSSSRENGK 
PDIPEKDLQD 
VGLNSPNSIL 



31 

I 

DNSGTVNQIM 
ESFARIQVRF 
QKAVERGAVP 
NSCDSRGGTT 
KTDDSWPCF 
ELIITDSITL 
WQIPELARKV 
RTPWKNDFP 
ILKQIGSGGS 
RLYDYEITDQ 
DIiKPANFLIV 
SKSKISPKSD 
VLKCCLKRDP 
KAAKTLYEHY 



41 
I 

MMANNPEDWL 
AELKAIQEPD 
LEMLEIALRN 
KARFLYGENM 
MKRQTSRSEC 
KNKTESSLLA 
NTEQKHTTFE 
PAOQLSTPYG 
SKVFQVLNEK 
YIYMVMECGN 
DGMLKLIDFG 
VWSLGCILYY 
KQRISIPELL 



51 
I 

SLLLKLBKNS 
DARDYFQMAR 
LHLQRKQLLS 
PPQDAEIGYR 
RDLWPGSKP 
KLEETKBYQE 
QPVPSVSKQS 
QPACFQQQQH 
KQIYAIKYVN 
IDLNSWLKKK 
IANQMQPDTT 
MTYGKTPFQQ 
AHPYVQIQTH 
SKTFEKKRGK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



A148 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession fi: 
Nucleic Acid Accession #: 
Coding sequence: 



endothelial differentiation, lysophosphatidic acid G-protein-coupled receptor, 
HS. 258583 
NM 012152 
NMJU2152 

43-1104 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 1 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGTCACTAT 60 

GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 120 

GGAACAAAGC TTGTGATTGT TTTGTGTGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 180 

TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TA CCTG TTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTO 300 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 360 

GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 420 

AGGCACATGT CAATCATGAG GATGOGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 480 

CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 540 

TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 600 

TACCTTGTTT TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGGT TGT GG TGT A C 660 

CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGTGGG 720 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAGGG 780 

GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 

ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG *ATAGTATTAG CCAAGGTGCA 1080 

901 



WO 03/042661 
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GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 



10 
15 
20 
25 
30 



A149 Protein sequence i 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



endothelial differentiation, lysophosphatidic acid G-protein- coupled receptor, 

HS. 258583 

NM 012152 

NPJ)36284 

none found 

31-53, 66-88, 150-172, 190-211, 239-261, 277-295 
plasma membrane 



1 
I 

MNECHYDKHM 
FHFPFYYLLA 
LVIAVERHMS 
PIYSRSYLVF 
VMTVLGAFW 
YGTMKKMICC 



11 21 

I I 
DFPYNRSNTD TVDDWTGTKL 
NLAAADFFAG IAYVPLMFNT 
IMRMRVHSNL TKKRVTLLIL 
WTVSNLMAFL IMVWYLRIY 
CWTPGLWLli LDGI2JCRQCG 
FSQENPERRP SRIPSTVLSR 



Prostate 



A150 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 



31 41 SI 

I I I 

VTVLCVGTFF CLFIPFSNSL VIAAVIKNRK 
GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 
LVWAIAIFMG AVPTLGWHCL CN I SACS SLA 
VYVKRKTNVL SPHTSGSISR RRTPMKLMKT 
VQHVKRWFLL LALLNSWNP IIYSYKDEDM 
SDTGSQYIED SISQGAVCKK STS 



ESTS 

Hs. 2 93 616 
AW043782 
none found 



60 
120 
180 
240 
300 



Coding sequence: 38-1075 (underlined sequence corresponds to start and stop codon) 



35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



i 
1 

AGCAACGACG 
CCTGCTGCTG 
GTGCAACATA 
GTGTGACGGG 
GTCGAAATGT 
CTTCCGGTGC 
AAACCCTCTG 
GAGCTTCATC 
AAGTTCTCAA 
TTACCCCAGC 
CCTGCTGGCA 
GCACCGGCTG 
CTGCAACGTC 
GAATGCGTCG 
TGCGTGGTAT 
CGACCTGCCC 
CAGCAGCCTC 
GGGCACTGCT 
AGTTATTCCA 
TGCTCATGGG 
AACTATCTCT 
TGACATGATC 
CACCCTCATT 
AAATAGGCTG 
CGCTGGACCC 
ATGATCTAAC 
ATCAAAACCT 
AAGAAAACTT 
AAGGACTCTG 
CTCATTCTGA 
GAGCCCCTCC 
TACACCTGCC 
ACCTGCCCGT 
GTATGTCCCT 
CTCCAAAGTT 
ACTGGTTTCT 
CTGCACTGTG 
GGTCAGGGTC 
AGACAATTTG 
TGAAACAGTG 
AGCTGTCTCT 
ACACCCTTGC 
ACATTTGTGC 
AGAGGGACTC 
TTCTCTGTGT 
AGGTGTTGTT 
CCACTCCGGG 



11 

i 

CCGGGCAGCG 
AGCAGCGCCG 
CCAGGCAACT 
CTGCCTGACT 
GGCCCAACCT 
AATGGGTTTG 
CTTTGCTCCA 
TGCGATGGAC 
GAACCCGGCA 
ATCACCTATG 
CTGGTCTTGC 
CAGCACCCTG 
ACCTACAACG 
GAAGTAGGCT 
GACCTTCCTC 
CCCTACCGCT 
CTGAGCGTGG 
GAGCCCAGGG 
AAGTCCATAT 
AAGCTCTTTA 
GCATTCCCCT 
TGTTGTGCGT 
TTTCACATTA 
GGAGAGAGCA 
AATTCTCTCT 
CAGGAGGCCA 
GCTTTGCACA 
TGGACGTGAG 
AAACCATCTA 
GAGCTTTCCT 
CATGAGTTTA 
CTGGCTCTAC 
AGCCAAGGAA 
GTGGCCCACA 
CCCTTAACAC 
ATCACAGGTG 
CACGCTCCTC 
AGGCCTCTCC 
GAGTCAAGAT 
TGTTTGTTTT 
TTTTTTGTTT 
CCCGCTGAGC 
ATTGTTGCAC 
CTCTCTCCCT 
CCAGTCAGCC 
TGGCAAGAAA 
CAGCTGTCAC 



21 
I 

GGAGCGGCGG 
CGGAQAGCCA 
TCATGTGCAG 
GCTTCGACAA 
TCTTCCCCTG 
AGGACTGTCC 
CCGCCCGCTA 
AGAATAACTG 
GTGGGCAGGT 
CCATCATCGG 
ACCACCAGCG 
TGCTGCTGTC 
TCAATAATGG 
CCCCACCCTC 
CACCGCCCTA 

AAGACACCAG 
ACTCTGAGCC 
GGGTTAATCT 
AGCACCTGTA 
CCTCCCCCAG 
CTTTTCTGTC 
TTCTGTTTCT 
ATGTTTCTGT 
GCTGGGTAGT 
TCACTGGATG 
ATCCTATTTG 
TAACACCCTT 
CCCTGTATAA 
CAGCAGCATA 
TCCAAGTTCT 
AGCCACTTAC 
TGAGGACCTA 
CCCAGCCTGT 
TTGCAAAGTC 
AGAGCCATGT 
TTCCCAAGGT 
CAACATCCCA 
TTTCCATTTG 
TTCCCTTCTA 
TTCCTTTAAC 
CCCGTGATAA 
TTTGAGGTTA 
CCGTGTATAG 
ACAGGGCCOG 
CCACACTGAC 
CCATTCAGAA 



31 

I 

CCGCGCCATG 
GCTGCTCCCC 
CAATGGACGG 
GAGTGATGAG 
TGCCAGCGGC 
CGATGGCAGC 
CCACTGCAAG 
TCAAGACAAC 
GTTTGTGACT 
CAGCTCCGTC 
GAAGCGGAAC 
CCGCCTGGTG 
CATCCAGTAT 
CTACTCCGAG 
CTCTTCTGAC 
GAGTGCCAAC 
CCACAGCCCG 
CAGCCAGGGC 
GCTCTGACTT 
AGGATGTCTC 
ACTTCAGAGA 
AGGTCACTCT 
GTTGGAGAGA 
GCTATATTGG 
TACCTTATAG 
GTCACCCCCC 
ATGCCCCCAG 
CAGCAGTCGC 
ATTCTGGCTT 
TATCATCAGC 
CAGCTCCTAA 
CTGGTTTCTG 
ACTTGAGTTG 
CTTGCTCATT 
CTTTTTACCT 
TCAATACCTC 
CCCAATACCA 
GTAGTTTCTC 
GATCTATTTT 
GTTAAGGGAC 
AAGGTCCAAA 
CAAGTCACTC 
TTATTTATCA 
TCTCTATGTT 
CCTCCCTGCA 
TGATGAGGGG 
CTTCTTTCCG 



41 
I 

TGGCTGCTGG 
GGGAACAACT 
TGCATCCCGG 
AAGGAGTGCC 
ATCCATTGCA 
GATGAAGAGA 
AACGGCCTCT 
AGTGATGAGG 
TCAGAGAACC 
ATTTTTGTGC 
AACCTCATGA 
GTCCTGGACC 
GTGGCCAGCC 
GCCTTGCTGG 
ACGGAATCTC 
AGTGCCAGCT 
GGGCAGCCTG 
ACTGAAGAAG 
GTTGCCATTC 
AAGTTACAGT 
TGTTTTTCTG 
TCCCTTGGGA 
CAGCATATAA 
ATQCTCAGAA 
CATTTGGGGA 
CAAAAAAATT 
TTCAGCAGAG 
AACGTTATTT 
TAGAAATTTG 
CTCATCCTAA 
AATGCAGGCT 
GACTGTCACC 
GCCCAAAGTC 
CATGCAGCCT 
GTGCATTTGG 
CAGCAAGCTC 
GCACCTCTAG 
CTCTGAGACA 
AAATCTTTTA 
TATTTATATG 
GAAAGATGCA 
CAGACTAAOC 
AGTTCTTGAA 
TGTGCTAGTT 
GGAATAAGGG 
TAAAATGGAA 
CAGCTGAAGA 



51 
I 

GGCCGCTGTG 
TCACCAATGA 
GCGCCTGGCA 
CCAAGGCTAA 
TCATTGGTCG 
ACTGCACAGC 
GTATTGACAA 
AAAGCTGTGA 
AACTTGTGTA 
TGGTGGTGGC 
CGCTGCCCGT 
ACCCCCACCA 
AGGCGGAGCA 
ACCAGAGGCC 
TGAACCAAGC 
CCCAGGCAGC 
GCCCCCAGGA 
TATAAGTCCC 
TAACAATTTG 
TTGGGATATT 
GCGTCTCAGT 
CCCGAGATCA 
AACAGTATTG 
GTGCAGGAGA 
TTTGGGTTAG 
CCATTTGAGC 
TCAGTGGCCA 
TGGTTTTGTG 
CCCAAGAATG 
AATAGGCAGG 
GCCAAGACCC 
CTCCCAGCTG 
TGACCTGGCT 
CAACACTGGC 
ACTTGAGGAC 
TCCTGGCTCC 
TTAGAGTTAG 
CATGGGCAAG 
GAAATGCATT 
TGTATAGGAA 
AAAGGAGATC 
TGTGTGCCAG 
GGAAGCAGAA 

mcirmT 

GTAAAACGTT 
CCAGGTAGAG 
AATGTTCAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



902 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 



AACCTGTTTG 
TGATCCTGTT 
AAGCTAGCCA 
AACGGAAAGG 
CTGAGTAATC 
CATTTCATCT 
TTTCTGGTGC 
GAGTTAATCT 
AGATAAGGGA 
TGATTTTTTT 
TTTTTGGGGO 
TTTTTAACTC 
AAAATAGTCT 
AGTTACACTG 
TTTTTAAAAA 
GAAAGGTTGT 
TTATACTTTC 
AAMMAAAAAC 
TGGGGCGGCG 
GAAACCCTGG 
GGAAACCGCA 
GGACTCAACC 



AOGCTAATTA 
CTGTAGACTT 
CTGGTATTTT 
AACCTAGCTG 
CAATAAAGAA 
CCTGTGAGTC 
TCTGGAAGTT 
CACTCGCTTT 
TGCCTACTAA 
AATGAATGTT 
GAGGGGGTTT 
ATTCCAACCA 
CATCTCTTTT 
TGATGACTGG 
ATGCAACTAA 
GTGTCGTTGC 
TAATAAATTT 
AWYWTTGGGG 
GGGCCCACGT 
CCAAGAAAAA 
GAGTGTTGGG 
AGGAGGACCC 



AAACAGAGCC 
TTCTTTCTTT 
GTTTTGTTTA 
CCTGTATCTT 
CTTTTGATGA 
AGAAGGGCTT 
GTTTAGAGGA 
TCTGCTTCCA 
TGCTTTTTTA 
TTTAAAAATA 
GTTTTCCAAC 
GGAAGCTTTT 
TTTCTCAAAT 
CCTATTACCT 
GTGGTTAATA 
TTTTTGTGTT 
GCAGTTTCAT 
GGGCTTGGGC 
AGGTACGGCG 
GGTGGCGAGA 
TAAACCACAC 
AAGGGAACCC 



TGCAGGAAGT 
TTTTAACCAA 
AAAAAAAAAA 
TCATTTTTAA 
CAGCCAGAAT 
TATTTCTCCC 
AAGAATTCTA 
GGCATCTTAG 
AAACAAACAG 
TATAAATAGG 
TCAAGATGGC 
TTATACATTG 
GAGATCCGTG 
GACTCAGCTC 
GTGTGTGACG 
TTGGTTAGGC 
TCTTTCTGTT 
CTCGGAAAAA 
ACCACGCGGG 
ATTCTCCACA 
CCGAAGAGAG 
GATAGAGTAC 



GGGGCTAAAG 
ATCCAAAGGA 
GAAAGAAAGA 
AATAGCACTT 
GTGTTAGAAC 
TTTGATGGGG 
ATTTTAATTA 
GAAAAACAAA 
GGACATTTTT 
ACACCAAAGC 
ACATTAGTGG 
CCTAAATCTA 
TTTTATTTTA 
CCTCTACCTT 
CTCAAAGTTA 
TTGOTTTTGT 
TGTGCAAAWG 
GTTTTTAACA 
CCCAAACGGG 
CCAGAAAAAA 
AACTCAGAAG 
G 



TGGCATTCAG 
TGTTACAGAA 
AAGAAAGAAA 
GAGTTATTTT 
TCTGGCTGAA 
CCCCTTCTTC 
ATTGCGCAGT 
TGGTTTTAGT 
ATTATAGATT 
GGCAGGGTTT 
CCAGCAATAT 
CGCCAACCAG 
GCATTAAATT 
GAAATTGACA 
ATGTAAACTG 
TTTTTAATTT 
GWMCTAMARM 
CCACTTCGGG 
ACCCCAGAAG 
ACGCGCCGGG 
CACACAAGCG 



A1S1 Protein sequence: 



Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence t 
Transmembrane domains: 
LDLa domains: 
Cellular Localization: 



1 
I 

MWLLGPLCLL 
EKECPKAKSK 
KNGLCIDKSF 
VIFVLWALL 
YVASQAEQNA 
NSASSQAASS 



11 21 

I I 
LSSAAESQLL PGNNFTNECN 
CGPTPFPCAS GIHCIIGRPR 
ICDGQNNCQD NSDEESCBSS 
ALVLHHQRKR NNLMTLPVHR 
SEVGSPPSYS EALLDQRPAW 
LLSVEDTSHS PGQPGPQEGT 



2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 



ESTs 

Hs. 293616 
AW043782 
none found 
1-17 
169-191 

28-66, 70-108, 112-149 
plasma membrane 



31 41 51 

I ! I 

IPGNFMCSNG RCIPGAWQCD GLPDCPDKSD 
CKGPEDCPDG SDEENCTANP LLC STAR YHC 
QBPGSGQVFV TSENQLVYYP SITYAIIGSS 
LQHPVLLSRL WLDHPHHCN VTYNVNNGIQ 
YDLPPPPYSS DTESLNQADL PPYRSRSGSA 
AEPRDSEPSQ GTEEV 



60 
120 
180 
240 
300 



A152 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession fl: 



Transmembrane protease, 
Hs. 105807 
T48536 
NM 005656.1 



serine 2 (TMPRSS2 ) 



Coding sequence: 57-1535 ( under lined~sequences correspond to start and stop codons) 



55 
60 
65 
70 
75 
80 



GTCATATTGA 
CTTTGAACTC 
CGGAAAACCC 
CTCAGTACTA 
ACCCCGTCGT 
AGAAAGCACT 
CTGGCCTACT 
CCTCAGGTAC 
GGGAGGACGA 
CATCTCAGAG 
GGGCGGCCTG 
ATGACAGCGG 
ATAAAAAACT 
TAGCCTGCGG 
CGCTCCCGGG 
GAGGCTCCAT 
TTAACAATCC 
ATGGAGCCGG 
AGAACAATGA 
AACCAGTGTG 
CCGGGTGGGG 
TGCTTCTCAT 
CAGCCATGAT 
GAGGGCCTCT 
GTTCTGGCTG 
ACTGGATTTA 
CGTTTTACAA 
ATGATTCAGA 
TGCCATACTG 



11 

1 

ACATTCCAGA 
AGGGTCACCA 
CTATCCCGCA 
CCCGTCCCCC 
CTGCACGCAG 
GTGCATCACC 
CTGGAAGTTC 
CTGCATCAAC 
GAATCGGTGT 
GAAGTCCTGG 
CAGGGACATG 
ATCCACCAGC 
GTACCACAGT 
GGTCAACTTG 
GGCCTGGCCC 
CATCACCCCC 
ATGGCATTGG 
ATACCAAGTA 
CATTGCGCTG 
TCTGCCCAAC 
GGCCACCGAG 
TGAGACACAG 
CTGTGCCGGC 
GGTCACTTCG 
TGCCAAAGCT 
TCGACAAATG 
GAAAACAATG 
GGTCACTTCA 
TGCAGGCTGC 



21 
I 

TACCTATCAT 
CCAGCTATTG 
CAGCCCACTG 
GTGCCCCAGT 
CCCAAATCCC 
TTGACCCTGG 
ATGGGCAGCA 
CCCTCTAACT 
GTTCGCCTCT 
CACCCTGTGT 
GGCTATAAGA 
TTTATGAAAC 
GATGCCTGTT 
AACTCAAGCC 
TGGCAGGTCA 
GAGTGGATCG 
ACGGCATTTG 
CAAAAAGTGA 
ATGAAGCTGC 
CCAGGCATGA 
GAGAAAGGGA 
AGATGCAACA 
TTCCTGCAGG 
AACAACAATA 
TACAGACCAG 
AAGGCAAACG 
GGGCTGGTTT 
TTTTTATTAA 
AGTGGCTCCC 



31 
I 

TACTCGATGC 
GACCTTACTA 
TGGTCCCCAC 
ACGCCCCGAG 
CATCCGGGAC 
GGACCTTCCT 
AGTGCTCCAA 
GGTGTGATGG 
ACGGACCAAA 
GCCAAGACGA 
ATAATTTTTA 
TGAACACAAG 
CTTCAAAAGC 
GCCAGAGCAG 
GCCTGCACGT 
TGACAGCCGC 
CGGGGATTTT 
TTTCTCATCC 
AGAAGCCTCT 
TGCTGCAGCC 
AGACCTCAGA 
GCAGATATGT 
GGAACGTCGA 
TCTGGTGGCT 
GAGTGTACGG 
GCTAATCCAC 
TGCTTCCCCG 
ACAGTGAACT 
CTGCCCAGCC 



41 
I 

TGTTGATAAC 
TGAAAACCAT 
TGTCTACGAG 
GGTCCTGACG 
AGTGTGCACC 
CGTGGGAGCT 
CTCTGGGATA 
CGTGTCACAC 
CTTCATCCTT 
CTGGAACGAG 
CTCTAGCCAA 
TGCCGGCAAT 
AGTGGTTTCT 
GATCGTGGGC 
CCAGAACGTC 
CCACTGCGTG 
GAGACAATCT 
AAATTATGAC 
GACTTTCAAC 
AGAACAGCTC 
AGTGCTGAAC 
CTATGACAAC 
TTCTTGCCAG 
GATAGGGGAT 
GAATGTGATG 
ATGGTCTTCG 
TGCATGATTT 
TGTCTGGCTT 
TGCTCTCCCT 



51 
I 

AGCAAGATGG 
GGATACCAAC 
GTGCATCCGG 
CAGGCTTCCA 
TCAAAGACTA 
GCGCTGGCOG 
GAGTGCGACT 
TGCCCCGGCG 
CAGATOTACT 
AACTACGGGC 
GGAATAGTGG 
GTCGATATCT 
TTACGCTGTT 
GGTGAGAGCG 
CACGTGTGCG 
GAAAAACCTC 
TTCATGTTCT 
TCCAAGACCA 
GACCTAGTGA 
TGCTGGATTT 
GCTGCCAAGG 
CTGATCACAC 
GGTGACAGTG 
ACAAGCTGGG 
GTATTCACGG 
TCCTTGACGT 
ACTCTTAGAG 
TGGCACTCTC 
AACCCCTTGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



903 



WO 03/042661 



PCT/US02/36810 



COGCAAGGGG TGATGGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG AAGGAAGAGG 1B00 

GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTTTGGA 1860 

TGAGCATGGA GCTGTCACTT CTCAGCTGCT GGATGACTTG AGATGAAAAA GGAGAGACAT 1920 

GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCTGGGGCCA CTTGGTAGTG 1980 

TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 2040 

GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGAOGTGGT AGTCACTTGT 2100 

AAGGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTG CCCTTGGTGC 2160 

GAGGGAAGCA ATTGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC TCCACCTGCA 2220 

CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 

TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA GTTTGGCACC 2340 

ATGTCGGCCT CTTCAGGCCT GATAGTCATT GGAAATTGAG GTCCATGGGG GAAATCAAGG 2400 

ATGCTCAGTT TAAGGTACAC TGTTTCCATG TTATGTTTCT ACACATTGAT GGTGGTGACC 2460 
CTGAGTTCAA AGCCATCTT 



A153 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



Transmembrane protease, 

Hs. 105807 

T48536 

NP_00S647.1 

none found 

85-107 

plasma membrane 



serine 2 (TMPRSS2) 



l 
I 

MALNSGSPPA 
SNPWCTQPK 
DSSGTCINPS 
GRAACRDMGY 
CLACGVNLNS 
PLNNPWHWTA 
VKPVCLPNPG 
TPAMICAGFL 
TDWIYRQMKA 



11 
I 

IGPYYENHGY 
SPSGTVCTSK 
NWCDGVSHCP 
KNNFYSSQGI 
SRQSRIVGGE 
FAGILRQSFM 
MMLQPEQLCW 
QGNVDSCQGD 
NG 



21 
I 

QPENPYPAQP 
TKKAbCITLT 
GGBDENRCVR 
VDDSGSTSFM 
SALPGAWPWQ 
PYGAGYQVQK 
I SGWGATEEK 
SGGPLVTSNN 



31 
I 

TWPTVYBVH 
LGTFLVGAAL 
LYGPNFILQM 
KLiNTSAGNVD 
VSLKVQNVHV 
VISHPNYDSK 
GKTSEVLNAA 
NIWWLIGDTS 



41 
I 

PAQYYPSPVP 
AAGLLWKFMG 
YSSQRKSWHP 
IYKKLYHSDA 
CGGSIITPEW 
TKNNDIALMK 
KVLLIETQRC 
WGSGCAKAYR 



51 
1 

QYAPRVLTQA 60 

SKCSNSGIEC 120 

VCQDDWNBNY 160 

CSSKAWSLR 240 

IVTAAHCVEK 300 

LQKPLTFNDL 360 

NSRYVYDNLI 420 

PGVYGNVMVP 480 



A1S4 DNA SEQUENCE 

Gene name: Homo sapiens cDNA FLJ13581 fis, clone PLACE1009039 
Unigene number: Hs. 129179 

Probeset Accession ft: AI694767 
Nucleic Acid Accession ft: AI694767 

Coding sequence: 130-1086 (underlined sequences correspond to start and stop codons) 



1 

I 

CAGAGAGGCT 
GGGGTCACAC 
AGCTTCTTCA_ 
ATAGGCCTCC 
TACCTTATTG 
CTGCATGAGC 
ACCTCATCCA 
GATGCTTGTC 
CTGCTGGCCA 
GTACTTACGT 
CTGATGGCAC 
TCCCATTCCT 
AATGTCGTCT 
TOCTTCTCAT 
AAGGCATTTG 
ATTGGATTGT 
TTGGCCAATA 
ACAAAGGAGA 
CC CTAGG TGT 
GTTAACATTT 
ATCCTTCAAA 
GTTTTCTTGC 
TTTTCATTTT 
GAGATAAGAA 
TAAACACAGA 
ACTCCCAACC 
AAATAATTTT 
AGAGTACATT 
ATGGACCCTG 
TTAGTACCCT 
GGGGTCATAC 
GGAAGAACTG 
TTCTARAGGA 
GCAACAGAAC 
AATTACCTGT 
AGAAAGTCTG 
TGATAGGCAG 
TGAAGATAAC 



11 

I 

GTATTTCAGT 
ATTCCTTCCA 
TGATGGTGGA 
CTGGTTTAGA 
CTGTGCTAGG 
CCATGTATAT 
TGCCCAAAAT 
TGCTACAGAT 
TGGCTTTTGA 
TGCCTCGTGT 
CCCTTCCTGT 
ACTGCCTACA 
ATGGCCTTAT 
ATCTGCTTAT 
GCACTTGCGT 
CCATGGTGCA 
TCTATCTGCT 
TTCGACAGCG 
CAGTGATCAA 
TGGAAGACAG 
TATGAAACTG 
TACATATAAT 
ACCATGCAGT 
TGGTACATCT 
ATATAATAAA 
ACATTGGATC 
TCCTCTGGAC 
TAOCTACGTT 
TTTTTCCTAT 
CATTGTAGCC 
AAGTATAAAA 
TTAAAGAGAC 
GGTATTTAAT 
TCATGGCTTT 
GTCTTGGAAG 
CATAGGGCTT 
TGAGGTTAGG 
ATTGGCCTTT 



21 
I 

GCAGCCTGCC 
TAOGGTTGAG 
TCCCAATGGC 
AGAGGCTCAG 
TAACTTGACA 
ATTTCTTTGC 
GCTGGCCATC 
GTTTGCCATC 
CCGCTATGTG 
CACCAAAATT 
CTTCATCAAG 
CCAAGATGTC 
CGTCATCATC 
TCTTAAGACT 
CTCTCATGTG 
TCGCTTTAGC 
GGTTCCTCCT 
CATCCTTCGA 
ACTTCTTTTC 
TATTCAGAAA 
GTTGGGGAAT 
TATTAATACC 
CCAAATCTAA 
AGAGAACATT 
ATGAGATAAT 
TCAGAAAAAT 
ACTAGCACTT 
AATGAAAGTT 
TTAATTTTCT 
ATGGGAAAAT 
ATTAAAAAAA 
CAACAGGGTA 
TTCTTCTCAC 
AATCCCACTA 
AAGTGATTTC 
ATAGCAAGTT 
GAGCCACCAG 
TGAGTGTGAC 




ATCATCTACA 
ATGCTTTCAG 
TTCTGGTTCA 
CACTCCTTAT 
GCCATCTGTC 
GGTGTGGCTG 
CAGCTGCCCT 
ATGAAGCTGG 
TCCGCCATTG 
GTGTTGGGCT 
TGTGCTGTGT 
AAGCGGCGTG 
GTGCTCAACC 
CTTTTCCATG 
CATTCAGAGT 
AAAAATTTCC 
CTCCATTTTT 
CTGACTAGGT 
ACTGCTTCTA 
TGCCAAAGGC 
CTAGCTTAAA 
ACTGTCTTCA 
AAGGGGAACA 
GACACACTGT 
TATCAACCCT 
TGATGTTCAG 
AAAGACTTCA 
GTGGGTTAGA 
TCATCCAGTG 
GCTATTGCTT 
TAGGTTCACC 
ATTTATTTTT 
TTATGATGGO 
TCGTAGCTGQ 



41 

I 

TGGAGGAAGA 
CCTGGTGCTG 
GTGCTACATA 
CCTTCCCATT 
TTGTGCGGAC 
GCATTGACAT 
ATTCCACTAC 
CTGGCATGGA 
ACCCACTGCG 
CTGTGGTGCG 
TCTGCCGCTC 
CCTGTGATGA 
6CCTGGACTC 
TGACACGTGA 
TCATATTCTA 
ACTCTCCACT 
CAATTGTCTA 
TGGCCACACA 
CCTCTGATTC 
TTAATAAAAA 
TCAATATTAT 
TGTGGTTGGA 
CTGATGGTTT 
CTAAGCACAG 
ACTATAACTT 
AAATGACTTC 
TTGGAAGTAA 
TCTGAGAGTT 
TTAATTAGGC 
TGGGGATCAG 
TGCCCAATCT 
GATTTCCAGA 
TTGTATTTAG 
ATTGTCCTGG 
ATTATGGAAG 
AAAAGTTCCA 
AAGTATGGAA 
AAAGTGAGGG 



51 
I 

CTGGACAAAG 
GTCACAGTTC 
CTTCATCCTA 
GTGCTCCCTC 
TGAGCACAGC 
CCTCATCTCC 
CATCCAGTTT 
ATCCACAGTG 
CCATGCCACA 
GGGGGCTGCA 
CAATATCCTT 
TATCOGGGTC 
ACTTCTCATC 
AGCCCAGGCC 
TGTACCTTTC 
GCCCGTCATC 
TGGAGTGAAG 
CGCTTCAGAG 
AGATTTTAAT 
TACAACTCAG 
TTTCTTCTTT 
GGGTTATTAC 
ACAGCATTCT 
CAAAGGAAAA 
CCTCTTCAGA 
TACAGAGAAG 
AGCCTTGAAA 
TTCACAGCAT 
AAAGATATTA 
TGAATTAAAT 
CATATGATGT 
GTCTTACATT 
GAATTTCCTG 
TCCAATTGCC 
ATTCTTATTC 
TAGGTGTTTC 
TGGCAGGTGT 
AATCTTCAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



904 



WO 03/042661 



PCT/US02/36810 



ACCATGCTTT ATTTGGGGCT TTGTGCAGTA TGGAACAGGG ACTTTGAGAC CGGGAAAGCA 2340 

ATCTGACTTA GGCATGGGAA TCAGGCATTT T T GCTTC T G A QGGGCTATTA CCAAGGGTTA 2400 

ATAGGTTTCA TCTTCAACAG GATATGACAA CAGTCTTAAC CAAGAAACTC AAATTACATA 2460 

TACTAAAACA TGTGATCATA TATGTGGTAA GTTTCATTTT CTTTTTCAAT CCTCAGGTTC 2S20 

CCTGATATGG ATTCCTATNA CATGCTTTCA TCCCCTTTTG TAATGGATAT CATATTTGGA 2 580 

AATGCCTATT TAATACTTGT ATTTGCTGCT GGACTGTAAG CCCATGAGGG CACTGTTTAT 2640 

TATTGAATGT CATCTCTGTT CATCATTOAC TGCTCTTTGC TCATCATTGA ATCCCCCAGC 2700 

AAAGTGCCTA GAACATAATA GTGCTTATGC TTGACACCGG TTATTTTTCA TCAAACCTGA 2760 

TTCCTTCTGT GCTGAACACA TAGCCAGGCA ATTTTCCAGC CTTCTTTGAG TTGGGTATTA 2820 

TTAAATTTTA GCCATTACTT CCAATGTGAG TGGAAGTGAC ATGTGCAATT TTTATACCTG 2880 

GCTCATAAAA CCCTCCCATG TGCAGOCTTT CATGTTGACA TTAAATGTGA CTTGGGAAGC 2940 

TATGTGTTAC ACAGAGTTAA TTAACCNGAA AGGCCTGGNA ATTTTTTGNN AANNAAACTG 3000 

TGGCCNNGAG GCCCNCAACC CTTTTTNNNA ATTTGGCAAN NTCCCACTTT GTANTTTGGT 3060 
AAGGAGGCCA GTTGGATAAG TGAAAAATAA AGTACTATTG TGTC 



A1S5 PROTEIN SEQUENCE 
Gene name: 
Unigene number: 
Protein Accession ft: 
Signal sequence: 
Pfam domain: 
Transmembrane domains: 
Cellular Localization: 



Homo sapiens cDNA FLJ13581 fis, clone PLACE100903 9 
Hs. 129179 

not available 
none found 
7ttn 1 [43-293] 

29-51, 57-79, 82-104, 203-225, 239-261, 273-295 
not determined 



1 11 21 31 41 51 

I I I I I I 

MVDPNGNESS ATYFXLIGLP GLEEAQFWLA PPLCSLYLIA VLGKLTIIYI VRTEHSLHEP 60 
MYIFLCMLSG IDILISTSSM PKMLAIPWFN STTIQFDACL LQMFAXHSLS GMESTVLLAM 120 
AFDRYVAICH PLRHATVLTI* PRVTKIGVAA WRGAALMAP LPVFIKQLFF CRSNILSHSY 180 
CLHQDVMKLA CDDIRVNWY GLIVIISAIG LDSLLISFSY LLILKTVLGL TREAQAKAFG 240 
TCVSHVCAVF IFYVPFIGLS MVHRFSKRRD SPLPVILANI YLLVPPVLNP IVYGVKTKEI 300 
RQRILRLFHV ATHASEP 

A156 DMA SEQUENCE 

Gene name: vasoactive intestinal peptide receptor 1 

Unigene number: Hs. 198726 

Probeset Accession ft: X77777 

Nucleic Acid Accession #: NMJ>04 624.1 

Coding sequence: S7~1544 (underlined sequences correspond to start and stop codons) 



TCGGAGCCTG CGGAGGGTGG TGGTGGTGGT GGTGGTGGCC CTCGCCCGCC TCACTCATGC 60 

CTCCTCCTCC TCTGCTCTCG CTCAGGCGCC TCGGTGGCGG TTGGTCGGCG GTTACGCGGC 120 

TGGTGGTCGC GGCGGCCGGG GCTCGCTCTC GGGGAGGCCG GGGCGGATCT CGCGGCGCAG 180 

GCGGCGGCGG CCGAGGTGGG GTCGCGCGGC GGAGGCGGCT CGAGCTTCGT GCTGCGCGCT 240 

CGCTCTTGGG CTCCTCGCTG CAGGAGGAGT GTGACTATGT GCAGATGATC GAGGTGCAGC 300 

ACAAGCAGTG CCTGGAGGAG GCCCAGCTGG AGAATGAGAC AATAGGCTGC AGCAAGATGT 360 

GGGACAACCT CACCTGCTGG CCAGCCACCC CTCGGGGCCA GGTAGTTGTC TTGGCCTGTC 420 

CCCTCATCTT CAAGCTCTTC TCCTCCATTC AAGGCCGCAA TGTAAGCCGC AGCTGCACCG 480 

ACGAAGGCTG GACGCACCTG GAGCCTGGCC CGTACCCCAT TGCCTGTGGT TTGGATGACA 540 

AGGCAGOGAG TTTGGATGAG CAGCAGACCA TGTTCTACGG TTCTGTGAAG ACCGGCTACA 600 

CCATTGGCTA CGGCCTGTCC CTCGCCACCC TTCTGGTCGC CACAGCTATC CTGAGCCTGT 660 

TCAGGAAGCT CCACTGCACG CGGAACTACA TCCACATGCA CCTCTTCATA TCCTTCATCC 720 

TGAGGGCTGC OGCTGTCTTC ATCAAAGACT TGGCCCTCTT CGACAGCGGG GAGTCGGACC 780 

AGTGCTCCGA GGGCTCGGTG GGCTGTAAGG CAGCCATGGT CTTTTTCCAA TATTGTGTCA 840 

TGGCTAACTT CTTCTGGCTG CTGGTGGAGG GCCTCTACCT GTACACCCTG CTTGCCGTCT 900 

CCTTCTTCTC TGAGCGGAAG TACTTCTGGG GGTACATACT CATCGGCTGG GGGGTACCCA 960 

GCACATTCAC CATGGTGTGG ACCATCGCCA GGATCCATTT TGAGGATTAT GGTCTGCTCA 1020 

GGTGCTGGGA CACCATCAAC TCCTCACTGT GGTGGATCAT AAAGGGCCCC ATCCTCACCT 1080 

CCATCTTGGT AAACTTCATC CTGTTTATTT GCATCATCCG AATCCTGCTT CAGAAACTGC 1140 

GGCCCCCAGA TATCAGGAAG AGTGACAGCA GTCCATACTC AAGGCTAGCC AGGTCCACAC 1200 

TCCTGCTGAT CCCCCTGTTT GGAGTACACT ACATCATGTT CGCCTTCTTT CCGGACAATT 1260 

TTAAGCCTGA AGTGAAGATG GTCTTTGAGC TCGTCGTGGG GTCTTTCCAG GGTTTTGTGG 1320 

TGGCTATCCT CTACTGCTTC CTCAATGGTG AGGTGCAGGC GGAGCTGAGG CGGAAGTGGC 1380 

GGCGCTGGCA CCTGCAGGGC GTCCTGGGCT GGAACCCCAA ATACCGGCAC CCGTCGGGAG 1440 

GCAGCAACGG CGCCACGTGC AGCACGCAGG TTTCCATGCT GAOCCGCGTC AGCCCAGGTG 1500 

CCCGCCGCTC CTCCAGCTTC CAAGCCGAAG TCTCCCTGGT CTGACCACCA GOATCCCAGC 1560 

CCAAGCGGCC CCTCCCGCCC CTTCCCACTC GCAGCAGACG CCGGGGACAG AGGCCTGCCC 1620 

GGGCGCGCCA GCCCCGGCCC TGGGCTCGGA GGCTGCCCCC GGCCCCCTGG TCTCTGGTCC 1680 

GGACACTCCT AGAGAACGCA GCCCTAGAGC CTGCCTGGAG CGTTTCTAGC AAGTGAGAGA 1740 

GATGGGAGCT CCTCTCCTGG AGGATGCAGG TGGAACTCAG TCATTAGACT CCTCCTCCAA 1800 

AGGCCCCCTA CGCCAATCAA GGGCAAAAAG TCTACATACT TTCATCCTGA CTCTGCCCCC 1860 

TGCTGGCTCT TCTGCCCAAT TGGAGGAAAG CAACCGGTGG ATCCTCAAAC AACACTGGTG 1920 

TGACCTGAGG GCAGAAAGGT TCTGCCCGGG AAGGTCACCA GCACCAACAC CACGGTAGTG 1980 

CCTGAAATTT CACCATTGCT GTCAAGTTCC TTTGGGTTAA GCATTACCAC TCAGGCATTT 2040 

GACTGAAGAT GCAGCTCACT ACCCTATTCT CTCTTTACGC TTAGTTATCA GCTTTTTAAA 2100 

GTGGGTTATT CTGGAGTTTT TGTTTGGAGA GCACACCTAT CTTAGTGGTT CCCCACCGAA 2160 

GTGGACTGGC CCCTGGGTCA GTCTGGTGGG AGGACGGTGC AACCCAAGGA CTGAGGGACT 2220 

CTGAAGCCTC TGGGAAATGA GAAGGCAGCC ACCAGCGAAT GCTAGGTCTC GGACTAAGCC 2280 

TACCTGCTCT CCAAGTCTCA GTGGCTTCAT CTGTCAAGTG GGACTCTGTC ACACCAGCCA 2340 

TTCTTATCTC TCTGTGCTGT GGAAGCAACA GGAATCAAGA GACTGCCCTC CTTGTCCACC 2400 

CACCTATGTG CCAACTGTTQ TAACTAGGCT CAGAGATGTG CACCCATGGG CTCTGACAGA 2460 



905 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 1 

35 



AAGCAGATCC TCACCCTGCT ACACATACAG GATTTGAACT CAGATCTGTC TGATAGGAAT 2520 

GTGAAAGCAC GGACTCTTAC TGCTAACTTT TGTGTATCGT AACCAGCCAG ATCCTCTTGG 2580 

TTATTTGTTT ACCACTTGTA TTATTAATGC CATTATCCCT GAATTCCCCT TGCCACCCCA 2640 

CCCTCCCTGG AGTGTGGCTG AGGAGGCCTC CATCTCATGT ATCATCTGGA TAGGAGCCTG 2700 

CTGGTCACAG CCTCCTCTGT CTGCCCTTCA CCCCAGTGGC CACTCAGCTT CCTACCCACA 2760 

CCTCTGCCAG AAGATCCCCT CAGGACTGCA ACAGGCTTGT GCAACAATAA ATGTTGGCTT 2820 
GGAAAAAAAA AAAA 



A157 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



vasoactive intestinal peptide receptor 1 

Hs. 198726 

X77777 

JC2195 

none found 

181-202, 214-236, 255-277. 290-311, 332-354, 
plasma membrane 



377-399, 408-430 



MPPPPLLSLR 
RSLLGSSLQB 
CPLIFKLFSS 
YTIGYGLSLA 
DQCSEGSVGC 
PSTFTMVWTI 
LRPPDIRKSD 
WAILYCPLN 
GARRSSSFQA 

A1S8 DNA 



11 
I 

RLGGGWSAVT 
ECDYVQMIEV 
IQGRNVSRSC 
TLLVATAILS 
KAAMVFFQYC 
ARIHFEDYGL 
SSPYSRLARS 
GEVQAELRRK 
EVSLV 



21 
I 

RLWAAAGAR 
QHKQCLEEAQ 
TDEGWTHLEP 
LFRKLHCTRN 
VMANFFWLLV 
LRCWDTINSS 
TLLLIPLFGV 
WRRWHLQGVL 



31 

I 

SRGGRGGSRG 
LENETIGCSK 
GPYPIACGLD 
YIHMHLFISF 
EGLYLYTLLA 
LWHIIKGPIL 
HYIMFAFFPD 
GWNPKYRHPS 



41 

1 

AGGGGRGGVA 
KWDNLTCWPA 
OKAASLDEQQ 
ILRAAAVFIK 
VSFFSERKYF 
TSILVNFILF 
NFKPEVKMVF 
GGSNGATCST 



51 
I 

RRRRLELRAA 60 

TPRGQVWLA 120 

TMFYGSVKTG 180 

DLALFDSGES 240 

WGYILIGWGV 300 

ICIIRILLQK 360 

ELWGSFQGF 420 

QVSMLTRVSP 480 



Gene name: ESTs 
unigene number: Hs ,29383 

Probeset Accession ft: AW207206 
Nucleic Acid Accession ft: AL133619 

Coding sequence: 1-2070 (underlined sequences correspond to start and stop codons) 



40 
45 
50 
55 
60 
65 
70 
75 
80 



1 11 21 31 41 51 

I I 1 I I I 

ATGAGCGGTG OGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 

CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 

CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 

CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 240 

GAAAACAAGG GTGAGCCGGC GOGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 

ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 

GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 

AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 

GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 

CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 

CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

ATGCTGGGGG CCCAGGGOAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 

GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 

GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 

GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCA6 GAGGACATCT GACTGGTGGA 960 

TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 

CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 

GACAGOACAC GGGAAGAGGC CATQCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 

CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 

GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGQACC CAGCCCTGCC 1320 

AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 

GGCGGTAGCG CCGACACTGT GOGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 

TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 

TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 

CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 

GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 

AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 

ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1800 

ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 1860 

CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 

AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 

CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAAGGGCGCC TGCATCGCTC AGTGCTTTGA 



A1S9 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs.29383 

AW207206 

T434S7 

none found 

303-322 

not determined 



906 



WO 03/042661 



PCTAJS02/36810 



1 11 21 

, I I I 

J MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP 
GGTGDGEPLQ TVLAHLAALA PVGQPSGYRF 
GPEVIAGRQV ATGCSPDLPP PSRAENGRNP 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS 

U AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS 
GARWVCINGV WVEPGGPSPA RLKEGSSRTH 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD 
EKAEASNAGA AOlGNSQHQG RQMGAGAHPP 

5 TQELRHLKSL LEGSQRPQAA PEEASPPRDQ 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 



31 41 51 

I I i 

VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 

RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 

WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 180 

WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 

RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 

AVPRALPSQG DMEKGVBGGP FPSRCGNSSE 360 

LGTCCSMCPK PSCPPDGPSG NHLSRASAPL 420 

RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 

VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 

MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 

EATHFPKVST KSLSKKCLSP PVAERAILPA 660 



20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



A160 DNA SEQUENCE 

Gene name: Liv-i protein, 

Unigene number: 

Probeset Accession #: 

Nucleic Acid Accession #: 

Coding sequence: 



estrogen regulated 
Hs. 79136 
U41060 
NM_012319.2 

138-2405 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CTCGTGCCGA ATTOGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 
CCAGTGGGCC OGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 
GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 
ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 
TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 
TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 
CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC . TTCTCAAAAC ATAGAAGAAA 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 
AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 
GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 
CTCACAGTCA CAGCCAGCGC TACTCTGGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 
TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 
GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 
ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATQTG GATATTTGCA CTTACTGCTG 
GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 
TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 
TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 
TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 
AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGGAT AGATTCTTAT 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 
CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 
TTCGTGOGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 
CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 

907 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



WO 03/042661 



PCT/US02/36810 



GAGCAATTGT CTTTATATAC GGTACTOTAG CCATACTAGG CCTG TC TGT G GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 



A161 PROTEIN SEQUENCE 
Gene name: 
Unigene number: 
Protein Accession ft : 
Signal sequence: 
Pfara domain* 
Transmembrane domains: 
Cellular Localization: 



LIV-1 protein, estrogen regulated 
Hs. 79136 

NP 036451 

1-21 

Zip 1591-743) 

330-346, 352-368, 427-444, 663-679, 688-703, 730-745 
plasma membrane 



1 
I 

MARKLSVILI 
FYRYGENNSL 
EHHSDHDHHS 
SVSASEVTST 
NESVSEPHKG 
RSCLIHTSEK 
LVALAVGTLS 
TWKGLTALGG 
EEKVDTDDRT 
HSHFHDTLGQ 
MGDGLHNFSD 
NALSAMLAYL 
RWGYFFLQNA 



11 
! 

LTFALSVTNP 
SVEGFRKLLQ 
HHNHAASGKN 
VYNTVSEGTH 
FMYSRNTNEN 
KAEIPPKTYS 
GDAFLHUjPH 
LYFMPLVEHV 
EGYLRADSQE 
SDDLIHHHHD 
GLAIGAAFTE 
GMATGIFIGH 
GMLLGFGIML 



21 
I 

LHELKAAAFP 
NIGIDKIKRI 
KRKALCPDHD 
PLETIETPRP 
PQECFNASKL 
LQIAWVGGFI 
SHASHHHSHS 
LTLIKQFKDK 
PSHFDSQQPA 
YHHILHHHHH 
GLSSGLSTSV 
YAENVSMWIP 
LISIFEHKIV 



31 

i 

QTTEKISPNW 
HIHHDHDHHS 
SDSSGKDPRN 
GKLFPKDVSS 
LTSHGMGIOV 
AISIISPLSL 



41 


51 




1 

ESGINVDLAI 


1 

STRQYHLQQL 


60 


DHEHHSOHER 


HSDHEHHSDH 


120 


SQGKGAHRPE 


HASGRRNVKD 


180 


STPPSVTSKS 


RVSRIiAGRKT 


240 


PLNATEPNYL 


CPAIINQIDA 


300 


LGVILVPLMN 


RVFFKPLLSF 


360 


GPLPSHLSSQ 


NIEESAYFDS 


420 


DDDVEIKKQL 


SKYESQLSTN 


480 


HAHPQEVYNE 


YVPRGCKNKC 


540 


RYSREELKDA 


GVATLAWMVI 


600 


LGDFAVLLKA 


GMTVKQAVLY 


660 


ALVDMVPEML 


HNDASDHGCS 


720 



A162 DNA sequence 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding sequence: 



KKKNQKKPEN 
VLEBEEVMIA 
QNHHPHSHSQ 
AVFCHEIiPHE 
ALTAGLFMYV 
PR INF 



bone morphogenetic protein receptor IB (ALK-6) 

Hs. 87223 

AA250737 

NM_001203 

274-1782 (underlined sequences correspond to start and stop codons) 



CGCGGGGCGC 
GAGGACGCGG 
GTGAAAGGAA 
CATAACCATT 
TGCCATAAGT 
AATGTGGGCA 
TTGCGTTGTA 
GACGGATATT 
GGTTGCCTAG 
AGAAGATCAA 
CTGCCTCCAT 
ATATCTGTGA 
TATAAAAGAC 
ATTCCTCCTG 
TCAGGCCTCC 
ATTGGAAAAG 
GTGAAAGTGT 
ACAGTGTTGA 
GGGTCCTGGA 
TATCTGAAGT 
AGTGGCTTAT 
CATCGAGATC 
GACCTGGGCC 
ACTCGAGTTG 
AATCACTTCC 
GTTGCTAGGA 
CTAGTGCCCA 
CGCCCCTCAT 
ATGACAGAAT 
ACACTTGCCA 
CATCTCTGCA 
TAAGCATCCA 
CTTTCAGGGA 
TCTGTTTGTA 



11 
I 

GGAGTCGGCG 
GAGCCGGGAG 
AGGAAGATCA 
TGGCTCTGAG 
GAGAAGCAAA 
CCAAGAAAGA 
AATGCCACCA 
GTTTCAOGAT 
GACTAGAAGG 
TTGAATGCTG 
TGAAAAACAG 
CTGTCTGTAG 
AAGAAACCAG 
GAGAATCCCT 
CTCTGCTGGT 
GTCGCTATGG 
TCTTCACCAC 
TGAGGCATGA 
CCCAGTTGTA 
CCACCACCCT 
GTCATTTACA 
TGAAAAGTAA 
TGGCTGTTAA 
GCACCAAACG 
AGTCTTACAT 
GATGTGTATC 
GTGACCCCTC 
TCCCAAACCG 
GCTGGGCTCA 
AAATGTCAGA 
GAAAGCCAAC 
CAGTACAAGC 
GCGACCTGGG 
GGCGGAGAAA 



21 
I 

GGGCCTCGCG 
CGCACGCGCG 
TTTCATGCCT 
CTATGACAAG 
CTTCCTTGAT 
GGATGGTGAG 
CCATTGTCCA 
GATAGAAGAG 
CTCAGATTTT 
CACAGAAAGG 
AGATTTTGTT 
TTTGCTCTTG 
ACCTCGATAC 
GAOAGACTTA 
CCAAAGGACT 
GGAAGTTTGG 
AGAGGAAGCC 
AAACATTTTG 
CCTAATCACA 
AGACGCTAAA 
CACAGAAATC 
AAACATTCTG 
ATTTATTAGT 
CTATATGCCT 
CATGGCTGAC 
AGGAGGTATA 
TTATGAGGAC 
GTGGAGCAGT 
CAATCCTGCA 
GTCCCAGGAC 
AGGTACTCTT 
CTTGAACATC 
CAAAGACAGA 
COGTTGGGTA 



31 
I 

GGACGCGGGC 
GGGTGGAGTT 
TGTTGATAAA 
AGAGGAAACA 
AACATGCTTT 
AGTACAGCCC 
GAAGACTCAG 
GATGACTCTG 
CAGTGTCGGG 
AACGAATGTA 
GATGGACCTA 
GTCCTTATCA 
AGCATTGGGT 
ATTGAGCAGT 
ATAGCTAAGC 
ATGGGAAAGT 
AGCTGGTTCA 
GGTTTCATTG 
GACTATCATG 
TCAATGCTGA 
TTTAGTACTC 
GTGAAGAAAA 
GATACAAATG 
CCAGAAGTGT 
ATGTATAGTT 
GTGGAAGAAT 
ATGAGGGAGA 
GATGAGTGTC 
TCAAGGCTGA 
ATTAAACTCT 
CTGTTTGTGG 
GTCCTGCTTC 
GAAGCTCCCA 
ACTTGTTCAA 



41 
I 

AGTGCGGAGA 
CAGCCTACTC 
GGTTCAGACT 
AAAAGTTAAA 
TGCGAAGTGC 
CCACCCCCCG 
TCAACAATAT 
GGTTGCCTGT 
ACACTCCCAT 
ATAAAGACCT 
TACACCACAG 
TATTATTTTG 
TAGAACAGGA 
CTCAGAGCTC 
AGATTCAGAT 
GGCGTGGCGA 
GAGAGACAGA 
CTGCAGATAT 
AAAATGGTTC 
AGTTAGCCTA 
AAGGCAAACC 
ATGGAACTTG 
AAGTTGACAT 
TGGACGAGAG 
TTGGCCTCAT 
ACCAGCTTCC 
TTGTGTGCAT 
TAAGGCAGAT 
CAGCCCTGCG 
GATAGGAGAG 
GCAGAGCAAA 
CCAGTGGGTT 
GAAGGAGAGA 
GATATGATGC 



51 
I 

CCGCGGCGCT 
TTTCTTAGAT 
TCTGCTGATT 
CTTACAAGCC 
AGGAAAATTA 
TCCAAAGGTC 
TTGCAGCACA 
GGTCACTTCT 
TCCTCATCAA 
ACACCCTACA 
GGCTTTACTT 
TTACTTCCGG 
TGAAACTTAC 
AGGAAGTGGA 
GGTGAAACAG 
AAAGGTAGCT 
AATATATCAG 
CAAAGGGACA 
CCTTTATGAT 
CTCTTCTGTC 
AGCAATTGCC 
CTGTATTGCT 
ACCACCTAAC 
CTTGAACAGA 
CCTTTGGGAG 
TTATCATGAC 
CAAGAAGTTA 
GGGAAAACTC 
GGTTAAGAAA 
GAAAAGTAAG 
AGACATCAAA 
CAGACCTCAC 
TTGATCCGTG 
AT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



A163 Protein sequence 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
PFAM domains: 
Cellular Localization: 



bone morphogenetic protein receptor IB (ALK-6) 

HS. 72472 / Ks. 87223 

AA250737 / U89326 

NP_0 01194 

1-13 

128-144 

activin_receptor [30-111] , protein kinase [204- 491 J 
plasma membrane 



908 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

I I I I I I 

MLIiRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSWNICSTD GYCFTMI EED 60 

DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 

GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGBSLRDLI 180 

EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 

WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 

MLKLAYSSVS GLCHLHTEIP STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 360 

TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWBV ARRCVSGGIV 420 

EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHKPAS 480 
RIiTALRVKKT LAKMSESQDI KL 



A164 DNA sequence 

Gene name: ESTs 

Unigene number: Hs. 157601 

Probeset Accession #: W07459 

NUcleic Acid Accession AC005383 

Coding Sequence: 328-2751 (underlined 



sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I i I I 

GACAGTGTTC GCGGCTGCAC CGCTCGGAGG CTGGGTGACC CGCGTAGAAG TGAAGTACTT 60 

TTTTATTTGC AGACCTGGGC CGATGCCGCT TTAAAAAACG CGAGGGGCTC TATGCACCTC 120 

CCTGGCGGTA GTTCCTCCGA CCTCAGCCGG GTCGGGTCG7 GCCGCCCTCT CCCAGGAGAG 180 

ACAAACAGGT GTCCCACGTG GCAGCCGCGC CCCGGGCGCC CCTCCTGTGA TCCCGTAGCG 240 

CCCCCTGGCC CGAGCCGCGC CCGGGTCTGT GAGTAGAGCC GCCCGGGCAC CGAGCGCTGG 300 

TCGCCGCTCT CCTTCCGTTA TATCAACATG CCCCCTTTCC TGTTGCTGGA GGCCGTCTGT 360 

GTTTTCCTGT TTTCCAGAGT GCCCCCATCT CTCCCTCTCC AGGAAGTCCA TGTAAGCAAA 420 

GAAACCATCG GGAAGATTTC AGCTGCCAGC AAAATGATGT GGTGCTCGGC TGCAGTGGAC 4B0 

ATCATGTTTC TGTTAGATGG GTCTAACAGC GTCGGGAAAG GGAGCTTTGA AAGGTCCAAG 540 

CACTTTGCCA TCACAGTCTG TGACGGTCTG GACATCAGCC CCGAGAGGGT CAGAGTGGGA 600 

GCATTCCAGT TCAGTTCCAC TCCTCATCTG GAATTCCCCT TGGATTCATT TTCAACCCAA 660 

CAGGAAGTGA AGGCAAGAAT CAAGAGGATG GTTTTCAAAG GAGGGCGCAC GGAGACGGAA 720 

CTTGCTCTGA AATACCTTCT GCACAGAGGG TTGCCTGGAG GCAGAAATGC TTCTGTGCCC 780 

CAGATCCTCA TCATCGTCAC TGATGGGAAG TCCCAGGGGG ATGTGGCACT GCCATCCAAG 840 

CAGCTGAAGG AAAGGGGTGT CACTGTGTTT GCTGTGGGGG TCAGGTTTCC CAGGTGGGAG 900 

GAGCTGCATG CACTGGCCAG CGAGCCTAGA GGGCAGCACG TGCTGTTGGC TGAGCAGGTG 960 

GAGGATGCCA CCAAOGGCCT CTTCAGCACC CTCAGCAGCT CGGCCATCTG CTCCAGCGCC 1020 

ACGCGAGACT GCAGGGTCGA GGCTCACCCC TGTGAQCACA GGACGCTGGA GATGGTCCGG 1080 

GAGTTCGCTG GCAATQCCCC ATGCTGGAGA GGATCGCGGC GGACCCTTGC GGTGCTGGCT 1140 

GCACACTGTC CCTTCTACAG CTGGAAGAGA GTGTTCCTAA CCCACCCTGC CACCTGCTAC 1200 

AGGACCACCT GCCCAGGCCC CTGTGACTCG CAGCCCTGCC AGAATGGAGG CACATGTGTT 1260 

CCAGAAGGAC TGGACGGCTA CCAGTGCCTC TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC 1320 

TGTGCCCTGA AGCTGAGCCT GGAATGCAGG GTCGACCTCC TCTTCCTGCT GGACAGCTCT 1380 

GCGGGCACCA CTCTGGACGG CTTCCTGCGG GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG 1440 

GCCGTGCTGA GCGAGGACTC TO0GGCCCGA GTGGGTGTGG CCACATACAG CAGGGAGCTG 1500 

CTGGTGGCGG TGCCTGTGGG GGAGTACCAG GATGTGCCTG ACCT G GTCTG GAGCCTCGAT 1560 

GGCATTCCCT TCCGTGGTGG CCCCACCCTG ACGGGCAGTG CCTTGCGGGA GGCGGCAGAG 1620 

CGTGGCTTCG GGAGCGCCAC CAGGACAGGC CAGGACCGGC CACGTAGAGT GGTGGTTTTG 1680 

CTCACTGAGT CACACTCCGA GGATGAGGTT G06GGCCCAG CGOGTCACGC AAGGGOGCGA 1740 

GAGCTGCTCC TGCTGGGTGT AGGCAGTGAG GCCGTGCGGG CAGAGCTGGA GGAGATCACA 1800 

GGCAGCCCAA AGCATGTGAT GGTCTACTCG GATCCTCAGG ATCTGTTCAA CCAAATCCCT 1860 

GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG CGGCCAGGGT GGCGGACACA AGCCCTGGAC 1920 

CTCGTCTTCA TGTTGGACAC CTCTGCCTCA GTAGGGCCCG AGAATTTTGC TCAGATGCAG 1980 

AGCTTTGTGA GAAGCTGTGC CCTCCAGTTT GAGGTGAACC CTGACGTGAC ACAGGTCGGC 2040 

CTGGTGGTGT ATGGCAGCCA GGTGCAGACT GCCTTCGGGC TGGACACCAA ACCCACCCGG 2100 

GCTGCGATGC TGCGGGCCAT TAGCCAGGCC CCCTACCTAG GTGGGGTGGG CTCAGCCGGC 2160 

ACCGCCCTGC TGCACATCTA TGACAAAGTG ATGACCGTCC AGAGGGGTGC CCGGCCTGGT 2220 

GTCCCCAAAG CTGTGGTGGT GCTCACAGGC GGGAGAGGCG CAGAGGATGC AGCCGTTCCT 2280 

GCCCAGAAGC TGAGGAACAA TGGCATCTCT GTCTTGGTCG TGGGCGTGGG GCCTGTCCTA 2340 

AGTGAGGGTC TGCGGAGGCT TGCAGGTCCC CGGGATTCCC TGATCCACGT GGCAGCTTAC 2400 

GCCGACCTGC GGTACCACCA GGACGTGCTC ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG 2460 

CCAGTCAACC TCTGCAAACC CAGCCCGTGC ATGAATGAGG GCAGCTGCGT CCTGCAGAAT 2520 

GGGAGCTACC GCTGCAAGTG TCGGGATGGC TGGGAGGGCC OCCACTGOGA GAACCGTGAG 2580 

TGGAGCTCTT GCTCTGTATG TGTGAGCCAG GGATGGATTC TTGAGACGCC CCTGAGGCAC 2640 

ATGGCTCCGG TGCAGGAGGG CAGCAGCCGT ACCCCTCCCA GCAACTACAG AGAAGGCCTG 2700 

GGCACTGAAA TGGTGCCTAC CTTCTGGAAT GTCTGTGCCC CAGGTCC TTA GA ATGTCTGC 2760 

TTCCCGCCGT GGCCAGGACC ACTATTCTCA CTGAGGGAGG AGGATGTCCC AACTGCAGCC 2820 

ATGCTGCTTA GAGACAAGAA AGCAGCTGAT GTCACCCACA AACGATGTTG TTGAAAAGTT 2880 

TTGATGTGTA AGTAAATACC CACTTTCTGT ACCTGCTOTG CCTTGTTGAG GCTATGTCAT 2940 

CTGCCACCTT TCCCTTGAGG ATAAACAAGG GGTCCTGAAG ACTTAAATTT AGOGGCCTGA 3000 

CGTT CCTT TG CACACAATCA ATGCTCGCCA GAATGTTGTT GACACAGTAA TGCCCAGCAG 3060 

AGGC CTTTA C TAGAGCATCC TTTGGACGGC GAAGGCCACG GO CTTTC AAQ ATGGAAAGCA 3120 

GCAGCTTTTC CACTTCCCCA GAGACATTCT GGATGCATTT GCATTGAGTC TGAAAGGGGG 3180 

CTTGAGGGAC GTTTGTGACT TCTTGGCGAC TGCCTTTTGT GTGTGGAAGA GACTTGGAAA 3240 

GGTCTCAGAC TGAATGTGAC CAATTAACCA GCTTGGTTGA TGATGGGGGA GGGGCTGAGT 3300 

TG TGCA TGGG CCCAGGTCTG GAGGGCCACG TAAAATCGTT CTGAGTCGTG AGCAGTGTCC 3360 
ACCTTGAAGG TCTTC 



909 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 



A165 Protein sequence 
Gene name: 
Unigene number t 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
vgw domains: 
EGP domains: 
Cellular Localization: 



ESTs 

Hs. 157601 

none found 
1-17 

none found 

49-223; 341-518; 529-706 
298-333; 715-748 
secreted 



MPPFLLLEAV 
SVGK6SFERS 
MVFKGGRTET 
FAVGVRPPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA 
SVLWGVGPV 
CMNBGSCVLQ 
RTPPSNYREG 



11 

I 

CVFLFSRVPP 
KHFAITVCDG 
ELALKYLLHR 
EELHALASEP 
REPAGNAPCW 
VPBGLDGYQC 
RAVLSEDSRA 
ERGFGSATRT 
TGSPKHVMVY 
QSFVRSCALQ 
GTALLH1YDK 
LSEGLRRLAG 
NGSYRCKCRD 
LGTEMVPTFW 



21 
I 

SLPLQEVHVS 
LDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRWV 
SDPQDIiFNQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 
GWEGPHCENR 
NVCAPGP 



31 

I 

KETIGKISAA 
GAFQFSSTPH 
POILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSIiEC 
LLVAVPVGEY 
LLTESHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAVWLT 
VADLRYHQDV 
EWSSCSVCVS 



41 
I 

SKMKWCSAAV 
LEFPLDSFST 
KSQGDVALPS 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVHSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 
QGWILBTPLR 



51 
I 

DIMFLLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLLLGVGS 
DLVFMLDTSA 
RAAMLRAISQ 
PAQKLRNNGI 
QPVNLCKPSP 
HMAPVQEGSS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A166 DNA sequence 

Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding Sequence: 



Homo sapiens beta-1 adrenergic receptor mRNA, 3* UTR 

Hs. 37744 

AA011176 

AF272890 

87-1520 (underlined sequences correspond to start and stop codons) 



TGCTACCCGC 
CCCGCCCCCG 
AGCCCGGTAA 
TGCTGGTGCC 
CGCTGTCTCA 
TCGTGGCGGG 
TCACCAACCT 
TGCCGTTCGG 
AGCTGTGGAC 
TTGCCCTGGA 
GCGCGCGGGC 
TGCCCATCCT 
ACCCCAAGTG 
CCTTCTACGT 
AGAAGCAGGT 
CGCCCTCGCC 
CCGCCGCCGC 
CGCGCCTCGT 
TCTTCACGCT 
AGCTGGTGCC 
TCAACCCCAT 
GCTGCGCGCG 
CGGGCTGTCT 
ACGACGATGT 
ACGGCGGGGC 
CCTCGGAATC 
GGGAACGAGG 
CCTCGTCTGA 
TTTGGGAAGG 



11 

I 

GCCCGGGCTT 
GCCTCCGCAG 
CCTGTCGTCG 
CGCGTCGCCG 
GCAGTGGACA 
CAATGTGCTG 
CTTCATCATG 
GGCCACCATC 
CTCAGTGGAC 
CCGCTACCTC 
GCGGGGCCTC 
CATGCACTGG 
CTGCGACTTC 
GCCCCTGTGC 
GAAGAAGATC 
CTCGCCCTCG 
CGCCGCCACC 
GGCCCTACGC 
CTGCTGGCTG 
CGACCGCCTC 
CATCTACTGC 
CAGGGCTGCC 
GGCCCGGCCC 
CGTCGGGGCC 
GGCGGCGGAC 
CAAGGTGTAG 
AGATCTGTGT 
ATCATCCGAG 
GATGGGAGAG 



21 
I 

CTCGGCATGG 
GCCGCACCGC 
CCCGCCTCGT 
GCGGGCATGG 
GTGATCGTGG 
TCCCTGGCCA 
GTGGTGTGGG 
GTGCTGTGCG 
GCCATCACCT 
GTGTGGAGGG 
TGGCGGGCGG 
GTCACCAACC 
ATCATGGCCT 
GACAGCTGCG 
CCCGTCCCCG 
QCCCCGCTGG 
GAGCAGAAGG 
CCCTTCTTCC 
TTCGTCTTCT 
CGCAGCCCCG 
CGCCGGCGCC 
GGACCCCCGC 
ACGCCGCCCG 
AGCGACTCGA 
GGCCCGGCGC 
TTACTTAAGA 
GCAAAGAGAA 
TGGCTTGCTG 




GTCTGCTGAT 
CCATCGCCAA 
GCGCCGACCT 
GCCGCTGGGA 
TGACGGCCAG 
CGCCCTTCCG 
TGTGGGCCAT 
AGAGCGACGA 
GGGCCTACGC 
TCGTGTACCT 
AGCGCCGTTT 
CGCCCGCGCC 
CCAACGGGCG 
CGCTCAAGAC 
TGGCCAACGT 
TCAACTGGCT 
ACTTCCGCAA 
ACGCGACCCA 
CATCGCCCGG 
CGCGCCTGCT 
GCCTGGACGA 
GGGGCGCGGA 
CCGATAGCAG 
AAGCCACGGA 
ATGTTCCTTG 



41 
I 

GGCCCAGCCC 
GCTCGTCCTG 
CGCGGCCACC 
CGCCAGCGAA 
GGCGCTCATC 
GACGCCGCGG 
GGTCATGGGG 
GTACGGCTCC 
CATCGAGACC 
CTACCAGAGC 
CTCGGCCCTG 
GGCGCGCOGC 
CATCGCCTCG 
GOGGGTGTTC 
CCTCGGCGGC 
GCCGCCCGGA 
TGCGGGTAAG 
GCTGGGCATC 
GGTGAAGGCC 
GGGCTAOGCC 
GGCCTTCCAG 
CGGAGACCGG 
GGCCGCCTCG 
GGAGCCCTGG 
GCCGTGCCGC 
CTCCGGGCAC 
GTGAACTCGA 
CCGTTGCACA 
TTG 



51 
I 

TGCCACACCC 
GGCGCCTCCG 
GCGGCGCGGC 
AGCCCCGAGC 
GTGCTGCTCA 
CTGCAGACGC 
CTGCTGGTGG 
TTCTTCTGCG 
CTGTGTGTCA 
CTGCTGACGC 

TGCTACAACG 
TCCGTAGTCT 
CGCGAGGCCC 
CCAGCGCGGC 
CCCCCGCGCC 
CGGCGGCCCT 
ATCATGGGCG 
TTCCACCGCG 
AACTCGGCCT 
GGACTGCTCT 
CCGCGCGCCT 
GACGACGACG 
GCCGGCTGCA 
CCCGGCTTCG 
GGCT7CCCAG 
AGCCCACAAT 
AAAAGGAAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



A167 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Pfam domian: 
Cellular Localization: 



Homo sapiens beta-1 adrenergic receptor 
HS. 37744 
AA011176 
none found 

62-84, 95-117, 135-157, 177-198, 226-248 
7tm_l [75-377] 
plasma membrane 



mRNA, 3» UTR 



11 

! 



21 
I 



31 
I 



41 
I 



51 
I 



MGAGVLVLGA SEPGNLSSAA PLPDGAATAA RLLVPASPPA S LLP PAS ESP EPLSQQWTAG 
MGLLMALIVL LTVAGNVLVI VAZAKTPRLQ TLTNLF1MSL ASADLVKGLL WPFGATIW 



60 
120 



910 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



WGRWEYGSPF CELWTSVDVL CVTASIBTLC VIALDRYLAI TSPFRYQSLL TRARARGLVC 
TVWAISALVS" FLPILMHWWR AESDEARRCY NDPKCCDPVT NRAYAIASSV VSFYVPLCIM 
AFVYLRVFRE AQKQVKKIDS CERRFLGGPA RPPSPSPSPV PAPAPPPGPP RPAAAAATAP 
LANGRAGKRR PSRLVALREQ KALKTLGIIM GVFTLCWLPF FLANWKAFH RBLVPDRLPV 
FPNWLGYANS AFNPIIYCRS PDFRKAFQGL LCCARRAARR RHATHGDRPR ASGCLARPGP 
PPSPGAASDD DDDDWGATP PARLLEPWAG CNGGAAADSD SSLDEPCRPG PASESKV 



180 
240 
300 
360 
420 



A168 DMA, sequence 
Gene name : 
Unigene number: 
Probeset Accession 8: 
Nucleic Acid Accession 8: 



CEGP1 
Hs. 222399 
AA256485 
AJ400877 



Coding sequence: 81-3080 (underlined sequences correspond to start and stop codons) 



GGCGTCOGCG 
CCGCAACCGC 
CGGTGCTGCT 
CGGGTCGGGG 
ATGACTGCCA 
AGCCTGGCTA 
TCAATGGAGG 
TTGATGGCTT 
AGAACAATGG 
GCAAGGAGGG 
GCCTGAGCTG 
GCAGCGTCGC 
TCTTGACCTG 
GCCCAGAGTG 
AGCGAGAGGA 
ATAAACGGGT 
ACCGCACCTG 
TCCAGTTGGA 
GTGATCATTT 
AATTATTAAC 
GTGACCACAG 
CCCTGTATGG 
GTCAGCAGGT 
AGCTCCACTG 
CACCCCGTGT 
GTCACTCTGG 
AGCTAAATGA 
CAGCACTACC 
GCAGCTCTGG 
TTATCACTGT 
TGAGCTGCAT 
AGGCCGTCCA 
AAAAGCCTCC 
CAGAAAACCA 
GCATTTTATG 
GCCCAAGACC 
GTGGAGGTCT 
GTGCCCTGGG 
GCCTTGCCAC 
GTTCACCTGG 
CATACCAGCC 
ACTTTGATGG 
GAGATTTCAC 
AGTGTAOGTG 
TCTTCCTGCC 
CCAATTCTGT 
CCAGGTCAAA 
TCCAGGTCCC 
GAGATGGCAG 
TCAAGGCTCT 
AGTCCCGAGA 
TTTTGAGACC 
GGTTGGTGGG 
CCGTATCAGT 
GAACTTGGTT 
CAGCTTCTCA 
TTGGTCAGCC 
TGTAGTGGAA 
CCGGCCCTCT 
CAAGAGGGGA 
ACTCAGTTTC 
AGTTCTAAGC 
AGCACTTCTG 



11 21 

1 I 
CACACCTCCC CGCGCCGCCG 
TGAGCCATCC ATGGGGGTCG 
GCTGCTGCTG CTGCTGCCGC 
CCGTGCCGCG GGGCCGCAGG 
TGCGGACGCC CTGTGTCAGA 
CCAAGGGGAA GGCAGGCAGT 
CTGTGTCCAT GACTGTTTGA 
CATGTTGGCT CATGACGGTC 
CGGCTGCCAG CATACCTGTG 
GTTTTTCCTG AGTGACAATC 
CATGAATAAG GATCAOGGCT 
CTGTGAGTGC AGGCCTGGTT 
TAACCATGGG AACGGTGGGT 
CAGCTGCCAT CCACAGTACA 
CACTGTCCTG GAGGTGACAG 
GAAACGGCGG CTGCTCATGG 
TAAGGATACT TCGACAGGTG 
TGGGAAGACA TGTAAAGATA 
CTGCAAAAAC ATCGTGGGCA 
AGATGAGAAG TCTTGCCAAG 
CTGCATCAAC CACCCTGGCA 
CTTCACCCAC TGTGGAGACA 
CTGTGTGAAC ACAGTGGGCA 
GAATAAAAAA GACTGTGTGG 
GTCCCTGCAC TGCGGTAAGA 
CATTCACCTC TCTTCAGATG 
AGGCAAGTGT AGTTTGAAAA 
AGAGAAGCAC AGCTCAGTAA 
CAAGCAAGTC CCAGGAGCCC 
TGAGTTTGAG CTTGAAACTA 
CGTAAAGCGA ACOGAGAAGC 
CAGGGAGCAG TTTCACCTCC 
CAGAACATCT GAACGCCAGG 
ATGTGTCAGT TGCAGGGCTG 
TCCAAATGGA ACCTTCCAAA 
AGGAAATTCT GGGGCCCTGA 
GTGTCAACCT GGTGAATATT 
CACGTTCCAG CCTGAAGCTG 
CAAACATCAG GGAGCTACTT 
ACATTTCTAC AACACCACCA 
TGAATTTGGA AAAAATAATT 
CTCCACAAAC ATAACCCAGT 
TGGGTACATT GAATCCCCAA 
GACCATCAAC CCACCCCCCA 
CATAGAGGAC GACTGTGGGG 
GACAACATAT GAAACCTGCC 
GAAGCTGTGG ATTCAGTTCA 
ATACGTGACA TATGATGAGG 
GCTCTATGCA TCTGAGAACC 
GTTTGATGTC CTGGCCCATC 
GATGTTTCCA AGATCGTTCA 
TTACAAATGA CTCAGCCCAC 
ACAGAGCTGT CTTCCTTCTG 
GACTCATTAG AGTTCAATTT 
TTTCTTTCCC AGCATCGTGG 
CTGCTGTGGG OGGATGTCTT 
TAGGTGAGAC TCACCTGTCC 
AGGAGGCCAC AGAATAAGCT 
CTAAGGGAGC CCTCTGCACT 
GGGAAGGAGA CCCCTGCAGG 
TCCACAGCCT TCTCCAGCCT 
AGTGCTCGTG AAAAAAAAAA 
GAGA CAT 



31 41 

I I 

CCGCCACCGC CCGCACTCCG 
CGGGCCGCAA CCGTCCCGGG 
CACTGCTGCT GCTGGCGGGG 
AGGATGTAGA TGAGTGTGCC 
ACACACCCAC CTCCTACAAG 
GTGAGGACAT CGATGAATGT 
ATATTCCAGG CAATTATCGT 
ATAATTGTCT TGATGTGGAC 
TCAACGTCAT GGGGAGCTAT 
AGCACACCTG CATTCACCGC 
GTAGTCACAT CTGCAAGGAG 
TTGAGCTGGC CAAGAACCAG 
GCCAGCACTC CTGTGACGAT 
AGATGCACAC AGATGGGAGG 
AGAGCAACAC CACATCAGTG 
AAACGTGTGC TGTCAACAAT 
TCCACTGCAG TTGTCCTGTT 
TTGATGAGTG CCAGACCCGC 
GTTTTGACTG CGGCTGCAAG 
ATGTGGATGA GTGCTCTTTG 
CATTTGCTTG TGCTTGCAAC 
CCAATGAGTG CAGCATCAAC 
GCTATGAATG CCAGTGCCAC 
AAGTGAAGGG GCTCCTGCCC 
GTGGTGGAGG AGACGGGTGC 
TCACCACCAT CAGGACAAGT 
ATGCTGAGCT GTTTCCCGAG 
AAGAGAGCTT CCGCTACGTA 
CTGGCCGACC AAGCACCCCT 
ACCAAAAGGA GGTGACAGCT 
GGCTCCGTAA AGCCAXCCGC 
AGCTCTCAGG CATGAACCTC 
CAGAGTCCTG TGGAGTGGGC 
GGACCTATTA TGATGGAGCA 
ATGAGGAAGG ACAAATGACT 
AGACCCCAGA AGCTTGGAAT 
CTGCAGATGG CTTTGCACCT 
GTCGAACTTC CTGCTTCCCC 
CCTTTCAGGA CTGTGAAACC 
CTCACCGATG TATTCGTTGC 
Q l W frClTC CCCAGGAAAT 
GTAAAAACAG AAGATGTGGA 
ACTACCCAGG CAATTACCCA 
AGCGCCGCAT CCTGATCGTG 
ACTATCTGGT GATGCGGAAA 
AGACCTACGA ACGCCCCATC 
AGTCCAATGA AGGGAACAGC 
ACTACCAGGA ACTCATTGAA 
ATCAGGAAAT ACTTAAGGAT 
CCCAGAACTA TTTCAAGTAC 
TCCGATTGCT ACGTTCCAAA 
GTGCCACTCA ATACAAATGT 
CATGTCAGCA CAGTCGGGTA 
TTATAGATAA TACAGATATT 
ATGTAGACTG AGAATGGCTT 
GGATAGATCA CGGGCTGGCT 
TTCTGGGGTC TTACTCCTCC 
GCTTATTCTG AAACTTCAGC 
CGTGTGCAGG CFCTGACCAG 
CTCCCTCCAC CCACCTTGAG 
GTGTGATACA AGTTTGATCC 
GCAGAAAGAA TTAGAAATAA 



51 
I 

CCGCCTCTGC 
GOGGCCTGGG 
GCCGTCCCGC 
CAAGGGCTAG 
TGCTCCTGCA 
GGAAATGAGC 
TGCACTTGTT 
GAGTGCCTGG 
GAGTGCTGCT 
TOGGAAGAGG 
GCCCCAAGGG 
AGAGACTGCA 
ACAGCCGATG 
AGCTGCCTTG 
GTGGATGGGG 
GGAGGCTGTG 
GGATTCACTC 
AATGGAGGTT 
AAAGGATTTA 
GATAGGACCT 
CGAGGGTACA 
AACGGAGGCT 
CCTGGGTACA 
ACAAGTGTGT 
TTCCTCAGAT 
GTAACCTTTA 
GGTCTGCGAC 
AACCTTACAT 
AAGGAAATGT 
TCTTGTGACC 
ACGCTCAGAA 
GACGTGGCTA 
CAGGGTCATG 
CGAGAACGCT 
TGTGAACCAT 
ATGTCTGAAT 
TGCCAGCTCT 
TGTGGAGGAG 
AGAGTTCAAT 
CCAGTGGGAA 
ACTACGACTG 
GGGGAGCTGG 
GCCAACACCG 
GTCCCTGAGA 
ACCTCTTCAT 
GCCTTCACCT 
GCTAGAGGGT 
GACATAGTTC 
AAGAAACTTA 
ACAGCCCAGG 
GTGTCCAGGT 
TCTGCTATAG 
TTGCTGCCTC 
TTGGTAAATT 
TGAGTGGCAT 
GAGCTGGACT 
TCAAGGAGTC 
TTCCTCTAGC 
GCAGAACAGG 
ACCTGGGAGG 
CAGGAACTTG 
ATAAAAACTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



A169 Protein sequence 
Gene name: 
Unigene number: 



CEGP1 
Hs. 222399 



911 



WO 03/042661 



PCT/US02/36810 



Probeset Accession 8s AA2S648S 

Protein Accession #: CAB92285 

Signal sequence: 1-31 

Transmembrane domains: none 

PPAM domains: EGF-like_domains (49-84,132-167,177-213,286-321.407-4421 CUB_domain {809-9183 
Cellular Localization: may be secreted 



1 11 21 31 41 51 

I I I I I I 

MGVAGRNRPG AAWAVLLLLL LLPPLLLLA6 AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 

LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 

HEGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFFL SDNQHTCIHR SEEGLSCMNX 180 

DHGCSHICKE APRGSVACBC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 

PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 

STGVHCSCPV GPTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGPKLLTDEK 360 

SCQDVDECSL DRTCDHSCIN HPGTPACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 

TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 480 

SSDVTTIRTS VTFKLNEGKC SLKNABLFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 

PGAPGRPSTP KEMFITVBFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 

FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 

TFQNEEGQMT CEPCPRPGKS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 

PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 

KNNCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 

PPPKRRILIV VPEIFLPIED DOGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 

IQFKSNEGNS ARGFQVPYVT YDEDYQBLIE DIVRDGRLYA SENHQEILKD KKLIKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 



A170 DMA sequence 

Gene names ~~ DEME-6 protein (KIAA04S2) 

Unigene number: Hs.125783 
Probeset Accession #: AL039402 
Nucleic Acid Accession 8: AF007170 

Coding sequence: 1-1725 (underlined sequences correspond to stop codon) 

1 11 21 31 41 51 

! 1 1 I I I 

AAGGAGGOGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 

CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 

AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 180 

CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 

ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 

TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 

GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 

AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 

GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 540 

GAAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 

AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 

CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 

CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 

GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 

TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 

GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 

ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 

AG CAAGG AGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 

ATGTTTGGGA AGGAGGACCA CAAGCOGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1140 

GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 

CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 

ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 1320 

ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 

TCAGTGGATG AOGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 

CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 

TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 

GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 

TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1680 

CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TGCAGCAGTT 1740 

CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 

CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1860 

TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 

GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 

GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 

CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 

TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 

TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 

AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 

CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 

AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 

AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 

AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 

CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 

CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 

A171 Protein sequence 



912 



WO 03/042661 



PCT7US02/36810 



Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession £: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



DEME-6 protein (KIAA0452) 

Hs. 125783 

AL039402 

AAC39582 

none 

210-226 

plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 

QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT BEBIHAEVCY AECLLQRAAL TFLQDENKVS 120 

FIKGGIKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRILR 180 

LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKL 240 

LKPYLNRYPK GAIFLFFAGR IBVIKGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 

TYKGQWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL. 360 

KLKIAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILEI 420 

ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKKIKYDHY 480 

LIPNALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFR1QAAT LQAKSSLENS S40 
SRSMVSSVSL 



A172 dna sequence 
Gene name: EST 
Unigene number: Hs. 200102 

Probeset Accession #: AL117406 
Nucleic Acid Accession #: none found 

Coding sequence: 1-4044 {underlined sequencee correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I 1 

ATGACTAGGA AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC 60 

ATCGACATAG GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT 120 

GGCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG 180 

TGGGGGAAGT ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT 240 

CCTGCCCCCC AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC 300 

ACCCCGCTCA TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG 360 

TCAGTCCATG ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA 420 

GTCTCAAGGC GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA 480 

ACAAGGTTGA TTTTGGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG S40 

CCAATATTGA TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC 600 

CATGGAGTGG GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC 660 

TCCTCCAGTT GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC 720 

TTTGCCTTTG AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC 780 

ATCAGCTTCT TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA 840 

GTACTGATCA CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA 900 

TACACTGCAT TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG 960 

ACAAGAATGG CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT 1020 

GTGACCAGTG AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA 1080 

TTTGCAAAAA TCATTGAAGG TATGGAAAGT CTGACTTTCT GCTCCAAACC TGGTGATGGC 1140 

ATGGCCTTCA GCATGCTGGC CTCCTTGAAT CTCCTTCGGC TGTCAGTGTT CTTTGTGCCT 1200 

ATTGCAGTCA AAGGTCTCAC GAATTCCAAG TCTGCAGTGA TGAGGTTCAA GAAGTTTTTC 1260 

CTCCAGGAGA GOCCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAGCAA AGCTCTGGTC 1320 

TTTGAGGAGG CX^CCTTGTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 1380 

GAGCTGGAGA GGAACGGGCA TGCTTCTGAG GGGATQACCA GGCCTAGAGA TGCCCTCGGG 1440 

CCAGAGGAAG AAGGGAACAG CCTGGGCCCA GAGTTGCACA AGATCAACCT GGTGGTGTCC 1500 

AAGGGGATGA TGTTAGGGGT CTGCGGCAAC ACGGGGAGTG GTAAGAGCAG CCTGTTGTCA 1560 

GCCATCCTGG AGGAGATGCA CTTGCTCGAG GGCTOGGTGG GGGTGCAGGG AAGCCTGGCC 1620 

TATGTCCCCC AGCAGGCCTG GATCGTCAGC GGGAACATCA GGGAGAACAT CCTCATGGGA 1680 

GGCGCATATG ACAAGGCCCG ATACCTCCAG GTGCTCCACT GCTGCTCCCT GAATCGGGAC 1740 

CTGGAACTTC TGCCCTTTGG AGACATGACA GAGATTGGAG AGCGGGGCCT C^CCTCTCT 1800 

GGGGGGCAGA AACAGAGGAT CAGCCTGGCC CGCGCCGTCT ATTCCGACCG TCAGATCTAC 1860 

CTGCTGGACG ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAG 1920 

TGCATTAAGA AGACACTCAG GGGGAAGACG GTCGTCCTGG TGACCCACCA GCTGCAGTAC 1980 

TTAGAATTTT GTGGCCAGAT CATTTTGTTG GAAAATGGGA AAATCTGTGA AAATGGAACT 2040 

CACAGTGAGT TAATGCAGAA AAAGGGGAAA TATGCCCAAC TTATCCAGAA GATGCACAAG 2100 

GAAGCCACTT CGGACATGTT GCAGGACACA GCAAAGATAG CAGAGAAGCC AAAGGTAGAA 2160 

AGTCAGGCTC TGGCCACCTC CCTGGAAGAG TCTCTCAACG GAAATGCTGT GCCGGAGCAT 2220 

CAGCTCACAC AGGAGGAGGA GATGGAAGAA GGCTCCTTGA GTTGGAGGGT CTACCACCAC 2280 

TACATCCAGG CAGCTGGAGG TTACATGGTC TCTTGCATAA TIT T CT TCT T CGTGGTGCTG 2340 

ATCGTCTTCT TAACGATCTT CAGCTTCTGG TGGCTGAGCT ACTGGTTGGA GCAGGGCTCG 2400 

GGGACCAATA GCAGCCGAGA GAGCAAXGGA ACCATGGCAG ACCTGGGCAA CATTGCAGAC 2460 

AATCCTCAAC TGTCCTTCTA CCAGCTGGTG TACGGGCTCA ACGCCCTGCT CCTCATCTGT 2520 

GTGGGGGTCT GCTCCTCAGG GATTTTCACC AAAGTCACGA GGAAGGCATC CACGGCCCTG 2580 

CACAACAAGC TCTTCAACAA GGTTTTCCGC TGCCOCATGA GTTTCTTTGA CACCATCCCA 2640 

ATAGGCCGGC TTTTGAACTG CTTCGCAGGG GACTTGGAAC AGCTGGACCA GCTCTTGCCC 2700 

ATCTTTTCAG AGCAGTTCCT GGTCCTGTCC TTAATGGTGA TCGCCGTCCT GTTGATTGTC 2760 

AGTGTGCTGT CTCCATATAT CCTGTTAATG GGAGCCATAA TCATGGTTAT TTGCTTCATT 2820 

TATTATATGA TGTTCAAGAA GGCCATCGGT GTGTTCAAGA GACTGGAGAA CTATAGCCGG 2880 

TCTCCTTTAT TCTCCCACAT CCTCAATTCT CTGCAAGGCC TGAGCTCCAT CCATGTCTAT 2940 

GGAAAAACTG AAGACTTCAT CAGCCAGTTT AAGAGGCTGA CTGATGCGCA GAATAACTAC 3000 

CrGCTGTTGT TTCTATCTTC CACACGATGG ATGGCATTGA GGCTGGAGAT CATGACCAAC 3060 

CTTGTGACCT TGGCTGTTGC CCTGTTCGTG GCTTTTGGCA TTTCCTCCAC CCCCTACTCC 3120 



913 



WO 03/042661 



PCT/US02/36810 



TTTAAAGTCA 
CGGATTGGCT 
AAGATGTGTG 
CCACAGCATG 
ACCGTGCTTC 
AGGACGGGCT 
GCAGGCCGGA 
TCCAAGCTCT 
CTAGATCCCT 
CTGACCAAGG 
GGAAACTTCT 
TCCAAGATCA 
ATCCAGCGCA 
GTCACCACTG 
TTTGATCGGC 
ACAGCCACTT 
CTCAGAGGTT 
ATGAGAACTT 
ACCCTGGAAT 
ATGGGATTCA 
AGGTAAAAGC 
TGTAAAATAT 



TGGCTGTCAA 
TGGAGACAGA 
TCTCGGAAGC 
GGGAAATCAT 
ACGGCATCAA 
CTGGGAAGTC 
TTCTCATTGA 
CAGTGATCCC 
TTGACCGTCA 
CCATCTCAAA 
CTGTGGGGGA 
TCCTTATCGA 
CAATCCGTGA 
TGCTGAACTG 
OGGAGGTACT 
CTTCACTGAG 
CACACAGGTG 
CTCCTGGAAG 
AGGCTACTTG 
GTGATCATGT 
TTATAGTTTT 
AAAACTAAGG 



CATCGTGCTG 
GGCACAGTTC 
TCCTTTACAC 
ATTTCAGGAT 
CCTGACCATC 
CTCCTTGGGC 
CGGCGTGGAC 
TCAAGATCCA 
CACTGACCAG 
GTTCCCCAAA 
GAGGCAGCTG 
TGAAGCCACA 
AGCCTTCCAG 
TGACCACATC 
GCGGAAGAAG 
ATAAGGAGAT 
CAGCTTCGAG 
CAGGGGTAAA 
ATGGCTCTCA 
GGTTCTCCTT 
CTGATCTGTG 
AAAACTCAAA 



CAGCTGGOGT 
ACGGCTGTAG 
ATGGAAGGCA 
TATCACATGA 
CGCGGCCACG 
ATGGCTCTCT 
ATTTGCAGCA 
GTGCTGCTCT 
CAGATCTGGG 
AAGCTGCATA 
CTCTGCATTG 
GCCTCCATTG 
GGCTGCACCG 
CTGGTTATGG 
CCTGGGTCAT 
GTGGAGACTT 
GCCCACAGTC 
TGTAGGGGGG 
AGACCTTAGA 
TTAACTTACA 
TTAGAAGTGY 
AAAAAAAAAA 



CCAGCTTCCA 
AGAGGATACT 
CAAGTTGTCC 
AATACAGAGA 
AAGTGGTGGG 
TCCGCCTGGT 
TOGGCCTGGA 
CAGGAACCAT 
ATGCCTTGGA 
CAGATGTGGT 
CCAGGGCTGT 
ACATGGAGAC 
TGCTCGTCAT 
GCAATGGGAA 
TGTTCGCAGC 
CATGGAGGCT 
TGOGACCTTC 
GTGGGGATTG 
ACCCCAGAAC 
TGCTGAATAA 
TGCAAATGCT 
AAAAAAA 



GGCCACTGCC 
GCAGTACATG 
CCAGGGGTGG 
CAACACACCC 
CATCGTGGGA 
GGAGCCCATG 
GGACTTGCGG 
CAGATTCAAC 
GAGGACATTC 
GGAAAAOGGT 
GCTTCGCAAC 
AGACACCCTG 
TGCCCACCGT 
GGTGGTAGAA 
CCTCATGGCC 
GGCAGCTGAG 
TTGTTTGGAG 
CTGGATGGAA 
CATCTAAGAC 
TTTTATAATA 
GTACTGACTT 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4X40 
4200 
4260 
4320 
4380 



A173 Protein sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
943,1018-1034 
PPAM domains: 

Cellular Localization: 



EST 

Hs. 200102 
AL117406 
none found 
none found 
169-185, 199-21S, 



275-291, 304-320, 387-403, 770-786, 829-845, 907-923, 927- 



ABC_transporter [502-673], ABC_membrane_reg ion [163-432, 771-1060] 
ATP~b ind ing_doroa ins [508-516, 1139-1147) 
plasma membrane 



MTRKRTYWVP 
WGKYDAALRT 
SVHDASDKNV 
PIL1IPKILE 
PAPEKLIQFK 
YTAFIAILCY 
FAKIIEGMES 
LQESPVFYVQ 
PEEEGNSLGP 
YVPQQAWIVS 
GGQKQRISLA 
LEFCGQIILL 
SQALATSLEB 
IVFLTIPSFW 
VGVCSSGIFT 
IPSBQFLVLS 
SFLFSHILKS 
LVTLAVALFV 
KMCVSEAPLH 
RTGSGKSSLG 
LDPFDRHTDQ 
SKIILIDEAT 
FDRPEVLRKK 



11 
I 

NSSGGLVNRG 
MIPPRPKPRF 
QRLHRLWEEE 
YSEBQLGNW 
SVIHITSGEA 
LLVFPLAVFM 
LTFCSKPGDG 
TLQDPSKALV 
ELHKINLWS 
GNIRENILMG 
RAVYSDRQIY 



SLNGNAVPEH 
WLSYWLEQGS 
KVTRKASTAL 
LMVIAVLLIV 
LQGLSSIHVY 
AFGISSTPYS 
MEGTSCPQGW 
MALFRLVEPM 
QIWDALERTF 
AS1DMETDTL 
PGSLFAALMA 



21 
i 

IDIGDDMVSG 
PAPQPLDNAG 
VSRRGIEKAS 
HGVGLCFALF 
ISFFTGDVNY 
TRMAVKAQHH 
MAFSMLASLN 
FEEATLSWQQ 
KGMMLGVCGN 
GAYDKARYLQ 
LLDDPLSAVD 
HSELMQKKGK 
QLTQEEEMEE 
GTNSSRESNG 
HNKLFNKVFR 
SVLSPYILLM 
GKTEDFISQF 
FKVMAVNIVL 
PQHGEIIFQD 
AGRILIDGVD 
LTKAI SKFPK 
IQRTIREAFQ 
TATSSLR 



31 
I 

MYKTYTLQD 
LFSYLTVSWL 
VLLVMLRFQR 
LSECVKSLSF 
LFEGVCYGPL 
TSEVSDQRIR 
LLRLSVFFVP 
TCPGIVNGAL 
TGSGKSSLLS 
VLHCCSLNRD 
AHVGKHIFEE 
YAQLIQKMHK 
GSLSWRVYHH 
TMADLGNIAD 
CPMSFFDTIP 
GAIIMVICFI 
KRLTDAQNNY 
QLASSFQATA 
YHMKYRDNTP 
ICSIGLEDLR 
KLHTDWENG 
GCTVLVIAHR 



41 



TPLMIQSLRS 
TRLIFDALLG 
SSSWIINQRT 
VLITCASLVI 
VTSEVLTCIK 
IAVKGLTNSK 
ELERNGHASE 
AILEEMHLLE 
LELLPFGDMT 
CIKKTLRGKT 
EATSDMLQDT 
YIQAAGGYMV 
NPQLSFYQLV 
IGRLLNCFAG 
YYMMFKKAIG 
LLLFLSSTRW 
RIGLETEAQP 
TVLHGINLTI 
SKLSVIPQDP 
GNFSVGERQL 
VTTVLNCDHI 



51 
I 

EAPGRAAVPP 
RLDENTIPPL 
ICFCIASVLG 
AIRFRAAVSS 
CSISSYFIIG 
LIKMYTWEKP 
SAVMRFKKFF 
GMTRPRDALG 
GSVGVQGSLA 
EIGERGLNLS 
WLVTHQLQY 
AKIAEKPKVE 
SCIIFFFWL 
YGLNALLLIC 
DLEQLDQLLP 
VFKRLENYSR 
MALRLEIMTN 
TAVERILQYM 
RGHEWGIVG 
VLLSGTIRFN 
LCIARAVLRN 
LVMGNGKWE 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



A174 DWA sequence 



Dnigene number: 
Probeset Accession #: 
Nucleic Acid Accession #x 
Coding sequence: 



l 
I 

ATGCTGTCTG 
GGAAAGAAAC 
GACCGGGAGA 
GGGGAGGCGG 
CCGCGGCCGC 
CGCCTGCGTC 
TCCGGACGAC 
ACTCATAAGG 
GCTCCTGGAC 
CCGCGCGGAA 
CTTCTCGGAG 
TGTGGGGCGC 
TGCTGCTGGC 



ES. 128899 

AA983251 

AA983251 

1-1749 (underlined sequences correspond to start and stop codons) 




31 
I 

ACCCAGCACA 
GGTGCGCACA 
GCCGGGCTCC 
CCGCCCGCCT 
GGGACTGCGG 
GTCCGGTTGC 
TGCATCCCGA 
CCACCGGCTG 
CTCCTGGGCG 
GACGAGGCCC 
GACGCGCTGT 
CATCCTGGAA 
CGAGGGCCCA 



41 
I 

GAGCACAGTA 
CCTCCCGAGG 
TGTGGGACCG 
GGATCCGCGC 
CTGGGGGCGC 
CAGTGAAACC 
GATTTCCATC 
AGGACGGGGA 
TCGOGGCAGA 
GGGGGTCGCC 
CCGCGCCCAG 
CGCCGCTTCG 
GCGGCGAGTA 



51 
I 

CACTCCCGGA 
GCGAGGCAGC 
CGCTGCAGCC 
CCAGCAGCAG 
GCAGGACCCT 
TCCAGAGGCT 
AGCGAGTGCA 
TGGCTTAGGA 
GGGGAGTGGC 
GGGGCCACGA 
GGTGGTGCCA 
CTCCTGCAGC 
CTGCCACGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



914 



WO 03/042661 



PCT/US02/36810 



TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 

GGCGACGCCA CCATCTGCTG OGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG OGAGCCTGGC 960 

CGGGCGGACA AAGAOGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 

GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGOGGTGCTC CCCTGAAGGC 1080 

TCCCOGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1140 

CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 

ATCTACGTGC OGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 

GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 

CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 

GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCOGGA AGGGACCATG 1S00 

AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 

CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 

CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 

AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 

ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 

GATTCTCGAG GTGGAAGTCC GCACATGTOG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 

GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 

CATCCAGGGT ATCATTGATT TATGATGGAA AACOGGCCTC AGCTGGAGAT GACTGTGATG 1980 

TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 

AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 

TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GAC T 1 T TTTT 2160 

TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2 8 B0 

TATATTGAAA TCATAAACTA TCAOCOGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATOGCT 3480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A175 Protein sequence 
Gene name; 
Unigene number: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 128899 
none found 
1-11 
402-424 

not determined 



1 11 21 31 41 51 

I I I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASX GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 

GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 

SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 

CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 

ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPBG 360 

SPKGRQLLRA FPGLLPRARR RGPPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFIILGS 420 

LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPEQGQYL HPPYVGYTVQ 540 
HDSVPHTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

A176 DMA SEQUENCE 

Gene name: ESTs, Weakly similar to CGH07L collagen alpha Kill) chain precursor tH. sapiens] 
Unigene number: Hs. 19322 

Probeset Accession ft: AA0884S8 
Nucleic Acid Accession ft: AA0884S8 

Coding sequence: 862-1995 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

III!)! 

GCCCTTGGAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CAOGAGGACA CTGACATGGA 60 

915 



WO 03/042661 



PCT/US02/36810 



10 

15 
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65 

70 
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80 



CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC 
GCGCGGGGCC GOGACTGGTA CCAGCAGCAG CTGCAACGAG 
CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCTGCAG 
CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG 
TGTGCCAGCC GGGCCCTGCC CCGGTCCTCC TCCGGGCCCC 
ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC 
CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC 
GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC 
GATTCCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC 
AGCCCTTCGA GGGTGGGCGC CCCATCGCAC CCACCCTCTC 
GCCCAGGCAC AGTCCCGGAG TGGGOGCCTT CCTGCCGCCC 
GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC 
ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAG 
TGGACAGTGG GGTACCCCTC CATGAGTTAG OQTCCCCCCG 
GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG 
AGAGGGCGCG GGGOGGCTCC GACGCGGGTC CAAGGGCAGC 
CAGGACGAGG TGGCTGTAGC TCGGACGGAC GGAAGTAGAT 
GTAAGCGGGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGGA 
CTGGCCAAGG CTGAGGGACC CTGGCTGCAG GGGATCGGCA 
GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT 
TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG 
GACAGCTCCC AGGCACGTCA TAGGCAAAGC CTGTTTCCCC 
CTGGGGTCCT GCTCACCCCC CTTTGCTCTC ACGCCCAGCC 
GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC 
GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA 
GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT 
GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC 
TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG 
CCCGATGCGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT 
ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT 
CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA 
TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA 
GCCCTCCTAC CCTGAAGATG GGAGTGGGCT TTCCAGGGGA 
ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT 
AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG 
GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT 
TGGGGGATCC TGGCATCTTT ACTGGACTGG AAGCAGGAGA 
GGTGACTTCA TCAGGAGACC GCCCACATAG AGCTGGACCC 
GAGACAGGCT GGCACCTCGG GAAAAACTGC CTTTCAGCCT 
AAAGAAATAG GTCCTCCCAG TTTACAGCTT GAAATCAGGC 
CACGAGGGGA GAATTTAAAG GCCCCGGCTG GCAGGGTCTA 
GCAGACCCTG CCTGGAGCCT GCCCTAGGAC GCTGGGCGGG 
GAGCAGOGTC CCTGGGCTCT ATCCGCGAGG TGCCAGTAGC 
GCGTGCACAC TGTGATGACA CCCGGAAATG TCTCAGGATG 
CAGAAGTGTC OCCAGTTGAG AATCTGCCCC AGAGGAACAC 
TTTTGTGTTG ATCAAGTTCC AAGGAAAAGG AACATCTCAG 
CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC 
CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG 
TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG 
TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA 
AGAAATAAAA GAGATTTCTG GAAACATGAA 



A177 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession 8: 
Nucleic Acid Accession 8: 
Coding sequence: 



TGCAGGGTTT 
TGCAGGAGCG 
GGAGCCCCCG 
GGGAGCTGCT 
CCTGCCCTGC 
TGAAGGAGCA 
AGCTGGAGCA 
AGCAGGACGG 
CCAGGGCCAG 
TGGCTGGAGA 
TTGCCAGATG 
GTTTKGGCTC 
CTACTACTGG 
TTTCCAGCGG 
CACTTCCAAC 
TTCCCGCTCA 
GGAGGGGGTG 
TAGCGGTCGG 
CGCCGGGTGG 
TCCCCCTCTT 
GCTCCCCAGG 
CGACTCAGGA 
TGTCCCCAGG 
AGGGTACAGG 
GGCCCACTCC 
GGAGGGTCCC 
CCAGGGCCCC 
TGCGTGGGGG 
CGTGTCCAGG 
GGCAGGCAGC 
CCCCACAGAG 
AGTCAGCCCA 
CATAAGGATG 
GCCCCACAGC 
GGAGAAGCCC 
TGAGGGTGCC 
CAGAACAGTG 
GGCAGCTGAA 
TGGTGTTCCG 
TAGTGAGTGG 
GGTGGCTGGC 
TCAGTCTCCG 
GTGTGCAGGT 
TTGAAATGTG 
ACCCACACCA 
CCGGGCGTGG 
CTGGGCAACG 
AGAGATCCAG 
CAGAAGCAAC 
ACAGTGTTTT 



GGAGATGATG 
CCAGCGCCGC 
CCCACTGGGG 
GGCTGCAGCC 
CCTGAOGTCC 
GAACCGACTC 
GGAGAAGTCG 
GGGACCTCTG 
CCTGGCACTC 
CCCCCGGCAG 
GGCTCCCCAG 
CTGGTTGYTG 
CCGCTGTCAG 
TGCCGCCCTG 
AACGGGCAGC 
ACCAGGGCAC 
GGGACGGCCT 
ACTTCAGGTT 
GCGAGAGCTT 
GGCCGGGACG 
AGGGCCCCCA 
TTTCCAAGGC 
TTTCAGCTGG 
AGGAGGCTGG 
CGCTGGTGCT 
AGTGTCACCA 
CGATGCGGGG 
GCGCAGGGCC 
GCACTTTGGT 
GTGGCAACTC 
CCACATTCCC 
GCATGCAGCT 
TCAGGCCTGG 
CCCAGCACCC 
CCCGTCAGCA 
TGCCATGCCC 
TCTGTCCCGG 
GGGGAAATGT 
TGCAAGGTGA 
CCCTGGAGAC 
AGAGGCACAT 
TGCAGGATGT 
ACATACACGT 
TCCTTGGGGG 
GGCCTCAGGA 
TGGTTCACGC 
CAGTGAGAGA 
GTTTAAAAAT 
AGATTGACTC 
ATATATCTAA 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 B0 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 



Cadherin 3, P-cadherin (placental) 

Hs.2877 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



GCGGAACACC 
TCCCTCGTGG 
CCTCCOAGCC 
CGGAGCAGGA 
CAGCTCTGTT 
AAAGAAGGTC 
GAAGACACAA 
CCTTCCCCCA 
ACAGCATCAC 
AGACAGGCTG 
TCTTTGGCCA 
TCATCGTGAC 
GTGTCTTAGA 
ATGATGCCAT 
AGGACCCACA 
CCAGTGGCCT 
TGGATGGGGA 
ACAATGCTCC 
GCCATGAGGT 
GTGCCACCTA 
CTGAGAGCAA 
AGCACACCCT 
CCACAGCCAC 



11 
I 

GGCCCGCCGT 
ACCTCTCGCG 
GTGCCGGGCG 
GCCCGGCCAG 
TAGCACTGAT 
ACTGAAGGAA 
GAGAGATTGG 
GAGACTGAAT 
GGGGCOGGGG 
GTTGTTGTTG 
CGCTGTGTCA 
CGACCAGAAT 
GGGAGTCCTA 
CTACACCTAC 
CGACCTCATG 
GGACCGGGAA 
CGGCTCCACC 
CATGTTTGAC 
GCAGAGGCTG 
CCTTATCATG 
CCAGGGCATC 
GTACGTTGAA 
CATAGTGGTC 



21 
I 

CGCGGCAGCT 
TCTCTCCTCC 
GTCTTCAGGG 
GCGCTGGGGA 
AATGATGACT 
AGGAATCCAT 
GTGGTTGCTC 
CAGCTCAAGT 
GCAGACAGCC 
AATAAGCCAC 
GAGAATGGTG 
GACCACAAGC 
CCAGGTACTT 
AATGGGGTGG 
TTCACAATTC 
AAAGTCCCTG 
ACCACGGCAG 
CCCCAGAAGT 
ACGGTCACTG 
GGCGGTGACG 
CTGACAACCA 
GTGACCAACG 
CACGTGGAGG 



31 
I 

GCTTCACCCC 
TTCTCCAGGT 
AGGCTGAAGT 
AAGTATTCAT 
TCACTGTGCG 
TGAAGATCTT 
CAATATCTGT 
CTAATAAAGA 
CCCCTGAGGG 
TGGACCGGGA 
CCTCAGTGGA 
CCAAGTTTAC 
CTGTGATGCA 
TTGCTTACTC 
ACCGGAGCAC 
AGTACACACT 
TGGCAGTAGT 
ACGAGGCCCA 
ATCTGGAOGC 
AOGGGGACCA 
GGAAGGGTTT 
AGGCCCCTTT 
ATGTGAATGA 



41 
I 

TCTCTCTGCA 
TTGCTGGCTG 
GACCTTGGAG 
GGGCTGCCCT 
GAATGGCGAG 
CCCATCCAAA 
CCCTGAAAAT 
TAGAGACACC 
TGTCTTCGCT 
GGAGATTGCC 
GGACCCCATG 
CCAGGACACC 
GGTGACAGCC 
CATCCATAGC 
AGGCACCATC 
GACCATCCAG 
GGAGATCCTT 
TGTGCCTGAG 
CCCCAACTCA 
TTTTACCATC 
GGATTTTGAQ 
TGTGCTGAAG 
GGCACCTGTG 



51 
I 

GCCATGGGGC 
CAGTGCGCGG 
GCGGGAGGOG 
GGGCAAGAGC 
ACAGTCCAGG 
CGTATCTTAC 
GGCAAGGGTC 
AAGATTTTCT 
GTAGAGAAGG 
AAGTATGAGC 
AACATCTCCA 
TTCCGAGGGA 
ACAGATGAGG 
CAAGAACCAA 
AGGGTCATCT 
GCCACAGACA 
GATGCCAATG 
AATGCAGTGG 
CCAGCGTGGC 
ACCACCCACC 
GCCAAAAACC 
CTCCCAACCT 
TTTGTCCCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



916 



WO 03/042661 



PCT/US02/36810 



CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 

CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 

GTGAGOATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACGTGCTGAA CATCAOGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 

TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 

TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 

ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 

TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 

TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 

AGGCCAGGCC GGAGGTGGTT CTCOGCAATG AQGTGGCACC AACCATCATC CCGACACCCA 2280 

TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 

AGGCGGCTAA CACAGACCCC ACAGCCCOGC CCTACGACAC CCTCTTGGTG TTOGACTATG 2400 

AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 

ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 

ACGGTGGCGG GGAGGACGAC TAGGOGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 

GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 

TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



A178 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domain: 



Cadherin 3, P-cadherin (placental) 

Hs.2877 

X63629 

CAA45177 

1-24 

659-675 



Cellular localization: 



plasma membrane 



1 11 21 31 41 51 

I I I I t I 

MGLPRGPLAS LLLLQVCWLQ CAASBPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

QEPALFSTDN DDPTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK IPYSITGPGA DSPPEGVFAV EKBTGWLTtT«N KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQKD HKPKPTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDPEA KNQHTLYVEV TNEAPPVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAKDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TV1RATVCDC HGHVETCPGP WKGGFILPVL 660 

GAVLALLFLXj LVLLLLVRKX RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA EMYGGGEDD 

A179 DNA SEQUENCE 

Gene name: ~^ " endothelial differentiation, lysophosphatidic acid G-protein- coupled receptor, 

Unigene number: Hs. 258583 

Probeset Accession #: NM_012152 
Nucleic Acid Accession ft: NM_0121S2 

Coding sequence: 43~1104 (underlined sequencea correspond to start and stop codons) 



1 11 21 31 41 51 

I I I ! I I 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGTCACTAT 60 

GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 120 

GGAACAAAGC TTGTGATTGT rri mtrr g i T GGGACGTTTT TCTGCCTGTT TATTTTTTTT 180 

TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTG 300 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 360 

GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 420 

AGGCACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 480 

CTGCTCATTT TGCTTGTCTG GGCCATOGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 540 

TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 600 

TACCTTGTTT TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGGT TGTGGTGTAC 660 

CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGTGGG 720 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAGGG 780 
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GCGTTTGTGG TATGCTGQAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 

ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 1080 

GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 



A1B0 Protein sequence! 
Gene name: 
Unigene number: 
Probeset Accession 8: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



endothelial differentiation, lysophosphatidic acid G-protein-coupled receptor, 7 

HS. 258583 

NM 012152 

NPJ>36284 

none found 

31-53, 66-88, 150-172, 190-211, 239-261, 277-295 
plasma membrane 



1 11 21 31 41 51 

I I I I I 1 

MNECHYDKHM DFPYNRSNTD TVDDWTGTKL VIVLCVGTFP CLFIFFSNSL VIAAVIKNRK 60 
FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 120 
LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFKG AVPTLGWNCL CNISACSSLA 180 
PIYSRSYLVF WTVSNLMAPL IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT 240 
VMTVLGAFW CWTPGLWLL LDGLNCRQCG VOHVKRWFLL LALLNSWNP IIYSYKDEDM 300 
YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 

A181 DNA SEQUENCE 

Gene name: ESTs 

Unigene number: Hs. 162 85 9 

Probeset Accession #: AA569531 

Nucleic Acid Accession #: AA569531 

Coding sequence: 1-504 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 I I I I I 

ATGACCTACA GTTACTCATT TTTCAGGCCT GAGTTGATCG TTAATCATCT TAATTATGTT 60 

CATTCTGAAG CCAACAGGAG AACCAAGACC AAAACTTTAT TGTCTCTGCT TTCATTTCTT 120 

GATGAAACCT CTGGACTAAG CACACATCTT CCTTGTTTAT CTCTCTCAAA GGAGTGTGGA 180 

GTGCTTCATC TGGACATCCA CGGGAAGAAG GAAGACATGA GAATCACCCA ACAGTCTTCC 240 

CAGCTATACC TGTGGGACAT GGGTGGTTTT ACAATATTTA AGAACCTGTG QATGAGCCTC 300 

ATACCCAGAG GGAACAAACG CTCCCCAAAA AGAGTTACAG AAACCATCCT GAGAGATTTT 360 

AAGCAGAAGC AAAGTTCAAA GATCCAAGAG GAGAGACGAA GAGAGTCTGC AGGACCAAAC 420 

CTCTCTTCAT TCTGGTTTGT GGGGAATGCT GGAAGAGGAG ACAGGCCCCA GATTTGGGCA 480 

GGAAGTAAAC AGTTTTCAGG CTGAGGCCAA TCTGAGCAGG AACATTCCAA TATTT C TTCA S40 

GCTACGTTGT CCCAGCACTT CACTGGTTAA CCTTTTATGT CCACCATTTG TGGATTTCAC 600 

AGCTACTTGT CAATGGTGAA TATTGATCAT CATCATTATC TACTGAGCTG CTACCATATC 660 

CCAGCTACTC CTTGCATGTT GTTCATTATT TTCTCAACAC TCAGCATATT TGCAATATGT 720 

TATGTAATAT CACAGACAAG GAAACTGAAC GCAGAAATGT TTTATTTCTT GCCAAACATC 780 

ACAT GAGGA T GAACAATGAA ACCGATTTGA AACCAGOATT GTCTGATTCC AACATCTCTG 840 
GGTCCTTTTT CACTCTGATA TGCTGCAATT AAAAAGCCAT TTCTAAGACT GT 



A182 Protein sequence i 

Gene name: ESTs 

Unigene number: Hs. 162 859 

Probeset Accession #: AA569531 

Protein Accession #: none found 

Signal sequence: 1-46 

Transmembrane domains: none found 

Cellular Localization: not determined 



1 11 21 31 41 51 

III! i 1 

MTYSYSFFRP BLIVNHLNYV HSEANRRTKT KTLLSLLSFL DETSGLSTHL PCLSLSKECG 60 
VLHLDIHGKK EDMRITQQSS QLYLWDMGGF TIFKNLWMSL IPRGNKRSPK RVTETILRDF 120 
KGKQSSKIGE ERRRESAGPN LSSFWFVGNA GRGDRPQIWA GSKQFSG 

A183 DNA SEQUENCE 
Gene name: ESTs 
Unigene number: Hs. 179809 

Probeset Accession ft: N95796 
Nucleic Acid Accession 8: XM_OS0197 

Coding sequence: 310-1971 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I t 

TCACACGTGC CAAGGGGCTG GCTCAGCGGA ACCAGCCTGC ACGCGCTGGC TCCGGGTGAC 60 

AGCCGCGCGC CTCGGCCAGG ATCTGAGTGA TGAGACGTGT CCCCACTGAG GTGCCCCACA 120 

GCAGCAGGTG TTGAGCATGG GCTGAGAAGC TGGACCGGCA CCAAAGGGCT GGCAGAAATG 180 

GGCGCCTGGC TGATTCCTAG GCAGTTGGCG GCAGCAAGGA GGAGAGGCCG CAGCTTCTGG 240 

AGCAGAGCCG AGACGAAGCA GTTCTGGAGT GCCTGAACGG CCCCCTGAGC CCTACCCGCC 300 
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TGGCCCAC TA TGG TCCAGAG GCTGTGGGTG AGCOGCCTGC TGCGGCACCG GAAAGCCCAG 360 

CTCTTGCTGG TCAACCTGCT AACCTTTGGC CTGGAGGTGT GTTTGGCCGC AGGCATCACC 420 

TATGTGCCGC CTCTGCTGCT GGAAGTGGGG GTAGAGGAGA AGTTCATGAC CATGGTGCTG 480 

GGCATTGGTC CAGTGCTGGG CCTGGTCTGT GTCCCGCTCC TAGGCTCAGC CAGTGACCAC 540 

TGGCGTGGAC GCTATGGCCG CCGCCGGCCC TTCATCTGGG CACTGTCCTT GGGCATCCTG 600 

CTGAGCCTCT TTCTCATCCC AAGGGCCGGC TGGCTAGCAG GGCTGCTGTG CCCGGATCCC 660 

AGGCCCCTGG AGCTGGCACT GCTCATCCTG GGCGTGGGGC TGCTGGACTT CTGTGGCCAG 720 

GTGTGCTTCA CTCCACTGGA GGCCCTGCTC TCTGACCTCT TCCGGGACCC GGACCACTGT 780 

CGCCAGGCCT ACTCTGTCTA TGCCTTCATG ATCAGTCTTG GGGGCTGCCT GGGCTACCTC 840 

CTGCCTGCCA TTGACTGGGA CACCAGTGCC CTGGCCCCCT ACCTGGGCAC OCAGGAGGAG 900 

TGCCTCTTTG GCCTGCTCAC CCTCATCTTC CTCACCTGCG TAGCAGCCAC ACTGCTGGTG 960 

GCTGAGGAGG CAGGGCTGGG CCCCAGCGAG CCAGCAGAAG GGCTGTCGGC CCCCTCCTTG 1020 

TCGCCCCACT GCTGTCCATG COGGGCCCGC TTGGCTTTCC GGAACCTGGG OGCCCTGCTT 1080 

CCCCGGCTGC ACCAGCTGTG CTGCCGCATG CCOCGCACCC TGCGCCGGCT CTTOGTGGCT 1140 

GAGCTGTGCA GCTGGATGGC ACTCATGACC TTCACGCTGT TTTACACGGA TTTCGTGGGC 1200 

GAGGGGCTGT ACCAGGGCGT GCCCAGAGCT GAGCCGGGCA CCGAGGCCOG GAGACACTAT 1260 

GATGAAGGCG TTCGGATGGG CAGCCTGGGG CTGTTCCTGC AGTGCGCCAT CTCCCTGGTC 1320 

TTCTCTCTGG TCATGQACCG GCTGGTGCAG CGATTCGGCA CTCGAGCAGT CTATTTGGCC 1380 

AGTGTGGCAG CTTTCCCTGT GGCTGCCGGT GCCACATGCC TGTCCCACAG TGTGGCCGTG 1440 

GTGACAGCTT CAGCCGCCCT CACCGGGTTC ACCTTCTCAG CCCTGCAGAT CCTGCCCTAC 1500 

ACACTGGCCT CCCTCTACCA CGGGGAGAAG CAGGTGTTCC TGCCCAAATA CCGAGGGGAC 1560 

ACTGGAGGTG CTAGCAGTGA GGACAGCCTG ATGACCAGCT TCCTGCCAGG CCCTAAGCCT 1620 

GGAGCTCCCT TCCCTAATGG ACACGTGGGT GCTGGAGGCA GTGGCCTGCT CCCACCTCCA 1680 

CCCGCGCTCT GCGGGGCCTC TGOCTGTGAT GTCTCCGTAC GTGTGGTGGT GGGTGAGCCC 1740 

ACCGAGGCCA GGGTGGTTCC GGGCCGGGGC ATCTGCCTGG ACCTCGCCAT CCTGGATAGT 1800 

GCCTTCCTGC TGTCCCAGGT GGCCCCATCC CTGTTTATGG GCTCCATTGT CCAGCTCAGC 1860 

CAGTCTGTCA CTGCCTATAT GGTGTCTGCC GCAGGCCTGG GTCTGGTCGC CATTTACTTT 1920 

GCTACACAGG TAGTATTTGA CAAGAGCGAC TTGGCCAAAT ACTCAGC GTA GA AAACTTCC 1980 

AGCACATTGG GGTGGAGGGC CTGCCTCACT GGGTCCCAGC TCCCOGCTCC TGTTAGCCCC 2040 

ATGGGGCTGC CX3GGCTGGCC GCCAGTTTCT GTTGCTGCCA AAGTAATGTG GCTCTCTGCT 2100 

GCCACCCTGT GCTGCTGAGG TGCGTAGCTG CACAGCTGGG GGCTGGGGCG TCCCTCTCCT 2160 

CTCTCCCCAG TCTCTAGGGC TGCCTGACTG GAGGCCTTCC AAGGGGGTTT CAGTCTGGAC 2220 

TTATACAGGG AGGCCAGAAG GGCTCCATGC ACTGGAATGC GGGGACTCTG CAGGTGGATT 2280 

ACCCAGGCTC AGGGTTAACA GCTAGCCTCC TAGTTGAGAC ACACCTAGAG AAGGGTTTTT 2340 

GGGAGCTGAA TAAACTCAGT CACCTGGTTT CCCATCTCTA AGCCCCTTAA CCTGCAGCTT 2400 

CGTTTAATGT AGCTCTTGCA TGGGAGTTTC TAGGATGAAA CACTCCTCCA TGGGATTTGA 2460 

ACATATGAAA GTTATTTGTA GGGGAAGAGT CCTGAGGGGC AACACACAAG AACCAGGTCC 2520 

CCTCAGCCCC ACAGGCACTG GTCTTTTTTG CTNGANTCCA CCCCCCCCCT CTTTACCCTT 2580 
TT 



A164 Protein sequence i 
Gene name: 
Unigene number: 
Probeaet Accession 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 179809 
H95796 
XPJJ50197 
none 

21-43, 53-75, 90-112, 125-147, 160-182, 
plasma membrane 



199-221, 323-345, 350-372, 379-401 



1 11 21 31 41 51 

I I I I i I 

MVQRLWVSRL LRHRKAQLLL VNLLTFGLEV CLAAGITYVP PLLLBVGVEE KFMTMVLGIG 60 

PVLGLVCVPL LGSASDHWRG RYGRRRPFIW ALSLGIUiSIi FLIPRAGWIA GLLCPDPRPL 120 

ELALLILGVG LLDFCGQVCF TPLEALLSDL FRDPDHCRQA YSVYAFMISL GGCLGYLLPA 180 

1DWDTSALAP YLGTQEECLP GLLTLIPLTC VAATLLVABB AALGPTEPAE GLSAPSLSPH 240 

CCPCRARLAP RNLGALLPRIi BQLCCRMPRT LRRLPVAELC SWMALMTFTL FYTDFVGEGIi 300 

YQGVPRAEPG TEARRHYDEG VRMGSLGLFL QCAISLVFSL VMDRLVQRFG TRAVYLASVA 360 

AFPVAAGATC LSHSVAWTA SAALTGFTFS ALQILPYTLA SLYHREKQVF LPKYRGDTGG 420 

ASSEDSLMTS FLPGPKPGAP FPNGHVGAGG SGLLPPPPAL CGASACDVSV RWVGEPTEA 480 

RWPGRGICL DLAILDSAPL LSQVAPSLFM GSIVQLSQSV TAYMVSAAGL GLVAIYFATQ 540 
WFDKSDLAK YSA 

A185 DMA SEQUENCE 
Gene name: ESTs 
Unigene number: Hs. 11260 

Probeset Accession ft: R73640 
Nucleic Acid Accession #: AK002126 

Coding sequence: 1-1593 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGGTTCGCC GGGGGCTGCT TGCGTGGATT TCCCGGGTGG TGGTTTTGCT GGTGCTCCTC 60 

TGCTGTGCTA TCTCTGTCCT GTACATGTTG GCCTGCACCC CAAAAGGTGA CGAGGAGCAG 120 

CTGGCACTGC CCAGGGCCAA CAGCCCCACG GGGAAGGAGG GGTACCAGGC CGTCCTTCAG 180 

GAGTGGGAGG AGCAGCACCG CAACTACGTG AGCAGCCTGA AGCGGCAGAT CGCACAGCTC 240 

AAGGAGGAGC TGCAGGAGAG GAGTGAGCAG CTCAGGAATG GGCAGTACCA AGCCAGCGAT 300 

GCTGCTGGCC TGGGTCTGGA CAGGAGCCCC CCAGAGAAAA CCCAGGCCGA CCTCCTGGCC 360 

TTCCTGCACT CGCAGGTGGA CAAGGCAGAG GTGAATGCTG GCGTCAAGCT GGCCACAGAG 420 

TATGCAGCAG TGCCTTTCGA TAGCTTTACT CTACAGAAGG TGTACCAGCT GGAGACTGGC 480 

CTTACCCGCC ACCCCGAGGA GAAGCCTGTG AGGAAGGACA AGCGGGATGA GTTGGTGGAA 540 

GCCATTGAAT CAGCCTTGGA GACCCTGAAC AATCCTGCAG AGAACAGCCC CAATCACCGT 600 
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CCTTACAOGG CCTCTGATTT CATAGAAGGG 
TTGTATGAGC TCACCTTCAA AGGGGACCAC 
CGACCATTCG GCCCCATCAT GAAAGTGAAA 
ATCAATGTTA TCGTGCCTCT AGCAAAAAGG 
TTCAGGGAGA TGTGCATTGA GCAGGATGGG 
AAAGAAGAAA TAAATGAAGT CAAAGGAATA 
AGGAACTTTA CCTTCATCCA GCTGAATGGA 
GGAGCCCGCT TCTGGAAGGG AAGCAACGTC 
TTCACATCTG AATTCCTCAA TACGTGTAGG 
TATCCAGTTC TTTTCAGTCA GTACAATCCT 
CCTCCCTTGG AACAGCAGCT GGTCATAAAG 
TTTGGGATGA CGTGTCAGTA TCGGTCAGAC 
ATCAAAGGCT GGGGCGGAGA GGATGTGCAC 
ATAGTGGTAC GGACGCCTGT GCGAGGACTC 
GAOGAGCTGA CCCCCGAGCA GTACAAGATG 
TCCCACGGCC AGCTGGGCAT GCTGGTGTTC 
CAGAAACAGA AGACAAGTAG CAAAAAAACA 



ATCTACCGAA CAGAAAGGGA CAAAGGGACA 660 

AAACAOGAAT TCAAACGGCT CATCTTATTT 720 

AATGAAAAGC TCAACATGGC CAACACGCTT 780 

GTGGACAAGT TCCGGCAGTT CATGCAGAAT 840 

AGAGTCCATC TCACTGTTGT TTACTTTGGG 900 

CTTGAAAACA CTTCCAAAGC TGCCAACTTC 960 

GAATTTTCTC GGGGAAAGGG ACTTGATGTT 1020 

CTTCTCTTTT TCTGTGATGT GGACATCTAC 1080 

CTGAATACAC AGCCAGGGAA GAAGGTATTT 1140 

GGCATAATAT ACGGCCACCA TGATGCAGTC 1200 

AAGGAAACTG GATTTTGGAG AGACTTTGGA 1260 

TTCATCAATA TAGGTGGGTT TGATCTGGAC 1320 

CTTTATOGCA AGTATCTCCA CAGCAACCTC 1380 

TTCCACCTCT GGCATGAGAA GCGCTGCATG 1440 

TGCATGCAGT CCAAGGCCAT GAACGAGGCA 1500 

AGGCAOGAGA TAGAGGCTCA CCTTCGCAAA 1S60 
TGA 



A186 Protein sequence: 
Gene name: " ESTs 
Unigene number: He. 11260 

Probeset Accession #: R73640 
Protein Accession #: NP_06084l 
Signal sequence: 1-26 
Transmembrane domains: none found 
Cellular Localization: not determined 

1 11 21 31 41 51 

1 1 I I I I 

MVRRGUAWI SRWVLLVLL CCAISVLYML ACTPKGDEEQ LALPRANSPT GKEGYQAVLQ 60 
EWEEQHRNYV SSLKRQIAQL KBELQERSEQ LRNGQYQASD AAGLGLDRSP PEKTQADLLA 120 
FLHSQVDKAE VNAGVKLATE YAAVPFDSFT LQKVYQLETG LTRHPEBKPV RKDKRDELVE 180 
AIESALETLN NPAENSPNHR PYTASDFIEG IYRTERDKGT LYELTFKGDH KHBFKRLILF 240 
RPFGPIMKVK NBKLNMANTL INVIVPIAKR VDKFRQFMQN PREMCIEQDG RVHLTWYFG 300 
KEEINEVKGI LENTSKAANF RNFTFIQLNG EFSRGKGLDV GARFWKGSNV LLFFCDVDIY 360 
FTSEFLNTCR LNTQPGKKVF YPVLFSQYNP GIIYGHHDAV PPLEQQLVIK KETGFWRDFG 420 
FGMTCQYRSD FINIGGFDLD XKGWGGEDVH LYRKYLHSNL IWRTPVRGL FHLWHEKRCM 480 
DELTPEQYKM CMQSKAMNEA SHGQLGMLVF RHEIEAHLRK QKQKTSSKKT 

A187DNA 



Gene name: ATPase, Ca++ transporting, type 2C, member 1 

Unigene number: Hs. 106778 

Probeset Accession 8: N51919 
Nucleic Acid Accession #: AP189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codons) 



* 1 11 21 31 41 51 

I I I I I I 

ATGATTCCTG TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTGA AGTTGCAAGC 60 

ATTCTCCAAG CTGATCTTCA GAATGGTCTA AACAAATGTG AAGTTAGTCA TAGGCGAGCC 120 

TTTCATGGCT GGAATGAGTT TGATATTAGT GAAGATGAGC CACTGTGGAA GAAGTATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCAGTCAT CAGTGTTTTA 240 

ATGCATCAGT TTGATGATGC CGTCAGTATC ACTGTGGCAA TACTTATCGT TGTTACAGTT 300 

GCCTTTGTTC AGGAATATCG TTCAGAAAAA TCTCTTGAAG AATTGAGTAA ACTTGTGCCA 360 

CCAGAATGCC ATTGTGTGCG TGAAGGAAAA TTGGAGCATA CACTTGCCCG AGACTTGGTT 420 

CCAGGTGATA CAGTTTGCCT TTCTGTTGGG GATAGAOTTC CTGCTQACTT ACGCTTGTTT 480 

GAGGCTGTGG ATCTTTCCAT TGATGAGTCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 540 

AAGGTGACAG CTCCTCAGCC AGCTGCAACT AATGGAGATC TTGCATCGAG AAGTAACATT 600 

GCCTTTATGG GAACACTGGT CAGATGTGGC AAAGCAAAGG GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAG CAGAAGAGGC ACCAAAAACC 720 

CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACTC CTTTGGTATA 780 

ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCCTGGA AATGTTTACT 840 

ATTAGTGTAA GTTTGGCTGT AGCAGCAATT CCTGAAGGTC TCCCCATTGT GGTCACAGTG 900 

ACGCTAGCTC TTGGTGTTAT GAGAATGGTG AAGAAAAGGG CCATTGTGAA AAAGCTGCCT 960 

ATTGTTGAAA CTCTGGGCTG CTGTAATGTG ATTTGTTCAG ATAAAACTGG AACACTGACG 1020 

AAGAATGAAA TGACTGTTAC TCACATATTT ACTTCAGATG GTCTGCATGC TGAGGTTACT 1080 

GGAGTTGGCT ATAATCAATT TGGGGAAGTG ATTGTTGATG GTGATGTTGT TCATGGATTC 1140 

TATAACCCAG CTGTTAGCAG AATTGTTGAG GCGGGCTGTG TGTGCAATGA TGCTGTAATT 1200 

AGAAACAATA CTCTAATGGG GAAGCCAACA GAAGGGGCCT TAATTGCTCT TGCAATGAAG 1260 

ATGGGTCTTG ATGGACTTCA ACAAGACTAC ATCAGAAAAG CTGAATACCC TTTTAGCTCT 1320 

GAGCAAAAGT GGATGGCTGT TAAGTGTGTA CACCGAACAC AGCAGGACAG ACCAGAGATT 1380 

TGTTTTATGA AAGGTGCTTA CGAACAAGTA ATTAAGTACT GTACTACATA CCAGAGCAAA 1440 

GGGCAGACCT TGACACTTAC TCAGCAGCAG AGAGATGTGT ACCAACAAGA GAAGGCACGC 1500 

ATGGGCTCAG CGGGACTCAG AGTTCTTGCT TTGGCTTCTG GTCCTGAACT GGGACAGCTG 1560 

ACATTTCTTG GCTTGGTGGG AATCATTGAT CCACCTAGAA CTGGTGTGAA AGAAGCTGTT 1620 

ACAACACTCA TTGCCTCAGG AGTATCAATA AAAATGATTA CTGGAGATTC ACAGGAGACT 1680 

GCAGTTGCAA TCGCCAGTCG TCTGGGATTG TATTCCAAAA CTTCCCAGTC AGTCTCAGGA 1740 

GAAGAAATAG ATGCAATGGA TOTTCAGCAG CTTTCACAAA TAGTACCAAA GGTTGCAGTA 1800 

TTTTACAGAG CTAGCCCAAG GCACAAGATG AAAATTATTA AGTCGCTACA GAAGAACGGT 1860 

TCAGTTGTAG CCATGACAGG AGATGQAGTA AATGATGCAG TTGCTCTGAA GGCTGCAGAC 1920 

ATTGGAGTTG CGATGGGCCA GACTGGTACA GATGTTTGCA AAGAGGCAGC AGACATGATC 1980 

CTAGTGGATG ATGATTTTCA AACCATAATG TCTGCAATCG AAGAGGGTAA AGGGATTTAT 2040 



920 



WO 03/042661 



PCT/US02/36810 



AATAACATTA AAAATTTCGT TAGATTCCAG 
ATCTCATTGG CTACATTAAT GAACTTTCCT 
ATCAATATTA TTATGGATGG ACCCCCAGCT 
GATGTCATTC GTAAACCTCC TCGCAACTGG 
CTTAAAATAC TTGTTTCATC AATAATCATT 
GAGCTACQAG ACAATOTGAT TACACCTCGA 
TTTTTTGACA TGTTCAATGC ACTAAGTTCC 
GGACTCTGCA GTAATAGAAT GTTTTGCTAT 
CTAGTTATTT ACTTTCCTCC GCTTCAGAAG 
GATCTGTTGT TTCTTTTGGG TCTCACCTCA 
AAGGTTGAAA GGAGCAGGGA AAAGATCCAG 
CTTGAAGTAT GA 



CTGAGCACGA GTATAGCAGC ATTAACTTTA 2100 

AATCCTCTCA ATGCCATGCA GATTTTGTGG 2160 

CAGAGCCTTG GAGTAQAACC AGTGGATAAA 2220 

AAAGACAGCA TTTTGACTAA AAACTTGATA 2280 

GTTTGTGGGA CTTTGTTTGT CTTCTGGCGT 2340 

GACACAACAA TGACCTTCAC ATGCTTTGTG 2400 

AGATCCCAGA CCAAGTCTGT GTTTGAGATT 2460 

GCAGTTCTTG GATCCATCAT GGGACAATTA 2520 

GTTTTTCAGA CTGAGAGCCT AAGCATACTG 2S80 

TCAGTGTGCA TAGTGGCAGA AATTATAAAG 2640 

AAGCATGTTA GTTCGACATC ATCATCTTTT 2700 



A188 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Pfam domains: 
[744-889] 

Cellular Localization: 



ATPase, Ca++ transporting, type 2C, member 1 

Hs. 106778 

N51919 

AAF27813 

none found 

60-82, 86-108, 252-274, 282-304, 757-779, 827-849, 856-878 

Cation ATPase N 16-83] , E1-B2 ATPase t89-324] . Hydrolase [328-649] , 

not determined 



Cation ATPase C 



I 

MIPVLTSKKA 
SQFKNPLIML 
PECHCVREGK 
KVTAPQPAAT 
PLQKSHDLLG 
TLALGVMRMV 
GVGYNQFGEV 
MGLDGLQQDY 
GQTLTLTQQQ 
TTLIASGVSI 
FYRASPRHKM 
LVDDDFQTIM 
INIIMDGPPA 
ELRDNVITPR 
LVIYFPPLQK 
LEV 



11 
I 

SELPVSEVAS 
LLASAVISVL 
LEHTLARDLV 
NGDLASRSNI 
KQLSFYSFGI 
KKRAIVKKLP 
rVDGDWHGF 
IRKAEYPFSS 
RDVYQQEKAR 
KMITGDSQET 
KIIKSLQKNG 
SAIEEGKGIY 
QSLGVEPVDK 
DTTMTFTCFV 
VFQTESLSIL 



21 
I 

ILQADLQNGL 
MHQFDDAVSI 
PGDTVCLSVG 
AFMGTLVRCG 
IGIIMLVGWL 
IVETLGCCNV 
YNPAVSRIVE 
EQKWMAVKCV 
MGSAGLRVLA 
AVAIASRLGL 
SWAMTGDGV 
NNIKNFVRPQ 
DVIRKPPRNW 
FFDMFNALSS 
DLLFLLGLTS 



31 
I 

NKCEVSHRRA 
TVAILIWTV 
DRVPADLRLF 
KAKGWIGTG 
LGKDILEMFT 
ICSDKXGTLT 
AGCVCNDAVI 
HRTQQDRPEI 
LA5GPELGQL 
YSKTSQSVSG 
NDAVALKAAD 
LSTSIAALTL 
KDSILTKNLI 
RSQTKSVFEI 
SVCIVAEIIK 



41 
I 

FHGWNEFDIS 
AFVQEYRSEK 
EAVDLSIDES 
ENSEFGEVFK 
ISVSLAVAAX 
KNEMTVTHIF 
HNKTLMGKPT 
CFMKGAYEQV 
TFLGLVGIID 
EEIDAMDVQQ 
IGVAMGQTGT 
ISLATLMNFP 
LKILVSSIII 



KVERSREKIQ 



51 
I 

EDEPLWKKYI 
SLEELSKLVP 
SLTGETTPCS 
MMQAEEAPKT 
PEGLPIWTV 
TSDGLHAEVT 
EGALIALAMK 
IKYCTTYQSK 
PPRTGVKEAV 
LSQIVPKVAV 
DVCKEAADMI 
NPLNAMQILW 
VCGTLFVFWR 
AVLGSIMGQL 
KHVSSTSSSF 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



A189 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs 

Hs. 293185 

N62096 

N62096 

1-1284 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I t 1 I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGGATTGCC TTATTCAATG 60 

AAGCAAGCTG GGTTTCCTTT GGGAATATTG CTTTTATTCT GGGTTTCATA TGTTACAGAC 120 

TTTTCCCTTG TTTTATTGAT AAAAGGAGGG GCCCTCTCTG GAACAGATAC CTACCAGTCT 180 

TTGGTCAATA AAACTTTCGG CTTTCCAGGG TATCTGCTCC TCTCTGTTCT TCAGTTTTTG 240 

TATCCTTTTA TAGCAATGAT AAGTTACAAT ATAATAGCTG GAGATACTTT GAGCAAAGTT 300 

TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AACGTGTTTA TTGGTCGCCA CTTCATTATT 360 

GGACTTTCCA CAGTTACCTT TACTCTGCCT TTATCCTTGT ACCGAAATAT AGCAAAGCTT 420 

GGAAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 480 

AGGGCAATTT CACTGGGTCC ACACATACCA AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 540 

CCCAATG CCA TTCAAGCGGT CGGGGTTATG TCTTTTGCAT TTATTTGCCA CCATAACTCC 600 
TTCTTAGTTT ACAGTTCTCT AGAAGAACCC ACAGTAGCTA AGTGGTCCCG CCTTATCCAT 660 
ATGTCCATCG TGATTTCTGT ATTTATCTGT ATATTCTTTG CTACATGTGG ATACTTGACA 720 
TTTACTGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATGA TGACCTGGTA 780 
ACATTTGGAA GATTTTGTTA TGGTGTCACT GTCATTTTGA CATACCCTAT GGAATGCTTT 840 
GTGACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GGTTTTCCAC 900 
ATTGTTGTAA CAGTGATGGT CATCACTGTA GCCACGCTTG TGTCATTGCT GATTGATTGC 960 

CTCGGGATAG TTCTAGAACT CAATGGTGTG CTCTGTGCAA CTCCCCTCAT TTTTATCATT 1020 

CCATCAGCCT GTTATCTGAA ACTGTCTGAA GAACCAAGGA CACACTCCGA TAAGATTATG 1080 

TCTTGTGTCA TGCTTCCCAT TGGTGCTGTG GTGATGGTTT TTGGATTCGT CATGGCTATT 1140 

ACAAATACTC AAGACTGCAC CCATGGGCAG GAAATGTTCT ACTGCTTTCC TGACAATTTC 1200 

TCTCTCACAA ATACCTCAGA GTCTCATGTT CAGCAGACAA CACAACTTTC TACTTTAAAT 1260 
ATTAGTATCT TTCAACTCGA GTAA 



A190 Protein sequence: 

Gene name : "* ~ ESTs 

Unigene number: Hs. 293185 

Probeset Accession ft: N62096 

Protein Accession 8: none found 

Signal sequence: none found 

Transmembrane domains i 28-S0, 66-88-112-134, 142-164. 217-239, 260-282, 298-320, 327-348, 359-381 



921 



WO 03/042661 



PC17US02/36810 



Cellular localization: plasma membrane 



1 
I 

MGYQRQEPVI 
LVNKTFGFPG 
GLSTVTPTLP 
PNAIQAVGVM 
FTGPTQGDLF 
IWTVMVITV 
SCVMLPIGAV 
ISIFQLE 



11 

I 

PPQRGLPYSM 
YLLLSVLQFL 
LSLYRNIAKL 
SFAFICHHNS 
ENYCRNDDLV 
ATLVSLLIDC 
VMVFGFVMAI 



21 
I 

KQAGFPLGIL 
YPFIAMISYN 
OKVSLISTGL 
FXiVYSSLEEP 
TFGRFCYGVT 
U3IVLSLNGV 
TNTQDCTHGQ 



31 
I 

LLFWVSYVTD 
IIAGDTLSKV 
TTLILQIVMA 
TVAKWSRLIH 
VILTYPMECF 
LCATPLIFII 
EMFYCFPDNF 



41 
I 

FSLVLLIKGG 
FQRIPGVDPE 
RAISLGPHZP 
MSIVISVFIC 
VTRBVIANVF 
PSACYLKLSE 
SLTNTSESHV 



51 
i 

ALSGTDTYQS 60 

NVFIGRHFII 120 

KTEDAWVPAK 180 

IFFATCGYLT 240 

FGGNLSSVFH 300 

EPRTHSDKIM 360 

QQTTQLSTLN 420 



A191 DNA SEQUENCE 
Gene name; 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs 

HS. 293185 

N62096 

N62096 

1-1203 {underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC 
AAAGGAGGGG CCCTCTCTGG AACAGATACC 
TTTCCAGGGT ATCTGCTCCT CTCTGTTCTT 
AGTTACAATA TAATAGCTGG AGATACTTTG 
GATCCTGAAA ACGTGTTTAT TGGTCGCCAC 
ACTCTGCCTT TATCCTTGTA CCGAAATATA 
ACAGGTTTAA CAACTCTGAT TCTTGGAATT 
CACATACCAA AAACAGAAGA CGCTTGGGTA 
GGGGTTATGT CTTTTGCATT TATTTGCCAC 
GAAGAACCCA CAGTAGCTAA GTGGTCCCGC 
TTTATCTGTA TATTCTTTGC TACATGTGGA 
GACTTATTTG AAAATTACTG CAGAAATGAT 
GGTGTCACTG TCATTTTGAC ATACCCTATG 
AATGTGTTTT TTGGTGGGAA TCTTTCATCG 
ATCACTGTAG CCACGCTTGT GTCATTGCTG 
AATGGTGTGC TCTGTGCAAC TCCCCTCATT 
CTGTCTGAAG AACCAAGGAC ACACTCCGAT 
GGTGCTGTGG TGATGGTTTT TGGATTCGTC 
CATGGGCAGG AAATGTTCTA CTGCTTTCCT 
TCTCATGTTC AGCAGACAAC ACAACTTTCT 
TAA 



31 41 51 

I I I ' 

CCGCCGCAGT TTTCCCTTGT TTTATTGATA 60 

TACCAGTCTT TGGTCAATAA AACTTTCGGC 120 

CAGTTTTTGT ATCCTTTTAT AGCAATGATA 180 

AGCAAAGTTT TTCAAAGAAT CCCAGGAGTT 240 

TTCATTATTG GACTTTCCAC AGTTACCTTT 300 

GCAAAGCTTG GAAAGGTCTC CCTCATCTCT 360 

GTAATGGCAA GGGCAATTTC ACT GGGTCCA 420 

TTTGCAAAGC CCAATGCCAT TCAAGCGGTC 480 

CATAACTCCT TCTTAGTTTA CAGTTCTCTA 540 

CTTATCCATA TGTCCATCGT GATTTCTGTA 600 

TACTTGACAT TTACTGGCTT CACCCAAGGG 660 

GACCTGGTAA CATTTGGAAG ATTTTGTTAT 720 

GAATGCTTTG TGACAAGAGA GGTAATTGCC 780 

GTTTTCCACA TTGTTGTAAC AGTGATGGTC 840 

ATTGATTGCC TCGGGATAGT TCTAGAACTC 900 

TTTATCATTC CATCAGCCTG TTATCTGAAA 960 

AAGATTATGT CTTGTGTCAT GCTTCCCATT 1020 

ATGGCTATTA CAAATACTCA AGACTGCACC 1080 

GACAATTTCT CTCTCACAAA TACCTCAGAG 1140 

ACTTTAAATA TTAGTATCTT TCAACTCGAG 1200 



A192 Protein sequence: 

Gene name: ESTs 

Unigene number: Hs. 293 185 

Probeset Accession #: N62096 

Protein Accession #: none found 

Signal sequence: 1-26 

Transmembrane domains: 45-61, 92-108, 117-132, 191-207, 274-290, 297-313, 335-351 

Cellular Localization! plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MGYQRQEPVT PPQFSLVLLI KGGALSGTDT YQSLVNKTFG PPGYLLLSVL QFLYPFIAMI 60 

SYNIIAGDTL SKVFQRIPGV DPENVFIGRH PIIGLSTVTF TLPLSLYRNI AKLGKVSLIS 120 

TGLTTLILGI VMARAI SLOP HIPKTEDAWV PAKPNAIQAV GVMSFAFICH HNSFLVYSSL 180 

EEPTVAKWSR LIHMSIVISV FICIFFATCG YLTFTGFTQG DLFENYCRND DLVTFGRFCY 240 

GVTVILTYPM ECFVTREVIA NVFFGGNLSS VPHIWTVMV ITVATLVSLL IDCLGIVLEL 300 

NGVLCATPLI PIIPSACYLK LSEEPRTRSD KIMSCVMLPI GAWMVFGFV MAITNTQDCT 360 
HGQEMFYCFP DNFSLTNTSE SHVQQTTQLS TLNISIFQLE 

A193 DNA SEQUENCE 

Gene name: ESTs 

Unigene number: Hs. 2 93 185 

Probeset Accession #: N62096 

Nucleic Acid Accession <h N62096 

Coding sequence: 1-1140 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGGCTTT 60 

CCAGGGTATC TGCTCCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATGATAAGT 120 

TACAATATAA TAGCTGGAGA TACTTTGAGC AAAGTTTTTC AAAGAATCCC AGGAGTTGAT 180 

CCTGAAAACG TGTTTATTGG TCGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 240 

CTGCCTTTAT CCTTGTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 300 

GGTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT GGGTCCACAC 360 

ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA AGCGGTCGGG 420 



922 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 
35 
40 



GTTATGTCTT 
GAACCCACAG 
ATCTGTATAT 
TTATTTGAAA 
GTCACTGTCA 
GTGTTTTTTG 
ACTGTAGCCA 
GGTGTGCTCT 
TCTGAAGAAC 
GCTGTGGTGA 
GGGCAGGAAA 
CATGTTCAGC 



TTGCATTTAT 
TAGCTAAGTG 
TCTTTGCTAC 
ATTACTGCAG 
TTTTGACATA 
GTGGGAATCT 
CGCTTGTGTC 
GTGCAACTCC 
CAAGGACACA 
TGGTTTTTGG 
TGTTCTACTG 
AGACAACACA 



TTGCCACCAT 
GTCCOGCCTT 
ATGTGGATAC 
AAATGATGAC 
CCCTATGGAA 
TTCATCGGTT 
ATTGCTGATT 
CCTCATTTTT 
CTCOGATAAG 
ATTOGTCATG 
CTTTCCTGAC 
ACTTTCTACT 



AACTCCTTCT 
ATCCATATGT 
TTGACATTTA 
CTGGTAACAT 
TGCTTTGTGA 
TTCCACATTG 
GATTGCCTOG 
ATCATTCCAT 
ATTATGTCTT 
GCTATTACAA 
AATTTCTCTC 
TTAAATATTA 



TAGTTTACAG 
CCATCGTGAT 
CTGGCTTCAC 
TTGGAAGATT 
CAAGAGAGGT 
TTGTAACAGT 
GGATAGTTCT 
CAGCCTGTTA 
GTGTCATGCT 
ATACTCAAGA 
TCACAAATAC 
GTATCTTTCA 



TTCTCTAGAA 
TTCTGTATTT 
CCAAGGGGAC 
TTGTTATGGT 
AATTGCCAAT 
GATGGTCATC 
AGAACTCAAT 
TCTGAAACTG 
TCCCATTGGT 
CTGCACCCAT 
CTCAGAGTCT 
ACTCGAGTAA 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



A194 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession 
Signal sequence : 
Transmembrane domains : 
Cellular Localization: 



EST6 

Hs. 293185 
N62096 
none found 
none found 

24-40, 70-86, 95-111, 
plasma membrane 



171-186, 253-269, 276-292, 314-330 



I 

MGYQRQBPVI 
PENVPIGRHF 
IPKTBOAWVF 
ICIPFATOGY 
VPFGGNLSSV 



HVQQTTQLST 



11 
I 

PPQVNKTPGP 
IIGLSTVTFT 
AKPNAIQAVG 
LTPTGFTQGD 
FHIWTVMVI 
IMSCVMLPIG 
LNISIFQLE 



21 
I 

PGYLLLSVLQ 
LPLSLYRNIA 
VMSPAFICHH 
LFENYCRNDD 
TVATLVSI*LI 
AWMVFGFVM 



31 
I 

FLYPFIAMIS 
KLGKVSLIST 
NSFLVYSSLE 
LVTFGRPCYG 
DCLGIVLELN 
AITNTQDCTH 



41 



51 

! 



YNI I AGDTLS KVFQRIPGVD 
GLTTLILGIV KARAISLGPH 
EPTVAKWSRL IHKSIVISVP 
VTVILTYPME CPVTREVIAN 
GVLCATPLIP IIPSACYLKL 
GQEMPYCFPD NFSLTNTSES 



60 
120 
180 
240 
300 
360 



A195 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession # t 
Coding sequence: 



ESTb 

Hs. 293185 

N62096 

N62096 

1-1389 (underlined sequences correspond to start and stop codons) 



45 
50 
55 
60 
65 
70 
75 
80 



i 

I 

ATGGGCTACC 
ACCCTTGTTT 
GTTGTCAACT 
GGGTTTCCTT 
GTTTTATTGA 
AAAACTTTCG 
ATAGCAATGA 
ATCCCAGGAG 
ACAGTTACCT 
TCCCTCATCT 
TCACTGGGTC 
ATTCAAGCGG 
TACAGTTCTC 
GTGATTTCTG 
TTCACCCAAG 
AGATTTTGTT 
GAGGTAATTG 
ACAGTGATGG 
GTTCTAGAAC 
TGTTATCTGA 
ATGCTTCCCA 
CAAGACTGCA 
AATACCTCAG 
TTTCAATGA 



11 
I 

AGAGGCAGGA 
CTGAACATGA 
CGATTATAGG 
TGGGAATATT 
TAAAAGGAGG 
GCTTTCCAGG 
TAAGTTACAA 
TTGATCCTGA 
TTACTCTGCC 
CTACAGGTTT 
CACACATACC 
TCGGGGTTAT 
TAGAAGAACC 
TATTTATCTG 
GGGACTTATT 
ATGGTGTCAC 
CCAATGTGTT 
TCATCACTGT 
TCAATGGTGT 
AACTGTCTGA 
TTGGTGCTGT 
CCCATGGGCA 
AGTCTCATGT 



21 
I 

GCCTGTCATC 
GTATAAAGAG 
ATCTGGTATA 
GCTTTTATTC 
GGCCCTCTCT 
GTATCTGCTC 
TATAATAGCT 
AAACGTGTTT 
TTTATCCTTG 
AACAACTCTG 
AAAAACAGAA 
GTCTTTTGCA 
CACAGTAGCT 
TATATTCTTT 
TGAAAATTAC 
TGTCATTTTG 
TTTTGGTGGG 
AGCCACGCTT 
GCTCTGTGCA 
AGAACCAAGG 
GGTGATGGTT 
GGAAATGTTC 
TCAGCAGACA 



31 

I 

CCGCCGCAGA 
AAAACCTGTC 
ATAGGATTGC 
TGGGTTTCAT 
GGAACAGATA 
CTCTCTGTTC 
GGAGATACTT 
ATTGGTCGCC 
TACCGAAATA 
ATTCTTGGAA 
GACGCTTGGG 
TTTATTTGCC 
AAGTGGTCCC 
GCTACATGTG 
TGCAGAAATG 
ACATACCCTA 
AATCTTTCAT 
GTGTCATTGC 
ACTOCCCTCA 
ACACACTCCG 
TTTGGATTCG 
TACTGCTTTC 
ACACAACTTT 



41 

I 

GAGATTTAGA 
AGTCTGCTGC 
CTTATTCAAT 
ATGTTACAGA 
CCTACCAGTC 
TTCAGTTTTT 
TGAGCAAAGT 
ACTTCATTAT 
TAGCAAAGCT 
TTGTAATGGC 
TATTTGCAAA 
ACCATAACTC 
GCCTTATCCA 
GATACTTGAC 
ATGACCTGGT 
TGGAATGCTT 
CGGTTTTCCA 
TGATTGATTG 
TTTTTATCAT 
ATAAGATTAT 
TCATGGCTAT 
CTGACAATTT 
CTACTTTAAA 



51 
I 

TGACAGAGAA 
TCTTTTTAAT 
GAAGCAAGCT 
CTTTTCCCTT 
TTTGGTCAAT 
GTATCCTTTT 
TTTTCAAAGA 
TGGACTTTCC 
TGGAAAGGTC 
AAGGGCAATT 
GCCCAATGCC 
CTTCTTAGTT 
TATGTCCATC 
ATTTACTGGC 
AACATTTGGA 
TGTGACAAGA 
CATTGTTGTA 
CCTCGGGATA 
TCCATCAGCC 
GTCTTGTGTC 
TACAAATACT 
CTCTCTCACA 
TATTAGTATC 



A196 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTS 

HS. 293185 
N62096 
none found 
none found 

37-53, 66-82, 109-125. 
plasma membrane 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



155-172, 180-196, 2SS-271, 338-354, 361-377, 399-415 



I 



11 
I 



21 
I 



31 
I 



41 
I 



51 
I 



MGYQRQBPVI PPQRDLDDRE TLVSEHEYKB KTCQSAALFN WNSIIGSGI IGLPYSKKQA 

923 



60 



WO 03/042661 



PCT/US02/36810 



GFPLGrLLLP WVSYVTDFSL VLLIKGGALS GTDTYQSLVN KTFGFPGYLL LSVLQFLYPF 120 

IAMISYNIIA GDTLSKVFQR IPGVDPENVF IGRHFIIGLS TVTFTLPI*SL YRNIAKLGKV 180 

SLISTGLTTL ILGIVMARAI SU3PHIPKTE DAWVPAKPNA IQAVGVMSFA FICHHNSFLV 240 

YSSLEEPTVA KWSRLIHMSI VISVPICIFF ATCGYLTFTG FTQGDLFENY CRNDDLVTFG 300 

RFCYGVTVIL TYPMECFVTR EVIANVFFGG NLSSVFH1W TVMVITVATL VSLLIDdiGI 360 

VLELNGVLCA TPLIPIIPSA CYLKLSEEPR THSDKIMSCV MLPIGAWMV FGFVMAITNT 420 
QDCTHGQEMF YCFPDNFSLT NTSESHVQQT TQLSTLNISI FQ 



A197 DNA SEQUENCE 
Gene name : 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs 

Ks. 31608 

H18836 

NM_017636 

1-3501 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGGAGGATG CCTTCGGGGC AGCCGTGGTG ACCGTGTG6G ACAGCGATGC ACACACCACG 60 

GAGAAGCCCA CCGATGCCTA CGQAGAGCTG GACTTCACGG GGGCCGGCCG CAAGCACAGC 120 

AATTTCCTCC GGCTCTCTGA CCGAACGGAT CCAGCTGCAG TTTATAGTCT GGTCACACGC 180 

ACATGGGGCT TCCGTGCCCC GAACCTGGTG GTGTCAGTGC TGGGGGGATC GGGGGGCCCC 240 

GTCCTCCAGA CCTGGCTGCA GGACCTGCTG CGTCGTGGGC TGGTGCGGGC TGCCCAGAGC 300 

ACAGGAGCCT GGATTGTCAC TGGGGGTCTG CACACGGGCA TCGGCCGGCA TGTTGGTGTG 360 

GCTGTACGGG ACCATCAGAT GGCCAGCACT GGGGGCACCA AGGTGGTGGC CATGGGTGTG 420 

GCCCCCTGGG GTGTGGTCCG GAATAGAGAC ACCCTCATCA ACCCCAAGGG CTCGTTCCCT 480 

GCGAGGTACC GGTGGCGCGG TGACCCGGAG GACGGGGTCC AGTTTCCCCT GGACTACAAC 540 

TACTCGGCCT TCTTCCTGGT GGACGACGGC ACACACGGCT GCCTGGGGGG CGAGAACCGC 600 

TTCCGCTTGC GCCTGGAGTC CTACATCTCA CAGCAGAAGA CGGGCGTGGG AGGGACTGGA 660 

ATTGACATCC CTGTCCTGCT CCTCCTGATT GATGGTGATG AGAAGATGTT GACGCGAATA 720 

GAGAACGCCA CCCAGGCTCA GCTCCCATGT CTCCTCGTGG CTGGCTCAGG GGGAGCTGCG 780 

GACTGCCTGG CGGAGACCCT GGAAGACACT CTGGCCCCAG GGAGTGGGGG AGCCAGGCAA 840 

GGCGAAGCCC GAGATCGAAT CAGGCGTTTC TTTCCCAAAG GGGACCTTGA GGTCCTGCAG 900 

GCCCAGGTGG AGAGGATTAT GACCCGGAAG GAGCTCCTGA CAGTCTATTC TTCTGAGGAT 960 

GGGTCTGAGG AATTCGAGAC CATAGTTTTG AAGGCCCTTG TGAAGGCCTG TGGGAGCTCG 1020 

GAGGCCTCAG CCTACCTGGA TGAGCTGCGT TTGGCTGTGG CTTGGAACCG CGTGGACATT 1080 

GCCCAGAGTG AACTCTTTOG GGGGGACATC CAATGGCGGT CCTTCCATCT CGAAGCTTCC 1140 

CTCATGGACG CCCTGCTGAA TGACCGGCCT GAGTTCGTGC GCTTGCTCAT TTCCCACGGC 1200 

CTCAGCCTGG GCCACTTCCT GACCCCGATG CGCCTGGCCC AACTCTACAG CGCGGCGCCC 1260 

TCCAACTCGC TCATCCGCAA CCTTTTGGAC CAGGCGTCCC ACAGCGCAGG CACCAAAGCC 1320 

CCAGCCCTAA AAGGGGGAGC TGCGGAGCTC OGGCCCCCTG ACGTGGGGCA TGTGCTGAGG 1380 

ATGCTGCTGG GGAAGATGTG CGCGCCGAGG TACCCCTCCG GGGGCGCCTG GGACCCTCAC 1440 

CCAGGCCAGG GCTTCGGGGA GAGCATGTAT CTGCTCTCGG ACAAGGCCAC CTCGCCGCTC 1500 

TCGCTGGATG CTGGCCTCGG GCAGGCCCCC TGGAGCGACC TGCTTCTTTG GGCACTGTTG 1560 

CTGAACAGGG CACAGATGGC CATGTACTTC TGGGAGATGG GTTCCAATGC AGTTTCCTCA 1620 

GCTCTTGGGG CCTGTTTGCT GCTCCGGGTG ATGGCAOGCC TGGAGCCTGA CGCTGAGGAG 1680 

GCAGCACGGA GGAAAGACCT GGCGTTCAAG TTTGAGGGGA TGGGCGTTGA CCTCTTTGGC 1740 

GAGTGCTATC GCAGCAGTGA GGTGAGGGCT GCCCGCCTCC TCCTCCGTCG CTGCCCGCTC 1800 

TGGGGGGATG CCACTTGCCT CCAGCTGGCC ATGCAAGCTG ACGCCCGTGC CTTCTTTGCC 1860 

CAGGATGGGG TACAGTCTCT GCTGACACAG AAGTGGTGGG GAGATATGGC CAGCACTACA 1920 

CCCATCTGGG CCCTGGTTCT CGCCTTCTTT TGCCCTCCAC TCATCTACAC CCGCCTCATC 1980 

ACCTTCAGGA AATCAGAAGA GGAGCCCACA CGGGAGGAGC TAGAGTTTGA CATGGATAGT 2040 

GTCATTAATG GGGAAGGGCC TGTCGGGACG GCGGACCCAG CCGAGAAGAC GCCGCTGGGG 2100 

GTCCCGCGCC AGTCGGGCCG TCCGGGTTGC TGCGGGGGCC GCTGCGGGGG GCGCCGGTGC 2160 

CTACGCCGCT GGTTCCACTT CTGGGGCGCG CCGGTGACCA TCTTCATGGG CAACGTGGTC 2220 

AGCTACCTGC TGTTCCTGCT GCTTTTCTOG CGGGTGCTGC TCGTGGATTT CCAGCCGGCG 2280 

CCGCCCGGCT CCCTGGAGCT GCTGCTCTAT TTCTGGGCTT TCACGCTGCT GTGCGAGGAA 2340 

CTGCGCCAGG GCCTGAGCGG AGGCGGGGGC AGCCTCGCCA GCGGGGGCCC CGGGCCTGGC 2400 

CATGCCTCAC TGAGCCAGCG CCTGCGCCTC TACCTCGCCG ACAGCTGGAA CCAGTGCGAC 2460 

CTAGTGGCTC TCACCTGCTT CCTOCTGGGC GTGGGCTGCC GGCTGACCCC GGGTTTGTAC 2520 

CACCTGGGCC GCACTGTCCT CTGCATCGAC TTCATGGTTT TCACGGTGCG GCTGCTTCAC 2580 

ATCTTCACGG TCAACAAACA GCTGGGGCCC AAGATCGTCA TCGTGAGCAA GATGATGAAG 2640 

GACGTGTTCT TCTTCCTCTT CTTCCTCGGC GTGTGGCTGG TAGCCTATGG CGTGGCCACG 2700 

GAGGGGCTCC TGAGGCCACG GGACAGTGAC TTCCCAAGTA TCCTGOGCCG CGTCTTCTAC 2760 

CGTCCCTACC TGCAGATCTT CGGGCAGATT CCCCAGGAGG ACATGGACGT GGCCCTCATG 2820 

GAGCACAGCA ACTGCTOGTC GGAGCCCGGC TTCTGGGCAC ACCCTCCTGG GGCCCAGGOG 2880 

GGCACCTGCG TCTCCCAGTA TGCCAACTGG CTGGTGGTGC TGCTCCTCGT CATCTTCCTG 2940 

CTCGTGGCCA ACATCCTGCT GGTCAACTTG CTCATTGCCA TGTTCAGTTA CACATTCGGC 3000 

AAAGTACAGG GCAACAGCGA TCTCTACTGG AAGGCGCAGC GTTACCGCCT CATCOGGGAA 3060 

TTCCACTCTC GGCCCGGGCT GGCCCCGCCC TTTATCGTCA TCTCCCACTT GCGCCTCCTG 3120 

CTCAGGCAAT TGTGCAGGCG ACCCCGGAGC CCCCAGCCGT CCTCCCCGGC CCTCGAGCAT 3180 

TTCCGGGTTT ACCTTTCTAA GGAAGCCGAG CGGAAGCTGC TAACGTGGGA ATCGGTGCAT 3240 

AAGGAGAACT TTCTGCTGGC ACGCGCTAGG GACAAGCGGG AGAGCGACTC CGAGCGTCTG 3300 

AAGCGCACGT CCCAGAAGGT GGACTTGGCA CTGAAACAGC TGGGACACAT CCGCGAGTAC 3360 

GAACAGCGCC TGAAAGTGCT GGAGCGGGAG GTCCAGCAGT GTAGCCGCGT CCTGGGGTGG 3420 

GTGGCCGAGG CCCTGAGCOG CTCTGCCTTG CTGCCCCCAG GTGGGCCGCC ACCCCCTGAC 3480 
CTGCCTGGGT CCAAAGACTG A 



A198 Protein sequence: 

Gene names ESTs 

Unigene number: Hs.31608 

Probeset Accession #: H18836 

Protein Accession #: none found 



924 



WO 03/042661 



PCT/US02/36810 



Signal sequence: none found 

Transmembrane domains: 214-230, S37-5S6, 642-662, 730-752, 760-782, 815-837, 642-864, 877-899, 973- 

Cellular Localization: plasma membrane 

1 11 21 31 41 51 

I I I I I I 

MEDAFGAAW TVWDSDAHTT EKPTDAYGEL DFTGAGRKHS NFLRLSDRTD PAAVYSLVTR 60 

TWGFRAPNLV VSVLGGSGGP VLQTWLQDLL RRGLVRAAQS TGAWIVTGGL HTGIGRHVGV 120 

AVRDKQMAST GGTKWAMGV APWGWRNRD TLINPKGSPP ARYRWRGDPE DGVQFPLDYN 180 

YSAFFLVDDG THGCLGGENR FRLRLBSYIS QQKTGVGGTG IDIPVLLLLI DGDEKMLTRI 240 

ENATQAQLPC LLVAGSGGAA DCLABTLEDT LAPGSGGARQ GEARDRIRRP FPKGDLBVLQ 300 

AQVERIMTRK ELLTVYSSED GSEEFETIVL KALVKACGSS EASAYLDELR LAVAWNRVDI 360 

AQSELFRGDI QWRSFHLEAS LMDALLNDRP BFVRLLISHG LSLGHFLTPM RLAQLYSAAP 420 

SNSLIRNLLD QASHSAGTKA PALKGGAAKL RPPDVGHVLR MLLGKMCAPR YPSGGAWDPH 480 

PGQGFGESMY LLSDKATSPL SLDAGLGQAP WSDLLLWALL LNRAQMAMYF WEMGSNAVSS 540 

ALGACLLLRV MARLEPDAEE AARRKDLAFK FEGKGVDLFG ECYRSSEVRA ARLLLRRCPL 600 

WGDATCLQLA MQADARAFFA QDGVQSLLTQ KWWGDMASTT PIWALVLAFF CPPLIYTRLI 660 

TFRKSEEEPT REELEFDMDS VINGEGPVGT ADPAEKTPLG VPRQSGRPGC CGGRCGGRRC 720 

LRRWFHFWGA PVTIPMGNW SYLLFLLLFS RVLLVDFQPA PPGSLELLLY FWAFTLLCEE 780 

LRQGLSGGGG SLASGGPGPG HASLSQRLRL YLADSWNQCD LVALTCFLLG VGCRLTPGLY 840 

HLGRTVLCID FMVFTVRLLH I FTVNKQLGP KIVIVSKMMK DVFFFLFFLG VWLVAYGVAT 900 

EGLLRPRDSD FPSILRRVFY RPYLQIFGQI PQBDMDVALM EHSNCSSEPG FWAHPPGAQA 960 

GTCVSQYANW LWLLLVIFL LVANILLVNL LIAMFSYTFG KVQGNSDLYW KAQRYRLIRE 1020 

FHSRPALAPP FIVISHLRLL LRQLCRRPRS PQPSSPALEH FRVYLSKEAE RKLLTWESVH 1080 

KENFLLARAR DKRESDSERL KRTSQKVDLA LKQLGHIREY EQRLKVLERE VQQCSKVLGW 1140 
VAEALSRSAL LPPGGPPPPD LPGSKD 

A199 DNA SEQUENCE 
Gene name: ESTs 
Unigene number: Hs.40808 
Probeset Accession ft: AA0S4237 
Nucleic Acid Accession $: AA054237 

Coding sequence: 1-894 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I 1 I I I I 

ATGGAGCCGC GGGCGCTCGT CACGGCGCTC AGCCTCGGCC TCAGCCTGTG CTCCCTGGGG 60 

CTGCTCGTCA CGGCCATCTT CACCGACCAC TGGTACGAGA CCGACCCCCG GCGCCACAAG 120 

GAGAGCTGCG AGCGCAGCCG CGCGGGCGCC GACCCCCCGG ACCAGAAGAA CCGCCTGATG 180 

CCGCTGTOGC ACCTGCCGCT GCGGGACTCG CCCCOGCTGG GGCGCCGGCT GCTCCOGGGC 240 

GGCCCGGGGC GCGCCGACXC CQAGTCCTGG CGCTC3GCTCC TGGGGCTCGG CGGGCTGGAC 300 

GCCGAGTGCQ GCCGGCCCCT CTTCGCCACC TACTCGGGCC TCTGGAGGAA GTGCTACTTC 360 

CTGGGCATCG ACCGGGACAT CGACACCCTC ATCCTGAAAG GTATTGCGCA GCGATGCACG 420 

GCCATCAAGT ACCACTTTTC TCAGCCCATC CGCTTGCGAA ACATTCCTTT TAATTTAACC 480 

AAGACCATAC AGCAAGATGA GTGGCACCTG CTTCATTTAA GAAGAATCAC TGCTGGCTTC 540 

CTCGGCATGG CCGTAGCCGT CCTTCTCTGC GGCTGCATTG TGGCCACAGT CAGTTTCTTC 600 

TGGGAGGAGA GCTTGACCCA GCACGTGGCT GGACTCCTGT TCCTCATGAC AGGGATATTT 660 

TGCACCATTT CCCTCTGTAC TTATGCCGCC AGTATCTCGT ATGATTTGAA CCGGCTCCCA 720 

AAGCTAATTT ATAGCCTGCC TGCTGATGTG GAACATGGTT ACAGCTGGTC CATCTTTTGC 780 

GCCTGGTGCA GTTTAGGCTT TATTGTGGCA GCTGGAGGTC TCTGCATCGC TTATCCGTTT 840 
ATTAGCCGGA CCAAGATTGC ACAGCTAAAG TCTGGCAGAG ACTCCACGGT ATGA 

A200 Protein sequence: 

Gene name: ESTs 

Unigene number: Hs.40808 

Probeset Accession #: AA054237 

Protein Accession ft: none found 

Signal sequence: 1-18 

Transmembrane domains: 179-201, 209-231, 257-279 

Cellular Localization: plasma membrane 

1 11 21 31 41 51 

I I I I I I 

• MEPRALVTAL SLGLSLCSLG LLVTAIFTDH WYETDPRRHK ESCERSRAGA DPPDQKNRLM 60 

PLSHLPLRDS PPLGRRLLPG GPGRADPESM RSLLGLGGLD AECGRPLFAT YSGLWRKCYF 120 

LGIDRDIDTL XLKGIAQRCT AIKYHFSQPI RLRNI PFNLT KTIQQDEWHL LELRRITAGF 180 

LGMAVAVLLC GCIVATVSFF WBESLTQHVA GLLFLMTGIF CTISLCTYAA SISYDLNRLP 240 
KLIYSLPADV EHGYSWSIFC AWCSLGFIVA AGGLCIAYPF ISRTKIAQLK SGRDSTV 

A201 DNA 



Gene name: COA14 
Unigene number: He. 26813 

Probeset Accession ft: N32912 
Nucleic Acid Accession fts NMJJ16S70 

Coding sequence: 1- 1134 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGAGGCGAC TGAATCGGAA AAAAACTTTA AGTTTGGTAA AAGAGTTGGA TGCCTTTCCG 60 
AAGGTTCCTG AGAGCTATGT AGAGACTTCA GCCAGTGGAG GTACAGTTTC TCTAATAGCA 120 
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TTTACAACTA TGGCTTTATT AACCATAATG GAATTCTCAG TATATCAAGA TACATGGATG 180 

AAGTATGAAT ACGAAGTAGA CAAGGATTTT TCTAGCAAAT TAAGAATTAA TATAGATATT 240 

ACTGTTGCCA TGAAGTGTCA ATATGTTGGA GCGGATGTAT TGGATTTAGC AGAAACAATG 300 

GTTGCATCTG CAGATGGTTT AGTTTATGAA CCAACAGTAT TTGATCTTTC ACCACAGCAG 360 

AAAGAGTGGC AGAGGATGCT GCAGCTGATT CAGAGTAGGC TACAAGAAGA GCATTCACTT 420 

CAAGATGTGA TATTTAAAAG TGCTTTTAAA AGTACATCAA CAGCTCTTCC ACCAAGAGAA 480 

GATGATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TGTCAATAAA 540 

GTAGCAGGGA ATTTTCACAT AACAGTGGGC AAGGCAATTC CACATCCTCG TGGTCATGCA 600 

CATTTGGCAG CACTTGTCAA CCATGAATCT TACAATTTTT CTCATAGAAT AGATCATTTG 660 

TCTTTTGGAG AGCTTGTTCC AGCAATTATT AATCCTTTAG ATGGAACTGA AAAAATTGCT 720 

ATAGATCACA ACCAGATGTT CCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 780 

TATAAAATAT CAGCAGACAC CCATCAGTTT TCTGTGACAG AAAGGGAACG TATCATTAAC 840 

CATGCTGCAG GCAGCCATGG AGTCTCTGGG ATATTTATGA AATATGATCT CAGTTCTCTT 900 

ATGGTGACAG TTACTGAGGA GCACATGCCA TTCTGGCAGT TTTTTGTAAG ACTCTGTGGT 960 

ATTGTTGGAG GAATCTTTTC AACAACAGGC ATGTTACATG GAATTGGAAA ATTTATAGTT 1020 

GAAATAATTT GCTGTOGTTT CAGACTTGGA TCCTATAAAC CTGTCAATTC TGTTCCTTTT 1080 
GAGGATGGCC ACACAGACAA CCACTTACCT CTTTTAGAAA ATAATACACA TTGA 



A202 Protein sequences 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



CDA14 
HS. 26813 
N32912 
NP_057654 
none found 
none found 
nuclear 



1 11 21 31 41 51 

I I I I I I 

MRRLNRKKTL SLVKELDAFP KVPESYVETS ASGGTVSLIA FTTMALLTIM EFSVYQDTWM 60 

KYEYEVDKDP SSKLRINIDI TVAMKCQYVG ADVLDIAETM VASADGLVYE PTVFDLSPQQ 120 

KEWQRMLQLI QSRLGEEHSL QDVIPKSAFK STSTALPPRE DDSSQSPNAC RIHGHLYWK 180 

VAGKFHITVG KAIPHPRGHA HLAALVNHES YNPSHRIDHL SFGELVPAII NPLDGTEKIA 240 

IDHNQMFQYF ITWPTKLHT YKISADTHQP SVTERERI IN HAAGSHGVSG IFMKYDLSSL 300 

MVTVTEEHMP FWQFFVRJjCG IVGGIFSTTG MLHGIGKPIV EIICCRFRLG SYKPVNSVPF 360 
EDGHTDNHLP LLENNTH 



A203 DNA SEQUENCE 

Gene name: dipeptidylpeptidase IV (CD26, adenosine deaminase completing protein 2) 

Unigene number: Hs.44926 
Probeset Accession ft: S79876 
Nucleic Acid Accession ft; NMJ>01935.1 

Coding sequence: 76-2301 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

i I 1 I 1 ! 

CGCGCGTCTC CGCCGCCCGC GTGACTTCTG CCTGCGCTCC TTCTCTGAAC GCTCACTTCC 60 

GAGGAGACGC OGACGATGAA GACACCGTGG AAGATTCTTC TGGGACTGCT GGGTGCTGCT 120 

GCGCTTGTCA CCATCATCAC CGTGCCCGTG GTTCTGCTGA ACAAAGGCAC AGATGATGCT 180 

ACAGCTGACA GTCGCAAAAC TTACACTCTA ACTGATTACT TAAAAAATAC TTATAGACTG 240 

AAGTTATACT CCTTAAGATG GATTTCAGAT CATGAATATC TCTACAAACA AGAAAATAAT 300 

ATCTTGGTAT TCAATGCTGA ATATGGAAAC AGCTCAGTTT TCTTGGAGAA CAGTACATTT 360 

GATGAGTTTG GACATTCTAT CAATGATTAT TCAATATCTC CTGATGGGCA GTTTATTCTC 420 

TTAGAATACA ACTACGTGAA GCAATGGAGG CATTCCTACA CAGCTTCATA TGACATTTAT 480 

GATTTAAATA AAAGGCAGCT GATTACAGAA GAGAGGATTC CAAACAACAC ACAGTGGGTC 540 

ACATGGTCAC CAGTGGGTCA TAAATTGGCA TATGTTTGGA ACAATGACAT TTATGTTAAA 600 

ATTGAACCAA ATTTACCAAG TTACAGAATC ACATGGAOGG GGAAAGAAGA TATAATATAT 660 

AATGGAATAA CTGACTGGGT TTATGAAGAG GAAGTCTTCA GTGCCTACTC TGCTCTGTGG 720 

TGGTCTCCAA ACGGCACTTT TTTAGCATAT GCCCAATTTA ACGACACAGA AGTCCCACTT 780 

ATTGAATACT CCTTCTACTC TGATGAGTCA CTGCAGTACC CAAAGACTGT ACGGGTTCCA 840 

TATCCAAAGG CAGGAGCTGT GAATCCAACT GTAAAGTTCT TTGTTGTAAA TACAGACTCT 900 

CTCAGCTCAG TCACCAATGC AACTTCCATA CAAATCACTG CTCCTGCTTC TATGTTGATA 960 

GGGGATCACT ACTTGTGTGA TGTGACATGG GCAACACAAG AAAGAATTTC TTTGCAGTGG 1020 

CTCAGGAGGA TTCAGAACTA TTCGGTCATG GATATTTGTG ACTATGATGA ATCCAGTGGA 1080 

AGATGGAACT GCTTAGTGGC ACGGCAACAC ATTGAAATGA GTACTACTGG CTGGGTTGGA 1140 

AGATTTAGGC CTTCAGAACC TCATTTTACC CTTGATGGTA ATAGCTTCTA CAAGATCATC 1200 

AGCAATGAAG AAGGTTACAG ACACATTTGC TATTTCCAAA TAGATAAAAA AGACTGCACA 1260 

TTTATTACAA AAGGCACCTG GGAAGTCATC GGGATAGAAG CTCTAACCAG TGATTATCTA 1320 

TACTACATTA GTAATGAATA TAAAGGAATG CCAGGAGGAA GGAATCTTTA TAAAATCCAA 1380 

CTTATTGACT ATACAAAAGT GACATGCCTC AGTTGTGAGC TGAATCCGGA AAGGTGTCAG 1440 

TACTATTCTG TGTCATTCAG TAAAGAGGCG AAGTATTATC AGCTGAGATG TTCCGGTCCT 1500 

GGTCTGCCCC TCTATACTCT ACACAGCAGC GTGAATGATA AAGGGCTGAG AGTCCTGGAA 1560 

GACAATTCAG CTTTGGATAA AATGCTGCAG AATGTCCAGA TGCCCTCCAA AAAACTGGAC 1620 

TTCATTATTT TGAATGAAAC AAAATTTTGG TATCAGATGA TCTTGCCTCC TC ATTTTG AT 1680 

AAATCCAAGA AATATCCTCT ACTATTAGAT GTGTATGCAG GCCCATGTAG TCAAAAAGCA 1740 

GACACTGTCT TCAGACTGAA CTGGGCCACT TACCTTGCAA GCACAGAAAA CATTATAGTA 1800 

GCTAGCTTTG ATGGCAGAGG AAGTGGTTAC CAAGGAGATA AGATCATGCA TGCAATCAAC 1860 

AGAAGACTGG GAACATTTGA AGTTGAAGAT CAAATTGAAG CAGCCAGACA ATTTTCAAAA 1920 

ATGGGATTTG TGGACAACAA ACGAATTGCA ATTTGGGGCT GGTCATATGG AGGGTACGTA 1980 

ACCTCAATGG TCCTGGGATC GGGAAGTGGC GTGTTCAAGT GTGGAATAGC CGTGGCGCCT 2040 
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GTATCCOGGT GGGAGTACTA TGACTCAGTG TACACAGAAC GTTACATGGG TCTCCCAACT 2100 

CCAGAAGACA ACCTTGACCA TTACAGAAAT TCAACAGTCA TGAGCAGAGC TGAAAATTTT 2160 

AAACAAGTTG AGTACCTCCT TATTCATGGA ACAGCAGATQ ATAACGTTCA CTTTCAGCAG 2220 

TCAGCTCAGA TCTCCAAAGC CCTGGTCGAT GTTGGAGTGG ATTTCCAGGC AATGTGGTAT 2280 

ACTGATGAAG ACCATGGA AT AGC TAGCAGC ACAGCACACC AA CAT AT AT A TACCCACATG 2340 

AGCCACTTCA TAAAACAATG TTTCTCTTTA CCTTAGCACC TCAAAATACC ATGCCATTTA 2400 

AAGCTTATTA AAACTCATTT TTGTTTTCAT TATCTCAAAA CTGCACTGTC AAGATGATGA 2460 

TGATCTTTAA AATACACACT CAAATCAAGA AACTTAAGGT TACCTTTGTT CCCAAATTTC 2S20 

ATACCTATCA TCTTAAGTAG GGACTTCTGT CTTCACAACA GATTATTACC TTACAGAAGT 25 BO 

TTGAATTATC CGGTCGGGTT TTATTGTTTA AAATCATTTC TGCATCAGCT GCTGAAACAA 2640 

CAAATAGGAA TTGTTTTTAT GGAGGCTTTG CATAGATTCC CTGAGCAGGA TTTTAATCTT 2700 

TTTCTAACTG GACTGGTTCA AATGTTGTTC TCTTCTTTAA AGGGATGGCA AGATGTGGGC 2760 

AGTGATGTCA CTAGGGCAGG GACAGGATAA GAGGGATTAG GGAGAGAAGA TAGCAGGGCA 2820 

TGGCTGGGAA CCCAAGTCCA AGCATACCAA CACGAGCAGG CTACTGTCAG CTCCCCTCGG 2880 

AGAAGAGCTG TTCACCACGA GACTGGCACA GTTTTCTGAG AAAGACTATT CAAACAGTCT 2940 

CAGGAAATCA AATATCGAAA GCACTGACTT CTAAGTAAAC CACAGCAGTT GAAAGACTCC 3000 

AAAGAAATGT AAGGGAAACT GCCAGCAACG CAGCCCCCAG GTGCCAGTTA TGGCTATAGG 3060 

TGCTACAAAA ACACAGCAAG GGTGATGGGA AAGCATTGTA AATGTGCTTT TAAAAAAAAA 3120 

TACTGATGTT CCTAGTGAAA GAGGCAGCTT GAAACTGAGA TGTGAACACA TCAGCTTGCC 3180 

CTGTTAAAAG ATGAAAATAT TTGTATCACA AATCTTAACT TGAAGGAGTC CTTGCATCAA 3240 

TTTTTCTTAT TTCATTTCTT TGAGTGTCTT AATTAAAAGA ATATTTTAAC TTCCTTGGAC 3300 

TCATTTTAAA AAATGGAACA TAAAATACAA TGTTATGTAT TATTATTCCC ATTCTACATA 3360 
CTATGGAATT TCTCCCAGTC ATTTAATAAA TGTGCCTTCA TTTTTTC 



A204 Protein sequence: 
Gene name: 
Unigene number; 
Probeset Accession #: 
Protein Accession fts 
Signal sequence: 
Transmembrane domains: 
DPPiy_N_term domain: 
Peptidase_S9 domain: 
Cellular Localization: 



dipeptidylpeptidase IV 

Hs. 44926 

S79876 

NPJ> 0192 6.1 

none found 

6-28 

43-S57 

558-635 

plasma membrane 



(CD26, adenosine deaminase 



complexing protein 2) 



1 
I 

MKTPWKILLG 
RWISDHEYLY 
VKQWRHSYTA 
PSYRITWTGK 
YSDESLQYPK 
CDVTWATQER 
EPHFTLDGNS 
EYKGMPGGRN 
TLHSSVNDKG 
PLLLDVYAGP 
FEVEDQIEAA 
YYDSVYTERY 
KALVDVGVDP 



11 
I 

LLGAAALVTI 
KQENNILVFN 
SYDIYDLNKR 
EDIIYNGITD 
TVRVPYPKAG 
ISLQWLRRIQ 
FYKIISNEEG 
LYKIQLIDYT 
LRVLEDNSAL 
CSQKADTVFR 
RQFSKMGFVD 
MGLPTPEDNL 
QAMWYTDEDH 



21 
I 

ITVPWLLNK 
ABYGNSSVPL 
QLITBERIPN 
WVYEEEVFSA 
AVNPTVKFFV 
NYSVMDICDY 
YRHICYPQID 
KVTCLtSCELN 
DKMLQNVQMP 
LNWATYLAST 
NKRIAIWGWS 
DHYRNSTVMS 
GIASSTAHQH 



31 

I 

GTDDATADSR 
ENSTFDEFGH 
NTQWVTWSPV 
YSALWWSPNG 
VNTDSLSSVT 
DESSGRWNCL 
KKDCTPITKG 
PERCQYYSVS 
SKKLDPIILN 
ENIIVASFDG 
YGGYVTSMVL 
RAENFKQVEY 
IYTHMSHPIK 



41 
I 

KTYTLTDYLK 
SINDYSISPD 
GHKLAYVWNN 
TFLAYAQFND 
HATS I QI TAP 
VARQRIBMST 
TWEVIGIEAL 
FSKEAKYYQL 
ETKFWYQMIL 
RGSGYQGDKI 
GSGSGVFKCG 
LLIHGTADDN 
QCPSLP 



51 
I 

NTYRLKLYSL 
GQFILLEYNY 
DIYVKIEPNL 
TBVPLIEYSP 
ASMLIGDHYL 
TGWVGRFRPS 
TSDYLYYISN 
RCSGPGLPLY 
PPHFDKSKKY 
MHAINRRLGT 
IAVAPVSRWE 
VHFQQSAQIS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



A205 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence* 



predicted exon 
none found 
none found 
none found 

1-261 (underlined sequences correspond to start and stop codons) 



1. 11 21 31 41 51 

I I I I I I 

ATGGCTCTGG CGAAGGTGAG GGAGCCAAAC GCAAATGACA ATGCCATCAG AGTTGACAAC 60 
AGAAGTGTGA TTAAAGTGCG TGCTAACCAG TGTTCCCTGC ATGAGGCAGA AAGTGAATCC 120 
AGAAACCCTC AGGAGCTCTG GATGGGCCTG CTCCTCTTGA TGGGGGTCCT AGAAGCATGT 180 
GTGGAAATGA GGCCTCTGTC AGTCTGGTCC CTGAGAGATG ACAAGGAGCA GAGCCCCCAC 240 
CAGCCCACAC TGGATGTCTA A 



A206 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession §: 
Protein Accession 6: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



predicted exon 
none found 
none found 
none found 
none found 
46-68 

not determined 



1 11 21 31 41 51 

I I I I I I 

MALAKVREPN ANDNAIRVDN RSV1KVRANQ CSLHEAESES RNPQELWMGL LLLKGVLBAC 60 
VEMRPLSVWS LRDDKEQSPH QPTLDV 
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A207 DNA SEQUENCE 

Gene name: ESTs 

Unlgene number: Hs.222B86 

Probeset Accession #; AI67222S 

Nucleic Acid Accession ft: none found 

Coding sequence: 1-462 (underlined 



sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGCCAAATG CTGAGTTAGA AGCAAAGAGC CTTGGAAGCA GTAAATGTTT AAAAACTGCT 60 

CTCATACTTG CTGTATGTTG TGGATCAGCA AATATAGTCA GCCCTCTACT TGAGCAAAAT 120 

ATTGATGTAT CTTCTCAAGA TCTGGACAGA CGGCCAGAGA GTATGCTGTT TCTAGTCATC 180 

ATCATGTGGA CCAGTTTTGT GGAAGACAAT CTTTCCATGG GCTGGGGGAA GCTAGAAGAT 240 

TTTATGGCTA TTGAAGAAGA AATGAAGAAG CACGGAAGTA CTCATGTGGG ATTCCCAGAA 300 

AACCTGACTA ATGGTGCCGC TGCTGGCAAT GGTGATGATG GATTAATTCC TCCAAGGAAG 360 

AGCAGAACAC CTGAAAGCCA GCAATTTCCT GACACTGAGA ATGAAGAGTA TCACAGGTTT 420 
GTCAAAGATC AGATAGTTGT AGATATGCGG CGTTATTT CT GA 

A206 Protein sequence: 

Gene name: ESts 

Unigene number: Hs. 222886 

Probeset Accession ft: AI672225 

Protein Accession ft: none found 

Signal sequence: none found 

Transmembrane domains: 16-38 

Cellular Localization: not determined 



1 11 21 31 41 51 

I I I I ! I 

MPNAELEAKS LGSSKCLKTA LILAVCCGSA NIVSPLLEQN IDVSSQDLDR RPESMLFLVZ 60 
IMWTSFVEDN LSMGWGKLED FMAIEEEMKK HGSTHVGFPE NLTNGAAAGN GDDGLIPPRK 120 
SRTPESQQPP DTENEEYHRF VKDQIWDMR RYP 

A209 DNA SEQUENCE 

Gene name: odz (odd Oz/ten-m, Drosophila) homo log 1 

Unigene number: Hs.23796 

Probeset Accession ft: NM 014253 

Nucleic Acid Accession ft* ~ NMJ>14253 

Coding sequence: 65-8242 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GACTGCTTGC ATTAAAGGAC TTCCTCATCC TTTTTTTCAT GAAACTGAGC TTGCTTAATC 60 

AGAGATGGAG CAAACTGACT GCAAACCCTA CCAGCCTCTA CCAAAAGTCA AGCATGAAAT 120 

GGATCTAGCT TACACCAGTT CTTCTGATGA GAGTGAAGAT GGAAGAAAAC CAAGACAGTC 180 

ATACAACTCC AGGGAGACCC TGCAOGAGTA TAACCAGGAG CTGAGGATGA ATTACAATAG 240 

CCAGAGTAGA AAGAGGAAAG AAGTAGAAAA ATCTACTCAA GAGATGGAAT TCTGTGAAAC 300 

CTCTCACACT CTGTGCTCTG GCTACCAAAC AGACATGCAC AGCGTTTCTC GGCATGGCTA 360 

CCAGCTAGAG ATGGGATCTG ATGTGGACAC AGAGACAGAA GGTGCTGCCT CACCTGACCA 420 

TGCACTAAGA ATGTGGATAA GGGGAATGAA ATCAGAGCAT AGTTCCTGTT TGTCCAGCCG 480 

GGCCAACTCT GCATTATCCT TGACTGACAC TGACCATGAA AGGAAGTCTG ATGGGGAAAA 540 

TGGTTTCAAA TTCTCTCCTG TTTGTTGTGA CATGGAGGCT CAAGCTGGGT CTACTCAAGA 600 

TGTGCAGAGC AGCCCACACA ACCAGTTCAC CTTCAGACCC CTCCCACCGC CACCTCOGCC 660 

TCCTCATGCC TGCACCTGTG CCAGGAAGCC ACCCCCTGCA GCGGACTCTC TTCAGAGGAG 720 

ATCAATGACT ACCCGCAGCC AGCCCAGCCC AGCTGCTCCA GCTCCCCCAA CCAGCACGCA 780 

GGATTCAGTC CATCTGCATA ACAGCTGG6T CCTGAACAGC AACATACCAT TGGAGACCAG 840 

GCATTCCCTG TTCAAACATG GATCTGGTTC CTCTGCGATC TTCAGTGCAG CCAGTCAGAA 900 

CTACCCTCTG ACATCCAATA CCGTGTACTC GCCCCCTCCC AGGOCTCTTC CTCGAAGCAC 960 

CTTTTCCCGA CCTGCCTTTA CCTTTAACAA ACCTTACAGG TGCTGCAACT GGAAGTGCAC 1020 

AGCATTGAGC GCCACTGCAA TCACAGTGAC TTTGGCCTTG TTACTAGOCT ATGTGATTGC 1080 

AGTGCATTTG TTCGGCCTGA CTTGGCAGTT GCAACCAGTT GAAGGAGAGC TGTATGCAAA 1140 

TGGAGTTAGC AAAGGGAACA GGGGGACCGA GTCCATGGAC ACTACTTACT CTCCAATTGG 1200 

AGGAAAAGTT TCTGATAAAT CAGAGAAAAA AGTGTTTCAG AAGGGACGGG CGATAGACAC 1260 

TGGAGAAGTT GACATTGGTG CACAGGTCAT GCAGACCATT CCACCTGGTT TATTCTGGCG 1320 

TTTCCAGATT ACTATCCACC ATCCAATATA TCTGAAGTTC AATATTTCTT TAGCCAAGGA 1380 

CTCTCTGCTG GGAATTTATG GCAGAAGAAA CATTCCACCT ACACATACTC AGTTTGATTT 1440 

TGTAAAACTA ATGGATGGCA AACAGCTGGT CAAGCAGGAC TCCAAGGGCT CTGATGATAC 1500 

ACAGCACTCC CCTCGGAACC TGATCTTAAC TTCGCTTCAG GAGACAGGTT TCATAGAGTA 1560 

TATGGATCAA GGACCTTGGT ATCTGGCGTT TTACAATGAT GGAAAAAAGA TGGAGCAAGT 1620 

ATTCGTGTTA ACTACAGCAA TTGAAATAAT GGATGACTGT TCAACCAATT GCAATGGAAA 1680 

TGGAGAGTGT ATCTCTGGCC ATTGTCATTG TTTCCCAGGA TTCCTTGGAC CTGACTGTGC 1740 

TAGAGATTCC TGCCCTGTGC TGTGTGGTGG GAATGGAGAA TACGAGAAAG GACACTGTGT 1800 

CTGCCGGCAT GGCTGGAAGG GGCCAGAGTG TGACGTTCCG GAAGAACAAT GCATTGATCC 1860 

AACATGCTTT GGCCACGGCA CCTGCATCAT GGGAGTCTGC ATCTGTGTGC CAGGATACAA 1920 

AGGAGAAATA TGCGAGGAAG AGGACTGCCT AGACCCAATG TGTTCCAACC ATGGCATCTG 1980 

TGTAAAAGGA GAATGTCACT GTTCTACTGG CTGGGGAGGA GTTAACTGTG AAACACCACT 2040 

TCCTGTATGT CAAGAGCA6T GCTCAGGACA CGGAACTTTT CTTCTGGACG CTGGAGTATG 2100 

CAGCTGTGAT CCCAAGTGGA CAGGATCTGA CTGCTCAACA GAGCTGTGTA CCATGGAGTG 2160 

TGGTAGCCAT GGAGTCTGCT CAAGAGGAAT TTGCCAGTGT GAAGAAGGCT GGGTAGGACC 2220 

AACATGTGAG GAACGCTCCT GTCATTCTCA TTGTACTGAG CATGGCCAAT GCAAAGATGG 2280 
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AAAATGTGAG TGTAGCCCTG GATGGGAGGG CGACCACTGC ACAATTGCTC ACTACTTAGA 2340 
TGCTGTCCGA GATGGCTGCC CAGG6CTCTG CTTTGQAAAT GGAOGATGTA CCCTGGATCA 2400 
AAATGGTTGG CACTGTGTGT GTCAGGTGGG TTGGAGTGGG ACAGGCTGCA ATGTTGTCAT 2460 
GGAAATGCTT TGTGGAGATA ACTTGGACAA TGATGGAGAT GOTTTAACCG ACTGTGTGGA 2520 
TCCTGACTGT TGTCAACAAA GCAACTGTTA TATAAGTCCT CTCTGCCAGG GCTCACCAGA 2580 
TCCTCTTGAC CTCATTCAGC AAAGOCAAAC TCTCTTCTCT CAGCACACTT CAAGACTTTT 2640 
TTATGATCGA ATCAAATTCC TCATTGGCAA GGACAGTACT CATGTCATTC CTCCTGAGGT 2700 
GTCATTTGAC AGCAGGCGTG CCTGTGTGAT TCGAGGCCAA GTGGTGGCCA TAGATGGAAC 2760 
TCCTCTAGTG GGAGTGAATG TCAGTTTCTT GCACCACAGT GATTATGGGT TTACCATCAG 2820 
CCGGCAAGAT GGAAGCTTTG ACCTCGTGGC CATOGGTGGC ATCTCTGTCA TCTTAATCTT 2880 
CGACCGATCC CCTTTCCTGC CTGAGAAGAG AACACTCTGG TTGCCTTGGA ATCAGTTTAT 2940 
TGTGGTAGAG AAAGTCACCA TGCAGAGAGT TGTATCAGAC COGCCATCCT GOGATATCTC 3000 
CAACTTTATC AGCCCAAACC CTATTGTGCT TCCTTCACCG CTCACATCAT TTGQAGGGTC 3060 
CTGTCCAGAG AGGGGAACTA TTGTTCCTGA GCTGCAGGTT GTACAGGAGG AAATTCCCAT 3120 
TCCCTCCAGC TTTGTGAGGC TQAGTTACCT GAGCAGCOGC ACCCCTGGGT ATAAAACCCT 3180 
GCTACGGATC CTTCTGACAC ATTCAACGAT TCCOGTAGGC ATGATAAAAG TACACCTCAC 3240 
AGTAGCTGTG GAAGGGOGAC TCACACAGAA GTGGTTTCCC GCOGCAATTA ATCTTGTCTA 3300 
CACATTTGCT TGGAACAAGA COGATATCTA TGGACAGAAG GTTTGGGGCC TGGCAGAGGC 3360 
TTTGGTATCT GTGGGATATG AATATGAAAC GTGCCCTGAC TTTATTCTCT GGGAGCAAAG 3420 
GACAGTCGTT TTACAAGGTT TTGAGATGGA TGCTTCTAAC CTAGGAGACT GGTCTTTGAA 3480 
TAAGCATCAC ATTTTGAATC CTCAAAGTGG AATCATACAT AAAGGGAATG GAGAAAATAT 3540 
GTTCATTTCC CAGCAGCCCC CAGTCATATC AACCATAATG GGTAATGGAC ACCAAAGGAG 3600 
TGTAGCCTGC ACCAACTGCA ATGGCCCAGC CCACAACAAC AAACT CTTTG CTCCTGTOGC 3660 
CTTAGCTTCT GGCCCTGATG GCAGTGTGTA TGTTGGCGAC TTCAATTTTG TAAGGAGAAT 3720 
ATTTCCCTOG GGAAACTCOG TTAGTATTTT GGAATTAAGC ACAAGTCCTG CTCACAAATA 3780 
CTATCTGGCT ATGGACCCTG TGTCTGAATC ACTCTATCTA TCAGACACCA ATACTOGCAA 3840 
AGTCTACAAG TTGAAATCTC TTGTGGAGAC GAAAGATCTG TCCAAGAATT TTGAAGTGGT 3900 
GGCAGGAACT GGTGATCAGT GCCTTCCCTT TGACCAGAGT CATTGTGGAG ATGGTGGGAG 3960 
AGCATCGGAA GCTTCACTGA ATAGCCCTCG AGGCATCACA GTTGATAGGC ATGGATTTAT 4020 
TTACTTTGTG GATGGGACTA TGATTOGCAA AATTGATGAG AATGCTGTGA TCACAACTGT 4080 
AATCGGCTCA AATGGTCTGA CTTCCACACA ACCACTGAGC TGTGACTCAG GAATGGACAT 4140 
CACTCAGGTG OGATTAGAGT GGCCAACAGA CCTTGCAGTA AATCCTATGG ACAATTCATT 4200 
GTATGTCTTG GATAACAACA TTGTGCTGCA AATTTCTGAG AACAGGCGTG TTGGGATCAT 4260 
CGCAGGACGC CCCATTCACT GCCAGGTGCC AGGCATOGAT CATTTCCTGG TCAGCAAGGT 4320 
AGCAATTCAC TCCACTCTAG AGTCAGCGAG GGCCATCAGT GTCTCCCACA GCGGGCTGCT 4380 
CTTCATAGCT GAAACAGACG AGAGGAAAGT AAACCGCATT CAGCAAGTAA CCACCAATGG 4440 
GGAGATCTAC ATCATOGCTG GTGCCCCCAC TGACTGTGAC TGCAAAATTG ATCCAAACTG 4500 
TGACTGTTTT TCAGGTGATG GTGGCTATGC CAAAQATGCA AAGATGAAAG CCCCTTCCTC 4560 
CTTAGCAGTG TCGCCTGATG GAACCCTCTA TGTGGCAGAC CTCGGAAATG TTOGAATTOG 4620 
TACCATGAGC AGGAACCAAG CCCACCTGAA TGACATGAAC ATTTATGAGA TTGCTTCACC 4680 
CGCTGATCAG GAACTGTACC AGTTCACTGT AAATGGAACC CACCTACACA CCCTGAACTT 4740 
GATAACAAGG GACTATGTTT ATAACTTCAC CTACAATTCT GAAGGTGACT TGGGCGCGAT 4800- 
TACCAGCAGC AATGGCAATT CAGTGCACAT TOGCCGTGAT GCAGGCGGAA TGCCGCTATG 4860 
GCTTGTGGTG CCTGGOGGAC AAGTATACTG GCTGACTATA AGCAGCAATG GAGTCCTGAA 4920 
AAGAGTGTCA GCCCAAGGCT ATAATCCGGC CTTAATGACC TATCCAGGAA ACACAGGGCT 4980 
TCTGGCTACC AAAAGTAACG AAAATGGATG GACAACOGTT TATQAGTATG ACCCOGAGGG 5040 
ACACCTGACC AATGCAACGT TTCCCACTGG AGAGGTCAGC AGCTTCCACA GTGACCTGGA 5100 
GAAGCTGACA AAAGTGGAGC TAGATACTTC CAACOGTGAA AATGTCCTCA TGTCAACCAA 5160 
CTTGACGGCA ACTAGTACCA TATATATTTT AAAACAAGAA AATACTCAAA GTACCTATOG 5220 
GGTGAATCCA GATGGTTCCC TGOGTGTCAC TTTTGCCAGC GGGATGGAGA TCGGCCTCAG 5280 
CTCAGAGCCC CACATCCTGG CAGGGGCAGT CAACCCTACC CTGGGCAAAT GCAACATCTC 5340 
ATTGCCCGGA GAGCACAATG CAAACCTCAT CGAGTGGCGG CAGAGGAAGG AGCAAAACAA 5400 
AGGCAATGTT TOGGCTTTTG AAAGGAGGCT GAGGGCCCAC AACAGAAACC TACTCTCCAT 5460 
AGATTTTGAT CATATAACCC GCACAGGAAA GATCTATGAT GACCATCGAA AATTCACCCT 5520 
TCGAATTCTT TATGACCAGA CTGGGOGACC CATTCTGTGG TCTCCTGTAA GCAGATATAA 5580 
TGAAGTGAAC ATCACATATT CACCTTCGGG ATTGGTGACG TTXATTCAAA GAGGAACGTG 5640 
GAATGAAAAA ATGGAATATG ACCAGAGTGG GAAAATTATT TCAAGAACTT GGGCTGATGG 5700 
GAAAATTTGG AGCTATACCT ACTTAGAAAA ATCTGTGATG CTTCTCCTAC ACAGCCAGOG 5760 
GCGTTACATC TTTGAGTATG ACCAATCAGA TTGCCTGCTG TCAGTTACCA TGCCTAGCAT 5820 
GGTGCGCCAC AGCTTACAAA CCATGCTTTC AGTGGGCTAC TACCGTAATA TCTACACCCC 5880 
ACCGGACAGT AGCACTTCTT TTATCCAAGA CTATAGTCGA GATGGCCGAT TGCTA CAGAC 5940 
CCTGCATCTG GGGACAGGGC GCAGAGTCTT ATACAAGTAC ACCAAGCAAG CAAGGCTTTC 6000 
TGAGGTTCTC TATGATAOCA CTCAGGTCAC ATTAACATAT GAAGAGTCTT CTGGAGTGAT 6060 
TAAGACAATA CACCTGATGC ATGAOGGATT CATCTGCACA ATCAGATACA GGCAAACAGG 6120 
ACCTCTTATT GGAGGCCAGA TTTTCAGATT CAGTGAAGAA GGCCTTGTGA ATGCACGGTT 6180 
CGACTACAGC TACAACAATT TCOGAGTCAC AAGCATGCAA GCTGTAATCA A TGAAA COCC 6240 
TTTGCCTATA GATCTTTACC GATATGTTGA TGTCTCTGGC AGAACAGAGC AGTTTGGAAA 6300 
ATTCAGTGTA ATTAATTACG ATTTAAATCA GGTCATAACT ACTACAGTGA TGAAACACAC 6360 
CAAAATCTTC AGTGCCAATG GACAAGTCAT TOAAGTCCAA TATGAAATCC TAAAGGCAAT 6420 
TGCCTACTGG ATGACCATTC AATATGATAA TGTGGGCGGA CATGGTAATA TGTGCATAAG 6480 
GGTAGGAGTA GATGCCAATA TAACAAGGTA CTTCTATGAA TAOQATGCTG ATGGGCAACT 6540 
TCAGACTGTT TCTGTAAATG ACAAAACCCA GTGGOGTTAT AGTTACGATC TGAATGGAGA 6600 
CATCAACCTC TTAAGCCATG GGAAGAGTGC TOGTCTTACT CCTCTCOGAT A TGACC TCOG 6660 
AGACOGCATC ACCAGATTAG GAGAAATTCA GTATAAAATG GATGAAGATG GCTTTCTGAG 6720 
GCAGAGGGGA AATGATATTT TTGAAXATAA TTCTAATGGC CTGCTGCAGA AAGCCTACAA 6780 
TAAGGCTTCT GGCTGGACTG TGCAGTATTA CTATGATGGG CTTGGGOGAC GTGTOGCGAG 6840 
TAAGTCCAGC CTAGGGCAGC ACCTTCAGTT CTTTGTOGAC GOGACOGCGA ACCCCATAAG 6900 
AGTTACTCAT TTGTACAACC ACACAAGCTC GGAGATTACA TCTCTGTATT ATGATCTCCA 6960 
AGGTCACCTT ATTGOCATGG AGTTAAGCAG TGGTGAAGAA TATTATGTAG CCTGTGATAA 7020 
TACAGGTACC CCACTAGCTG TGTTCAGCAG COGAGGTCAG GTCATAAAGG AGATACTATA 7080 
CACACCTTAT GGCGATATCT ATCATGACAC TTACCCTGAC TTTCAGGTCA TAATTGGTTT 7140 
TCATGGAGGA CTCTATGATT TCCTTACTAA ATTAGTGCAC CTGGGGCAAA GGGATTATGA 7200 
TGTTGTTGCT GGGAGATGGA CAAOGGCCTA TCATCACATA TGGAAACAGT TGAACCTOCT 7260 
TCCTAAACCA TTCAACCTCT ACTCCTTTGA AAATAACTAC CCAGTTGGCA AAATTCAAGA 7320 
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TGTTGCAAAG TATACCACAG ACATCAGAAG TTGGTTGGAG CTATTTGGTT TCCAATTACA 7380 
CAATGTACTA CCTGGATTTC CCAAACCTGA ATTAGAAAAT TTAGAATTAA CTTACGAGCT 7440 
TCTAOGGCTT CAOACAAAAA CTCAAGAGTG GGATCCTGGA AAGACTATCC TGGGCATTCA 7500 
GTGTGAACTC CAGAAACAGC TCAGGAATTT CATTTCCTTG GACCA ACTA C CTATGACTCC 7560 
CCGATACAAT GATGGACGGT GCCTTGAAGG AGGGAAGCAA CCAAGGTTTG CTGCTGTCCC 7620 
TTCTGTTTTT GGGAAAGGTA TAAAATTTGC CATCAAGGAT GGCATAGTAA CAGCTGATAT 7680 
TATAGGAGTA GCCAATGAAG ATAGCAGGOG GCTTGCTGCC ATTCTCAATA ATGCC CATT A 7740 
CCTGGAAAAC CTACATTTTA CCATAGAGGG GAGGGACACT CACTACTTCA TTAAGCTTGG 7800 
GTCTCTGGAG GAAGACCTGG TGCTCATCGG TAACACTGGG GGGAGGOGGA TTCTGGAGAA 7860 
TGGTGTCAAT GTCACTGTGT CCCAGATGAC TTCTCTGTTG AATGGGAGGA CTAGACGGTT 7920 
TGCAGATATT CAGCTCCAGC ATGGAGCCCT GTGCTTCAAC ATCCGGTATG GGACAACTGT 7980 
CGAAGAGGAA AAGAATCACG TGTTGGAGAT TGCCAGACAG CGCGCAGTGG CCCAGGCCTG 8040 
GACTAAGGAA CAAAGAAGGC TGCAAGAGGG GGAAGAGGGG ATTAGGGCAT GGACAGAAGG 8100 
GGAAAAGCAG CAGCTTTTGA GCACTGGGCG GGTACAAGGT TACGATGGGT ATTTTGTTTT 8160 
GTCTGTTGAG CAGTATTTAG AACTTTCTGA CAGTGCCAAT AATATTCACT TTATGAGACA 8220 
GAGCGAAATA GGCAGGAGGT AA CAAAAATA TCTCTGCCTT TGOGTCACCA AAGACTGCCT 8280 
GTTTTTAAAA CATAAAATGG TTTATTGTAT TGGTTTTCTA GATCAGAACT CTGTATATGT 8340 
AAATATGGAG GAAAAACATA TCCAACTGCC TTTCAATGTG AOGGAAGATG GTATTTTAAT 8400 
ATTGTTTGTT TAAACTCTTT AAGAAATGAC AGAGATTTTT AGTTCTTGTG TGGCAGTATT 8460 
CAAAATAACA CAAGTAGAAC TCAAACAGCT AAAAACAGTT TTCAGAAAGC ACCACTTTCA 8520 
ATTTGCCGAG CCATGCATAT GTTCCAATAT CCAGAAAGAA CCCAAGGTTC TCTATCTCTA 8580 
TTGTGAGAAG CAGTTTCATC CTTAACTGTT GGCAGAACTT ACGGGCTATT TGAATAGGTG 8640 
GTGCAATAGT ATCTGAAACT TGGCTTTCGA AAGACTGCCA GCCCTTTGAC GTTTTCCAGA 8700 
TCTGTTATAG GAAACTTAAA AACAGGTGTA AAATGTCTTC AGCCACCATC TCCTAGAGTG 8760 
AGGACCCAAT TGCCCTTCCT TCTTGATTAT TCCTCCTTGC TTGTTAAAGT AAATGCCATA 8820 
TTGTTGTGCT GTGTTTTGGC GTGTGGTGGC TGGGTTCTGT CTACCATGCT TCCCTGTGGG 8880 
TGTGGTAACC AGACTGTATA GCCGCTATTT GCTCGTGTGT ACATGATACC AAAGCAGCTG 8940 
GCCAGCGTGA CCTCTCTCAC ACGACCTGTT TTGACTCAAT TTTTTACTAA AAGTTGTTCA 9000 
GCTGTATTGG TATCATGTAA ACATAGCTTT TATTAACCTG GGTAGGAATT TCTCATTTAT 9060 
ATATAGGATG TGTTTTGGTC ATAGTTTCAC ATTAGTGATT CAGTATCTAT ACACTGACCC 9120 
AATGGTTTTG TGCACATGAA CGGTAATTTA CTTAAAAGTA TGATTCTGGT ACAAAAACAA 9180 
ACAAAGGCTT TAGCAGGCAT ACGTGTCTGG GATCCCGATA CATACATTAA CTACTACTGC 9240 
AGAAATTCAT AAGAGCCAAA ACCTTAAAAA AATAGACCTG GTACTTAAGT GAAAGTACTA 9300 
AAGGGAAGAC CAGACCAAAC ATCACAGCAG TTGCTGCCAC ATTGTTTCAG CCCACTTAGA 9360 
TTTATCTTTC AAATGTACAA TTCTGTATTG AACATCTCCC AGCCATCTTC AGGAAATCGA 9420 
ATCAAGTAAA TCCTTTCCAA CCGAAAACAT TTCAACTAAC TATAGAGAGG CAGACTCATT 9480 
TTTACTAAAA TAATTTATAC AGTTAGTTAT TTTCGTTCTC CGTACTTACC CATTTATCTT 9540 
TATTTAATCG TCTCTACTGC CTAGGAAAAT AACTATTTTC CAGGACGGGT TAT TTGTT CT 9600 
GCGATCATTT AAAATTTGGA GAAAGGTCAG GATTAGTGTT AATATCAGCT GCAGTTTCTC 9660 
AATCTCTAGG AATCCTGCAG TAAAACAAGC CCCTTGGTGA GCTGGAAGAT TTGTGCCCAG 9720 
TGACAAAGAG ATAGTTTGTA AAATGCTGTG TAATTGTAAG TTACCACAAA TGAAAATACA 9780 
TGACAGCACA ATGTGGCCCG TAGAAAATTC CCCTGAGCCA GCTTCTGCAC TTTCATCACC 9840 
GAATCTGAAC ATTTGCTATG TCTGAAGGCA AATTTATGAT GGAATGTTAG TTTG GATT CT 9900 
TTCCAGATGC TACCTAAATG CAGTGTGGGG TCATTGCCTT GCTTTGCGAT GACAGTTTCT 9960 
TTGAAAATAT GCAAAGTCAT AAGCTCATGT TAAGGTTTTT CAAGAGTCTG CCTCCTACTA 10020 
CACAAAGGAA AGCAAGGGAA AGGAAATGAC CCTGGCAAAC AGTAGGGAAG GGTGTATTCA 10080 
AACATTTCAT TTTCAAAACC TTCGGGTTAG AATACCACTT ACACATGTAT TCTGAGAGAC 10140 
AGAATTCATG AGGAACTCAT CTCTCTTTAT AACTGGAAAC ACACCAGCTT GATATATTGC 10200 
TAATCCATAC TAAAATCATA TTATTGGGTT TTTTCTGAAT CAGGGCTGTA TTAATGGTAC 10260 
AGTATTTATT CAGAATGGAA TTCTAAAATT ACTAACAAAC TTGTTGAAAA TTTGAATACC 10320 
TCCACACCAA CCTAAAAATG GACCTTAAGT TCCTAGAACC TCTGATGTTC TTTTAAATTA 10380 
ATGGAAAAAT AATTTGTGAA CTGTATATAG AGAGTGCATT CATAAATGTG ATTATGTATT 10440 
TTATCACAAA TCCAAAATGT CAATATTAGA GTCTATTTTG CTTATATTTT AAGCAATTAT 10500 
ACGTTTTTGC AATTCATTGA TGATGTATCA TTTTCAAACT GCTTTAAATA TCCATTAGAA 10560 
ACAAATATTT GAAGCTTTTA CTTAATAGTG ATTACCTTGA ACTGTGCATT TCTAGTTTGT 10620 
AATACGTATT TGGTTGGTTC GTGCCTTTAG TTTGTTAAAG TTACATTTGT ATTATATTCA 10680 
GGAAATGCAC TTTTTATTAC TTACAGCTGT GGTTTTAATA CTGCCTTGAA CTATTATTAT 10740 
TCTTTTTACA ACTCCTAAAG CTTGAGGGAG GAAAGAAAAA AAAAACAAAA CXACTAATCA 10800 
GTAGTAAATC GAAGAGAAAC ATTTTGGCAT TTCTTAAGAA GAAGATGGAG ATATTGAGTA 10860 
TATCACTTCC TATTCAGCTG AATAGAAAGA ATGCCTTCAT TGACTTGCAG TTCTGCAGTT 10920 
TAAATTATTG AAAGAACAAT TCGTTTGCAT TTCCTGATGA AAGTAAAAGC ATTTTTCAGA 10980 
GAAACATATG AATTTCTCAT ACCCAGCAGA CAGATGGCTG ACACTGCACA GCCACACACC 11040 
ATTCGAGTAA GTTAAAGTGA GAGCATAGTA GTTGGACTCT CCTATGAAGA ACATTCTGGG 11100 
CTGGAGGCAG GGAATACTCC ATGGTTGTTT CTTTTTCCTA CTTAAGCCCA TTTTGTTTGT 11160 
GCTTTTCTGT TTTGTTTTGT TTTCACTCTT GCACTACAGT CTAGAGATCC AAATGAACTG 11220 
AAAAGTTCAA AGTTTAACAC ATTTAAATAT GTTTACTTTT AGTTGTCATT CTAATCGTTA 11280 
TTGATTAGAA GCATGACTCC TGAAGGAAAG GGAAATAAAT CTCAATTCAT ACTAACTTGC 11340 
AACAAAACAC TTTTACCATA TAAATAAGTA TATGATTTAT TTTTAACCCA AAAAATGXAT 11400 
AAAATAAGTG TGTCCTTTAC TGTCAATTTA TCGAGAAGAT CTATAATATA TAGACTACAT 11460 
ATATATAATA TATACAACAT AGCCAAATGT ATGAAAACTT GACAATGTAT AATTTGGAAT 11520 
TCACATGCTA CCTATGTAGA CAGGTATGAA ATTAAGTTAT AATTTTCATG AGACATTTTC 11580 
ATCACTGTTG AGACAGTTTC AAGGCATTCC ATCATGTTAT TTTGACTCTT TTTCTTTTTT 11640 
TTTTCTTTAA AAATATATTT TTAACTAGAC CAGGCCCCAC TATAATATCA CTTAAGAGAG 11700 
TCAGGGCAAA GTTTTTGCAT TTATGAAGAT GTGTTCATGT AAGGGTGATT GTAATGGAGT 11760 
TCATTGGTAA TAGAAGCAAA AGTACAGTAA CGAAGTATTG AAAAGAAAAT TTTGGAGACA 11820 
TTGGAGCATA TTATATATAG CTTGTGGAAA GACATAAGGC TACAGATGGA ATGGAACATT 11880 
CCTGTTTTCT TGAAGAAATT CACATACACA TAGCTGACCT GACTAGTACT TCAGCTCTTC 11940 
CACAGCCTTC TATAAAGGTT CTTTCTTCTG CAAAGAAAAC AAAACAAAAC AAAACAAAAC 12000 
AAAAAAAAAC AAAAAAAGCG CAAAAAACAA AAAAACAAAA AAAAGCAAAG TAA AATTT AA 12060 
AAATACAGAA AACAAACAAC AAAAAAGAAT TCAACCATAA ATAGTGACTA TTATTTTCAG 12120 
TGTGTCCTTC ATGTGAAAGC TATTAAGGAC CAAATATACT ACTGTTCAXA AGAAGAAATT 12180 
ACTTTCTAAA CAGTAACTGA AAATACTTAG AGTTAAACTT GCTGTGGATT TT GTCT TGGC 12240 
AGTTGTCATC TTACATTATT TGTCAAAGGA AATGTGTTTG GCAGTTAAAA ATCTTTCCTT 12300 
AGATTTAGTG GTGGACTTTA ACCTCTTAAA TAAATGTTAG TATATCAGAT TGTGTCCTTG 12360 
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AAAAATATTT TACTTGTATG AATCATGACA ACGTCTAAAT CTTTACTATT CTTCTGGCAA 12420 
AAGCATCAGT AAGAAAGAAG GCGAAAAAGA GAAGTATAGC CTTTATGTCA GAAAAACATT 12480 
CTTTTTAGCT GCTTACTTTC TCATGAAAAG TAAAGATGTT TACAGTGTAT GCCAAGTTTT 12540 
CAGTTTCTGT ATAACAACAG GTAGAGGTTC TAATCATATT GAAAATTGTG TTATAATGGT 12600 
CTGAGCCATG TTGCTAGGAA ACAATAGGTT CCAATTTTGT ATTCCTGCTC TCCTGTGCTG 12660 
AAAAGTGACT GGATACTGTA CAGGTTCATG TTCTCTGGCT GCAGTTAAAT GGTCTTTTGC 12720 
ATTTTGCTCT GGCTTTCAGG OCAGAAGCAT GCATTTTTCT ACAAGAGCAT CACAACAACA 12780 
TGCTGTAAAT ATTTAAAGTT AAACATTATG TGTTGATATT TGAAAGAAAA GTACTTTGAA 12840 
TATTTCATTT TTAAAAAATA AAATTGCCAA TGAAAAAAAA 



A210 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



odz (odd Oa/ten-m, 
Hs. 23796 
NM_0142S3 
NP_055068 
none found 
318-340 

plasma membrane 



Drosophila) homolog 1 



1 11 21 31 41 51 

I I i I I I 

MEQTDCKPYQ PLPKVKHEMD LAYTSSSDES EDGRKPRQSY NSRETLHEYN QELRMNYNSQ 60 

SRKRKEVEKS TQBMEFCETS HTLCSGYQTD MHSVSRHGYQ LEMGSDVDTE TEGAASPDHA 120 

LRMWIRGMKS EHSSCLSSRA NSALSLTDTD HBRKSDGENG FKPSPVCCDM EAQAGSTQDV 180 

QSSPHNQPTP RPLPPPPPPP HACTCARKPP PAADSLQRRS MTTRSQPSPA APAPPTSTQD 240 

SVHLHNSWVL NSNIPLETRH SLFKHGSGSS AIFSAASQNY PLTSNTVYSP PPRPLPRSTP 300 

SRPAFTPNKP YRCCNWKCTA LSATAITVTL ALLLAYVIAV HLFGLTWQLQ PVEGBLYANG 360 

VSKGNRGTES MDTTYSPIGG KVSDKSEKKV FQKGRAIDTG EVDIGAQVKQ TIPPGLFWRF 420 

QITIHHPIYL KFNISLAKDS LLGIYGRRNI PPTHTQFDFV KLMDGKQLVK QDSKGSDDTQ 480 

HSPRNLILTS LQETGPIEYM DQGPWYLAFY NDGKKMEQVF VLTTAIE1MD DCSTNCNGNG 540 

ECISGHCHCP PGFLGPOCAR DSCPVLCGGN GBYEKGHCVC RHGWKGPECD VPEEQCIDPT 600 

CFGHGTCIMG VCICVPGYKG EICEEEDCLD PMCSNHGICV KGECHCSTGW GGVNCETPLP 660 

VCQEQCSGHG TFLLDAGVCS CDPKWTGSDC STELCTMECG SHGVCSRGIC QCEEGWVGPT 720 

CEERSCHSHC TEHGQCKDGK CECSPGWEGD HCTIAHYLDA VRDGCPGLCF GNGRCTLDQN 780 

GWHCVCQVGW SGTGCKWME MLCGDNLDND GDGLTDCVDP DCCQQSNCYI SPLCQGSPDP 840 

LDLIQQSQTL FSQHTSRLFY DRIKFLIGKD STHVIPPEVS FDSRRACVIR GQWAIDGTP 900 

LVGVNVSFLH HSDYGFTISR QDGSFDLVAI GGISVILIFD RSPFLPEKRT LWLPWNQPIV 960 

VEKVTMQRW SDPPSCDISN PISPNPIVLP SPLTSPGGSC PERGTIVPEL QWQEEIPIP 1020 

SSFVRLSYLS SRTPGYKTLL RILLTHSTIP VGMIKVHLTV AVEGRLTQKW FPAAINLVYT 1080 

FAWNKTDIYG QKVWGLAEAL VSVGYEYETC PDFILWEQRT WLQGFEMDA SNLGDWSLNK 1140 

HHILNPQSGI IHKGNGENMF ISQQPPVIST IMGNGHQRSV ACTNCNGPAH NNKLFAPVAL 1200 

ASGPDGSVYV GDFNFVRRIF PSGNSVSILE LSTSPAHKYY LAMDPVSESL YLSDTNTRKV 1260 

YKLKSLVETK DLSKNFEWA GTGDQCLPFD QSHGGDGGRA SEASLNSPRG ITVDRHGFIY 1320 

FVDGTMIRKI DENAVITTVI GSNGLTSTQP LSCDSGMDIT QVRLEWPTDL AVNPMDKSLY 1380 

VLDNNIVLQI SENRRVRIIA GRPIHCQVPG IDHFLVSKVA 1HSTLBSARA ISVSHSGLLF 1440 

IAETDERKVN RIQQVTTNGE IYIIAGAPTD CDCKIDPNCD CFSGDGGYAK DAKMKAPSSL 1500 

AVSPDGTLYV ADLGNVRIRT ISRNQAHLND MNIYEIASPA DQELYQFTVN GTHLHTLNLI 1560 

TRDYVYNFTY NSEGDLGAIT SSNGNSVHIR RDAGGMPLWL WPGGQVYWL TISSNGVLKR 1620 

VSAQGYNPAL MTYPGNTGLL ATKSNENGWT TVYEYDPEGH LTNATFPTGE VSSFHSDLEK 1680 

LTKVELDTSN RENVLMSTNL TATSTIYILK QENTQSTYRV NPDGSLRVTF ASGMEIGLSS 1740 

EPHILAGAVN PTLGKCNISL PGEHNANLIE WRQRKEQNKG NVSAFERRLR AHNRNLLSID 1800 

FDHITRTGKI YDDHRKFTLR ILYDQTGRPX LWSPVSRYNE VNITYSPSGL VTFIQRGTMN 1860 

EKMEYDQSGK IISRTWADGK IWSYTYLEKS VMLLLHSQRR YIFEYDQSDC LLSVTMPSMV 1920 

RHSLQTMLSV GYYRNIYTPP DSST9FIQDY SRDGRLLQTL HLGTGRRVLY KYTKQARLSE 1980 

VLYDTTQVTL TYEESSGVIK TIHLMHDGFI CTIRYRQTGP LIGRQIFRFS EEGLVNARFD 2040 

YSYNNFRVTS MQAVINETPL PIDLYRYVDV SGRTEQFGKF SVINYDLNQV ITTTVMKHTK 2100 

IFSANGQVIE VQYEZLKAIA YWMTIQYDNV GRHGNMCIRV GVDANITRYF YEYDADGQLQ 2160 

TVSVNDKTQW RYSYDLNGDI NLLSHGKSAR LTPLRYDLRD RITRLGEIQY KMDEDGFLRQ 2220 

RGNDIFEYNS NGLLQKAYNK ASGWTVQYYY DGLGRRVASK SSLGQHLQFF VDATANPIRV 2280 

THLYNHTSSE ITSLYYDLQG HLIAMELSSG EEYYVACDNT GTPLAVPSSR GQVIKEILYT 2340 

PYGDIYHDTY PDFQVI IGFH GGLYDFLTKL VHLGQRDYDV VAGRWTTAYH HIWKQLNLLP 2400 

KPFNLYSFEN NYPVGKIQDV AKYTTDIRSW LELPGFQLBN VLPGPPKPEL ENLELTYELL 2460 

RLQTKTQEWD PGKTILGIQC ELQKQLRNFI SLDQLPMTPR YNDGRCLEGG KQPRFAAVPS 2520 

VFGKGIKFAI KDGIVTADII GVANEDSRRL AAILNNAHYL BNLHFTIBGR DTHYFIKLGS 2580 

LBEDLVLIGN TGGRRILENG VNVTVSQMTS LLNGRTRRFA DIQLQHGALC FNIRYGTTVE 2640 

EEKNHVLEIA RQRAVAQAWT KEQRRLQEGE EGIRAWTEGE KQQLLSTGRV QGYDGYFVLS 2700 
VEQYLELSDS ANNIHPMRQS EIGRR 
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I I I I I I 

CTCAGCCTTC CCGGTTCGGG AAAGGGGAAG AATGCAGGAG GGGTAGGATT TCTTTCCTGA 60 

TAGGATCGGT TGGGAAAGAC CGCAGCCTGT C TIUTUTCI TT CCCTTCGACC AAGGTGTCTG 120 

TTGCTCCGTA AATAAAACGT CCCACTGCCT TCTGAGAGCG CTATAAAGGC AGCGGAAGGG 180 

TAGTCCGCGG GGCATTCCGG GCGGGGCGCG AGCAGAGACA GGTCATGGCA GCGCCAGGCG 240 

GCAGGTCGGA GCCGCCGCAG CTCCCCGAGT ACAGCTGCAG CTACATGGTG TCGCGGCCGG 300 

TCTACAGCGA GCTCGCTTTC CAGCAACAGC ACGAGCGGCG CCTGCAGGAG CGCAAGACGC 360 
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TGCGGGAGAG CCTGGCCAAG TGCTGCAGTT GTTCAAGAAA GAGAGCCTTT GGTGTGCTAA 420 

AGACTCTTGT GCCCATCTTG GAGTGGCTCC CCAAATACCG AGTCAAGGAA TGGCTGCTTA 480 

GTGACGTCAT TTOGGGAGTT AQTACTGGGC TAGTGGCCAC QCTGCAAGGG ATGGCATATG 540 

CCCTACTAGC TGCAGTTCCT GTOGGATATG GTCTCTACTC TGCTTTTTTC CCTATCCTGA 600 

CATACTTTAT CTTTGGAACA TCAAGACATA TCTCAGTTGG ACCTTTTCCA GTGGTGAGTT 660 

TAATGGTGGG ATCTGTTGTT CTGAGCATGG CCCCCGACGA ACACTTTCTC GTATCCAGCA 720 

GCAATGGAAC TGTATTAAAT ACTACTATGA TAGACACTGC AGCTAGAGAT ACAGCTAGAG 780 

TCCTGATTGC CAGTGCCCTG ACTCTGCTGG TTGGAATTAT ACAGTTGATA TTTGGTGGCT 840 

TGCAGATTGG ATTCATAGTG AGGTACTTGG CAGATCCTTT GGTTGGTGGC TTCACAACAG 900 

CTGCTGCCTT CCAAGTGCTG GTCTCACAGC TAAAGATTGT CCTCAATGTT TCAACCAAAA 960 

ACTACAATGG AGTTCTCTCT ATTATCTATA CGCTGGTTGA GATTTTTCAA AATATTGGTG 1020 

ATACCAATCT TGCTGATTTC ACTGCTGGAT TGCTCACCAT TGTCGTCTGT ATGGCAGTTA 1080 

AGGAATTAAA TGATCGGTTT AGACACAAAA TCCCAGTCCC TATTCCTATA GAAGTAATTG 1140 

TGACGATAAT TGCTACTGCC ATTTCATATG GAGCCAACCT GGAAAAAAAT TACAATGCTG 1200 

GCATTGTTAA ATCCATCCCA AGGGGGTTTT TGCCTCCTGA ACTTCCACCT GTGAGCTTGT 1260 

TCTCGGAGAT GCTGGCTGCA TCATTTTCCA TCGCTCTGGT GGCTTATGCT ATTGCAGTGT 1320 

CAGTAGGAAA AGTATATGCC ACCAAGTATG ATTACACCAT CGATGGGAAC CAGGAATTCA 1380 

TTGCCTTTGG GATCAGCAAC ATCTTCTCAG GATTCTTCTC TTGTTTTGTG GCCACCACTG 1440 

CTCTTTCCCG CACGGCCGTC CAGGAGAGCA CTGGAGGAAA GACACAGGTT GCTGGCATCA 1500 

TCTCTGCTGC GATTGTGATG ATCGCCATTC TTGCCCTGGG GAAGCTTCTG GAACCCTTGC 1560 

AGAAGTCGGT CTTGGCAGCT GTTGTAATTG CCAACCTGAA AGGGATGTTT ATGCAGCTGT 1620 

GTGACATTCC TCGTCTGTGG AGACAGAATA AGATTGATGC TGTTATCTGG GTGTTTACGT 1680 

GTATAGTGTC CATCATTCTG GGGCTGGATC TCGGTTTACT AGCTGGCCTT ATATTTGGAC 1740 

TGTTGACTGT GGTCCTGAGA GTTCAGTTTC CTTCTTGGAA TGGCCTTGGA AGCATCCCTA 1800 

GCACAGATAT CTACAAAAGT ACCAAGAATT ACAAAAACAT TGAAGAACCT CAAGGAGTGA 1860 

AGATTCTTAG ATTTTCCAGT CCTATTTTCT ATGGCAATGT CGATGGTTTT AAAAAATGTA 1920 

TCAAGTCCAC AGTTGGATTT GATGCCATTA GAGTATATAA TAAGAGGCTG AAAGCGCTGA 1980 

GGAAAATACA GAAACTAATA AAAAGTGGAC AATTAAGAGC AACAAAGAAT GGCATCATAA 2040 

GTGATGCTGT TTCAACAAAT AATGCTTTTG AGCCTGATGA GGATATTGAA GATCTGGAGG 2100 

AACTTGATAT CCCAACCAAG GAAATAGAGA TTCAAGTGGA TTGGAACTCT GAGCTTCCAG 2160 

TCAAAGTGAA CGTTCCCAAA GTGCCAATCC ATAGCCTTGT GCTTGACTGT GGAGCTATAT 2220 

CTTTCCTGGA CGTTGTTGGA GTGAGATCAC TGCGGGTGAT TGTCAAAGAA TTCCAAAGAA 2280 

TTGATGTGAA TGTGTATTTT GCATCACTTC AAGATTATGT GATAGAAAAG ' CTGGAGCAAT 2340 

GCGGGTTCTT TGACGACAAC ATTAGAAAGG ACACATTCTT TTTGACGGTC CATGATGCTA 2400 

TACTCTATCT ACAGAACCAA GTGAAATCTC AAGAGGGTCA AGGTTCCATT TTAGAAACGA 2460 

TCACTCTCAT TCAGGATTGT AAAGATACCC TTGAATTAAT AGAAACAGAG CTGACGGAAG 2520 

AAGAACTTGA TGTCCAGGAT GAGGCTATGC GTACACTTGC ATCCTGAAAG TGGGTTCGGG 2580 

AGGTCTCTAT GAGCAAGGAA TACAAGACAA AACTTCCTCA ATGCATTGAC TATTTCTTCA 2640 

GACTCAAAAC ACTCATTCTT TTTTCTATTA AGCCATTGAA AGAGAAGCAC TAAGACTGCT 2700 

TCTAGGCTTT ATTTATAAAA TAAACACCTT ATCCCTAACA TCGGCAAAAT GGCTAGAATT 2760 

ATTCAGACGA TTTGGCAGCG TCCAGGGTAA GCTGGTGTTA TAATACGCTG CTGATCTACA 2820 

TCACAGATTT GCTAATAATG TTCACGTGGG CCCTGGCATA TCTCTGTTCA GTTAGAGTGA 2880 

GTG CTGACC C AACAGCCTCT GTGGTCAAGC GAGTCACGAA TGATTAATCA TAAAGAAAAA 2940 

TCAGTTTTTG ACTGACCTGG ATATCCATGA GCTGCACTGA TCACCATGTA AGGTCACATT 3000 

TAGTAAATGC TGAAATAAAA TGATTAATGC ATTTATCAAT AAAAGCCTTT GAAAATACTT 3060 

TGGATAATAA ATTGGAGTTT TAAAAATGCA AATTTGCTTA GTATCTAATA ATGAAGTGTT 3120 

ATTACATATA GCCGGAATTG AGGATCTCTT TGATCCTGGA AATGGTTTAC CTAAAAGCTA 3180 

CAGAACCAGG CCAATATATT TTGAAATATT GATGCAGACA AATGAAATAA TAAAGAGATT 3240 

TTCATGGTTT ATAAAAATCT TTTTTGATAT GATAATAATC ATGATCACAA CTGAGATCAA 3300 

AAAAATATAT GACAGATTAT TTTGTTTAAA AATGCAGTTT TAATTATCTT AGTCTATAGA 3360 

AATGATCATT GCATGGAGGC ATGTATAGGT ATGATCTGTG TAAAATCTGA CATAAAAACA 3420 

GTGCTATTCT GAGTGAAAAT TTTTTTGATG TGCTTACATA ACCATGGTGA TTAAAATGAG 3480 

TTTATATTTT TTCTCAAAAA TTTTAGCAGT GTGTAAAGTA AGTAATCTTT AACTGAACTC 3540 

TGACCACTTA AAAAAAAATC TAAAAATTGA ACTACCTATA GTAGTCTGTG TTTAAAGTGA 3600 

ATTTTTAAAG ACAAAGCATT CTAAATGAAC TCAATATAAA AACATTCATT TGGAATGTAC 3660 

ATACTGAAAA ATACAGGTTT TTTTGACCAA AAGTTTTTAT ATCTTTTCTT TTTATTTATT 3720 

TTTTTCCTAA GTGCCAACAA TTTTCTAGAT ATTATATACA ACACAGGCTT TGATCTTGGG 3780 

GACTTTTCCC ATATATTTCA CACTGGAGTG AATGAAGTTG TACTTCATTT CTAGAGAAAA 3840 

GTTATACCCA GGTCCCCAAT TGAGAATGTC TTGCTTGATT GAAAACGACA TCATCCCTTG 3900 

GTATACTCCA GGGATTGGTT TCAGGACCCC TGCATTTACC AAAATTTGTG CACACTCAAG 3960 

TCCTGCAGTC ACCCCTGCCT AAAGATAGAA TGGCTTCTCT GTTTTTCTTC TGAAATACAA 4020 

CCAGAAACAA TGTGTCTATT TCTGAAAGAA TAGGATTAAT GATCATACAA ATGGGTTAAT 4080 

CCTGAATTCT GGTTGTAAAT CTGGTTAGAG CATAACTAGG ATTATAATGC TGCCTCATTT 4140 

TCACAGCACT ACTTGCTTAT ATTGACAACA AATCATCTCG CTAAAGAGTG AATGTAGGCC 4200 

AGGCGCGGTG GCTCATGCCT GTAATCCCAG CACTTTGGGA GGCCGAGGCG GGTGGATCAC 4260 

GAGGTCAGGA GATCGAGACC ATCCTGGCTA ACATGGTAAA ACCCCGTCTC TACTAAAAAT 4320 

AGAAAAAAAG AAATTAGCCT AGCGTGGTGG CTGGCGGGCG CCTGTAGTCC CAGCTATTTG 4380 

GGAGGCTAAG GCAGGAGAAT GGCGTGAACC OGGGAGGCGG AGCTTGCAGT GAGCOGAGGT 4440 

CGTGCCACTG CACTCCAGCC TGGGOGACAG AGCAAGACTC CGTCTCAAAA AAAAAAAAAA 4500 

AAAAAAAAAA AGAGTGAATG TAATAGTCTT GCAGAAAATG AATGAATACC TTTGTTCAAT 4560 

AAAGGAAATA TGCACTGCTC ACTTTTTTGA AGGAAATGCC AAAGTTAOGT TTTACAACAA 4620 

GGCTAGAGTT TGTAAATTCT GGGTTCATTT GTGATGACAT AAGTCAGCAA ACTGCGGGAA 4680 

TACTGTCTCT TCTATGTATT TTGTGAATAG TAAGCATAAT TTTAGTTTTG TATTATCAAT 4740 

GAAAATTTCA CTTGAAATTA AAGCTGCCTT TTGTTATATT TTTAACCTAT AGGATAAGAT 4800 

TCCAGTATTG TATATGAGTT TTAACAAATT AAAAAATCAA ATCATGTACA TTTGAAAATA 4860 

TTTGCACACA TTTAAAAATA AATGTAAAGT TGTCTTTTAA ACTACTCGGA TGTGTCCTTT 4920 
CTGAACAAAA 

A212 Protein sequence: 
Gene name : 
Unigene number: 
Probeset Accession th 
Protein Accession #: 
Signal sequence: 

932 



Solute carrier family 26, member 4 

He .159275 

AF030880 

043511 

none found 



WO 03/042661 



PCT/US02/36810 



Transmembrane domains: 
Cellular Localization-. 



81-103, 109-131. 136-158. 185-207, 221-243, 245-267, 270-291, 295-317, 
347-369, 386-408, 420-442, 448-470, 486-508 
plasma membrane 



1 
I 

MAAPGGRSEP 
AFGVLKTLVP 
FFPILTYFIF 
RDTARVLIAS 
NVSTKNYNGV 
P1EVIVTIIA 
YAIAVSVGKV 
QVAGIISAAI 
IWVFTCIVSI 
BPQGVKILRF 
KNGIISDAVS 
DCGAXSFLDV 
TVHDAILYLQ 
QDEAMHTLAS 



11 

I 

PQWBYSCSY 
ILEWLPKYRV 
GTSRHISVGP 
ALTLLVGIIQ 
LSIIYTLVEI 
TAISYGANLE 
YATKYDYTID 
VMIAILALGK 
ILGLDLGLLA 
SSPIFYGNVD 
TNNAFEPDED 
VGVRSLRV1V 
NQVKSQEGQG 



21 
I 

MVSRPVYSEL 
KEHLLSDVIS 
FPWSLMVGS 
LIFGGLQIGF 
FQNIGDTNLA 
KNYNAGIVKS 
GNQEFIAFGI 
LLEPLQKSVL 
GLIFGUiTW 
GFKKCIKSTV 
IEDLEELDIP 
KEFQRIDVNV 
SILETITLIQ 



31 

1 

AFQQQHBRRIi 
GVSTGLVATL 
WLSMAPDSH 
IVRYLADPLV 
DFTAGLLTIV 
IPRGFLPPEI* 
SNIFSGFFSC 
AAWIANLKG 
LRVQFPSWNG 
GFDAIRVYNK 
TKEIEIGVDW 
YFASLQDYVI 
DCKDTLELIE 



41 
! 

QERKTLRESL 
QGMAYALLAA 
FLVSSSNGTV 
GGFTTAAAFQ 
VCMAVKELND 
PPVSLFSEML 
FVATTALSRT 
KFMQLCDIPR 
LGSIPSTDIY 
RLKALRKIQK 
NSELPVKVNV 
EKLEQCGFFD 
TELTEEELDV 



51 
I 

AKCCSCSRKR 
VPVGYGLYSA 
LNTTMIDTAA 
VLVSQLKIVL 
RFRHKIPVPI 
AASFSIAWA 
AVQESTGGKT 
LWRQNKIDAV 
KSTKNYKNIE 
LIKSGQLRAT 
PKVPIHSLVL 
DNIRKDTFFL 
QDEAMRTLAS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A213 DMA SEQUENCE; 
Gene name: 
Unigene number: 
ProbeBet Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs, Highly similar to calcium-activated potassium channel rSK2 [R.norvegicusl 

Hs. 98280 

AA418000 

NM_021614 

458-2197 (underlined sequences correspond to start and Btop codons) 



1 11 21 31 

III! 
CGGCGGCAGC AGCCCATGCC TCCGGTGCAA CAGCTGCGCC 
GGGGGCGGGA GATAACCTGT CCCTGCTGCT CCGCACCTCC 
GACCCGCACC TCCTCGCCGC TGTCGGGCTC GTCCTGCTGC 
CCGGGGCAGC CAGCTCAATG TGAGCGAGCT GACGCCGTCC 
GCAGCAGTAC GCGCAGCAGT CCGCGCAGCA GTCGGCGTCC 
CCACAGCCTG CAGCCCGCCG OCAGCCCCAC GGGCAGCCTC 
CCCGCTCTCG CACCACCACC ACCACCCGCA CCCGGCGCAC 
GGCGCGCCGC GAGAGCAACC CCTTCACCGA AATAGCCATG 
GGGCGTCATG CGGCCGCTCA GCAACTTGAG CGCGTCCCGC 
CTCAGAGGCG CAGCCCCTGC AGCCCCCCGC GTCTGTCGGA 
CCCGTCTGCA GACGCTGCCG CCGCCGCCGC TGTTTCGTCC 
GTCTAAGCCC GAGCACAACA ACTCCAACAA CCTGGCGCTC 
CAGCACTGGA GGAGGCGGCG GCGGTGGAGG GAGCGGGCAC 
GTCCAGCAAA AAGAAAAACC AGAACATCGG CTACAAGCTG 
CGAAAAGCGC AAGCGGCTCA GCGACTACGC GCTCATCTTC 
CATGGTCATC GAGACCGAGC TGTCGTGGGG CGCCTACGAC 
AGCTCTGAAA TGCCTTATCA GTCTCTCCAC GATCATCCTG 
CCACGCCAGG GAAATACAGT TGTTCATGGT GGACAATGGA 
CATGACTTAT GAGCGTATTT TCTTCATCTG CTTGGAAATA 
CATACCTGGG AATTATACAT TCACATGGAC GGCCCGGCTT 
CACAACCACC GCTGATGTGG ATATTATTTT ATCTATACCA 
GATTGCCAGA GTCATGCTTT TACATAGCAA ACTTTTCACT 
TGGAGCACTT AATAAGATAA ACTTCAATAC ACGTTTTGTT 
ATGCCCAGGA ACTGTACTCT TGGTTTTTAG TATCTCATTA 
TGTCCGAGCT TGTGAAAGGT ACCATGATCA ACAGGATGTT 
GATGTGGTTG ATATCAATAA CTTTTCTCTC CATTGGTTAT 
ATACTGTGGA AAAGGAGTCT GCTTACTTAC TGGAATTATG 
GGTGGTAGCT GTAGTGGCAA GGAAGCTAGA ACTTACCAAA 
TTTCATGATG GATACTCAGC TGACTAAAAG AGTAAAAAAT 
GGAAACATGG CTAATTTACA AAAATACAAA GCTAGTGAAA 
AAGAAAACAT CAACGAAAAT TCCTGCAAGC TATTCATCAA 
GCAGAGGAAA CTGAATGACC AAGCAAACAC TTTGGTGGAC 
CATGTATGAT ATGATTTCTG ACTTAAACGA AAGGAGTGAA 
TACCCTGGAA ACAAAACTAG AGACTTTGAT TGGTAGCATC 
AAGCCAGACC ATCAGGCAGC AGCAGAGAGA TTTCATTGAG 
CAAGCACGTC ACTTACAATG CTGAGCGGTC CCGGTCCTCG 
TTCCA CAGCA CCACCAACTT CATCAGAGAG TAGCTAQAAQ 
TAAGACTTTT TGCCATCATA TGGTCAATAT TTTAGCTTTT 
CTAATCAGCG TTATCCGGGT TCTGATGTCA GAATCCTGGG 
GGCCAAAATG AGTGAAAACT CTTT TTTT T T CTTTCAGATG 
TTGCTATATA GATTGTTCCT CCTGTAATTT CACTAACTTT 
AACTTTACTA CTACATTATA TGATATATAA TAAAAAAAGT 



41 51 
I I 

TCCTCCGGTG CCCCGGCGGC 60 

TCGCCCGGCG GCGCCTTCCG 120 

TGCTGCTGCT GCTCGTCGCG 180 

AGCCATGCCA GTGCGCTCCG 240 

GCCTCCCAGT ACCACCAGTG 300 

GGCAGTCTGG GCTCCGCGCC 360 

CACCAGCACC ACCAGCCCCA 420 

AGCAGCTGCA GGTACAACGG 480 

CGGAACCTCC ACGAGATGGA 540 

GGAGGTGGCG GCGCGTCCTC 600 

TCAGCeCCCG AGATCGTGGT 660 

TATGGAACCG GCGGCGGAGG 720 

GGCAGCAGCA GTGGCACCAA 780 

GGCCACCGGC GCGCCCTGTT 840 

GGCATGTTCG GCATCGTGGT 900 

AAGGCGTCGC TGTATTCCTT 960 

CTCGGTCTGA TCATCGTGTA 1020 

GCAGATGACT GGAGAATAGC 1080 

CTGGTGTGTG CTATTCATCC 1140 

GCCTTCTCCT ATGCCCCATC 1200 

ATGTTCTTAA GACTCTATCT 1260 

GATGCCTCCT CTAOAAGCAT 1320 

ATGAAGACTT TAATGACTAT 1380 

TGGATAATTG CCGCATGGAC 1440 

ACTAGCAACT TCCTTGGAGC 1500 

GGTGACATGG TACCTAACAC 1560 

GGTGCTGGTT GCACAGCCCT 1620 

GCAGAAAAAC ACGTGCACAA 1680 

GCAGCTGCCA ATGTACTCAG 1740 

AAGATAGATC ATGCAAAAGT 1800 

TTAAGAAGTG TAAAAATGGA 1860 

TTGGCAAAGA CCCAGAACAT 1920 

GACTTCGAGA AGAGGATTGT 1980 

CACGCCCTCC CTGGGCTCAT 2040 

GCTCAGATGG AGAGCTACGA 2100 

TCCAGGAGGC GGCGGTCCTC 2160 

AGAATAAGTT AACCACAAAA 2220 

ATTGTAAAGC CCCTATGGTT 2280 

AACCTGAACA CTAAGTTTTA 2340 

CACAGGGAAT GCACCTATTA 2400 

TTATTCATGC ACTTCAAACA 2460 
TAATTTCGGA 



A214 Protein sequence: 
Gene name i 
Unigene number: 
Probeset Accession §: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 



ESTs, Highly similar to calcium- activated potassium channel rSK2 [R.norvegicus 
Hs. 98280 
AA4 18000 
NP_0 67627 
none found 

135-1S7, 168-190, 208-230, 254-276, 306-328, 342-364, 373-394 

933 



WO 03/042661 



PCT/US02/36810 



Calmodulin binding domain: 
Cellular Localization: 



412-488 

plasma membrane 



1 

I 

MSSCRYNGGV 
SSAPEIWSK 
LGHRRALFEK 
LLGLI IVYHA 
LAFSYAPSTT 
VMKTLMTICP 
YGDMVPNTYC 
NAAANVLRET 
DLAKTQNIMY 
EAQMESYDKH 



11 

I 

MRPLSNLSAS 
PEHNNSNNLA 
RKRLSDYALI 
REIQLFMVDN 
TADVDIILSI 
GTVLLVPSIS 
GKGVCLLTGI 
WLIYKNTKLV 
DMISDLNERS 
VTYNAERSRS 



21 



LYGTGGGGST 
PGMPGIWMV 
GADDWRIAWT 
PMFLRLYLIA 
LWIIAAWTVR 
MGAGCTALW 
KKIDKAKVRK 
EDPEKRIVTL 



31 
I 

AQPLQPPASV 
GGGGGGGGSG 
IETELSWGAY 
YERIFFICLE 
RVMLLHSKLF 
ACERYHDQQD 
AWARKLELT 
HQRKFLQAIK 
ETKLETLIGS 
APPTSSESS 



41 
I 

GGGGGASSPS 
HGSSSGTKSS 
DKASLYSLAL 
ILVCAIHPIP 
TDASSRSIGA 
VTSNFLGAMW 
KAEKHVHNFM 
QLRSVKMEQR 
IHALPGLISQ 



51 
I 

ADAAAAAAVS 
KKKNQNIGYK 
KCLISLSTII 
GNYTFTWTAR 
LNKINFNTRF 
LISITFLSIG 
MDTQLTKRVK 
KLNDQANTLV 
TIRQQQRDFI 



60 
120 
180 
240 
300 
360 
420 
480 
540 



A215 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeaet Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



CGI- 86 protein 
He. 109201 
AW161450 
NM_016029 

228-1097 (underlined sequences correspond to start and stop codons) 



1 
1 

CTGCGATCCC 
GGGCGTGCGC 
TGCTCCTGCT 
TATGGGCCGA 
TGACTGGAGC 
TTTCTCTTGT 
TAGAGAATGG 
CTGGTTCCCA 
TGGTCAACAA 
ACAGAAAGCT 
CTCACATGAT 
TATCTGTACC 
ATGGCCTTCG 
GACCTGTGCA 
GCAATAATGG 
TCAGCATGGC 
CATATTTGTG 
AAAGGATTGA 
AGACAAAACA 
AAACATGAAA 
ACTTTTTAAT 
AGATTGCCAT 



11 

1 

GCAGGGCAGC 
GGCCGCAATG 
CTTGGTGCAG 
GTGGCAGGGA 
CTCGAGTGGA 
GCTGTCAGCC 
CAATTTAAAA 
TGAAGCGGCT 
TGGTGGAATG 
AATAGAGCTT 
CGAGAGGAAG 
TCTTTCCATT 
AACAGAACTT 
ATCAAATATT 
AGACCAGTCC 
CAATGATTTG 
GCAATACATG 
GAACTTTAAG 
TGACTGAAAA 
ACAGCAATCT 
AGATATGACT 
GAATCTTGCA 



21 
I 

GACGCGACTC 
AACTGGGAGC 
CTGCTGCGCT 
CGACGCCCAG 
ATTGGTGAGG 
AGAAGAGTGC 
GAAAAAGATA 
ACCAAAGCTG 
TCCCAGCGTT 
AACTACTTAG 
CAAGGAAAGA 
GGATACTGTG 
GCCACATACC 
GTGGAGAATT 
CACAAGATGA 
AAAGAAGTTT 
CCAACCTGGG 
AGTGGTGTGG 
GAGCACCTGT 
TCTTATGCTT 
TTGCTTCCAA 
AA 




AGTTTCTAAC 
AGAAGTCACA 
TTGTGTGCGG 
ACAACCTTTC 
AACCAACAAG 
TTCTTATTTT 
CCACTGGAGG 
AAAGACTAAT 
AATAAAAAAT 



51 
I 

CCCCCGAGCT 
CTGTGCGCGC 
CTGACGCTAC 
GTGGTGTGGG 
AAACTAGGAG 
AGAAGATGCC 
CTGACCGACA 
ATCGACATTC 
TTGGATGTCT 
TGTGTTCTGC 
CTGGGTATCA 
GGTTTTTTTA 
ATTTGCCCAG 
AAGACTATAG 
CTGATGTTAA 
TTGTTAGTAA 
ATGGGGAAGA 
AAAATCT7TA 
GAGAAATGGA 
TTGTGATTTT 
AAATAATAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



A216 Protein sequence: 
Gene name: 
Unigene number: 
, Probeset Accession # i 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



CGI- 86 protein 
Hs. 109201 
AW161450 
NP 057113 
1-26 

183-206, 221-243 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MNWELLLHLL VLCALLLLLV QLLRFLRADG DLTLLWAEWQ GRRPEWELTD MWHVTGASS 60 
GIGBELAYQL SKLGVSLVLS ARRVHELERV KRRCLENGML KEKDILVLPL DLTDTGSHEA 120 
ATKAVLQEFG RIDILVNNGG MSQRSLCMDT SLDVYRKLIE LNYLGTVSLT KCVLPHMIER 180 
KGGKTVTVNS ILGIISVPLS IGYCASKHAL RGFFNGLRTE LATYPGIIVS NICPGPVQSN 240 
IVENSLAGEV TKTIGNNGDQ SHKMTTSRCV RLMLISMAND LKEVWISEQP FLLVTYLWQY 300 
MPTWAWWITN KMGKKRIENF KSGVDADSSY FKIFKTKHD 



A217 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession ft? 
Coding sequence: 



Homo sapiens raRNA; cDNA DKFZp56401763 (from clone DKFZpS 6401763) 

Hs. 27373 

F13036 

AC012478 

1-1896 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

ATGCGCGCCG TGCCGCTGCC CGCCCCGCTC CTGCCGCTGC TGCTGCTCGC GCTCCTGGCC 60 

GCTCCCGCCG CCCGCGCCAG CAGAGCCGAG TCCGTCTCCG CGCCGTGGCC CGAACCCGAG 120 

CGCGAGTCGC GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 180 

GCGGCGGGCG GCAGCGGCAG CTCCAGCTCC AACAGCAGTG GCGACGCCTT GGTGACCCGC 240 



934 



WO 03/042661 



PCT/US02/36810 



ATTTCCATCC TCCTCCGCGA CCTACCCACC CTCAAGGCAG CCGTGATCGT GGCGTTCGCC 300 

TTTACCACCC TCCTCATCGC CTGCCTGCTQ CTGCGCGTCT TCAGGTCGGG AAAGAGGTTA 360 

AAGAAGACAC GCAAGTATGA TATCATCACC ACTCCAGCAG AGCGAGTGGA AATGGCGCCA 420 

CTAAATGAAO AGGATGATGA AGATGAGGAC TCCACAGTAT TCGACATCAA ATACAGAGTG 480 

TCCTTGCOGO CTGCACTGAG ACGTCAGCTG CCAGGGTGCC AGACGCTACT GACAGTTCCT 540 

GTGCCCCCAC CCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAGTGG AAGGCCTGAT 600 

GGTGGAATCA GACCTGGTAA AACCTGTTTC CCAGCCTGGT GGCATCCTGT GGAAAGTTGG 660 

TCAGCTGCAA CCTGGGGTGT GAAGGACTGG ACCTGGAAGC CCTCTTGCGT CGGAGGTGTT 720 

GAAACCAAAA OGAAOGTTAT GTATAAAACC CCAGCTCCAT CGTGCGTGTC AGGCATCTGC 780 

TCAGACTGTC ACTGGCAAGC TCGTTTCCAC GTCACCACAA TGGAGTTGCT TCTGCCACCC 840 

TTTGGGCATC CCTTTAAAGT GCCCCCTACT TCTACTCCCC ATGGTTTTCG ACAACTGCAG 900 

CTGAATCTCA TGGAAAAGCT GGATTCCTCT GCCTTACGCA GAAACACCOG GGCTCCATCT 960 

GCCAGGTGCT TGCCACTGGT CCTGGCAGAA ATGGCGGCTG CTGAAAGTGA CCTTCCAAAT 1020 

CCTTGGTGGC ACTTCAGCGC CACAGGCTCT CCAATAAAAA CCCTTTACAC ACAAACCATG 1080 

AGTAOCTTGG GCTTGGATGT TTTCTGT GG T GCCGGCCAGC GGGGCACCTT TTGTGAAGAC 1140 

AGAGCAGTGA CTAAGGTTCT CCAGGGTAGC TCTTTCTCCA AACAGCTGCG CTGGAAGCCA 1200 

GCCCTAGAGA GTGGGTTTCC CCATCATCTC AGGCTTCTCA GAGAGTGTCC TCCGCTGAGC 1260 

ACCCATCCTG TCAGGTTGGC TCGTTCAGAT GCCCGGGGAC AAGCCAGCCT GACGGGGAGG 1320 

AGGGTGTTTC GGCGTCCGCG GCAGTCTCTG CATGGCGGAG GGTCAGCGGG TACCGCAACT 1380 

TGCCTTTTGG TTTTGAAGAT TCTGTTGAGG CGCCATCCTC ACCTTGACCT CTTCTACAAA 1440 

ATCTGTCTCC CCTGCTGTGC CGTGGAACAC CTACGGGAAG CCAAGAGAAG CTCAGTGACT 1500 

GTCCTTGCGT CATTTGAGCA GAGCCCACAA AAGGCAGCTG CTGCCCACGG GGAGCCTGTC 1560 

AAACGAGGGC CCAGTGGGCA ATTGACCAGA CACACATGCC CTGGCTGGGG GATCACACAT 1620 

GCGAACCTGC AGACAATTCC AGATACCCAA GGCCAGGAAG GCCCACGTGA GGATGTCACT 1680 

CACCCTGGAG GAGACTTGGA TGGGGTGGCA AATTTCTATT TGGAGGAAGA GGGTTTCCAG 1740 

GATGGCAGAT GCCAGAAGAT GGTCCTGATG TCTGAGGAAG GGCCACCTAG TTTGACAGGA 1800 

TGTGAGAGGC TCACAGGTTC CCATCACTTC TCCAGCCATT CCAAGTCTTG GTCCTTCCTT 1860 
TCCCCCCGAC AGCCCCTGTT TCTGTCCAGG CCCTGA 



A218 Protein sequence: 
Gene name: 
Uhigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



Homo sapiens mRNA; cDNA DKPZp56401763 (from clone DKFZp56401763) 

Hs. 27373 

F13036 

pgenesh predicted 
1-27 

94-115. 448-469 
not determined 



1 11 21 

1 I I 

MRAVPLPAPL LPLLLLALLA APAARASRAE SVSAPWPBPE RESRPPPGPG PGNTTRFGSG 60 

AAGGSGSSSS NSSGDALVTR ISILLRCLPT LKAAVIVAFA FTTLLIACLL LRVFRSGKRL 120 

KKTRKYDI IT TPAERVEMAP LNBEDDEDED STVFDIKYRV SLPAALRRQL PGCQTLLTVP 180 

VPPPPILDID LPARCSGRPD GGIRPGKTCF PAWWHPVBSW SAATWGVKDW TWKPSCVGGV 240 

ETKTNVMYKT PAPSCVSGIC SDCHHQARPH VTTMELLLPP FGHPPKVPPT STPHGPRQLQ 300 

LNLMEKLDSS ALRRNTRAPS ARCLPLVLAE MAAAESDLPN PWWHFSATGS PIKTLYTQTM 360 

STLGLDVFCG AGQRGTFCED RAVTKVLQGS SFSKQLRWKP ALESGFPHHL RLLRECPPLS 420 

THPVRLARSD ARGQASLTGR RVFRRPRQSL EGGGSAGTAT CLLVLKILLR RHPHLDLFYK 480 

ICLPCCAVEH LREAKRSSVT VLASFEQSPQ KAAAAHGEPV KRGPSGQLTR HTCPGWGITH 540 

ANLQTIPDTQ GQEGPREDVT HPGGDLDGVA NFYLEEEGFQ DGRGQKMVLM SEEGPPSLTG 600 
CERLTGSHHF SSHSKSWSFL SPRQPLFLSR P 

A219 DNA SEQUENCE 

Gene name: select in E (endothelial adhesion molecule 1) 

Unigene number: Hs. 89546 

Probeset Accession ft: M24736 

Nucleic Acid Accession ft: NM_0004S0 

Coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGATTGCTT CACAGTTTCT CTCAGCTCTC ACTTTGGTGC TTCTCATTAA AGAGAGTGGA 60 

GCCTGGTCTT ACAACACCTC CACGGAAGCT ATGACTTATG ATGAGGCCAG TGCTTATTGT 120 

CAGCAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAG AAGAGATTGA GTACCTAAAC 180 

TCCATATTGA GCTATTCACC AAGTTATTAC TGGATTGGAA TCAGAAAAGT CAACAATGTG 240 

TGGGTCTGGG TAGGAACCCA GAAACCTCTG ACAGAAGAAG CCAAGAACTG GGCTCCAGGT 300 

GAACCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAGA TCTACATCAA GAGAGAAAAA 360 

GATGTGGGCA TGTGGAATGA TGAGAGGTGC AGCAAGAAGA AGCTTGCCCT ATGCTACACA 420 

GCTGCCTGTA CCAATACATC CTGCAGTGGC CACGGTGAAT GTGTAGAGAC CATCAATAAT 480 

TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAGT GTGAGCAAAT TGTGAACTGT 540 

ACAGCCCTGG AATCCCCTGA GCATGGAAGC CTGGTTTGCA GTCACCCACT GGGAAACTTC 600 

AGCTACAATT CTTCCTGCTC TATCAGCTGT GATAGGGGTT ACCTGCCAAG CAGCATGGAG 660 

ACCATGCAGT GTATGTCCTC TGGAGAATGG AGTGCTCCTA TTCCAGCCTG CAATGTGGTT 720 

GAGTGTGATG CTGTGACAAA TCCAGCCAAT GGGTTCGTGG AATGTTTCCA AAACCCTGGA 780 

AGCTTCCCAT GGAACACAAC CTGTACATTT GACTGTGAAG AAGGATTTGA ACTAATGGGA 840 

GCCCAGAGCC TTCAGTGTAC CTCATCTGGG AATTGGGACA ACGAGAAGCC AACGTGTAAA 900 

GCTGTGACAT GCAGGGCCGT CCGCCAGCCT CAGAATGGCT CTGTGAGGTG CAGCCATTCC 960 

CCTGCTGGAG AGTTCACCTT CAAATCATCC TGCAACTTCA CCTGTGAGGA AGGCTTCATG 1020 

TTGCAGGGAC CAGCCCAGGT TGAATGCACC ACTCAAGGGC AGTGGACACA GCAAATCCCA 1080 

GTTTGTGAAG CTTTCCAGTG CACAGCCTTG TCCAACCCCG AGOGAGGCTA CATGAATTGT 1140 

CTTCCTAGTG CTTCTGGCAG TTTCCGTTAT GGGTCCAGCT GTGAGTTCTC CTGTGAGCAG 1200 

GGTTTTGTGT TGAAGGGATC CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTGGGACAAC 1260 



31 
I 



935 



WO 03/042661 



PCT/US02/36810 



GAGAAGCCCA 
GTGAGGTGTG 
TGTGAGGAGG 
TGGACAGAAG 
AAGATCAACA 
CCTGAAGGAT 
TCTGGCCTGC 
CTTTCTGCTG 
TGCTTACGGA 
GGAAGCTACC 



CATGTGAAGC 
CTCATTCCCC 
GATTTGAATT 
AOGTTCCTTC 
TGAGCTGCAG 
GGACGCTCAA 
TACCTACCTG 
CTGGACTCTC 
AAGCAAAGAA 
AAAAGCCTTC 



TGTGAGATGC 
TATTGGAOAA 
ATATGGATCA 
CTGCCAAGTG 
TGGGGAGCCC 
TGGCTCTGCA 
TGAAGCTCCC 
CCTCCTGACA 
ATTTGTTCCT 
TTACATCCTT 



GATGCTGTCC 
TTCACCTACA 
ACTCAACTTG 
GTAAAATGTT 
GTGTTTGGCA 
GCTOGGACAT 
ACTGAGTCCA 
TTAGCACCAT 
GCCAGCAGCT 
TAA 



ACCAGCCCCC 
AGTCCTCTTG 
AGTGCACATC 
CAAGCCTGGC 
CTGTGTGCAA 
GTGGAGCCAC 
ACATTCCCTT 
TTCTCCTCTG 
GCCAAAGCCT 



GAAGGGTTTG 
TGCCTTCAGC 
TCAGGGACAA 
AGTTCCGGGA 
GTTCGCCTGT 
AGGACACTGG 
GGTAGCTGGA 
GCTTCGGAAA 
TGAATCAGAC 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



A220 Protein sequence: 
Gene name* 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft : 
Signal sequence; 
Transmembrane domains: 
C- lectin domain: 
Cellular Localization: 



Select in E (endothelial adhesion molecule 1) 

Hs. 89546 

H24736 

NP 000441 

1-22 

555-573 

23-139 

plasma membrane 



1 11 21 31 41 SI 

I i I I | | 

. MIASQPLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEBIEYLN 60 

SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPNNRQKDED CVEIYIKREK 120 

DVGMWNDERC SKKKLALCYT AACTNTSCSG HGECVETINN YTCKCDPGFS GLKCEQIVNC 180 

TALESPEHGS LVCSHPLGNP SYNSSCSISC DRGYLPSSME TMQCMSSGEW SAPIPACNW 240 

ECDAVTNPAN GFVECPQNPG SPPWNTTCTF DCEEGPELMG AQSLQCTSSG NWDNEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNPTCEEGFM LQGPAQVECT TQGQWTQQIP 360 

VCEAFQCTAL SNPERGYMNC LPSASGSPRY GSSCEFSCEQ GFVLKGSKRL QCGPTGEWDN 420 

EKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSQGQ 480 

WTEEVPSCQV VKCSSLAVPG KINMSCSGEP VFGTVCKFAC PEGWTLNGSA ARTCGATGHW 540 

SGLLPTCEAP TESNIPLVAG LSAAGLSLLT LAPFLLWLRK CLRKAKKFVP ASSCQSLESD 600 
GSYQKPSYIL 



Taxol Prostate 
A221 DNA SEQUENCE 

Gene name: ESTs; Liprin A2 

Unigene number: Hs. 306480 

Probeset Accession ft: NS1002 

Nucleic Acid Accession ft: N51002 

Coding sequence: 1-3793 (underlined 



sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGATGTGTG AAGTGATGCC CACGATTAAT GAGGACACCC CAATGAGCCA AAGGGGGTCC 60 

CAAAGCAGTG GCTCGGACTC AGACTCCCAT TTTGAGCAGC TGATGGTGAA TATGCTAGAT 120 

GAAAGGGATC GTCTTCTAGA CACCCTTCGG GAGACCCAGG AAAGCCTCTC ACTTGCCCAG 180 

CAAAGACTTC AGGATGTCAT CTATGACCGA GACTCACTCC AGAGACAGCT CAATTCAGCC 240 

CTGCCACAGG ATATCGAATC CCTAACAGGA GGGCTGGCTG GTTCTAAGGG GGCTGATCCA 300 

CCGGAATTTG CTGCACTGAC AAAAGAATTA AATGCCTGCA GGGAACAACT TCTAGAAAAG 360 

GAAGAAGAAA TCTCTGAACT TAAAGCTGAA AGAAACAACA CAAGACTATT ACTGGAGCAT 420 

TTGGAGTGCC TTGTGTCACG ACATGAAAGA TCACTAAGAA TGACGGTGGT AAAACGGCAA 480 

GCCCAGTCTC CCTCAGGAGT ATCCAGTGAA GTTGAAGTTC TCAAGGCACT GAAATCTTTG 540 

TTTGAGCACC ACAAGGCCTT GGATGAAAAG GTAAGGGAGC GACTGAGGGT TTCTTTAGAA 600 

AGAGTCTCTG CACTGGAAGA AGAACTAGCT GCTGCTAATC AGGAGATTGT TGCCTTGCGT 660 

GAACAAAATG TTCATATACA AAGAAAAATG GCATCAAGCG AGGGATCCAC AGAGTCAGAA 720 

CATCTTGAAG GGATGGAACC TGGACAGAAA GTCCATGAGA AGCGTTTGTC CAATGGTTCT 780 

ATAGACTCAA COGATGAAAC TAGTCAAATA GTTGAACTAC AAGAATTGCT TGAAAAGCAA 840 

AACTATGAAA TGGCCCAGAT GAAAGAACGT TTAGCAGCCC TTTCTTCCCG AGTGGGAGAG 900 

GTGGAACAGG AAGCAGAGAC AGCAAGAAAG GATCTCATTA AAACAGAAGA AATGAACACC 960 

AAGTATCAAA GGGACATTAG GGAGGCCATG GCACAAAAGG AAGATATGGA AGAAAGAATT 1020 

ACAACCCTTG AAAAGCGTTA CCTCAGTGCT CAGAGAGAAT CTACCTCCAT ACATGACATG 1080 

AATGATAAAC TAGAAAATGA GTTAGCAAAT AAAGAAGCTA TCCTACGGCA GATGGAAGAG 1140 

AAAAACAGAC AGTTACAAGA ACGTCTTGAG CTAGCTGAAC AAAAGTTGCA GCAGACCATG 1200 

AGAAAGGCTG AAACCTTGCC TGAAGTAGAG GCTGAACTGG CTCAGAGAAT TGCAGCCCTA 1260 

ACCAAGGCTG AAGAGAGACA TGGAAATATT GAAGAACGTA TGAGACATTT AGAGGGTCAA 1320 

CTTGAAGAGA AGAATCAAGA ACTTCAAAGA GCTAGGCAAA GAGAGAAAAT GAATGAGGAG 1380 

CATAACAAGA GATTATCGGA TACGGTTGAT AGACTTCTGA CTGAATCCAA TGAACGCCTA 1440 

CAACTACACT TAAAGGAAAG AATGGCTGCT CTAGAAGAAA AGAATGTTTT AATTCAAGAA 1500 

TCAGAAACTT TCAGAAAGAA TCTTGAAGAA TCTTTACATG ATAAGGAAAG ATTAGCAGAA 1560 

GAAATTGAAA AGCTGAGATC TGAACTTGAC CAATTGAAAA TGAGAACTGG CTCTTTAATT 1620 

GAACCCACAA TACCAAGAAC TCATCTAGAC ACCTCAGCTG AGTTGCGGTA CTCAGTGGGA 1680 

TCCCTAGTGQ ACAGCCAGTC TGATTACAGA ACAACTAAAG TAATAAGAAG ACCAAGGAGA 1740 

GGCCGCATGG GTGTGCGAAG AGATGAGCCA AAGGTGAAAT CTCTTGGGGA TCACGAGTGG 1800 

AATAGAACTC AACAGATTGG AGTACTAAGC AGCCACCCTT TTGAAAGTGA CACTGAAATG I860 

TCTGATATTG ATGATGATGA CAGAGAAACA ATTTTTAGCT CAATGOATCT TCTCTCTCCA 1920 

AGTGGTCATT CCGATGCCCA GACGCTAGCC ATGATGCTTC AGGAACAATT GGATGCCATC 1980 

AACAAAGAAA TCAGGCTAAT TCAGGAAGAA AAAGAATCTA CAGAGTTGCG TGCTGAAGAA 2040 

ATTGAAAATA GAGTGGCTAG TGTGAGCCTC GAAGGCCTGA ATTTGGCAAG GGTCCACCCA 2100 



936 



WO 03/042661 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 



GGTACCTCCA 
GGACACTCAA 
GTCATGACAC 
GATGGTOGAG 
GCCCTCAGAA 
TCTGTCTCTC 
CTTCACAAAG 
AAAGAAAAAG 
GAGTCCCTGG 
AAGCATGAAC 
CCAACTGTGG 
TGCCGAGCCA 
AGAGAAATTG 
ATGGTTTCCC 
TGGGTGACTC 
GAAGGAAGCT 
CATGAGTGGA 
TTTATGGAAT 
GTCCATTTAA 
TTAAAGAGGT 
GAAATAAAAG 
GGACTTCGAG 
CTGGATGAAA 
ACCCAGGCAA 
AGGCGACTGG 
TTTCCTCCTC 
GCTGGATTTA 
TCATCAAGAC 
GCGGCCGCTT 



TTACTGCCTC 
CTCCAAAGCT 
TGCCAAGTGA 
AGGACAAAGC 
TGACTCACAC 
TTGAGCCAGA 
CCCCCAAGAA 
CTGGACTTGG 
GGTTAGGCAA 
TTCTTGAAGA 
TOGCATGGCT 
ACGTGAAGAG 
GAATCAGGAA 
TAACAAGTCC 
ATGAAGAAAT 
GGGCCCAGTG 
TTGGAAATGA 
GCTTGGTAGA 
AAATGGTGGA 
TGAATTATGA 
AOGTGTTGGT 
AATATGCAAA 
ACTTTGACTA 
GGCAGATTCT 
ATGAAAGTGA 
GTGAAGTACA 
GGTTAACCAC 
TGCAGAGGTT 
TAA 



TGTTACAGCT 
CACCCCTCGA 
TCTGAGGAAA 
AACAATTAAA 
TCTCCCTTCT 
AAGCCTCGGG 
GAAAGGAATC 
GCAGCTCOGA 
ACTCGGAACT 
AGCTGGGAGA 
AGAGCTTTGG 
TGGTGCCATC 
TCCACTGCAT 
TTCAGCTCCT 
GGAAAATCTT 
TCOGGTTTTT 
ATGGCTTCCC 
TGCAAGAATG 
TAGTTTCCAT 
CAGAAAAGAA 
GTGGAGCAAT 
TAATATACTT 
CAGCAGCTTA 
TGAAAGAGAA 
TGACAAGAAC 
TGGAATCAGC 
AACCTCTGGG 
AGACAACTCC 



TCATOGCTGG 
AGCCCTGCCA 
CATOGGAGAA 
TGTGAAACTT 
TCCTACCACA 
CTTGGTAGTG 
AAGTCTTCAA 
GGCTTTATGG 
CAAGCTGAGA 
AAGGGATTAC 
TTGGGAATGC 
ATGTCTGCTT 
CGCTTAAAAC 
CCAACATCTC 
GCAGCTCCAG 
CTACAGACCC 
AGCTTGGGGT 
TTAGATCACC 
OGAACAAGTT 
CTAGAAAGAA 
GACOGAATTA 
G AGAGC GGTG 
ACTTTATTAT 
TACAATAACC 
TTCAGAOGTG 
ATGATGCCTG 
CAATCAAGAA 
ACTGTTCGCA 



CCAGTTCATC 
GGGAAATGGA 
AGATTGCAGT 
CTCCTCCTCC 
ATGATGCTOG 
CCAACAGCAG 
TAGGACGTTT 
AGACTGAASC 
AGGATCGAAG 
CTTTTGCCCA 
CTGCGTGGTA 
TATCTGACAC 
TTCGATTAGC 
GAACTCCTTC 
CAAAAACGAA 
TGGCTTATGG 
TACCTCAGTA 
TAACAAAAAA 
TACAATATGG 
GACGGGAAGC 
TTCGCTGGAT 
TGCATGGCTC 
TACAGATTCC 
TCTTGGCCCT 
GATCAACCTG 
GGTCCTCAGA 
AAATGACAAC 
CATACTCATG 



TCCCCCCAGT 
TCGGATGGGA 
TGTGGAAGAA 
TACCCCTAGA 
AAGTAGTTTA 
CCAAGACTCT 
GTTTGGTAAA 
TGCAGCTCAG 
ACTAAAGAAA 
GTGGGATGGG 
CGTGGCAGCC 
TGAGATCCAG 
AATCCAGGAG 
AGGCAACGTT 
AGAATCTGAG 
AGATATGAAT 
CAGAAGTTAC 
AGATCTCCGT 
AATTATGTGC 
AAGCCAACAT 
ACAAGCAATT 
ACTTATAGCC 
AACACAGAAC 
GGQAACTGAA 
GAGAAGGCAG 
AACATTACCA 
AGATGTTGCT 
TCTCGAGTAA 



A222 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
AAA domain: 
SAM domains: 
Cellular Localization: 



ESTS; Liprin A2 
Hs. 306480 
N51002 
none found 
none found 
none found 
286-539 

895-964, 1017-1084, 1105-1177 
not determined 



1 
I 

MMCEVMPTIN 
QRLQDVIYDR 
EEEISELKAS 
FEHHKALDEK 
HLEGMEPGQK 
VEQEAETARK 
NDKLENELAN 
TKAEERHGNI 
QLHLKBRMAA 
EPTIPRTHLD 
NRTQQIGVLS 
NKEIRLIQEE 
GHSTPKLTPR 
ALRMTHTLPS 
KEXARLgQLR 
PTWAWLELW 
MVSLTSPSAP 
HEWIGNEWLP 
LKRLNYDRXE 
LDENFDYSSL 
FPPREVHGIS 



11 
I 



21 
I 



DSLQRQLNSA 
RNNTRLLLEH 
VRERLRVSLE 



DLIKTEEMNT 
KEAILRQMEE 
EERMRHLEGQ 
LEEKNVLIQE 
TSAELRYSVG 
SHPFESDTEM 
KESTELRAEE 
SPAREMDRMG 
SYHNDARSSL 
GFMETEAAAQ 
LGMPAWYVAA 
PTSRTPSGNV 
SLGLPQYRSY 
LERRREASQH 
TLLLQIPTQN 
MMPGSSETLP 



LPQDIESLTG 
LECLVSRHER 
RVSALEEELA 
IDSTDETSQI 
KYQRDIREAM 
KNRQLQERLE 



31 
I 

FEQLMVNMLD 
GLAQSKGADP 
SLRMTWKRQ 
AANQEIVALR 
VELQELLEKQ 



SETPRKNLEE 
SLVDSQSDYR 
SDIDDDDRET 
IENRVASVSIi 
VMTLPSDLRK 
SVSLEPESLG 
ESLGLGKLGT 
CRANVKSGAI 



FMECLVDARM 
EIKDVLVWSN 
TQARQIIiBRE 
AGFRLTTTSG 



LAEQKLQQTM 
ARQREKMNEE 
SLHDKERLAE 
TTKVIRRPRR 
IFSSMDLLSP 
EGLNLARVHP 
HRRKIAWEE 
LGSANSSQDS 
QAEKDRRLKK 
MSALSDTEIQ 
AAPAKTKESE 
LDHLTKKDLR 
DRIIRWIQAI 
YNKLLALGTE 
QSRKMTTDVA 



41 
I 

ERDRLLDTLR 
PEFAALTKEL 
AQSPSGVSSE 
EQNVHIQRKM 
NYEMAQMKER 
TTLEKRYLSA 
RKAETLPEVE 
HNKRLSDTVD 
EIEKLRSELD 
GRMGVRRDEP 
SGHSDAQTLA 
GTSITASVTA 
DGREDKATIK 
LHKAPKKKGI 
KHELLEEARR 
REIGISNPLH 
EGSWAQCPVP 
VHLKMVDSPH 
GLREYANNIL 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



51 
I 

ETQESLSLAQ 
HACREQLLEK 
VEVLKALKSL 
ASSEGSTESE 
LAALSSRVGE 
QRESTSIHDM 
AELAQRIAAL 
RLLTESNERL 
QLKMRTGSLI 
KVKSLGDHEW 
MMLQEQLDAI 
SSLASSSPPS 
CETSPPPTPR 
KSSIGRLFGK 
KGLPPAQWDG 
RLKLRLAIQE 
LQTIiAYGDMN 
RTSLQYGIMC 
ESGVHGSLIA 
FRRGSTWRRQ 
TVRTYSCLE 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



A223 DWA SEQUENCE 
Gene name: CDA14 
Unigene number: Hs. 2 681 3 

Probeset Accession #: N32912 
Nucleic Acid Accession 8* NMJ)16570 

Coding sequence: 1- lF34 (underlined sequences correspond to start and stop codons) 



75 



80 



1 11 21 31 41 SI 

I I I I I i 

ATGAGGCQAC TGAATCGGAA AAAAACTTTA AGTTTGGTAA AAGAGTTGGA TGCCTTTCOG 60 

AAGGTTCCTG AGAGCTATGT AGAGACTTCA GCCAGTGGAG GTACAGTTTC TCTAATAGCA 120 

TTTACAACTA TGGCTTTATT AACCATAATG GAATTCTCAG TATATCAAGA TACATGGATG 180 

AAGTATGAAT ACGAAGTAGA CAAGGATTTT TCTAGCAAAT TAAGAATTAA TATAGATATT 240 

ACTGTTGCCA TGAAGTGTCA ATATGTTGGA GCGGATGTAT TGGATTTAGC AGAAACAATG 300 

GTTGCATCTG CAGATGGTTT AGTTTATGAA CCAACAGTAT TTGATCTTTC ACCACAGCAG 360 

AAAGAGTGGC AGAGGATGCT GCAGCTGATT CAGAGTAGGC TACAAGAAGA GCATTCACTT 420 

CAAGATGTGA TATTTAAAAG TGCTTTTAAA AGTACATCAA CAGCTCTTCC ACCAAGAGAA 480 

937 



WO 03/042661 



PCT/US02/36810 



GATGATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TGTCAATAAA 540 

GTAGCAGGGA ATTTTCACAT AACAGTGGGC AAGGCAATTC CACATCCTCG TGGTCATGCA 600 

CATTTGGCAG CACTTGTCAA CCATGAATCT TACAATTTTT CTCATAGAAT AGATCATTTG 660 

TCTTTTGGAG AGCTTGTTCC AGCAATTATT AATCCTTTAG ATGGAACTGA AAAAATTGCT 720 

ATAGATCACA ACCAGATGTT CCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 780 

TATAAAATAT CAGCAGACAC CCATCAGTTT TCTGTGACAG AAAGGGAACG TATCATTAAC 640 

CATGCTGCAG GCAGCCATGG AGTCTCTGGG ATATTTATGA AATATGATCT CAGTTCTCTT 900 

ATGGTGACAG TTACTGAGGA GCACATGCCA TTCTGGCAGT TTTTTGTAAG ACTCTGTGGT 960 

ATTGTTGGAG GAATCTTTTC AACAACAGGC ATGTTACATG GAATTGGAAA ATTTATAGTT 1020 

GAAATAATTT GCTGTCGTTT CAGACTTGGA TCCTATAAAC CTGTCAATTC TGTTCCTTTT 1080 
GAGGATGGCC ACACAGACAA CCACTTACCT CTTTTAGAAA ATAATACACA TTGA 



A224 Protein sequence; 

Gene name: ' CDA14 

Unigene number: Hs. 26813 

Probeset Accession #: N32912 

Protein Accession ft: NP_0576S4 

Signal sequence: none found 

Transmembrane domains: none found 

Cellular Localization: nuclear 



1 11 21 

I I I 

MRRLNRKKTL SLVKBLDAFP KVPESYVETS 
KYEYEVDKDP SSKLRINIDI TVAMKCQYVG 
KEWQRMLQLI QSRLQEEHSL QDVIPKSAPK 
VAGNPHITVG KAIPHPRGHA HLAALVNHES 
IDHNQMFQYP ITWPTKLHT YKISADTHQF 
MVTVTEEHMP FWQFPVRLGG IVGGIPSTTG 
EDGHTDNHLP LLENNTH 



31 41 51 

t I ! 

ASGGTVSLIA FTTMALLTIM EPSVYQDTWM 60 

ADVLDLAETM VASADGLVYE PTVFDLSPQQ 120 

STSTALPPRE DDSSQSPNAC RIHGHLYVNK 180 

YNFSHRIDHL SFGELVPAII NPLDGTEKIA 240 

SVTERERItN HAAGSHGVSG IFMKYDLSSL 300 

MLHGIGKFIV EIICCRFRLG SYKPVNSVPF 360 



Uterine 

A225 DMA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession ft: 
Coding sequence: 



ESTs, Weakly similar to JE0350 Anterior gradient -2 [H. sapiens] 

Hs. 100686 

AA4874G8 

AA487468 

55-555 (underlined sequences correspond to start and stop ccdons) 



1 11 21 31 41 51 

I I I I I 1 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGXAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAAXACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC TATAAQAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A226 Protein sequence: 
Gene name: 
Uhigene number: 
Probeset Accession #: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Hs. 100686 

AA487468 

none found 

1-23 

none found 
secreted 



1 11 21 31 41 51 

I I I I I I 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLPYAQKSKK 60 
PLMVIHHLED CQYSQALKKV PAQNEEIQEM AQNKFIMLNL MHETTDKNLS PDGQYVPRIM 120 
PVDPSLTVRA DIAGRYSNRL YTYBPRDLPL LIENMKKALR LIQSEL 

A227 DNA SEQUENCE 

Gene name: G protein-coupled receptor 49 (GPR49) (HG38) (LGR5) 
Unigene number: Hs. 285529 

Probeset Accession ft: AA460S30 
Nucleic Acid Accession ft: NM_003667 

Coding sequence: 201-2924 (underlined sequences correspond to start and stop codons) 



938 



WO 03/042661 



PCT/US02/36810 
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75 

80 



1 11 21 31 41 SI 

I I I I I I 

GTGGCGGCAA CCGGCACCTC AGTCCCCGCC GOGCTTCTCC TCGCCGCCCA CGCCGTGGGG 60 

TCAGGAACGC GGOGTCTGGC GCTGCAGACQ COCGCTGAGT TGCAGAA6CC CACGGAGCGG 120 

CGCCOGGCGC GOCAOGGCCC GTAGCAGTCC GGTGCTGCTC TCCGCCCGCG TC0G6CTCGT 180 

GGCCCCCTAC TTCGGGCACC ATGGACACCT CCCGGCTOGG TGTGCTCCTG TCCTTGCCTG 240 

TGCTGCTGCA GCTGGCGACC GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT 300 

GCCCCACACA CTGTCATTGC GAGCCCGACG GCAGOATGTT GCTCAGGGTG GACTGCTCCG 360 

ACCTGGGGCT CTCGGAGCTG CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA 420 

GTATGAACAA CATCAGTCAG CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG 480 

AGTTAOGTCT TGCGGGAAAC GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT 540 

ACAGTCTTAA AGTTCTTATG CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC 600 

TGCAGAATTT GCGAAGCCTT CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC 660 

CCCCAAGCTG TTTCAGTGGC CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT 720 

TAACAGAAAT CCCCGTCCAG GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG 780 

CCCTGAACAA AATACACCAC ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG 840 

TTCTACATCT CCATAACAAT AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC 900 

ACAGCCTAGA GACTTTAGAT TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA 960 

GGACACTCTC CAACCTTAAA GAACTAGGAT TTCATAGCAA CAATATCAGG TCGATACCTG 1020 

AGAAAGCATT TGTAGGCAAC CCTTCTCTTA TTACAATACA TTTCTATGAC AATCCCATCC 1080 

AATTTGTTGG GAGATCTGCT TTTCAACATT TACCTGAACT AAGAACACTG ACTCTGAATG 1140 

GTGCCTCACA AATAACTGAA TTTCCTGATT TAACTGGAAC TGCAAACCTG GAGAGTCTGA 1200 

CTTTAACTGG AGCACAGATC TCATCTCTTC CTCAAACCGT CTGCAATCAG TTACCTAATC 1260 

TCCAAGTGCT AGATCTGTCT TACAACCTAT TAGAAGATTT ACCCAGTTTT TCAGTCTGCC 1320 

AAAAGCTTCA GAAAATTGAC CTAAGACATA ATGAAATCTA CGAAATTAAA GTTGACACTT 13 B0 

TCCAGCAGTT GCTTAGCCTC CGATCGCTGA ATTTGGCTTG GAACAAAATT GCTATTATTC 1440 

ACCCCAATGC ATTTTCCACT TTGCCATCCC TAATAAAGCT GGACCTATOG TCCAACCTCC 1500 

TGTCGTCTTT TCCTATAACT GGGTTACATG GTTTAACTCA CTTAAAATTA ACAGGAAATC 1560 

ATGCCTTACA GAGCTTGATA TCATCTGAAA ACTTTCCAGA ACTCAAGGTT ATAGAAATGC 1620 

CTTATGCTTA CCAGTGCTGT GGATTTGGAG TGTGTGAGAA TGCCTATAAG ATTTCTAATC 1680 

AATGGAATAA AGGTGACAAC AGCAGTATGG ACGACCTTCA TAAGAAAGAT GCTQGAATGT 1740 

TTCAGGCTCA AGATGAACGT GACCTTGAAG ATTTCCTGCT TGACTTTGAG GAAGACCTGA 1800 

AAGCCCTTCA TTCAGTGCAG TGTTCACCTT CCCCAGGCCC CTTCAAACCC TGTGAACACC 1860 

TGCTTGATGG CTGGCTGATC AGAATTGGAG TGTGGACCAT AGCAGTTCTG GCACTTACTT 1920 

GTAATGCTTT GGTGACTTCA ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC 1980 

TGTTAATTGG GGTCATCGCA GCAGTGAACA TGCTCACGGG AGTCTCCAGT GCCGTGCTGG 2040 

CTGGTGTGGA TGCGTTCACT TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG 2100 

GGGTTGGTTG CCATGTCATT GGTTTTTTGT CCATTTTTGC TTCAGAATCA TCTGTTTTCC 2160 

TGCTTACTCT GGCAGCCCTG GAGCGTGGGT TCTCTGTGAA ATATTCTGCA AAATTTGAAA 2220 

CGAAAGCTCC ATTTTCTAGC CTGAAAGTAA TCATTTTGCT CTGTGCCCTG CTGGCCTTGA 2280 

CCATGGCCGC AGTTCCCCTG CTGGGTGGCA GCAAGTATGG CGCCTCCCCT CTCTGCCTGC 2340 

CTTTGCCTTT TGGGGAGCCC AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT 2400 

CCCT TT GC TT CCTCATGATG ACCATTGCCT ACACCAAGCT CTACTGCAAT TTGGACAAGG 2460 

GAGACCTGGA GAATATTTGG GACTGCTCTA TGGTAAAACA CATTGCCCTG TTGCTCTTCA 2520 

CCAACTGCAT CCTAAACTGC CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA 2580 

CATTTATCAG TCCTGAAGTA ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT 2640 

GTCTCAATCC CCTTCTCTAC ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC 2700 

TGAGAAAGCA AACCTACGTC TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT 2760 

CTGATGATGT CGAAAAACAG TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT 2820 

CCAGCATCAC TTATGACCTG CCTCCCAGTT CCGTGCCATC ACCAGCTTAT CCAGTGACTG 2880 

AGAGCTGCCA TCTTTCCTCT GTGGCATTTG TCCCATGTCT CTAATTAATA TGTGAAGGAA 2940 

AATGTTTTCA AAGGTTGAGA ACCTGAAAAT GTGAGATTGA GTATATGAGA GCAGTAATTA 3000 
ATAAGAAGAG CTGAGGTGAA ACTCGGTTTA AA 



A228 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



G protein-coupled receptor 49 (GPR49) (HG38) (LGR5) 
Hs. 28552 9 
NP_003 658.1 
1-22 

557-S79, 596-618, 641-663, 687-709, 724-746, 770-792, 
plasma membrane 



803-825 



1 
I 

MDTSRLGVLL 
PSNLSVFTSY 
LQNNQLRHVP 
AFRSLSALQA 
LNYNNLDEFP 
FQHLPELRTL 
YNLLKDLPSP 
LPSLIKLDLS 
AFGVCENAYK 
CSPSPGPPKP 
AVNMLTGVSS 
ERGFSVKYSA 
STMGYMVALI 
PVAFLSFSSL 
VfTRSKHPSLM 
VAPVPCL 



11 

I 

SLPVLLQLAT 
LDLSMNNISQ 
TEALQNIiPvSL 
MTLALNKIHH 
TAIRTLSNLK 
TLNGASQITE 
SVCQKLQKID 
SNLLSSFPIT 
ISNQWNKGDN 
CBBLLDGWLI 
AVLAGVDAFT 
KFBTKAPFSS 
LLNSLCFLMM 
INLTPISPEV 
SINSDDVEKQ 



21 
I 

GGSSPRSGVL 
LLPNPLPSLR 
QSLRLDANHI 
IPDYAFGNL9 
ELGFHSNNIR 
FPDLTGTANL 
LRHNBIYEIK 
GLHGLTHLKL 
SSMDDLHKKD 
RIGVWTIAVL 
FGSFARHGAW 
LKVIILLCAL 
TIAYTKLYCN 
IKFILLVWP 
SCDSTQALVT 



31 

I 

LRGCPTHCHC 
FLEELRLAGN 
SYVPPSCFSG 
SLWLHLHNN 
SIPBKAFVGN 
BSLTLTGAQI 
VDTFQQLLSL 
TGHEALQSXil 
AGMFQAQDER 
ALTCNALVTS 
WENGVGCHVI 
LALTMAAVPL 
LDKQDLENIW 
LPACLNPLLY 
PTSSSITYDL 



41 
I 

EPDGRMLLRV 
ALTYIPKGAF 
LHSLRHLWLD 
RIHSLGKKCF 
PSLITIHPYD 
SSLPQTVCNQ 



SSEWFPELKV 
DLBDFLLDFE 
TVFRSPLYIS 
GFLSIFASBS 
LGGSKYGASP 
DCSMVKHIAL 
ILFNPHFKED 
PPSSVPSPAY 



51 




1 

DCSDLGLSBL 


60 


TGLYSLKVLM 


120 


DNALTEIPVQ 


180 


DGLKSLETLD 


240 


NPIQFVGRSA 


300 


LPNLQVLDLS 


360 


AIIHPNAFST 


420 


IEMPYAYQCC 


480 


EDLKALHSVQ 


540 


PIKLLIGVIA 


600 


SVFLLTLAAL 


660 


LdiPLPFGEP 


720 


LLFTNCILNC 


780 


LVSLRKQTYV 


840 


PVTBSCHI*SS 


900 



Table 75: See Tablet 



939 



WO 03/042661 



PCT/US02/36810 



Table 75A depicts Seq ID No; UnigenelD; UnigeneTlte; PKey; Predicted Cellular Localization; and Exemplar Accession far each of the sequences In Table 7a The Information in 
Table 76A is linked by SeqID No to Table 78. 

Seq ID No: Sequence ID No for sequences bi table ' 

Pkey: Unique Eos probeset identifier number 

ExAcat Exemplar Accession number, Genbanx accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Pred Subceil Loc: Predicted sub-cellular locafcation 

Seq ID No 

Seq ID 1 & 2 
SeqlD3&4 
SeqlD5&6 
SeqlD7&8 
SeqlD9&10 
Seq ID 11 & 12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17 & 18 
Seq ID 19 & 20 
Seq ID 21 & 22 
SeqlD23&24 
Seq ID 25 & 26 
Seq ID 27428 
SeqID29&30 
Seq ID 3H 32 
SeqlD33&34 
SeqlD35&36 
Seql037&38 
SeqlD39&40 
Seq ID 4U 42 
SeqlD43&44 
SeqlD45&46 
SeqtD47&48 
Seq ID49&50 
Seq ID 51 & 52 
Seq ID 53 & 54 
Seq ID 55 & 56 
SeqlD57&58 
Seq ID 59 & 60 
Seq ID 6 U 62 
Seq ID 63464 
Seq ID 65466 
Seq ID 674 68 
Seq ID 69470 
Seq ID 71472 
Seq ID 73474 
Seq ID 75476 
Seq ID 77478 
Seq ID 79480 
Seq ID 81482 
Seq ID 83484 
Seq ID 85486 
Seq ID 87488 
Seq ID 89490 
Seq ID 91 & 92 
Seq ID 93494 
Seq ID 95496 
Seq ID 97488 
Seq ID 994100 
Seq ID 101 4 102 
Seq ID 1034 104 
Seq ID 1054106 
Seq ID 1074108 
Seq ID 1094110 
SeqlD1114112 
SeqID 113 4 114 
Seq ID 115 & 116 
Seq ID 117 4 118 
Seq ID 1194 120 
Seq ID 1214122 
Seq ID 1234 124 
Seq ID 125 4 126 
Seq ID 127 & 128 
Seq ID 1294130 
Seq ID 131 4 132 
Seq ID 1334134 
Seq ID 135 4 136 
Seq ID 137 4 138 

940 



Pkev 


fcxAccn 


UnigenelD 


ufugtzno hub 


rreo ouoceu loc 


425023 


AW956889 


Hs.154210 


prufalhpfbit rirffprPnfisfinn cnhlrwilrnl 
cimuuicuai uiiiciciiuduon, spninyviipi 


plasma membrane 


424503 


NMJX)2205 Hs.149609 


Integrin, alpha 5 (fibronectin receptor, 


plasma membrane 


429423 


AIM £7 19 


U g 0Q77Q7 


mfaarin hflfa 1 ffihmnorfin mrantnr 

Ultoyilll, 1 HlUlvllCvUJI IwwClJlUl, 


piasrna rnemorane 


400289 


X07820 


U c OOTQ 

ns.iioo 


llldlll* IlKHdWywtcUUlaB III |5Ifuln6iy5Bl 


secreted 


419172 




Li c 99190 


EQTtv cimtl-v In TDAMCMCMDDAMC A CI ItlCDC 

to I 5, similar la I KAJVwMtMoKANb 4 oUrfcKr 


plasma membrane 
secreted 


418007 


M 1 JOUJ 


LU OQ4CQ 

rt3.00 IDj 


matiiv iTOfattaimJoffiaeo 1 flntofeftiat 
uuftiiA 1 1 mawpi uvciiicwc i uiuei5uu3i 


418007 


M 13509 


lie 

nSiOO ids 


matrhf mplatlownfAtnac/» 1 finterctrK:*! 


secreted 


407835 


T7Q3ATI 




ft-r^tl CI 1 /fvmnhnma fi momhor R 9liw fi 


In trace II 


414577 


Aincccja 


Hc7911fi 

ns./ £ 110 


hvrvrf htAftrs?! nmtefn PI .I9AQQ9 cimtlar fo 


secreted 


418738 


AWoooooo 


Hs.6682 




plasma membrane 
secre^ 


428368 


BE440042 


ns. oooio 




415138 


VI 0000 


HS.Z95944 


Rssue factor nnfhwnv fnhfhifftf 9 


secreteo 


429276 


Acncenoc 
ArC56Uo5 


Hs. 1986 12 


G nmfpJn-cmirvlpd recfioter ^1 


plasma membrane 


418994 




n$.D?9*K> 


owevun c ^onuoinsiiai aanssion moiecui 


plasma membrane 


407975 




Ue /M71C 


BndothRfial celUsrjecific mnlefiitln 1 


secreted 


429113 


UZoZoo 


H5.1 95384 


prusiagiarKijn^naopefoxiae synmase < (p 


Intracefl 


418506 


AAuo4248 


U„ or ion 

Hs .85339 


G protein-coupled receptor 39 


plasma membrane 


423961 


ui Jooo 


nS.ioo34o 


nprfrKfin /fVsP.9oc^ 


secreted 


414812 


Y707CC 


Hs.77367 


iwiHwno iikhmm uy ganuna iniciKsun 


secreted 


417433 


BtZ7Q266 


HS.B212B 


5T4 oncofetal trophoblast Qtycopotein 


plasma membrane 


417433 


BE270266 


HS.ozizo 


d i ^ oncoreiaj iropnooiasi glycoprotein 


plasma membrane 


424399 


AIOrtCCQf 

AI905do7 




AIQ056R7*IL.RTM^1QniQ<U)1Q Rmw Mnmn 

.IL-D 1 UJJ- lju 1 13 u I U30 nOlTlO 


secreted 


422867 


L32137 


Hs.1584 


Miiuidya ojigomenc iuouia prolan \psB 


secreted 


428227 


AA321649 


Hs.2248 


cwitill Ifl/fit^iKla ^ul/iWnA gnhfaffnttlfc* Q Ff\i 

onioii uiouaoiB cytojuno suoiamuy d ^uy 


secreted 


444381 


BE387335 


Hs.283713 


co i s, wtMiiuy buuiftfl uj odwoh nypouieu 


secreted 


439569 


AW602166 


Hs.222399 


utvari prowtn 


$ecreted 


411558 


AA102670 


Hs.70725 


nammcuimfnAhiih/rir iv\fi //2ARA\ A ratentn 
QdliUIUhaiunoUUqfnc oCKj yvsMDn/ A IcCCpiD 


plasma membrane 


400303 


AA242758 


Hs.79136 


UV*I pUlCUl, tAUUgoIl IcyuldlcQ 


piasma rnemorane 
secreted 


411789 


AF245505 


Hs.72157 


nuvwii 


428698 


AA852773 


Hs.334838 


KlAAIRfifi nmlpin 


plasma membrane 


450098 


W27249 


Hs.8109 


hvmihpJral nmtnfn FI l9inRfl 


uitraceii 


421552 


AF026692 


Hs.105700 




secreted 


452747 


BE153855 


Hs.61460 


ig supenanuiy iccopior Lfiir\ 


plasma membrane 


415539 


AI733881 


Hs.72472 


DlVlr"f\ID 


plasma membrane 


416636 


N32536 


Hs.42645 


ouuiuj Corner laiimy id \\ \ luitucoiuoxyuc 


plasma membrane 


416635 


N32536 


Hs.42645 


solute carrier family 1 6 (monocarboxyllc 


plasma membrane 


409079 


W87707 


Hs.B2065 


tnieneuxin o stgnai transducer (gpioD, 


plasma membrane 
plasma membrane 




R41823 


Hs.7413 


to IS 


400297 


Al 127076 


Ks.306201 




plasma membrane 


451398 


AI793124 


Hs.144479 


CO IS 


intnaceu 

plasma membrane 


429220 


AW207206 




CO IS 


421524 


AA312082 


Hs.105445 


ouixr uuiwy laxpox sopna i 


plasma membrane 


423242 


AL039402 


Hs.125783 


rye up e nmtofrt 


plasma membrane 


423242 


AL039402 


Hs. 125783 




fntnvetl 


423242 


AL039402 


Hs.125783 


DEME-6 protein 


Intracell 


452190 


H2673S 


H&91668 


Homo sapiens done PP1498 unknown mRNA 


intracelt 


452190 


H26735 


HsS1668 


Homo sapiens done PP1498 unknown mRNA 


Intracefl 


325372 






Phase 2 & 3 Exons 


cytoplasmic 


450375 


AA009847 




a (Gsintegrin and metalloproteinasedoma 


plasma membrane 


426215 


AW9S3419 


Hs.155223 


stannbeakin 2 


secreted 


425247 


NMJJ05940 Hs.155324 


matrix metalbpfoteinase 11 (strornetysfn 


secreted 


429353 


AL117406 


Hs.335891 


ATP-bindlng cassette transporter MRP 8 


plasma membrane 


429353 


AL117406 


H&335891 


ATP-Wndlng cassette transporter MRP8 


piasma membrane 


432201 


AI538613 


KS298241 


TransnTembrane protease, serine 3 


plasma membrane 
secreted 


427585 


031152 


Hs.179729 


coHaoen, type X alpha 1 (Schmtd metaph 


446163 


AA026880 


Hs^5252 


prolactin receptor 


plasma membrane 


442117 


AW664964 


Ks.128899 


ESTs; hypothetical protein tor IMAG&447 


ptasma membrane 


428179 


AI127772 


KSJ279696 


senjm/glucocorficold regulated kinase-Ii 


intracell 


431211 


M86849 


Hs323733 


gap junction protein, beta 2, 26kD (conn 


ptasma membrane 
secreted 


447033 


AI357412 


Hs.157601 


ESTs 


447033 


AI357412 


Hs.157601 


ESTs 


secreted 


447033 


AI357412 


Hs.157601 


ESTs 


secreted 


115522 


BE614387 


Hs.333893 


c-Myc target JP01 


intracefl 


452679 


242387 


Hs.83883 


transmembrane, prostate androgen Induced 


plasma membrane 


446051 


8EO460S1 


Hi37054 


ephrin-A3 


ptasma membrane 


422048 


NNL012445 


KsJ88126 


spondin 2, extraceOiiar matrix protein 


secreted 


410418 


D31382 


Hs.63325 


transmembrane protease, serine 4 


plasma membrane 


446342 


BE2S8865 


Hs.14846 


Homo sapiens mRNA; cDNA DKFZp564D016 (fr 


ptasma membrane 


422260 


AA315993 


Ks.105484 


regeneraSng gene type IV 


secreted 
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SeqtD 139& 140 


409041 


AB 033025 


Hs. 50031 


Hypothetical protein. XP_051 860 (K1AA1 19 


secreted 


Seq 10 141 & 142 


409041 


AB033025 


Hs 50081 


HvootheScsi oraiein. XP~051860 (K1AA119 




SeqtD 143 & 144 


111929 


AF027208 


Hs. 11 2360 


promWn (mouseJ-Gke 1 


plasma membrane 


Seq ID 145 & 146 


111929 


AF027208 


Hs. 11 2360 


nmmtntn /mnii^ftVJIkft 1 


plasma membrane 
ptssma membrane 


Seq tD 147 & 148 


111929 


AF027208 


Ks.112360 


proffitnin { mouse)"©© 1 


Seq ID 149 & 150 


104888 


AW939591 


Hs.5940 


mucin 13, epithelial transmembrane 




Seq ID 151 & 152 


420159 


AI572490 


Hs.99785 


Homo sapiens cDNA: FU21245 lis, clone C 


plasma membrane 


Seq ID 153 & 154 


422330 


D30783 


Hs.115263 


epireguEn 


plasma membrane 


Seq ID 155 & 156 


452461 


N78223 


Hs.108106 


transcription factor 


in trace! 


Seq ID 157 a 158 


413324 


V00571 


Hs.75294 


corticotropin releasing hormone 


secreted 


Seq ID 159 & 160 


412420 


AU035668 


HsJ3853 


bone morphogenettc protein 2 


secreted 


Seq ID 161 & 162 


416658 


U03272 


Hs.79432 


fibrillin 2 (congenital contractural ara 


secreted 


Seq ID 163 & 164 


407811 


AW190902 


Hs 40098 


cysteine knot superfamiry 1, BMP antagon 


secreted 


SeqtD 165 


402230 






Fd&nesh aredfcted- CYTOCHROME P450 4F5 1 


mtracell 


Seq ID 166 & 167 


402230 






Fgenesh predicted: CYTOCHROME P450 4F5 ( 


intracell 


Seq ID 168 & 169 


432829 


W60377 


Hs.57772 


ESTs 


intracell 


Seq ID 170 & 171 


425721 


AC002115 


Hs.1 59309 


uroplakln 1A 


plasma membrane 


Seq ID 172 & 173 


420370 


Y13645 


Hs.97234 


uroplakin 2 


plasma membrane 


Seq ID 174 & 175 


437852 


R POO 1838 


fe!256897 


ESTs. WeaMv similar to dJ365012.1 IRsa 




Sea ID 176 & 177 


402075 






FN l ?Pnfinnn2^1fl^fi* , Pla«;ma mpmhranp rplrinm 


OCMCICU 


Sea ID 178 & 179 


421110 


AJ25Q717 


Hs.1355 


mihpnsin P 


secreted 


Seq ID 180 & 181 


451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


plasma membrane 


Seq ID 182 & 183 


451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


secreted 


Seq ID 184 & 185 


451668 


Z43948 


K&326444 


cartilage acidic protein 1 


intracell 


Seq ID 186 & 187 


408243 


Y00787 


Hs.624 


interteukin 8 


secreted 


Seq ID 188 & 189 


422282 


AF019225 


Hs. 11 4309 


apoGpoprotetn L 


secreted 


Seq ID 190 & 191 


425852 


AK001504 


Hs.159551 


death receptor 6, TNF superfamiry member 


plasma membrane 


Seq ID 192 & 193 


439738 


BE246502 


Ks.9598 


soma domain, ImmunoQlobultn domain (Ig), 


plasma membrane 


Seq ID 194 & 195 


404875 






NM_022819*:Homo sapiens phospholipase A2 


intraceH 


Seq ID 196 & 197 


425883 


AL1 37708 


Hs.161031 


Homo sapiens mRNA; cDNA DKFZp434K0322 (f 


plasma membrane 


Seq ID 198 & 199 


404977 






Insulin-like growth factor 2 (somatomedi 


secreted 


Seq ID200&201 


420876 


AA918425 


Hs.177744 


ESTs 


plasma membrane 


SeqlD202 &203 


427747 


AW411425 


Hs.180655 


serine/threonine kinase 12 


intracell 


Seq ID 204 & 205 


420281 


A1623693 


Ks.323494 


Predicted cation efflux pump 


plasma membrane 


SealD206 &207 


446673 


NM.016361 Hs.15871 


1 PAP far hrconhfisnhafktic arid nhnsnhata 


nfamfmomhnno 


Seq ID 208 


437553 


AI829935 


Hs.130497 


ESTs Weaktv similar to MATS HUMAN CHLOR 


piosnia rncniufojic 


Sea ID 209 & 210 


437553 


AI829935 


Hs.130497 


ESTs. Weaktv similar to MAT 8 HUMAN CHLOR 


plasma membrane 


Seq ID 211 &212 


437553 


A1829935 


Hs.130497 


ESTs Weaktv similar to MAT8 HUMAN CHLOR 


plasma membrane 


Sea ID 213 & 214 


426900 


AW163564 


Hs.142375 


ESTs 


nlacma mpmhranA 

UIOOIIIm IILCUiUIOilO 


Sea ID 215 & 918 


426900 


AW163564 


Hs.142375 


ESTs 


nlscma irumhrono 
pIcCM 1 la iieiiiuiaira 


Seq ID 217 & 218 


426900 


AW163564 


Hs.142375 


ESTs 


plasma membrane 


Sea ID 219 & 220 


405932 






C1^>0nMn^f^fi1??lnhlAACfiQ1Qfl 11 /AM 




Sen (D 221 & 222 


405932 






C1500Q3DS*oil380G12?lflhlAAC691^1l fARI 




Seq ID 2234224 


424008 


R02740 


Hs.137555 


putative chemoldne receptor] GTP-binding 


plasma membrane 


SeqID225&226 


424008 


R02740 


Hs.137555 


putative chemokina receptor, GTP-binding 


plasma membrane 


Seq ID 2274228 


424008 


R02740 


Hs.137555 


tMitafrvo chpmoWnfl recpntnr GTP-Mndhvi 


plasma membrane 


Seq ID 229 & 230 


444342 


NMJN4398 Hs.10687 






Seq ID 231 & 232 


421379 


Y15221 


Hs. 103382 


small inducible cytokine subfamily B (Cy 


secreted 


Seq ID 233 4234 


417079 


U65590 


Hs.81134 


intsrleukin 1 receptor antagonist 


secreted 


Seq ID 235 & 236 


412986 


X81120 


Hs.75110 


cannaUrtoid receptor 1 (brain) 


plasma membrane 


Seq ID 237 & 238 


412986 


X81120 


Hs.75110 


cancabtaoid receptor 1 (brain) 


plasma membrane 


SeqID239 &240 


412986 


X81120 


Hs.75110 


^n^^^n^ry^oMf^eA 1 


plasma membrane 


Sea ID 241 & 242 


447072 


D61594 


Hs.17279 


lyrosyipjoteiri sudooo isierasQ i 




SeolD243 &244 


419723 


AL120193 


Hs.339810 




secreted 


Seq ID 245 & 246 


419723 


AL120193 


Ks.339810 


kwwvftv a^stirsncfl H Afn1 S cftfflvislflft 


mamhrQna 


Sea ID 247 & 248 


430890 


X54232 


Hs.2699 


gryptcan 1 




Sea ID 249 & 250 


456759 


BE259150 


Hs.127792 


rlpUfl rnrrKfinhnaVJIVR 3 


ntetnw mptnhrflrcp 


Seq ID 251 


429466 


M85635 


Hs.1 2827 


ESTs 




Sea ID 252 


429466 


M85835 


Hs.12827 


ESTs 




Seq ID 253 & 254 


419721 


NM001650 Hs^88650 


aQuaponn 4 


plasma membrane 


Sea ID 255 & 256 


407034 


U84540 




ah'Human dvstrnfamvtn tsflfrym OTM-3 fDTN 


sacreled 


Sea ID 257 & 258 


413472 


8E242870 


Hs.75379 


cnlutfl carrier farnltv 1 fallal rrroh affi 


plasma membrane 


Seq ID 259 & 260 


438380 


T06430 


Hs.6194 


chondrottin sulfate proteoglycan BEHAB/b 


secreted & plasma t 


SeqlD261&262 


426271 


AF026547 


Hs.169047 


chondrouln sulfate proteoglycan 3 (neur 


secreted 


Seq ID 263 & 264 


419704 


AA429104 


Hs.45057 


ESTs 


Intracell 


SeqtD 265 & 266 


444471 


AB020684 


Hs.11217 


K1AA0877 protein 


plasma membrane 


Seq ID 267 & 268 


409395 


U46745 


Hs.336678 


dystrobievm. alpha 


secreted 


Seq ID 269 & 270 


413063 


AL035737 


Hs.75184 


cnitinase 3-Cke 1 (carfilage glycoprotB 


secreted 


Sea ID 271 & 272 


433800 


AI034361 


Hs.135150 


lung type*l ce8 membra pfr^assoclated gly 


plasma membrane 


Sea ID 273 


458435 


AM18718 


Hs.144121 


ESTs. Weakfv similar to T46916 hvootheti 




Seq ID 274 


458435 


AI418718 


Hs.144121 


ESTs, Weakly similar to T46916 hypothefi 




Seq ID 275 4276 


424343 


AW956360 


Hs.4748 


adenylate cyclase activating polypeptide 


plasma membrane 


Seq ID 2774278 


424998 


U58515 


Hs.154138 


chlSnase 3-Gke 2 


secreted 


Seq ID 2794280 


412709 


AL022327 


Hs.74518 


WAA0027 protein 


plasma membrane 


Seq ID 2814282 


435615 


Y15065 


Hs.4975 


potassium voltage-gated channel, KQT-Wc 


plasma membrane 


Seq ID 2834284 


404049 






NM_018937*:Homo sapiens protocadherin be 


plasma membrane 


SeqtD 2854286 


418932 


134059 


Ks.89484 


cadherin 4, type 1, Rcadherln (reunaf) 


ptasma membrane 


Seq ID 2874288 


404029 






NM_018936^Homo sapiens protocadherin be 


plasma membrane 


Seq ID 2894290 


436480 


AJ271643 


HsJ7469 


puta&ve acid-sensing ton channel 


fntracefl 


Seq ID 291 4292 


452401 


NM.007115 K&29352 


tumor necrosis factor, alpha-induced pro 


secreted 


Seq ID 2934294 


452401 


NM.007115 Hs393S2 


tumor necrosis factor, alpha-induced pro 


secreted 


SeqtD 2954296 


436895 


AF037335 


Hs^338 


carbonic anhydrase XII 


plasma membrane 


Seq ID 2974298 


421471 


U90545 


Hs^27179 


solute carrier family 17 (sodium phospha 


plasma membrane 
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SeqtO299&300 


428296 


NM-003Q58 Hs.183572 


solute carrier family 22 (organic cation 


plasma membrane 


Seq ID 301 & 302 


423508 


AW604297 


Ks.129711 


hepatitis A virus ceOular receptor 1 


plasma membrane 


Seq ID 303 8304 


450001 


NM_OQ1044 Hs.406 


solute carrier family 6 (neurotransrntte 


plasma membrane 


Seq ID 305 & 308 


410407 


X66839 


Hs.63287 


carbonic arthydrase IX 


plasma membrane 


Seq ID 307 & 308 


453496 


AA442103 


Ks.33084 


solute carrier family 2 {facilitated g!u 


plasma membrane 


Seq ID 309 8310 


420737 


L08096 


Hs.99899 


CD70 ; tumor necrosis factor (Bgand) s 


plasma membrane 


Seq ID 311 & 312 


309931 


AW3416B3 




gbAd13d01jc1 Soares.NFU.T_GBC.S1 Homos 


plasma membrane 


Seq ID 313 & 314 


412719 


AW016610 


Hs.816 


ESTs 


intracetl 


Seq ID 315 & 316 


417034 


NM 006183 Hs.80962 


neurotensin 


secreted 


Seq ID 317 & 318 


430486 


BE062109 


Hs.241551 


chloride channel, calcium activated, fam 


plasma membrane 


SeqlD319&320 


413753 


U17760 


Hs.75517 


taminfn, beta 3 (nicein (1 25kD), kaBrrin 


secreted 


Seq ID 321 & 322 


425650 


NNL001944 Hs.1925 


desmoglein 3 (pemphigus vulgaris antigen 


plasma membrane 


Seq ID 323 & 324 


423673 


BE003054 


Hs.1695 


matrix metalloproteinase 12 (macrophage 




Seq ID 325 & 326 


418663 


AK001100 


Hs.41690 


desmocoliin 3 


plasma membrane 


Seq ID 327 & 328 


418663 


AK001100 


Hs.41690 


desmocoliin 3 


plasma membrane 


Seq ID 329 & 330 


429610 


AB024937 


Hs.211092 


LUNX protein; PLUNC (palate lung and nas 


secreted 


Seq ID 331&332 


406690 


M29540 


Hs^20529 


carcboembryonic art fig en-related cell ad 


plasma membrane 


Seq ID 333&334 


431846 


BE019924 


Hs^71580 


uroplakin IB 


plasma membrane 


Seq ID 335 & 336 


422158 


L10343 


Hs.112341 


protease Inhibitor 3, skin-derived (SKAL 




Seq ID 337 & 338 


431958 


X63629 


Hs^877 


cadherin 3, type 1, P-cadherin (placenta 


plasma membrane 


Seq ID 339 & 340 


437044 


AL035864 


Hs.69517 


differenSdly expressed in Fanconfc an 


plasma membrane 


Seq ID 341 & 342 


428484 


AF104032 


Hs.184601 


solute carrier family 7 (cationic amino 


plasma membrane 


Seq ID 343 & 344 


429211 


AF052693 


Hs.198249 


gap junction protein, beta 5 (connexin 3 


plasma membrane 


Seq ID 345 & 346 


417389 


BE260964 


Hs.82045 


midkine (neuritB growth-promoting factor 




SeqlD347&348 


431009 


BE149762 


Hs.48956 


gap junction protein, beta 6 (connexin 3 


plasma membrane 


Seq ID 349 & 350 


417542 


J04129 


Hs.82269 


progestagen-associated endometrial prote 


secreted 


Seq ID 351 & 352 


449230 


BE613348 


Hs.211579 


melanoma cell adhesion molecule 


plasma membrane 


Seq ID 353 & 354 


410555 


U92649 


Hs.64311 


a disintegrin and metalloproteinase do ma 
a disintegrin and metalloproteinase do ma 


plasma membrane 


Seq ID 355 & 35S 


410555 


U92649 


Hs.64311 


plasma membrane 


Seq ID 357 & 358 


424687 


JQ507O 


Hs.151738 


matrix metalloproteinase 9 (gelatin ase B 


secreted 


Seq ID 359 & 360 


418462 


BE001596 


Hs.85266 


integral, beta 4 


plasma membrane 


Seq ID 361 & 382 


410274 


AA381807 


Hs.61762 


hypoxia-inducible protein 2 


secreted 


Seq ID 363 & 364 


439606 


W79123 


Ks.58561 


G protein-coupled receptor 87 


plasma membrane 


Seq ID 355 & 366 


404877 






NM_005365:Homo sapiens melanoma antigen, 


intracetl 


Seq ID 3674368 


444781 


NMJH4400 Hs.11950 


GPI-anchored metastasis-associated prote 


plasma membrane 


Seq ID 369 & 370 


109424 


NM.005329 Hs.85962 


hyaturonan synthase 3 


pis ma membrane 


Seq ID 371 & 372 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


plasma membrane 


Seq ID 373 & 374 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


plasma membrane 


Seq ID 375 & 376 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


plasma membrane 


Seq ID 377 & 378 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


plasma membrane 


Seq ID 379 & 380 


415817 


U88967 


HSJ8867 


protein tyrosine phosphatase, receptor-t 


plasma membrane 


Seq ID 381 & 382 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


plasma membrane 


Seq ID 383 4 384 


421817 


AF146074 


Hs.108660 


ATP-blnding cassette, sub-family C (CFTR 


plasma membrane 


Seq ID 385 & 386 


418678 


NMJM1327 Hs.87225 


cancerftesfe antigen (NY-ESO-1) 


in trace 11 


Seq ID 387 & 388 


418678 


NM-001327 Hs.87225 


cancer/testis antigen (NY-ESO-1) 


intracetl 


Sea ID 369 & 390 


409420 


Z15008 


Hs.54451 


Isminin namma 9 frarp.ln t IflflkTH kntini 


nSwMfnpmhrano 


Sea ID 391 & 392 


332180 


AF134160 


Hs.7327 


daudin 1 




Sea ID 393 & 394 


408790 


AW580227 


Hs.47860 


neurotrophic tyrosine kinase, receptor, 


plasma membrane 


Seq ID 395 & 396 


408790 


AW580227 


Hs.47860 


neurotrophic tyrosine Kinase, receptor, 


* plasma membrane 


Seq ID 397 & 398 


439223 


AW238299 


Hs.250618 


UL16 WnrfirKi nmtein 2 




Seq ID 399 & 400 


409757 


NM.O01898 Hs.123114 


cvstafin SN 


secreted 


Sea ID 401 & 402 


428969 


AF120274 


Hs.194689 




Sea ID 403 & 404 


428969 


AF120274 


Hs.194689 




secreted 


Seq ID 405 & 406 


428969 


AF120274 


Hs.194689 


Brtsmin 


secreted 


Seq ID 407 & 408 


428969 


AF120274 


Hs.194689 




secreted 


SeqlD409 &410 


450701 


H39960 


Hs.288467 


hypothetical protein XPJ098151 (leucine- 


secreted 


SeqlD41U412 


450701 


H39960 


Hs^88467 


hypothetical protein XP_098151 (leucine- 


secreted 


Seq ID 413 & 414 


414774 


X02419 


Hs.77274 


plasminogen activator, urokinase 


secreted 


Seq ID 415 & 416 


407944 


R34008 


Hs.239727 


desmocoliin 2 


plasma membrane 


Seq ID 417 & 418 


407944 


R34008 


HSJ239727 


desmocoliin 2 


plasma membrane 


Seq ID 419 & 420 


428486 


AW583497 


Hs.184604 


pancreaSc polypeptide 


secreted 


SeqlD421&422 


457489 


AI693815 


Hs.127179 


cryptic gene 




Seq ID 423 8424 


432874 


W94322 


Hs^79651 


melanoma inhibitory activity 


secreted 


Seq ID 425 & 426 


445891 


AW391342 


Hs.199460 


DPCR1 protein 


plasma membrane 


Seq ID 427 & 428 


445891 


AW391342 


Hs.199460 


DPCR1 protein 


plasma membrane 


SeqlD429 &430 


404682 






C9001188*^i|1273884^ref|NP^073725.1| p 


secreted 


Seq ID 431 & 432 


429547 


AW009166 


Hs.99376 


ESTs 


secreted 


Seq ID 433 & 434 


425921 


NM.007231 Hs.162211 


solute center family 6 (neurotransrmtte 


plasma membrane 


Sea ID 435 & 436 


407242 


M18728 




gbcHuman nonspecific ciossreacti ng anOg 


plasma membrane 


Seq ID 437 & 438 


407242 


M18728 




gbcKurnan nonspecific crossreacting an tig 


plasma membrane 


Seq ID 439 & 440 


407242 


M18728 




gbcHuman nonspecific crossreacOng antig 


plasma membrane 


SeqlD441&442 


432596 


AJ224741 


Hs.278461 


malriEn 3 


secreted 


Seq ID 443 & 444 


444006 


BE395C85 


Hs.10086 


type 1 transmembrane protein Fn14 


plasma membrane 


SeqlD445 &446 


423685 


BE350494 


Hs.49753 


uveal autoantigen with coiled cod domal 


tntraceil 


Seq ID 447 & 448 


428392 


H10233 


Hs.2265 


secretory granute, neuroendocrine protei 


secreted 


SeqlD449 &450 


429597 


NMJXJ3816 Hsl2442 


a disintegrin and metaBoproteinase doma 


plasma membrane 


Seq ID 451 & 452 


448030 


N30714 


HS.32596Q 


membrane-spanning 4-domains, subfamily A 


plasma membrane 


Seq ID 453 & 454 


422109 


S73265 


Hs.1473 


gastrirweteasing peptide 


secreted 


SeqlD455 &456 


419235 


AW470411 


HsJ>88433 


neurotrimin 


plasma membrane 


SeqlD457 &458 


449048 


245051 


Hs.22920 


similar to S68401 (cattle) glucose indue 


plasma membrane 


Seq ID 459 & 460 


427333 


AP067797 


Hs.176558 


aquaporin 8 


plasma membrane 


Seq ID 461 & 462 


417931 


W95642 


H«l82981 


trefoil factor 3 (intestinal) 


secreted 


Seq ID 463 & 464 


419216 


AU076718 


Hs.164021 


small Inducible cytokine subfamOy B (Cy 
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SeqlD465&466 


431629 


AU077025 


Hs.265827 


interferon, eloha-faiduribte oratein /eta 


secreted 


Seq tD 467 & 468 


413554 


AA319146 


Hs.75426 


secretogranln II (chior nogr unln G) 


secreted 


SeqlD469 &470 


452194 


AI694413 


Hs.332649 


UbtQuHin-Oce protein FAT1077? 


plasma membrane 


SeqlD471 & 472 


452194 


AI694413 


Hs.332649 


UWquffin-Ike protein FAT1077? 


plasma membrane 


Seq ID 473 & 474 


426322 


J05068 


Ks.2012 


transoobstemin 1 (vitamin B12 binding pr 


secreted 


Seq 10475 &476 


429010 


Y18198 


Hs. 194725 


one cut domain, family member 2 


intracefl 


Seq 10 477 & 478 


431462 


AW583672 


Hs.256311 


granin-Cke neuroendocrine peptide precu 




Seq ID 479 & 480 


448243 


AW369771 


Hs^2620 


Integrin, beta 6 


plasma membrane 


Seq ID 481 & 482 


426427 


M86699 


Hs.169840 


TTK protein kinase 




Seq ID483 &484 


428167 


AI687303 


Hs .285529 


G protein-coupled receptor 49 


plasma membrane 


Seq ID 4854436 


428579 


NM_005756 Hs.184942 


Q protelivcouplod receptor 64 


plasma membrane 


Seq ID 487 & 488 


428579 


NM.C05756 Hs.184942 


G protein-coupled receptor 64 


plasma membrane 


Sea!D489 &490 


428579 


NM_005756 Hs.184942 


G protein-coupled receptor 64 


plasma membrane 


Seq ID 491 & 492 


428579 


NNL005756 Hs.184942 


G proteio^oupted receptor 64 


plasma membrane 


Sen ID 403 A 404 


445537 


AJ245671 


Hs.12844 


COTHlMMlWldJIl, HlUlupy 0 


secreted 


Sea ID 495 & 496 


422278 


AF072873 


Hs.114218 


fri77lpH { Drnsnnhila) hnmnlna 6 


ptasmd membrane 


Sea ID 497 & 498 


424620 


AA101043 


Hs.151254 


kflltkreln 7 frhvmnfnrnfir clrafum mm 


^ecrefed 

aCUKCU 


Sea ID 499 & 500 


406400 






kaOntrpin ft fnaiimnfifnftwacinl ( Kl KR\ 


<eSpfe*J 


Sea ID Ml & 509 
ot>v| iu out a 3U< 


431130 


NMJJ06103 Hs3719 


net, cpiUHjyiiByj>powip»t witoy-cvJUKy \s 




Sea ID 503 & 504 


420440 


NM_002407 Hs.97644 


rrtctii u i BQtobtn 2 


secreted 


Can m 505 A ^flfi 

\JC^ iu JuJ IX WW 


428450 


NM.014791 


Hs.184339 


W A AD 175 neno nmriiirf 


Infrtvpil 


Shi ID 507 A SOB 


446619 


AU076643 


Hs.313 


uw^mtoH nhnenhnrunlotfi 1 /ncfofuMntin 


coyybIoH 

SCvlCvCU 


Sen (D 509 & 510 


436982 


AB018305 


Hs.5378 


c nn nrtl n 1 /LfinnnH7n\ ovtrarotliitar mzA 
opuiniiii I; ^t*o||uiBiMi^ tMUiquoiiUhai nidi 


secreted 


Sea ID 511 ft 51? 


453392 


U23752 


Hs.32954 


CRY fepy rfotpimlninn mninn Yl-hnv 1 1 


intraceil 


Seq ID 513 


431989 


AW972870 


Hs.291069 


ESTs 




Seq ID 514 


439820 


AL360204 


Hs.283853 


Homo sapiens mRNA full length insert cDN 




Seq ID 515 & 516 


409178 


BE393948 


Hs.50915 


kel&krein 5 


secreted 


Seq ID 517 & 518 


426514 


BE616633 


Hs.170195 


bone morphogenetic protein 7 (osteogenic 


secreted 


Seq ID 519 & 520 


421478 


AI683243 


Hs.97258 


ESTs, Moderately similar to S29539 ribos 


plasma membrane 


Seq ID 521 & 522 


421478 


AI683243 


Hs.97258 


ESTs, Moderately similar to S 29 539 ribos 


plasma membrane 


Seq ID 523 & 524 


425776 


U25128 


Hs.159499 


parathyroid rtonnone receptor 2 


plasma membrane 


Seq ID 525 & 526 


425776 


U25128 


Hs.159499 


parathyroid hormone receptor 2 


plasma membrane 


Seq ID 527 & 528 


452097 


AB002364 


Ks.27916 


a disintegrin-like and matallo protease ( 


secreted 


Seq ID 529 & 530 


416530 


U62801 


HsJ9361 


kalSkrein 6 (neurosin, zyme) 


secreted 


Sea ID 631 & 532 


431515 


NMJM2152 


Hs^5B5B3 


endothelial differenGalioni lysopnospha 


plasma membrane 


Seq ID 533 & 534 


419452 


U33635 


Hs.90572 


PTK7 piotein fyrosine kinase 7 




Sea ID 535 & 536 

ud{ ILy wvu u JwU 


412078 


X69699 


Hs.73149 


nzXmA tmv nowa ft 
JJoil CU LAJA ^ohj O 


liuF3GGII 


Sea ID 537 A 53fl 


412078 


X69699 


Hs.73149 


rto)mfi hrtv nofla ft 
poucU UUA yclro O 




Sea ID 539 A 540 


411773 


NM.006799 Hs.72026 


pruiudoo, otaUlo, L I ^uJoUouij 


ep^fafaH 


Seq ID 541 & 542 


407792 


AI077715 


Hs.39384 


mikifhfA RPCffifeH Rremfl hnrnnlmmf^ In f 


cenreJed 


Sea ID 543 A 544 


431616 


AA508552 


Hs.195839 


pOTe lA/poVK/ dmib In I3A099 hvnnf hpfi 


plasma membrane 


Sea ID 545 & 546 


452792 


AB037765 


Hs.30652 


KIAA1344 nmSpfa 

IwVMilTf (HWCIil 


njacma memhrarte 


SealD547&548 


400294 


N95796 


Hs.278695 


Hnmn confpfrc nmdefn mRNA mmnlpfA nfc 


plasma membrane 


Sea ID 549 A 550 


432653 


N62096 


H&293185 




nlacfna mpmH ran p. 

|AoallUS KIOIIIUIOJIS 


Sen ID 551 A 559 


432653 


N62096 


Hs.293185 


PQTc WoqUv cJmilur In IP7^9ft amino or! 


plasms membrane 


Sea ID 553 A "554 


432653 


N62096 


Hs.293185 


CCTq WpnktvRtmnnrln ir719ft flminn nri 
CO 1 a, fYcaniy oBTUIca ID Jul Oc.0 oiiUllU OH 


plasma membrane 


Seq ID 555 & 556 


432653 


N62096 


Hs33185 


PST<! WpaVh/ clmBflr its JC739S nmlnn eel 


plasma membrane 


Seq ID 557 & 558 


400290 


H18835 


Ks.31608 


nvDothefkal orotein FL120041 


plasma membrane 


Sea ID 559 & 560 


410001 


AB041036 


Hs.57771 




secreted 


Seq ID 561 & 562 


418396 


AI765805 


Hs.26691 


ESTs 




Seq ID 563 & 564 


451027 


AW519204 


Hs.40808 


ESTs 


pioSslla nl6f?lwlcQi6 


Seq ID 565 & 566 


446057 


A1420227 


Hs.149358 


ESTs. Weakfv similar to A46010 X-Bnked 


p^OTamembrane 


Sea(D567&568 


433466 


AA508353 


Hs.105314 


relaxin 1 M\\ 


secieisa 


Sen tD 5fi9 &570 


453370 


AI470523 


Hs.139336 


ATP-HrrHffin raccotto ci iK-famlh/ C. (CA- \ R 
m r -uuKJitty \#ooooLU3 f ouvwiiuij \j \ui i r\ 


plasma membrane 


Sea ID 571 A 579 


453370 


AI470523 


Hs.139336 


m i r-iwtujiiy cassauoi suunamiiy \j \\jt i r\ 


rdaefna mpmhmno 
piOMFIa lllclHUloilo 


Sea ID 573 & 574 


414569 


AF109298 


Hs.118258 


prostate cancer assoctated protein 1 


nlsfims memhrane 


Seq ID 575 & 576 


413435 


X51405 


Hs.75360 


carboxypeptidase E 


secreted 


Sea ID 577 A 57fl 


426501 


AW043782 


HsJ293616 


ESTs 




Sea ID 579 & 580 

wvvj IU wiwU wUv 


448999 


AF179274 


Hs.22791 


tranvnemhrflne runfpin with f-^PJtke and 


plasma membrane 


Sea ID 581 A 589 

UCU IL/ JU 1 v w<0£ 


408369 


R38438 


Hs.182575 


ftrtfiffa Aarner famttv 15 francnivt 

oviUUJ MlillOl uUIUIj 13 \nf f 1 UdllafHJH 


plasma membrane 


Seq ID 583 & 584 


412628 


A1972402 


Hs36051 


hypothetical protein MGC2648 


secreted 


Sea ID 585 & 586 

Vw^ 14/ WW WVV 


403047 






MM nft555fi**H nmn ssnlms tmmsmpmhmnp nr 

IW^WMwU *riV4IIU «w^#fC*K> UOI IdllU^IIMCMlC pi 


plasma membrane 


Seq ID 587 & 588 


403047 






N M„005656*:Homo sapiens transmembrane pr 


plasma membrane 


Seq ID 589 & 590 


408430 


S79876 


Hs.44926 


dtpeptldyl peptidase W (CD26, adenosine 


pbsnnfl membrane 


Seq ID 591 & 592 


445413 


AA151342 


Hs.12677 


06H47 protein 


secreted 


Sea ID 593 & 594 


451982 


F13036 


Hs.27373 


Homo saniens mRNA! cDNA DKFZoS6401763 ff 


inlmcefl 


Seq ID 595 & 596 


427958 


AA418000 


Hs.98280 


potflssturn In tGrmod tsls/smsil oonductsncs 


plasfna membrane 


Sea ID 597 A598 

vwj il/ vv 9 km. wwv 


421887 


AW161450 


Hs.109201 


vwnw |iivW4ti 


pbsnQ inerobrsno 


Seq ID 599 & 500 


425071 


NM.013989 Hs.154424 


tfofawJflflSQi todothyroninO) typo 1) 




Sea ID 601 A 602 

wwj iu wv 1 q yu& 


432101 


AI918950 


Hs.123642 


EphA3 


^3aramerrtbrane 


Seq ID 603 & 604 


407786 


AA667538 


Hs.38972 


tetraspan 1 


plasma membrane 


SeqlD605&606 


416836 


D54745 


Hs.80247 


rfiotecystokinin 


secreted 


Seq ID 607 & 608 


416539 


Y07909 


Hs.79368 


epithelial membrane protein 1 
ganTma-antnobutyric add (GABA) A recepto 


plasma membrane 


Seq(D609 &610 


131083 


Y09763 


Hs.22785 


plasma membrane 


SeqlD611&612 


131083 


Y09763 


Hs^Z7B5 


gamrra^mlrrobur/ric acid (GABA) A recepto 


plasma membrane 


Seq ID 613 & 614 


131083 


Y09763 


Hs^2785 


garnrna-ermrwmrtyric add (GABA) A recepto 


plasma membrane 


Seq ID 615 & 616 


131033 


Y09763 


Hs.22785 


gamma-arrdnobutync arid (GABA) A recepto 


plasma membrane 


Seq ID 617 & 618 


422424 


AI186431 


Hs-29S638 


prostate differentiation factor 


secreted 


SeqlD619&620 


428970 


BE276891 


Hs.194691 


refinolc add Induced 3 


plasma membrane 


Seq ID 621 4622 


428330 


122524 


Hs3256 


matrix metaSoprcteinase 7 (matrflystn, 


secreted 


Seq ID 623 & 624 


439018 


AW300887 


Hs .26638 


merntyane-6 panning 4-domaIns, subfamily A 


plasma membrane 


Seq ID 625 & 626 


420610 


AI683183 


Hs.99348 


dlstaWess hocneo box 5 


intraceil 


Seq ID 6274628 


425723 


NM.014420 Hs.159311 
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SeqlD629 & 630 456662 NMJW2448 Hs.1494 

Seq ID 631 4 632 418281 U09550 Hs.1154 

Seq ID 633 4 634 429903 AL134197 Hs.93597 

Table 768: 



msh (DroscphUa) homeo box homolog 1 (fo 
oviductei glycoprotein 1 ,120kD (mudn9 
(^independent kinase 5, regulatory su 



intraceD 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
424399 

429220 
450375 



Table 76C 



CAT Number Accession 

238961.1 AS05687 A1905624 AI905837 AI905623 AA340069 R75793 W72837 BE07451 2 AI905633 W72838 BE092421 Al 127172 BE186013 AW070916 

A1139456AW176044AW291950 
301384J AW207206 AW341473 AA448195 AI951341 

83327 J AA009647 AA131254 AA374293 AW954405 H04410 AW606284 AA151 166 BE157467 BE157601 H04384 W46291 AW663674 H04021 H01532 

M190993H03231H59605 H01642AA&2876M1W^ 



Pkey: 
Reft 

Strand: 
NLposiSon: 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gi) numbers. "Dunham I. et at' refers to the publication entitled The DNA 
sequence of human chromosome 22." Dunham I. et a!., Nature (1999) 402:489495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


Nt^positfon 


402075 


8117407 


Phis 


121907-122035,122804-12^21.124019-124161,124455-124610,125672.126076 


402230 


9966312 


Minus 


29782-29932 


403047 


3540153 


Minus 


59793-59968 


404029 


7671252 


Plus 


108716-111112 


404049 


3688074 


Minus 


75765-78155 


404682 


9797231 


Minus 


40977-41150 


404875 


9801324 


Plus 


96588-96732,97722-97831 


404877 


1519284 


Plus 


1095-2107 


404977 


3738341 


Minus 


43081-43229 


405932 


7767812 


Minus 


123525-123713 


405400 


9256298 


Plus 


1553-1712,1878-2140,4252-4385,5922-6077 
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Table 77 provides Pkey; Seq ID No; Disease Indications: and Preferred Uffity for sequences in Table 78. Seq ID No Rnks Ihe Informalion in Table 77 to Table 78. 



SeqlDNa 
Pkey: 

Disease Indications: Diseases designated (or coverage as described in Table 1 
Preferred Utility: Preference of utility, based partly upon predicted localization (Ab is antibody; sm is smaB molecule target; CTL is vaccine target) 



Sequence ID No for sequences In table 
Unique Eos probesei Identifier number 



Seq ID no 



Pkey Disease Indications 



425023 
424503 
429423 
400289 
419172 



Seq ID 1 & 2 
Seq(D3&4 
SeqlD5&6 
SeqID7&8 
SeqlD9&10 
Seq ID 11 & 12 418007 
Seq ID 13 & 14 418007 
Seq ID 15 & 16 407638 
Seq ID 17 & 18 414577 
SeqtD19& 20 418738 
SeqtD21&22 428368 
Seq ID 23 & 24 415138 
SeqiD25&26 429276 
SeqiD27&28 418994 
SeqlD29&30 407975 
Seq ID 31 & 32 429113 
Seq ID 33 & 34 4185Q6 
SeqlD35& 36 423961 
Seq ID 37 & 38 414812 
Seq ID 39 & 40 417433 
Seq ID 41 & 42 417433 
Seq!D43& 44 424399 
SeqiD45& 46 422867 
SeqlD47& 48 428227 
Seq ID 49 & 50 444381 
Seq ID 51 & 52 439569 
SeqlD53& 54 411558 
Seq!D55& 56 400303 
Seq ID 57 & 58 411789 
Seq(D59&60 428698 
Seq ID 61 & 62 450098 
Seq ID 63 & 64 421552 
Seq ID 65 & 66 452747 
SeqlD67& 68 415539 
Seq ID 69 & 70 416636 
Seq ID 7U 72 416636 
SeqiD73& 74 409079 
SeqlD75& 76 442082 
SeqlD77& 78 400297 
SeqlD79&60 451398 
SeqID81&82 429220 
Seq ID 834 84 421524 
SeqlD85& 86 423242 
SeqtD87& 68 423242 
SeqlD89&90 423242 
SeqlD9U92 452190 
Seq ID 93 & 94 452190 
Seq ID 95 & 96 325372 
Seq ID 97 & 98 450375 
Seq ID 99 & 100 426215 
Seq ID 101 & 102 425247 
Seq ID 103 & 104 429353 
Seq ID 105 & 106 429353 
Seq ID 107 & 108 432201 
Seq ID 109 & 110 427585 
Seq ID 111 & 112 446163 
Seq ID 113 & 114 442117 
Seq ID 115 & 116 428179 
SeqID117&118 431211 
Seq ID 119 & 120 447033 
Seq ID 121 & 122 447033 
Seq ID 123 & 124 447033 
Seq ID 125 & 126 115522 
Seq ID 127 & 128 452679 
SeqtD129&130 446051 
Seq ID 131 & 132 422048 
Seq ID 133 & 134 410418 
Seq ID 135 & 136 446342 
Seq ID 137 & 138 422260 
SeqlD139&140 409041 
Seq ID 141 & 142 409041 
SeqlD143&144 111929 
SeqlD145&146 111929 
Seq ID 147 & 148 111929 



angtogenesis 

pancreas, prostate, angtogenesis, bladder, lung 



angtogenesis, bladder, lung, cervical, ovarian, head & neck 
angtogenesis, renal 

angtogenesis, rung, bladder, fibrosis, head & neck, pancreas, stomach, colon, ovarian 
angtogenesis, rung, bladder, fibrosis, head & neck, pancreas, stomach, colon, ovarian 
angtogenesis 
angtogenesis 

angtogenesis, lung, ovarian, bladder & stomach, pancreas, uterine 

angtogenesis, head & neck, stomach 

angtogenesis, pancreas, stomach, lung, uterine 

angtogenesis, bladder, glioblastoma 

prostate, angtogenesis 

angtogenesis, renal 

angtogenesis, bladder and stomach 

anglogenests, ovarian, glioblastoma, uterine, lung, bladder, pancreas 

breast, colon, bladder, lung, fibrosis, pancreas, head and neck, ovarian 

breast bladder, lung, Fibrosis, pancreas, colon, head and neck, cervical, stomach, renal, ovarian 

pancreas, breast, bladder, lung, head & neck, cervical, uterine, ovarian, stomach, renal 

pancreas, breast bladder, lung, head & neck, cervical, uterine, ovarian, stomach, renal 

breast uterine, head & neck 

breast ovarian, prostate, pancreas, lung, colon, uterine 

breast lung, bladder, ovarian, head & neck, fibrosis, colon, stomach, cervical 

breast colon, bladder, lung, pancreas, head & neck, ovarian, stomach, uterine, renal, angtogenesis 

breast androgen withdrawal prostate, prostate, bladder 

pancreas, prostate, stomach, breast uterine, cervical, ovarian 

breast ovarian, pros, stomach, uterine, bladder lung head & neck 

pancreas, lung, breast stomach, head & neck, ovarian, uterine 

breast colon, rung, pancreas, stomach, head & neck, ovarian 

breast lung, stomach, uterine 

breast ovarian, pancreas, cervical, uterine, prostate, rung, stomach, head & neck 
breast bladder, lung, head & neck, ovarian, stomach, uterine, pancreas 
breast prostate, ovarian, cervical, uterine 
breast pancreas, uterine 
breast pancreas, uterine 



breast prostate, ovarian 
breast bladder, colon, prostate 
breast ovarian 

breast prostate, benign prostatic hyperplasia 
breast 

breast renal, ovarian, prostata, colon 
breast renal, ovarian, prostate, colon 
breast renal, ovarian, prostate, colon 
breast stomach, pancreas 
breast, stomach, pancreas 



breast ovarian, head & neck, pancreas, lung, colon 
breast hmg, renal, colon, ovarian, uterine 

breast ovarian, lung, colon, pancreas, head & neck, stomach, uterine, cervical, bladder 
breast prostate 
breast prostate 

breast colon, king cherno, ovarian, stomach, pancreas, uterine, cervical 
breast lung, head & neck, pancreas, stomach, colon, ovarian, cervical 
breast cervical, uterine 

breast hmg, bladder, pancreas, head & neck, stomach, ovarian, prostate 
breast 

colon, bladder, hmg, pancreas, head & neck 
colon, prostate, fibrosis, breast lung pancreas 
colon, prostate, fibrosis, breast kmg pancreas 
colon, prostate, fibrosis, breast hmg pancreas 
colon, king, bladder, pancreas 
prostate, colon, pancreas, Taxol prostate 
colon, breast 
colon, pancreas, prostate 

colon, bladder, lung, ovarian, pancreas, head & neck 
uterine, colon, prostate 
colon, ovarian mucinous 

uterine, ovarian, hmg, colon, stomach, head & neck, breast pancreas 
uterine, ovarian, lung, cokm, stomach, head & neck, breast, pancreas 
colon, breast, fibrosis 
colon, breast fibrosis 
colon, breast fibrosis 
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PrererredUtiity 

Ab, sm, CTL, imaging 
Ab, sm, imaging 
Ab, sm 

Ab, sm, CTL, diagnosfic 
Ab, sm, CTL, imaging 
Ab, sm, diagnostic 
Ab, sm, diagnostic 
CTL 

Ab, CTL, diagnostic 
Ab. sm, CTL, imaging 
Ab, sm, diagnostic 
Ab, CTL, diagnostic 
Ab, sm, imaging 
Ab, CTl, imaging 
Ab, CTL, diagnostic 
sm, CTL 
Ab, sm, imaging 
Ab, sm, diagnostic 
Ab, CTL, diagnostic 
Ab,CTUrmaging 
Ab.CTL, imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnosis 
Ab, CTL diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, sm, imaging 
Ab.sm, CTL imaging 
Ab, CTL. diagnostic 
Ab. sm, imaging 
CTL 

Ab, sm, CTL, diagnostic 
Ab, imaging 
Ab.sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab, imaging 
Abim, CTL, Imaging 
CTL 

Ab. CTL, imaging 

Abvsm, CTL, imaging 

Ab, CTL, imaging 

CTL 

CTL 

CTL 

CTL 

CTL 

Ab,sm, CTL, Imaging 
Ab, CTL, diagnostic 
Ab, sm, CTL, diagnostic 
Ab, sm, CTL, imaging 
Ab, sm, CTL, Imaging 
Ab, sm, CTL, imaging 
CTL 

Ab, sm, imaging 
Ab, CTL, imaging 
sm,CTL 
Ab, sm, imaging 
Ab, CTL, diagnosfic 
Ab, CTL, diagnosfic 
Ab, CTL, diagnostic 
CTL 

Ab.Cn, Imaging 
Ab, sm. CTL, imaging 



Ah, sm, CTL, imaging 
Ab, sm, CTL, imaging 
Ab, sm, CTL. diagnostic 
Ab, CFL, diagnostic 
Ab, CTL, diagnostic 
Ah, sm, imaging 
Ab, sm, imaging 
Ab. sm, imaging 
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Seq t0 149 & 150 104888 
Seq ID 151 & 152 420159 
Seq ID 153 & 154 422330 
Seq ID 155 & 156 452461 
Seq ID 157 & 158 413324 
Seq ID 159 & 160 412420 
Seq ID 161 & 162 416658 
Seq ID 163 & 164 407811 
Seq ID 155 402230 
Seq ID 166 & 167 402230 
Seq ID 168 & 169 432829 
SeqlD170&171 425721 
Seq ID 172 & 173 420370 
Seq ID 174 & 175 437852 
Seq ID 176 & 177 402075 
SeqlD178&179 421110 
SeqlD180&181 451668 
Seq ID 162 & 183 451668 
Seq ID 184 & 185 451668 
Seq ID 186 & 187 408243 
Seq ID 188 & 189 422282 
Seq ID 190 & 191 425852 
Seq ID 192 & 193 439738 
Seq ID 194 & 195 404875 
Seq ID 196 & 197 425883 
Seq ID 198 & 199 404977 
Seq ID 200 & 201 420876 
Seq ID 202 & 203 427747 
Seq ID 204 & 205 420281 
Seq!D206 & 207 446673 
Seq ID 208 437553 
Seq ID 209 & 210 437553 
Seq ID 211 & 212 437553 
Seq ID 213 & 214 426900 
Seq ID 215 & 216 426900 
Seq ID 217 & 218 426900 
Seq ID 219 & 220 405932 
Seq ID 221 & 222 405932 
Seq ID 223 & 224 424008 
Seq ID 225 & 226 424008 
Seq ID 227 & 228 424008 
Seq ID 229 & 230 444342 
Seq ID 231 & 232 421379 
Seq ID 233 & 234 417079 
Seq ID 235 & 236 412986 
Seq ID 237 & 238 412986 
Seq ID 239 & 240 412986 
Seq ID 241 & 242 447072 
Seq ID 243 & 244 419723 
Seq ID 245 & 246 419723 
Seq ID 247 & 248 430890 
Seq ID 249 & 250 455759 
Seq ID 251 429466 
Seq ID 252 429466 
Seq ID 253 & 254 419721 
Seq ID 255 & 256 407034 
Seq ID 257 & 258 413472 
Seq ID 259 & 260 438380 
Seq ID 261 & 262 426271 
Seq ID 263 & 264 419704 
Seq ID 265 & 266 444471 
Seq ID 267 & 268 409395 
Seq ID 269 & 270 413063 
Seq ID 271 & 272 433800 
Seq ID 273 458435 
Seq ID 274 458435 
Seq ID 275 & 276 424343 
Seq ID 277 & 278 424998 
SeqlD279&280 412709 
Seq ID 281 & 282 435615 
Seq ID 283 & 284 404049 
Seq ID 285 & 286 418932 
Seq ID 287 & 288 404029 
Seq ID 289 & 290 436480 
Seq ID 291 & 292 452401 
Seq ID 293 & 294 452401 
SeqlD295& 296 436895 
Seq ID 297 & 298 421471 
Seq ID 299 & 300 428298 
Seq ID 301 & 302 423508 
Seq ID 303 & 304 450001 
Seq ID 305 & 306 410407 
Seq ID 307 & 308 453496 



colon, stomach, uterine 
bladder, stomach 
pancreas, colon. Madder 
bladder, lung, head & neck, ovarian, 
bladder 

bladder, glioblastoma, tung. stomach 
lung, ovarian, uterine, bladder 
bladder, pancreas, stomach, uterine, lung 
bladder 
bladder 



glioblastoma, stomach, colon, cervical 



bladder 
bladder 
bladder, tung 

bladder, tung, head & neck, cervical 

bladder, pancreas, stomach, ovarian, lung 

bladder, ovarian, lung 

bladder, ovarian, lung 

bladder, ovarian, lung 

bladder, stomach, head & neck, cervical 

bladder, tung, head & neck 

bladder, lung, head & neck 

bladder, lung, cervical 



bladder, pancreas 

bladder, ovarian 

pancreas, bladder 

bladder, rung, ovarian, stomach 

lung, bladder, ovarian, pancreas 

bladder 

bladder 

bladder 

bladder 

bladder, prostate 
bladder, prostate 



bladder, lung, head & neck, cervical 
bladder, lung, head &fleck, cervical 
bladder, head & neck, stomach, cervical 
bladder, head & neck, stomach, cervical 
bladder, head & neck, stomach, cervical 
hepatitis C, tung, fibrosis, bladder 

breast pancreas, head & neck, lung, stomach, bladder, cervical, colon 

bladder, tung, head & neck, cervical 

glioblastoma 

glioblastoma 

glioblastoma 

glioblastoma, pancreas 

glioblastoma 

glioblastoma 

glioblastoma, tung, cervical, bladder 
glioblastoma 
glioblastoma, uterine 
glioblastoma, uterine 
gfiobtastoma, lung 
glioblastoma 



glioblastoma 



glioblastoma, tung, colon 
glioblastoma 

glioblastoma ovarian, bladder, lung 
glioblastoma, tung 
glioblastoma 
glioblastoma 

glioblastoma* ovarian, uterine 
glioblastoma 



glioblastoma 
glioblastoma 
glioblastoma 
glioblastoma 



bladder, breast pancreas, head & neck, stomach, lung, arthritis, renal 

bladder, breast pancreas, head & neck, stomach, lung, arthritis, renal 

breast renal, ovarian, glioblastoma 

renal 

renal 

renal, coton 
renal, lung 

renal lung, colon, stomach, ovarian, uterine 
renal, prostate 

946 



Ab, imaging 

Ab, sm, CTU imaging 

Ab,sm, CTU imaging, diagnostic 

CTL 

Ab, sm, CTL, diagnostic 

Ab, diagnostic 

Ab, CTL, diagnostic 

Ab, sm, diagnostic 

sm, CTL 

sm,CTL 

CTL 

Ab, imaging 
Ab, CTUtrnagirtg 
Ab, sm. CTL, Imaging 



Ab, sm, diagnostic 

Ab, sm, CTL, diagnostic 

Ab, sm, CTL, diagnostic 

Ab, sm, CTU diagnostic 

Ab, diagnostic 

CTL, diagnostic 

Ab, sm, CTL, imaging 

Ab, sm, CTL, imaging 

sm,CTL 

Ab,CTL, imaging 

Ab, sm, CTL, diagnostic 

Ab^m, CTL, imaging 

sm, CTL 

Ab, sm, imaging 

sm,CTL 

Ab,CTL, imaging 

Ab.CTL imaging 

Ab.CH, imaging 

Ab.CTL imaging 

Ab.CTL Imaging 

Ab.CTL imaging 

sm 

sm 

Ab^m, CTU Imaging 
Ab,sm, CTU Imaging 
Ab^m, CTU imaging 
Ab, CTU diagnostic 
Ab, CTU ©agnostic 
Ab, diagnostic 
Ab,stn, CTU imaging 
Ab,sm, CTU Imaging 
Ab,sm, CTU imaging 
sn\CTL 

Ab, CTU diagnostic 
Ab.sm, CTU imaging 
Ab, CTU Imaging, diagnostic 
Ab.sm, CTU imaging 



Ab.sm, CTU imaging 
Ab, CTU diagnostic 
Ab.sm, CTU Imaging 
Ab, CTU diagnostic, imaging 
Ab, CTU Diagnostic 
sm,CTL 

Ab,sm, CTU Imaging 
Ab, CTU diagnostic 
Ab, CTU diagnostic 
Ab, CTU Imaging 



Ab.sm, CTU imaging 
Ab, CTU diagnostic 
Ab,sm, CTU imaging 
Ab,sm, CTU Imaging 
Ab^sm, CTU Imaging 
Ab^sra CTU Imaging 
Ab^m, CTU imaging 
sm, CTL 

Ab. CTU diagnostic 
Ab, CTU diagnostic 
Ab,sm, imaging 
Ab,sm. CTU imaging 
Ab^m, CTU imaging 
Ab, CTU Imaging 
Ab^m, CTU Imaging 
Ab^m, CTU Imaging 
Ah^m, CTU Imaging 
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Seq ID 309 & 310 420737 
Seq ID 311 4 312 309931 
Seq ID 313 & 314 412719 
Seq ID 315 & 316 417034 
Seq ID 317 & 318 430486 
Seq ID 319 & 320 413753 
Seq ID 321 & 322 425650 
Seq ID 323 & 324 423673 
Seq ID 325 & 326 418663 
Seq ID 327 & 328 418663 
Seq ID 329 & 330 429610 
Seq ID 331 & 332 406690 
Seq ID 333 & 334 431846 
Seq ID 335 & 336 4221 58 
Seq ID 337 & 338 431958 
Seq ID 339 & 340 437044 
Seq ID 341 & 342 428484 
Seq ID 343 & 344 429211 
Seq ID 345 & 346 417389 
Seq ID 347 & 348 431009 
Seq ID 349 & 350 417542 
Seq ID 351 & 352 449230 
Seq ID 353 & 354 410555 
Seq ID 355 & 356 410S55 
Seq ID 357 4 358 424687 
Seq ID 359 & 360 418462 
Seq ID 361 & 362 410274 
Seq ID 363 & 364 439606 
Seq ID 365 & 366 404877 
Seq ID 367 & 368 444781 
Seq ID 369 & 370 109424 
Seq ID 371 & 372 415817 
Seq ID 373 & 374 415817 
Seq ID 375 & 376 415817 
Seq ID 377 & 378 415817 
Seq ID 379 & 380 415817 
Seq ID 381 & 382 415817 
Seq ID 383 & 384 421817 
Seq ID 385 & 386 416678 
Seq ID 387 & 388 418678 
Seq ID 389 & 390 409420 
Seq ID 391 & 392 332180 
Seq ID 393 & 394 408790 
Seq ID 395 & 395 408790 
SeqlD397& 398 439223 
Seq ID 399 & 400 409757 
Seq ID 401 & 402 426969 
Seq ID 403 & 404 428969 
Seq ID 405 & 406 428969 
Seq ID 407 & 408 428969 
Seq ID 409 & 410 450701 
SeqtD411 4412 450701 
Seq ID 413 & 414 414774 
Seq ID 415 & 416 407944 
Seq ID 417 & 418 407944 
Seq ID 419 & 420 428486 
Seq ID 421 & 422 457489 
Seq ID 423 & 424 432874 
Seq ID 425 & 426 445891 
Seq ID 427 & 428 445891 
Seq ID 429 & 430 404682 
Seq ID 431 & 432 429547 
Seq ID 433 4 434 425921 
Seq ID 435 & 436 407242 
Seq ID 437 & 438 407242 
Seq ID 439 & 440 407242 
Seq ID 441 4442 432596 
Seq ID 443 4 444 444006 
Seq ID 445 4 446 423685 
Seq ID 447 4 448 428392 
Seq ID 449 4 450 429597 
Seq ID 451 4 452 448030 
Seq ID 453 4 454 422109 
Seq ID 455 4 456 419235 
Seq ID 457 4 458 449048 
Seq ID 459 4 460 427333 
Seq ID 461 4462 417931 
Seq ID 463 4464 419216 
Seq ID 465 4 466 431629 
Seq ID 467 4 468 413554 
Seq ID 469 4 470 452194 
Seq ID 471 4 472 452194 
Seq ID 473 4 474 426322 



renal 
lung 

lung, head 4 neck, bladder, glioblastoma, cervical 

lung, head 4 neck, a couple cervical 

lung, bladder, head 4 neck, cervical 

lung, bladder, head 4 neck, pancreas, cervical, colon 

king, head 4 neck, cervical, bladder 

bladder, king, head 4 neck, ovarian, pancreas, colon, stomach, uterine, cervical 
king, bladder, head 4 neck, cervical 
lung, bladder, head 4 neck, cervical 
tang 

lung, head 4 neck, pancreas, stomach, bladder, colon, cervical 
lung, bladder, head 4 neck, uterine, cervical, stomach, ovarian 
head 4 neck, bladder, lung, cervical, stomach 
lung, bladder, cervical, head 4 neck, ovarian, colon, prostate, pancreas, t 
head & neck, cervical, lung, bladder, breast, prostate, ovarian, stomach 
king, glioblastoma, bladder, head 4 neck, colon, cervical 
lung, bladder, head 4 neck, cervical stomach 
ovarian, hing, bladder, uterine, cervical, pancreas, stomach 
lung, bladder, head 4 neck 
lung 

lung, cervical, head 4 neck, bladder, ovarian, colon 
lung 
lung 

head 4 neck, pancreas, lung, uterine, cervical, colon, stomach 
lung, bladder 
lung, renal 

lung, bladder, head & neck, cervical . 
lung, bladder 

lung, bladder, head 4 neck, cervical 
bladder, lung 

lung, gfobJasbrna, head 4 neck, cervical, fibrosis 
lung, glioblastoma, head 4 neck, cervical, fibrosis 
lung, glioblastoma, head 4 neck, cervical, fibrosis 
lung, glioblastoma, head 4 neck, cervical, fibrosis 
lung, gfebfastoma, head 4 neck, cervical, fibrosis 
lung, gSobtastoma, head 4 neck, cervical, fibrosis 
lung, cervical, head & neck & bladder 
lung, bladder, stomach, ovarian, pancreas 4 cervical 
lung, bladder, stomach, ovarian, pancreas & cervical 
lung, head 4 neck, pancreas, stomach, cervical, bladder 
lung 
lung 
lung 

lung, head 4 neck, cervical, bladder 4 colon 
pancreas, stomach, rung, bladder, stomach 
lung, cervical 
lung, cervical 
lung, cervical 
hmg, cervical 

king, head 4 neck, pancreas, breast, stomach, ovarian, colon 
hmg, head 4 neck, pancreas, breast, stomach, ovarian, colon 
lung, bladder, head 4 neck, pancreas, stomach, ovarian 
hmg, head 4 neck 
lung, head 4 neck 
pancreas 

pancreas, prostate, lung 
pancreas, stomach 
stomach, pancreas, ovarian 
stomach, pancreas, ovarian 
pancreas 

pancreas, head & neck, lung, ovarian 
stomach, pancreas 

pancreas, colon, bladder, head 4 neck, stomach, lung, ovarian, cervical 
pancreas, colon, bladder, head 4 neck, stomach, lung, ovarian, cervical 
pancreas, colon, bladder, head 4 neck, stomach, lung, ovarian, cervical 
pancreas, breast 

pancreas, colon, lung, ovarian 4 cervical 
pancreas, uterine, colon 



pancreas, colon, stomach, hmg 

pancreas, renal and stomach, 

pancreas, hmg, colon 

pancreas, fibrosis, head & neck & tung 

pancreas, ovarian, uterine, glioblastoma, head & neck & lung 

pancreas, coton 

ovarian, pancreas, stomach, colon, uterine, prostate 

pancreas, lung, stomach, cervfcai, prostate, head & neck 

pancreas, uterine, cervical, stomach 

pancreas, glioblastoma 

stomach, pancreas, renal, coton 

stomach, pancreas, renal, colon 

pancreas, bladder, stomach 



Ab.sm, CTL. imaging 
Ab,sm, CTL imaging 

cn 

Ab, CTL, diagnostic 
Ab.sm, imaging 
CTL, diagnostic 
Ab,ffnaging 
Ab, CTL, diagnostic 
Ab. imaging 
Ab, imaging 
CTL, diagnostic 
Ab, imaging 
Ab, imaging 
diagnostic 
Ab, CTL.ii 
Ab, imaging 
Ab, sm, imaging 
Ab, imaging 
ad, diagnostic 
Ab, sm, imaging 
CTL, diagnostic 
Ab, imaging 
Ab,sm, imaging 
Ab, sm, imaging 
Ab, sm, diagnostic 
Ab, imaging 



Ab^m, imaging 
CTL 

Ab, imaging 
Ab, sm, CTL, imaging 
Abism, CTL, imaging 
Ab,sm, CTL, Imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab^m, CTL. Imaging 
CTL 
CTl 

CTL, diagnostic 
Ab, sm, imaging 
Ab,sm, CTL, imaging 
Ab^m, CTL. imaging 
Ab, CTL, imaging 
Ab, CTL, diagnostic 
Ab, CTL diagnostic 
Ab, CTL. diagnostic 
Ab.CTL,diagnosfic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL diagnostic 
Ab, sm, diagnostic 
Ab, sm, imaging 
Ab, sm, imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTl, diagnostic 
Ab, CTL, Imaging 
Ab, CTL, imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab,sm, CTL, Imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTl, imaging 
Ab^m, CTL, imaging 
CTL 

Ab, CTL, imaging 
CTL 

Ab, CTL, diagnostic 
Ah,sm, CTL, imaging 
Ab^m, imaging 
Ab, CTL, diagnostic 
Ab, CTL, imaging 
Ab, CTL, knaging 
Ab, sm, imaging 
Ab, diagnostic 
Ab, CTU diagnostic 
Ab, CTL, diagnostic 
Ab, CTU diagnostic 
flhjsm. CTL, imaging 
Ab^sm, CTL, imaging 
Ab, diagnostic 
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Seq 10 475 & 476 429010 
Seql0477&478 431462 
Seq ID 479 & 480 448243 
Seq 10*481 & 482 426427 
Seq ID 483 & 484 428187 
Seq ID 485 & 486 428579 
Seq ID 487 & 488 428579 
Seq ID 489 & 490 428579 
Seq ID 491 & 492 428579 
Seq ID 493 & 494 445537 
Seq ID 495 & 496 422278 
Seq ID 497 & 498 424620 
Seq ID 499 & 500 406400 
Seq ID 501 & 502 431130 
Seq ID 503 & 504 420440 
Seq ID 505 & 505 428450 
SeqlD507& 508 446619 
Seq ID 509 & 510 436982 
Seq ID511&512 453392 
Seq ID 513 431989 
Seq ID 514 439820 
Seq ID 515 & 516 409178 
Seq ID 517 & 518 426514 
Seq ID 519 & 520 421478 
Seq ID 521 & 522 421478 
Seq ID 523 & 524 425776 
Seq ID 525 & 526 425776 
Seq ID 527 & 528 452097 
Seq ED 529 & 530 416530 
SeqlD53U532 431515 
Seq ID 533 & 534 419452 
Seq ID 535 & 536 412078 
Seq ID 537 & 538 412078 
Seq ID 539 & 540 411773 
SeqlD54U 542 407792 
Seq ID 543 & 544 431616 
Seq ID 545 & 546 452792 
Seq ID 547 & 548 400294 
SeqlD549& 550 432653 
Seq ID 55U 552 432653 
Seq ID 553 & 554 432653 
Seq ID 555 & 556 432653 
Seq ID 557 & 558 400290 
Seq ID 559 & 560 410001 
Seq ID 561 & 562 418396 
Seq ID 563 & 564 451027 
Seq ID 565 & 566 446057 
Seq ID 567 &S68 433466 
Seq ID 569 & 570 453370 
Seq ID 571 & 572 453370 
Seq ID 573 & 574 414569 
Seq ID 575 & 576 413435 
Seq ID 577 & 578 426501 
Seq ID 579 & 580 448999 
Seq ID 581 & 582 408369 
Seq ID 583 & 584 412628 
SeqlD585& 586 403047 
Seq ID 587 & 588 403047 
Seq ID 589 & 590 408430 
Seq ID 591 & 592 445413 
Seq ID 593 & 594 451982 
Seq ID 595 & 596 427958 
Seq ID 597 & 598 421887 
Seq ID 599 & 600 425071 
Seq ID 601 & 602 432101 
Seq ID 603 & 604 407786 
Seq ID 605 & 60S 416836 
Seq ID 607 & 608 416539 
Seq ID 609 & 610 131083 
Seq ID 611 & 612 131083 
Seq ID 613 & 614 131083 
Seq ID 615 & 616 131083 
Seq ID 617 & 618 422424 
Seq ID 619 & 620 428970 
Seq ID 621 & 622 428330 
Seq ID 623 & 624 439018 
Seq ID 625 & 626 420610 
Seq ID 627 & 628 425723 
Seq ID 6^ & 630 456662 
Seq ID 631 & 632 418281 
Seq tO 633 & 634 429903 



pancreas, lung, glioblastoma 

ovarian, uterine, lung, stomach, head & neck, glioblastoma, pancreas 

ovarian, king, head & neck, cervical, colon, uterine, stomach 

ovarian, uterine, colon, stomach 

ovarian, Ewing, uterine, pancreas 

ovarian, Ewing, uterine, pancreas 

ovarian, Ewing, uterine, pancreas 

ovarian, Ewing, uterine, pancreas 

ovarian, bladder, uterine, breast, lung, head & neck, renal, fibrosis, pancreas, cervical 

ovarian, head & neck, bladder, cervical, lung 

ovarian 

ovarian, uterine 

ovarian, uterine 

ovarian, uterine, cervteal 

ovarian, cervical, pancreas, lung 

ovarian, fibrosis, pancreas, head & neck, rung, colon 

ovarian, fibrosis 

ovarian, lung, glioblastoma 

ovarian 

ovarian, uterine, cervical, breast, prostate 
ovanan, breast * 

ovarian, colon, bladder, lung, cervical 
ovarian, bladder 
ovarian, bladder 
ovarian, uterine, lung 
ovarian, uterine, lung 



ovarian, uterine 

ovarian, prostate, lung 

ovarian, prostate, lung, breast, uterine 

ovarian 

ovarian 

ovarian / 
ovarian, uterine, cervical, pancreas 
prostate, pancreas, colon 
prostate, uterine, breast 
prostate, taxol prostate 
prostate, lung 
prostate, lung 
prostate, lung 
prostate, lung 
prostate, colon 

ovarian, prostate, uterine, cervical, lung 
prostate 

prostate, uterine, glioblastoma 



prostate 
prostate 
prostate, glioblastoma, pancreas 



prostate, glioblastoma 

prostate, lung, fibrosis, uterine, glioblastoma, cervical, ovarian 



prostate, bladder, colon 
prostate, bladder, colon 



prostate, colon, uterine, ovarian, lung, pancreas 
prostate, bladder 
prostate, glioblastoma 



prostate, colon, stomach, uterine, cervical, head & neck, pancreas 



prostate, colon, uterine, stomach. Inflammatory bowel disease, ovarian 

Prostate, Ewing, glioblastoma 

ZD1839 resistant cancers, head & neck 

androgen withdrawal prostate 

androgen withdrawal prostate 

androgen withdrawal prostate 

androgen withdrawal prostate 

bladder, pancreas, prostate, anagenesis, colon, stomach, lung 
stomach, pancreas, colon 

uterine, ovarian, fibrosis, prostate, pancreas, lung, bladder, head & neck 

uterine, stomach, prostate 

uterine, ovarian endometrioid, lung 

ovarian enoometriokl, uterine, colon 

uterine, ovanan 

uterine, ovarian 

lung 



sm,CTL 
Ab, diagnostic 
Ab.sm, imaging 
sm,CTL 

Ab,sm, CTU imaging 
Ab,sm, CTU imaging 
Ab,sm, CTU imaging 
Abism, CTU imaging 
Ab^m, CTU imaging 
Ab, CTU diagnostic 
Ab,sm, imaging 
Ab.Cn, diagnostic 
Ab, CTU diagnostic 
AD.CTL, diagnostic 
Ab, CTU diagnostic 
sm 

Ab, diagnostic 
Ab, CTL, diagnostic 
CTL 



Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTU Imaging 
Ab, CTU imaging 
Ab,sm, CTU imaging 
Ab,£m, CTU imaging 
Ab, sm, diagnostic 
Ab, CTU diagnostic 
Ab,sm, CTU Imaging 
Abvsm, CTU imaging 
CTL 
CTL 

Ab, sm, CTU diagnostic 
A^ CTU diagnostic 
Ab,sm, CTU imaging 
Ab, CTU imaging 
AbjSm, CTU imaging 
Ab^m, CTU imaging 
Ab,sm, CTU imaging 
Ab.sm, CTU imaging 
Ab.sm, CTU imaging 
Ab,sm, CTU imaging 
Ab, CTL, diagnostic 
Ab.sm, CTU imaging 
Ab,sm, CTU imaging 
Ab,sm, CTU imaging 
Ab, CTU diagnostic 
Ab.sm, CTU imaging 
Atvsm, CTU Imaging 
Ab.sm, CTU imaging 
Ab,sm, diagnostic 
Ab, CTl, imaging 
Ab,sm, CTU imaging 
Ab,sra CTU Imaging 
Ab, CTU diagnostic 
Ab'sm, CTU imaging 
Ab,sm, CTU Imaging 
Ab,sm, CTU Imaging 
diagnostic 
CTL 

Ab^rn, CTU Imaging 
Ab, CTU imaging 
Ab, diagnostic 
Ab,sm, imaging 
Ab,sm, Imaging 
Ab.Cn, diagnostic 
Ab, sm, CTU Imaging 
Ab.sm,CTU imaging 
Ab, sm, CTU Imaging 
Ab,sm. CTU Imaging 
Ab. sm, CTU imaging 
Ab, CTU diagnostic 
Ab.sm, Imaging 
Ab, sm, CTU diagnostic 
Ab^sm, CTU imaging 
CTL 

Ab, CTU diagnostic 
CTL 

Ab, CTU diagnostic 
sm 
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Table 78 

Seq ID NO: 1 DMA sequence 

Nucleic Acid Accession #: NMJ>01400 

Coding sequence: 251.. 1399 

1 11 21 31 41 51 

1 I I I I I 

TCTAAAGGTC GGGGGCAGCA GCAAGATGCG AAGCGAGCCG TACAGATCCC GGGCTCTCCG 60 

AACGCAACTT CGCCCTGCTT GAG06AGGCT GCGGTTTCCG AGGCCCTCTC CAGCCAAGGA 120 

AAAGCTACAC AAAAAGCCTG GATCACTCAT CGAACCACCC CTGAAGCCAG TGAAGGCTCT 180 

CTCGCCTCGC CCTCTAGCGT TGGTCTGGAG TAGCGCCACC CCGGCTTCCT GGGGACACAG 240 

GGTTGGCACC ATGGGGCCCA CCAGCGTCCC GCTGGTCAAG GCCCACCGCA GCTCGGTCTC 300 

TGACTACGTC AACTATGATA TCATCGTCCG GCATTACAAC TACACGGGAA AGCTGAATAT 360 

CAGOGCGGAC AAGGAGAACA GCATTAAACT GACCTCGGTG GTGTTCATTC TCATCTGCTG 420 

CTTTATCATC CTGGAGAACA TCTTTGTCTT GCTGACCATT TGGAAAACCA AGAAATTCCA 480 

CCGACCCATG TACTATTTTA TTGGCAATCT GGCCCTCTCA GACCTGTTGG CAGGAGTAGC 540 

CTACACAGCT AACCTGCTCT TGTCTGGGGC CACCACCTAC AAGCTCACTC CCGCCCAGTG 600 

GTTTCTGCGG GAAGGGAGTA TGTTTGTGGC CCTGTCAGCC TCCGTGTTCA GTCTCCTCGC 660 

CATCGCCATT GAGCGCTATA TCACAATGCT GAAAATGAAA CTCCACAACG GGAGCAATAA 720 

CTTCCGCCTC TTCCTGCTAA TCAGCGCCTG CTGGGTCATC TCCCTCATCC TGGGTGGCCT 780 

GCCTATCATG GGCTGGAACT GCATCAGTGC GCTGTCCAGC TGCTCCACCG TGCTGCCGCT 840 

CTACCACAAG CACTATATCC TCTTCTGCAC CACGGTCTTC ACTCTGCTTC TGCTCTCCAT 900 

OGTCATTCTG TACTGCAGAA TCTACTCCTT GGTCAGGACT OGGAGCCGCC GCCTGACGTT 960 

CCGCAAGAAC ATTTCCAAGG CCAGCCGCAG CTCTGAGAAG TCGCTGGOGC TGCTCAAGAC 1020 

GGTAATTATC GTCCTGAGCG TCTTCATCGC CTGCTGGGCA CCGCTCTTCA TCCTGCTCCT 1080 

GCTGGATGTG GGCTGCAAGG TGAAGACCTG TGACATCCTC TTCAGAGCGG AGTACTTCCT 1140 

GGTGTTAGCT GTGCTCAACT CCGGCACCAA CCCCATCATT TACACTCTGA CCAACAAGGA 1200 

GATGCGTCGG GCCTTCATCC GGATCATGTC CTGCTGCAAG TGCCCGAGOG GAGACTCTGC 1260 

TGGCAAATTC AAGCGACCCA TCATCGCCGG CATGGAATTC AGCCGCAGCA AATCGGACAA 1320 

TTCCTCCCAC CCCCAGAAAG ACGAAGGGGA CAACCCAGAG ACCATTATGT CTTCTGGAAA 1380 

CGTCAACTCT TCTTCCTAGA ACTGGAAGCT GTCCACCCAC CGGAAGOGCT CTTTACTTGG 1440 

TCGCTGGCCA CCCCAGTGTT TGGAAAAAAA TCTCTGGGCT TCGACTGCTG CCAGGGAGGA 1500 

GCTGCTGCAA GCCAGAGGGA GGAAGGGGGA GAATACGAAC AGCCTGGTGG TGTCGGGTGT 1560 

TGGTGGGTAG AGTTAGTTCC TGTGAACAAT GCACTGGGAA GGGTGGAGAT CAGGTCCCGG 1620 

CCTGGAATAT ATATTCTACC CCCCTGGAGC TTTGATTTTG CACTGAGCCA AAGGTCTAGC 1680 

ATTGTCAAGC TCCTAAAGGG TTCATTTGGC CCCTCCTCAA AGACTAATGT CCCCATGTGA 1740 

AAGCGTCTCT TTGTCTGGAG CTTTGAGGAG ATGTTTTCCT TCACTTTAGT TTCAAACCCA 1800 

AGTGAGTGTG TGCACTTCTG CTTCTTTAGG GATGCCCTGT ACATCCCACA CCCCACCCTC 1860 

CCTTCCCTTC ATACCCCTCC TCAACGTTCT TTTACTTTAT ACTTTAACTA CCTGAGAGTT 1920 

ATCAGAGCTG GGGTTGTGGA ATGATCGATC ATCTATAGCA AATAGGCTAT GTTGAGTACG 1980 

TAGGCTGTGG GAAGATGAAG ATGGTTTGGA GGTGTAAAAC AATGTCCTTC GCTGAGGCCA 2040 

AAGTTTCCAT GTAAGGGGGA TCCGTTTTTT GGAATTTGGT TGAAGTCACT TTGATTTCTT 2100 

TAAAAAACAT CTTTTCAATG AAATGTGTTA CCATTTCATA TCCATTGAAG CCGAAATCTG 2160 

CATAAGGAAG CCCACTTTAT CTAAATGATA TTAGCCAGGA TCCTTGGTGT CCTAGGAGAA 2220 

ACAGACAAGC AAAACAAAGT GAAAACOGAA TGGATTAACT TTTGCAAACC AAGGGAGATT 2280 

TCTTAGCAAA TGAGTCTAAC AAATATGACA TCCGTCTTTC CCACTTTTGT TGATGTTTAT 2340 

TTCAGAATCT TGTGTGATTC ATTTCAAGCA ACAACATGTT GTATTTTGTT GTGTTAAAAG 2400 

TACTTTTCTT GATTTTTGAA TGTATTTGTT TCAGGAAGAA GTCATTTTAT GGATTTTTCT 2460 

AACCCGTGTT AACTTTTCTA GAATCCACCC TCTTGTGCCC TTAAGCATTA CTTTAACTGG 2520 

TAGGGAAOGC CAGAACTTTT AAGTCCAGCT ATTCATTAGA TAGTAATTGA AGATATGTAT 2580 

AAATATTACA AAGAATAAAA ATATATTACT GTCTCTTTAG TATGGTTTTC AGTGCAATTA 2640 

AACGGAGAGA TGTCTTGTTT TTTTAAAAAG AATAGTATTT AATAGGTTTC TGACTTTTGT 2700 
GGATCATTTT GCACATAGCT TTATCAACTT TTAAACATTA ATAAACTGAT TTTTTTAAAG 

Seq ID NO i 2 Protein sequence 
Protein Accession #* NP 001391 



1 
I 

MGPTSVPLVK 
LENIFVLLTI 
EGSMFVALSA 
GWNCISALSS 
ISKASRSSEK 
VLNSGTNPII 
PQKDBGDNPE 



11 
I 

AHRSSVSDYV 
WKTKKFHRPM 
SVFSLLAIAI 
CSTVLPLYKK 
SLALLKTVII 
YTLTNKEMRR 



21 
I 

NYDIIVRHYN 
YYFIGNLALS 
ERYITMLKKK 
HYILPCTTVP 
VLSVFIACHA 
AFIRIHSCCK 



31 
I 

YTGKLNISAD 
DLLAGVAYTA 
LHNGSNNFRL 
TLLLLSIVIL 
PliFILLLLDV 
CPSGDSAGKF 



41 SI 

I I 
KENSIKLTSV VFILICCPII 
NLLLSGATTY KLTPAQWFLR 
FLLI6ACWVI SLILGGLPIM 
YCRIYSLVRT RSRRLTFRKN 
GCKVKTCDIL FRAEYFLVLA 
KRPIIAGMEF SRSKSDNSSH 



60 
120 
180 
240 
300 
360 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: NMJ>02205.1 

Coding sequence: 1..3149 



ATGGGGAGCC 
CGCCGACCCC 
GGCTTCAACT 
GGATTCTCAG 
CCCAAGGCTA 
TGGGGTGCCA 
CTGGAGTCCT 
TGGTTCGGGG 
AGCTGGCGCA 
GATAACTTCA 



11 
I 

GGACGCCAGA 
CGCTSSTGCC 
TAQACGCGGA 
TGGAGTTTTA 
ATACCAGCCA 
GCCCCACACA 
CACTGTCCAG 
CAACAGTTCG 
CAGAGAAGGA 
CCCGAATTCT 



21 
I 

GTCCCCTCTC 
GCTGCTGTTG 
GGCCCCAGCA 
COGGCCGGGA 
GCCAGGAOTO 
GTGCACCCCC 
CTCAGAGGGA 
AGCCCATGGC 
GCCACTGAGC 
GGAGTATGCA 



31 
I 

CACGCCGTGC 
CTGCTSSTGC 
GTACTCTCGG 
ACAGACGGGG 
CTGCAGQGTG 
ATTGAATTTG 
GAGGAGCCTG 
TCCTCCATCT 
GACCCCGTGG 
CCCTGCCGCT 



41 

I 

AGCTGCGCTG 
CGCCGCCACC 
GGCCCCCGGG 
TCAGTGTGCT 
GTGCTGTCTA 
ACAGCAAAGG 
TGGAGTACAA 
TGGCATGCGC 
GCACCTGCTA 
CAGATTTCAG 



51 
I 

GGGCCCCCGG 
CAGGGTCGGG 
CTCCTTCTTC 
GGTGGGAGCA 
CCTCTGTCCT 
CTCTCGGCTC 
GTCCTTGCAG 
TCCACTGTAC 
CCTCTCCACA 
CTGGGCAGCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 660 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 780 

CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 840 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTGGATCCC TCTACAACTT CTCAGGGGAA 960 

CAGATGGCCT CCTACTTTGG CTATGCAGTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 1020 

GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GCGGCCTCAG 1080 

GAGGTGGGCA GGGTCTAOGT CTACCTGCAG CACCCAGCGG GCATAGAGCC CACGCCCACC 1140 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 1200 

GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 1260 

CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 1320 

CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1380 

OGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTGT COGCTAGTGC CTCCCTCACC 1500 

ATCTTCCCOG CCATGTTCAA CCCAGAGGAG OGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1560 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 1620 

GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1680 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGAACG AGTCAGAATT TCGAGACAAA 1800 

CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 1920 

ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 1980 

GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 2040 

CAGAATGTGG GTGAGGGTGG CGCCTATGAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 2100 

GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCOGGGACAC TAAGAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGACGTGGTT 2340 

TCCTTTCGGC TCTCGGTGGA GGCTCAGGCC CAGGTGACCC TGAACGGTGT CTCCAAGCCT 2400 

GAGGCAGTGC TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 2580 

GTGACCAGAG TTAOGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GAGTTGGATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 2700 

TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 2760 

TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 2820 

TGGGCCAAGA CTTTCTTGCA GOGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 28B0 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

ATCATCATCC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 3120 
CTCAAGCCTC CAGCCACCTC TGATGCCTGA 



Seq ID NO: 4 Protein sequence 
Protein Accession #: NP_002196.1 

1 11 21 31 41 51 

I 1 I I I I 

MGSRTPESPL HAVQLRWGPR RRPPUjPLLL LLLPPPPRVG GFNLDAEAPA VLSGPPGSFF 60 

GPSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEXEPLS DPVGTCYLST 180 

DNFTRILEYA PCRSDFSWAA GQGYCQGGFS AEPTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTBDFV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RPGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVPPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLZVGSFG 480 

VDKAWYRGR PIVSASASLT IPPAMFNPEE RSCSLEGKPV ACINLSFCLN ASGKHVADSI 540 

GPTVELQLDW QKQKGGVRRA LPLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

LSPIHIALNP SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQNHVYLGD KNALNLTPHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV C33LGNPMKAG ASLWGGLRPT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

SFRLSVEAQA QVTLNGVSKP EAVLPPVSDW HPKDQPQKBE DLGPAVHHVY ELINQGPSSI 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT THHPINPKGL ELDPEGSLHH QQKREAPSRS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTPLQREH QPPSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVOWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 
YKLGPFKRSL PYGTAMEKAQ LKPPATSDA 

Seq ID NO: 5 DNA sequence 

Nucleic Acid Accession #: NM_002211.1 

Coding sequence: 1. .2397 

1 11 21 31 41 51 

I I I I I I 

ATGAATTTAC AACCAATTTT CTGGATTGGA CTGATCAGTT CAGTTTGCTG TGTGTTTGCT 60 

CAAACAGATG AAAATAGATG TTTAAAAGCA AATGCCAAAT CATGTGGAGA ATGTATACAA 120 

GCAGGGCCAA ATTGTGGGTG GTGCACAAAT TCAACATTTT TACAGGAAGG AATGCCTACT 180 

TCTGCACGAT GTGATGATTT AGAAGCCTTA AAAAAGAAGG GTTGCCCTCC AGATGACATA 240 

GAAAATCCCA GAGGCTCCAA AGATATAAAG AAAAATAAAA ATGTAACCAA CCGTAGCAAA 300 

GGAACAGCAG AGAAGCTCAA GCCAGAGGAT ATTACTCAGA TCCAACCACA GCAGTTGGTT 360 

TTGCGATTAA GATCAGGGGA GCCACAGACA TTTACATTAA AATTCAAGAG AGCTGAAGAC 420 

TATCCCATTG ACCTCTACTA CCTTATGGAC CTGTCTTACT CAATGAAAGA CGATTTGGAG 480 

AATGTAAAAA GTCTTGGAAC AGATCTGATG AATGAAATGA GGAGGATTAC TTCGGACTTC 540 
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AGAATTGGAT TTGGCTCATT TGTGGAAAAG ACTGTGATGC CTTACATTAG CACAACACCA 600 

GCTAAGCTCA GGAACCCTTG CACAAGTGAA CAGAACTGCA CCAGCCCATT TAGCTACAAA 660 

AATGTGCTCA GTCTTACTAA TAAAGGAGAA GTATTTAATG AACTTGTTGG AAAACAGCGC 720 

ATATCTGGAA ATTTGGATTC TCCAGAAGGT GGTTTCGATG CCATCATGCA AGTTGCAGTT 780 

TGTGGATCAC TGATTGGCTG GAGGAATGTT ACACGGCTGC TGGTGTTTTC CACAGATGCC 840 

GGGTTTCACT TTGCTGGAGA TGGGAAACTT GGTGGCATTG TTTTACCAAA TGATGGACAA 900 

TGTCACCTGG AAAATAATAT GTACACAATG AGCCATTATT ATGATTATCC TTCTATTGCT 960 

CACCTTGTCC AGAAACTGAG TGAAAATAAT ATTCAGACAA TTTTTG CAST TACTGAAGAA 1020 

TTTCAGCCTG TTTACAAGGA GCTGAAAAAC TTGATCCCTA AGTCAGCAGT AGGAACATTA 1080 

TCTGCAAATT CTAGCAATOT AATTCAGTTG ATCATTGATG CATACAATTC CCTTTCCTCA 1140 

GAAGTCATTT TGGAAAACGG CAAATTGTCA GAAGGAGTAA CAATAAGTTA CAAATCTTAC 1200 

TGCAAGAAOG GGGTGAATGG AACAGGGGAA AATGGAAGAA AATGTTCCAA TATTTCCATT 1260 

GGAGATGAGG TTCAATTTGA AATTAGCATA ACTTCAAATA AGTGTCCAAA AAAGGATTCT 1320 

GACAGCTTTA AAATTAGGCC TCTGGGCTTT ACGGAGGAAG TAGAGGTTAT TCTTCAGTAC 1380 

ATCTGTGAAT GTGAATGCCA AAGOGAAGGC ATCCCTGAAA GTCCCAAGTG TCATGAAGGA 1440 

AATGGGACAT TTGAGTGTGG CGCGTGCAGG TGCAATGAAG GGCGTGTTGG TAGACATTGT 1500 

GAATGCAGCA CAGATGAAGT TAACAGTGAA GACATGGATG CTTACTGCAG GAAAGAAAAC 1560 

AGTTCAGAAA TCTGCAGTAA CAATGGAGAG TGCGTCTGCG GACAGTGTGT TTGTAGGAAG 1620 

AGGGATAATA CAAATGAAAT TTATTCTGGC AAATTCTGCG AGTGTGATAA TTTCAACTGT X680 

GATAGATCCA ATGGCTTAAT TTGTGGAGGA AATGGTGTTT GCAAGTGTCG TGTGTGTGAG 1740 

TGCAACCCCA ACTACACTGG CAGTGCATGT GACTGTTCTT TGGATACTAG TACTTGTGAA 1800 

GCCAGCAACG GACAGATCTG CAATGGCCGG GGCATCTGCG AGTGTGGTGT CTGTAAGTGT 1860 

ACAGATCOGA AGTTTCAAGG GCAAACGTGT GAGATGTGTC AGACCTGCCT TGGTGTCTGT 1920 

GCTGAGCATA AAGAATGTGT TCAGTGCAGA GCCTTCAATA AAGGAGAAAA GAAAGACACA 1980 

TGCACACAGG AATGTTCCTA TTTTAACATT ACCAAGGTAG AAAGTCGGGA CAAATTACCC 2040 

CAGCOGGTCC AACCTGATCC TGTGTCCCAT TGTAAGGAGA AGGATGTTGA CGACTGTTGG 2100 

TTCTATTTTA CGTATTCAGT GAATGGGAAC AACGAGGTCA TGGTTCATGT TGTGGAGAAT 2160 

CCAGAGTGTC CCACTGGTCC AGACATCATT CCAATTGTAG CTGGTGTGGT TGCTGGAATT 2220 

GTTCTTATTG GCCTTGCATT ACTGCTGATA TGGAAGCTTT TAATGATAAT TCATGACAGA 2280 

AGGGAGTTTG CTAAATTTGA AAAGGAGAAA ATGAATGCCA AATGGGACAC GGGTGAAAAT 2340 
CCTATTTATA AGAGTGCCGT AACAACTGTG GTCAATCCGA AGTATGAGGG AAAATGA 

Seq ID NO: 6 Protein sequence 
Protein Accession fh NP_002202.1 

1 11 21 31 41 51 

I 1 I I I I 

MMLQPIPWIG LISSVCCVFA QTDENRCLKA NAKSCGECIQ AGPNCGWCTN STFLQEGMPT 60 

SARCDDLEAL KKKGCPPDDI ENPRGSKDIK KNKNVTNRSK GTASKLKPBD ITQIQPQQLV 120 

LRLRSGBPQT PTLKFKRAED YPIDLYYLMD LSYSMKDDLE NVKSIjGTDLM NEMRRITSDF 180 

RIGFGSFVEK TVMPYISTTP AKLENPCTSE QNCTSPFSYK NVLSLTNKGE VFNBLVGKQR 240 

ISGNLDSPEG GFDAIMQVAV CGSLIGWRNV TRLLVFSTDA GFHFAGDGKL GGIVLPNDGQ 300 

CHLENNMYTM SHYYDYPSIA HLVQKLSENN IQTIFAVTEE FQPVYKELKN LIPKSAVGTL 360 

SANSSNVIQL IIDAYNSLSS EVILENGKLS EGVTISYKSY CKNGVNGTGE NGRKCSNISI 420 

GDEVQFEISI TSNKCPKKDS DSPKIRPLGF TEEVEVILQY ICECECQSEG IPBSPKCHEG 480 

NGTFECGACR QJEGRVGRHC ECSTDEVNSE DKDAYCRKEN SSEICSNNGE CVCGQCVCRK 540 

RDNTNEIYSG KPCECDNFNC DRSNGLICGG NGVCKCRVCE CNPNYTGSAC DCSLDTSTCE 600 

ASNGQICNGR GICEOGVCKC TDPKFQGQTC EMCQTCLGVC AEHKECVQCR AFNKGEKKDT 660 

CTQECSYFNI TKVESRDKLP QPVQPDPVSH CKEKDVDDCH FYPTYSVNGN NEVMVHWEN 720 

PECPTGPDII PIVAGWAGI VLIGLALLLI WKLLMIIHDR REFAKFEKEK MNAKWDTGEN 780 
PIYKSAVTTV VNPKYEGK 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession #: NM_002425 

Coding sequence: 26.. 1453 

1 11 21 31 41 51 

I I I I I I 

AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 

AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 

GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 

TACATACAGG ATTGTGAATT ATACACCAGA TTTGOCAAGA GATGCTGTTG ATTCTGCCAT 420 

TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 

AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 

TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 

CGTTGCTGCT CATGAACTTG GOCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 

GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 

TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 

TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTQACACAC ATATTAAAGA GTAACAGCTG 1440 

GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT IS 60 
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GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 
ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 
ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 

Seq ID MO: 8 Protein sequence 
Protein Accession #: NP 002416 



MHLAFLVLLC 
KIQGMQKFLG 
PDLPRDAVDS 
HAYPPGPGLY 
PTELAQFRLS 
RGEYLFFKDR 
IRGNEVQAGY 
PRLIADDPPG 



11 
I 

LPVCSAYPLS 
LEVTGKLDTD 
AIEKALKVWE 
GDIHPDDDEK 
QDDVNGIQSL 
YFWRRSHWKP 
PRGIHTLGPP 
VEPKVDAVLQ 



21 
1 



TLEVMRKPRC 
EVTPLTFSRL 
WTEDASGTNL 
YGPPPASTEE 
EPEFHLISAF 
PTIRKIDAAV 
AFGPFYFFSG 



31 
I 

DLAQQYLEKY 
GVPDVGHPSS 
YEGEADIMIS 
FLVAAHELGH 
PLVPTKSVPS 
WPSLPSYLDA 
SDKEKKKTYF 



41 

i 

YNLEKDVKQF 
FPGMPKWRKT 
FAVKEHGDFY 
SLGLFHSANT 
GSEMPAKCDP 
AYEVNSRDTV 
FAADKYWRFO 
RMVTHILKSN 



51 
I 

RRKDSNLIVK 
HLTYRIVNYT 
SFDGPGKSLA 
EALMYPLYNS 
ALSPDAISTL 
FIFKGNEFWA 
ENSQSMEQGF 
SWLHC 



Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #: XM_058189. 

Coding sequence: 169.. 774 



I 

GAAGACCAGC 
AGTATTTCAC 
TATGAAGTGA 
AAGTGTGGAG 
AACATATTAT 
AACTAOGTGT 
ACAGTTCTTC 
AGCAAAAAAT 
GGATACTGCC 
GATGGCTGGG 
TGGATTCAGT 
CTCATAACCC 
TCCAAGATAC 
ACAAAATGTT 
GACTTGAGGG 
TTTGCAGTCC 
AATATGCATC 
GAATCTGTAC 
TGCAACATTA 
AGGATTAAGT 
ATAAAAAATA 



11 
[ 

TCAGCTCTTC 
ACTGAGATTG 
AGGGCTCTGA 
GCTGCCTAAG 
TGTATTTCCC 
GGTATTTTGA 
TGGTACTGGA 
ATGTGACACT 
TGGTCATCTC 
AGTATGCTTT 
GCCTGGAACC 
TCAGTGGGCT 
TGTGTGGAAG 
TTCCATTATC 
CAATATTGAA 
TCACTGCACA 
AGAAACTTCA 
TAGGGAAGTA 
TTTAATATTC 
AAAGAGTGGT 
AAGAGTACTA 



21 

I 

AGTTGTTGAT 
TCGGCTGCGG 
CCCTGGAAGT 
TTGTTTGCTG 
GAATGGGCAA 
AGGAATCTGT 
GAATAATAAC 
GCTGTCAATT 
TGOCTTGGGT 
TGAAGGCACT 
TGCACATGTT 
TCAAGTGATC 
CTATTCAGTG 
AAGACATGGC 
ATGATGGTGC 
TGCAAGTATA 
GAAATACTTC 
AATAAGAATA 
TGGAAAATTG 
ACATACTGTA 
CATGGTTGTA 



31 
I 

CATTGTCTAT 
GTATATTCCA 
GGTTCTAAGC 
ATTCCGCTTG 
ACTTCCTATG 
TTCTCAGGCA 
AACTATAAAT 
ATCTTTTCTT 
CTTGTCCAAG 
GCTGGACGTT 
GTGGAGTGGA 
ATCTGCCTCA 
ATCTTCCAGC 
CATCTATCTA 
TTTCTGCATT 
CCACCCTTCC 
TGCCCTTTGA 
TGAGAGAAAC 
GAAACACCCC 
AATGTTTTCT 



41 

I 

TGTTCTCCAA 
ATTCCCCGTC 
AGGGCAAAAT 
CACTTTGGAG 
CATCCAGCAA 
TCATGATGCT 
GTTGCCAGAG 
CCCTCGGAAT 
GGCCATATTG 
TCCTTACAGA 
ACATCATTTT 
TCAGAGTAGT 
CTGGAATCAT 
AATATTATAT 
TGGTGTTTAT 
ATTTAGTATG 
TCAAACAAAT 
CTTTATGCAA 
AAAATTCTAA 
GATATTAAAA 



51 

I 

ACAGTAAACC 
TCCTCATGAA 
GGGGTCTCGG 
TATAATCGTG 
TAAACTCACC 
TATAGTAACA 
TGAAAACTGC 
TGCTTTTTCT 
CGGCACCCTT 
TTCTAGCATA 
ATTTTCCATT 
CATGCAACTA 
TTGAATAAGG 
CAACTGTGTA 
TTGTAAAAAA 
TTTTTTAAGT 
CCATTTCCAA 
ATATGTATAT 
ACTCAGAGGA 
AAAAAATTAA 



Seq ID NO: 10 Protein sequence 
Protein Accession #: XP 058189.1 



1 11 21 31 41 51 

I I I I I I 

MGSRKGGGCL SCLLIPLALW SIIVNILLYF PNGQTSYASS NKLTNYVWYF EGICFSGIMM 
LIVTTVLLVL ENNNNYKCCQ SENCSKKYVT LLSIIFSSLG IAFSGYCLVI SALGLVQGPY 
CRTLDGWEYA FEGTAGRFLT DSSIWIQCLB PAHWEWNII LFSILITLSG LQVIICLIRV 
VMQLSKILCG SYSVIFQPGI I 

Seq ID NO: 11 DNA sequence 

Nucleic Acid Accession #i NM_002421.2 

Coding sequence: 1..1409 



ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 
TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCA AGGACA 
CTTTCTGAGG 
GATGAATATA 



11 
I 

TTCCTCCACT 
TAQAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 



21 
I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TGGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCC3 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 



31 
I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
GAAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 



41 
I 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAAATAG 
AAGTGACTGG 
TGCCTGATGT 
TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 



51 
I 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 



AGACATCATG 
TGGAGGAAAT 
TGATGAAGAT 
TCATGAACTC 
COCTAGCTAC 
AGCCATATAT 
ATGTGACAGT 
TAAAGACAGA 
TTCTGTTTTC 
AGATGAAGTC 
ACACGGATAC 
CGATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 
GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 

Seq ID NO: 12 Protein sequence 
Protein Accession it: NP_0 02412.1 



1320 
13S0 



KHSFPPLLLL 
VEKLKQKQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 
I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVBLNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 

I 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



41 
I 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGOHRD 
GHSLGLSHST 
KLTFDAITTI 
RPFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



51 

1 

VEKRRNSGPV 
QTHLTYRIBN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLHGY 
YPKMIAHDPP 



Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence : 1 . . 1409 



1 
I 

ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 
TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 
GGAATTGGCC 
GGAACAAGAC 
AATAGCTGGT 



11 
I 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 
ACAAAGTTGA 
AATACAAATT 
TCAACTGCAG 



21 
I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCCG 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 
TGCAGTTTTC 
TGATCCTAAA 
GAAAAATTAG 



31 
I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
GAAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 
ATGAAAGATG 
ACGAAGAGAA 



41 

1 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAAATAG 
AAGTGACTGG 
TGCCTGATGT 
TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 
GATTTTTCTA 
TTTTGACTCT 



51 

I 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 
CTTCCAACTC 
AGACATCATG 
TGGAGGAAAT 
TGATGAAGAT 
TCATGCCCTC 
CCCTAGCTAC 
AGCCATATAT 
ATGTGACAGT 
TAAAGACAGA 
TTCTGTTTTC 
AGATGAAGTC 
ACACGGATAC 
CGATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 
TTTCTTTCAT 
CCAGAAAGCT 



Seq ID NO: 14 Protein sequence 
Protein Accession 8: NP_002412.1 



I 

MHSFPPLLLL 
VEKLRQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 
I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 

I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 



41 



VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHAL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSHST 
KLTFDAITTI 
RPFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



51 
I 

VEKRRNSGPV 
QTHLTYRIEN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLHGY 
YPKMIAHDPP 



Seq ID NO: IS DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 141. .1580 



1 

! 

TCTGCGTGTG 
AGGCAAACAG 
CTCTACAGGC 
TCCGCGAGTT 
GCGGGATCCT 
CAGTTCTCAT 
GGGTGGACGT 
ACTTCATGTA 
CCGCCACCTA 
GCTATGAACC 
CGGCCCCTCC 
CTCGAAGCTG 
GGAAAAAGTA 



11 

I 

CCGGGGCTAG 
AGGAGGGAAG 
CTGTGTCGCT 
CACTCGCCAC 
CACTGACGTC 
CGCCTGCAGT 
GCTCTCTCTG 
CACTTCGCGC 
TTTGCAGATG 
TCTGGGCATC 
ACCAGGTAGT 
CAGTCAAGGC 
CAAGTACATC 



21 
I 

GGGCTGGAAG 
GCGTCTTAGG 
ATGGGTTCCC 
TCCTOCGACG 
ACGCTGCTGG 
GGCTTCTTCT 
CCCGGGGGTC 
CTGCGCCTCT 
GAGCACGTGG 

CCCAGGCGCT 
CCCCCCAGTC 
GTGCTAAACT 



31 
I 

TCCTGGCTCT 
ACTGCCTGGA 
CCGCCGCCCC 
TGCTGGGCAA 
TTGGCGGGCA 
ATTCAATTTT 
CCGAAGCGAG 
CTCCAGCCAC 
TCCAGGCATG 
CCCTGGAAGC 
CCGAAGGACA 
CAGCCAGCCC 
CTCAGGCCTC 



41 
1 

AGTTGCACCT 
TCCAGAGCAC 
GGAGGGAGCG 
CCTCAACGAG 
ACCCCTCAGA 
CCGGGGCCGT 
AGGCTTCGCC 
TGCACCAGCA 
CCACCGCTTC 
AGAACCCCCA 
CCCAGACCCA 
TGACCCCAAG 
CCAAGCAGGG 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
300 
360 
420 



51 
I 

CGGAAGGAAA 
TTTCCTCGGC 
CTGGGCTACG 
CTGCGCCTGC 
GCACACAAGG 
GCGGGAGTCG 
CCTCTATTGG 
GTCCTAGCGG 
ATCCAGGCCA 
ACACCCCCAA 
CCTACTGAAT 
GCCTGCAACT 
AGCCTGGTCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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GGGAGAGAAG TTCTGGTCAA 
CCAGCAGCAG CAGCAGCAGC 
GCAGGCTCTC TCCAACTGCT 
CCTACCTCCT CACATCCCAG 
CACTACCGGG AAGTGAATTT 
CGGGGCTGGA CTCCTTGGTT 
GGTCTTCGTT OCGCTACAAG 
AGCCTTACCA CTGCTCAATC 
ACAGCCGCAT CCATTCGGGA 
TACAGGTGGC ACATCTGCGG 
GCCCTACCTG CGGAACCCGC 
ACACCGGAGA GAAGCCTTAC 
AACTGCGGCT GCATCTGCGC 
ACATTCTCGG GGGGCCCTAQ 
GAAAGCTGCA GGCCCAGGCC 
CACTTTGGTA TCAGAAATTG 
GATCCTGGCT AGATCTGCCT 
GTTTCTGAGG AGAGAGCTAG 
CCTAAGGGAA TAGCCCTCCA 
TTTATTGAGG CCTTTGGQTG 
TTCCACAAGT GTGATTAAAA 
CAGAGATTAC TAGCCCTTGG 
TAACTTTTAT CTTTAGAATT 
TGGAAAAAGG GGTTCTCTGT 
TCTAGGGCAG CTCTGGGAAC 
TTCTGGATGT TGTAGGTTCT 
CAAGGGTGAT AGGAACCATT 
GAGGCTGTGG GTGTGGGGGA 
GGGTGTGGGG GATTCTGTAT 
TCTGCAAGAT GGTCCAGAAT 
GCTGTATCCA TCTATAGTGG 
CACGGGGGCC TGTTCTTAGC 
TTATCAGAGA TGATGTGACC 
GGGAAGAATC ATGAAACTCT 
ACTACAGAGG CATATGGGTT 
TCTTCCTTTA GTGGGTTTTG 
TCCTCTAGAA GGGATGGTGC 
TCTTCCCATC CCTGCATTCC 
AAGAAAAGGG GCTGAGTTCC 
ATTACAGATG TAAAAGATTG 
TTCAAGTAGG ATTAAGAGGT 
GAAAGTGAGG AACAGGGTTG 
GCTGAAGCCT TGATTGATAG 
OGAGGGTAGA AAGTAAGAAG 
GTTCTAGTGG CTGTCGCCTG 
TCTCCCCATG GCCCCACTGC 
AGAAGGAATC ATGATTTCTA 
AGAAATGTTA GATCTTGCAA 
AAAAAAAAAA AAAAAA 



CCTTGCCCCC AAGCCAGGCT CCCCAGTGGA GACGAGGCCT 
AGCAGCAGCA GTGAAGAAGG ACCCATTCCT OGTCCCCAGA 
GCCACTGTGC AGTTCAAATG TGGGGCTCCA GCCAGTACCC 
GCTCAAGACA CCTCTGGATC ACCCTCTGAA CGGGCTCGTC 
TTCAGCTGCC AGAACTGTGA GGCTGTGGCA GGGTGCTCAT 
CCTGGGGACG AAGACAAACC CTATAAGTGT CAGCTGTGCC 
GGCAACCTTG CCAGTCATCG TACAGTGCAC ACAGGGGAAA 
TGOGGAGCCC GTTTTAACCG GCCAGCAAAC CTGAAAAOGC 
GAGAAGCCGT ATAAGTGTGA GACGTGCGGC TCGCGCTTTG 
GCGCACGTGC TGATCCACAC CGGGGAGAAG CCCTACCCTT 
TTCCGCCACC TGCAGACCCT CAAGAGCCAC GTTCGCATCC 
CACTGCGACC CCTGTGGCCT GCATTTCCGG CACAAGAGTC 
CAGAAACACG GAGCTGCTAC CAACACCAAA GTGCACTACC 
CTGAGCGCAG GCCCAGGCCC CACTTGCTTC CTGCGGGTGG 
TTGCTTCCCT ATCAGGCTTG GGCATAGGGG TGTGCCAGGC 
CCACCCTCTT AATTTCTCAC TGGGGAGAGC AGGGGTGGCA 
CTGTTTTGCT GGTCAAAACC TCTTCCCCAC AAGCCAGATT 
CTAGGGGCTG GGAAAGGGGA GAGATTGGAG TCCTGGTCTC 
CCTGTGGCCC CCATTGCATT CAGTTTATCT GTAAATATAA 
GCACCGGGGC CTTCATTCGA TTGCATTTCC CACTCCCCTC 
GTGACCAGAA ACACAGAAGG TGAGATCACA GCTCTGCTGG 
CTCTCTCGTT TGGCTTGGGT ATTTTATATT ATTTCTGTCA 
GTTCTTTCTC CTGTTTGTTT GCTTGTTAGT TTGTTTAAAA 
GTTCTGCCCC TGTAATTCTA GGTCTGGAAC CTTTATTTGT 
ATGCGGGATT GTGGAATTGG GTCAGGAACC CTCTCTGGTA 
CTAGCAGTCT AGAAATGGAT ACAGACATTT CTCTGTTCTT 
ATGTTGAGCC CAAAATGGAA GTAATAATAA ATGCCTCCTG 
TTCTGTATCT GGATTCCGTA TCACTCCAAC TGGAGGCTGT 
CTGGATTCCG TATCACTCCA AGTGGAGGCT GGCAGGTTTT 
CTAAAATGTC CCATTAATCT GGTCACTTGG GTTTGGCTCT 
TAGAGACCCA CCAGGGCTCA AGTGGAGTCC ATCATCCTCC 
ACTGAGTTGA TCGCTCCATG GGGGAGAGAT CAGACATTCC 
TTTTCTGACT CTGCCCAGTC TCTATGAATG TTATGGCCTA 
TTAGCTTGAT TAGATGGTAA ACAGTGTTAA CCCATCCTTT 
TGAATGTTAC CTGGGGTTCT CTCTATTGAG TTGAGCGCCT 
GACATCTTCT GGCAAGTGTC CAGATGCCAG AACCTTCTTT 
TTGGTAACCT TACCTTTTAA AAGCTGGGTC TGTGACCTGG 
TGTCTGGAAC CAGTGAATGC ATTAGAACCT TCCATAGGAA 
ATTCTGGGTT TGCTGTAGTT TGGTTGGGAT TATTGTTGGC 
ACTAGCCCAT AGGCCAAAGG CCTGTTCTAG TTGACCAAGT 
TGGTTGAGGG GTGCAGTTTC TGGTGTAGGC CAGGTAGGTA 
CCTCTTGGCT GGGTGGAGTC TCTGAAATGT TAGAAGAAGC 
TTCTGCCCCT TGTTGCCCTG GGGCTTATCT GATTATGGGA 
CACTTTTGAA TTTGTGGGGT AGAACTTCAA CAATAAGTCA 
GGGACTAGTG AGAAAGCTAC TCTTCTCCCT CTTCCCTCTT 
AGAATTAAAG AAGGAAGAAG GGAAGGCGGA GGAGTCTATA 
TTTAGCAGAT TGGATGGGCA GGTGGAGAAT GCCTGGGGGT 
CATCAGATCC TTGGAATAAA GAAGCCTCTC TGYGCWRAAA 



Seq ID NO: 16 Protein sequence 
Protein Accession #: FGENBSH predicted 



MGSPAAPEGA 
GFPYSIFRGR 
EHWQACHRP 
PPSPASPDPK 



FSOQNCEAVA 
CGARFNRPAN 
PRHLQTLKSH 



11 
I 

LGYVREPTRH 
AGVGVDVLSL 
IQASYEPLGI 
ACNWKKYKYI 
GPQSRLSPTA 
GCSSGLDSLV 



21 
I 

SSDVLGNLNE 
PGGPEARGFA 
SLRPLEABPP 
VLNSQASQAG 
ATVQFKCGAP 
PGDEDKPYKC 
EKPYKCETCG 



31 

I 

LRLRGILTDV 
PLLDFMYTSR 
TPPTAPPPGS 



41 

1 

TLLVGGQPLR 
LRLSPATAPA 
PRRSEGHPDP 
PCPQARLPSG 



ASTPYLLTSQ 
QLCRSSFRYK 
SRFVQVAHLR 
VRIHTGEKPY HCDPCGLHFR HKSQLRLHLR 



GNLASHRTVH 
AHVLIHTGEK 
QKHGAATNTK 



Seq ID NO: 17 DMA sequence 

Nucleic Acid Accession #: XMJ)39209 

Coding sequence: 1..2049 



ATGCTGAAGA 
GGAGATGCTA 
AATGGGAACC 
CTGCTGAGTG 
CGGAGTGACA 
ACAGAATGTG 
AGCCTGTTCC 
TGCAAAGACT 
CAAACAACTG 
CCAGATTTTC 
GAATATGACA 
GAGGTTGTGA 
CGTCTCTTCA 
TTCAAGGAGC 



11 

I 

TGCTCTCCTT 
AGTTTGGGGA 
CCCCGAAGCG 
GGGGAGAGAT 
GCCCGGGGCT 
GGAAGTTACT 
ACTCACCTGA 
ATTGCAAAGA 
CGGATGAGTT 
CAAGAAAACA 
AAGTGGAAGA 
GTGGGCTGOG 
TTCTGGAAAA 
CTTATTTGGA 



21 

! 

TAAGCTGCTG 
AAGAAACGAA 
CCTGAAAAGG 
GCTGTGCGGT 
AGGGCGCCTG 
GGAGGAAATC 
GAGAGAAGTC 
ATTCTTTTAC 
TTGCTTTTAC 
AGTCAGAGGA 
GATCAGCAGA 
GCAGCCCGTT 
AGAAGGTTAT 
CATTCACAAA 



31 

I 

GGGAGCGGAG 
AGAGACAGGA 
GGCTTCTACC 
GAGAATAAGA 
AAATGTGCAC 
TTGGAAAGAG 
ACTTGCCGAG 
TATGCAAGAA 
CCAGCATCTA 
AAGCACAAAC 
GGTGCCCTGC 
GTGAAGATAC 
CTTGTTCAAA 



41 
I 

TGGCTCTGGG 
CAAGGAGGAG 
GGATGATGTC 
CTCGGCTGTC 
TATTTTCTGT 
TTTGCTCTCC 
ACCTAGTACT 
GCCATATTCC 
AAGATGGTGG 
ACTACTTGGA 
ACAACTGCTT 
ATAGTGGGGA 
TTACCCCTGA 
GTGGAATAAA 




640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



51 
I 

AHKAVLIACS 
VLAAATYLQM 
PTESRSCSQG 



RARPLPGSEF 
TGEKPYHCSI 
PYPCPTCGTR 
VHYHILGGP 



60 
120 
180 
240 
300 
360 
420 



CCAGATGGAA 
CTGTATTCAG 
TGGCTCGCAA 
AGGAGAAATT 
GGGAGGAGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



954 



WO 03/042661 



GAAAGAGQAC TGCTAAGCCT CGCATTCCAT CCCAATTACA AGAAAAATGG AAAGTTGTAT 900 
GTGTCCTATA CCACCAACCA AGAACGGTGG GCTATCGGGC CTCATGACCA CATTCTTAGG 960 

GTTGTGGAAT ACACAGTATC CAGAAAAAAT CCACACCAAG TTGATTTGAG AACAGCCAGA 1020 

GTCTTTCTTG AAGTTGCAGA ACTCCACAGA AACCATCTGG GAGGACAACT GCTCTTTGGC 1080 

CCTGAOGGCT TTTTGTACAT CATTCTTGGT GATGGGATGA TTACACTGGA TGATATGGAA 1140 

GAAATGGATG GGTTAAGTGA TTTCACAGGC TCAGTGCTAC GGCTGGATGT GGACACAGAC 1200 

ATGTGTAACG TGCCTTATTC CATACCAAGG AGCAACCCAC ACTTCAACAG CACCAACCAG 1260 

CCCCCCGAAG TGTTTGCTCA TGGGCTCCAC GATCCAGGCA GATGTGCTGT GGATAGACAT 1320 

CCCACTGATA TAAACATCAA TTTAACGATA CTGTGTTCAG ACTCCAATGG AAAAAACAGA 1380 

TCATCAGCCA GAATTCTACA GATAATAAAG GGGAAAGATT ATGAAAGTGA GCCATCACTT 1440 

TTAGAATTCA AGCCATTCAG TAATGGTCCT TTGGTTGGTG GATTTGTATA CCGGGGCTGC IS 00 

CAGTCAGAAA GATTGTATGG AAGCTAOGTG TTTGGAGATC GTAATGGGAA TTTCCTAACT 1560 

CTCCAGCAAA GTCCTGTGAC AAAGCAGTGG CAAGAAAAAC CACTCTGTCT CGGCACTAGT 1620 

GGGTCCTGTA GAGGCTACTT TTCOGGTCAC ATCTTGGGAT TTGGAGAAGA TGAACTAGGT 1680 

GAAGTTTACA TTTTATCAAG CAGTAAAAGT ATGACCCAGA CTCACAATGG AAAACTCTAC 1740 

AAAATTGTAG ATCCCAAAAG ACCTTTAATG CCTGAGGAAT GCAGAGCCAC GGTACAACCT 1800 

GCACAGACAC TGACTTCAGA GTGCTCCAGG CTCTGTCGAA ACGGCTACTG CACCCCCACG 1860 

GGAAAGTGCT GCTGCAGTCC AGGCTGGGAG GGGGACTTCT GCAGAACTGC AAAATGTGAG 1920 

CCAGCATGTC GTCATGGAGG TGTCTGTGTT AGACOGAACA AGTGCCTCTG TAAAAAAGGA 1980 

TATCTTGGTC CTCAATGTGA ACAAGTGGAC AGAAACATCC GCAGAGTGAC CAGGGCAGAC 2040 
ATCACCTAG 



Seq ID NO: 18 Protein sequence 
Protein Accession #: XP_039209 

1 11 21 31 41 51 

i I I I I t 

MLKMLSPKLL LLAVALGFFE GDAKFGKRNE GSGARHRRCL NGNPPKRLKR RDRRMMSQLE 60 

LLSGGEMLCG GFYPRLSCCL RSDSPGLGRL ENKIPSVTNN TECGKLLEEI KCALCSPHSQ 120 

SLFHSPEREV LERDLVLPLL CKDYCKEFPY TCRGHIPGFL QTTADEFCPY YARKDGGLCF 180 

PDFPRKQVRG PASNYLDQME BYDKVBEISR KHKHNCFCIQ EWSGLRQPV GALHSGDGSQ 240 

RLFILEKEGY VKILTPEGEI FKEPYLDIHK LVQSGIKGGD ERGIiLSLAFH PNYKKKGKLY 300 

VSYTTNQERW AIGPHDHILR WEYTVSRKN PHQVDLRTAR VFLEVAELHR KHLGGQLLFG 360 

PDGFLYIILG DGMITLDDME EMDGLSDFTG SVLRLDVDTD KCNVPYSIPR SNPHFNSTNQ 420 

PPEVFAHGLH DPGRCAVDRH PTDININLTI LCSDSNGKNR SSARILQIIK GKDYESEPSL 480 

LEFKPFSNGP LVGGFVYRGC QSERLYGSYV FGDRNGNFLT LQQSPVTKQW QEKPLCLGTS 540 

GSCRGYFSGH ILGFGEDBLG EVYILSSSKS MTQTHNGKLY KXVDPKRPLM PEECRATVQP 600 

AQTLTSECSR LCRNGYCTPT GKCCCSPGWE GDFCRTAKCE PACRHGGVCV RPNKCLCKKG 660 
YLGPQCEQVD RNIRRVTRAD IT 

Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession 8: NM_014331.2 

Coding sequence t 1..1506 

1 11 21 31 41 51 

I I I I I I 

ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGOGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATAICCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAOTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTAOGC GGTTGCCACT GGCTTTTTAT 720 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC OCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCX3G CTACTGGGAA ATTTCTCATT AGCAGTTCOG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGOGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TOAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 



955 



WO 03/042661 



PCT/US02/36810 



TTATCTGTCA TTT T T TTTT T TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 
AGCAAGAOTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 
TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 
TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 
5 CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 
TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 
CTTCAGATGA AACTGTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 
AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 
ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 

10 CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 
GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 
GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 
CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 
GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 

15 GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 
CCATCTCCAA AAAAAAAAAA AAAA 

Seq ID NO: 20 Protein sequence 
Protein Accession 8: NP_0S5146.1 

1 11 21 31 41 51 

I I I I I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQKKVQL KRKVTLLRGV SIIIGTIIGA 60 

GIPISPKGVL QNTGSVGMSL TIWTVOGVLS LFGALSYABL GTTIKKSGGH YTYILEVFGP 120 

25 LPAFVRVWVE LLIIRPAATA VIS1AFGRYI LEPPFIQCEI PELAIKLITA VGITWMVLN 180 

SMSVSWSARI QIFLTPCKLT AILIIIVPGV MQLIKGQTQN FKDAPSGRDS SITRLPLAFY 240 

YGMYAYAGWF YLNFVTEEVE NPEKTIPLAI CISMAITIGV YVLTNVAYFT TINAEELLLS 300 

NAVAVTFSER LLGN7SLAVP IFVALSCFGS HNGGVFAVSR LFYVASREGH LPEILSMIHV 360 

RKHTPLPAVI VLHPLTMIML FSGDLDSLLN FLSFARWLFI GLAVAGLIYL RYKCPDMHRP 420 

30 FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 480 
SEKITRTLQI ILEWPEEDK L 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



Seq ID NO: 21 DNA sequence 
35 Nucleic Acid Accession #: NM_002422.2 
Coding sequence: 64.. 14 97 

1 11 21 31 41 51 

Aft I | I I I 1 

40 ACAAGGAGGC AGGCAAGACA GCAAGGCATA GAGA CAA CAT AGAGCTAAGT AAAGCCAGTG 60 

GAAATGAAGA GTCTTCCAAT CCTACTGTTG CTGTGCGTGG CAGTTTGCTC AGCCTATCCA 120 

TTGGATGGAG CTGCAAGGGG TGAGGACACC AGCATGAACC TTGTTCAGAA ATATCTAGAA 180 

AACTACTACG ACCTCAAAAA AGATGTGAAA CAGTTTGTTA GGAGAAAGGA CAGTGGTCCT 240 

GTTGTTAAAA AAATCCGAGA AATGCAGAAG TTCCTTGGAT TGGAGGTGAC GGGGAAGCTG 300 

45 GACTCCGACA CTCTGGAGGT GATGCGCAAG CCCAGGTGTG GAGTTCCTGA TGTTGGTCAC 360 

TTCAGAACCT TTCCTGGCAT CCCGAAGTGG AGGAAAACCC ACCTTACATA CAGGATTGTG 420 

AATTATACAC CAGATTTGCC AAAAGATGCT GTTGATTCTG CTGTTGAGAA AGCTCTGAAA 480 

GTCTGGGAAG AGGTGACTCC ACTCACATTC TCCAGGCTGT ATGAAGGAGA GGCTGATATA 540 

ATGATCTCTT TTGCAGTTAG AGAACATGGA GACTTTTACC CTTTTGATGG ACCTGGAAAT 600 

50 GTTTTGGCCC ATGCCTATGC CCCTGGGCCA GGGATTAATG GAGATGCCCA CTTTGATGAT 660 

GATGAACAAT GGACAAAGGA TACAACAGGG ACCAATTTAT TTCTCGTTGC TGCTCATGAA 720 

ATTGGCCACT CCCTGGGTCT CTTTCACTCA GCCAACACTG AAGCTTTGAT GTACCCACTC 780 

TATCACTCAC TCACAGACCT GACTCGGTTC CGCCTGTCTC AAGATGATAT AAATGGCATT 840 

CAGTCCCTCT ATGGACCTCC CCCTGACTCC CCTGAGACCC CCCTGGTACC CACGGAACCT 900 

55 GTCCCTCCAG AACCTGGGAC GCCAGCCAAC TGTGATCCTG CTTTGTCCTT TGATGCTGTC 960 

AGCACTCTGA GGGGAGAAAT CCTGATCTTT AAAGACAGGC ACTTTTGGCG CAAATCCCTC 1020 

AGGAAGCTTG AACCTGAATT GCATTTGATC TCTTCATTTT GGCCATCTCT TCCTTCAGGC 1080 

GTGGATGCCG CATATGAAGT TACTAGCAAG GACCTCGTTT TCATTTTTAA AGGAAATCAA 1140 

. _ TTCTGGGCCA TCAGAGGAAA TGAGGTACGA GCTGGATACC CAAGAGGCAT CCACACCCTA 1200 

60 GGTTTCCCTC CAACCGTGAG GAAAATCGAT GCAGCCATTT CTGATAAGGA AAAGAACAAA 1260 

ACATATTTCT TTGTAGAGGA CAAATACTGG AGATTTGATG AGAAGAGAAA TTCCATGGAG 1320 

CCAGGCTTTC CCAAGCAAAT AGCTGAAGAC TTTCCAGGGA TTGACTCAAA GATTGATGCT 1380 

GTTTTTGAAG AATTTGGGTT CTTTTATTTC TTTACTGGAT CTTCACAGTT GGAGTTTGAC 1440 

^ CCAAATGCAA AGAAAGTGAC ACACACTTTG AAGAGTAACA GCTGGCTTAA TTGTTGAAAG 1500 

65 AGATATGTAG AAGGCACAAT ATGGGCACTT TAAATGAAGC TAATAATTCT TCACCTAAGT 1560 

CTCTGTGAAT TGAAATGTTC GTTTTCTCCT GCCTGTGCTG TGACTCGAGT CACACTCAAG 1620 

GGAACTTGAG CGTGAATCTG TATCTTGCCG GTCATTTTTA TGTTATTACA GGGCATTCAA 1680 

ATGGGCTGCT GCTTAGCTTG CACCTTGTCA CATAGAGTGA TCTTTCCCAA GAGAAGGGGA 1740 

AGCACTCGTG TGCAACAGAC AAGTGACTGT ATCTGTGTAG ACTATTTGCT TATTTAATAA 1800 

70 AGACGATTTG TCAGTTGTTT T 

Seq ID NO: 22 Protein sequence 
Protein Accession fl: NPJJ02413 

75 1 11 21 31 41 SI 

I I I I I I 

MKSLPILLLL CVAVCSAYPL DGAARGEDTS MNLVQKYLEN YYDLEKDVKQ PVRRKDSGPV 60 

VKKIREMQKP LGLEVTGKLD SDTLEVMRKP RCGVPDVGHF RTPPGIPKWR KTHLTYRIVN 120 

0 _ YTPDLPKDAV DSAVEKALKV WEEVTPLTFS RLYEGBADIM ISPAVREHGD FYPFDGPGNV 180 

SO LAHAYAPGPG INGDAHPDDD EQWTKDTTGT NLPLVAAHEI GHSLGLFHSA NTEALMYPLY 240 

HSLTDLTRFR LSQDDINQIQ SLYGPPPDSP ETPLVPTEPV PPBPGTPANC DPALSFDAVS 300 

TLRGEILIFK DRKFWRKSUR. KLEPELHIiIS SFMPSLPSGV DAAYEVTSKD LVFIFKGNQP 360 

WAIRGNEVRA GYPRGIHTLG FPPTVRKIDA AISDKEKNKT YPPVEDKYWR FDEKRNSMEP 420 
GFPKQIAEDP PGIDSKIDAV FEEFGFFYFP TGSSQLEFDP NAKKVTHTLK SNSWI2IC 

956 



WO 03/042661 PCT/US02/36810 



Seq ID NO: 23 DNA sequence 
Nucleic Acid Accession #: NM_006528 
^ Coding sequence: S7..764 

1 11 21 31 41 SI 

I I J I I I 

GCC6CCAGCG GCTTTCTCGG ACGCCTTGCC CAGCGGGCCG CCCGACCCCC TGCACCATGG 60 

1 n ACCCCGCTCG CCCCCTGGGG CTGTCGATTC TGCTGCTTTT CCTGACGGAG GCTGCACTGG 120 

1U GOGATGCTGC TCAGGAGCCA ACAGGAAATA ACGCGGAGAT CTGTCTCCTG CCCCTAGACT 180 

ACGGACCCTG CCGGGCCCTA CTTCTCCGTT ACTACTACGA CAGGTACACG CAGAGCTGCC 240 

GCCAGTTCCT GTAOGGGGGC TGCGAGGGCA ACGCCAACAA TTTCTACACC TGGGAGGCTT 300 

GCGACGATGC TTGCTGGAGG ATAGAAAAAG TTCCCAAAGT TTGCCGGCTG CAAGTGAGTG 360 

TGGACGACCA GTGTGAGGGG TCCACAGAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

1 J GTGAAAAATT CTTTTCOGGT GGGTGTCACC GGAACOGGAT TGAGAACAGG TTTCCAGATG 480 

AAGCTACTTG TATGGGCTTC TGCGCACCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA 540 

AAGATGAGGG ACTGTGCTCT GCCAATGTGA CTCGCTATTA TTTTAATCCA AGATACAGAA 600 

CCTGTGATGC TTTCACCTAT ACTGGCTGTG GAGGGAATGA CAATAACTTT GTTAGCAGGG 660 

^ A GGAT TGCAA ACGTGCATGT GCAAAAGCTT TGAAAAAGAA AAAGAAGATG CCAAAGCTTC 720 

l\J GCTTTGCCAG TAGAATCCGG AAAATTCGGA AGAAGCAATT TTAAACATTC TTAATATGTC 780 

ATCTTGTTTG TCTTTATGGC TTATTTGCCT TTATGGTTGT ATCTGAAGAA TAATATGACA 840 

GCATGAGGAA ACAAATCATT GGTGATTTAT TCACCAGTTT TTATTAATAC AAGTCACTTT 900 

TTCAAAAATT TGGATTTTTT TATATATAAC TAGCTGCTAT TCAAATGTGA GTCTACCATT 960 

TTTAATTTAT GGTTCAACTG TTTGTGAGAC GAATTCTTGC AATGCATAAG ATATAAAAGC 1020 

AAATATGACT CACTCATTTC TTGGGGTCGT ATTCCTGATT TCAGAAGAGG ATCATAACTG 1080 

AAACAACATA AGACAATATA ATCATGTGCT TTTAACATAT TTGAGAATAA AAAGGACTAG 1140 
CC 

„ Seq ID NO: 24 Protein sequence 
J\J Protein Accession #: NP_006519 

1 11 21 31 41 51 

I I I t i I 

MDPARPLGLS ILLLPLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 60 

J J CRQFLYGGCE GNANNFYTWE ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNLSSM 120 

TCEKFFSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 180 
RTCDAFTYTG OGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 



40 



45 



Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession fh NMJ>0S4S8.1 

Coding sequence: 1..2825 



1 11 21 31 41 51 

I I I I I I 

ATGGCTTCCC CGCGGAGGTC CGGGCAGOCA GGGCGGCCGC CGCCGCCGCC ACCGCCGCCC 60 

GCGCGCCTGC TACTGCTACT GCTGCTGCCG CTGCTGCTGC CTCTGGCGCC OGGGGCCTGG 120 

GGCTGGGCGC GGGGCGCCCC CCGGCCGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC 180 

CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCCGCC 240 

cr\ GTGGAACTGG CCATCGAGCA GATCOGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC 300 

DKJ CTGCGGCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA 360 

ATAAAATACG GGCCGAACCA CTTGATGGTG TTTGGAGGCG TCTGTCCATC OGTCACATCC 420 

ATCATTGCAG AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACG 480 

CCTGTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACCGTCCC ATCAGACAAT 540 

GCGGTGAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG 600 

03 CTGACGCAAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT 660 

GGCGAGGACA TTGAGATTTC AGACACCGAG AGCTTCTCCA ACGATCCCTG TACCAGTGTC 720 

AAAAAGCTGA AGGGGAATGA TGTGCGGATC ATCCTTGGCC AGTTTGACCA GAATATGGCA 780 

GCAAAAGTGT TCTGTTGTGC ATACGAGGAG AACATGTATG GTAGTAAATA TCAGTGGATC 840 

ATTCCGGGCT GGTACGAGCC TTCTTGGTGG GAGCAGGTGC ACACGGAAGC CAACTCATCC 900 

OU CGCTGCCTCC GGAAGAATCT GCTTGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTOGAG 960 

CCCCTGAGCT CCAAGCAGAT CAAGACCATC TCAGGAAAGA CTCCACAGCA GTATGAGAGA 1020 

GAGTACAACA ACAAGCGGTC AGGCGTGGGG CCCAGCAAGT TCCACGGGTA CGCCTACGAT 1080 

GGCATCTGGG TCATCGCCAA GACACTGCAG AGGGCCATGG AGACACTGCA TGCCAGCAGC 1140 

- CGGCACCAGC GGATCCAGGA CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC 1200 

CO AATGCCATGA ACGAGACCAA CTTCTTCGGG GTCACGGGTC AAGTTGTATT CCGGAATGGG 1260 

GAGAGAATGG GGACCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA 1320 

GAGTACAACG CTGTGGCCGA CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA 1380 

TCCGAACCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT 1440 

„ CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC 1S00 

/U TTCTTCAACA TCAAGAACOG GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC 1S60 

AACCTTATCA TCCTTGGAGG GATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT 1620 

GGATCCTTTG TCTCTGAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 1680 

ACCGTGGGCT ACACGACCGC TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCAOGCC 1740 

_ _ ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 1800 

O GTGGGGGGCA TGCTGCTGAT OGACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC 1860 

CTGCGAAGGA CAGTGGAGAA GTACAGCATG GAGCCGGACC CAGCAGGACG GGATATCTCC 1920 

ATCCGCCCTC TCCTGGAGCA CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC 1980 

TATGCCTACA AGGGACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGGA GACCOGCAAC 2040 

q/\ GTCAGCATCC CCGCACTCAA CGACAGCAAG TACATCGGGA TGAGTGTCTA GAAOGTGGGG 2100 

OU ATCATGTGCA TCATCGGGGC CGCK3TCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG 2160 

TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC 2220 

GTGCCGAAGC TCATCACCCT GAGAACAAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC 2280 

CAGTTCACTC AGAATCAGAA GAAAGAAGAT TCTAAAACGT CCACCTCGGT CACCAGTGTG 2340 

AACCAAGCCA GCACATCCCG CCTGGAGGGC CTACAGTCAG AAAACCATOG CCTGCGAATG 2400 
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AAGATCACAO AGCTGGATAA AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA 2460 

GAAAAGACCA CCTACATTAA ACAGAACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG 2520 

GGAAACTTCA CTGAGAGCAC AGA7GGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA 2580 

AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTOGAA CATGCAAAGA TCCTATAGAA 2640 

GATATAAACT CTCCAGAACA CATCCAGCGT CGGCTGTCCC TCCAGCTCCC CATCCTCCAC 2700 

CACGCCTACC TCCCATCCAT CGGAGGCGTG GACGCCAGCT GTGTCAGCCC CTGCGTCAGC 2760 

CCCACCGCCA GCCCCCGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC 2820 
CTGTAA 

Seq ID NO: 26 Protein sequence 
Protein Accession th NP_00S449.l 

1 11 21 31 41 51 

I I I I I I 

MASPRRSGQP GRPPPPPPPP ARLLLLLLLP LIXPLAPGAW GWARGAPRPP PSSPPLSIMG 60 

LHPLTKEVAK GSIGRGVLPA VELAIEQIRN ESLLRPYPLD LRLYDTECDN AKGLKAFYDA 120 

IKYGPNHLMV FGGVCPSVTS IIAESLQGWN LVQLSFAATT PVLADKKKYP YFPRTVPSDN 180 

AVNPA1LKLL KHYQWKRVGT LTQDVQRPSE VRNDLTGVLY GEDIEISDTB SFSNDPCTSV 240 

KKLKGNDVRI ILGQFDQNMA AKVFCCAYEE NMYGSKYQVJI IPGWYEPSWW EQVHTEANSS 300 

RCLRKNLLAA MEGYIGVDPE PLSSKQIKTI SGKTPQQYER EYNNKRSGVG PSKFHGYAYD 360 

GIWVIAKTLQ RAMBTLHASS RHQRIQDFNY TDHTLGRIIL NAMNETNFFG VTGQWFRNG 420 

BRMGTIKPTQ FQDSREVKVG EYNAVADTLE IINDTIRFQG SEPPKDKTII LEQLRKISLP 480 

LYSILSALTI LGMIMASAFL PFNIKNRNQK LIKMSSPYMN NLIILGGMLS YASIFLFGLD 540 

GSFVSEKTPE TLCTVRTWIL TVGYTTAFGA MFAKTWRVHA IFKNVKMKKK IIKDQKLLVI 600 

VGGMLLIDLC ILICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLLEHCEK THMTIWLGIV 660 

YAYKGLLMLF GCFLAWETRN V3IPALNDSK YIGMSVYNVG IMCIIGAAVS FLTRDQPNVQ 720 

FCIVALVIIF CSTITLCLVF VPKLITLRTN PDAATQNRRF QFTQNQKKED SKTSTSVTSV 780 

NQASTSRLEG LQSENHRLRM KITELDKDLE BVTMQLQDTP EKTTYIKQNH YQELNDILNL 840 

GNFTESTDGG KAILKNHLDQ NPQLQWNTTE PSRTCKDPIE DINSPEHIQR RLSLQLPILH 900 
HAYLPSIGGV DASCVSPCVS PTASPRHRHV PPSFRVMVSG L 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession #: NM_000450.1 

Coding sequence: 117.. 1949 

1 11 21 31 41 51 

I I I I I I 

CCTGAGACAG AGGCAGCAGT GATACCCAOC TGAGAGATCC TGTGTTTGAA CAACTGCTTC 60 

CCAAAACGGA AAGTATTTCA AGCCTAAACC TTTGGGTGAA AAGAACTCTT GAAGTCATGA 120 

TTGCTTCACA GTTTCTCTCA GCTCTCACTT TGGTGCTTCT CATTAAAGAG AGTGGAGCCT 180 

GGTCTTACAA CACCTCCACG GAAGCTATGA CTTATGATGA GGCCAGTGCT TATTGTCAGC 240 

AAAGGTACAC ACACCTGGTT GCAATTCAAA ACAAAGAAGA GATTGAGTAC CTAAACTCCA 300 

TATTGAGCTA TTCACCAAGT TATTACTGGA TTGGAATCAG AAAAGTCAAC AATGTGTGGG 360 

TCTGGGTAGG AACCCAGAAA CCTCTGACAG AAGAAGCCAA GAACTGGGCT CCAGGTGAAC 420 

CCAACAATAG GCAAAAAGAT GAGGACTGCG TGGAGATCTA CATCAAGAGA GAAAAAGATG 480 

TGGGCATGTG GAATGATGAG AGGTGCAGCA AGAAGAAGCT TGCCCTATGC TACACAGCTG 540 

CCTGTACCAA TACATCCTGC AGTGGCCACG GTGAATGTGT AGAGACCATC AATAATTACA 600 

CTTGCAAGTG TGACCCTGGC TTCAGTGGAC TCAAGTGTGA GCAAATTGTG AACTGTACAG 660 

CCCTGGAATC CCCTGAGCAT GGAAGCCTGG TTTGCAGTCA CCCACTGGGA AACTTCAGCT 720 

ACAATTCTTC CTGCTCTATC AGCTGTGATA GGGGTTACCT GCCAAGCAGC ATGGAGACCA 780 

TGCAGTGTAT GTCCTCTGGA GAATGGAGTG CTCCTATTCC AGCCTGCAAT GTGGTTGAGT 840 

GTGATGCTGT GACAAATCCA GCCAATGGGT TCGTGGAATG TTTCCAAAAC CCTGGAAGCT 900 

TCCCATGGAA CACAACCTGT ACATTTGACT GTGAAGAAGG ATTTGAACTA ATGGGAGCCC 960 

AGAGCCTTCA GTGTACCTCA TOTGGGAATT GGGACAACGA GAAGCCAACG TGTAAAGCTG 1020 

TGACATGCAG GGCCGTCCGC CAGCCTCAGA ATGGCTCTGT GAGGTGCAGC CATTCCCCTG 1080 

CTGGAGAGTT CACCTTCAAA TCATCCTGCA ACTTCACCTG TGAGGAAGGC TTCATGTTGC 1140 

AGGGACCAGC CCAGGTTGAA TGCACCACTC AAGGGCAGTG GACACAGCAA ATCCCAGTTT 1200 

GTGAAGCTTT CCAGTGCACA GCCTTGTCCA ACCCCGAGOG AGGCTACATG AATTGTCTTC 1260 

CTAGTGCTTC TGGCAGTTTC CGTTATGGGT CCAGCTGTGA GTTCTCCTGT GAGCAGGGTT 1320 

TTGTGTTGAA GGGATCCAAA AGGCTCCAAT GTGGCCCCAC AGGGGAGTGG GACAACGAGA 1360 

AGCCCACATG TGAAGCTGTG AGATGCGATG CTGTCCACCA GCCCCCGAAG GGTTTGGTGA 1440 

GGTGTGCTCA TTCCCCTATT GGAGAATTCA CCTACAAGTC CTCTTGTGCC TTCAGCTGTG 1500 

AGGAGGGATT TGAATTATAT GGATCAACTC AACTTGAGTG CACATCTCAG GGACAATGGA 1560 

CAGAAGAGGT TCCTTCCTGC CAAGTGGTAA AATGTTCAAG CCTGGCAGTT CCGGGAAAGA 1620 

TCAACAIGAG CTGCAGTGGG GAGCCCGTGT TTGGCACTGT GTGCAAGTTC GCCTGTCCTG 1680 

AAGGATGGAC GCTCAATGGC TCTGCAGCTC GGACATGTGG AGCCACAGGA CACTGGTCTG 1740 

GCCTGCTACC TACCTGTGAA GCTCCCACTG AGTCCAACAT TCCCTTGGTA GCTGGACTTT 1800 

CTGCTGCTGG ACTCTCCCTC CTGACATTAG CACCATTTCT CCTCTGGCTT CGGAAATGCT 1860 

TACGGAAAGC AAAGAAATTT GTTCCTGCCA GCAGCTGCCA AAGCCTTGAA TCAGACGGAA 1920 

GCTACCAAAA GCCTTCTTAC ATCCTTTAAG TTCAAAAGAA TCAGAAACAG GTGCATCTGG 1980 

GGAACTAGAG GGATACACTG AAGTTAACAG AGACAGATAA CTCTCCTOGG GTCTCTGGCC 2040 

CTTCTTGCCT ACTATGCCAG ATGCCTTTAT GGCTGAAACC GCAACACCCA TCACCACTTC 2100 

AATAGATCAA AGTCCAGCAG GCAAGGACGG CCTTCAACTG AAAAGACTCA GTGTTCCCTT 2160 

TCCTACTCTC AGGATCAAGA AAGTGTTGGC TAATGAAGGG AAAGGATATT TTCTTCCAAG 2220 

CAAAGGTGAA GAGACCAAGA CTCTGAAATC TCAGAATTCC TTTTCTAACT CTCCCTTGCT 2280 

OGCTGTAAAA TCTTGGCACA GAAACACAAT ATTTTGTGGC T TTC TTT C TT TTGCCCTTCA 2340 

CAGTGTTTCG ACAGCTGATT ACACAGTTGC TGTCATAAGA ATGAATAATA ATTATCCAGA 2400 

GTTTAGAGGA AAAAAATGAC TAAAAATATT ATAACTTAAA AAAATGACAG ATGTTGAATG 2460 

CCCACAGGCA AATGCATGGA GGGTTGTTAA TGGTGCAAAT CCTACTGAAT GCTCTGTGCG 2520 

AGGGTTACTA TGCACAATTT AATCACTTTC ATCCCTATGG GATTCAGTGC TTCTTAAAGA 2580 

GTTCTTAAGG ATTGTGATAT TTTTACTTGC ATTGAATATA TTATAATCTT CCATACTTCT 2640 

TCATTCAATA CAAGTGTGGT AGGGACTTAA AAAACTTGTA AATGCTGTCA ACTATGATAT 2700 

GGTAAAAGTT ACTTATTCTA GATTACCCCC TCATTGTTTA TTAACAAATT ATGTTACATC 2760 
TGTTTTAAAT TTATTTCAAA AAGGGAAACT ATTGTCCCCT AGCAAGGCAT GATGTTAACC 2820 
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AGAATAAAGT TCTGAGTGTT TTTACTACAG TTGTTTTTTG AAAACATGGT AGAATTGGAG 28B0 

AGTAAAAACT GAATGGAAGG TTTGTATATT GTCAGATATT TTTTCAGAAA TATGTGGTTT 2940 

CCACGATGAA AAACTTCCAT GAGGCCAAAC GTTTTQAACT AATAAAAGCA TAAATGCAAA 3000 

CACACAAAGG TATAATTTTA TGAATGTCTT TGTTGGAAAA GAATACAGAA AGATGGATGT 3060 

GCTTTGCATT CCTACAAAGA TGTTTGTCAG ATGTGATATG TAAACATAAT TCTTGTATAT 3120 

TATGGAAGAT TTTAAATTCA CAATAGAAAC TCACCATGTA AAAGAGTCAT CTGGTAGATT 3180 

TTTAACGAAT GAAGATGTCT AATAGTTATT CCCTATTTGT TTTCTTCTGT ATGTTAGGGT 3240 

GCTCTGGAAG AGAGGAATGC CTGTGTGAGC AAGCATTTAT GTTTATTTAT AAGCAGATTT 3300 

AACAATTCCA AAGGAATCTC CAGTTTTCAG TTGATCACTG GCAATGAAAA ATTCTCAGTC 3360 

AGTAATTGCC AAAGCTGCTC TAGCCTTGAG GAGTGTGAGA ATCAAAACTC TCCTACACTT 3420 

CCATTAACTT AGCATGTGTT GAAAAAAAAA GTTTCAGAGA AGTTCTGGCT GAACACTGGC 3480 

AACGACAAAG CCAACAGTCA AAACAGAGAT GTGATAAGGA TCAGAACAGC AGAGGTTCTT 3540 

TTAAAGGGGC AGAAAAACTC TGGGAAATAA GAGAGAACAA CTACTGTGAT CAGGCTATGT 3600 

ATGGAATACA GTGTTATTTT CTTTGAAATT GTTTAAGTGT TGTAAATATT TATGTAAACT 3660 

GCATTAQAAA TTAGCTGTGT GAAATACCAG TGTGGTTTGT GTTTGAGTTT TATTGAGAAT 3*720 

TTTAAATTAT AACTTAAAAT ATTTTATAAT TTTTAAAGTA TATATTTATT TAAGCTTATG 3780 
TCAGACCTAT TTGACATAAC ACTATAAAGG TTGACAATAA ATGTGCTTAT GTTT 

Seq ID NOi 28 Protein sequence 
Protein Accession fl: NP_00044l.l 

1 11 21 31 41 SI 

I I I I I I 

MIASQFLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYLN 60 

SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPNNRQKDED CVEIYIKREK 120 

DVGMWNDERC SKKKLALCYT AACTNTSCSG HGECVETINN YTCKCDPGFS GLKCEQIVNC 180 

TALESPEHGS LVCSHPLGNF SYNSSCSISC DRGYLPSSME TMQCMSSGBW SAPIPACNW 240 

ECDAVTNPAN GFVECFQNPG SPPWNTTCTF DCEEGFELMG AQSLQCTSSG NWDNEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQIP 360 

VCEAFQCTAL SNPERGYMNC LPSASGSFRY GSSCEFSCEQ GFVUCGSKRL QCGPTGEWDN 420 

EKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSOGQ 480 

WTEEVPSCQV VKCSSLAVPG KINMSCSGEP VFGTVCKFAC PEGWTLNGSA ARTCGATGHW 540 

SGLLPTCEAP TESNIPLVAG LSAAGLSLLT LAPPLLWLRK CLRKAKKFVP ASSCQSLESD 600 
GSYQKPSYIL 

Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_007036 

Coding sequence : 56 . . 610 

1 11 21 31 41 51 

I I I I I I 

CTTCCCACCA GCAAAGACCA CGACTGGAGA GCCGAGCCGG AGGCAGCTGG GAAACATGAA 60 

GAGCGTCTTG CTGCTGACCA CGCTCCTCGT GCCTGCACAC CTGGTGGCCG CCTGGAGCAA 120 

TAATTATGCG GTGGACTGCC CTCAACACTG TGACAGCAGT GAGTGCAAAA GCAGCCCGCG 180 

CTGCAAGAGG ACAGTGCTCG ACGACTGTGG CTGCTGCCGA GTGTGCGCTG CAGGGCGGGG 240 

AGAAACTTGC TACCGCACAG TCTCAGGCAT GGATGGCATG AAGTGTGGCC CGGGGCTGAG 300 

GTGTCAGCCT TCTAATGGGG AGGATCCTTT TGGTGAAGAG TTTGGTATCT GCAAAGACTG 360 

TCCCTACGGC ACCTTCGGGA TGGATTGCAG AGAGACCTGC AACTGCCAGT CAGGCATCTG 420 

TGACAGGGGG AOGGGAAAAT GCCTGAAATT CCCCTTCTTC CAATATTCAG TAACCAAGTC 480 

TTCCAACAGA TTTGTTTCTC TCACGGAGCA TGACATGGCA TCTGGAGATG GCAATATTGT 540 

GAGAGAAGAA GTTGTGAAAG AGAATGCTGC CGGGTCTCCC GTAATGAGGA AATGGTTAAA 600 

TCCACGCTGA TCCCGGCTGT GATTTCTGAG AGAAGGCTCT ATTTTOGTGA TTGTTCAACA 660 

CACAGCCAAC ATTTTAGGAA CTTTCTAGAT ATAGCATAAG TACATGTAAT TTTTGAAGAT 720 

CCAAATTGTG ATGCATGGTG QATCCAGAAA ACAAAAAGTA GGATACTTAC AATCCATAAC 780 

ATCCATATGA CTGAACACTT GTATGTGTTT GTTAAATATT CGAATGCATG TAGATTTGTT 840 

AAATGTGTGT GTATAGTAAC ACTGAAGAAC TAAAAATGCA ATTTAGGTAA TCTTACATGG 900 

AGACAGGTCA ACCAAAGAGG GAGCTAGGCA AAGCTGAAGA CCGCAGTGAG TCAAATTAGT 960 

TCTTTGACTT TGATGTACAT TAATGTTGGG ATATGGAATG AAGACTTAAG AGCAGGAGAA 1020 

GATGGGGAGG GGGTGGGAGT GGGAAATAAA ATATTTAGCC CTTCCTTGGT AGGTAGCTTC 1080 

TCTAGAATTT AATTGTGCTT TTTTTTTTTT TTTGGCTTTG GGAAAAGTCA AAATAAAACA 1140 

ACCAGAAAAC CCCTGAAGGA AGTAAGATGT TTGAAGCTTA TGGAAATTTG AGTAACAAAC 1200 

AGCTTTGAAC TGAGAGCAAT TTCAAAAGGC TGCTGATGTA GTTCCCGGGT TACCTGTATC 1260 

TGAAGGACGG TTCTGGGGCA TAGGAAACAC ATACACTTCC ATAAATAGCT TTAACGTATG 1320 

CCACCTCAGA GATAAATCTA AGAAGTATTT TACCCACTGG TGGTTTGTGT GTGTATGAAG 1380 

GTAAATATTT ATATATTTTT ATAAATAAAT GTGTTAGTGC AAGTCATCTT COCTACCCAT 1440 

ATTTATCATC CTCTTGAGGA AAGAAATCTA GTATTATTTG TTGAAAATGG TTAGAATAAA 1500 

AACCTATGAC TCTATAAGGT TTTCAAACAT CTGAGGCATG ATAAATTTAT TATCCATAAT 1560 

TATAGGAGTC ACTCTGGATT TCAAAAAATG TCAAAAAATG AGCAACAGAG GGA CCTTAT T 1620 

TAAACATAAG TGCTGTGACT TCGGTGAATT TTCAATTTAA GGTATGAAAA TAAGTTTTTA 1680 

GGAGGTTTGT AAAAGAAGAA TCAATTTTCA GCAGAAAACA TGTCAACTTT AAAATATAGG 1740 

TGGAATTAGG AGTATATTTG AAAGAATCTT AGCACAAACA GGACTGTTGT ACTAGATGTT 1800 

CTTAGGAAAT ATCTCAGAAG TATTTTATTT GAAGTGAAGA ACTTATTTAA GAATTATTTC 1860 

AGTATTTACC TGTATTTTAT TCTTGAAGTT GGCCAACAGA GTTGTGAATG TGTGTGGAAG 1920 

GCCTTTGAAT GTAAAGCTGC ATAAGCTGTT AGGTTTTGTT TTAAAAGGAC ATGTTTATTA 1980 
TTGTTCAATA AAAAAGAACA AGATAC 

Seq ID NO: 30 Protein sequence 
Protein Accession 8: NP_0O8967.l 

1 11 21 31 41 51 

I 1 I I I I 

MKSVLLLTTL LVPAHLVAAW SNNYAVDCPQ HCDSSECKSS PRCKRTVLDD CGCCHVCAAG 60 
RGETCYRTVS GMDGMKOGPG LRCQPSNGED PFGEEFGICK DCPYGTFGMD CRETCNOQSG 120 
I CDRGTGKCL KFPPPQYSVT KSSNRPVSLT EHDMASGDGN IVREEWKEN AAGSPVMRKW 180 
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Seq ID NO: 31 DNA sequence 
Nucleic Acid Accession 8: NM_000963 
Coding sequence: 135.. 1949 

1 11 21 31 41 51 

I I I I I I 

CAATTGTCAT ACGACTTGCA GTGAGCGTCA GGAGCACGTC CAGGAACTCC TCAGCAGCGC 60 

CTCCTTCAGC TCCACAGCCA GACGCCCTCA GACAGCAAAG CCTACCCCOG CGCC6CGCCC 120 

TGCCCGCCGC TOGGATGCTC GCCCGCGCCC TGCTGCTGTG CGCGGTCCTG GCGCTCAGCC 180 

ATACAGCAAA TCCTTGCTGT TCCCACCCAT GTCAAAACCG AGGTGTATGT ATGAGTGTGG 240 

GATTTGACCA GTATAAGTGC GATTGTACCC GGACAGGATT CTATGGAGAA AACTGCTCAA 300 

CACOGGAATT TTTGACAAGA ATAAAATTAT TTCTGAAACC CACTCCAAAC ACAGTGCACT 360 

ACATACTTAC CCACTTCAAG GGATTTTGGA ACGTTGTGAA TAACATTCCC TTCCTTCGAA 420 

1 J ATGCAATTAT GAGTTATGTC TTGACATCCA GATCACATTT GATTGACAGT CCACCAACTT 480 

ACAATGCTGA CTATGGCTAC AAAAGCTGGG AAGCCTTCTC TAACCTCTCC TATTATACTA S40 

GAGCCCTTCC TCCTGTGCCT GATGATTGCC CGACTCCCTT GGGTGTCAAA GGTAAAAAGC 600 

AGCTTCCTGA TTCAAATGAG ATTGTGGAAA AATTGCTTCT AAGAAGAAAG TTCATCCCTG 660 

ATCCCCAGGG CTCAAACATG ATGTTTGCAT TCTTTGCCCA GCACTTCACG CATCAGTTTT 720 

TCAAGACAGA TCATAAGCGA GGGCCAGCTT TCACGAACGG GCTGGGCCAT GGGGTGGACT 780 

TAAATCATAT TTACGGTGAA ACTCTGGCTA GACAGCGTAA ACTGCGCCTT TTCAAGGATG 840 

GAAAAATGAA ATATCAGATA ATTGATGGAG AGATGTATCC TCCCACAGTC AAAGATACTC 900 

AGGCAGAGAT GATCTACCCT CCTCAAGTCC CTGAGCATCT ACGGTTTGCT GTGGGGCAGG 960 

0< AGGTCTTTGG TCTGGTGCCT GGTCTGATGA TGTATGCCAC AATCTGGCTG CGGGAACACA 1020 

1 J ACAGAGTATG CGATGTGCTT AAACAGGAGC ATCCTGAATG GGGTGATGAG CAGTTGTTCC 1080 

AGACAAGCAG GCTAATACTG ATAGGAGAGA CTATTAAGAT TGTGATTGAA GATTATGTGC 1140 

AACACTTGAG TGGCTATCAC TTCAAACTGA AATTTGACCC AGAACTACTT TTCAACAAAC 1200 

AATTCCAGTA CCAAAATCGT ATTGCTGCTG AATTTAACAC CCTCTATCAC TGGCATCCCC 1260 

TTCTGCCTGA CACCTTTCAA ATTCATGACC AGAAATACAA CTATCAACAG TTTATCTACA 1320 

ACAACTCTAT ATTGCTGGAA CATGGAATTA CCCAGTTTGT TGAATCATTC ACCAGGCAAA 1380 

TTGCTGGCAG GGTTGCTGGT GGTAGGAATG TTCCACCCGC AGTACAGAAA GTATCACAGG 1440 

CTTOCATTGA CCAGAGCAGG CAGATGAAAT ACCAGTCTTT TAATGAGTAC CGCAAACGCT 1500 

TTATGCTGAA GCCCTATGAA TCATTTGAAG AACTTACAGG AGAAAAGGAA ATGTCTGCAG 1560 

AGTTGGAAGC ACTCTATGGT GACATCGATG CTGTGGAGCT GTATCCTGCC CTTCTGGTAG 1620 

AAAAGCCTGG GCCAGATGCC ATCTTTGGTG AAACCATGGT AGAAGTTGGA GCACCATTCT 1680 

CCTTGAAAGG ACTTATGGGT AATGTTATAT GTTCTCCTGC CTACTGGAAG CCAAGCACTT 1740 

TTGGTGGAGA AGTGGGTTTT CAAATCATCA ACACTGCCTC AATTCAGTCT CTCATCTGCA 1800 

ATAACGTGAA GGGCTGTCCC TTTACTTCAT TCAGTGTTCC AGATCCAGAG CTCATTAAAA 1860 

CAGTCACCAT CAATGCAAGT TCTTCCCGCT CCGGACTAGA TGATATCAAT CCCACAGTAC 1920 

TACTAAAAGA ACGTTCGACT GAACTGTAGA AGTCTAATGA TCATATTTAT TTATTTATAT 1980 

GAACCATGTC TATTAATTTA ATTATTTAAT AATATTTATA TTAAACTCCT TATGTTACTT 2040 

AACATCTTCT GTAACAGAAG TCAGTACTCC TGTTGCGGAG AAAGGAGTCA TACTTGTGAA 2100 

GACTTTTATG TCACTACTCT AAAGATTTTG CTGTTGCTGT TAAGTTTGGA AAACAGTTTT 2160 

TATTCTGTTT TATAAACCAG AGAGAAATGA GTTTTGACGT CTTTTTACTT GAATTTCAAC 2220 

TTATATTATA AGAACGAAAG TAAAGATGTT TGAATACTTA AACACTATCA CAAGATGGCA 2280 

AAATGCTGAA AGTTTTTACA CTGTCGATGT TTCCAATGCA TCTTCCATGA TGCATTAGAA 2340 

GTAACTAATG TTTGAAATTT TAAAGTACTT TTGGTTATTT TTCTGTCATC AAACAAAAAC 2400 

AGGTATCAGT GCATTATTAA ATGAATATTT AAATTAGACA TTACCAGTAA TTTCATGTCT 2460 

ACTTTTTAAA ATCAGCAATG AAACAATAAT TTGAAATTTC TAAATTCATA GGGTAGAATC 2520 

ACCTGTAAAA GCTTGTTTGA TTTCTTAAAG TTATTAAACT TGTACATATA CCAAAAAGAA 2580 

GCTGTCTTGG ATTTAAATCT GTAAAATCAG ATGAAATTTT ACTACAATTG CTTGTTAAAA - 2640 

TATTTTATAA GTGATGTTCC TTTTTCACCA AGAGTATAAA CCTTTTTAGT GTGACTGTTA 2700 

AAACTTCCTT TTAAATCAAA ATGCCAAATT TATTAAGGTG GTGGAGCCAC TGCAGTGTTA 2760 

TCTCAAAATA AGAATATTTT GTTGAGATAT TCCAGAATTT GTTTATATGG CTGGTAACAT 2820 

GTAAAATCTA TATCAGCAAA AGGGTCTACC TTTAAAATAA GCAATAACAA AGAAGAAAAC 2880 

CAAATTATTG TTCAAATTTA GGTTTAAACT TTTGAAGCAA ACTTTTTTTT ATCCTTGTGC 2940 

ACXGCAGGCC TGGTACTCAG ATTTTGCTAT GAGGTTAATG AAGTACCAAG CTGTGCTTGA 3000 

ATAACGATAT GTTTTCTCAG ATTTTCTGTT GTACAGTTTA ATTTAGCAGT CCATATCACA 3060 

TTGCAAAAGT AGCAATGACC TCATAAAATA CCTCTTCAAA ATGCTTAAAT TCATTTCACA 3120 

CATTAATTTT ATCTCAGTCT TGAAGCCAAT TCAGTAGGTG CATTGGAATC AAGCCTGGCT 3180 

ACCTGCATGC TGTTCCTTTT CTTTTCTTCT TTTAGCCATT TTGCTAAGAG ACACAGTCTT 3240 

CTCATCACTT CGTTTCTCCT ATTTTGTTTT ACTAGTTTTA AGATCAGAGT TCACTTTCTT 3300 

TGGACTCTGC CTATATTTTC TTACCTGAAC TTTTGCAAGT TTTCAGGTAA ACCTCAGCTC 3360 

AGGACTGCTA TTTAGCTCCT CTTAAGAAGA TTAAAAGAGA AAAAAAAAGG CCCTTTTAAA 3420 

AATAGTATAC ACTTATTTTA AGTGAAAAGC AGAGAATTTT ATTTATAGCT AATTTTAGCT 3480 

ATCTGTAACC AAGATGGATG CAAAGAGGCT AGTGCCTCAG AGAGAACTGT ACGGGGTXTG 3540 

TGACTGGAAA AAGTTACGTT CCCATTCTAA TTAATGCCCT TTCTTATTTA AAAACAAAAC 3600 

CAAATGATAT CTAAGTAGTT CTCAGCAATA ATAATAATGA CGATAATACT TCTTTTCCAC 3660 

ATCTCATTGT CACTGACATT TAATGGTACT GTATATTACT TAATTTATTG AAGATTATTA 3720 

TTTATGTCTT ATTAGGACAC TATGGTTATA AACTGTGTTT AAGCCTACAA TCATTGATTT 3780 

TTTTTTGTTA TGTCACAATC AGTATATTTT CTTTGGGGTT ACCTCTCTGA ATATTATGTA 3840 

AACAATCCAA AGAAATGATT GTATTAAGAT TTGTGAATAA ATTTTTAGAA ATCTGATTGG 3900 

CATATTGAGA TATTTAAGGT TGAATGTTTG TCCTTAGGAT AGGCCTATGT GCTAGCCCAC 3960 

AAAGAATATT GTCTCATTAG CCTGAATGTG CCATAAGACT GACCTTTTAA AATGTTTTGA 4020 

GGGATCTGTG GATGCTTCGT TA AT TT G TTC AGCCACAATT TATTGAGAAA ATATTCTGTG 4080 

TCAAGCACTG TGGGTTTTAA TATTTTTAAA TCAAACGCTG ATTACAGATA ATAGTATTTA 4140 

TATAAATAAT TGAAAAAAAT T TT CTTTTGG GAAGAGGGAG AAAATGAAAT AAATATCATT 4200 

AAAGATAACT CAGGAGAATC TTCTTTACAA TTTTACGTTT AGAATGTTTA AGGTTAAGAA 4260 

AGAAATAGTC AATATGCTTG TATAAAACAC TGTTCACTGT TTTTTTTAAA AAAAAAACTT 4320 

GATTTGTTAT TAACATTGAT CTGCTGACAA AACCTGGGAA TTTGGGTTGT GTATGCGAAT 4380 

GTTTCAGTGC CTCAGACAAA TGTGTATTTA ACTTATGTAA AAGATAAGTC TGGAAATAAA 4440 
TGTCTGTTTA TTTTTGTACT ATTTA 
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Seq ID NO: 32 Protein sequence 
Protein Accession #: NP_0009S4 

- 1 11 21 31 41 51 

5 I i I I I I 

MLARALLLCA VLALSHTANP CCSHPCQNRG VCMSVGFDQY KCDCTRTGFY GENCSTPEFL 60 

TRIKLPLKPT PNTVHYILTH FKGFWNWNN IPFLRNAIMS YVLTSRSHLI DSPPTYNADY 120 

GYKSWEAFSN LSYYTRALPP VPDDCPTPLG VKGKKQLPDS NEIVEKLLLR RKPIPDPQGS 180 

NMMFAFFAQH FTHQFFKTDH KRGPAFTNGL GHGVDLNHIY GETLARQRKL RLFKDGKMKY 240 

10 QIIDGEMYPP TVKDTQAEMI YPPQVPEHLR FAVGQEVFGL VPGLMMYATI WLREHNRVCD 300 

VLKQEHPEWG DEQLFQTSRL ILIGETIKIV IEDYVQHLSG YHFKLKFDPE LLFNKQPQYQ 360 

NRIAAEFNTL YHWHPLLPDT FQIHDQKYNY QQFIYNNSIL LEHGITQFVE SFTRQIAGRV 420 

AGGRNVPPAV QKVSQASIDQ SRQMKYQSFN EYRKRFMLKP YBSFEELTGE KEMSAELEAL 480 

YGDIDAVELY PALLVEXPRP DAIFGETMVE VGAPFSLKGL MGNVICSPAY WKPSTFGGEV 540 

15 GFQIINTASI QSLICNNVKG CPFTSFSVPD PELIKTVTIN ASSSRSGLDD INPTVLLKER 600 
STEL 

Seq ID NO: 33 DMA sequence 
Nucleic Acid Accession #: NM_001508.1 
20 Coding sequence: 1..1361 

1 .11 21 31 41 51 

I I I I I I 

ATGGCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA CAGTCATGTC 60 

25 CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA TTCTGGTGTA CCTGATCATC 120 

TTCGTGATGG GCCTTCTGGG GAACAGCGTC ACCATTCGGG TCACCCAGGT GCTGCAGAAG 180 

AAAGGATACT TGCAGAAGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG CTOGGACATC 240 

TTGGTGTTCC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 300 

ACGTCCAGCT ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 360 

30 GCTACGCTGC TGCACGTGCT GACGCTCAGC TTTGAGOGCT ACATCGCCAT CTGTCACCCC 420 

TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 480 

GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 540 

GTGAACGTGC CCAGCCACCG GGGTCTCACT TGCAACCGCT CCAGCACCCG CCACCACGAG 600 

CAGCCCGAGA CCTCCAATAT GTCCATCTGT ACCAACCTCT CCAGCCGCTG GACCGTGTTC 660 

35 CAGTCCAGCA TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 720 

ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAGCCAGA AGGGCTCGCT GGCCGGGGGC 780 

AOGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC CAGGAGGCAG 840 

ACCATCATCT TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT GCCCAACCAG 900 

ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GGACGAGGTC CTACTTCCGG 960 

40 GCGTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCAGCTC GGTCATCAAC 1020 

CCGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTOGTGCA GGTGCTGTGC 1080 

TGCCGCCTGT CGCTGCAGCA CGCCAACCAC GAGAAGCGCC TGCGCGTACA TGCGCACTCC 1140 

ACCACCGACA GCGCCCGCTT TGTGCAGCGC CCGTTGCTCT TCGCGTCCCG GCGCCAGTCC 1200 

TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGCGAGGC CGAGCCCCAG 1260 

45 TCTAAGTCCC AGTCATTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 1320 
AATTCTGCTG CAGAGAATGG TTTTCAGGAG CATGAAGTTT GA 
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Seq ID NO: 34 Protein sequence 
Protein Accession #: np 001499.1 



1 11 21 31 41 51 

I i I I I I 

MASPSLPGSD CSQIIDHSHV PBFEVATWIK ITLILVYLII FVMGLLGNSV TIRVTQVLQK 60 

_ _ KGYLQKEVTD HMVSLACSDI LVFLIGMPME FYSIIWNPLT TSSYTLSCKL HTFLFBACSY 120 

55 ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAKGTEYPL 180 

VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIPGAFW YLWLLSVAF 240 

MCWNMMQVLM KSQKGSItAGG TRPPQLRKSE SEESRTARRQ TIIFLRLIW TLAVCWMPNQ 300 

IRRIMAAAKP KHDWTRSYFR AYMILLPFSE TFFYLSSVIN PLLYTVSSQQ FRRVPVQVLC 360 

CRLSLQHANH EKRLRVHAHS TTDSARFVQR PLLFASRRQS SARRTEKIFL STFQSEAEPQ 420 
SKSQSLSLES LEPNSGAKPA NSAAENGFQE HEV 



Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession #: NM_006475.1 

Coding sequence: 28.. 2538 



1 11 21 31 41 51 

I I I I I I 

AACAGAACTG CAACGGAGAG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 

TTGCTGCTTA TTGTTAACCC TATAAACGCC AACAATCATT ATGACAAGAT CTTGGCTCAT 120 

70 AGTCGTATCA GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 180 

ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 

AAAACGACTQ TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 

TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 360 

ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 

75 TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAGA 480 

GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 

AAGAGAATGT TGACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 

TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 

ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTGACCG TGTGCTTACA 720 

80 CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 780 

GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 

TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 900 

GGAGACAAAG TGGCTTCCGA ACCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 

TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
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GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 

GTGACAAATA ATGGTQTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1140 

CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 

GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 

GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 1320 

CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 

ATCGGAGGCA AACAGCTCAG AGTCTTOGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 

TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 1500 

ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 

TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 

ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 

CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 

TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1B00 

AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA I860 

TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 

GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 

CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 

AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 

ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 

AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 

AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 

GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 

ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 

GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 

ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 

GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 

AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 

ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 

GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 

CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 

AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATOG AAGAGTTGTG AACTGTTATC 2880 

CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 

TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 

TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 

CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 

TCTCAAAOGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTGAGTA 3180 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Seq ID NO: 36 Protein sequence 
Protein Accession #: NP_006466.1 



1 11 21 31 41 51 

I I I I I I 

MIPFLPMFSL LLLLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 

45 KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 

KLREEIEGKG SPTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 180 

NGMIIPSMYN NLGLFINHYP NGWTVNCAR I IHGNQIATN GWHVIDRVL TQIGTSIQDF 240 

IEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 

MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 

IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GBYTLLAPVN NAFSDDTLSM 420 

VQRLLKLILQ NHILKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 

RNGAIHIFRE IIKPAEKSLH EKLKQDKRFS TFLSLLBAAD LKELLTQPGD WTLFVPTNDA 540 

FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VFXGKGFEPG VTNTLKTTQG SKIFLKEVND 600 

TLLVNELKSK ESDIMTTNGV IHWDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 660 

JJ FKEIPVTVYT TKIITKWEP KIKVIEGSLQ PIIKTEGPTL TKVKIEGBPE FRLIKEGETI 720 

TEVIHGEPII KKYTKIIDGV PVEITEKETR EERIXTGPEI KYTRISTGGG ETEETLKKLL 780 
QEEVTKVTKF IEGGDGHLFE DBEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 



Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession #t NM_002416 

Coding sequence : 40 . . 417 

1 11 21 31 41 51 

I I I I I I 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTA TGAAGAAAAG TGGTGTTCTT 60 

TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTGAGA 120 

AAGGGTCGCT GTTCCTGCAT CAGCACCAAC CAAGGGACTA TCCACCTACA ATCCTTGAAA 180 

GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAGAAAA TTGAAATCAT TGCTACACTG 240 

AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTGAAGGA ACTGATTAAA 300 

AAGTGGGAGA AACAGGTCAG CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 

AAGAAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAGAAGAC TACATAAGAG 420 

ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 

TTCCAAAGGA GGATGGCATA TAATACAAAG GCTTATTAAT TTGACTAGAA AATTTAAAAC 540 

ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTGATTTT AAGAATCCAA ACGTTAAGAA 600 

TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 

TTAAGGCCAT GATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 

CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 

TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 

TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 

CTACAGGCCT CACACACAAT GTGTCTGAGA GATTCATGCT GATTGTTATT GGGTATCACC 960 

ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 

TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 1080 

AAGTCAGCTC TTCTCCATCC TACCACAATG CAGTGCCTTT CTTCTCTCCA GTGCACCTGT 1140 

CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
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AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 1260 

AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 

CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 

AGATTGTCAG CTCCTTGAG6 GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 

CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 

TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 

CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 1620 

GATGCAACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAA6AA 1680 

GCACGTGGTA AAACACTTGC GGATATTCTG GACTGTTTTT AAAAAATATA CAGTTTACCG 1740 

AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 

CCAACCATAC AAAAATTCCT TTTCCCGAAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT 1860 

TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 

AGTTTTATTG TCCGTTTACT TGTTTCAGAG TTTGTATTGT GATTATCAAT TACCACACCA 1980 

TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2040 

TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 

GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 

CTTTCCCAAA TTGAATCACT GCTCACACTG CTGATGATTT AGAGTGCTGT CCGGTGGAGA 2220 

TCCCACCCGA ACQTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 

AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 

GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 

TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 

AAACAGTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 

Seq ID NO: 38 Protein sequence 
Protein Accession #: NP_002407 

1 11 21 31 41 51 

I I I I I 1 

MKKSGVLFLL GIILLVLIGV QGTPWRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 60 
IEIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 120 
QKKTT 

> 

Seq ID NO: 39 DKA sequence 

Nucleic Acid Accession #: NM_O06670 

Coding sequence: 85.. 1347 

1 11 21 31 41 51 

I I I I I I 

CCGGCTCGCG CCCTCCGGGC CCAGCCTCCC GAGCCTTCGG AGCGGGCGCC GTCCCAGCCC 60 

AGCTCCGGGG AAACGCGAGC CGCGATGCCT GGGGGGTGCT CCCGGGGCCC CGCCGCCGGG 120 

GACGGGCGTC TGCGGCTGGC GCGACTAGCG CTGGTACTCC TGGGCTGGGT CTCCTCGTCT 180 

TCTCCCACCT CCTCGGCATC CTCCTTCTCC TCCTCGGCGC CGTTCCTGGC TTCCGCCGTG 240 

TCCGCCCAGC CCCCGCTGCC GGACCAGTGC CCCGCGCTGT GCGAGTGCTC CGAGGCAGCG 300 

CGCACAGTCA AGTGCGTTAA CCGCAATCTG ACCGAGGTGC CCACGGACCT GCCCGCCTAC 360 

GTGCGCAACC TCTTCCTTAC CGGCAACCAG CTGGCCGTGC TCCCTGCCGG CGCCTTCGCC 420 

CGCCGGCCGC CGCTGGCGGA GCTGGCCGCG CTCAACCTCA GCGGCAGCCG CCTGGACGAG 480 

GTGCGCGCGG GCGCCTTCGA GCATCTGCCC AGCCTGCGCC AGCTCGACCT CAGCCACAAC 540 

CCACTGGCCG ACCTCAGTCC CTTCGCTTTC TCGGGCAGCA ATGCCAGCGT CTCGGCCCCC 600 

AGTCCCCTTG TGGAACTGAT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCAGAAC 660 

CGGAGCTTCG AGGGCATGGT GGTGGGGGCC CTGCTGGCGG GCCGTGCACT GCAGGGGCTC 720 

CGCCGCTTGG AGCTGGCCAG CAACCACTTC CTTTACCTGC CGCGGGATGT GCTGGCCCAA 780 

CTGCCCAGCC TCAGGCACCT GGACTTAAGT AATAATTCGC TGGTGAGCCT GACCTACGTG 640 

TCCTTCCGCA ACCTGACACA TCTAGAAAGC CTCCACCTGG AGGACAATGC CCTCAAGGTC 900 

CTTCACAATG GCACCCTGGC TGAQTTGCAA GGTCTACCCC ACATTAGGGT TTTCCTGGAC 960 

AACAATCCCT GGGTCTGCGA CTGCCACATG GCAGACATGG TGACCTGGCT CAAGGAAACA 1020 

GAGGTAGTGC AGGGCAAAGA CCGGCTCACC TGTGCATATC CGGAAAAAAT GAGGAATCGG 1080 

GTCCTCTTGG AACTCAACAG TGCTGACCTG GACTGTGACC CGATTCTTCC CCCATCCCTG 1140 

CAAACCTCTT ATGTCTTCCT GGGTATTGTT TTAGCCCTGA TAGGCGCTAT TTTCCTCCTG 1200 

GTTTTGTATT TGAACCGCAA GGGGATAAAA AAGTGGATGC ATAACATCAG AGATGCCTGC 1260 

AGGGATCACA TGGAAGGGTA TCATTACAGA TATGAAATCA ATGCGGACCC CAGATTAACA 1320 

AAOCTCAGTT CTAACTCGGA TGTCTGAGAA ATATTAGAGG ACAGACCAAG GACAACTCTG 1380 

CATGAGATGT AGACTTAAGC TTTATCCCTA CTAGGCTTGC TCCACTTTCA TCCTCCACTA 1440 

TAGATACAAC GGACTTTGAC TAAAAGCAGT GAAGGGGATT TGCTTCCTTG TTATGTAAAG 1500 

TTTCTCGGTG TGTTCTGTTA ATGTAAGACG ATGAACAGTT GTGTATAGTG TTTTACCCTC 1560 

TTCTTTTTCT TGGAACTCCT CAACACGTAT GGAGGGATTT TTCAGGTTTC AGCATGAACA 1620 

TGGGCTTCTT GCTGTCTGTC TCTCTCTCAG TACAGTTCAA GGTGTAGCAA GTGTACCCAC 1680 

ACAGATAGCA TTCAACAAAA GCTGCCTCAA CTTTTTCGAG AAAAATACTT TATTCATAAA 1740 

TATCAGTTTT ATTCTCATGT ACCTAAGTTG TGGAGAAAAT AATTGCATCC TATAAACTGC 1800 

CTGCAGACGT TAGCAGGCTC TTCAAAATAA CTCCATGGTG CACAGGAGCA CCTGCATCCA 1860 

AGAGCATGCT TACATTTTAC TGTTCTGCAT ATTACAAAAA ATAACTTGCA ACTTCATAAC 1920 

TTCTTTGACA AAGTAAATTA CTTTTTTGAT TGCAGTTTAT ATGAAAATGT ACTGATTTTT 1980 

TTTTAATAAA CTGCATCGAG ATCCAACCGA CTGAATTGTT AAAAAAAAAA AAAAATAAAG 2040 
ATTCTTAAAA GAA 

Seq ID NO: 40 Protein sequence 
Protein Accession #: NP_006661 

1 11 21 31 41 51 

I I I I I I 

MPGGCSRGPA AGDGRLRIAR LALVLLGWVS SSSPTSSASS PSSSAPFLAS AVSAQPPLPD 60 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLAVLPAGA PARRPPLAEL 120 

AALNLSGSRL DEVRAGAFEH LPSLRQLDLS HNPLADLSPP APSGSNASVS APSPLVELIL 160 

NHIVPPEDBR QNRSPEGMW AALLAGRALQ GLRRLELASN HFLYLPRDVL AQLPSLRHLD 240 
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bSKNSLVSLT YVSPRMLTHI* ESLHLEDNAL KVLHNGTLAE LQGLPHIRVF LDNNPWVCDC 300 

HMADMVTWLK ETEWQGKDR LTCAYPEKMR NRVLLELNSA DLDCDPILPP SLQTSYVFLG 360 
IVLALIGAIF LLVLYLNRKG IKKHMHNIRD ACRDHMEGYH YRYEINADPR LTNLSSNSDV 

5 Seq n> NO: 41 DNA sequence 

Nucleic Acid Accession fh Eos sequence 
Coding sequence : 1 . . 92 7 

1 11 21 31 41 51 

10 I | | 1 | | 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGAOG GGCGTCTGCG GCTGGCGOGA 60 

CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 

TTCTCCTCCT CGGOGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 180 

CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 

AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 

AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 

AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 

CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 

AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 

CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 

GTGCAGGGCA AAGACOGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 

TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 

TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 

TATTTGAACC GCAAGGGOAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 

CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGOGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGGATGTCCT CGAGTGA 
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20 
25 
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Seq ID NO: 42 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS PSSSAPPLAS AVSAQPPLPD 60 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHPLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 180 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SYVFLGIVLA LIGAIFLLVL YtURKGIKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 

Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession ft: NM_058173 

Coding sequence: 68.. 340 



Ac 1 11 21 31 41 51 

45 i | | | | | 

AGCGCCTTGC CTTCTCTTAG GCTTTGAAGC ATTTTTGTCT GTGCTCCCTG ATCTTCAGGT 60 

CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 

TGCCCAGAAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 180 

TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA CTGCTGCTCC 240 

TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA 300 

ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA GCTTGAGTCT 360 

TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 

CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 
GCGAGCTAAC AT 



Seq ID NO: 44 Protein sequence 
Protein Accession ft: NP 477521 



1 11 21 31 41 51 

60 | | | i | | 

MKFLiAVLVLL GVSIPLVSAQ NPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 
ATTAASTTAR KDIPVLPKWV GDLPNGRVCP 

- Seq ID NO: 45 DNA sequence 
CO Nucleic Acid Accession g : NM_0 000 95.1 
Coding sequence: 2 6.. 2299 

1 11 21 31 41 51 

nCi I I I I I I 

/U CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 

CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTCC TCCACTGCGC 300 

I J GCCOGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 

CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACOGACGTCA ACGAGTGCAA 420 

CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCCG COGGGGTACA GCOGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA S40 

QA GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACOGGGCAAC ATAACTGCGT 600 

OU CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

CTTCGTGGGC GACCAGGCGT COGGCTGCCA GCGCGGOGCA CAGCGCTTCT GCCCCGACGG 720 

CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 

AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
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WO 03/042661 



CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGOC CGCTGGTGCG 1020 

GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGOG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 

CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 

TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 

GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GAOGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGAOG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CGGACTTCAG 1620 

GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGAOGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

GGATGAOGAC TATGOGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 

CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCOGA 1920 

GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCGGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TOGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCOGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGOGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Seq ID NO: 46 Protein sequence 
Protein Accession #: NP_000086.1 

1 11 21 31 41 51 

I I I I I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQ<JVREIT 60 

PLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKANKQVCTD 180 

INECETGQHN CVPNSVCINT RGSFQCGPOQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 

ADCVLERDGS RSCVCRVGWA GNGILOGRDT DLDGPPDEKL RCPEPQCRKD NCVTVPMSGQ 300 

EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 360 

DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 

RDKCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWLD S40 

PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAPNGV DFEGTFHVNT VTDDDYAGFI 600 

FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 

ESQVRLLWKD PRKVGWKDKK SYRWFLQHRP QVGYIRVRPY EGPELVADSN WLDTTMRGG 720 
RLGVFCFSQE NIIWANLRYR CNDTIPEDYE THQLRQA 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession #: NMJ)01S65.1 

Coding sequence: 67.. 363 

1 11 21 31 41 51 

I I I I I I 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 

AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 

ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTOTACCT GCATCAGCAT TAGTAATCAA 180 

CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 

CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA .300 

TOGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 

TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 

CCTCTCCCAT. CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 

GTTACACTAA AAGGTQACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA S40 

GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 

GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 

ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 

TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 

ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 

CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 

CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 

TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 

TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 

Seq ID NO: 48 Protein sequence 
Protein Accession 8: NP_001S56.1 

1 11 21 31 41 51 

I 1 1 1 1 1 

KNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 60 

BIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession #: XH_057Q14 

Coding sequence: 143.. 874 
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1 
I 

GGGAGGGAGA 
CGCGGCGGAG 
CGCTGCCCGG 
COGOGGCCTC 
CCCCAAGGGG 
AATGTGCTTA 
CATTCCGGGT 
TCTGAGGGAA 
ATTGAATTAT 
AAATAGTGCT 
CTGTCAGCGT 
AGCTATAATT 
CACTTCTTCT 
CTGGGTTGGC 
TTCTCGCATC 
TTTTTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 
GGTCTTTTGT 
TGTACAATTT 
CAACCTTAAA 



11 
I 

GAGGCGCGCG 
CCAGACGCTG 
CAGCCGGGAG 
CTGCTGCTCC 
AAGCAAAAGG 
CAAGGGCCAG 
ACACCTGGGA 
AGCTTTGAGG 
GGCATAGATC 
CTAAGAGTTT 
TGGTATTTCA 
TATTTGGACC 
GTGGAAGGAC 
ACTTGTTCAG 
ATTATTGAAG 
ATGCCTTGGA 
AAAAGCAAAG 
GCATTATTCA 
ACTTTCTTCA 
TTTTTCTCTT 
GTAAATGTTA 
AAAAAAAAAA 



21 
I 

GGTGAAAGGC 
ACCACGTTCC 
CCATGCGACC 
TGCTGCTGCA 
CGCAGCTCCG 
CAGGAGTGCC 
TCCCAGGTCQ 
AGTOCTGGAC 
TTGGGAAAAT 
TGTTCAGTGG 
CATTCAATGG 
AAGGAAGCCC 
TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 



31 
I 

GCATTGATGC 
TCTCCTCGGT 
CCAGGGCCCC 
GCTGCCCGCG 
GCAGAGGGAG 
TGGTCGAGAC 
GGATGGATTC 
ACCCAACTAC 
TGCGGAGTGT 
CTCACTTOGG 
AGCTGAATGT 
TGAAATGAAT 
AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAGACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



41 

t 

AGCCTGOGGC 
CTCCTCCGCC 
GCCGCCTCCC 
CCGTOGAGCG 
GTGGTGGACC 
GGGAGCCCTG 
AAAGGAGAAA 
AAGCAGTGTT 
ACATTTACAA 
CTAAAATGCA 
TCAGGACCTC 
TCAACAATTA 
GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



51 

I 

GGCCTCGGAG 
TCCAGCTCCG 
CGCAGOGGCT 
CCTCTGAGAT 
TGTATAATGG 
GGGCCAATGG 
AGGGGGAATG 
CATGGAGTTC 
AGATGCGTTC 
GAAATGCATG 
TTCCCATTGA 
ATATTCATCG 
ATGTTGCTAT 
GG AATTCAGT 
TTGCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTT T TTAGT 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Seq ID NO: 50 Protein sequence 
Protein Accession #: XP_057014 

1 11 21 31 41 51 

! I I I I I 

MRPQGPAASP QRLRGTiTiTiTiL LI/QLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 60 

GVPGRDGSPG ANGIPGTPGI PGRDGFKGBK GECLRESFEE SWTPNYKQCS HSSLNYGIDL 120. 

GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYPT FNGAECSGPL PIEAIIYLDQ 180 

GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 240 
LPK 

Seq ID NO: 51 DNA sequence 

Nucleic Acid Accession fc: NM_020974 

Coding sequence : 8 1 .. 3 0 8 0 

1 .11 21 31 41 51 

I I 1 I 1 I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 

CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 

GGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCOGTCCCGC 180 

CGGGTCGGGG CCGTGCOGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 

ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 

AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 

TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 

TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 

AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 

GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 

GCCTGAGCTG CATGAATAAG GATCAOGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 

GCAGCGTOGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTQTGACGAT ACAGCCGATG 780 

GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 

AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 

ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 

ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 

GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1140 

AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 

GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 

CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 

GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 

AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 

CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT IS 00 

GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 

AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 

CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 

GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 

TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1800 

TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1860 

AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 

AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 

CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAAOGCT 2040 

GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 

GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 

GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 

GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 

GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 

CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 

ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
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GAGATTTCAC 
AGTGTACGTG 
TCTTCCTGCC 
CCAATTCTGT 
CCAGGTCAAA 
TCCAGGTCCC 
GAGATGGCAG 
TCAAGGCTCT 
AGTCCCGAGA 
TTTTGAGACC 
GGTTGGTGGG 
CCGTATCAGT 
GAACTTGGTT 
CAGCTTCTCA 
TTGGTCAGCC 
TGTAGTGGAA 
CCGGCCCTCT 
CAAGAGGGGA 
ACTCAGTTTC 
AGTTCTAAGC 
AGCACTTCTG 



TGGGTACATT 
GACCATCAAC 
CATAGAGGAC 
GACAACATAT 
GAAGCTGTGG 
ATACGTGACA 
GCTCTATGCA 
GTTTGATGTC 
GATGTTTCCA 
TTACAAATGA 
ACAGAGCTGT 
GACTCATTAG 
TTTCTTTCCC 
CTGCTGTGGG 
TAGGTGAGAC 
AGGAGGCCAC 
CTAAGGGAGC 
GGGAAGGAGA 
TCCACAGCCT 
AGTGCTCGTG 
GAGACAT 



GAATCCCCAA 
CCACCCCCCA 
GACTGTGGGG 
GAAACCTGCC 
ATTCAGTTCA 
TATGATGAGG 
TCTGAGAACC 
CTGGCCCATC 
AGATCGTTCA 
CTCAGCCCAC 
CTTCCTTCTG 
AGTTCAATTT 
AGCATCGTGG 
CGGATGTCTT 
TCACCTGTCC 
AGAATAAGCT 
CCTCTGCACT 
CCCCTGCAGG 
TCTCCAGCCT 
AAAAAAAAAA 



ACTACCCAGG 
AGCGCCGCAT 
ACTATCTGGT 
AGACCTACGA 
AGTCCAATGA 
ACTACCAGGA 
ATCAGGAAAT 
CCCAGAACTA 
TCCGATTGCT 
GTGCCACTCA 
CATGTCAGCA 
TTATAGATAA 
ATGTAGACTG 
GGATAGATCA 

GCTTATTCTG 
OGTGTGCAGG 
CTCCCTCCAC 
GTGTGATACA 
GCAGAAAGAA 



CAATTACCCA 
CCTGATCGTG 
GATGCGGAAA 
AOGCCCCATC 
AGGGAACAGC 
ACTCATTGAA 
ACTTAAGGAT 
TTTCAAGTAC 
ACGTTCCAAA 
ATACAAATGT 
CAGTCGGGTA 
TACAGATATT 
AGAATGGCTT 
CGGGCTGGCT 
TTACTCCTOC 
AAACTTCAGC 
CTCTGACCAG 
CCACCTTGAG 
AGTTTGATCC 
TTAGAAATAA 



GCCAACACCG 
GTCCCTGAGA 
ACCTCTTCAT 
GCCTTCACCT 
GCTAGAGGGT 
GACATAGTTC 
AAGAAACTTA 
ACAGCCCAGG 
GTGTCCAGGT 
TCTGCTATAG 
TTGCTGCCTC 
TTGGTAAATT 
TGAGTGGCAT 
GAGCTGGACT 
TCAAGGAGTC 
TTCCTCTAGC 
GCAGAACAGG 
ACCTGGGAGG 
CAGGAACTTG 
ATAAAAACTA 



2S80 
2640 
2700 
2760 
2B20 
2S80 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



Seq ID NO: 52 Protein sequence 
Protein Accession fc: NP 066025 



MGVAGRNRPG 
LOQNTPTSYK 
HDGHNCLDVD 
DHGCSRICKE 
PQYKMHTDGR 
STGVHCSCPV 
SCQDVDECSL 
TVGSYECQCH 
SSDVTTIRTS 
PGAPGRPSTP 
FHLQLSGMNL 
TPQNEEGQMT 
PEAGRTSCFP 
KKNCVSCPGN 
PPPKRRILIV 



LAHPQNYPKY 



11 

I 

AAWAVLLLLL 
CSCKPGYQGE 
ECLENNGGCQ 
APRGSVACEC 
SCLEREDTVL 
GFTLQLDGKT 
DRTCDHSCIN 
PGYKLHWNKK 
VTFKLNEGKC 
KEMFITVEPE 
DVAKKPPRTS 
CEPCPRPGNS 
CGGGLATKHQ 
TTTDFDGSTN 
VPEIFLPIED 
ARGFOVPYVT 
TAQESREMFP 



21 
I 

LLPPLLLLAG 
GRQCEDIDEC 
HTCVNVMGSY 
RPGFELAKNQ 
EVTESNTTSV 
CKDIDECQTR 
EPGTFACACN 
DCVEVKGLLP 
SLKNABLFPE 
IiETNQKEVTA 



GALKTPEAWN 
GATSFQDCET 
ITQCKNRRCG 
DCGDYLVMRK 
YDEDYQELIE 
RSFIRLLRSK 



31 
I 

AVPPGRGRAA 
GNELNGGCVH 
ECCCKEGFFL 
RDCILTCNHG 
VDGDKRVKRR 
NGGCDHFCKN 
RGYTLYGFTH 
TSVSPRVSLH 
GLRPALPEKH 
SCDfcSCIVKR 
QGHAENOCVS 
MSECGGLCQP 
RVQCSPGHFY 
GELGDFTGYI 
TSSSKSVTTY 
DIVRDGRLYA 
VSRFIiRPYK 



41 
I 

GPQEDVDECA 
DCLNIPGNYR 
SDNQHTCIHR 
KGGOQHSCDD 
LLMETCAVNN 
IVGSFDCGCK 
CGDTNECSIN 
CGKSGGGDGC 
SSVKESFRYV 
TEKRLRKAIR 
CRAGTYYDGA 
GEYSADGFAP 
NTTTHRCIRC 
ESPNYPGNYP 
ETCQTYERPI 



51 
I 

QGLDDCHADA 
CTCFDGFMLA 
SEEGLSCMNK 
TADGPECSCH 
GGCDRTCKDT 
KGFKLLTDEK 
NGGCQQVCVN 
FIjRCHSGIHL 
NLTCSSGKQV 
TLRKAVHREQ 
RERCILCPNG 
CQIiCALGTFQ 

pvgtyqpefg 
antectwtin 
aftsrskklw 
kklikalfdv 



Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession #: NM_014211 

Coding sequence: 157. .1479 



11 



21 
I 



GGGACAGGGC TGAGGATGAG GAGAACCCTG 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC 
CCTACTTCAG CCCCTTGGTG TGAGCAGCTT 
TTCGTGTGTC TGAGTCTCTT CACTGAGAGG 
GAGGTOGGCA GAAGTGACAA GCTTTCCCTG 
AACAAATTTC TCAGGCCCAA TTTTGGTGGA 
ATTQCAAGTA TCTCTAGCAT TTCAGAGAGT 
CGACAGCGCT GGATGGACCA GCGGCTGGTG 
GCCCGCCTCG TGGAGTTCCT CTGGGTGCCA 
TTCCTCCATG AAGTCACTGT GGGAAACAGG 
CTGTATGCCC TCAGAATCAC GACAACTGTT 
ATGGACACAC AGACATGCAA GTTGCAGCTG 
GAGTTCACCT GGCTGAGAGG GAACGACTCT 
CAGTACACCA TAGAGCGGTA TTTCACCTTA 
TACACTAGAT TGGTCTTACA GTTTGAGCTT 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG 
GATTCAGTCC CTGCAAGAAC CTGCATTGGA 
ATGATCGGGT COCGCACTTC TCTTCCCAAC 
TACCTGGGGA TCTGCTTTAG CTTTGTGTTT 
TACAGTTCCT TACAGCAGAT GGCAGCCAAA 
GTCAGTATTA CTAATATCAT CAACAGCTCC 
GCCAGCATTG AAATTTCCAG CGACAACGTT 
GACAAQTTCA AGTTTGTCTT COGAGAAAAG 
CAAAACCCCA GTAATGTTGA TCACTATTCC 
GCCAATGTAT TTTACTGGGC ATACTACATG 
TGCCATAGGT CTTCAACAGG ACAAGATAAT 
ACCCACATCC AATGGTGCTA CAAGTGACTG 
ATGAAGCTCC AACCATTGTT CTAAGCTGTG 
TAGAAACATC AGTCCATTCC TCTTTCATCT 
TTACAAATGT ACTCAGGGCT GTTTATTCGG 
AAGAATGGGA AGGAGACCAT TGGGTAACCC 
TATACATGTT TTTTACTAAA TCTCTGCAGT 
GAGTAACATT TTCTAGTTTT TGTTTCTGGT 
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31 41 51 

! I I 

GGGACCCAGA AGACCGTGCC TTGCCCGGAA 
TAACAGAGCC TCAACAACTA CCTGGTGATT 
CTCAACATGA ACTACAGCCT CCACTTGGCC 
ATGTGCATCC AGGGGAGTCA GTTCAACGTC 
CCTGGCTTTG AGAACCTCAC AGCAGGATAT 
GAACCCGTAC AGATAGCGCT GACTCTGGAC 
AACATGGACT ACACAGCCAC CATATACCTC 
TTTGAAGGCA ACAAGAGCTT CACTCTGGAT 
GATACTTACA TTGTGGAGTC CAAGAAGTCC 
CTCATCCGCC TCTTCTCCAA TGGCACGGTC 
GCATGTAACA TGGATCTGTC TAAATACCCC 
GAAAGCTGGG GCTATGATGG AAATGATGTG 
GTGCGTGGAC TGGAACACCT GCGGCTTGCT 
GTCACCAGAT CGCAGCAGGA GACAGGAAAT 
CGGAGGAATG TTCTGTATTT CATTTTGGAA 
TTGTCCTGGG TTTCATTTTG GATCTCTCTC 
GTGACGACCG TGTTATCAAT GACCACACTG 
ACCAACTGCT TCATCAAGGC CATCGATGTG 
GGGGCCTTGC TAGAATATGC AGTTGCTCAC 
GATAGGGGGA CAACAAAGGA AGTAGAAGAA 
ATCTCCAGCT TTAAACGGAA GATCAGCTTT 
GACTACAGTG ACTTGACAAT GAAAACCAGC 
ATGGGCAGGA TTGTTGATTA TTTCACAATT 
AAACTACTGT TTCCTTTGAT TTTTATGCT A 
TATTTTTGAG TCAATGTTAA ATTTCTTGCA 
GATGTAAATG GTATTTTAGG CCAAGTGTGC 
AAATAATATT TGAGTCTTTC TGCTCAAAGA 
TAGAAGTCCT AGCATTATAG GATCTTGTAA 
TAATCAAGGA CATTCCCATG GAGCCGAAGA 
TGGCTCCCTG GTTTGCATTT ACCTCATATA 
TCAAGTGTCA GAAGTTGTTT CTAAAGTAAC 
GCTTATAAAA TACATTGTTG CCTATTTAGG 
TAAAATGAAA TATGGGCTTA TGTCAATTCA 
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TTGGAAGTCA 
ATACCACAAC 
AGTGAAGAAA 
GGATTCCCCA 
TTT ATTATT A 
GGAAATGGAA 
AGGCTTGCAG 
AAAGTACTGG 
GAGCAACACT 
TGCTCTAATG 
GGCTTAGCTT 
TGAGTGGCTT 
AACAATGAGA 
CCAGCCAGAG 
AGTGTGCCAG 
TTAATTTCTT 
ACCAAGGCTC 
CTTTTCCCAT 
ACAAAATATT 
TATGAGCCAA 
AACCCCACTT 
AATATACCAT 



ATGCACTAAC 
AGAATTATCC 
AAATTAGTAG 
TACTGGAAGG 
TACACACATC 
GATTTTTTTG 
AATTGAGTCC 
GTTGACTCAG 
CTCCCAGTGG 
ATCAGGAATG 
AAGTAAACTT 
ATCCCGCATG 
CAOGTTACAG 
ATGCACATTC 
GGTAAAGGCT 
AGCTTCCTGT 
TAAAAGATGA 
TATACTTCTC 
TCTAACGGGA 
TCATATTTGT 
AAGCATTGTT 
ATTAGCTACC 



TCAATACCAA 
CCAATTTCCA 
ATCAACAATC 
ACTCTGAGGC 
CATCCTAAAC 
TAACTTGTTC 
ATTTTCTAGC 
AGAGTGGCTG 
CAGATCCCCT 
ATGCTTATTA 
GGCTTTGCTC 
AGCAGGAGCG 
AACCTATGTT 
CTCGGCCAGT 
TCCAGTTCAG 
TCTAATAAAT 
TTTCCCTTCT 
ACAATTCAGT 
ATGGGTGGGA 
GATTTTTTAA 
TTTATATAAA 
CAOC 



GATGAGTTTT 
ATAAGTGCTA 
TAAACAAATC 
TTTATTCCCC 
TATACTAAAG 
TAGAAGTCTT 
TGCCTTTATT 
TCATTCTGTC 
GTATCATTCC 
GAAAACAAAC 
AGATCGCTGA 
TGCTGGCCCT 
CAGGTTGCGG 
CTCAGCCAAC 
CCTCAGTTAT 
GCAOGGCTTT 
GTAACTCCCT 
TTCTATGAGT 
GTGCTGGTGA 
AAAAAGTTTA 
AACAATGATA 



TAAATAATGA 
TCATTGAAAA 
CCTOGGTTCT 
CACTATGCAT 
CXXTTTTCCC 
AATATGGGCT 
CACATAGTGA 
ATTGCTGCTA 
AAGAGGAGCA 
TGCTTGACCC 
TCCTTCCAGC 
GAGTACTGAA 
GTGAGCTGCC 
AGTACCAAAA 
TTTAGACAAT 
ACCTTTCCTG 
AGAGCCACAG 
TTGATCACCT 
AAAGAGATGA 
AAAGGAAATA 
AAGATGTGAA 



ATATTATTTA 
TTCAAATATA 
AAGATACAAT 
ATCTTATCAT 
ATGCATGGAT 
GTTGCCATGA 
TGGGGTACTA 
CTCTAACACT 
TTCATCCCTT 
AGGAACAAGT 
TGGTCTGCTC 
CTTTCTGAGT 
CTCTCCAAAT 
GTGATTTTTG 
CTOGCCATCT 
TCAGAAATAA 
GTTCTCATTC 
GATTTTTTTA 
AATGTGGTTG 
TCTGTTCTGA 
CTGTGAAATA 
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Seq ID NO: 54 Protein sequence 
Protein Accession #: NP 055026 



1 
I 

MNYSLHLAFV 
VQIALTLDIA 
YIVE9KKSFL 
WGYDGNDVEP 
NVLYFILETY 
CFIKAIDVYL 
SFKRK1SPAS 
LFPLIFMLAN 



11 
I 

CLSLFTERMC 
SISSISESNM 
HBVTVGNRLI 
TWLRGNDSVR 
VPSTFLWLS 
GICFSPVFGA 
IEISSDNVDY 
VFYWAYYMYF 



21 
I 

IQGSQFNVEV 
DYTATIYLRQ 
RLFSNGTVLY 
GLEHLRXiAQY 
VJVSFWISLDS 
LLEYAVAKYS 
SDLTMKTSDK 



31 

I 

GRSDKLSLPG 
RWMDQRLVFE 
ALRITTTVAC 
TIERYFTLVT 
VPARTCIGVT 
SLQQMAAKDR 
FKFVFREKMG 



41 

I 

FENLTAGYNX 
GNKSFTLDAR 
NMDLSKYPKD 
RSQQBTGNYT 
TVLSMTTLMI 
GTTKEVEEVS 
RIVDYFTIQN 



51 

1 

FLRPNFGGEP 
LVEFLWVPDT 
TQTCKLQLES 
RLVLQFELRR 
GSRTSLPNTN 
ITNIINSSIS 
PSNVDHYSKL 



Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession #: XM_084007 

Coding sequence: 13 8.. 2405 



51 



60 
120 
180 
240 
300 
360 
420 



1 -11 21 31 41 

I I I I I 1 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGOGCCTG GTAGAGATTT CTCGAAGACA 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 
GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 
ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCCCAC CATAATCATG 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA OCATGACTCA GATAGTTCAG 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACQG ACCAGAACAT GCCAGTGGTA 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 
TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 
CCAAAGATGT AAGCAGCTGC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 
TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 
CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 
AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 
GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 
AAGAGGTCAT GAT AG CT CAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 
TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 
GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 
ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 
GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 

968 
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AGGGAGATGA 
TTGTATTGAA 
TATTCTATCT 
TAAACAAGAG 
TTTTCAAGAA 
TGTTTAGGAA 
AGCAAAGAAA 
AAAAATCACA 
CAGAATTAGT 
GTAGTGAGCA 
AAATATATTT 
TTCGTGCGGG 
TATTGCCAAG 
CAAAATTATC 
TCATTTGATT 
GAGCAATTGT 
GAT G TTTCTT 



GTTTGTATGC 
TATTGCTGTC 
TGGAGATAAA 
ATTTGGCATQ 
CTAACACAGT 
TAAGAATGTG 
TAAA6GAGAA 
AAATTTGTTG 
ATAGAGTACA 
CTCTCATATA 
AATGAATTCA 
TTATATACCA 
TTATATATCA 
AGAGTAGTAA 
CGATTCAGAA 
CTTTATATAC 
TTTTACACAA 



TGTACTATGC 
TGTTACAAAG 
ATCTGTATGT 
ACATGTTCTG 
TATTCCTATA 
CATGAAGCCT 
AAGAGAAGAA 
TAAATTAGAG 
TTCATTAAAC 
CTAATTAGTG 
AG CAATATAC 
GATGAGTACA 
CCAAAAGCTG 
AACTTTGATA 
AGTACTTTGA 
GGTACTGTAG 
TAAATTCCTT 



AGCGTTTAAA 
TCAGTTAAAG 
GCAATTCACC 
TATGTTTCAG 
CTGGATTTTA 
AAAATACCAA 
TCTGAGAATT 
GGGAGAAATT 
ATTTTTGTCA 
TACATTTAAC 
ACTTGACCAA 
GTGAGTAGTT 
TATGACTGGA 
TATATGAGGA 
TATCTCTCAG 
CCATACTAGG 
ATATCAGCTT 



GTTAGTGGGT 
GTACGTTTTA 
GGTATTACCA 
GGAAAAATGT 
GGTCTCTGAA 
GAAAGCTTAT 
GGGGAGGCAT 
TAGAATTAAG 
GGATTATTTC 
TTTGTATAAT 
GAAATTGGAA 
TATGTATCAC 
TGTTCTGGTT 
TATTAAAACT 
TGCTTCAGTG 
CCTGTCTGTG 
G 



TTTGTGATTT 
ATATTTAAGT 
GTTTATTATG 
CTTTAATGCT 
GAACTGCTGG 
ACTGAATTTA 
AGATTCTTAT 
TATAAAAAGG 
CCGTAAAAAC 
ACAGAAATCT 
TTTCAAAATG 
CAGACTGGGT 
ACCTGGTTTA 
ACACTAAGTA 
CTATCATTGT 
GCATTCTCTA 
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Seq ID NO: 56 Protein sequence 
Protein Accession 8: XP 084007 



1 
I 

MARKLSVILI 
FYRYGENNSIT 
EHHSDHDHHS 
SVSASEVTST 
NESVSEPRKG 
RSCLIHTSEK 
LVALAVGTLS 
TWKGLTALGG 
EEKVDTDDRT 
HSHFHDTLGQ 
MGDGLHNFSD 
NALSAMLAYL 
RWGYFFLQNA 



11 

I 

LTFALSVTNP 
SVEGFRKLLQ 
HHNHAASGKN 
VYNTVSEGTH 
FMYSRNTNEN 
KABIPPKTYS 
GDAFLHLLPH 
LYFMFLVEHV 
EGYLRADSQE 
SDDLIHHHHD 
GLAIGAAFTE 
GMATGIFIGH 
GMLLGFGIML 



21 
I 

LHELKAAAFP 
NIGIDKIKRI 
KRKALCPDHD 
FIiETIETPRP 
PQECFNASKL 
LQIAWVGGFI 
SHASHHHSHS 
LTLIKQFKDK 
PSHFDSQQPA 
YHHILHHHKH 
GLSSGLSTSV 
YAENVSMWIF 
LISIFEHKIV 



31 
I 

QTTEKISPNW 
HIHKDHDHHS 
SDSSGKDPRN 
GKLFPKDVSS 
LTSHGMGIQV 
AISIISFLSL 
HEEPAMEMKR 
KKKNQKKPEH 
VLEEEEVMIA 



AVFCHELPHE 
ALTAGLFMYV 
FRINF 



41 
I 

ESGINVDLAI 
DHEHHSDHER 
SQGKGAHRPE 
STPPSVTSKS 
PLNATEFNYL 
LGVILVPLMN 
GPLFSHLSSQ 
DDDVBIKKQL 
HAHPQEVYNE 
RYSREELKDA 
LGDFAVLLKA 
ALVDMVPEML 



51 
I 

STRQYHLQQL 
HSDHEHBSOK 
HASGRRNVKD 
RVSRLAGRKT 
CPAIINQIDA 
RVFFKFLLSF 
NIEESAYFDS 
SKYESQLSTN 
YVPRGCKNKC 
GVATLAWKVI 
GMTVKQAVLY 



Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession #t NM_015419. 

Coding sequence: 1..8487 

21 



1 11 21 31 41 51 

I I I I I I 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCOGTGGTGC TGATCCTGCT TTGGGGCCAT 
CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 
TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 
AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 
TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 
CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 
TTTATCCACC CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 
AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTOCACGT TCACATTTTT GGATTATTTC 
AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 
TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 
TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 
AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 
GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 
GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 
AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 
GATGTGTACA AGATTCACTT GAACCAAACG GATCCTCCAG ATATTGACAT AAATGCAACA 
GTTGCCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 
AT AG CAT ACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACAGGT 
GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 
CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 
CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG 
CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 
TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 
CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 
AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC TCAAGTGAGG 
GATGAAATGG ACCGCATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 
GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 
GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 
GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 
CATTTTACGG TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 
AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 
GGCTOGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 
GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 
AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 
GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 
GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 
GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 
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TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTQTGCAGA CAGTAACCAG TGCTGAAGAA 2520 
TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 2S80 
GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 
c AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2700 
5 ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 
TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 
ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT 2880 
GAGCCTCCAT TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2940 
A GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3000 
10 CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTOCAGTA CATCACAGTT ATTTGAGGAT 3060 
TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 3120 
ATCCACCTT G TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 
AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGAOCC CACACACTCC 3240 
AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 
15 GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 
CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 
ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3480 
AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 
TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 
20 GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3660 
AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3720 
CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3780 
CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3900 
25 AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 
TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 
ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 
ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 
AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 
30 GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 
GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 
ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 
GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 
CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 
35 ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 
GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 
GCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 
CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 
TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 
40 GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 
CTACCTGAAA TGTCCACACA AAGOGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 
CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 
CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 
CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 
45 GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 
AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 
CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA S280 
GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 
GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 5400 
50 ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA S460 
TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 5520 
CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 
CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5640 
CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 
55 AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGXTCAAGTA 5760 
CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC ACGGCCTGGA CAGGATGGTG 5820 
GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 
ACTGTCTACC TGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC S940 
^ CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 

00 CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 
GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCOGGGGCGG ACAGCCTGGC CATCCGCCTG 6120 
CACGTGGCGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 
CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG OGCCCCTGCC CAGCGTGCGC 6240 
Jlc TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 
05 GTTTTCCCCA ACGGGACGCT CTACATCOGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 6360 
' GAGTGCGTGG CCGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 
CGTGCAGCAG CCAACGCGCG CATCAOGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC 6480 
GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 
AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 
70 TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA AAGATGCCGG AGATTACCTG 6660 
TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 
AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 6780 
CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 
GAOGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG CACCAAGCGC 6900 
75 TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 
GACTACACCT GCTTTGCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 
GTGGTGACAG CGCCOGCCAC CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 
GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 
OA TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 
OO GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 
AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGOCAOCC 7320 
AAGATCAACG GTAACCCCAA COCCATCACC ACOGTGCGGG AGATAGCAGC GGGGGGCAGT 7380 
CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 7440 
COCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT CCATGGCAAC 7500 
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GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 

CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 

AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 

AGCCTCAACT GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 

CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 

GCCGCTGGCC ACACGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 

GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG AGACGGAGTA OGGCCCTTCG 8160 

GTCACCAGCA TCCCOGTGAT TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 8220 

CCGGTCATCT ACACCCGGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

CCCAAAGCTG ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 8340 

GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ACACAGAGAG ATGCOGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAGGAACTG 8S20 

ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT 8S80 

GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 8640 

GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

TTTCTTCTTT TGCAAATGCC ACTOGACTGC CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA 8880 

TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 8940 

TCTTTCAGTT atttcctctg TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 

CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

TATATATTTT TTCC TTTC AA ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC TGTTATATTA 9300 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 

Seq ID NOt 58 Protein sequence 
Protein Accession #; NP_056234.l 

1 11 21 31 41 51 

I I I I I I 

MPKRAHWGAL SWLILLWGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 60 

NLGFNSIQAL SETSFAGLTK LELLMIHGNE IPSIPDGALR DLSSLQVFKF SYNKLRVITG 120 

QTLQGLSNLM RLHIDHNKIE FIHPQAFNGL TSLRLLHLEG NLLHQLHPST FSTFTFLDYF 180 

RLSTIRHLYL AENMVRTLPA SMLRNMPLLB NLYLQGNPWT GDCEMRWFLE WDAKSRGILK 240 

CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMTCLKPSI ESPLRQNRSR SIEEEQEQEE 300 

DGGSQLILEK FQLPQWSISL NMTDEHGNMV NLVCDIKKPM DVYKIHLNQT DPPDIDINAT 360 

VALDFECPMT RENYEKLWKL IAYYSEVPVK LHRELKLSKD PRVSYQYRQD ADEEALYYTG 420 

VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIE 480 

PSGAVQRDQT VLEGGPCQLS CKVKASESPS IFWVLPDGSI LKAPMDDPDS KFSILSSGWL 540 

RIKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTLPOJ 600 

ALAI PEARLS WILPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 660 

HPTVGITVTK KGSGLPSKRG RRPGAKALSR VREDXVEDEG GSGMGDEENT SRRLLHPKDQ 720 

EVFLKTKDDA INGDKKAKKG RRKLKLWKHS BKEPETNVAE GRRVFESRRR INMANKQINP 780 

ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP FPAVSPPSAS PVGTVTSAEE 840 

SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT STPLEEWDD LSEKTBEITS 900 

TEGDLKGTAA PTLXSEPYEF SPTLHTLDTV YBKPTHEETA TEGWSAADVG SSPEPTSSEY 960 

EPPLDAVSIiA ESEPMQYEDP DLETKSQPDE DKMKEDTFAH LTPTPTINVN DSSTSQLFED 1020 

STIGEPGVPG QSHLQGLTDN IHLVKSSLST QDTLLIKHGM KEMSQTLQGG NMLEGDPTHS 1080 

RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TPRQKVAPSS 1140 

TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETF STQPTQAPDI KISSQVESSL 1200 

VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPKK HRYTPSTVSS RASGSKPSPS 1260 

PENKHRNIVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 1320 

SDGKEIKDDV ATNVDKHKSD ILVTGBSITN AIPTSRSLVS THGBFKEESS PVGFPGTPTW 1380 

NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KKLEDVDFTS EFLSSLTVST PFHQEEAGSS 1440 

TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN HTPTAARMKE PASSSPSTIL 1500 

MSLGQTTTTK PALPSPRISQ ASRDSKENVF LNYVGNPETE ATPVNNEGTQ HMSGPKELST 1560 

PSSDRDAFNL STKLBLEKQV FGSRSLPRGP DSQRQDGRVH ASRQLTRVPA KPILPTATVR 1620 

LPEMSTQSA8 RYFVTSQSPR HWTNKPEITT YPSGALPENK QFTTPRLSST TIPLPLHMSK 1680 

PSIPSKFTDR RTDQFNGYSK VFGNNNIPEA RNPVGKPPSP RIPKYSHGRL PFFTNKTLSF 1740 

PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHL DFGPPAPPLL HTPOTTGSPS 1800 

TNLQNIPMVS STQSSISPIT SSVQSSGSFH QSSSKFPAGG PPASKFWSLG BKPQILTKSP 1860 

QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLKN GTLVIRKVQV 1920 

QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QILASHYQDV TVYLGDTIAM ECLAKGTPAP 1980 

QISWIFPDRR VWQTVSPVES RITLHENRTL SIKBASFSDR GVYKCVASNA AG ADS LAI RL 2040 

HVAALPPVIH QEKLENISLP PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFLHGNLF 2100 

VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 2160 

GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIRV FANGTLWK9 VTDKDAGDYL 2220 

CVARNKVGDD YWLKVDWM KPAK1EHKBE NDHKVFYGGD LKVDCVATGL PNPBISWSLP 2280 

DGSLVNSFKQ SDDSGGRTKR YWFHNGTLY FNEVGMREEG DYTCFAEHQV GKDEMRVRVK 2340 

WTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 2400 

GTLLXQKAQR SDSGNYTCLV RNSAGEDRKT VWIHVNVQPP KINGNPNPIT TVREIAAGGS 2460 

RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGNRITVHGN GSLDIRSLRK SDSVQLVCMA 2520 
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RNEGGEARLI 
GTDLQSGQQL 
KQYHNLVSII 
ASVFDRGTYV 
PKADITWELP 
KTTYIHVF 



VQLTVLEPME KPIFHDPISE KITAMAGHTI SLNCSAAGTP TPSLVWVUN 2580 

QRPYHKADGM LHISGLSSVD AGAYRCVARN AAGHTBRLVS LKVGLKPEAN 2640 

NGETLKLPCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 2700 

CRMBTEYGPS VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 2760 

CKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGPYKC MAKNILGSDS 2820 



Seq ID NO: 59 DMA sequence 

Nucleic Acid Accession tt: Eos sequence 

Coding sequence: 1..5001 

1 11 21 31 41 51 

I I I I I 1 

ATGCCAGGCA CAAAACTAAC COGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 60 

ACCTCTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCCGGGT TATGTCATCT 120 

CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 180 

TCAAGACAGT ACACOGTGOG CTATCGAGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 240 

CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 300 

GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 360 

ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 420 

AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGCTACCAG AGACTGAGGG GAAAGTGAAA 480 

GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGCCAAATCA 540 

TTTCAGAATA CATTCTTTCA TACGCCCCGG CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 

CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG GAACGCTATC 660 

TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TCGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGTTGTCT AGCTTACCTG 780 

ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 840 

CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGCGGCTT TTCCTTCATT 900 

ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 

GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 1020 

CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 

CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 

TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 

TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACAOGG CCACTCGGTG 1320 

GTTGCTCCCG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CCTGGCCACG CAGCCCCGCC CAGGGGCGCC CCCCTCGGCT 1440 

TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1S00 

GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GOGCAAAGCC AGCCTCGCCG 1680 

GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 

GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 

GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 

TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 2340 

GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 

CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 

GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 2520 

TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 2580 

ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC 2640 

CCGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 

CAGGCCGGGC GGCCCCGCCC CAOGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 

ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG 2820 

GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCCGCGCAC 2880 

GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 2940 

ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 3000 

AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCG 3060 

GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 

GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT 3240 

GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 

CTGGCTCCTG TOAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3360 

GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACCGTGA GCCCCGTCGC GGGCACCCAC 3420 

CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3480 

TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCQ ACAGCCTGCC 3540 

AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 3600 

GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3660 

GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 3720 

CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 3780 

GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG GGCAGGGGCG ACATGGCACA 3840 

CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC 3900 

TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 

ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 4020 

ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 4080 

CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 4140 

ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAC 4200 

GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 4260 

972 



WO 03/042661 
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GATGAGTTCT 
TATGATGAAG 
ACTGCTACCA 
GAATTTGATC 
GACCCATCAG 
GATGAAATCA 
AACATCACCG 
GCCACCCCAG 
ATCAGGAACA 
AAGCCCAACA 
ATCAGCCCTT 
CCAGGCGGTG 
GCCATGGACG 
GTAATTCACT 
TTGGAGACAQ 
GAAGAACAGG 
AGTATCGTCA 
TGGGCTGGTA 
CTGCAAGCTT 
CAGCCAGCAT 
CATTCT6GTC 
TTCAGTTTTG 
CCAGAGACAT 
AATTCGGACA 
TGCTACATGC 
TCAGGTCCCT 
ATATCCTACT 
TTATTTGTAA 
TGTCTGCAAG 
AAAAAAA 



CAGGCTTGGA 
ATTATGAATT 
CACCGAGGGT 
TGGCTGGAAG 
CCCCGTGCTC 
TCCCCAATGA 
TGGTGGCCGT 
GAGATTTGGT 
AGTTTTCCAC 
CGAGGTATTA 
CGGTCTCATT 
AGCTATCTGG 
GCAATATGTG 
GAGGTATAAA 
CTGGGQAAGA 
GCCTCAGTCC 
GGAGCCTGTC 
CGAGTGTGGG 
GCCCTGCCCA 
GCTCAGCCCC 
ATCTCAGTCT 
CTGTTAACTT 
CAGAAACCAG 
CTTCAGTATT 
TTTCTGTTTT 
TTGTATGCAG 
TGAAATTTAC 
ATTCTCAATT 
AATATTGATT 



GACTGACACT 
TGAGACGTCA 
GATCCCAGAG 
GAAACGATTT 
TCTGACTGAT 
CCTGAAGAAG 
GGAAGGTTGC 
CACAGGTTAT 
TCAAGCTTCA 
TTTTAAAGTG 
TGTCACCGAA 
ATCCCATTCG 
AAGCGCACGT 
ATCTACCTCA 
GGTGAAGACC 
TATGTAGAAG 
AGGTTTGGGA 
GTCTCCATCC 
GCCCCACCAA 
GCTGCCCTAG 
GGAACTCAGT 
TGCTTCTCTA 
CAACTGATTC 
TCCAGGAATA 
TCTCATTTTG 
TAGAAAGGAA 
TCTATGGACT 
TTGATATATA 
AAAATTGCTA 



GCAGTACCTA 
AGGCCACCAA 
GAAGGCGCCA 
GTTGCTCCTT 
GCACTGGATC 
AGTGATCTGC 
CACTCATTTG 
TTGGTTTACA 
TCAGTAACTC 
CAAGCACAAA 
TCAGATAATC 
CTTTCAAACA 
GGTATCGAAA 
GTGACAACCT 
ATTGCCAATT 
CCCTCCCTAC 
ACATCGGCTT 
CTGGAAAGTG 
CTAAGTCGCA 
GTGCCAGGAA 
CCCACTTCTT 
CTTTTTTTTG 
AGTGTGATTT 
GCATATGCAC 
GATTTCTCCA 
TTATTAAAAA 
TACCCACTGC 
TATGTATATA 
AATTTGTACT 



CGGAAGAGGC 
CCACCACTGA 
TCAGTTCCTT 
ACGTGACGTA 
ACTTCCAAGT 
CTCCCCAGCA 
TCATTGTGGA 
GTGCATCCTA 
ACTTGCCCAT 
ATCCTCATGG 
CTCTGCTTGT 
TGATCCCAGC 
GTTCGTGGGA 
GAAAGATACA 
TGTGGATTCA 
TATTCAAGGC 
CGGAACCCCC 
GTAATCACAG 
CTAGGGGCTG 
GGTCACAGAT 
GGCCTGGACA 
TTTGTTTGTA 
CCCAGACTTT 
GCTGTTCTTG 
AAACTAACTG 
CACCACCAAA 
TAGAATAAAT 
TGCATATACA 
TGTTCACCAA 



CTACGTTATA 
GCCTTCGACC 
TCCTGAAGAA 
CCTAAATAAA 
GGACAGCCTG 
TGCTCCCCGC 
TTGGGACAAA 
TGAAGATTTC 
TGAGAACCTA 
CTACGGACCT 
TGTGAGGCCC 
TACACGGACT 
GTTGTTCTTT 
TTCTACAGCA 
CACCTTGATG 
TACTATCGCC 
TACTACTATG 
GACCGTCATG 
TGAGCAAAGA 
GGACACTGGC 
ATGAACAGGA 
ATAGCACATC 
TTAGGCATGA 
CTTCATGGAA 
AATTTAAGCT 
GAAAATAAAT 
GTATCAAATC 
TATCCACACT 
AAAAAAAAAA 



Seq ID NO: 60 Protein sequence 
Protein Accession #: Eos sequence 



MPGTKLTRTG 
SRQYTVRYRE 
TPESAPTTAP 
FQNTPFHTPR 
SLCQQECSCT 
MCYEDPVSSL 
QGRMAKDLLL 
DTQDQKRTLR 
ASPAHHASTQ 
SSVLRDRSSV 
SPLSKGGKDG 
GRQAEATAQT 
HPRVPSHSDS 
SLHRKEPIPE 
PARPPAARSQ 
AGRPRPTSQG 
ARAKEAAASL 
KSEPPSKRPL 
SLAKEEREPA 
WPRYTTRAPP 
SNGKPNGQRI 
TPWSPDGLP 
TPLPTTTTPR 
TPKPTTPIPT 
DEDYEFETSR 
PSAPCSLTDA 
TPGDLVTGYL 
SPSVSFVTES 



11 
I 

APADYRVILK 
KGELARWDYK 
EKLNVWPVNG 
LSNHLEQSFS 
QKDPSCLAYL 
TGNSLKSVAA 
DLKNKILANG 
PPSRHGHSW 
GTSHRPSLPA 
HPGAKPASPA 
EDAPATNSNA 
LRARPASGHF 
HPKLSSGIHG 
NPKSTGADTH 
0HPSVPRRMT 
RSHSSSDPYT 
PKHQQVESPT 
SSKSQQSVSA 
1ALAPRGGSL 
GHFSTTPMLS 
INGPQGTKWV 
LFGQGRHGTP 
PTTATTMQPT 
CPPGTLERHD 
PPTTTEPSTT 
LDHPQVDSIiD 
VYSASYEDPI 
DNPLLWRPP 



21 
I 

TSQEDELDVP 
QIANRRVLIB 
KPTWAASWD 
PILETLLLPW 
IDIQTKQVNK 
SKADVQQNTE 
GAPRKPQLRA 
APGRTAVRAR 
SLNDNDLVDS 
RRTFHSGAAE 



31 
I 

DDISVRVMSS 
MLIPDTVYEF 
ALPETEGKVK 
WMVCSLGNAI 
DPQLEGSVFG 
DNGKPEKPEP 
KKAEELDLQS 
MPALPRREGV 
DEDERAVGSL 



41 
I 

QSVLVSWVDP 
AVRISQGERD 
VCLLDTGLFS 
FSKSGPQTGE 
PCFLFYFLTF 
SSPSPRAPAS 
TEITGEEELG 
DKPGFSLATQ 
HPKGAFAQPR 



HLLRHKPFAA 
DEEDEKPLPA 
PQGKYSSLAS 
PGRAPEQQPP 
ASSRGMLPTA 
GAGAGGDHRS 
EDEEEEDAGF 
APVKRPLPPP 
LRQRMMHARF 
VDLDRGLVLN 
LANAQDKPIL 
TTTTPLPTTT 
DDGNLIMSSN 
ATTPRVIPEE 
EIIPNDLKKS 
RNKFSTQASS 



SSHLSSRTQV 
KGRSPSRFSI 
TWNDHVPSS 
KAQDVQQSTD 
PPVATSQHHP 
LQNQDEDAQG 
QRGHAASPAR 
FKGGKEDIiS 
PGSSPRASHV 
RNPLSRQPAR 
AEGRYLQDSH 
SLGGKPLVGL 
PRPTTATTRR 
GIPECYAEED 
GAISSFPBEE 
DLPPQHAPRN 
VTHLPIENLK 
SNMIPATRTA 



SEGAEASDGE 
GRGPRLQPSS 
SRQPISRGWE 
ADTEGHSPKA 



SYDDD STEVE 
PSRPGGPQSR 
SSVPKWPSSS 
PSRPPPRSAA 
PSYRQGYNGR 
GNPLRIKUGG 
EVIKKTTHPP 
TTTRRPTTTV 
EFSGLETDTA 
FDLAGRKRFV 
ITWAVEGCH 
PNTRYYFKVQ 
MDGNM 



4320 
43B0 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
S940 
6000 



51 
I 

VLEKQKKWA 
GKWSTSVFQR 
VSSFQPSAKS 
AWDLTPKPSL 
MLDIGGFSFI 
SQHPSVPASP 
SREDSPMSPS 
PRPGAPPSAS 
PALSPSRQSP 
RLLPTQPHLS 



SPQSTVPSRA 
DLRRSPQRGA 



PSQPRLSLTQ 
AQDVRAPAHA 
ARVPSRAAPG 
TPRGGKDADG 
TVSPVAGTHP 
PNVEGKVLPG 
DGRTIVDLEG 
TTTMQPTTTT 
RTTTRTTTTT 
VPTEEAYVIY 
APYVTYLNKD 
SFVIVDWDKA 
AQNPHGYGPI 



Seq ID NO: 61 DNA sequence 

Nucleic Acid Accession §: NM_022743 

Coding sequence: 128.. 1237 



GTGGATTTTA 
TCTCTGCTGC 
AAAGCTGATG 
AAAAGCTTGG 
TCCTCCAGAC 
TTCAGAATCA 
TGAAGATAAG 
AGAAATACAG 
AGTGATCTGC 
ATATCCCAGT 
TGGGCCCCAC 
CTGCTACCTG 
CTGCTTTGAA 



11 
I 

GAGATACCTC 
AACACCCCTA 
CGATGCTCTC 
CCAGACCACA 
TCCGTTCGAC 
GAGAAGCTTT 
AAAGAGGGCC 
GATGCCTCTC 
AACTCTTTCA 
ATCTCTTTGC 
CTCTTACTGC 
GATATGCTGA 
TGTGACTGTT 



21 
I 

CCCTCCTTCT 
CTGTTCTCCG 
AGTGCCGCGT 
AGCGGGAATG 
TTCTTGGCAG 
ACTCATTTTA 
TCAGGCAACT 
AGCTGCCACC 
CCATCTGTAA 
TCAATCACAG 
GAGCAGTCCG 
TGACCAGTGA 
TCCGTTGCCA 



31 
I 

GCTCAGCTGC 
TGTATTGGCT 
CGCCAAATAC 
CAAATGCCTT 
AGTTGTCTTC 
TGATCTGGAG 
CGTAATGACA 
TGCCTTTGAC 
TGCGGAGATG 
CTGTGACCCC 
AGACATCGAG 
GGAGCGCCGG 
AACCCAGGAC 



41 

I 

CTTGCAGTAA 
TTTCTGGGCA 
TGTAGTGCTA 
AAAAGCTGCA 
AAACTTATGG 
TCAAATATTA 
TTTCAACATT 
CTTTTTGAAG 
CAGGAAGTTG 
AACTGTTCGA 
GTGGGAGAGG 
AAGCAGCTGA 
AAGGATGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 



51 
I 

TTAAACTCTT 
GCAGGAAGGA 
AGTGTCAGAA 
AACCCAGATA 
ATGGAGCACC 
ACAAACTGAC 
TCATGAGAGA 
CCTTTGCAAA 
GTGTTGGCCT 
TTGTGTTCAA 
AGCTCACCAT 
GGGACCAGTA 
ATATGCTAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



973 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



TGGTGATGAG 
GGCACACTGG 
TGAAOGGCTT 
CTGCATCAAC 
ATACAGGATT 
CAAACTGCAG 
TGATATTATG 
TTTAGAAGAA 
CGGCGTGTGT 
GTGAACCTCT 
GGTTTGCAAA 
ATTTGGTTGA 



CAAGTATGGA 
AAGTGGGAGC 
CCCGATATCA 
CTOGGCCTGT 
TTTTTCCCAG 
CTACATCAAG 
AGAGTGACAC 
TGCGACGCCA 
CTTTGTTGAA 
CTTATTGGAA 
CCACAAGAAT 
GGATGCCAAA 



AGGAAGTTCA 
AGGTTCTGGC 
ACATCTACCA 
TGGAGGAAGC 
GAAGCCATCC 
GCATGTTTCC 
ATGGCAGAGA 
ACATCAGAGC 
TGCCTTATTG 
ATTCTGTTCC 
CATTAGTTGT 
AAAAAAAAAA 



AGAATCCCTG 
CATGTGCCAG 
GCTGAAGGTG 
CTTGTTCTAT 
CGTCAGAGGG 
CCAAGCAATG 
ACACAGCCTG 
ATCCTAAGGG 
AGGTCACACA 
GTGTTTGTGT 
AGAGAAGCAC 
AAAAAAA 



AAAAAAATTG 
GCGATCATAA 
CTCGACTGCG 
GGTACTCGGA 
GTTCAAGTGA 
AAGAATCTGA 
ATTGAAGATT 
AACGCAGTCA 
CTCTATGCTT 
AGGTAAATAA 
GATTATAATA 



Seq ID NO: 62 Protein sequence 
Protein Accession #: NP_073580 



MRCSQCRVAK 
SEKLYSFYDL 
CNSFTICNAE 
LDMLMTSEER 
WKWEQVLAMC 
IFFPGSHPVR 
ECDANIRAS 



11 
I 

YCSAKCQKKA 
ESNINKLTED 
MQEVGVGLYP 
RKQLRDQYCF 
QAIISSNSER 
GVQVMKVGKL 



21 
1 

WPBHKRECK.C 
KKEGLRQLVM 
SISLLKHSCD 
ECDCFRCOTQ 
L.PDINIYQLK 
QLHQGMFPQA 



31 
I 

LKSCKPRYPP 
TPQHFMREBI 
PNCSrVFNGP 
DKDADMLTGD 
VLDCAMDACI 
MKNLRLAFDI 



41 
I 

DSVRLLGRW 
QDASQLPPAF 
HLLLRAVRDI 
BQVWKEVQES 
NLGLLEEALF 
MRVTHGRBHS 



AAGAACTGAA 
GCAGCAATTC 
CCATGGATGC 
CCATGGAGCC 
TGAAAGTTGG 
GACTGGCTTT 
TGATTCTACT 
GAGGGAAATA 
TGTTAGCTGT 
AGGCAGACAT 
AATTCAAAAC 



SI 
I 

FKLMDGAPSE 
DLPEAFAKVI 
EVGBELTICY 
LKKIEELKAH 
YGTRTMEPYR 
LIEDLILLLE 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



Seq ID NO: 63 DMA sequence 

Nucleic Acid Accession #: NM_003014.2 

Coding sequence: 238.. 648 

1 11 21 31 41 51 

I I I I I I 

GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGCCAGACGG 
CGGAGCTCCG CGGCGGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 
AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCGAAG GGACAGCGAA AGATGAGGGT 
GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG CGCGAGAGGG CAGTGCCATG 
TTCCTCTCCA TCCTAGTGGC GCTGTGCCT G TGGCTGCACC TGGCGCTGGG CGTGCGCGGC 
GCGCCCTGCG AGGCGGTGCG CATCCCTATG TGCCGGCACA TGCCCTGGAA CATCACGCGG 
ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAACGCCA TCCTGGCCAT CGAGCAGTAC 
GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 
GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 
CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 
AGCCTGGCCT GCGACGAGCT GCCTGTCTAT GACCGTGGCG TGTGCATTTC GCCTGAAGCC 
ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 
CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 
AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 
AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 
GAGATCTTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 
TCTTGCCAGT GTCCACACAT CCTGCCCCAT CAAGATGTTC TCATCATGTG TTACGAGTGG 
CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 
AAAAGATCCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 
AAGAAAACAG CCGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 
GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 
AACCCGAAAA GAGTGTGAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 
GATGAGGCTG GGCATTGCCT GGGACAGCCT ATGTAAGGCC ATGTGCCCCT TGCCCTAACA 
ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGCTTCA 
GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG TACAGAAGGT 
GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGCATACT 
CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAATATGTGC ATTGTAAAAT 
AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT TAOCTTTTGA 
TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 
AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT GTTTATGGTC TGCAGAAGGA 
TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG AAAATTATAA 
TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA AAACAAAAAT 
AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTTTTTTG 
GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 
ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 
ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT CCACCCTGAG 
AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC CAAATTTTGT TTTTCTTCAT 
TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG TACAGAGAGG 
AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATACT TTAATCAGAA 
AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 
ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTTTATA 
AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 
ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTTGCTTAT 
GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 
CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 
TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACOGGAATTC 

Seq ID NO: 64 Protein sequence 
Protein Accession ft: NP_00300S.l 

1 11 21 31 41 51 

974 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



WO 03/042661 



I I I I I I 

MFLSILVALC LWLHLALGVR GAPCEAVRIP MCRHMPWNIT RMPNHLHHST QENAILAIEQ 60 

YBELVDVKCS AVLRFFFCAM YAPICTLEFL HDPIKPCKSV OQRARDDCBP LKKMYNHSWP 120 

BSIACDELPV YDRGVCISPE AIVTDLPEDV KWIDITPDMM VQBRPLDVDC KRLSPDRCKC 180 

KKVKPTLATY LSKNYSYVIH AKIKAVQRSG CNEVTTWDV KBIFKSSSPI PRTQVPLITN 240 

SSCQCPHILP HQDVLIMCYE WRSRMMLLEN CLVBKHRDQL SKRSIQWBER LQEQRRTVQD 300 
KKKTAGRTSR SNPPKPKGKP PAPKPASPKK NIKTRSAQKR TNPKRV 

Seq ID NO: 65 DNA sequence 
Nucleic Acid Accession #: BC010423 
Coding sequence: 248.. 1780 

1 11 21 31 41 51 

I I I I I 1 

CACAGCGTGG GAAGCAGCTC TGGGGGAGCT CGGAGCTCCC GATCACGGCT TCTTGGGGGT 60 

AGCTACGGCT GGGTGTGTAG AACGGGGCCG GGGCTGGGGC TGGGTCCCCT AGTGGAGACC 120 

CAAGTGCGAG AGGCAAGAAC TCTGCAGCTT CCTGCCTTCT GGGTCAGTTC CTTATTCAAG 180 

TCTGCAGCCG GCTCCCAGGG AGATCTCGGT GGAACTTCAG AAACGCTGGG CAGTCTGGCT 240 

TTCAACCATG CCCCTGTCCC TGGGAGCCGA GATGTGGGGG CCTGAGGCCT GGCTGCTGCT 300 

GCTGCTACTG CTGGCATCAT TTACAGGCCG GTGCCCCGCG GGTGAGCTGG AGAOCTCAGA 360 

CGTGGTAACT GTGGTGCTGG GCCAGGACGC AAAACTGCCC TGCTTCTACC GAGGGGACTC 420 

CGGCGAGCAA GTGGGGCAAG TGGCATGGGC TCGGGTGGAC GCGGGCGAAG GCGCCCAGGA 480 

ACTAGCGCTA CTGCACTCCA AATACGGGCT TCATGTGAGC CCGGCTTACG AGGGCCGCGT 540 

GGAGCAGCCG CGGCCCCCAC GCAACCCCCT GGAOGGCTCA GTGCTCCTGC GCAACGCAGT 600 

GCAGGCGGAT GAGGGCGAGT ACGAGTGCCG GGTCAGCACC TTCCCCGCCG GCAGCTTCCA 660 

GGCGCGGCTG CGGCTCCGAG TGCTGGTGCC TCCCCTGCCC TCACTGAATC CTGGTCCAGC 720 

ACTAGAAGAG GGCCAGGGCC TGACCCTGGC AGCCTCCTGC ACAGCTGAGG GCAGCCCAGC 780 

CCCCAGCGTG ACCTGGGACA CGGAGGTCAA AGGCACAACG TCCAGCCGTT CCTTCAAGCA 840 

CTCCCGCTCT GCTGCCGTCA CCTCAGAGTT CCACTTGGTG CCTAGCCGCA GCATGAATGG 900 

GCAGCCACTG ACTTGTGTGG TGTCCCATCC TGGCCTGCTC CAGGACCAAA GGATCACCCA 960 

CATCCTCCAC GTGTCCTTCC TTGCTGAGGC CTCTGTGAGG GGCCTTGAAG ACCAAAATCT 1020 

GTGGCACATT GGCAGAGAAG GAGCTATGCT CAAGTGCCTG AGTGAAGGGC AGCCCCCTCC 1080 

CTCATACAAC TGGACACGGC TGGATGGGCC TCTGCCCAGT GGGGTACGAG TGGATGGGGA 1140 

CACTTTGGGC TTTCCCCCAC TGACCACTGA GCACAGCGGC ATCTACGTCT GCCATGTCAG 1200 

CAATGAGTTC TCCTCAAGGG ATTCTCAGGT CACTGTGGAT GTTCTTGACC CCCAGGAAGA 1260 

CTCTGGGAAG CAGGTGGACC TAGTGTCAGC CTCGGTGGTG GTGGTGGGTG TGATCGCCGC 1320 

ACTCTTGTTC TGCCTTCTGG TGGTGGTGGT GGTGCTCATG TCCCGATACC ATCGGCGCAA 1380 

GGCCCAGCAG ATGACCCAGA AATATGAGGA GGAGCTGACC CTGACCAGGG AGAACTCCAT 1440 

CCGGAGGCTG CATTCCCATC ACACGGACCC CAGGAGCCAG CCGGAGGAGA GTGTAGGGCT 1500 

GAG AG CCG AG GGCCACCCTG ATAGTCTCAA GGACAACAGT AGCTGCTCTG TGATGAGTGA 1560 

AGAGCCCGAG GGCCGCAGTT ACTCCACGCT GACCACGGTG AGGGAGATAG AAACACAGAC 1620 

TGAACTGCTG TCTCCAGGCT CTGGGCGGGC CGAGGAGGAG GAAGATCAGG ATGAAGGCAT 1680 

CAAACAGGCC ATGAACCATT TTGTTCAGGA GAATGGGACC CTACGGGCCA AGCCCACGGG 1740 

CAATGGCATC TACATCAATG GGCGGGGACA CCTGGTCTGA CCCAGGCCTG CCTCCCTTCC 1800 

CTAGGCCTGG CTCCTTCTGT TGACATGGGA GATTTTAGCT CATCTTGGGG GCCTCCTTAA 1860 

ACACCCCCAT TTCTTGCGGA AGATGCTCCC CATCCCACTG ACTGCTTGAC CTTTACCTCC 1920 

AACCCTTCTG TTCATCGGGA GGGCTCCACC AATTGAGTCT CTCCCACCAT GCATGCAGGT 1980 

CACTGTGTGT GTGCATGTGT GCCTGTGTGA GTGTTGACTG ACTGTGTGTG TGTGGAGGGG 2040 

TGACTGTCCG TGGAGGGGTG ACTGTGTCCG TGGTGTGTAT TATGCTGTCA TATCAGAGTC 2100 

AAGTGAACTG TGGTGTATGT GCCAGGGGAT TTGAGTGGTT GCGTGGGCAA CACTGTCAGG 2160 

GTTTGGCGTG TGTGTCATGT GGCTGTGTGT GACCTCTGCC TGAAAAAGCA GGTATTTTCT 2220 

CAGACCCCAG AGCAGTATTA ATGATGCAGA GGTTGGAGGA GAGAGGTGGA GACTGTGGCT 2280 

CAGACCCAGG TGTGCGGGCA TAGCTGGAGC TGGAATCTGC CTCCGGTGTG AGGGAACCTG 2340 

TCTCCTACCA CTTCGGAGCC ATGGGGGCAA GTGTGAAGCA GCCAGTCCCT GGGTCAGCCA 2400 

GAGGCTTGAA CTGTTACAGA AGCCCTCTGC CCTCTGGTGG CCTCTGGGCC TGCTGCATGT 2460 

ACATATTTTC TGTAAATATA CATGCGCCGG GAGCTTCTTG CAGGAATACT GCTCCGAATC 2520 

ACTTTTAATT TTTTTCTTTT TTTTTTCTTG CCCTTTCCAT TAGTTGTATT TTTTATTTAT 2580 

TTTTATTTTT ATTTTTTTTT AGAGTTTGAG TCCAGCCTGG ACGATATAGC CAGACCCTGT 2640 
CTGTAAAAAA ACCAAAACCC AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 66 Protein sequence 
Protein Accession #: AAH10423 

1 11 21 31 41 51 

I I I I I I 

MPLSLGAEHW GPEAWLLLLL LLASFTGRCP AGELETSDW TWLGQDAKL PCFYRGDSGE 60 

QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 

DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CT AEG S PAPS 180 

VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 

HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 

GFPPIjTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 360 

FCLLWVWL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 

EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIBTQTEL LSPGSGRAEE EEDQDEGIKQ 480 
AMNHPVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NO: 67 DNA sequence 

Nucleic Acid Accession #: NM_001203 

Coding sequence: 274.. 1782 

1 11 21 31 41 51 

I I I I I I 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 

GAGGACGCGG GAGCOGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 

GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 
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CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 

TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGOGAAGTGC AGGAAAATTA 300 

AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCOG TCCAAAGGTC 360 

TTGOGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 

GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 

GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTOGGG ACACTCCCAT TCCTCATCAA 540 

AGAAGATCAA TTGAATGCTG CACAGAAAGG AAGGAATGTA ATAAAGACCT ACACCCTACA 600 

CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGAOCTA TACACCACAG GGCTTTACTT 660 

ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 

ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 

TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 

ATTGGAAAAG GTOGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGOGA AAAGGTAGCT 960 

GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 

ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 

GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1140 

TATCTGAAGT CCACCACCCT AGAOGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 

AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 

CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 

GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 

ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 

AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1500 

GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 

CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 

CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 

ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 

ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 

CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 

CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCOGTG 1980 
TCTGTTTGTA GGOGGAGAAA CCGTTGGGTA ACTTGTTGAA GATATGATGC AT 

Seq ID NO; 68 Protein sequence 
Protein Accession NP_001194 

1 11 21 31 41 51 

I I I I I I 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCRHHCPB DSVNNIC9TD GYCPTMIEED 60 
DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERH ECNKDLHPTL PPLKNRDPVD 120 
GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDU 180 
EQSQSSGSGS GLPLLVQRT1 AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 
WFRETEIYQT VLMRHENILG PIAADIKGTG SWTQLYLITD YHENGSLYDY LK5TTL0AKS 300 
MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 360 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLII»WEV ARRCVSGGIV 420 
EEYQLPYHDL VPSOPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 

Seq ID NOt 69 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 166.. 1737 

1 11 21 31 41 SI 

I I I I I I 

TTGGGGGTTT ATTCTCTTCC CTTCTAACTT GACAGGGTCT TGCTCTGTCA TTCAGGCAAG 60 

AGTGCAGTAG TGTGATCACT TCTTACTGCC GCCTCAAGCT TCCAGCCTCA ACTCAAGCAA 120 

TCCTCCCACC TCAGCCACCC AAGTGGCTGG GACTACAGAT TAAGAATGAC CCAAAATAAA 180 

TTAAAGCTTT GTTCCAAAGC CAATGTGTAT ACTGAAGTGC CTGATGGAGG ATGGGGCTGG 240 

GCGGTAGCTG TTTCATTTTT CTTCGTTGAA GTCTTCACCT ACGGCATCAT CAAGACATTT 300 

GGTGTCTTCT TTAATGACTT AATGGACAGT TTTAATGAAT CCAATAGCAG GATCTCATGG 360 

ATAATCTCAA TCTGTGTGTT TGTCTTAACA TTTTCAGCTC CCCTCGCCAC AGTCCTGAGC 420 

AATCGTTTCG GACACCGTCT GGTAGTGATG TTGGGGGGGC TACTTGTGAG CACCGGGATG 480 

GTGGCCGCCT CCTTCTCACA AGAGGTTTCT CATATGTACG TCGCCATCGG CATCATCTCT 540 

GGTCTGGGAT ACTGCTTTAG TTTTCTCCCA ACTGTAACCA TCCTATCACA ATATTTTGGC 600 

AAAAGACGTT CCATAGTCAC TGCAGTTGCT TCCACAGGAG AATGTTTCGC TGTGTTTGCT 660 

TTCGCACCAG CAATCATGGC TCTGAAGGAG CGCATTGGCT GGAGATACAG CCTCCTCTTC 720 

GTGGGCCTAC TACAGTTAAA CATTGTCATC TTCGGAGCAC TGCTCAGAOC CATCTTTATC 780 

AGAGGACCAG CGTCACCGAA AATAGTCATC CAGGAAAATC GGAAAGAAGC GCAGTATATG 840 

CTTGAAAATG AGAAAACACG AACCTCAATA GACTCCATTG ACTCAGGAGT AGAACTAACT 900 

ACCTCACCTA AAAATGTGCC TACTCACACT AACCTGGAAC TGGAGCCGAA GGCCGACATG 960 

CAGCAGGTCC TGGTGAAGAC CAGCCCCAGG CCAAGCGAAA AGAAAGCCCC GCTATTAGAC 1020 

TTCTCCATTT TGAAAGAGAA AAGTTTTATT TGTTATGCAT TATTTGGTCT CTTTGGAACA 1080 

CTGGGATTCT TTGCACCTTC CTTGTACATC ATTOCTCTGG GCATTAGTCT GGGCATTGAC 1140 

CAGGACCGCG CTGCTTTTTT ATTATCTACG ATGGCCATTG CAGAAGTTTT CGGAAGGATC 1200 

GGAGCTGGTT TTGTCCTCAA CAGGGAGCCC ATTCGTAAGA TTTACATTGA GCTCATCTGC 1260 

GTCATCTTAT TGACTGTGTC T CTGTT T GCC TTTACTTTTG CTACTGAATT CTGGGGTCTA 1320 

ATGTCATGCA GCATATTTTT TGGGTTTATG GTTGGAACAA TAGGAGGGAC TCACATTCCA 1380 

CTGCTTGCTG AGGATGATGT CGTGGGCATT GAGAAGATGT CTTCTGCAGC TGGGGTCTAC 1440 

ATCTTCATTC AGAGCATAGC AGGACTGGCT GGACCGCCCC TTGCAGGTTT GTTGGTGGAC 1500 

CAAAGTAAGA TCTACAGCAG GGCCTTCTAC TCCTGCGCAG CTGGCATGGC CCTGGCTGCT 1560 

GTGTGCCTCG CCCTGGTGAG ACCGTGTAAG ATGGGACTGT GCCAGCATCA TCACTCAGGT 1620 

GAAACAAAGG TAGTGAGCCA TGGTGGGAAG ACTTTACAGG ACATACCTGA AGACTTTCTG 1680 

GAAATGGATC TTGCAAAAAA TQAGCACAGA GTTCACGTGC AAATGGAGCC GGTATGACAC 1740 

ACTTTCTTAC AACAACAGCC ACTGTGTTGG CTGGAGAGGG ATGGGGTGGG CCCAACGGGG 1800 

ACACAAGGAG GCAGAGGAGC TAACCCCTCT ACTCCACTTT CAAAACTACA TTTTAAAGGG 1860 
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AATGTGTATG TGAAGAGCAC TACCAACATC GCTTTTGTTT TGTTTTGTTT TGTTTTAAGC 1920 

TTTTTTTTTT TGCTTGTTTT TAAAGCCAAA ACAAAAAACA ACCAAGCACT CTTCCATATA 1980 

TAAATCTGGC TGTATTCAGT AGCAATACAA GAGATATGTA GAAAGACTCT TTGGTTCACA 2040 

TTCCGATATT AAAATAGTGA CATGAACTGG CAAAGTGGTT TTAAAAGCTT TCACGTGGGA 2100 

TAAATGATTT TCTTTTTTTC TTTTCTTTCT TCCTATGGTC TTGTCTGAAT AAACTACTCT 2160 
CCTGAATAAA ACAACATCCA ACCCAGGTCA TTGAAATGAA ATTGGCCAGT C 

Seq ID NO: "70 Protein sequence 
Protein Accession # : Eos sequence 



1 11 21 31 41 51 

I I I 1 I I 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS PPFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 

SRISWIISIC VFVLTFSAPL ATVLSNRFGH RLWMLGGLL VSTGMVAASP SQEVSHMYVA 120 

15 IGIISGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC PAVPAFAPAI MALKERIGWR 180 

YSLLFVGLLQ LNIVIFGALL RPIFIRGPAS PKIVIQENRK EAQYMLENEK TRTSIDSIDS 240 

GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 

GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREPIRKIY 360 

IELICVILLT VSLFAFTFAT EPWGLMSCSI FFGFMVGTIG GTHIPLLAED DWGIEKMSS 420 

20 AAGVYIPIQS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKW SHRGKTLQDI PEDPLEMDLA KNEHRVHVQM EPV 

Seq ID NO* 71 DNA sequence 
„ Nucleic Acid Accession #» NM_004694 
25 Coding sequence: 166.. 1737 

1 11 21 31 41 51 

I I I I I I 

TTGGGGGTTT ATTCTCTTCC CTTCTAACTT GACAGGGTCT TGCTCTGTCA TTCAGGCAAG 60 

30 AGTGCAGTAG TGTGATCACT TCTTACTGCC GCCTCAAGCT TCCAGCCTCA ACTCAAGCAA 120 

TCCTCCCACC TCAGCCACCC AAGTGGCTGG GACTACAGAT TAAGAATGAC CCAAAATAAA 180 

TTAAAGCTTT GTTCCAAAGC CAATGTGTAT ACTGAAGTGC CTGATGGAGG ATGGGGCTGG 240 

GCGGTAGCTG TTTCATTTTT CTTCGTTGAA GTCTTCACCT ACGGCATCAT CAAGACATTT 300 

GGTGTCTTCT TTAATGACTT AATGGACAGT TTTAATGAAT CCAATAGCAG GATCTCATGG 360 

35 ATAATCTCAA TCTGTGTGTT TGTCTTAACA TTTTCAGCTC CCCTCGCCAC AGTCCTGAGC 420 

AATCGTTTCG GACACCGTCT GGTAGTGATG TTGGGGGGGC TACTTGTCAG CACCGGGATG 480 

GTGGCCGCCT CCTTCTCACA AGAGGTTTCT CATATGTACG TCGCCATCGG CATCATCTCT 540 

GGTCTGGGAT ACTGCTTTAG TTTTCTCCCA ACTGTAACCA TCCTATCACA ATATTTTGGC 600 

AAAAGACGTT CCATAGTCAC TGCAGTTGCT TCCACAGGAG AATGTTTCGC TGTGTTTGCT 660 

40 TTCGCACCAG CAATCATGGC TCTGAAGGAG CGCATTGGCT GGAGATACAG CCTCCTCTTC 720 

GTGGGCCTAC TACAGTTAAA CATTGTCATC TTCGGAGCAC TGCTCAGACC CATCATTATC 780 

AGAGGACCAG CGTCACCGAA AATAGTCATC CAGGAAAATC GGAAAGAAGC GCAGTATATG 840 

CTTGAAAATG AGAAAACACG AACCTCAATA GACTCCATTG ACTCAGGAGT AGAACTAACT 90 0 

ACCTCACCTA AAAATGTGCC TACTCACACT AACCTGGAAC TGGAGCCGAA GGCCGACATG 960 

45 CAGCAGGTCC TGGTGAAGAC CAGCCCCAGG CCAAGCGAAA AGAAAGCCCC GCTATTAGAC 1020 

TTCTCCATTT TGAAAGAGAA AAGTTTTATT TGTTATGCAT TATTTGGTCT CTTTGCAACA 1080 

CTGGGATTCT TTGCACCTTC CTTGTACATC ATTCCTCTGG GCATTAGTCT GGGCATTGAC 1140 

CAGGACOGOG CTGC T T T TTT ATTATCTACG ATGGCCATTG CAGAAGTTTT CGGAAGGATC 1200 

<A GGAGCTGGTT TTGTCCTCAA CAGGGAGCCC ATTCGTAAGA TTTACATTGA GCTCATCTGC 1260 

50 GTCATCTTAT TGACTGTGTC TCTGTTTGCC TTTACTTTTG CTACTGAATT CTGGGGTCTA 1320 

ATGTCATGCA GCATATTTTT TGGGTTTATG GTTGGAACAA TAGGAGGACT CACATTCCAC 1380 

TGCTTGCTGA AGATGATGTC GTGGGCATTG CAGAAGATGT CTTCTGCAGC TGGGGTCTAC 1440 

ATCTTCATTC AGAGCATAGC AGGACTGGCT GGACCGCCCC TTGCAGGTTT GTTGGTGGAC 1500 

CAAAGTAAGA TCTACAGCAG GGCCTTCTAC TCCTGOGCAG CTGGCATGGC CCTGGCTGCT 1560 

55 GTGTGCCTCG CCCTGGTGAG ACCGTGTAAG ATGGGACTGT GCCAGCGTCA TCACTCAGGT 1620 

GAAACAAAGG TAGTGAGCCA TCGTGGGAAG ACTTTACAGG ACATACCTGA AGACTTTCTG 1680 

GAAATGGATC TTGCAAAAAA TGAGCACAGA GTTCACGTGC AAATGGAGCC GGTATGACAC 1740 

ACTTTCTTAC AACAACAGCC ACTGTGTTGG CTGGAGAGGG ATGGGGTGGG CCCAACGGGG 1800 

ACACAAGGAG GCAGAGGAGC TAACCCCTCT ACTCCACTTT CAAAACTACA TTTTAAAGGG 1860 

60 AATGTGTATG TGAAGAGCAC TACCAACATC GCTTTTGTTT TOT1TGTTT TGTTTTAAGC 1920 

TTTTTTTTTT TGCTTGTTTT TAAAGCCAAA ACAAAAAACA ACCAAGCACT CTTCCATATA 1980 

TAAATCTGGC TGTATTCAGT AGCAATACAA GAGATATGTA GAAAGACTCT TTGGTTCACA 2040 

TTCCGATATT AAAATAGTGA CATGAACTGG CAAAGTGGTT TTAAAAGCTT TCACGTGGGA 2100 

TAAATGATTT T CT T T TTTTC TTTTCTTTCT TCCTATGGTC TTGTCTGAAT AAACTACTCT 2160 

65 CCTGAATAAA ACAACATCCA ACCCAGGTCA TTGAAATGAA ATTGGCCAGT C 

70 



Seq ID NO: 72 Protein sequence 
Protein Accession ft: NP 004685 



1 11 21 31 41 51 

I I I I I I 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS FPPVEVFTYG IIKTFGVFFN DLMDSFNESN 60 

SRISWIISIC VFVLTFSAPL ATVLSNRFGH RLWMLGGLL VSTGMVAASP SQEVSHMYVA 120 

IGIISGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC FAVFAFAPAI MALKERIGWR 180 

75 YSLLFVGLLQ LNIVIFGALL RPIIIRGPAS PKIVIQENRK EAQYMLENEK TRTSIDSIDS 240 

GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 

GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREPIRKIY 360 

IELICVILLT VSLFAFTFAT EPWGLMSCSI FFGFMVGTIG GLTFHCLLKM MSWALQKMSS 420 

AAGVYIPIQS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
RHHSGETKW SHRGKTLQDI PEDPLEMDLA KNEHRVHVQM EPV 



Seq ID NO: 73 DNA sequence 

Nucleic Acid Accession ft: NM_002184.1 

Coding sequence: 256.. 3012 
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1 11 21 31 41 51 

I I I 1 I.I 

GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 

CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGOQCC AAGGGGTTOG TGOGCTGTGG 120 

AGACGOGGAG GGTCGAGGOG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 

TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 

AAATATOCGC GCAAGATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 

CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 

CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 

GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 

AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 

TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTOG GACAGCTTGA ACAGAATGTT 600 

TATGGAATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 

GTGAACGAGG GGAAGAAAAT GAGGTGTGAG TGGGATGGTG GAAGGGAAAC ACACTTGGAG 720 

ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 

CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 

GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 

CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 

CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 

AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 1080 

GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1140 

GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 

GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 

ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 

CCTTTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 

CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 

TATCTAGCAA CCCTAACAGT AAGAAATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1500 

ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1560 

GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 

GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 

ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 

GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 

CAAGCTCCAC CTTCCAAAGG ACCTACTGTT OGGACAAAAA AAGTAGGGAA AAACGAAGCT 1860 

GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 

ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 

GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 

ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 

GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 

GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 

CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 

TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 

ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 

AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 

TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2520 

ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 2580 

GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAGAT 2640 

CTACAATTAG TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 

AAACAGAACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 

CAAGTTTCAT CAGTCAATGA GGAAGATTTT GTTAGACTTA AACAGCAGAT TTCAGATCAT 2820 

ATTTCACAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 

GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 

GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 

ATGCCTCAGT GAAGGACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTAAAATGAT TTTATCTGTG AATTC 

Seq ID NO: 74 Protein sequence 
Protein Accession #: NP_002175.1 

1 11 21 31 41 51 

I I I I I I 

MLTLQTWWQ ALPIPLTTES TGELLDPCGY ISPESPWQL HSNPTAVCVL KEKCMDYFHV 60 

NANYIVWKTN HPTIPKEQYT IINRTASSVT PTDIASLNIQ LTCNILTPGQ LEQNVYGITI 120 

ISGLPPEKPK NLSCIVNEGK KMRCEWDGGR ETHLETNPTL KSEWATHKPA DCKAKRDTPT 180 

SCTVDYSTVY FVNIEVWVEA ENALGKVTSD HINFDPVYKV KPNPPHNLSV INSEELSSIL 240 

KLTWTNPSIK SVIILKYNIQ YRTKDASTWS QIPPEDTAST RSSPTVQDLK PFTEYVFRIR 300 

CMKEDGKGYW SDWSEEASGI TYEDRPSKAP SPWYKIDPSH TQGYRTVQLV WKTLPPPEAN 360 

GKILDYEVTTj trwkshlqny tvnatkltvn LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 

FQATHPVMDL KAPPKDNMLW VEHTTPRESV KKYILEWCVL SDKAPCITDW QQBDGTVHRT 460 

YLRGNIiAESK CYLITVTPVY ADGPGSPESI KAYLKQAPPS KGPTVRTKKV GKNEAVLEWD 540 

QLPVDVQNGP IRNYTIFYRT IIGNETAVNV DSSHTBYTLS SLTSDTLYMV RMAAYTDEGG 600 

KDGPEPTPTT PKPAQGEIEA IWPVCLAPL LTTLLGVLFC FNKRDLIKKH IWPNVPDPSK 660 

SHZAQHSFHT PPRHNFNSKD QMYSDGNPTD VSWEIEAND KKPPPEDLKS LDLFKKEKIN 720 

TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRHQ VPSVQVFSRS 780 

ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYPKQNCS QHESSPDISH PERSKQVSSV 840 

NEEDPVRLKQ QISDHISQSC GSGQMKMPQE VSAADAFGPG TEGQVERPET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

Seq XD NO: 7S DNA sequence 

Nucleic Acid Accession ft: NMJ>22131 

Coding sequence: 11.. 2878 

1 11 21 31 41 51 

I I I I I I 

TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
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GGGGAGCGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 

CAATAAGCAC AAGCCATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA 180 

CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 

AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTQ CCCTTTGAGG CTGTGGTGCT 300 

CAACAAGACA TCAGGAGAGG GCOGGCTCOG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 

GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 

CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 

TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGAOG GAGGGCAAGA TCTATGACAG 540 

CATTCTGCAG GTGGAGGCCA TTGAOGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 

CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 

CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 

CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 

CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 

GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAGAT 900 

CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 

GAAATCCCTT CAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 

TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TGATCTTCAA 1060 

GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1140 

TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 1200 

GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 

TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 1320 

TCGCCCCGCG' GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 

TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 

ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1500 

CGGCGCTTGT TGGCAAGGAG GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 

AAGCCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAGAAGG TGATCTCCTG 1620 

CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 

AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 

CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1800 

TGTGCGGCGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1860 

TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTGCAGGCC ATCGAGCCCC GGATCACCCT 1920 

CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 

CCTCTTCCCT GATATCAAGA TTGTGAGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 

GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 

TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 

CAGTGAGCTC CACCAACGAC ACCTGGATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 

CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 

GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 

CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAGA 2400 

TAAGGAGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 

TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 

CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 

CGCCCACCAG CACTTCATCC AGGAGACTGA GGCTGCCAAG GAATCTGAGA TGGACTGGGA 2640 

CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 

GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 

TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 

GCAGAATGGA GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTACTAGTG 2B80 

CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 

CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 3000 

TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 

AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT GTAGCCTCCA 3120 

CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 3180 

TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 

GGCCCTGGGG TTCCAACTCA CTGTGOGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 

CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGAACT TCACACAOGT AAGGTCTTAG 3360 

TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 

CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 

CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 

TCCTTGGCCA CAAGCAGGGT CTGATCCCCC ATCAGAGCTA TCTGAGCCTG CCTGTCTGGG 3600 

CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 

CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 

GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3760 

AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 

GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 

TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 3960 

AAACGCACGA GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 

AAGCATATTT GCAATCATTG CAGCTTCTTC T TTCTT CTGC TCATAAAAGG AGGAACACTT 4080 

TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 

TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAGAA TTGTTTCTAT AGTAACTGGT 4200 

CTGTGATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 
ACGAAGCTCT GAGAACATGT TTGTTTCGAA . TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA A 

Seq ID MO: 76 Protein sequence 
Protein Accession 8 : NP_071414 

1 11 21 31 41 51 

I I I I I I 

MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPWIBTSYHG VITENNDTVI 60 
LDPPLVALDK DAPVPFAGBZ CAFK1HGQEL PPEAWLNKT SGEORLRAKS PIDCELQKEY 120 
TPIIQAYDCG AGPHBTAMKK SHKAWHIQV KDVNEPAPTF KEPAYKAWT EGKIYDSIM) 180 
VEAIDEDCSP QYSQICKYEI VTTDVPPAID RNGNIRNTBK LSYDKQHQYB ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLP PSIHLETCDG AVSSLQIVTE 300 
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5 



LQTNYIGKGC DRETYSBKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 

RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 420 

CRLVPLLRKD FDQADTFRPA EPHWKLDQIC DKBWHYYVIN VBPPWTLYM DGATYEPYLV 480 

TNDWPIHPSH IAMQLTVGAC WQGGBVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 540 

CKEGLDINSL ESLGQGIKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQPPTAGVRR 600 

LKVSSKVQCF GEDVCISIPE VDAYVMVLQA IBPRITLRGT DHFWRPAAQP ESARGVTLFP 660 

DIKIVSTFAK TEAPGDVKTT DPKSEVLBEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL 720 

HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRPRIK CSELNGRYTS 780 

NEFNLBVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSWP SIATWIIIS 840 

10 VCMLVFWAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTITVNPMEK HEGPGHGEDE 900 
TEGEEEBEAE EEMSSSSGSD DSEBEEEBEG MGRGRHGQNG ARQAQLEWDD STL FY 

Seq ID NO: 77 DMA sequence 
Nucleic Acid Accession § : Eos sequence 
15 Coding sequence: 482.. 3007 

1 11 21 31 41 51 

1 I I I.I I 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 

TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 

TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT ISO 

ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 240 

CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 

CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 

GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 

ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 

CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 

CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 

AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 

TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 

CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 

TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 

TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 

CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAQCTCAA 960 

CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 

CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1080 

TGTTGGTTTT CTOGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1140 

GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 

TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 

ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 

AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 

CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 

ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 

ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 

TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 

GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATOGTATTGA 1680 

AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 

TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 

ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1860 

ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 

AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 

TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 

CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 

GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 

AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 

TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 

TCTTACGGAC GCTGTGOCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 

TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 

AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 

GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2520 

GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 

GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 

TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 

GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 

AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 

TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 

GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 

TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 

AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 

ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 

AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 
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Seq ID NO: 78 Protein sequence 
/J Protein Accession #: Eos sequence 



80 



1 11 21 31 41 51 

I I I I I 1 

MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMLINCBA KGIKMVSEIS 60 

VPPSRPFQLS LLNNGLTMLH TNDPSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQLHIN 120 

HNSLBILKBD TPHGLENLEP LQABNNFITV IEPSAFSKLN RLKVLILNDN AIESLPPNIF 180 

RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 

IIGDWCNSP PPFRGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRNSTKT 300 

TSILKLPTKA PGLIPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHCQER NIBSLSDLRP 360 
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PPQNPRKLIL 
NHLTKLSKGM 
GVPLTKVNLK 
TDDILCTSPG 
LTDAVPLSVL 
YGHKTTHHTT 
LLBQENHSPL 
QLQPDMBAHY 
T 



AGNI IHSLMK 
FLGLHNLBYL 
TNQFTHLPVS 
HLDKKELKA1* 
ILGLLIMFIT 
ERPSASLYEQ 
TGSNMKYKTT 
PGAHEELKLM 



SDLVBYFTLE 
YLSYNAIKEI 
NILDDLDLLT 
NSEILCPGLV 
IVFCAAGIW 
HMVSPMVHVY 
NQSTEFLSFQ 
ETLMYSRPRK 



MLHLGNNRIE 
LPGTFNPMPK 
QIDLEDNPWD 
NNPSMPTQTS 
LVLHRRRRYK 
RSPSPGPKHL 
DASSLYRNIL 
VLVEQTKNEY 



VLEEGSFMNL 
LKVLYLNNNL 
CSCDLVGLQQ 
YLMVTTPATT 
KKQVDEQMRD 



EKERBLQQLG 
FEliKANLHAE 



TRLQKLYLNG 420 

LQVLPPHIFS 460 

WIQKLSKNTV 540 

TNTADTILRS 60O 

NSPVHLGYSM 660 

GSDAKHLQRS 720 

ITEYLRKNIA 780 

PDYLEVLEQQ 840 



Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession #: NMJU6640.2 

Coding sequence: 39.. 1358 



GCTTAAGTTG 
GGCCTTTGCT 
CTACAGAAAC 
TGGCCTCCAT 
CGACGGTGCA 
TTATGAAGTA 
ACTTCACCAA 
AGCCCGAACC 
ACTGCCTGCT 
AGAGCGAGGT 
TCAGCCCACA 
TTTACTGGGT 
ACTTGOGATA 
CAGAGTTTGT 
CAGACAAACT 
CTGCAGATCC 
GGGAAAGGCT 
CTATTGCAAG 
AAGCAGATGT 
CCTTTTTCTG 
CTCGTAAAAA 
ATGATGTGAA 
GACCAAAAGA 
TGTGGGAATA 
ATTAAATACA 



11 
I 

ACCTCTGGGT 
ACGCGGTCCG 
GACCTCCCAA 
GACAGCOGAC 
OGCTGCGGAG 
CATGGTTTAC 
GACCGTGTTC 
CGAACCTGAA 
GCAGGAGCAC 
CATATCTTTG 
CAACCCGGCC 
GCGTGGTGAA 
CCAGATAGAT 
GCCATTGGAT 
TCCATTATTC 
TTGCTGTTAC 
TTTGAGACAA 

TACTOGACCT 
CTACCAGCTA 
TATATGTTGG 
AGGTTTTAAT 
AGAAAAATCA 
TTTAAATTTT 
TTGATTTTTG 



21 
I 

CCGGAATCGC 
AGGCTTTCAT 
GACGTCGCGG 
AGCAAAGCTG 
TCGGTAGACG 
CCGCAGACCT 
CTGTCGGGTC 
CCTGCGCTGG 
TTCTACCTGC 
CCCTTCCTGG 
CTGGCCGCTG 
GAAATTATTC 
GATAAACCAA 
TATTCTGTTC 
AAACGGCAGT 
GGTCACACCC 
AACTGTGCTG 
TGGACTGGAG 

AATACTTTGG 
GGTACACAAA 
GATGATGTTC 
CAGCTGTTGG 
ACTGAAGGAA 
AGACAAAAAA 



31 
I 

GGGCAAAGAT 
TGCACACCGC 
CGACCCCCGT 
CAOGGCTGCG 
AGAAGCTGCG 
TCGCGCTGAA 
TGCCGCCGCC 
ACCTCGOGGC 
GGCGCAGGCG 
ATCAGCTGGT 
CCGCCCTCGA 
CTCGTGGTCA 
ACAACCAGAT 
CTATAGAAAT 
ATGAAAACCA 
AGTTTCATCT 
ATCAGATAGA 
CACAAGCTAT 
AGGCTGTGAT 
CACTGACTAC 
GTAAGCCTCT 
TACTTCAGAT 
AAAACTGAAA 
CAATAATGAT 
AAAAAAAAAA 



41 

I 

GGCGGOGGCC 
GGCTAATGCC 
CGCGCGGTAC 
GCGGATGGAG 
AATCCTCACC 
TGCGGACCGC 
CCCAGCGGAG 
GCTGCGTGCG 
GCGCGTGCAC 
GTCAACCCTC 
TTATAGATGC 
TCGAAGAGGT 
TCGAATATCC 
CCCCACTATA 
CATATTTGTT 
GTTACCTGAC 
AGTTGTTTTT 
GTATCAAGGA 
CACAGATGGA 
ACAAGCTGAT 
TTATGAAACA 
AGTTCACTTT 
AAGCATATTT 
GAGATTTGTA 
AA 



51 
I 

AGGTGTTGGA 
GCCGCCACGG 
CCGCCGATTG 
CGCTGGCAGG 
AAGATGCAGT 
TGGTACCAGT 
CCCGAGCCCG 
GTCGCCTGGG 
CGTTACGAGG 
GTGGGCCTCC 
CCAGTTCATT 
CGAATTGATG 
AAGCAACTCG 
AAATGTAAAC 
GGCTCAAAAA 
AAATTAAGAA 
AGAGCTAATG 
TTCTGGAGTG 
AAATACTTTT 
CAAAATAACC 
ATTGAGGATA 
CTACTGAATA 
GATTGAGAAC 
ACTGTCAACT 



Seq ID NO: 80 Protein sequence 
Protein Accession NP 057724.1 



1 
I 

MAAARCWRPL 
RRIERWQATV 
PPAEPEPEPE 
VSTLVGLLSP 
IRISRQLAEF 
LLPDKLRRER 
ITDGKYFSFF 
IVHFLLNRPK 



11 
I 

LRGPRLSLHT 
HAAESVDEKL 
PEPEPALDLA 
HNPALAAAAL 
VPLDYSVPIE 
LLRQNCADQI 
CYQLNTLALT 



21 
I 

AANAAATATE 
RILTKMQFMK 
ALRAVACDCL 
DYRCPVHFYW 
IPTIKCKPDK 
EWFRANAIA 
TQADQNNPRK 



31 
I 

TTSQDVAATP 
YMVYPQTFAL 
IjQEHFYLRRR 
VRGEEIIPRG 
LPLFKRQYEN 
SLFAWTGAQA 
NICWGTQSKP 



41 

I 

VARYPPIVAS 
NADRWYQYFT 
RRVHRYEESE 



HIFVGSKTAD 
KYQGFWSEAD 
LYETIEDNDV 



Seq ID NO: 81 DNA sequence 
Nucleic Acid Accession #: 
Coding sequence: 1..2070 



FGENESH predicted 



ATGAGCGGTG 
CGGCGCCGGC 
CTCAGGCAGA 
CAGCAGCAGC 
GAAAACAAGG 
ACACTGCCGC 
GGGGGAACAC 
CCTGTATGCC 
AGCCGTGGCT 
GGGCCTGAGG 
CCAAGTAGAG 
CCTCAGATTG 
ATGCTGGGGG 
GCAGCAACCA 
GCACTTCCCC 
GCTCACTTCC 
TGGAGCCAGC 
GACATGGAGA 
CTGTTCTGGG 
GACAGGACAC 
CCCTCCTGCT 
GGCGCTCGCT 



11 

I 

CGGGGGTGGC 
GCCAGCGCCC 
GCGACCCGCA 
ACTCGGAGAT 
GTGAGCCGGC 
TCCCGCAGCA 
AGGACGGGGA 
AACCCAGTGG 
GGACGATGTT 
TCATTGCAGG 
CTGAAATGGG 
CTGCTGTGGC 

TGGGGACAAA 
ATCCTGACAG 
CATTATCTTT 
CTGGGAACAT 
AGGGGGTTGA 
CAAAGTGTGG 
GGGAAGAGGC 
TTCCAGATGG 
GGGTCTGCAT 




CAGAAACACA 
GCCCCTCCAG 
GTACAGGTTC 
ATGCAGCCAA 
GCGGCAGGTG 
AAGGAACCCC 
CAGGCCCAGG 
ATGGACACAC 
GGGAGGAAGC 
CGGCCCCCAC 
GGGGCTGGGG 
CGCAGCTGGG 
GGGAGGGCCC 
CCCAAGTCGG 
CATGCTTTCC 
CCCCrCAGGA 
CAACGGAGTG 



31 
I 

CGGCCCCCCA 
GTCCAGTCCT 
CTGGACCTGG 
CTCCATGAGG 
AGGCCGGCCC 
GCCATCAACT 
ACTGTCCTTG 
TGGGGGACCT 
GCACAGCACG 
GCCACAGGGT 
TGGGACAGCC 
ATTTCCAGCC 
TCCATCCAGG 
AGAGTCCTGT 
CCAGCCCAGG 
CTGACATCAG 
GCAGTGCCTA 
TTCCCTAGCC 
CAGCCCCAGC 
CTCGGGACCT 
AACCACCTTT 
TGGGTAGAGC 



41 
I 

GCTCGCCGAC 
TGAGGCOGCA 
AGAAAAGCCT 
AGATCGAGCA 
TGCCTCCCCA 
CCAGCACACG 
CCCACCTGGC 
GGACAGATGC 
TGCTGCTCTC 
GCTCCCCAGA 
CCTGCCCTGC 
CTATGGCTCT 
GATCCCTTCC 
TTCCTTGCCA 
ATCCTGGGCT 
GAGGACATCT 
GGGCTCTCCC 
GCTGTGGCAA 
CCTGCAGTGC 
GCTGTTCCAT 
CCAGGGCCTC 
CGGGAGGACC 



60 
120 
180 
240 
300 
360 
420 
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600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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1320 
1380 
1440 



51 
I 

MTADSKAARL 
KTVFLSGLPP 
VISLPFLDQL 
YQIDDKPNNQ 
PCCYGHTQFH 
VTRPFVSQAV 
KGFNDDVLLQ 



60 
120 
180 
240 
300 
360 
420 



51 
I 

CCCGGGCTCT 
GAGCCCGCAG 
GCAGTTCCTG 
TCTGAAGCGG 
GGCACACTCA 
CCTGGGCTCA 
TGCACTGGCC 
CGCTACCTCT 
GGGAAGCCCA 
CCTCCCTCCT 
TAGATCTTTG 
GAGTCCTCAC 
TGCCATCTGG 
CTTGTCCAAG 
GTGGTCTCAA 
GACTGGTGGA 
TTCCCAGGGA 
CTCCAGTGAG 
TGGGGACGCT 
GTGTCCCAAG 
TGCTCCCTTG 
CAGCCCTGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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1320 
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AGGCTGAAGG 
GGCGGTAGCG 
TCTGTCAAGT 
TCCTTCAACA 
CCCCTACTTC 
GAGAAAGCAG 
AGGCAGATGG 
ACCACACTTA 
ACCCAAGAGC 
CCGGAGGAAG 
AAGAGCCTCT 
CTGAAGCAGA 
AAACGGCGCC 



AGGGCTCCTC 
CCGACACTGT 
CCATCTCTAA 
AGCAAGATTC 
ACAACAGCAA 
AGGCCTCTAA 
GGGCGGGGGC 
GGCAGTGCGA 
TGCGGCACCT 
CTAGCTTTCC 
CCAAGAAATG 
CCCCGAAGAA 
TGCATCGCTC 



ACGGACACAC 
GCGCTCTCCT 
TTCAGCCAAC 
AAAAGCTGAC 
GCTGGACAAA 
TGCAGGAGCT 
ACACCCCCCA 
AGTGCTCATC 
CAAGTCCCTC 
CAGGGACCAA 
CCTGAGCCCA 
CAACTTTGCC 
AGTGCTTTGA 



AGGCCAGGAG 
GCAGACAGCC 
TCTCAAGGCA 
GTCTCCCAGA 
GTTCCTGGGG 
GCCTGTATGG 
ATGATCCTGC 
CGCGAGCTGT 
CTGGAAGGGA 
GAAGCCACGC 
CCTGTGGCGG 
GAGAGGCAGA 



GCAAGCGTGG 
TCTCCATGTC 
AGGCCAOGCC 
AGGOGGACCT 
TACAAGGGCA 
GGAACAGCCA 
CCCTTCCCCT 
GGAATACCAA 
GCCAGAGGCC 
ATTTCCCCAA 
AGCGTGCCAT 
AGAGGCTGCA 



GCGTCTTGCG 
AAGCTTCCAG 
CCAGCCCGGC 
GGAAGAGGAG 
GGCCAGAAAG 
GCACCAGGGC 
GCGAAAGCCC 
CCTCCTGCAG 
CCAGGCAGCC 
GGTCTCCACC 
CCTGCCCGCA 
GGCAATGCAG 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



Seq ID NO: 82 Protein sequence 
Protein Accession FGENESH predicted 



MSGAGVAAGT 
QQQHSEMLAK 
GGTQDGEPLQ 
GPBVIAGRQV 
WLGAQGIWTH 
AHFPLSLGLG 
LFWAKCGPSR 
GARWVCXNGV 
SVKSISNSAN 
EKAEASNAGA 
TQELRHLKSL 
LKQTPKNNFA 



11 
1 

RPPSSPTPGS 
LHEEIEHLKR 
TVLAHLAALA 
ATGCSPDLPP 
SIQGSLPAIW 
LTSGGHLTGG 
QPQPCSAGDA 
WVEPGGPSPA 
SQGKARPQPG 



21 
I 

RRRRQRPSVG 
ENKGEPARGP 
PVOQPSGYRF 



LEGSQRPQAA 
ERQKRLQAMQ 



AATMGTKGGS 
WSQPGNIAAG 
DRTREEAMLS 
RLKEGSSRTH 
SFNKQDSKAD 
RQMGAGAHPP 
PEEASFPRDQ 
KRRLHRSVLi 



31 
I 

VQSLRPQSPQ 
RPALPPQAHS 
WGTWTDAATS 
WDSPCPARSL 
RVLPPCHLSK 
AVPRALP5QG 
LGTCCSMCPK 
RPGGKRGRLA 
VSQKADLEEE 
MILPLPLRKP 
EATHPPKVST 



Seq ID NO: 83 DNA sequence 

Nucleic Acid Accession #: NMJJ05264.1 

Coding sequence: 557.. 1954 



11 



41 
I 

LRQSDPQKRN 
TLPIiPQHRNT 
SRGWTMLCSQ 
PQIAAVARPR 
ALPHPDSGPH 
DMEKGVEGGP 
PSCPPDGPSG 
GGSADTVRSP 
PLLHNSKLDK 
TTLRQCBVLI 
KSLSKKCLSP 



41 



51 
I 

LDLEKSLQFL 
AINSSTRLGS 
AQHVLLSGSP 
ISSPMALSPH 
PAQDPGLWSQ 
FPSRCGNSSE 
NHLSRASAPL 
ADSLSMSSFQ 
VPGVQGQARK 
RELWNTNLLQ 
PVAERAILPA 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



21 31 

I I I I 1 I 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 
ACCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTCT GCTCGGAAAT CGTCCTGGCC 
CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 
TTTCCTAGCG CAGATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGOGGGGA CACCATTGCC 
CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 
TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 
ATCCCGGAGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TOCTCTTTCA 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCX3G CTCGGGATTT TTTTGGGGGG GCGGGGACCA 
GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTGTACTTC GCGCTGCOGC TCTTGGACTT 
GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CGAGTGATCA 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 
CATGGAGGCC CTGAAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 
GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 
CACCCCGTGC ACCACCAGCG TGTCCAACGA TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 
CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAGACC ATCGTGCCTG TGTGCTCCTA 
TGAAGAGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CQAATTACAT 
CTGCAGATCT CGCCTTGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 
AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGOCCCAT GGTGTGACTG 
CAGCAACAGT GGGAACGACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 
GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 
TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 
TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 
ATGTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC TGAAATTCCA 
GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 
AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 
AAAGCAAACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 
TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 
CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCGGAATTC 



60 
120 
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600 
660 
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900 
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Seq ID NO: 84 Protein sequence 
Protein Accession #i NM_005264.1 
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i 
I 

MFLATLYFAL 
SLASGLEAKD 
YEPVNSRLSD 
SVSNDVCNRR 
PNCLNLQDSC 
YIDSSSLSVA 
QTTTATTTTA 
EKEGLGASSH 



11 
I 

PLLDLLLSAE 
ECRSAMEALK 
IPRWPFISD 
KCHKALRQFF 
KTNYICRSRL 
PWCDCSNSGN 
LRVKNKPLGP 
ITTKSMAAPP 



21 
I 

VSGGDRLDCV 
QKSLYNCRCK 
VFQQVEHIPK 
DKVPAKHSYG 
ADFFTNCQPE 
DLEECLKFLN 
AGSENBIPTH 
SCGLSPLLVL 



31 
I 

KASDQCLKEQ 
RGMKKEKNCL 
GNNCLDAAKA 
MLFCSCRDIA 
SRSVSSCLKE 
FFKDNTCLKN 
VLPPCANLQA 
WTALSTLLS 



41 
I 

SCSTKYRTLR 
RIYWSMYQSL 
CNLDDICKKY 
CTERRRQTIV 
NYADCLLAYS 
AIQAFGNGSD 
QKLKSNVSGN 
LTETS 



51 
I 

QCVAGKETNF 60 

QGNDLLEDSP 120 

RSAYITPCTT 180 

PVCSYEERBK 240 

GLIGTVMTPN 300 

VTVWQPAFPV 360 

THLCISNGNY 420 



Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession 8: XM_027l72.l 

Coding sequence: 143 . . 1405 



11 



21 



31 



41 51 

1111)1 

GGTGGAGACA CCGCCTCAGG GCTCGGTGCA CAGTGGACAT TTGGGGAGCG TTGTGGGTGA 60 

CCCCCACACA GGCACTGGGA ATGCAGG6GA GAGGGGGCCA AGGGGGAAAG GGGCCAGAGT 120 

GTTGGCTTTG GATTCAGGAG GGATGGATTC CAGTCCTAGC TTGCCACTTA TTAGGACTCC 180 

TGAGAGCAGC CTCCATGAGG CCCTGGACCA GTGCATGACC GCCCTGGACC TCTTCCTCAC 240 

CAACCAGTTC TCAGAAGCAC TCAGCTACCT CAAGCCCAGA ACCAAGGAAA GCATGTACCA 300 

CTCACTGACA TATGCCACCA TCCTGGAGAT GCAGGCCATG ATGACCTTTG ACCCTCAGGA 360 

CATCCTGCTT GCCGGCAACA TGATGAAGGA GGCACAGATG CTGTGTCAGA GGCACCGGAG 420 

GAAGTCTTCT GTAACAGATT CCTTCAGCAG CCTGGTGAAC CGCCCGACGC TGGGCCAATT 480 

CACTGAAGAG GAAATCCACG CTGAGGTCTG CTATGCAGAG TGCCTGCTGC AGCGAGCAGC 540 

CCTGACCTTC CTGCAGGGTT CCTCACACGG AGGGGCAGTC AGGCCCAGAG CCTTGCATGA 600 

TCCCTCTCAC GCCTGCAGCT GCCCACCTGG GCCAGGCCGT CAGCATCTTT TCCTCCTGCA 660 

GGACGAGAAC ATGGTGAGCT TCATCAAAGG CGGCATCAAA GTTCGAAACA GCTACCAGAC 720 

CTACAAGGAG CTGGACAGCC TTGTTCAGTC CTCACAATAC TGCAAGGGTG AGAACCACCC 780 

GCACTTTGAA GGAGGAGTGA AGCTTGGTGT AGGGGCCTTC AACCTGACAC TGTCCATGCT 840 

TCCTACTAGG ATCCTGAGGC TGTTGGAGTT TGTGGGGTTT TCAGGAAACA AGGACTATGG 900 

GCTGCTGCAG CTGGAGGAGG GAGOGTCAGG GCACAGCTTC CGCTCTGTGC TCTGTGTCAT 960 

GCTCCTGCTG TGCTACCACA CCTTCCTCAC CTTCGTGCTC GGTACTGGGA ACGTCAACAT 1020 

CGAGGAGGCC GAGAAGCTCT TGAAGCCCTA CCTGAACCGG TACCCTAAGG GTGCCATCTT 1080 

CCTGTTCTTT GCAGGGAGGA TTGAAGTCAT TAAAGGCAAC ATTGATGCAG TGAGTGATGG 1140 

GGGTCCGGGC CGGGGCTGGG GATCCCTCGG GGTCTCCCAG ACCAGCAGGA AGTCAGGCAC 1200 

ATGTGACATA CTCAGGGACA GGATAGACTG GGGGOGGGGG GGGGGCCAAG AGAGAACCAA 1260 

CCAGAGAGCA GGGGCAGGAG AGGGCCTTCT GGCAGAGCAG CCTGGGAAGA CAAGGGAGGA 1320 

GGAGGCATTT GTGGTGCCTG GGATTTTGAC TGGGAGATAT AGGACTGCAG CATTGCAGTG 1380 

GAGGGAGGTG GAGGGAGGTG CTTGAGGGAG GCAGAGGTTA GGAAAGCCCA TCTGTTTAGG 1440 

GCATGACGAT TAGGCTGGAG TCTGGTACCT CCCCTCCATT ATAGCTCTCT CCTGCTCTTT 1500 

CATTTTGTTA CTAAAAACCA GAGTCCTAGG CGGGGGCTGT ATTTGAGCCC AACGTCATGT 1560 

AAGACTTAGG AGGTAAAACC AGGACTGGAG GCCAGATCTC CTGGCTCCTG GGGCCCCACC 1620 

TGAGCCTAGC ACAGGGCTGG ACCACTATGC CCTGGAGGAG TCCCGGTCTG CTGTGGTGTT 1680 

GGGAGGTTCG GAGGATGCAG AGGGGTTGGG GCTGGGTGGG CACCCGTCAG GCTGACCAGA 1740 

AGGTGCCTGC AGGCCATCCG GCGTTTCGAG GAGTGCTGTG AGGCCCAGCA GCACTGGAAG 1800 

CAGTTCCACC ACATGTGCTA CTGGGAGCTG ATGTGGTGCT TCACCTACAA GGGCCAGTGG 1860 

AAGATGTCCT ACTTCTACGC CGACCTGCTC AGCAAGGAGA ACTGCTGGTC CAAGGTGGGC 1920 

TGATGCCACG TGTTAGGGGC ATTGGGTGAC CAGGGCTGAC TGTGTGCCTC CAGACCACGG 1980 

GCCAAATCCC TAACTGAACA CAGATGTCTC AGCTGGAATC TAAACATAAC CTTAAATTCT 2040 

AATAGGACTC AGGCTTGGAA GGAACTAAAG ACCACAAGAG AAACTTCTGA ACCACAATGT 2100 

GTCACAAAGA GATTTCTAGT CACAAAGGAC AGAAACATGG CTCCCTCTGT CCAGTAGAAC 2160 

TGTTTCTGTG GTAGAAATGT TCTACTTATG CACTGCCCAA ATATGGTAGT CACCAGCCAC 2220 

ATGTGCACAA TGAGCACATG AAATGTGCCT AGTGCAACTG GGAAACTGAT TGTTTTCAGT 2280 

TTTATTTAAT TTTAATTAAC TAAATGTTAA ATTTAAATAG CCATGTAGGG CTTGTGGCCA 2340 

CTATATTGGA CTATGCAGGT CCAAAACACA AAAGGCTCAT ATAACTGAAC ATTCTTGGCA 2400 

CATCCGACTT CAGGTAGGGC TGGATCCAGG AATTCAAATG ATGTCGTCTG GCTTGGTCTT 2460 

TCCATTTGTG GCTGTGCTCT CTCCTATGAC ATCTTTGTTT CTGCTGCATC TTTGCAGGAA 2520 

GGTTCTCTCC ATGTGACAGG CAAGGTGGCC ACGGGCTGCT TCTACTCATA TCCTCCCCTT 2580 

GGTTTCAACC •AGAGTCCCAC AGTTTGAGCC TCACTGGTCT GACTTGTCAC CTGCCAATGC 2640 

CTGGAACAGA GGGGTGGGAA GATTCCTCAT GGGCTGAGAG TAGGAGAGGG GTGGTTCCAC 2700 

AGAAGAAAAT GATGCACCCA GAAATAGTGG AGGGATTAAC AAGATGCCAT ACAGGCAAAA 2760 

CAAAGCCAAA CAGATGCCCG CCTACGAAAG ATGAAATTTA TCATGGTAAG TATTGAAATA 2820 

AGTGTTAGCT TGTACCATCA TAGTAATGAT AGTGCAGAAA TTGGAACCAA GAGTCTTACA 2880 

ACCACCTAGC TCAGCAAACG TCTAATCTGT TATTTGTAAA TACACAGGAC ATGTGTCTTC 2940 

ATGGCTTCAT CCCCTGCCCA TAGCAGACAT TGCTAATCAA TCCTCTGCCA TGAGCCTAGC 3000 

TGTGACCTTA GATACCTTCC CTGCAAAGCT CCAGGCAGCC GTTAGAACTG ACCCACATTG 3060 

GCACCTGTTA GGAGACCTGT CATCTGCATC TGGTCCCTCC CCCTCCTCTG CATGGAGGTC 3120 

TCTTCCGTAG CCCTGCCTGG GGACCAAGCT GAAGGGGAAT TCTCTGGGTC TGGGGCAGGT 3180 

GGGGCTGGGG AAGGGAGCAG GCTTCTACTG AGCTCTCAAC ATGTCTGGCA CCTGTCATGT 3240 

CCTCCACACC ACACCCAGTG AGGAGAAGTT ATTATCTCCA TTGGGCACAT GAGGGCCACT 3300 

GATGCTAAGC GAGGACTGTG ACTTTCCCAC CCACATGGCT GTTGGTGACA GACACCGGAT 3360 

TTGAGGACAG TCCTGTGTGA CTCAGAGGCC TCTGCTTCTC TCTTGCTCCT AGAGTCCTCC 3420 

TGAGGAGTCG GGGCTTGCCC TGAGCCCACC CTGCTGTTGA AGGTGCTTCC TCAGGCCCAG 3480 

CTCCCATGGC CCCCACACCC CCTCCTCATC ACCTCCTACT CCCAAAAAGG ACAAAGCCTC 3540 

AGGGAACCTT ITXTCTTTTT TTAGAGACAG GGTCTTGCTA TGTTGGTCAG GCTAGTCTTG 3600 

AACTCCTGGG CCCAAGCAAT CTTCCCACCT CTGCCTCCTA AAGTGCTGGG ATTCTGGGCC 3660 

TTAGGGAACC TTTTTGAACT GAAAGTGACT CTCGAAGCCT TCCTGTAGAG TGAGGTGGGT 3720 

GGCTGGGGCT AACCATATGG GGAAGGAGAG ACCTGGTGGG GGCACACAGC TGCTATATAG 3760 

AGGAACAGAA GGTGGGCCAG GCCTCCCAGC TGTCAGGAAA TCTGGCTCAG TCCCCAGACC 3840 

GCTGTGTGGC TTTGGGGTGG CCAGTCCCTC TCTCTCTGCA ACACTCTCCT CATGAGGACT 3900 

TCTGTGAAAA ATGGGGGTTG TAACCCTCTG AGTTCTGGGG CCCCACCCAT TTCTAATCTC 3960 
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CAGAACTTGG CCACCACAGA CTCCACCAAC TTCTAGTCCT GGGGCCTGGG CCTCTGGCCA 4020 

TTGCCATAGG CACCACCTGC TCTGTGCAGG CAGCGCCCCC CTCTGCCAGG ATCCTCCGAG 4080 

GTCAGCTGCT GGGTCTGACC CGCAGACCCT GGCTGAGCGA OGGATGAACG GAGTATGCAG 4140 

ACACAGGCTT GCCTGTCAGC AGATGGGGGA CCCCTGCCAG AGTCAGCAGC GGCCCCCATA 4200 

AGCCTGCCAC GCTTGCATTT ATTTAGTACA GATGTAATGA CAAAGGCCTA AAGCAAACTC 4260 

CATTTGTGGG TAATTAACAT TGTCGCCCCC CCAGAAAGAG CAGTCCTCOG CATGATGATT 4320 

AAAGGCCAGG TTCCGAGGCC TAAGTAAACC AACTTATCTA GATCAATTCC CTTACTTCTT 4380 

GTTATCTACT CTGAGAGAAT TCAGCTGCCT TCAGCCAAAT CCTTTCCCGA AGCTTTTGCA 4440 

AAACCTCCGA GCCTTCCAAG GTTTGCTTCT TTCTGTAATT TTTCTCACCA CCCTGACCTA 4500 

TCTCCTGCAG TCAGCCCTGT GGAGGCCTTT GTGTTTCCCC CAGTGCTGGC AGCCTAGAGG 4560 

CTGAGATGGC CAGAAACAAG GTGGTGACAG TGGCGTGCTC AGGGCTTGGG AAACCCAAGG 4620 

AGCTAAAGGC ATGCCCAGGC AACCAAAGAG GACAGGAAGG CTTCTGAGGA GAGACCTCTG 4680 

AGGTGGGTCT TGGAGAGGAA GGACTTAGGG AGGCAGAGTG GAGGAAGTGA GAGGACACCC 4740 

CAAGCCAAGA GGGCGGCAGG ACCAAAGGCT CAGAAGCCAG GGCGCTGCAG AGGGGCTGTG 4 800 

TGCCACAGGG TGAAGAGTTT GTGTGGCAGA AGGGCAGGGG GCTTGCATCA GGGGTGACAG 4860 

CTGCTCTTTT GTCCCAGCAT AGCCCCTGTA CATCCCTGGA GAGCTGGGGC GTCCACAACT 4920 

CTAAGTCACA GCCCCCATCC TAACCCTGGT GGTGCAGTGA GGGTGAGCTG TCTGTGGGCA 4980 

GGAGGGAAGA CTCTTGGAGA TGAGCCTGGT GAAGGGATAA TGGCATCCOG GGCCGAGGAG 5040 

CAGCACAGGC AGAGGCCTGG GGAGAGTTTA AGGAGTGTAG GGGAGGAAAT GGCAGAAGAT 5100 

GAGCCAGAAA AAGAAAGGTT AGGGCAGGTC CTGGAGGACA TGAGTGGCTG TTTGGGCTTT 5160 

ATCCAGCAGT GGGGGAGCCT TGGCAGGCTT GTGGCTTAGA TAGGTGCTTT AGAAAGCCCA 5220 

CCAGCAGTTG CTGGGCCACC CCGCTGGCTG GGTCCTGTTC TAAGGCAGGA AATACAAGCA 5280 

TGAGCAGGAA AAGACCCCCT CAAGGCTCAC GTCCTAGTGG GGAGACAAGA AACACAGATG 5340 

GGCAATATAA CACGATGTCT GGTTCCAGTA AGTGCAGTGA AGAACAAGCG AGGCTGGATG 5400 

CAGGGAGTGA TGGGAGGGGC TTTGTAAGGG GAGGTCGGGG GAAGCCTGTC TCAGAGGACA 5460 

CCAGAATGGA GCGCAGGAGC AGCACGTGGC AGTCACATGG CAGGCCGTTA GGGCAGAGGG 5520 

AGCTGGGCAG GGCACAGCAG GGCAGGAGTG TGTTTGATGT GTCCTGGGAA CCGCCCTGAG 5580 

GCCGTCGTGT GGCTGGAGTG CTGCAGGTGT CAAGGAAATT GTAGGAGATG TCTCCTGAGT 5640 

GTGATGGAAT ATAACCAGAT TTCCAGAAGG AACTGACATG ATCTGACTTA AAAAGGTCAG 5700 

TGTGCGAAAT GGCTTGCAGG GGACAGGAGT GGGAGCAGGG AGATAGGAGA CAATGTGTAC 5760 

CAGGACAGCA GAAAGACATC CCGGGTAGCC TGGAACAGGG AGACGGTGTG GAGATGGTGG 5820 

CAGTCCGATA ATGAGAGCCG TAGGGCAAGG CCAGCAGGAT CCTAGAGTGA GACGGGAGGT 5880 

AAAGTCACCG GGACTTGGTG TCTCCACGTC AGGGGCAGGG GAAAGGGAGA GGACAAGGGT 5940 

GACCCGGGAG GTTAAAGATG GGACGGGGGC CAGACGCAGT GGCTCATGCC TGTAATCCTA 6000 

GCACTTTGGG AGGCTGAGGC GGGCGGATAG CTTGAGGTCA GGAGTTTGAA ACCAGCCTGG 6060 

CCAACATGGT GAAACCCCGT CTCTACTAAA ATATACAAAA ATTAGCCTGG CGTGGTGGTG 6120 

CATGCCTGCA GTCCCAGCTA TTCAGGAGGC TGAGGCAACA AGAATCGCTT GAACCTGGGA 6180 

GGCGGAGGTT GCAGTGAGCC GAGATCGCGC CATAGCACTC CAGCCTTAGC CTGGGCGACA 6240 
GAGCGAGACC ACATC 



Seq ID NO: 86 Protein sequence 
Protein Accession #s XP 027172.1 



MDSSPSLPLI 
LEMQAMMTFD 
EVCYAECLLQ 
IKGGIKVRNS 
LEPVGFSGNK 
KPYLNRYPKG 
IOWGRGGGQE 



11 
I 

RTPESSLHEA 
PQDILLAGKM 
RAALTPLQGS 
YQTYKKLDSL 
DYGLLQLEEG 
AIPLPFAGRI 
RTNQRAGAGE 



21 
I 

LDQCMTALDL 
MKEAQMLCQR 
SHGGAVRPRA 
VQSSQYCKGE 
ASGHSPRSVL 
EVIKGNIDAV 
ALLAEQPGKT 



31 
I 

FLTNQFSEAL 
HRRKSSVTDS 
LHDPSHACSC 
NHPHFEGGVK 
CVMLLLCYHT 
SDGGPGRGWG 
REEEAFWPG 



41 
I 

SYLKPRTKES 
FSSLVNRPTL 
PPGPGRQHLF 
LGVGAFNLTL 
PLTFVLGTGN 
SLGVSQTSRK 
ILTGRYRTAA 



Seq ID NOj 87 DNA sequence 

Nucleic Acid Accession 8: AB007921 

Coding sequence: 14 3.. 13 63 

1 11 21 31 

1 I 1 I 

GGTGGAGACA CGGCCTCAGG GCTCGGTGCA CAGTGGACAT 
CCCCCACACA GGCACTGGGA ATGCAGGGGA GAGGGGGCCA 
GTTGGCTTTG GATTCAGGAG GQATGGATTC CAGTCCTAGC 
TGAGAGCAGC CTCCATGAGG CCCTGGACCA GTGCATGACC 
CAACCAGTTC TGAGAAGCAC TCAGCTACCT CAAGCCCAGA 
CTCACTGACA TATGCCACCA TCCTGGAGAT GCAGGCCATG 
CATCCTGCTT GCCGGCAACA TGATGAAGGA GGCACAGATG 
GAAGTCTTCT GTAACAGATT CCTTCAGCAG CCTGGTGAAC 
CACTGAAGAG GAAATCCACG CTGAGGTCTG CTATGCAGAG 
CCTGACCTTC CTGCAGGGTT CCTCACAOGG AGGGGCAGTC 
TCCCTCTCAC GCCTGCAGCT GCCCACCTGG GCCAGGCCGT 
GGACGAGAAC ATGGTGAGCT TCATCAAAGG CGGCATCAAA 
CTACAAGGAG CTGGACAGCC TTGTTCAGTC CTCACAATAC 
GCACTTTGAA GGAGGAGTGA AGCTTGGTGT AGGGGCCTTC 
TCCTACTAGG ATCCTGAGGC TGTTGGAGTT TGTGGGGTTT 
GCTGCTGCAG CTGGAGGAGG GAGCGTCAGG GCACAGCTTC 
GCTCCTGCTG TGCTACCACA CCTTCCTCAC CTTCGTGCTC 
CGAGGAGGCC GAGAAGCTCT TGAAGCCCTA CCTGAACCGG 
CCTGTTCTTT GCAGGGAGGA TTGAAGTCAT TAAAGGCAAC 
GGGTCCGGGC CGGGGCTGGG GATCCCTCGG GGTCTCCCAG 
ATGTGACATA CTCAGGGACA GGATAGACTG GGGGCGGGGG 
ACCAGAGAGC AGGGGCAGGA GAGGCCCTTC TGGCAGAGCA 
AGGAGGCATT TGTGGTGCCT GGGATTTTGA CTGGGAGATA 
GGAGGGAGGT GGAGGGAGGT GCTTGAGGGA GGCAGAGGTT 
GGCATGACGA TTAGGCTGGA GTCTGGTACC TCCCCTCCAT 



AGGGGGAAAG 
TTGCCACTTA 
GCCCTGGACC 
ACCAAGGAAA 
ATGACCTTTG 
CTGTGTCAGA 
CGCCCCACGC 
TGCCTGCTGC 
AGGCCCAGAG 
CAGCATCTTT 
GTTCGAAACA 
TGCAAGGGTG 
AACCTGACAC 
TCAGGAAACA 
CGCTCTGTGC 
GGTACTGGGA 
TACOCTAAGG 
ATTGATGCAG 
ACCAGCAGGA 
GGGGGGCCAA 
GCCTGGGAAG 
TAGGACTGCA 
AGGAAAGCCC 
TATAGCTCTC 



51 
I 

MYHSLTYATI 
GQFTEEEIKA 
LLQDENMVSF 
SMLPTRILRL 
VNIEEAEKLL 
SGTCDILRDR 
LQWREVEGGA 



51 
I 

TTGTGGGTGA 
GGGCCAGAGT 
TTAGGACTCC 
TCTTCCTCAC 
GCATGTACCA 
ACCCTCAGGA 
GGCACCGGAG 
TGGGCCAATT 
AGOGAGCAGC 
CCTTGCATGA 
TCCTCCTGCA 
GCTACCAGAC 
AGAACCACCC 
TGTCCATGCT 
AGGACTATGG 
TCTGTGTCAT 
ACGTCAACAT 
GTGCCATCTT 
TGAGTGATGG 
AGTCAGGCAC 
GAGAGAACCA 
ACAAGGGAGG 
GCATTGCAGT 
ATCTGTTTAG 
TCCTGCTCTT 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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TCATTTTGTT ACTAAAAACC AGAGTCCTAG GGGG06GCTG TATTTGAGCC CAACGTCATG 1S60 
TAAGACTTAG GAGGTAAAAC CAGGACTGGA GGCCAGATCT CCTGGCTCCT GGGGCCCCAC 1620 
CTGAGCCTAG CACAGGGCTG GACCACTATG CCCTGGAGGA GTCCOGGTCT GCTGTGGTGT 1680 
TGGGAGGTTC GGAGGATGCA GAGGGGTTGG GGCTGGGTGG GCACCCX3TCA GGCTGACCAG 1740 
5 AAGGTGCCTG CAGGCCATCC GGCGTTTCGA GGAGTGCTGT GAGGCCCAGC AGCACTGGAA 1800 
GCAGTTCCAC CACATGTGCT ACTGGGAGCT GATGTGGTGC TTCACCTACA AGGGCCAGTG 1860 
GAAGATGTCC TACTTCTAOG COGACCTGCT CAGCAAGGAG AACTGCTGGT CCAAGGTGGG 1920 
CTGATGOCAC GTGTTAGGGG CATTGGGTGA CCAGGGCTGA CTGTGTGCCT CCAGACCACG 1980 
GGCCAAATCC CTAACTGAAC ACAGATGTCT CAGCTGGAAT CTAAACATAA CCTTAAATTC 2040 
10 TAATAGGACT CAGGCTTGGA AGGAACTAAA GACCACAAGA GAAACTTCTG AACCACAATG 2100 
TGTCACAAAG AGATTTCTAG TCACAAAGGA CAGAAACATG GCTCCCTCTG TCCAGTAGAA 2160 
CTGTTTCTGT GGTAGAAATG TTCTACTTAT GCACTGCCCA AATATGGTAG TCACCAGCCA 2220 
CATGTGCACA CTGAGCACAT GAAATGTGCC TAGTGCAACT GGGAAACTGA TTGTTTTCAG 2280 
TTTTATTTAA TTTTAATTAA CTAAATGTTA AATTTAAATA GCCATGTAGG GCTTGTGGCC 2340 
15 ACTATATTGG ACTATGCAGG TCCAAAACAC AAAAGGCTCA TATAACTGAA CATTCTTGGC 2400 
ACATCCGACT TCAGGTAGGG CTGGATCCAG GAATTCAAAT GATGTCGTCT GGCTTGGTCT 2460 
TTCCATTTGT GGCTGTGCTC TCTCCTATGA CATCTTTGTT TCTGCTGCAT CTTTGCAGGA 2520 
AGGTTCTCTC CATGTGACAG GCAAGGTGGC CACGGGCTGC TTCTACTCAT ATCCTCCCCT 2580 
TGGTTTCAAC CAGAGTCCCA CAGTTTGAGC CTCACTGGTC TGACTTGTCA CCTGCCAATG 2640 
20 CCTGGAACAG AGGGGTGGGA AGATTCCTCA TGGGCTGAGA GTAGGAGAGG GGTGGTTCCA 2700 
CAGAAGAAAA TGATGCACCC AGAAATAGTG GAGGGATTAA CAAGATGCCA TACAGGCAAA 2760 
ACAAAGCCAA ACAGATGCCC GCCTACCAAA GATGAAATTT ATCATGGTAA GTATTGAAAT 2820 
AAGTGTTAGC TTGTACCATC ATAGTAATGA TAGTGCAGAA ATTGGAACCA AGAGTCTTAC 2880 
AACCACCTAG CTCAGCAAAC GTCTAATCTG TTATTTGTAA ATACACAGGA CATGTGTCTT 2940 
25 CATGGCTTCA TCCCCTGCCC ATGGCAGACA TTGCTAATCA ATCCTCTGCC ATGAGCCTAG 3000 
CTGTGACCTT AGATACCTTC CCTGCAAAGC TCCAGGCAGC CGTTAGAACT GACCCACATT 3060 
GGCACCTGTT AGGAGACCTG TCATCTGCAT CTGGTCCCTC CCCCTCCTCT GCATGGAGGT 3120 
CTCTTCCGTA GCCCTGOCTG GGGACCAAGC TGAAGGGGAA TTCTCTGGGT CTGGGGCAGG 3180 
TGGGGCTGGG GAAGGGAGCA GGCTTCTACT GAGCTCTCAA CATGTCTGGC ACCTGTCATG 3240 
30 TCCTCCACAC CACACCCAGT GAGGAGAAGT TATTATCTCC ATTGGGCACA TGAGGGCCAC 3300 
TGATGCTAAG CGAGGACTGT GACTTTCCCA CCCACATGGC TGTTGGTGAC AGACACCGGA 3360 
TTTGAGGACA GTCCTGTGTG ACTCAGAGGC CTCTGCTTCT CTCTTGCTCC TAGAGTCCTC 3420 
CTGAGGAGTC GGGGCTTGCC CTGAGCCCAC CCTGCTGTTG AAGGTGCTTC CTCAGGCCCA 3480 
GCTCCCATGG CCCCCACACC CCCTCCTCAT CACCTCCTAC TCCCAAAAAG GACAAAGCCT 3540 
35 CAGGGAACCT TTTTTCTTTT TTTAGAGACG GGGTCTTGCT ATGTTGGTCA GGCTGGTCTT 3600 
GAACTCCTGG GCCCAGGCAA TCTTCCCGCC TCTGCCTCCT AAAGTGCTGG GATTCTGGGC 3660 
CTTAGGGAAC CTTTTTGAAC TGAAAGTGAC GCTCGAAGCC TTCCTGTAGA GTGAGGTGGG 3720 
TGGCTGGGGC TAACCATATG GGGAAGGAGA GACCTGGTGG GGGCACACAG CTGCTATATA 3780 
GAGGAACAGA AGGTGGGCCA GGCCTCCCAG CTGTCAGGAA ATCTGGCTCA GTCCCCAGAC 3840 
40 CGCTGTGTGG CTTTGGGGTG GCCAGTCCCT CTCTCTCTGC AACACTCTCC TCATGAGGAC 3900 
TTCTGTGAAA AATGGGGGTT GTAACCCTCT GAGTTCTGGG GCCCCACCCA TTTCTAATCT 3960 
CCAGAACTTG GCCACCACAG ACTCCACCAA CTTCTAGTCC TGGGGCCTGG GCCTCTGGCC 4020 
ATTGCCATAG GCACCACCTG CTCTGTGCAG GCAGCGCCCC CCTCTGCCAG GATCCTCCGA 4080 
GGTCAGCTGC TGGGTCTGAC CCGCAGACCC TGGCTGAGCG ACGGATGAAC GGAGTATGCA 4140 
45 GACACAGGCT TGCTTGTCAG CAGATGGGGG ACCCCTGCCA GAGTCAGCAG CGGCCCCCAT 4200 
AAGCCTGCCA CGCTTGCATT TATTTAGTAC AGATGTAATG ACAAAGGCCT AAAGCAAACT 4260 
CCATTTGTGG GTAATTAACA TTGTCGCCCC CCCAGAAAGA GCAGTCCTCC GCATGATGAT 4320 
TAAAGGCCAG GTTCCGAGGC CTAAGTAAAC CAACTTATCT AGATCAATTC CCT TACT TCT 4380 
TGTTATCTAC TCTGAGAGAA TTCAGCTGCC TTCAGCCAAA TCCTTTCCCG AAGCTTTTGC 4440 
50 AAAACCTCCG AGCCTTCCAA GGTTTGCTTC TTTCTGTAAT TTTTCTCACC ACCCTGACCT 4500 
ATCTCCTGCA GTCAGCCCTG TGGAGGCCTT TGTGTTTCCC CCAGTGCTGG CAGCCTAGAG 4560 
GCTGAGATGG CCAGAAACAA GGTGGTGACA GTGGCGTGCT CAGGGCTTGG GAAACCCAAG 4620 
GAGCTAAAGG CATGCCCAGG CAACCAAAGA GGACAGGAAG GCTTCTGAGG AGAGACCTCT 4680 
GAGGTGGGTC TTGGAGAGGA AGGACTTAGG GAGGCAGAGT GGAGGAAGTG AGAGGACACC 4740 
55 CCAAGCCAAG AGGGCGGCAG GACCAAAGGC TCAGAAGCCA GGGCGCTGCA GAGGGGCTGT 4800 
GTGCCACAGG GTGAAGAGTT TGTGTGGCAG AAGGGCAGGG GGCTTGCATC AGGGGTGACA 4660 
GCTGCTCTTT TGTCCCAGCA TAGCCCCTGT ACATCCCTGG AGAGCTGGGG CGTCCACAAC 4920 
TCTAAGTCAC AGCCCCCATC CTAACCCTGG TGGTGCAGTG AGGGTGAGCT GTCTGTGGGC 4980 
AGGAGGGAAG ACTCTTGGAG ATGAGCCTGG TGAAGGGATA ATGGCATCCC GGGCCGAGGA 5040 
60 GCAGCAGAGG CAGAGGCCTG GGGAGAGTTT AAGGAGTGTA GGGGAGGAAA TGGCAGAAGA 5100 
TGAGCCAGAA AAAGAAAGGT TAGGGCAGGT CCTGGAGGAC ATGAGTGGCT GTTTGGGCTT 5160 
TATCCAGCAG TGGGGGAGCC TTGGCAGGCT TGTGGCTTAG ATAGGTGCTT TAGAAAGCCC 5220 
ACCAGCAGTT GCTGGGCCAC CCCGCTGGCT GGGTCCTGTT CTAAGGCAGG AAATACAAGA 5280 
ATGAGCAGGA AAAGACCCCC TCAAGGCTCA CGTCCTAGTG GGGAGACAAG AAACACAGAT 5340 
65 GGGCAATATA ACACGATGTC TGGTTCCAGT AAGTGCAGTG AAGAACAAGC GAGGCTGGAT 5400 
GCAGGGAGTG ATGGGAGGGG CTTTGTAAGG GGAGGTOGGG GGAAGCCTGT CTCAGAGGAC 5460 
ACCAGAATGG AGCGCAGGAG CAGCACGTGG CAGTCACATG GCAGGCCGTT AGGGCAGAGG 552 0 
GAGCTGGGCA GGGCACAGCA GGGCAGGAGT GTGTTTGATG TGTCCTGGGA ACCGCCCTGA 5580 
GGCCGTCGTG TGGCTGGAGT GCTGCAGGTG TCAAGGAAAT TGTAGGAGAT GTCTCCTGAG S640 
70 TGTGATGGAA TATAACCAGA TTTCCAGAAG GAACTGACAT GATCTGACTT AAAAAGGTCA 5700 
GTGTGCGAAA TGGCTTGCAG GGGACAGGAG TGGGAGCAGG GAGATAGGAG ACAATGTGTA 5760 
CCAGGACAGC AGAAAGACAT CCCGGGTAGC CTGGAACAGG GAGACGGTGT GGAGATGGTG 5820 
GCAGTCCGAT AATGAGAGCC GTAGGGCAAG GCCAGCAGGA TCCTAGAGTG AGAOGGGAGG 5880 
TAAAGTCACC GGGACTTGGT GTCTCCACGT CAGGGGCAGG GGAAAGGGAG AGGACAAGGG 5940 
75 TGACCCGGGA GGTTAAAGAT GGGACOGGGG CCAQACGCAG TGGCTCATGC CTGTAATCCT 6000 
AGCACTTTGG GAGGCTGAGG CGGGCGGATG GCTTGAGGTC AGGAGTTTGA AACCGGCCTG 6060 
GCCAACATGG TGAAACCCOG TCTCTACTAA AATATACAAA AATTAGCCTG GCGTGGTGGT 6120 
GCATGCCTGC AGTCCCAGCT ATTCAGGAGG CTGAGGCGAC AAGAATCGCT TGAACCTGGG 6180 
o _ AGGCGGAGGT TGCAGTGAGC CGAGATOGCG CCATAGCACT CCAGCCTTAG CCTGGGCGAC 6240 
80 AGAGCGAGAC CACATC 
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VBTPPQGSVH SQHLGSWGD PHTGTGNAGE RGPRGKGARV LALDSGOCDS SPSLPLIRTP 60 

5 ESSLHEALDQ CMTALDL.PLT NQFSEALSYL KPRTKESMYH SLTYATILEM QAMMTFDPQD 120 

ILUVGNMMKE AQMLCQRHRR KSSVTDSFSS LVNRPTLGQF TBEEIHAEVC YAECLLQRAA 180 

LTPLQGSSHG GAVRPRALHD PSHACSCPPG FGRQHLFI&Q DENMVSFIKG GIKVRNSYQT 240 

YKELDSLVQS SQYCKGENHP HFEGGVKLGV GAFNLTLSML PTRILRLLSF VGPSGNKDYG 300 

LLQLEEGASG HSFRSVLCVM LLLCYHTFLT FVLGTGNVNI EEAEKLLKPY LNRYPKGAIF 360 

10 LFFAGRIEVI KGNIDAVSDG GPGRGWGSLG VSQTSRKSGT CDILRDRIDW GRGGGPRENQ 420 
PESRGRRGPS GRAAWEDKGG GGICGAWDFD WEI 

Seq ID NO; 89 DMA sequence 
Nucleic Acid Accession ft : APC07170 
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1 11 21 31 41 51 

I I I I I I 

AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 

20 CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 

AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 180 

CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 

ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 

TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 

25 GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGAOGAG 420 

AACATGGTGA GCTTCATCAA AGGOGGCATC AAAGTTGGAA ACAGCTACCA GACCTACAAG 480 

GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT S40 

GAAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 

AGGATCCTGA GGCTGTTGGA GTTTGTGGGQ TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 

30 CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 

CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 

GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 

TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 

GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 

35 ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 

AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 

ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1140 

GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 

CGGAAGTCCC GGQGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 

40 ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCOGAAACT CACGGATGGG 1320 

ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 

TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 

CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 

TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 

45 GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 

TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1680 

CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TGCAGCAGTT 1740 

CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 

CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1860 

50 TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 

GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 

GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 

CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 

TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 

55 TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 

AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 

CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 

AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 

AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 

60 AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 

CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 

CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 

65 Seq ID NO: 90 Protein sequence 
Protein Accession ft: AAC39582-1 

1 11 21 31 41 51 

70 KEAASGKSDR RTPESSLHEA LDQCMTALDL FLTNQFSEAL SYLKPRTKES MYHSLTYATI 60 

LEMQAMMTFD PQDILLAGNM MKEAQMLCQR HRRKSSVTDS FSSLVNRPTL GQFTEEEIHA 120 

EVCYAECLLQ RAALTFLQDE NMVSFIKGGI KVRNSYQTYK ELDSLVQSSQ YCKGENHPHF 180 

EGGVKLGVGA FNLTLSMLPT RILRLLEFVG FSGNKDYGLL QLEEGASGHS PRSVLCVMLL 240 

LCYHTFLTFV LGTGNVNIEE AEK1XKPYLN RYPKGAIFLF FAGRIEVIKG NIDAAIRRFE 300 

75 ECCEAQQHWK QFHHMCYHEL MWCFTYKGQW KMSYFYADLL SKENCWSKAT YIYMKAAYLS 360 

MFGKEDHKPP GDDEVELFRA VPGLKLKIAG KSLPTEKFAI RKSRRYFSSN PISLPVPALE 420 

KMYIWNGYAV IGKQPKLTDG ILEIITKABE MLEKGPENBY SVDDECLVKL LKGLCLKYLG 480 

RVQEAEENFR SISANEKKIK YDHYLIFNAL LELALLLMEQ DRNEBAIKLL ESAKQNYKNY 540 
SMESRTHFRI QAATLQAXSS LENSSRSMVS SVSL 
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CGGGTCGACC CACGCGTCOG GGGAGAAAGG ATGGCOGGCC TGGOGGOGCG GTTGGTCCTG 60 

CTAGCTGGGG CAGCGGCGCT GGCGAGCGGC TCCCAGGGCG ACOGTGAGCC GGTGTACCGC 120 

GACTGCGTAC TGCAGTGOGA AGAGCAGAAC TGCTCTGGGG GOGCTCTGAA TCACTTCCGC 180 

TCCCGCCAGC CAATCTACAT GAGTCTAGCA GGCTGGACCT GTCGGGACGA CTGTAAGTAT 240 

GAGTGTATGT GGGTCACCGT TGGGCTCTAC CTCCAGGAAG GTCACAAAGT GCCTCAGTTC 300 

CATGGCAAGT GGCCCTTCTC CCGGTTCCTG TTCTTTCAAG AGCCGGCATC GGCCGTGGCC 360 

TCGTTTCTCA ATGGCCTGGC CAGCCTGGTG ATGCTCTGCC GCTACCGCAC CTTCGTGCCA 420 

GCCTCCTCCC CCATGTACCA CACCTGTGTG GCCTTCGCCT GGGTGTCCCT CAATGCATGG 480 

TTCTGGTCCA CAGTYTTCCA CACCAGGGAC ACTGACCTCA CAGAGAAAAT GGACTACTTC 540 

TGTGCCTCCA CTGTCATCCT ACACTCAATC TACCTGTGCT GCGTCAGCCT CATCCGCTTC 600 

GACTATGGCT ACAACCTGGT GGCCAACGTG GCTATTGGCC TGGTCAACGT GGTGTGGTGG 660 

CTGGCCTGGT GCCTGTGGAA CCAGOGGCGG CTGCCTCACG TGCGCAAGTG CGTGGTGGTG 720 

GTCTTGCTGC TGCAGGGGCT GTCCCTGCTC GAGCTGCTTG ACTTCCCACC GCTCTTCTGG 780 

GTCCTGGATG CCCATGCCAT CTGGCACATC AGCACCATCC CTGTCCACGT CCTCTTTTTC 840 

AGCTTTCTGG AAGATGACAG CCTGTACCTG CTGAAGGAAT CAGAGGACAA GTTCAAGCTG 900 

GACTGAAGAC CTTGGAGCGA GTCTGCCCCA GTGGGGATCC TGCCCCOGCC CTGCTGGCCT 960 

CCCTTCTCCC CTCAACCCTT GAGATGATTT TCTCTTTTCA ACTTCTTGAA CTTGGACATG 1020 

AAGGATGTGG GCCCAGAATC ATGTGGCCAG CCCACCCCCT GTTGGCCCTC ACCAGCCTTG 1080 

GAGTCTGTTC TAGGGAAGGC CTCCCAGCAT CTGGGACTCG AGAGTGGGCA GCCCCTCTAC 1140 

CTCCTGGAGC TGAACTGGGG TGGAACTGAG TGTGCTCTTA GCTCTACCGG GAGGACAGCT 1200 

GCCTGTTTCC TCCCCATCAG CCTCCTCCCC ACATCCCCAG CTGCCTGGCT GGGTCCTGAA 1260 

GCCCTCTGTC TACCTGGGAG ACCAGGGACC ACAGGCCTTA GGGATACAGG GGGTCCCCTT 1320 

CTGTTACCAC CCCCCACCCT CCTCCAGGAC ACCACTAGGT GGTGCTGGAT GCTTGTTCTT 1380 

TGGCCAGCCA AGGTTCAOGG CGATTCTCCC CATGGGATCT TGAGGGACCA AGCTGCTGGG 1440 

ATTGGGAAGG AGTTTCACCC TGACCRTTGC CCTAGCCAGG TTCCCAGOAG, GCCTCACCAT 1500 

ACTCCCTTTC AGGGCCAGGG CTCCAGCAAG CCCAGGGCAA GGATCCTGTG CTGCTGTCTG 1560 

GTTGAGAGCC TGCCACCGTG TGTCGGGAGT GTGGGCCAGG CTGAGTGCAT AGGTGACAGG 1620 

GCCGTGAGCA TGGGCCTGGG TGTGTGTGAG CTCAGGCACT AGGTGCGCAG TGTGGAGACG 1680 

GGTGTTGTCG GGGAAGAGGT GTGGCTTCAA AGTGTTGTGT GTGCAGGGGG TKGGTGTGTT 1740 

AAGCGTGGGT TAGGGGAACG TGTGTGCGCG TGCTGGTGGG CATGTGAGAT GAGTGACTGC 1800 

CGGTGAATGT GTCCACAGTT GAGAGGTTGG AGCAGGATGA GGGAATCCTG TCACCATCAA 1860 

TAATCACTTG TGGAGCGCCA CTTGGCCCAA GACGCCACCT GGGCGGACAG CAGGAGCTCT 1920 

CCATGGCCAG GCTGCCTGTG TGCATGTTCC CTGTCTGGTG CCCCTTTGCC CGCCTCCTGC 1980 

AAACCTCACA GGGTCCCCAC ACAACAGTGC CCTCCAGAAG CAGCCCCTCG GAGGCAGAGG 2040 

AAGGAAAATG GGGATGGCTG GGGCTCTCTC CATCCTCCTT TTCTCCTTGC CTTGGCATGG 2100 

CTGGCCTTCC CCTCCAAAAC CTCCATTCCC CTGCTGCCAG CCCCTTTGCC ATAGCCTGAT 2160 

TTTGGGGAGG AGGAAGGGGC GATTTGAGGG AGAAGGGGAG AAAGCTTATG GCTGGGTCTG 2220 

GTTTCTTCCC TTCCCAGAGG GTCTTACTGT TCCAGGGTGG CCCCAGGGCA GGCAGGGGCC 2280 

ACACTATGCC TGCGCCCTGG TAAAGGTGAC CCCTGCCATT TACCAGCAGC CCTGGCATGT 2340 

TCCTGCCCCA CAGGAATAGA ATGGAGGGAG CTCCAGAAAC TTTCCATCCC AAAGGCAGTC 2400 

TCCGTGGTTG AAGCAGACTG GATTTTTGCT CTGCCCCTGA CCCCTTGTCC CTCTTTGAGG 2460 

GAGGGGAGCT ATGCTAGGAC TCCAACCTCA GGGACTCGGG TGGCCTGCGC TAGCTTCTTT 2520 

TGATACTGAA AACTTTTAAG GTGGGAGGGT GGCAAGGGAT GTGCTTAATA AATCAATTCC 2580 
AAGCCTCAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 
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MAGLAARLVL LAGAAALASG SQGDREPVYR DCVLQCEEQN CSGGALNHFR SRQPIYMSLA 60 
GWTCRDDCKY ECMWVTVGLY LQEGHKVPQF HGKWPFSRFL PPQEPASAVA SPLMGIASLV 120 
MLCRYRTFVP ASSPMYHTCV AFAWVSLNAW PWSTVFHTRD TDLTEKMDYF CASTVILHSI 180 
YLCCVRTVGL QHPAWSAPR ALLLLMLTVH VSYLSLIRFD YGYNLVANVA IGLVNWWWL 240 
AWCLWNQRRL PHVRKCVWV LLLQGLSLLE LLDFPPLFWV LDAHAIWHIS TIPVHVLFFS 300 
FLEDDSLYLL KESEDKFKLD 
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CGAGCCAGGG AGAAAGGATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 

CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 

AGTGCGAAGA GCAGAACTGC TCTGGGGGCG CTCTGAATCA CTTCCGCTCC CGCCAGCCAA 180 

TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACGACTG TAAGTATGAG TGTATGTGGG 240 

TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 

CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC OGGCATCGGC OGTGGCCTCG TTTCTCAATG 360 

GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 

TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 

TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATGGA CTACTTCTGT GCCTCCACTG 540 

TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 

TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 

GCCTCATCCG CTTCOACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 

ACGTGGTGTG GTGGCTGGCC TGOTGCCTGT GGAACCAGCG GCGGCXGCCT CACGTGOGCA 780 

AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 

CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 

ACGTCCTCTT TTTCAGCTTT CTGGAAGATG ACAGCCTGTA CCTGCTGAAG GAATCAGAGG 960 

ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 

CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 

TGAACTTGGA CATGAAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1140 
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CCTCACCAGC 
GGCAGCCCCT 
COGGGAGGAC 
GGCTGGGTCC 
CAGGGGGTCC 
GGATGCTTGT 
ACCAAGCTGC 
GGAGGCCTCA 
TGTGCTGCTG 
GCATAGGTGA 
CAGTGTGGAG 
GGTGGGTGTG 
TGAGTGACTG 
GTCACCATCA 
AGCCAGGAGC 
GCCOGOCTCC 
TCGGAGGCAG 
TGCCTTCGCA 
GCCATAGCCT 
ATGGCTGGGT 
GCAGGCAGGO 
AGCCCTGGCA 
CCCAAAGGCA 

CGCTAGCTTC 
ATAAATCAAT 



CTTGGAGTCT 
CTACCTCCTG 
AGCTGCCTGT 
TGAAGCCCTC 
CCTTCTCTTA 
TCTTTGGCCA 
TGGGATTGGG 
CCATACTCCC 
TCTGGTTGAG 
CAGGGCCGTG 
ACGGGTGTTG 
TTAGCGTGGG 
CCGGTGAATG 
ATAATCACTT 
TCTCCATGGC 
TGCAAACCTC 
AGGAAGGAAA 
TGGCTGGCCT 
GATTTTGGGG 
CTGGTTTCTT 
GCCACACTAT 
TGTTCCTGCC 
GTCTCCGTGG 
AGGGAGGGGA 
TTTTGATACT 
TCCAAGCCTC 



GTTCTAGGGA 
GAGCTGAACT 
TTCCTCCCCA 
TGTCTACCTG 
CCACCCCCCA 
GCCAAGGTTC 
AAGGAGTTTC 
TTTCAGGGCC 
AGCCTGCCAC 
AGCATGGGCC 
TCGGGGAAGA 
TTAGGGGAAC 
TGTCCACAGT 
GTGGAGCGCC 
CAGGCTGCCT 
ACAGGGTCCC 
ATGGGGATGG 
TCCCCTCCAA 
AGGAGGAAGG 
CCCTTCCCAG 
GCCTGCGCCC 
CCACAGGAAT 
TTGAAGCAGA 
GCTATGCTAG 
GAAAACTTTT 
AAAAAAAAAA 



AGGCCTCCCA 
GGGGTGGAAC 
TCAGCCTCCT 
GGAGACCAGG 
CCCTCCTCCA 
ACGGCGATTC 
ACCCTGACCA 
AGGGCTCCAG 
CGTGTGTCGG 
TGGGTGTGTG 
GGTGTGGCTT 
GTGTGTGCGC 
TGAGAGGTTG 
AGCTCTGCCC 
GTGTGCATGT 
CACACAACAG 
CTGGGGCTCT 
AACCTCCATT 
GGCGATTTGA 
AGGGTCTTAC 
TGGTAAAGGT 
AGAATGGAGG 
CTGGATTTTT 
GACTCCAACC 
AAGGTGGGAG 
AAAAAAAAAA 



GCATCTGGGA 
TGAGTGTGCT 
CCCCACATCC 
GACCACAGGC 
GGACACCACT 
TCCCCATGGG 
TTGCCCTAGC 
CAAGCCCAGG 
GAGTGTGGGC 
TGAGCTCAGG 
CAAAGTGTGT 
GTGCTGGTGG 
GAGCAGGATG 
AAGGOGCCAC 
TCCCTGTCTG 
TGCCCTCCAG 
CTCCATCCTC 
CCCCTGCTGC 
GGGAGAAGGG 
TGTTCCAGGG 
GACCCCTGCC 
GAGCTCCAGA 
GCTCTGCCCC 
TCAGGGACTC 
GGTGGCAAGG 
AA 



CTCGAGAGTG 
CTTAGCTCTA 
CCAGCTGCCT 
CTTAGGGATA 
AGGTGGTGCT 
ATCTTGAGGG 
CAGGTTCCCA 
GCAAGGATCC 
CAGGCTGAGT 
CCTAGGTGCG 
GTGTGCAGGG 
GCATGTGAGA 
AGGGAATCCT 
CTGGGCGGAC 
GTGCCCCTTT 
AAGCAGCCCC 
CTTTTCTCCT 
CAGCCCCTTT 
GAGAAAGCTT 
TGGCCCCAGG 
ATTTACCAGC 
AACTTTCCAT 
TGACCCCTTG 
GGGTGGCCTG 
GATGTGCTTA 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2560 
2640 



Seq ID NO: 94 Protein aeguence 
Protein Accession #: NP_219487.1 



11 21 31 41 SI 

I I I I I 

QPTPCWPSPA LESVLGKASQ HLGLESGQPL YHiELNWGGT ECALSSTGRT 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLDQ DTTRWCWMLV 
LWPAKVHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGP KVCVCRGWVC 

Seq ID NO: 95 DNA sequence 

Nucleic Acid Accession #: XMJJ90469 

Coding sequence: 



ATGGGGTTTG 
TGCCGCGTGC 
AGCCCACACC 
CCACCTGAAG 
TCCCTGGCAC 
ATGTCTCTTC 
AACTCCGCAG 
GCTGCCCACC 
CAATCAGTTC 
AGGGCACCTT 
CTCAGCCGCA 
CGCGAGCTAG 
TGCCTGCCCG 
GGCAGCCCAC 
CTGCATCAAG 
TGTATCAAAG 
GAATGA 



11 
I 

GAGACCAGGG 
TGGACAGGAC 
CCTTGCATGG 
TGAAAATGTT 
AAGGCACAGG 
CTGGAGTGCT 
ATACTGACTG 
AGGAAGACAC 
AGCAGCAGAA 
GCGOGCCACC 
GCCCGGCGCC 
ACGCGCCAGG 
CGGCTCCAGA 
CACCTAAGGA 
AATCACCAGG 
GGGCTCTGGG 



21 
I 

AACGGTGGAA 
GCAGGTGAAC 
ACAGTOGGTC 
TGGAGCCAGT 
CTGCTCAGTG 
CAGTGCTGAT 
GGGAACTGAG 
TTTCTTTTTG 
TTGCATTTAC 
AAGGCGAGAG 
GTCCCCACGT 
CCCGCCCCCC 
AGCCCAGAAA 
CATCTTGCAG 
TTCATTTCCA 
TGTGGCTGCT 



31 
i 

GGGAGCCTAG 
AAGCCCTGCC 
AACAGCCACC 
CAAGGTTTGC 
TGCTGGGAGG 
GCTGGCCAGG 
GAGGGACCAG 
AAAGACATCA 
AGCCCCAGAG 

ACACCCATGG 
GGGCTGOGGA 
CTCTTCTTCC 
ACGCTCCAGC 
ATAGGTTTAT 
GAAGCAGCAT 



Seq ID NO: 96 Protein sequence 
Protein Accession th XP_090469 



31 



41 
I 

GAACGTCGAA 
TGCTGGAGGC 
CATCAGGTGC 
TGACAATGGA 
TGAAOGGCCC 
TGGAGCACAG 
GAAGTCTAGA 
AGCACACAAG 
AAAAACCCTG 
CCCTCAGCAG 
CGCCCTTCCC 
GCTCGGCAGC 
TGCTGCCCCT 
ACAAGGCACC 
GTAACAGAAC 
TTCACTTACA 



41 



60 
120 
180 



51 
I 

AGATAGCTCC 
TGCCAAGCAC 
TCATCAGAAA 
AACAAACCAG 
AGGGTTGGAA 
AAGACAAATG 
TTTCGCCGTG 
TACGTTCAGG 
TGGGAATGTC 
GCCCTGGOGG 
GACCTCTGAC 
AGCCCCTCAC 
TTATCCAGAT 
TGGGAAGAGA 
AGCTGAACCA 
GTTTTCCAGT 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1 11 21 

I ! I I I I 

MGFGDQGTVE GSLGTSKDSS CRVLDRTQVN KPCLLEAAKH SPHPLHGQSV NSHPSGAHQK 
PPBVKMFGAS QGLLTMETNQ SLAQGTGCSV CWEVNGPGLE MSLPGVLSAD AGQVEHRRQM 
NSADTDWGTE EGPGSLDPAV AAHQEDTFFL KDIKHTSTFR QSVQQQNCIY SPREKPCGNV 
RAPCAPPRRE APLALSRPWR LSRSPAPSPR TPMAPPPTSD RELDAPGPPP GLRSSAAAPH 
CLPAAPEAQK LPFLLPLYPD GSPPPKDILQ TLQHKAPGKR LHQESPGSFP IGLCNRTAEP 
CIKGALGVAA EAAFHLQFSS E 

Seq ID NO: 97 DNA sequence 

Nucleic Acid Accession #: NMJ>03474.2 

Coding sequence: 37.. 3036 

1 11 21 31 41 51 

I I I I I I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 
CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTOGGC 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 

988 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



WO 03/042661 



PCT/US02/36810 



GCGACGATGG CAGCGCGCCC GCTGCCCSTQ TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 
GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATGCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
5 CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 
TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 
10 TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 
CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTOGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1140 
15 CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 
GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
20 ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TSGAAGAAGG AGAGGAGTGT 1620 
GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 
25 GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 
GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 
CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2040 
30 AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 
CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 
GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
35 GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 
GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 
TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 
40 CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGOGTCC CTGCCAGACC CCTGCCAGCC 2820 
AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
45 CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 
GTGCCCAGAT CCACCCACAC CGCCTATATT AAGTGAGAAG CCGACACCTT TTTTCAACAG 3060 
TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 
GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCGTCAGT 3180 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 
50 GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 
CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
55 TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TQAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
60 CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 
65 TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
„ CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 
70 AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
_ _ TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 
75 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
OA AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 
O0 ACCAAAAAAA AAAAAAAAAA AA 



Seq ID NO; 98 Protein sequence 
Protein Accession #: NPJ>03465 



989 



WO 03/042661 



1 11 21 31 41 51 

I I I t I I 

MAARPLPVSP ARALLLALAG ALLAPCEARG VSLNNEGRAD EWSASVRSG DLWIPVKSFD 60 

SKNHPEVLNI RLQRBSKELI INLERNBGLI ASSPT ETHYL QDGTDVSLAR NYTVILGHCY 120 

YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 180 

SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 

LIEIANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 

DNAQLVSGVY PQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 

DTLDRGCSOQ MAVEKGGCIM NASTGYPFPM VPSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 

ACRDSSHSCD LPEFCTGASP HCPANVYLHD GHSCODVDGY CYNGICQTHE QQCVTLWGPG 540 

AKPAPGICPE RVNSAGDPYG NCGKVSKSSP AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 

IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GXICLNRQCQ NISVFGVHEC 660 

AMQCHGRGVC NNRKNCHCEA EWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 

LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 

DSYPPKDNPR RLLQCQNVDI SRPUJGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 

ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 

Seq ID NO: 99 DNA sequence 

Nucleic Acid Accession #: NM_003714 

Coding sequence: 135.. 1043 

1 11 21 31 41 51 

) I i 111 

GAGGAGGAGG GAAAAGGCGA GCAAAAAGGA AGAGTGGGAG GAGGAGGGGA AGCGGCGAAG 60 

GAGGAAGAGG AGGAGGAGGA AGAGGGGAGC ACAAAGGATC CAGGTCTCCC GACGGGAGGT 120 

TAATACCAAG AACCATGTGT GCCGAGCGGC TGGGCCAGTT CATGACCCTG GCTTTGGTGT 180 

TGGCCACCTT TGACCCGGCG CGGGGGACCG ACGCCACCAA CCCACCCGAG GGTCCCCAAG 240 

ACAGGAGCTC CCAGCAGAAA GGCOGCCTGT CCCTGCAGAA TACAGCGGAG ATCCAGCACT 300 

GTTTGGTCAA CGCTGGCGAT GTGGGGTGTG GOGTGTTTGA ATGTTTCGAG AACAACTCTT 360 

GTGAGATTCG GGGCTTACAT GGGATTTGCA TGACTTTTCT GCACAACGCT GGAAAATTTG 420 

ATGCCCAGGG CAAGTCATTC ATCAAAGACG CCTTGAAATG TAAGGCCCAC GCTCTGCGGC 480 

ACAGGTTCGG CTGCATAAGC CGGAAGTGCC CGGCCATCAG GGAAATGGTG TCCCAGTTGC 540 

AGCGGGAATG CTACCTCAAG CACGACCTGT GCGCGGCTGC CCAGGAGAAC ACCCGGGTGA 600 

TAGTGGAGAT GATCCATTTC AAGGACTTGC TGCTGCACGA ACCCTACGTG GACCTCGTGA 660 

ACTTGCTGCT GACCTGTGGG GAGGAGGTGA AGGAGGCCAT CACCCACAGC GTGCAGGTTC 720 

AGTGTGAGCA GAACTGGGGA AGCCTGTGCT CCATCTTGAG CTTCTGCACC TCGGCCATCC 780 

AGAAGCCTCC CACGGCGCCC CCCGAGCGCC AGCCCCAGGT GGACAGAACC AAGCTCTCCA 840 

GGGCCCACCA OGGGGAAGCA GGACATCACC TCCCAGAGCC CAGCAGTAGG GAGACTGGCC 900 

GAGGTGCCAA GGGTGAGCGA GGTAGCAAGA GCCACCCAAA CGCCCATGCC CGAGGCAGAG 960 

TCGGGGGCCT TGGGGCTCAG GGACCTTCCG GAAGCAGCGA GTGGGAAGAC GAACAGTCTG 1020 

AGTATTCTGA TATCCGGAGG TGAAATGAAA GGCCTGGCCA CGAAATCTTT CCTCCACGCC 1080 

GTCCATTTTC TTATCTATGG ACATTCCAAA ACATTTACCA TTAGAGAGGG GGGATGTCAC 1140 

ACGCAGGATT CTGTGGGGAC TGTGGACTTC ATCGAGGTGT GTGTTCGCGG AACGGACAGG 1200 

TGAGATGGAG ACCCCTGGGG CCGTGGGGTC TCAGGGGTGC CTGGTGAATT CTGCACTTAC 1260 

ACGTACTCAA GGGAGCGCGC CCGCGTTATC CTCGTACCTT TGTCTTCTTT CCATCTGTGG 1320 

AGTCAGTGGG TGTCGGCCGC TCTGTTGTGG GGGAGGTGAA CCAGGGAGGG GCAGGGCAAG 1380 

GCAGGGCCCC CAGAGCTGGG CCACACAGTG GGTGCTGGGC CTCGCCCCGA AGCTTCTGGT 1440 

GCAGCAGCCT CTGGTGCTGT CTCCGCGGAA GTCAGGGGGG CTGGATTCCA GGACAGGAGT 1500 

GAATGTAAAA ATAAATATCG CTTAGAATGC AGGAGAAGGG TGGAGAGGAG GCAGGGGCCG 1560 

AGGGGGTGCT TGGTGCCAAA CTGAAATTCA GTTTCTTGTG TGGGGCCTTG CGGTTCAGAG 1620 

CTCTTGGCGA GGGTGGAGGG AGGAGTGTCA TTTCTATGTG TAATTTCTGA GCCATTGTAC 1680 

TGTCTGGGCT GGGGGGGACA CTGTCCAAGG GAGTGGCCCC TATGAGTTTA TATTTTAACC 1740 

ACTGCTTCAA ATCTCGATTT CACTTTTTTT ATTTATCCAG TTATATCTAC ATATCTGTCA 1800 

TCTAAATAAA TGGCTTTCAA ACAAAGCAAC TGGGTCATTA AAACCAGCTC AAAGGGGGTT 1860 

TAAAAAAAAA AAAACCAGCC CATCCTTTGA GGCTGATTTT TCTTTTTTTT AAGTTCTATT 1920 

TTAAAAGCTA TCAAACAGCG ACATAGCCAT ACATCTGACT GCCTGACATG GACTCCTGCC 1980 

CACTTGGGGG AAACCTTATA CCCAGAGGAA AATACACACC TGGGGAGTAC ATTTGACAAA 2040 

TTTCCCTTAG GATTTCGTTA TCTCACCTTG ACCCTCAGCC AAGATTGGTA AAGCTGCGTC 2100 

CTGGCGATTC CAGGAGACCC AGCTGGAAAC CTGGCTTCTC CATGTGAGGG GATGGGAAAG 2160 

GAAAGAAGAG AATGAAGACT ACTTAGTAAT TCCCATCAGG AAATGCTGAC CTTTTACATA 2220 

AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGGAAA 2280 

TTTAGCAATA AGGAGAGGAG TCCAAGGGGA CAAATAAAGG CAGAGAGAGA GAGAGAGAGA 2340 
GGGAGAGGAA GAAAAGAGAG AGAGAAAAGA GCCTCGTGCC 

Seq ID NO: 100 Protein sequence 
Protein Accession *h NP_003705 

1 11 21 31 41 51 

I I I I I I 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 

GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFIKDALKCK AHALRHRFGC 120 

ISRKCPAIRE MVSQLQRECY LKHDLCAAAQ ENTRVIVEMI HFKDLLLHEP YVDLVNLLLT 180 

CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 

EAGHHLPEPS SRETGRGAKG BRGSKSHPNA HARGKVGGLG AQGPSGSSSH EDEQSEYSDI 300 
RR 

Seq ID NO: 101 DNA sequence 
Nucleic Acid Accession &: NM_005940 
Coding sequence: 23.. 1489 

1 11 21 31 41 51 

I I I I I I 



990 



WO 03/042661 



AAGCCCAGCA GCCCCGGGGC GGATGGCTCC GGCOQCCTGG CTCCGCAGCG CGGCOGOGOG 60 

CGCCCTCCTG CCCCCGATGC T6CTGCTGCT GCTCCAGCCG CCGCCGCTGC TGGCCCGGGC 120 

TCTGCCGCCG GACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC CCTGGCATCC 180 

AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TGCCACGCAG GAAGCCCCCC GGCCTGCCAG 240 

CAGCCTCAGG CCTCCCCGCT GTGGCGTGCC QGACCCATCT GATGGGCTGA GTGCCCGCAA 300 

CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCACCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCGATG TGACGCCACT CACCTTTACT GAGGTGCAOG AGGGCCGTGC 480 

TGACATCATG ATCGACTTCG CCAGGTACTG GCATGGGGAC GACCTGCCGT TTGATGGGCC 540 

TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC OGAGAAGGGG ATGTCCACTT 600 

CGACTATGAT GAGACCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTGC TGCAGGTGGC 660 

AGCCCATGAA TTTGGCCACG TGCTGGGGCT GCAGCACACA ACAGCAGCCA AGGCCCTGAT 720 

GTCCGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGGGCGT 7B0 

TCAACACCTA TATGGCCAGC CCTGGCCCAC TGTCACCTCC AGGACCCCAG CCCTGGGCCC 840 

CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCTGGAG CCAGACGCCC OGCCAGATGC 900 

CTGTGAGGCC TCCTTTGACG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTGCC TTCGAGGATG CCCAGGGCCA 1080 

CATTTGGTTC TTCCAAGGTG CTCAGTACTG GGTGTAOGAC GGTGAAAAGC CAGTCCTGGG 1140 

CCCCGCACCC CTCACCGAGC TGGGCCTGGT GAGGTTCCCG GTCCATGCTG CCTTGGTCTG 1200 

GGGTCCOGAG AAGAACAAGA TCTACTTCTT CCGAGGCAGG GACTACTGGC GTTTCCACCC 1260 

CAGCACCCGG CGTGTAGACA GTCCCGTGCC CCGCAGGGCC ACTGACTGGA GAGGGGTGCC 1320 

CTCTGAGATC GACGCTGCCT TCCAGGATGC TGATGGCTAT GCCTACTTCC TGCGCGGCCG 1380 

CCTCTACTGG AAGTTTGACC CTGTGAAGGT GAAGGCTCTG GAAGGCTTCC CCCGTCTCGT 1440 

GGGTCCTGAC TTCTTTGGCT GTGCCGAGCC TGCCAACACT TTCCTCTGAC CATGGCTTGG 1500 

ATGCCCTCAG GGGTGCTGAC CCCTGCCAGG CCACGAATAT CAGGCTAGAG ACCCATGGCC 1560 

ATCTTTGTGG CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TGTCTCCTGC AGGGGGATGG 1620 

GGTGGGGTAC AACCACCATG ACAACTGCCG GGAGGGCCAC GCAGGTCGTG GTCACCTGCC 1680 

AGCGACTGTC TCAGACTGGG CAGGGAGGCT TTGGCATGAC TTAAGAGGAA GGGCAGTCTT 1740 

GGGACCCGCT ATGCAGGTCC TGGCAAACCT GGCTGCCCTG TCTCATCCCT GTCCCTCAGG 1800 

GTAGCACCAT GGCAGGACTG GGGGAACTGG AGTGTCCTTG CTGTATCCCT GTTGTGAGGT 1860 

TCCTTCCAGG GGCTGGCACT GAAGCAAGGG TGCTGGGGCC CCATGGCCTT CAGCCCTGGC 1920 

TGAGCAACTG GGCTGTAGGG CAGGGCCACT TCCXGAGGTC AGGTCTTGGT AGGTGCCTGC 1980 

ATCTGTCTGC CTTCTGGCTG ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2040 

GTTCACAGTC AAATGGGGAG GGGTATTCTT CATGCAGGAG ACCCCAGGCC CTGGAGGCTG 2100 

CAACATACCT CAATCCTGTC CCAGGCCGGA TCCTCCTGAA GCCCTTTTOG CAGCACTGCT 2160 

ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AGTGTGTATA AACCTTCTTC TTCTTTTTTT 2220 
TTTTTAAACT GAGGATTGTC ATTAAACACA GTTGTTTTCT 

Seq ID NO: 102 Protein sequence 
Protein Accession ft: NP_ 005931 

1 11 21 31 41 51 

I I I I I I 

MAPAAWLRSA AARALLPPML IiLLLQPPPLL ARALPPDVHH LHAERRGPQP WHAALPSSPA ' 60 

PAPATQEAPR PASSLRPPRC GVPDPSDGLS ARNRQKRFVL SGGRWEKTDL TYRILRFPWQ 120 

LVQEQVRQTM AEALKVWSDV TPLTFTEVHE GRADIMIDFA RYWHGDDLPF DGPGGILAHA 180 

PPPKTHREGD VHFDYDETWT IGDDQGTDLL QVAAHEFGHV LGLQHTTAAK ALMSAFYTFR 240 

YPLSLSPDDC RGVQHLYGQP WPTVTSRTPA LGPQAGIDTN EIAPLEPDAP FDACEASFDA 300 

VSTIRGELPP PKAGFVWRLR GGQLQPGYPA LASRHMQGLP SPVDAAFEDA QGHIHFFQGA 360 

QYWVYDGEKP VLGPAPLTEL GLVRFPVHAA LWGPEKNKI YPFRGRDYWR FHPSTRRVDS 420 

PVPRRATDWR GVPSEIDAAF QDADGYAYPL RGRLYWKFDP VKVKALEGFP RLVGPDFFGC 480 
AEPANTFL 

♦ 

Seq ID NO: 103 DNA sequence 

Nucleic Acid Accession 8 s NM_0331S1.2 

Coding sequence-. 3 51.. 44 99 

1 11 21 31 41 51 

I 1 I I I I 

ACTGGGATAA AGCAAGAAGA CTGATTTTAT GAGCAGGGGT TTGATACATC AAAGGAGATT 60 

GCCCAGGATC AAGGGTGCGG TGTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 

ACTAAGTGAT TTGGGCCTGA GGCCTGAGAA GATGTTTAAA AAGAGGGATC AAGCACAGGC 180 

TAAGGAGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 

GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 

CTTTTCATAT TCTCTGATTC TGGGAAACGA AGAATTGGCA GGAACTGAAA ATGACTAGGA 360 

AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 

GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 480 

GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 

ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 

AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 

TGATCCAAAG CTTACGGAGT OGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 

ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA GTCTCAAGGC 780 

GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 

TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 

TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 

GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC TCCTC CAGT T 1020 

GGATCATCAA CCAACGCACA GCCATCAGGT TGGGAGCAGC TGTTTCCTCC TTTGCCTTTG 1080 

AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1140 

TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 

CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA TACACTGCAT 1260 

TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 

CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 

AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA TTTGCAAAAA 1440 
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TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCCAGA 1500 

GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGOGGTC TGGGTTCTCA 1560 

TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 

TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CTCACGAATT 1680 

5 CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 

TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 

AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 1860 

CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAGAAGGG AACAGCCTGG 1920 

GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1980 

10 GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 

TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 

TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 

TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTTGGAGACA 2220 

TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 

15 TGGCCCGCGC OGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 

TGGACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT TAAGAAGACA CTCAGGGGGA 2400 

AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 

TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 

GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 

20 ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAQAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 

AAGAGTCTCT CAAOGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 

AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 

TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 

TCTGGTGGCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 

25 ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 

TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 

TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAAGCTCTTC AACAAGGTTT 3060 

TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 

CAGGGGACTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3180 

30 TGTCCTTAAT GGTGATCGCC GTCCTGTTGA TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 

TAATGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 

TCGGTGTGTT CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 

ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 

AGTTTAAGAG GCTGACTGAT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 

35 GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 

TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 

TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCXTGGAG ACAGAGGCAC 3660 

AGTTCACGGC TGTAGAGAGG ATACTGCAGT ACATGAAGAT GTGTGTCTCG GAAGCTCCTT 3720 

TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 

40 AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 

CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 
TGGACATTTG CAGCATCGGC CTGGAGGACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
ATCCAGTGCT GCTCTCAGGA ACCATCAGAT TCAACCTAGA TCCCTTTGAC CGTCACACTG 4080 
45 ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 
CCACAGCCTC CATTGACATG GAGACAGACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 
50 ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAG 4500 
GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 
TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
55 TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 
TCCTTTTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 
TC 

60 Seq ID NO: 104 Protein sequence 
Protein Accession #: NP_149163.2 

1 11 21 31 41 51 

« 1 I I I 1 1 

OJ MTRKRTYWVP NSSGGLVNRG IDIGDDMVSG LIYKTYTLOD GPWSQQERNP EAPGRAAVPP 60 

WGKYDAALRT MIPFRPKPRF PAPQPLDNAG LPSYLTVSWL TPLMIQSLRS RLDENTIPPL 120 

SVHDASDKNV QRLHRLWEEB VSRRGIEKAS VLLVML.RFQR TRLIPDALLG ICFCIASVLG 180 

PILIIPKILE YSEEQLGNW HGVGLCPALF LSECVKSLSP SSSWIINORT AIRFRAAVSS 240 

PAPEKLIQFK SVIHITSGEA ISPPTGDVNY LFEGVCYGPL VLITCASLVI CSISSYFIIG 300 

70 YTAFIAILCY LLVPPLAVFM TRMAVKAQHH TSBVSDQR1R VTSEVLTCIK LIKMYTWEKP 360 

PAKIIEDLRR KERKLLEKCG LVQSLTSITIi FIIPTVATAV WVLIHTSLKL KLTASMAFSM 420 

LASLNLLRLS VFPVPIAVKG LTNSKSAVMR FKKFFLQESP VFYVQTLQDP SKALVFBEAT 480 

LSWQQTCPGI VNGALBLERN GHASEGMTRP RDALQPBEEG NSLGPBLHKI NLWSKGMML 540 

GVCGNTGSGK SSLLSAILEE MHLLEGSVGV QGSLAYVPQQ AWIVSGNIRE NILMGGAYDK 600 

75 ARYLQVLHCC SLNRDLELLP FGDMTEIGBR GLNLSGGQKQ RISLARAVYS DRQIYLLDDP 660 

LSAVDAHVGK HIPEECIKKT LRGKTWLVT HQLQYLEFOG QIILLENGKI CENGTHSELM 720 

QKKGKYAQLI QKMHKEATSD MLQDTAKIAE KPKVBSQALA TSLEESLNGN AVPEHQLTQE 780 

EEMEEGSIiSW RVYHHYIQAA GGYMVSCIIF PFWLIVPLT IPSFWWLSYW LEQGSGTNSS 840 

RESNGTMADI* GNIADNPQLS PYQLVYGLNA LLLICVGVCS SGIPTKVTRK ASTALHNKLP 900 

80 NKVFRCPMSF FDTIPIGRLL NCFAGDLEQL DQLLPIFSEQ PLVLSLMVIA VLLIVSVLSP 960 

YILLMGAIIM VICFIYYNMP KKAIGVFKRL ENYSRSPLPS HIUfSLQGLS SIHVYGKTED 1020 

FXSQFKRIiTD AQNNYLLLFL SSTRWMALRL EIMTNLVTIA VAI*FVAFGIS STPYSPKVMA 1080 

VNIVLQLASS FQATARIGLE TBAQPTAVER ILQYMKMCVS EAPLHMEGTS CPOGWPQHGE 1140 

IIFQDYRMKY RDNTPTVLHG INLTIRGHEV VGXVGRTGSG KSSLGMALFR LVEPMAGRIL 1200 
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IDGVDICSIG LEDLRSKLSV IPQDPVLLSG TIRFNLDPFD RHTDQQIWDA LERTFLTKAI 1260 
SKFPKKLHTD WENGGNPSV GERQLbCIAR AVLRNSKIIL IDRATASIDM ETDTLIQRTI 1320 

REAPQGCTVL VIAHRVTTVL NCDHILVMGN GKWEFDRPE VXiRKKPGSLF AALMATATSS 1380 
LR 

5 

Seq ID NOt 105 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..4043 

10 1 11 21 31 41 SI 

I I 1 1 I I 

ATGACTAGGA AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC 60 

ATCGACATAG GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT 120 

GGCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG 180 

15 TGGGGGAAGT ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT 240 

CCTGCCCCCC AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC 300 

ACCCCGCTCA TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG 360 

TCAGTCCATG ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA 420 

GTCTCAAGGC GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA 480 

20 ACAAGGTTGA TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG S40 

CCAATATTGA TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC 600 

CATGGAGTGG GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC 660 

TCCTCCAGTT GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC 720 

TTTGCCTTTG AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC 780 

25 ATCAGCTTCT TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA 840 

GTACTGATCA CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA 900 

TACACTGCAT TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG 960 

ACAAGAATGG CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT 1020 

GTGACCAGTG AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA 1080 

30 TTTGCAAAAA TCATTGAAGG TATGGAAAGT CTGACTTTCT GCTCCAAACC TGGTGATGGC 1140 

ATGGCCTTCA GCATGCTGGC CTCCTTGAAT CTCCTTCQGC TGTCAGTGTT CTTTGTGCCT 1200 

ATTGCAGTCA AAGGTCTCAC GAATTCCAAG TCTGCAGTGA TGAGGTTCAA GAAGTTTTTC 1260 

CTCCAGGAGA GCGCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAGCAA AGCTCTGGTC 1320 

TTTGAGGAGG CCACCTTGTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 1380 

35 GAGCTGGAGA GGAACGGGCA TGCTTCTGAG GGGATGACCA GGCCTAGAGA TGCCCTCGGG 1440 

CCAGAGGAAG AAGGGAACAG CCTGGGCCCA GAGTTGCACA AGATCAACCT GGTGGTGTCC 1500 

AAGGGGATGA TGTTAGGGGT CTGCGGCAAC ACGGGGAGTG GTAAGAGCAG CCTGTTGTCA 1560 

GCCATCCTGG AGGAGATGCA CTTGCTCGAG GGCTCGGTGG GGGTGCAGGG AAGCCTGGCC 1620 

TATGTCCCCC AGCAGGCCTG GATCGTCAGC GGGAACATCA GGGAGAACAT CCTCATGGGA 1680 

40 GGCGCATATG ACAAGGCCCG ATACCTCCAG GTGCTCCACT GCTGCTCCCT GAATCGGGAC 1740 

CTGGAACTTC TGCCCTTTGG AGACATGACA GAGATTGGAG AGGGGGGCCT CAACCTCTCT 1800 

GGGGGGCAGA AACAGAGGAT CAGCCTGGCC CGCGCCGTCT ATTCCGACCG TCAGATCTAC 1860 

CTGCTGGAOG ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAG 1920 

TGCATTAAGA AGACACTCAG GGGGAAGACG GTCGTCCTGG TGACCCACCA GCTGCAGTAC 1980 

45 TTAGAATTTT GTGGCCAGAT CATTTTGTTG GAAAATGGGA AAATCTGTGA AAATGGAACT 2040 

CACAGTGAGT TAATGCAGAA AAAGGGGAAA TATGCCCAAC TTATCCAGAA GATGCACAAG 2100 

GAAGCCACTT CGGACATGTT GCAGGACACA GCAAAGATAG CAGAGAAGCC AAAGGTAGAA 2160 

AGTCAGGCTC TGGCCACCTC CCTGGAAGAG TCTCTCAAOG GAAATGCTGT GCCGGAGCAT 2220 

CAGCTCACAC AGGAGGAGGA GATGGAAGAA GGCTCCTTGA GTTGGAGGGT CTACCACCAC 2280 

50 TACATCCAGG CAGCTGGAGG TTACATGGTC TCTTGCATAA TTTTCTTCTT CGTGGTGCTG 2340 

ATCGTCTTCT TAACGATCTT CAGCTTCTGG TGGCTGAGCT ACTGGTTGGA GCAGGGCTCG 2400 

GGGACCAATA GGAGCCGAGA GAGCAATGGA ACCATGGCAG ACCTGGGCAA CATTGCAGAC 2460 

AATCCTCAAC TGTCCTTCTA CCAGCTGGTG TACGGGCTCA ACGCCCTGCT CCTCATCTGT 2520 

GTGGGGGTCT GCTCCTCAGG GATTTTCACC AAAGTCAOGA GGAAGGCATC CACGGCCCTG 2S80 

55 CACAACAAGC TCTTCAACAA GGTTTTCCGC TGCCCCATGA GTTTCTTTGA CACCATCCCA 2640 

ATAGGCCGGC TTTTGAACTG CTTCGCAGGG GACTTGGAAC AGCTGGACCA GCTCTTGCCC 2700 

ATCTTTTCAG AGCAGTTCCT GGTCCTGTCC TTAATGGTGA TCGCCGTCCT GTTGATTGTC 2760 

AGTGTGCTGT CTCCATATAT CCTGTTAATG GGAGCCATAA TCATGGTTAT TTGCTTCATT 2820 

^ TATTATATGA TGTTCAAGAA GGCCATCGGT GTGTTCAAGA GACTGGAGAA CTATAGCCGG 2880 

60 TCTCCTTTAT TCTCCCACAT OCTCAATTCT CTGCAAGGCC TGAGCTCCAT CCATGTCTAT 2940 

GGAAAAACTG AAGACTTCAT CAGCCAGTTT AAGAGGCTGA CTGATGCGCA GAATAACTAC 3000 

CTGCTGTTGT TTCTATCTTC CACACGATGG ATGGCATTGA GGCTGGAGAT CATGACCAAC 3060 

CTTGTGACCT TGGCTGTTGC CCTGTTCGTG GCTTTTGGCA TTTCCTCCAC CCCCTACTCC 3120 

- TTTAAAGTCA TGGCTGTCAA CATCGTGCTG CAGCTGGCGT CCAGCTTCCA GGCCACTGCC 3180 

65 GGGAITGGCT TGGAGACAGA GGCACAGTTC ACGGCTGTAG AGAGGATACT GCAGTACATG 3240 

AAGATGTGTG TCTCGGAAGC TCCTTTACAC ATGGAAGGCA CAAGTTGTCC CCAGGGGTGG 3300 

CCACAGCATG GGGAAATCAT ATTTCAGGAT TATCACATGA AATACAGAGA CAACACACCC 3360 

ACCGTGCTTC ACGGCATCAA CCTGACCATC CGCGGCCACG AAGTGGTGGG CATCGTGGGA 3420 

AGGACGGGCT CTGGGAAGTC CTCCTTGGGC ATGGCTCTCT TCGGCCTGGT GGAGCCCATG 3480 

70 GCAGGCCGGA TTCTCATTGA CGGOGTGGAC ATTTGCAGCA TCGGCCTGGA GGACTTGOGG 3540 

TCCAAGCTCT CAGTGATCCC TCAAGATCCA GTGCTGCTCT CAGGAACCAT CAGATTCAAC 3600 

CTAGATCCCT TTGACCGTCA CACTGACCAG CAGATCTGGG ATGCCTTGGA GAGGACATTC 3660 

CTGACCAAGG CCATCTCAAA GTTCCCCAAA AAGCTGCATA CAGATGTGGT GGAAAACGGT 3720 

GGAAACTTCT CTGTGGGGGA GAGGCAGCTG CTCTGCATTG CCAGGGCTGT GCTTOGCAAC 3780 

75 TCCAAGATCA TCCTTATCGA TGAAGCCACA GCCTCCATTG ACATGGAGAC AGACACCCTG 3840 
ATCCAGCGCA CAATCCGTGA AGCCTTCCAG GGCTGCACCG TGCTCGTCAT TGCCCACCGT 3900 
GTCACCACTG TGCTGAACTG TGACCACATC CTGGTTATGG GCAATGGGAA GGTGGTAGAA 3960 
TTTGATCGGC CGGAGGTACT GCGGAAGAAG CCTGGGTCAT TGTTCGCAGC CCTCATGGCC 4020 
OA ACAGCCACTT CTTCACTGAG ATAA 
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MTRKRTYWVP NSSGGLVNRG IDIGDDMVSG LIYKTYTLQD GPWSQQERNP EAPGRAAVPP 60 

WGKYDAALRT MIPFRPKPRF PAPQPLDNAG LFSYLTVSWL TPLMIQSLRS RLDENTIPPL' 120 

SVHDASDKNV QRLHRL.WEEE VSRRGIBKAS VLLVMLRFQR TRLIFDALLG ICFCIASVLG 180 

PILIIPKILE YSEEQLGMW HGVGLCFALF LSECVKSLSF SSSWIINQRT AIRFRAAVSS 240 

FAPEKLIQFK SVIHITSGEA ISFFTGDVNY LFEGVCYGPL VLITCASLVI CSISSYFIIG 300 

YTAPIA1LCY LLVFPLAVFM TRMAVKAQHH TSEVSDQRIR VTSEVLTCIK LIKMYTWEKP 360 

FAKIIEGMES LTPCSKPGDG MAFSMLASI21 LLRLSVFFVP IAVKGLTNSK SAVMRFKKFF 420 

LQESPVFYVQ TLQDPSKALV FEEATliSWQQ TCPGIVNGAL ELERNGHASE GMTRPRDALG 480 

PEEEGNSLGP ELHKINLWS KGMMLGVCGN TGSGKSSLLS AILEEMHLLE GSVGVQGSLA 540 

YVPQQAWIVS GNIRENILMG GAYDKARYLQ VLHCCSLNRD LELLPFGDMT EIGERGLNLS 600 

GGQKQRISLA RAVYSDRQIY LLDDPLSAVD AHVGKHIFEE CIKKTLRGKT WLVTHQLQY 660 

LEFCGQIILL ENGKICENGT HSELMQKKGK YAQLIQKMHK EATSDMLQDT AKIAEKPKVE 720 

SQALATSLEE SLNGNAVPSH QLTQEEEMEE GSLSWRVYKH YIQAAGGYMV SCIIPFFWL 780 

IVFLTIFSFW WLSYWLEQGS GTNSSRESNG TMADLGNIAD NPQLSFYQLV YGLNALUjIC 840 

VGVCSSGIFT KVTRKASTAL HNKLFNKVFR CPMSFFDTIP IGRLLNCFAG DLEQLDQLLP 900 

IFSEQPLVLS LMVIAVLLIV SVLSPYILLM GAIIMVICPI YYMMPKKAIG VFKRLENYSR 960 
SPLFSHILNS LQGLSSIHVY GKTEDFISQF KRLTDAQNNY LLLFLSSTRW MALRLEIMTN 1020 
LVTLAVALFV AFGISSTPYS FKVKAVNTVL QLASSFQATA RIGLETEAQP TAVERILQYM 1080 
KKCVSEAPLH MEGTSCPQGW PQHGEIIFQD YHMKYRDNTP TVLHGINLTI RGHEWGIVG 1140 
RTGSGKSSLG MALFRLVEPM AGRILIDGVD ICSIGLEDLR SKLSVIPQDP VLLSGTIRFM 1200 
LDPFDRHTDQ Q I WD ALERT F LTKAISKFPK KLHTDWENG GNFSVGERQL LCIARAVLRN 1260 
SKI I LIDEAT ASIDMETDTL IQRTIREAFQ GCTVLVIAHR VTTVLNCDHI LVMGNGKWE 1320 
FDRPEVLRKK PGSLFAALMA TATSSLR 

Seq ID NO: 107 DMA sequence 
Nucleic Acid Accession 4h NM_024022 
Coding sequence: 202.. 1563 

1 11 21 31 41 51 

I I I t 1 I 

ACCGGGCACC GGACGGCTCG GGTACTTTOG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGOGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 
AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG • 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCAGGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGAXGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATGATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCCCCTGGAC TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTGCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTC TCTTG 1860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TAGAGGCATG GGCCACCACG 2040 

CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAA 

Seq ID NO: loa Protein sequence 
Protein Accession #: NP_076927 

1 11 21 31 41 51 

I I I I I I 

MGENDPPAVE APPSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGIIALI 60 
LALAIGLGIH FDCSGKYRCR SSFKCIBLIA RCDGVSDCKD GEDEYRCVRV GGONAVLQVF 120 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSUiSQHPW QASLQFOGYH 240 
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LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSBE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 

VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
j GIGCAEVNKP GVYTRVTSFL DWIKEQMERD LKT 

Seq ID KO: 109 DNA sequence 

Nucleic Acid Accession 8: NMJ>00493.2 

Coding sequence: 97.. 2139 

10 1 11 21 31 41 51 

CACCTTCTGC ACTGCTCATC TGGGCAGAGG AAGCTTCAGA AAGCTGCCAA GGCACCATCT 60 

CCAGGAACTC CCAGCACGCA GAATCCATCT GAGAATATGC TGCCACAAAT ACCCTTTTTG 120 

CTGCTAGTAT CCTTGAACTT GGTTCATGGA GTGTTTTACG CTGAACGATA CCAAATGCCC 180 

15 ACAGGCATAA AAGGCCCACT ACCCAACACC AAGACACAGT TCTTCATTCC CTACACCATA 240 

AAGAGTAAAG GTATAGCAGT AAGAGGAGAG CAAGGTACTC CTGGTCCACC AGGCCCTGCT 300 

GGACCTCGAG GGCACCCAGG TCCTTCTGGA CCACCAGGAA AACCAGGCTA CGGAAGTCCT 360 

GGACTCCAAG GAGAGCCAGG GTTGCCAGGA CCACCGGGAC CATCAGCTGT AGGGAAACCA 420 

GGTGTGCCAG GACTCCCAGG AAAACCAGGA GAGAGAGGAC CATATGGACC AAAAGGAGAT 480 

20 GTTGGACCAG CTGGCCTACC AGGACCCCGG GGCCCACCAG GACCACCTGG AATCCCTGGA S40 

CCGGCTGGAA TTTCTGTGCC AGGAAAACCT GGACAACAGG GACCCACAGG AGCCCCAGGA 600 

CCCAGGGGCT TTCCTGGAGA AAAGGGTGCA CCAGGAGTCC CTGGTATGAA TGGACAGAAA 660 

GGGGAAATGG GATATGGTGC TCCTGGTCGT CCAGGTGAGA GGGGTCTTCC AGGCCCTCAG 720 

GGTCCCACAG GACCATCTGG CCCTCCTGGA GTGGGAAAAA GAGGTGAAAA TGGGGTTCCA 780 

25 GGACAGCCAG GCATCAAAGG TGATAGAGGT TTTCCGGGAG AAATGGGACC AATTGGCCCA 840 

CCAGGTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGAAG GCATTGGAAA GCCAGGAGCT 900 

GCTGGAGCCC CAGGCCAGCC AGGGATTCCA GGAACAAAAG GTCTCCCTGG GGCTCCAGGA 960 

ATAGCTGGGC CCCCAGGGCC TCCTGGCTTT GGGAAACCAG GCTTGCCAGG CCTGAAGGGA 1020 

GAAAGAGGAC CTGCTGGCCT TCCTGGGGGT CCAGGTGCCA AAGGGGAACA AGGGCCAGCA 1080 

30 GGTCTTCCTG GGAAGCCAGG TCTGACTGGA CCCCCTGGGA ATATGGGACC CCAAGGACCA 1140 

AAAGGCATCC CGGGTAGCCA TGGTCTCCGA GGCCCTAAAG GTGAGACAGG GCCAGCTGGG 1200 

CCTGCAGGAT ACCCTGGGGC TAAGGGTGAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 1260 

GGGTACCCAG GAAAACCAGG TCTCGATGGT CCTAAGGGTA ACCCAGGGTT ACCAGGTCCA 1320 

AAAGGTGATC CTGGAGTTGG AGGACCTCCT GGTCTCCCAG GCCCTGTGGG CCCAGCAGGA 1380 

35 GCAAAGGGAA TGCCCGGACA CAATGGAGAG GCTGGCCCAA GAGGTGCCCC TGGAATACCA 1440 

GGTACTAGAG GCCCTATTGG GCCACCAGGC ATTCCAGGAT TCCCTGGGTC TAAAGGGGAT IS 00 

CCAGGAAGTC CCGGTCCTCC TGGCCCAGCT GGCATAGCAA CTAAGGGCCT CAATGGACCC 1560 

ACCGGGCCAC CAGGGCCTCC AGGTCCAAGA GGCCACTCTG GAGAGCCTGG TCTTCCAGGG 1620 

CCCCCTGGGC CTCCAGGCCC ACCAGGTCAA GCAGTCATGC CTGAGGGTTT TATAAAGGCA 1680 

40 GGCCAAAGGC CCAGTCTTTC TGGGACCCCT CTTGTTAGTG CCAACCAGGG GGTAACAGGA 1740 

ATGCCTGTGT CTGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGGAACTCCC 1800 

ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTGGAATC 1860 

TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 1920 

CATGTTTGGG TAGGCCTGTA TAAGAATGGC ACCCCTGTAA TGTACACCTA TGATGAATAC 1980 

45 ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC 2040 

CAGGTGTGGC TCCAGCTTCC CAATGCCGAG TCAAATGGCC TATACTCCTC TGAGTATGTC 2100 

CACTCCTCTT TCTCAGGATT CCTAGTGGCT CCAATGTGAG TACACCCCAC AGAGCTAATC 2160 

TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA ATGCATATGG 2220 

AGGTAGGCTG AAAAGAATGT AATTTTTATT TTCTGAAATA CAGATTTGAG CTATCA GACC 2280 

50 AACAAACCTT CCCCCTGAAA AGTGAGCAGC AAOGTAAAAA CGTATGTGAA GCCTCTCTTG 2340 

AATTTCTAGT TAGCAATCTT AAGGCTCTTT AAGGTTTTCT CCAATATTAA AAAATATCAC 2400 

CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAAAAT 2460 

TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATGTGAAA TTTGATTTGA GAAACTCGGC 2520 

ATTTCCTTTT TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 2580 

55 AGGAGGTATC ATATAACTTT GTAGAACTTA AATACTTGAA TATTCAAATT TAAAAGACAC 2640 

TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 2700 

CAATATCTAT TCAAATATAC AGGTGCATAT ATACTTGTTA AAGCTCTTAT ATAAAA AAGC 2760 

CCCAAAATAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 2820 

CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAGCA TAACTTGGAA ACAGGTATCT 2880 

60 GACCTATTCT TATTTAGTTA ACACAAGTGT GATTAATTTG ATTTCTTTAA TTCCTTATTG 2940 

AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACATGTAC CTTGTGCCTC 3000 

CCATTCAAGT GAAGTTATAA TTTACACTGA GGGTTTCAAA ATTCGACTAG AAGTGGAGAT 3060 

ATATTATTTA TTTATGCACT GTACTGTATT TTTATATTGC TGTTTAAAAC TTTTAAGCTG 3120 

TGCCTCACTT ATTAAAGCAC AAAATGTTTT ACCTACTCCT TATTTACGAC ACAATAAAAT 3180 

65 AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 
TTCTTTCAAG GCTTTTCATT CGACACAATA AAATAACATC AATAG 



70 



Seq ID NO: 110 Protein sequence 
Protein Accession NP_000484.2 



11 21 31 41 SI 

1 I I I I I 

MLPQIPFUiL VSLNLVHGVP YAERYQMPTG IKGPLPNTKT QFPIPYTIKS KGIAVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 

75 QPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GPPGEKGAPG 180 

VPGMNGQKGB MGYGAPGRPG ERGLPGPOGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGPP 240 

GBMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 

PGLPGLKGBR GPAGLPGGPG AKGBQGPAGL PGKPGLTGPP GNKGPQGPKG IPGSBGLPGP 360 

KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKG DPGVGGPPGL 420 

80 PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GPPGSKGDPG SPGPPGPAGI 480 

ATKGLNGPTG PPGPPGPRGH SGBPGLPGPP GPPGPPGQAV MPEGPIKAGQ RPSLSGTPLV 540 

SANQGVTGMP VSAFTVILSK AYPAIGTPIP PDKILYNRQQ HYDPRTGIPT CQIPGIYYFS 600 

YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA IIDLTENDOV WUJLPNAESN 660 
GLYSSBYVHS SFSGFLVAPK 
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Seq ID NO: 111 DNA sequence 
Nucleic Acid Accession #: NM_000949 
Coding sequence: 285.. 2153 

1 11. 21 31 41 51 

I I I I I I 

GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 

CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 

ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 180 

TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 

GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 

CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 

TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 

GCTGGTGGAG GCCTGGGACA GATGGAGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 

GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 

GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 

CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 

AGCCAGACCC TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 

ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGGTTCACGC 780 

TCCTGTATGA AATTCGATTA AAACCOGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 

GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 

TTCGCTGCAA ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 

TACCTAGTGA CTTCACCATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 

CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 

GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGATT TGATGCTCAT CTGTTGGAGA 1140 

AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 

ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1260 

TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 

ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 

AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1440 

CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 

TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1560 

CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 

CACCGGCCAC TCTGTTGAAT GAAGCAGGTA AAGATGCTTT AAAATCCTCT CAAACCATTA 1680 

AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAGA 1740 

CTGACCAGGA TACGCCCTGG CTGCTGCCCC AGGAGAAAAC OCCCTTTGGC TCCGCTAAAC 1800 

CCTTGGATTA TGTGGAGATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA I860 

AACAGAGAGA GAACAGOGGC AAGCCCAAGA AGCCOGGGAC TCCTGAGAAC AATAAGGAGT 1920 

ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 

CTAAAAACGT GGCTTGCTTT GAAGAATCAG CCAAAGAGGC CCCACCATCA CTTGAACAGA 2040 

ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 

TGGGTGGTTT GGATTACCTG OATCCOGCAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 

ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 

AATGCTCAAG AATGTAGTCA GACTGACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 

TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 

AACTGTGATT TGTAGATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 

AAAAGCACAC TGCTTAGTAT TCTTGAGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 

GCTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 

AAGATGACAA AAGAAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 

AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGGA TTACCCAAAT ATAACCATTT 2640 

AAAGAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTGATAT GCAAGTAAGA AAT 

Seq ID NO: 112 Protein sequence 
Protein Accession ft: NP_000940 

1 11 21 31 41 51 

I I t I ) 1 

MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 

SLTYHRBGET LMHECPDYIT GGPNSCHFGK QYTSKWRTYI MMVNATNQMG SSFSDBLYVD 120 

VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTLIDLK TGWPTLLYEI RUCPEKAAEW 180 

EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYNSAWSP ATFIQIPSDF TMNDTTVWIS 240 

VAVLSAVICL IIVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSE ELLSALGCQD 300 

FPPTSDYBDL LVEYI*EVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 

SEKCEEPQAN PSTFYDPEVI EKPENPETTH TWDPQCISME GKIPYFHAGG SKCSTWPLPQ 420 

PSQHNPRSSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 

SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP S40 

ENNKEYAKVS. GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNOAEK ALANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 

Seq ID NO: 113 DNA sequence 

Nucleic Acid Accession ft: XM_0628ll 

Coding sequence t 1..888 

1 ll 21 31 41 51 

I I I I I I _ 

ATGTGGGGCG CTCGCCGCTC GTCCGTCTCC TCATCCTGGA ACGCCGCTTC GCTCCTGCAG 60 

CTGCTGCTGG CTGCGCTGCT GGCGGCGGGG GCGAGGGCCA GCGGCGAGTA CTGCCACGGC 120 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCOGAGCG CTTCGACGGC 180 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 240 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 300 

CGGGCGGACA AAGACGGCCC CGACGGCTCG GCAGTGCCCA TCTACGTGCC GTTCCTCATT 360 
GTTGGCTCCG TGTTTGTCGC CTTTATCATC TTGGGGTCCC TGGTGGCAGC CTGTTGCTGC 420 
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AGATGTCTCC GGCCTAAGCA GGATCCCCAG CAGAGCCGAQ CCCCAGGGGG TAACCGCTTG 480 

ATGGAGACCA TCCCCATGAT CCCCAGTGCC AGCACCTCCC GGGGGTCGTC CTCACGCCAG S40 

TCCAGCACAG CTGCCAGTTC CAGCTCCAGC GCCAACTCAG GGGCCCGGGC GCCCCCAACA 600 

AGGTCACAGA CCAACTGTTG CTTGCCGGAA GGGACCATGA ACAACGTGTA TGTCAACATG 660 

5 CCCACGAATT TCTCTGTGCT GAACTGTCAG CAGGCCACCC AGATTGTGCC ACATCAAGGG 720 

CAGTATCTGC ATCCCCCATA CGTGGGGTAC AOGGTGCAGC ACGACTCTGT GCCCATGACA 780 

GCTGTGCCAC CTTTCATGGA CGGCCTGCAG CCTGGCTACA GGCAGATTCA GTCCCCCTTC 840 
CCTCACACCA ACAGTGAACA GAAGATGTAC CCAGCGGTGA CTGTATAA 

10 Seq ID NO: 114 Pr otein seque nce 
Protein Accession fh XP_06281l 

1 11 21 31 41 SI 

1C I I I I I I 

I J MWGARRSSVS SSWNAASLLQ LLLAALLAAG AHASGEYCHG WLDAQGVWRI GFQCPERFDG 60 

GDATICCGSC ALRYCCSSAE ARLDQGGCDN DRQQGAGEPG RADKDGPDGS AVPIYVPFLI 120 

VGSVFVAPII LGSLVAACCC RCLRPKQDPQ QSRAPGGNRL METIPMIPSA STSRGSSSRQ 180 

SSTAASSSSS ANSGARAPPT RSQTNCCLPB GTMNNVYVNM PTNPSVUTCQ QATQIVPHQG 240 
QYLHPPYVGY TVQHDSVPMT AVPPFMDGLQ PGYRQIQSPF PHTNSEQKMY PAVTV 
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Seq ID NO: 115 DNA sequence 
Nucleic Acid Accession #: NM_013257 
Coding sequence: 223.. 1512 



25 1 11 21 31 41 51 

I I I I I I 

GGTGTGCTCT TGAGGGATTA AATGCAAAGA GATCACACCA TGGACTACAA GGAAAGCTGC 60 

CCAAGTGTAA GCATTCCCAG CTCCGATGAA CACAGAGAGA AAAAGAAGAG GTTTACTGTT 120 

TATAAAGTTC TGGTTTCAGT GGGAAGAAGT GAATGGTTTG TCTTCAGGAG ATATGCAGAG 180 

30 TTTGATAAAC TTTATAACAC TTTAAAAAAA CAGTTTCCTG CTATGGCCCT GAAGATTCCT 240 

GCCAAGAGAA TATTTGGTGA TAATTTTGAT CCAGATTTTA TTAAACAAAG ACGAGCAGGA 300 

CTAAACGAAT TCATTCAGAA CCTAGTTAGG TATCCAGAAC TTTATAACCA TCCAGATGTC 360 

AGAGCATTCC TTCAAATGGA CAGTCCAAAA CACCAGTCAG ATCCATCTGA AGATGAGGAT 420 

GAAAGAAGTT CTCAGAAGCT ACACTCTACC TCACAGAACA TCAACCTGGG ACCGTCTGGA 480 

35 AATCCTCATG CCAAACCAAC TGACTTTGAT TTCTTAAAAG TTATTGGAAA AGGCAGCTXT 540 

GGCAAGGTTC TTCTTGCAAA ACGGAAACTG GATGGAAAAT TTTATGCTGT CAAAGTGTTA 600 

CAGAAAAAAA TAGTTCTCAA CAGAAAAGAG CAAAAACATA TTATGGCTGA ACGTAATGTG 660 

CTCTTGAAAA ATGTGAAACA TCCGTTTTTG GTTGGATTGC ATTATTCCTT CCAAACAACT 720 

GAAAAGCTTT ATTTTGTTCT GGATTTTGTT AATGGAGGGG AGCTTTTTTT CCACTTACAA 780 

40 AGAGAACGGT CCTTTCCTGA GCACAGAGCT AGGTTTTACG CTGCTGAAAT TGCTAGTGCA 840 

TTGGGTTACT TACATTCCAT CAAAATAGTA TACAGAGACT TGAAACCAGA AAATATTCTT 900 

TTGGATTCAG TAGGACATGT TGTCTTAACA GATTTTGGGC TTTGTAAAGA AGGAATTGCT 960 

ATTTCTGACA CCACTACCAC ATTTTGTGGG ACACCAGAGT ATCTTGCACC TGAAGTAATT 1020 

AGAAAACAGC CCTATGACAA TACTGTAGAT TGGTGGTGCC TTGGGGCTGT TCTGTATGAA 1080 

45 ATGCTGTATG GATTGCCTCC TTTTTATTGC CGAGATGTTG CTGAAATGTA TGACAATATC 1140 

CTTCACAAAC CCCTAAGTTT GAGGCCAGGA GTGAGTCTTA CAGCCTGGTC CATTCTGGAA 1200 

GAACTCCTAG AAAAAGACAG GCAAAATCGA CTTGGTGCCA AGGAAGACTT TCTTGAAATT 1260 

CAGAATCATC CTTTTTTTGA ATCACTCAGC TGGGCTGACC TTGTACAAAA GAAGATTCCA 1320 

CCACCATTTA ATCCTAATGT GGCTGGACCA GATGATATCA GAAACTTTGA CACAGCATTT 1380 

50 ACAGAAGAAA CAGTTCCATA TTCTGTGTGT GTATCTTCTG ACTATTCTAT AGTGAATGCC 1440 

AGTGTATTGG AGGCAGATGA TGCATTCGTT GGTTTCTCTT ATGCACCTCC TTCAGAAGAC 1500 

TTATTTTTGT GAGCAGTTTG CCATTCAGAA ACCATTGAGC AAAATAAGTC TATAGATGGG IS 60 

ACTGAAACTT CTATTTGTGT GAATATATTC AAATATGTAT AACTAGTGCC TCATTTTTAT 1620 

ATGTAATGAT GAAAACTATG AAAAAATGTA TTTTCTTCTA TGTGCAAGAA AAATAGGGCA 1680 

55 TTTCAAAGAG CTGTTTTGAT TAAAATTTAT ATTCTTGTTT AATAAGCTTA TTTTTAAACA 1740 

ATTTAAAAGC TATTATTCTT AGCATTAACC TATTTTTAAA GAAACCTTTT TTGCTATTGA 1800 

CTGTTTTTTC CCTCTAAGTT TACACTAACA TCTACCCAAG ATAGACTGTT TTTTAACAGT 1860 

CAATTTCAGT TCAGCTAACA TATATTAATA CCTTTGTAAC TCTTTGCTAT GGCTTTTGTT 1920 

ATCACACCAA AACTATGCAA TTGGTACATG GTTGTTTAAG AAGAAACCGT ATTTTTCCAT 1980 

60 GATAAATCAC TGTTTGAAAT A T TTG GTTC A TGGTATGATC GAAATGTAAA AGCATAATTA 2040 

ACACATTGGC TGCTAGTTAA CAATTGGAAT AACTTTATTC TGCAGATCAT TTAAGAAGTA 2100 

ACAGGCCGGG CGCGGTGGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGGC TGAGGCGGGC 2160 

AGATCACCTG AGGTCAGGAG TTGGAGACCA GCCTGACCAA CATGGACAAA CCCCGTCTCT 2220 

- ACTAAAAATA CAAAATTGGC AGGGTGTGGT GGCACATGCC TATAATCCCA GCTACTTGGG 2280 

65 AGGCTAAGGC AGGAGAATCG CTTGAACCCG GGAGGCGGAG GTTGCAGTGA GCCGAGATCG 2340 
- CACCATTGCA CTCCTGCCTG GGCAACAAGA GTGAAACTCC ATCTCCAAAA A 



Seq ID NO: 116 Protein sequence 
Protein Accession 8: NP_037389 



1 11 21 31 41 51 

I I I I I I 

MALKIPAKRI FGDNFDPDFI KQRRAGLNEF IQNLVRYPEL YNHPDVRAFL QMDSPKHQSD 60 

c PSEDEDERSS QKLHSTSQNI NLGPSGNPHA KPTDFDFLKV IGXGSFGKVL LAKRKLDGKF 120 

75 YAVKVLQKKI VLNRKEQKHI MAERNVLLKN VKHPPLVGLH YSFQTTEKLY FVLDFVNGGE 180 

LFFHUQRERS FPEHRARFYA AEIASALGYL HSIKIVYRDI> KPENILLDSV GHWLTDFGL 240 

CKEGIAISDT TTTFCGTPEY LAPBVIRKQP YDNTVDWWCL GAVLYEMLYG LPPFYCRDVA 300 

EMYDNILHKP LSLRPGVSLT AWSILEBLLE KDRQNRLGAK BDFLEIQNHP PFESLSWADL 360 

OA VQKKIPPPFN PNVAGPDDIR NFDTAFTEET VPYSVCVSSD YSIVNASVLE ADDAFVGFSY 420 

OU APPSEDLFL 



Seq ID NO: 117 DNA sequence 

Nucleic Acid Accession ft: NM 004004.1 



997 



WO 03/042661 



Coding sequence: 1..681 

1 11 21 31 41 SI 

I I I I I I 

ATGGATTGGG GCACGCTGCA GACGATCCTG GGGGGTGTGA ACAAACACTC CACCAGCATT 60 

GGAAAGATCT GGCTCACCGT CCTCTTCATT TTTCGCATTA TGATCCTCGT TGTGGCTGCA 120 

AAGGAGGTGT GGGGAGATGA GCAGGCCGAC TTTGTCTGCA ACACCCTGCA GCCAGGCTGC 180 

AAGAACGTGT GCTACGATCA CTACTTCCCC ATCTCCCACA TCCGGCTATG GGCCCTGCAG 240 

CTGATCTTCG tgTCCAGCCC AGCGCTCCTA GTGGCCATGC ACGTGGCCTA CCGGAGACAT 300 

GAGAAGAAGA GGAAGTTCAT CAAGGGGGAG ATAAAGAGTG AATTTAAGGA CATOGAGGAG 360 

ATCAAAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC AAGCAGCATC 420 

TTCTTCCGGG TCATCTTOGA AGCCGCCTTC ATGTACGTCT TCTATGTCAT GTACGACGGC 480 

TTCTCCATGC AGCGGCTGGT GAAGTGCAAC GCCTGGCCTT GTCCCAACAC TGTGGACTGC 540 

TTTGTGTCCC GGCCCACGGA GAAGACTGTC TTCACAGTGT TCATGATTGC AGTGTCTGGA 600 

ATTTGCATCC TGCTGAATGT CACTGAATTG TGTTATTTGC TAATTAGATA TTGTTCTGGG 660 
AAGTCAAAAA AGCCAGTTTA A 

Seq ID NO: 118 Protein sequence 
Protein Accession ftt NP_003 995.1 

1 11 21 31 41 51 

1 I I I I I 

MDWGTLQTIlt GGVNKHSTSI GKIWLTVLFI FRIMILWAA KEVWGDBQAD PVCNTLQPGC 60 
KNVCYDHYFP ISHIRLWALQ LIFVSSPALL VAMHVAYRRH EKKRKFIKGE IKSBFKDIEE 120 
IKTQKVRIEG SLWWTYTSSI FFRVIFEAAF MYVFYVMYDG FSMQRLVKCN AWPCPNTVDC 180 
FVSRPTEKTV FTVFMIAVSG ICILLNVTEL CYLLIRYCSG KSKKPV 



Seq ID NO: 119 DNA sequence 

Nucleic Acid Accession #: XMJJ61091.1 

Coding sequence : 1 . . 2481 

1 11 21 31 41 SI 

11)111 

ATGCCAAATA CTTCAGGAAC AACCAGGATT GAAATTTGGC TTCTCCAAGA GCCGCCCGGG 60 

CACCGAGCGC TGGTCGCCGC TCTCCTTCCG GTGAGTCCCA GCCCCGAGTT GGCTCTGGCG 120 

CCCGGGTACC CGCCAGTGCC GGCTGCCGAT GACCGATTCA CGCTCCCGAT GATTGGAGGT 180 

CAGATGCATG GTGAGAAGGT AGATCTCTGG AGCCTTGGTG TTCTTTGCTA TGAATTTTTA 240 

GTTGGGAAGC CTCCTTTTGA GGCAAACGAA GTCCATGTAA GCAAAGAAAC CATCGGGAAG 300 

ATTTCAGCTG CCAGCAAAAT GATGTGGTGC TCGGCTGCAG TGGACATCAT GTTTCTGTTA 360 

GATGGGTCTA ACAGCGTCGG GAAAGGGAGC TTTGAAAGGT CCAAGCACTT TGCCATCACA 420 

GTCTGTGACG GTCTGGACAT CAGCCCCGAG AGGGTCAGAG TGGGAGCATT CCAGTTCAGT 480 

TCCACTCCTC ATCTGGAATT CCCCTTGGAT TCATTTTCAA CCCAACAGGA AGTGAAGGCA 540 

AGAATCAAGA GGATGGTTTT CAAAGGAGGG CGCACGGAGA CGGAACTTGC TCTGAAATAC 600 

CTTCTGCACA GAGGGTTGCC TGGAGGCAGA AATGCTTCTG TGCCCCAGAT CCTCATCATC 660 

GTCACTGATG GGAAGTCCCA GGGGGATGTG GCACTGCCAT CCAAGCAGCT GAAGGAAAGG 720 

GGTGTCACTG TGTTTGCTGT GGGGGTCAGG TTTCCCAGGT GGGAGGAGCT GCATGCACTG 780 

GCCAGCGAGC CTAGAGGGCA GCACGTGCTG TTGGCTGAGC AGGTGGAGGA TGCCACCAAC 840 

GGCCTCTTCA GCACCCTCAG CAGCTCGGCC ATCTGCTCCA GCGCCACGCC AGCTGGGAGC 900 

CCCGAGCTTG TCTTCATGGA GCGGTTAATG GGCATCTCTC TGATAGGCCC CTGTGACTCG 960 

CAGCCCTGCC AGAATGGAGG CACATGTGTT CCAGAAGGAC TGGACGGCTA CCAGTGCCTC 1020 

TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC TGTGCCCTGA AGCTGAGCCT GGAATGCAGG 1080 

GTOGACCTCC TCTTCCTGCT GGACAGCTCT GCGGGCACCA CTCTGGACGG CTTCCTGCGG 1140 

GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG GCCGTGCTGA GCGAGGACTC TCGGGCCCGA 1200 

GTGGGTGTGG CCACATACAG CAGGGAGCTG CTGGTGGCGG TGCCTGTGGG GGAGTACCAG 1260 

GATGTGCCTG ACCTGGTCTG GAGCCTCGAT GGCATTCCCT TCCGTGGTGG CCCCACCCTG 1320 

ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG CGTGGCTTCG GGAGCGCCAC CAGGACAGGC 1380 

CAGGACCGGC CACGTAGAGT GGTGGTTTTG CTCACTGAGT CACACTCCGA GGATGAGGTT 1440 

GCGGGCCCAG CGCGTCACGC AAGGGCGCGA GAGCTGCTCC TGCTGGGTGT AGGCAGTGAG 1500 

GCCGTGCGGG CAGAGCTGGA GGAGATCACA GGCAGCCCAA AGCATGTGAT GGTCTACTCG 1560 

GATCCTCAGG ATCTGTTCAA CCAAATCCCT GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG 1620 

CGGCCAGGGT GCCGGACACA AGCCCTGGAC CTCGTCTTCA TGTTGGACAC CTCTGCCTCA 1680 

GTAGGGCCCG AGAATTTTGC TCAGATGCAG AGCTTTGTGA GAAGCTGTGC CCTCCAGTTT 1740 

GAGGTGAACC CTGACGTGAC ACAGGTCGGC CTGGTGGTGT ATGGCAGCCA GGTGCAGACT 1800 

GCCTTCGGGC TGGACACCAA ACCCACCCGG GCTGCGATGC TGCGGGCCAT TAGCCAGGCC 1860 

CCCTACCTAG GTGGGGTGGG CTCAGCCGGC ACCGCCCTGC TGCACATCTA TGACAAAGTG 1920 

ATGACCGTCC AGAGGGGTGC CCGGCCTGGT GTCCCCAAAG CTGTGGTGGT GCTCACAGGC 1980 

GGGAGAGGCG CAGAGGATGC AGCCGTTCCT GCCCAGAAGC TGAGGAACAA TGGCATCTCT 2040 

GTCTTGGTCG TGGGCGTGGG GCCTGTCCTA AGTGAGGGTC TGCGGAGGCT TGCAGGTCCC 2100 

CGGGATTCCC TGATCCACGT GGCAGCTTAC GCCGACCTGC GGTACCACCA GGACGTGCTC 2160 

ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG CCAGTCAACC TCTGCAAACC CAGCCCGTGC 2220 

ATGAATGAGG GCAGCTGCGT CCTGCAGAAT GGGAGCTACC GCTGCAAGTG TCGGGATGGC 2280 

TGGGAGGGCC CCCACTGCGA GAACCGTGAG TGGAGCTCTT GCTCTGTATG TGTGAGCCAG 2340 

GGATGGATTC TTGAGACGCC CCTGAGGCAC ATGGCTCCCG TGCAGGAGGG CAGCAGCCGT 2400 

ACCCCTCCCA GCAACTACAG AGAAGGCCTG GGCACTGAAA TGGTGCCTAC CTTCTGGAAT 2460 
GTCTGTGCCC CAGGTCCTTA G 

Seq ID NO: 120 Protein sequence 
Protein Accession 8: XP_061091.1 

1 11 21 31 41 51 

I I I I I I 

MPNTSGTTRI EIWLLQEPPG HRALVAALLP VSPSPELALA PGYPPVPAAD DRFTLPMIGG 60 

QMHGEKVDLW SLGVLCYEFL VGKPPFEANE VHVSKET1GK I8AASKMMWC SAAVDIMFLL 120 



998 



WO 03/042661 PCT/US02/36810 



DGSNSVGKGS FERSKHFAIT VCDGLDISPE RVRVGAFQFS STPHLEFPLD SFSTQQEVKA 180 

RIKRMVFKGQ RTETELALKY LLHRGLPGGR NASVPQILII VTDGKSQGDV ALPSKQLKER 240 

GVTVFAVGVR FPRWEELHAL ASBPRGQHVL LAEQVEDATN GLFSTLSSSA ICSSATPAGS 300 

PELVFMERLM GISLIGPCDS QPCQNGGTCV PEGLDGYQCL CPLAFGGEAN CALKLSLECR 360 

5 VDLLFLLDSS AGTTLDGFLR AKVFVKRFVR AVLSEDSRAR VGVATYSRBL LVAVPVGEYQ 420 

DVPDLVWSLD GIPFRGGPTL TGSALRQAAB RGFGSATRTG QDRPRRWVL LTESHSEDEV 480 

AGPARHARAR- BLLLLGVGSE AVRAELEEIT GSPKHVMVYS DPQDLFNQIP ELQGKLCSRQ 540 

RPGCRTQALD LVFMLDTSAS VGPENFAQHQ SFVRSCALQF EVNPDVTQVG LWYGSQVQT 600 

AFGLDTKPTR AAMLRAISQA PYLGGVGSAG TALLHIYDKV MTVQRGARPG VPKAVWLTG 660 

10 GRGAEDAAVP AQKLRNNGIS VLWGVGPVL SEGI*RRLAGP RDSLIHVAAY ADLRYHQDVL 720 

IEWLCGEAKQ PVNLCKPSPC MNEGSCVLQN GSYRCKCRDG WEGPHCENRE WSSCSVCVSQ 780 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTEMVPTFWN VCAPGP 



Seq ID NO: 121 DNA sequence 
15 Nucleic Acid Accession #: Eos sequence 
Coding sequence : 1 . . 2424 

1 11 21 31 41 51 

OA I I I I 1 1 

ZU ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGQT 240 

CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 

25 CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 

AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 

TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 600 

30 AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 720 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 780 

AGAGGATCGC GGGGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 840 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

35 TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 1020 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CGACTCTGGA CGGCTTCCTG 1080 

CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 1200 

40 CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 1260 

CTGACGGGCA GTGCCTTGOG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 1320 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 1380 

GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 

GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 1500 

45 TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 1620 

TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC GGGGCTGGGA TGCTGCGGGC CATTAGCCAG 1800 

50 GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 

TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGAOGTG 2100 

55 CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GCCCCCACTG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 2280 

CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC CCGTGCAGGA GGGCAGCAGC 2340 

- CGTACCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGCC TACCTTCTGG 2400 

60 AATGTCTGTG CCCCAGGTCC TTAG 



Seq ID NO: 122 Protein sequence 
Protein Accession #i Eos sequence 

65 1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LBFPLDSFST QQEVKARIKR 120 

MVPKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

70 FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLPS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPBGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRVW IiLTESHSEDB VAGPARHARA RELLLLQVG3 480 

75 EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIVDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

OA CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EHSSCSVCVS QGWILETPLR HNAPVQEGSS 780 

80 RTPPSNYREG LGTEMVPTFW NVCAPGP 



Seq ID NO: 123 DNA sequence 

Nucleic Acid Accession ft: Bos sequence 

Coding sequence: 89.. 2356 



999 



WO 03/042661 



1 11 21 31 41 51 

I I I I I I 

GCCCCCTGGC CCGAGCCGCG CCCGGGTCTG TGAGTAGAGC CGCCOGGGCA CCGAGCGCTG 60 

GTCGCOGCTC TCCTTCCGTT ATATCAACAT GCCCCCTTTC CTGTTGCTGG AAGCCGTCTG 120 

TGTTTTCCTG TTTTCCAGAG TGCCCCCATC TCTCCCTCTC CAGGAAGTCC ATGTAAGCAA 180 

AGAAACCATC GGGAAGATTT CAGCTGCCAG CAAAATGATG TGGTGCTCGG CTGCA6TGGA 240 

CATCATGTTT CTGTTAGATG GGTCTAACAG CGTCGGGAAA GGGAGCTTTG AAAGGTCCAA 300 

GCACTTTGCC ATCACAGTCT GTGACGGTCT GGACATCAGC CCCGAGAGGG TCAGAGTGGG 360 

AGCATTCCAG TTCAGTTCCA CTCCTCATCT GGAATTCCCC TTGGATTCAT TTTCAACCCA 420 

ACAGGAAGTG AAGGCAAGAA TCAAGAGGAT GGTTTTCAAA GGAGGGCGCA CGGAGACGGA 480 

ACTTGCTCTG AAATACCTTC TGCACAGAGG GTTGCCTGGA GGCAGAAATG CTTCTGTGCC 540 

CCAGATCCTC ATCATCGTCA CTGATGGGAA GTCCCAGGGQ GATGTGGCAC TGCCATCCAA 600 

GCAGCTGAAG GAAAGGGGTG TCACTGTGTT TGCTGTGGGG GTCAGGTXTC CCAGGTGGGA 660 

GGAGCTGCAT GCACTGGCCA GOGAGCCTAG AGGGCAGCAC GTGCTGTTGG CTGAGCAGGT 720 

GGAGGATGCC ACCAACGGCC TCTTCAGCAC CCTCAGCAGC TCGGCCATCT GCTCCAGOGC 780 

CACGCCAGAC TGCAGGGTCG AGGCTCACCC CTGTGAGCAC AGGACGCTGG AGATGGTCOG 840 

GGAGTTCGCT GGCAATGCCC CATGCTGGAG AGGATCGCGG CGGACCCTTG CGGTGCTGGC 900 

TGCACACTGT CCCTTCTACA GCTGGAAGAG AGTGTTCCTA ACCCACCCTG CCACCTGCTA 960 

CAGGACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC CAGAATGGAG GCACATGTGT 1020 

TCCAGAAGGA CTGGACGGCT ACCAGTGCCT CTGCCOGCTG GCCTTTGGAG GGGAGGCTAA 1080 

CTGTGCCCTG AAGCTGAGCC TGGAATGCAG GGTCGACCTC CTCTTCCTGC TGGACAGCTC 1140 

TGCGGGCACC ACTCTGGAOG GCTTCCTGCG GGCCAAAGTC TTCGTGAAGC GGTTTGTGCG 1200 

GGCCGTGCTG AGCGAGGACT CTCGGGCCCG AGTGGGTGTG GCCACATACA GCAGGGAGCT 1260 

GCTGGTGGCG GTGCCTGTGG GGGAGTACCA GGATGTGCCT GACCTGGTCT GGAGCCTCGA 1320 

TGGCATTCCC TTCOGTGGTG GCCCCACCCT GAOGGGCAGT GCCTTGOGGC AGGCGGCAGA 1380 

GCGTGGCTTC GGGAGCGCCA CCAGGACAGG CGAGGACOGG CCAOGTAGAG TGGTGGTTTT 1440 

GCTCACTGAG TCACACTCCG AGGATGAGGT TGOGGGCCCA GCGOGTCACG CAAGGGOGCG 1500 

AGAGCTGCTC CTGCTGGGTG TAGGCAGTGA GGCCGTGCGG GCAGAGCTGG AGGAGATCAC 1560 

AGGCAGCCCA AAGCATGTGA TGGTCTACTC GGATCCTCAG GATCTGTTCA ACCAAATCCC 1620 

TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GCGGCCAGGG TGCCGGACAC AAGCCCTGGA 1680 

CCTCGTCTTC ATGTTGGACA CCTCTGCCTC AGTAGGGCCC GAGAATTTTG CTCAGATGCA 1740 

GAGCTTTGTG AGAAGCTGTG CCCTCCAGTT TGAGGTGAAC CCTGACGTGA CACAGGTCGG 1800 

CCTGGTGGTG TATGGCAGCC AGGTGCAGAC TGCCTTCGGG CTGGACACCA AACCCACCCG 1860 

GGCTGCGATG CTGCGGGCCA TTAGCCAGGC CCCCTACCTA GGTGGGGTGG GCTCAGCCGG 1920 

CACCGCCCTG CTGCACATCT ATGACAAAGT GATGACCGTC CAGAGGGGTG CCCGGCCTGG 1980 

TGTCCCGAAA GCTGTGGTGG TGCTCACAGG CGGGAGAGGC GCAGAGGATG CAGCOGTTCC 2040 

TGCCCAGAAG CTGAGGAACA ATGGCATCTC TGTCTTGGTC GTGGGGGTGG GGCCTGTCCT 2100 

AAGTGAGGGT CTGCGGAGGC TTGCAGGTCC COGGGATTCC CTGATCCACG TGGCAGCTTA 2160 

CGCCGACCTG CGGTACCACC AGGACGTGCT CATTGAGTGG CTGTGTGGAG AAGCCAAGCA 2220 

GCCAGTCAAC CTCTGCAAAC CCAGCCOGTG CATGAATGAG GGCAGCTGCG TCCTGCAGAA 2280 

TGGGAGCTAC CGCTGCAAGT GTOGGGATGG CTGGGAGGGC CCOCACTGCG AGAACOGATT 2340 

CTTGAGACGC CCCTGAGGCA CATGGCTCCC GTGCAGGAGG GCAGCAGCCG TACCCCTCCC 2400 

AGCAACTACA GAGAAGGCCT GGGCACTGAA ATGGTGCCTA CCTTCTGGAA TGTCTGTGCC 2460 

CCAGGTCCTT AGAATGTCTG CTTCCOGCOG TGGCCAGGAC CACTATTCTC ACTGAGGGAG 2520 

GAGGATGTCC CAACTGCAGC CATGCTGCTT AGAGACAAGA AAGCAGCTGA TGTCACCCAC 2580 

AAACGATGTT GTTGAAAAGT TTTGATGTGT AAGTAAATAC CCACTTTCTG TACCTGCTGT 2640 

GCCTTGTTGA GGCTATGTCA TCTGCCACCT TTCCCTTGAG GATAAACAAG GGGTCCTGAA 2700 

GACTTAAATT TAGCGGCCTG ACGTTCCTTT GCACACAATC AATGCTCGCC AGAATGTTGT 2760 
TGACACAGTA ATGCCCAGCA GAGGCCTTTA CTAGAGCATC CTTTGGAOGG 



Seq ID NO: 124 Protein sequence 
Protein Accession th Eos sequence 



1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSPERS KHPAITVCDG LDISPBRVRV GAFQFSSTPH LEPPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQt/KERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPIAPGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFIi 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ PEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VKTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR FLRRP 



Seq ID NOt 125 DKA sequence 

Nucleic Acid Accession #: NM_031942.l 

Coding sequence t 145.. 1260 



1 11 21 31 41 51 

I I I I I I 

CCCGAGCCCC GCCCCTCCGG GCCGGGGTCG GCGCGCCCAG CCTGOCAGCC GOGCTGCTGC 60 

TGCTCCTCCT GCTGTGGGAC OGCTGACCGC GCGGCTGCTC CGCTCTCCCC GCTCCAAGCG 120 

COGATCTGGG CACCCGCCAC CAGCATGGAC GCTOGCOGCG TGCCGCAGAA AGATCTCAGA 180 

GTAAAGAAGA ACTTAAAGAA ATTCAGATAT GTGAAGTTGA TTTCCATGGA AACCTCGTCA 240 

TCCTCTGATG ACAGTTGTGA CAGCTTTGCT TCTGATAATT TTGCAAACAC GAGGCTGCAG 300 

TCAGTTCGGG AAGGCTGTAG GACCCGGAGC CAGTGCAGGC ACTCTGGACC TCTCAGGGTG 360 

GCGATGAAGT TTCCAGCGOG GAGTACCAGG GGAGCAACCA ACAAAAAAGC AGAGTCCCGC 420 

CAGCCCTCAG AGAATTCTGT GACTGATTCC AACTCOGATT CAGAAGATGA AAGTGGAATG 480 

1000 



WO 03/042661 



PCT/US02/36810 



AATTTTTTGG AGAAAAGGGC TTTAAATATA AAGCAAAACA AAOCAATCCT TGCAAAACTC 540 



15 



35 



60 



Seq ID NO: 126 Protein sequence 
Protein Accession Ut NP 114148.1 



Seq ID NO: 129 DNA sequence 

Nucleic Acid Accession fit KM 004952.1 



600 



ATGTCTGAAT TAGAAAGCTT CCCTGGCTCG TTCCGTGGAA GACATCCCCT CCCAGGCTCC ouu 

GACTCACAAT CAAGGAGACC GCGAAGGCGT ACATTCCCGG GTGTTGCTTC CAGGAGAAAC 660 

CCTGAACGGA GAGCTOGTCC TCTTACCAGG TCAAGGTCCC GGATCCTCGG GTCCCTTGAC 720 

GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTGGTGAG AAAGAGGAAG 780 

ACCGTGGATG GCTACATGAA TGAAGATGAC CTGCCCAGAA GCCGTCGCTC CAGATCATCC 840 

GTGACCCTTC CGCATATAAT TCGCCCAGTG GAAGAAATTA CAGAGGAGGA GTTGGAGAAC 900 

GTCTGCAGCA ATTCTOGAGA GAAGATATAT AACOGTTCAC TGGGCTCTAC TTGTCATCAA 960 

1 n TGCCGTCAGA AGACTATTGA TACCAAAACA AACTGCAGAA ACCCAGACTG CTGGGGCGTT 1020 

iU OGAGGCCAGT TCTGTGGCCC CTGCCTTCGA AACOGTTATG GTGAAGAGGT CAGGGATGCT 1080 

CTGCTGGATC OGAACTGGCA TTGCCCGCCT TGTCGAGGAA TCTGCAACTG CAGTTTCTGC 1140 

CGGCAGCGAG ATGGACGGTG TGCGACTGGG GTCCTTGTGT ATTTAGCCAA ATATCATGGC 1200 

TTTGGGAATG TGCATGCCTA CTTGAAAAGC CTGAAACAGG AATTTGAAAT GCAAGCATAA 1260 

TATCTGG AAA ATTTGCTGCC TGCCTTCTAC TTCTCAAATC TTTCTTGTAA AAGTTTCCAA 1320 

TTTTTTCACT GAAACCTGAG TTAAAAATCT TGATGATCAG CCTGTTTCAT AAGAAACTCC 1380 

AATCAAGTTA ATCTTAGCAG ACATGTGTTT CTGGAGCATC ACAGAAGGTA TATTGCTAGT 1440 

TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 1500 

TCTATTTCCA A TGCTCC TCT CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 1560 

<yft TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 1620 

GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 1860 

syr CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 1920 

ZJ AAGGTTGGTG GGCAATGTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 1980 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 2040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 2160 

1(\ CAAG ? TCRA GTT TAQA TTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 2220 

D\J GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 22 B0 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 

AAGTGTTTQT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 



40 t i 1 r I 1 ; x r 

MDARRVPQKD LRVKKNLKKP RYVKLISMET SSSSDDSCDS FASDNPANTR LQSVREGCRT 60 

RSQCRHSGPL RVAMKPPARS TRGATNKKAE SRQPSENSVT DSNSDSEDES GMNPLEKRAL 120 

NIKQNKAMLA KLMSELESPP GSPRGRHPLP GSDSQSRRPR RRTPPGVASR RNPERRARPL 180 

AC TRSRSRILGS LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 

HJ PVEEITEEBli ENVCSNSREK IYNRSLGSTC HQCRQKTIDT KTNCRNPDCW GVRGQFCGPC 300 

LRNRYGEEVR DALLDPNWHC PPCRGICNCS FCRQRDGRCA TGVLVYLAKY HGFGNVHAYL 360 
A 



Seq ID NO: 127 DNA sequence 
OU Nucleic Acid Accession #: AF305616.1 
Coding sequence-. 1..863 

1 11 21 31 41 51 

<^ 1 I I I i | 

ATGCACCGCT TGATGGGGGT CAACAGCACC GCCGCCGCCG CCGCCGGGCA GCCCAATGTC 60 

TCCTGCACGT GCAACTGCAA ACGCTCTTTG TTCCAGAGCA TGGAGATCAC GGAGCTGGAG 120 

TTTGTTCAGA TCATCATCAT CGTGGTGGTG ATGATGGTGA TGGTGGTGGT GATCAOGTGC 180 

CTGCTGAGCC ACTACAAGCT GTCTGCACGG TCCTTCATCA GCCGGGACAG CCAGGGGCGG 240 

AGGAGAGAAG ATGCCCTGTC CTCAGAAGGA TGCCTGTGGC CCTCGGAGAG CACAGTGTCA 300 

GGCAACGGAA TCCCAGAGCC GCAGGTCTAC GCCCCGCCTC GGCCCACCGA CCGCCTGGCC 360 

GTGCCGGCCT TCGCCCAGCG GGAGCGCTTC CACCGCTTCC AGCCCACCTA TCCGTACCTG 420 

CAGCACGAGA TCGACCTGCC ACCCACCATC TCGCTGTCAG ACGGGGAGGA GCCCCCACCC 480 

TACCAGGGCC CCTCCACCCT CCAGCTTOGG GACCCOGAGC AGCAGCTGGA ACTGAACCGG S40 

^- GAGTCGGTGC GCGCACCOCC AAACAGAACC ATCTTCGACA GTGACCTGAT GGATAGTGCC 600 

OJ AGGCTGGGCG GCCCCTGCCC CCCCAGCAGT AACTCGGGCA TCAGCGCCAC GTGCTACGGC 660 

■ AGCGGCGGGC GCATGGAGGG GCCGCCGCCC ACCTACAGCG AGGTCATCGG CCACTACCCG 720 

GGGTCCTCCT TCCAGCACCA GCAGAGCAGT GGGCCGCCCT CCTTGCTGGA GGGGACCCGG 780 

CTCCACCACA CACACATCGC GCCCCXAGAG AGCGCAGCCA TCTGGAGCAA AGAGAAGGAT 840 
AAACAGAAAG GACACCCTCT CTAG 

Seq ID NO: 128 Protein sequence 
Protein Accession #: AAL09357.1 

nc J }1 21 31 41 51 

MHRLMGVNST AAAAAGQPNV SCTCNCKRSL FQSMEITELE FVQIIIIWV MMVMVWITC 60 

LLSHYKLSAR SFISRHSQGR RREDALSSEG CLWPSBSTVS GNGIPEPQVY APPRPTDRLA 120 

VPPFAQRERF HRFQPTYPYL QHEIDLPPTI SLSDGBEPPP YQGPCTLQLR DPEQQLELNR 180 

orj ESVRAPPNRT IFDSDIWDSA RLGGPCPPSS NSGISATCYG SGGRMEGPPP TYSEVIGHYP 240 

0\J GSSFQHQQSS GPPSLLEGTR LHHTHIAPLE SAAIWSKEKD KQKGHPL 



1001 



WO 03/042661 



PCT/US02/36810 



Coding sequence: 1..718 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
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70 
75 
80 



ATGGCGGCGG 
CTGGCCCAAG 
AACCAGCACC 
ATTTACTGCC 
GGCGGGGCAG 
GCCAGCCAGG 
AAGTTCTCGG 
GGCCACGAGT 
ATGAAGGTGT 
CTCCCCCAGT 
GAGAACCCTC 
CACCTGCCCC 



11 
I 

CTCCGCTGCT 
GGCCCGGAG6 
TGCGGCGAGA 
CGCACTACAA 
AGCAGTACGT 
GCTTCAAGCG 
AGAAGTTCCA 
ACTACTACAT 
TCGTCTGCTG 
TCACCATGGG 
AGGTGCCCAA 
TGGCCGTGGG 



21 
I 

GCTGCTGCTG 
GGCGCTGGGA 
GGGCTACACC 
CAGCTCGGGG 
GCTGTACATG 
CTGGGAGTGC 
GCGCTACAGC 
CTCCACGCCC 
CGCCTCCACA 
CCCCAATGTG 
GCTTGAGAAG 
CATCGCCTTC 



31 
I 

CTGCTCGTGC 
AACCGGCATG 
GTGCAGGTGA 
GTGGGCCCCG 
GTGAGCCGCA 
AACCGGCCGC 
GOCTTCTCTC 
ACTCACAACC 
TOGCACTCCG 
AAGATCAACG 
AGCATCAGCG 
TTCCTCATGA 



41 

I 

CCGTGCCGCT 
CGGTGTACTG 
ACGTGAACGA 
GGGCGGGACC 
ACGGCTACCG 
ACGCCCGGCA 
TGGGCTACGA 
TGCACTGGAA 
GGGAGAAGCC 
TGCTGGAAGA 
GGACCAGCCC 
CGTTCTTGGC 



51 
I 

GCTGCCGCTG 
GAACAGCTCC 
CTATCTGGAT 
GGGGCCCGGA 
CACCTGCAAC 
CAGCCCCATC 
GTTCCACGCC 
GTGTCTGAGG 
GGTCCCCACT 
CTTTGAGGGA 
CAAACGGGAA 
CTCCTAG 



Seq ID NO: 130 Protein Bequence 
Protein Accession #: NP_004 943.1 

} " 21 31 41 51 

• I I I I I 

MAAAPLLLLL LLVPVPLLPL LAQGPGGALG NRHAVYWNSS NQHLRREGYT VQVNVNDYLD 
IYCPHYNSSG VGPGAGPGPG GGAEQYVLYM VSRNGYRTCN ASQGFKRWEC NRPHAPHSPI 
KFSEKFQRYS AFSLGYEFHA GHEYYYISTP THNLHWKCLR MKVFVCCAST SHSGEKPVPT 
LPQPTMGPNV KINVLEDFEG ENPQVPKLEK SISGTSPKRE HLPLAVGIAF FLMTFLAS 

Seq ID NO: 131 DNA sequence 

Nucleic Acid Accession #: NMJU2445.1 

Coding sequence: 276.. 1271 " 




21 
I 

CCGACCCGGG 
CGCAGCCCCT 
TGCAGCATCG 
GGCTTAAATA 
TGCTCCTGCC 
GCTCTCCTCC 
TCCGCCAGAG 
TTCCCCAAGC 
GCGCATAGCT 
GACTTTGCGG 
GCGCTGCAGA 
ACGTCGGCGG 
GTGCCCAGCC 
TGGCGGGAAC 
ACCTTCTCCT 
TCCTCTCCCA 
ATCGCCAGGG 
CCAGTCCTGC 
CTGGACTGCG 
CTCGGGACCA 
TGCCCCGAGC 
CCCGCAGCCC 
CTGCAGGCGG 
GACCATCTCT 
TCCTTTCCCA 
CCTGCAGGAT 
TTATTGCTGC 
AGATACCTCA 
AGCGGGGGCC 
GTTATGGATC 



31 
I 

GAAGGTCGCT 
TCTCCTCCTT 
AAGACAGGAG 
GGAGCTCCGG 
GGGTGATGGA 
TGGCCACTCT 
CCCCGGCCAA 
AGTACCCCCT 
CCGACTACAG 
AGCGCGGCGA 
GCGTGCACGC 
AGCTGGAGGT 
CCGACTGGTT 
AGGOGGCGCT 
CCCCCAACTT 
GCCACCCGGC 
TGACACTGGT 
CCAGCAGGGA 
AGGTCTCCCT 
AGAGCAGGAC 
TOGAAGAAGA 
CTGGGGCCCC 
CCGAGGCACA 
GCACTGAAGG 
ACCTTGCTTC 
AAAGTCATCC 
TCCAGGAGAT 
GACCTGGTGC 
ACTTGAGAAG 
TCTCTGCGTT 



41 

I 

GGGCAGGGCG 
TCTCCCACGT 
GAACTGGAGC 
GCTCTGGCTG 
AAACCCCAGC 
CGGCGCCGCC 
ATACAGCATC 
GTTCCGCCCC 
CATGTGGAGG 
GGCCTGGGCG 
GGTGTTTTCG 
GCAGCGCAGG 
CGTGGGCGTG 
GGACCTGTAC 
CGCCACCATC 
CAACTCCTTC 
GCGGCTGCGA 
CAA7GAGATT 
GTGGTCGTCC 
TCGCTACGTC 
GGCTGAGTGC 
CGGAGCCATG 
GGGGGTTTCG 
GCCCTCTGGT 
TTAGGGGCCC 
CCAAGGCTCC 
TCTCCTTCAT 
TCTAGGCTGT 
TGAATAAATG 
TGAATAAAGA 



51 
I 

AGTTGGGAAA 
CCTATCTGCC 
CTCATTGGCC 
GGACCCGACC 
CCGGCCGCCG 
GGCCAGCCTC 
ACCTTCACGG 
CCTGCGCAGT 
AAGAACCAGT 
CTGATGAAGG 
GCGCCCGCCG 
CACTCGCTGG 
GACAGCCTGG 
CCCTACGACG 
CCGCAGGACA 
TACTACCCGC 
CAGAGCCCCA 
GTAGACAGCG 
TGGGGACTGT 
CGGGTCCAGC 
GTCCCTGATA 
GGGTGTCGGG 
CGCTGCTCCT 
GGCCGGCACG 
CCGTGTCCCG 
AGCTACTCTA 
CGTCCAGGGG 
GCTGAGCCCA 
GGGCGGTTTC 
CTATCTCTGT 



51 



41 

I I 
AKYSITFTGK WSQTAFPKQY 
GEAWALMKEI EAAGEALQSV 
WFVGVDSLDL CDGDRWREQA 
PANSFYYPRL KALPPIARVT 
SLWSSWGLCG GHCGRLGTKS 



Seq ID NO: 132 Protein sequence 
Protein Accession #: NP_036S77.1 

1 11 21 31 

I I I I 

MENPSPAAAL GKALCALLLA TLGAAGQPLG GBSICSARAP 
PLFRPPAQWS SLLGAAHSSD YSMWRKNQYV SNGLRDPAER 
HAVPSAPAVP SGTGQTSAEL EVQRRHSLVS PWRIVPSPD 
ALDLYPYDAG TDSGFTPSSP NPATIPQDTV TBITSSSPSH 
LVRLRQSPRA PIPPAPVLPS RDNEIVDSAS VPETPLDCEV 
RTRYVRVQPA NNGSPCPELE EEAECVPDNC V 

Seq ID NO: 133 DNA sequence 

Nucleic Acid Accession #: NM__019894 

Coding sequence: 1..1314 

1 11 21 31 41 51 

I I I I I | 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 

1002 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



60 
120 
180 
240 
300 



60 



WO 03/042661 



AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT OVTTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGAOGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC A6GTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 4B0 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGO TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 



Seq ID NO: 134 Protein sequence 
Protein Accession ft: NP_063947.l 



1 11 21 31 41 51 

I 1 I I I ( 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSIASIIIW VLIKVILDKY 60 

YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSPPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALABTAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSOKLGSPPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TPSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 



Seq ID NO: 135 DNA sequence 
Nucleic Acid Accession ft: NMJJ03045 
Coding sequence: 148.. 2 03 7 

1 11 21 31 41 51 

1 I I J I I 

CGATCCTGCC GGAGCCCCGC CGCCGCCGGC TTGGATTCTG AAACCTTCCT TGTATCCCTC 60 

CTGAGACATC TTTGCTGCAA GATCGAGGCT GTCCTCTGGT GAGAAGGTGG TGAGGCTTCC 120 

CGTCATATTC CAGCTCTGAA CAGCAACATG GGGTGCAAAG TCCTGCTCAA CATTGGGCAG 180 

CAGATGCTGC GGCGGAAGGT GGTGGACTGT AGCOGGGAGG AGACGCGGCT GTCTOGCTGC 240 

CTGAACACTT TTGATCTGGT GGCCCTCGGG GTGGGCAGCA CACTGGGTGC TGGTGTCTAC 300 

GTCCTGGCTG GAGCTGTGGC CCGTGAGAAT GCAGGCCCTG CCATTGTCAT CTCCTTCCTG 360 

ATCGCTGCGC TGGCCTCAGT GCTGGCTGGC CTGTGCTATG GCGAGTTTGG TGCTCGGGTC 420 

CCCAAGACGG GCTCAGCTTA CCTCTACAGC TATGTCACCG TTGGAGAGCT CTGGGCCTTC 480 

ATCACCGGCT GGAACTTAAT CCTCTCCTAC ATCATCGGTA CTTCAAGCGT AGCGAGGGCC 540 

TGGAGCGCCA CCTTCGACGA GCTGATAGGC AGACCCATCG GGGAGTTCTC ACGGACACAC 600 

ATGACTCTGA ACGCCCCCGG CGTGCTGGCT GAAAACCCCG ACATATTCGC AGTGATCATA 660 

ATTCTCATCT TGACAGGAGT TTTAACTCTT GGTGTGAAAG AGTCGGCCAT GGTCAACAAA 720 

ATATTCACTT GTATTAACGT CCTGGTCCTG GGCTTCATAA TGGTGTCAGG ATTTGTGAAA 780 

GGATCGGTTA AAAACTGGCA GCTCACGGAG GAGGATTTTG GGAACACATC AGGCCGTCTC 840 

TGTTTGAACA ATGACACAAA AGAAGGGAAG CCCGGTGTTG GTGGATTCAT GCCCTTCGGG 900 

TTCTCTGGTG TCCTGTCGGG GGCAGCGACT TGCTTCTATG CCTTCGTGGG CTTTGACTGC 960 

ATCGCCACCA CAGGTGAAGA GGTGAAGAAC CCACAGAAGG CCATCCCCGT GGGGATCGTG 1020 

GCGTCCCTCT TGATCTGCTT CATCGCCTAC TTTGGGGTGT CGGCTGCCCT CACGCTCATG 1080 

ATGCCCTACT TCTGCCTGGA CAATAACAGC CCCCTGCCCG ACGCCTTTAA GCACGTGGGC 1140 

TGGGAAGGTG CCAAGTACGC AGTGGCCGTG GGCTCCCTCT GCGCTCTTTC CGCCAGTCTT 1200 

CTAGGTTCCA TGTTTCCCAT GCCTCGGGTT ATCTATGCCA TGGCTGAGGA TGGACTGCTA 1260 

TTTAAATTCT TAGCCAACGT CAATGATAGG ACCAAAACAC CAATAATCGC CACATTAGCC 1320 

TCGGGTGCCG TTGCTGCTGT GATGGCCTTC CTCTTTGACC TGAAGGACTT GGTGGACCTC 1380 

ATGTCCATTG GCACTCTCCT GGCTTACTCG TTGGTGGCTG CCTGTGTGTT GGTCTTACGG 1440 

TACCAGCCAG AGCAGCCTAA CCTGGTATAC CAGATGGCCA GTACTTCCGA CGAGTTAGAT 1500 

CCAGCAGACC AAAATGAATT GGCAAGCACC AATGATTCCC AGCTGGGGTT TTTACCAGAG 1560 

GCAGAGATGT TCTCTTTGAA AACCATACTC TCACCCAAAA ACATGGAGCC TTCCAAAATC 1620 

TCTGGGCTAA TTGTGAACAT TTCAACCAGC CTTATAGCTG TTCTCATCAT CACCTTCTGC 1680 

ATTGTGACCG TGCTTGGAAG GGAGGCTCTC ACCAAAGGGG CGCTGTGGGC AGTCTTTCTG 1740 

CTCGCAGGGT CTGCCCTCCT CTGTGCCGTG GTCACGGGCG TCATCTGGAG GCAGCCCGAG 1800 

AGCAAGACCA AGCTCTCATT TAAGGTTCCC TTCCTGCCAG TGCTCCCCAT CCTGAGCATC 1860 

TTCGTGAACG TCTATCTCAT GATGCAGCTG GACCAGGGCA CCTGGGTCOG GTTTGCTGTG 1920 

TGGATGCTGA TAGGCTTCAT CATCTACTTT GGCTATGGCC TGTGGCACAG CGAGGAGGCG 1980 

TCCCTGGATG CCGACCAAGC AAGGACTCCT GACGGCAACT TGGACCAGTG CAAGTGACGC 2040 

ACAGCCCCGC CCCCCGGAGG TGGCAGCAGC CCOGAGGGAC GCCCCCAGAG GACCGGGAGG 2100 
CACCCCACCC TCCCCACCAG TGCAACAGAA ACCACCTGCG TCCACACCCT CACTGCA 

Seq ID NO: 136 Protein sequence 
Protein Accession ft: NM_003045 

1 11 21 31 41 51 

I I 1 1 I I 
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MGCKVLLNIG 
NAGPAIVISF 
YIIGTSSVAR 
LGVKESAMVN 
KPGVGGFMPF 
YFGVSAALTL 
VIYAMAEDGL 
SLVAACVLVL 
LSPKNMEPSK 
WTGVIWRQP 
FGYGLWHSEE 



QCKLRRKWD 
LIAALASVLA 
AWSATFDELI 
KIPTCINVt.V 
GFSGVLSGAA 
MMPYPCLDNN 
LFKFLANVND 
RYQPEQPNLV 
ISGLIVNIST 
ESKTKLSFKV 
ASLDADQART 



CSREETRLSR 
GLCYGEFGAR 
GRPIGEPSRT 
LGFIMVSGFV 
TCFYAFVGFD 
SPLPDAFKHV 
RTKTPIIATL 
YQMASTSDEL 
SLIAVLIITF 
PFLPVLPILS 
PDGNLDQCK 



CLNTFDLVAL 
VPKTGSAYLY 
HMTLNAPGVIi 
KGSVKNWQLT 
CIATTGEBVK 
GWEGAKYAVA 
ASGAVAAVMA 
DPADQNEIAS 
CIVTVLGREA 
IFVNVYLMMQ 



Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: NM_032044.1 

Coding sequence: 182.. 658 



I 

AAGATATAAA 
GCCCTTAGAG 
AGAAGCGCTA 
GATGGCTTCC 
CCTGGGTGAT 
TTGCTATGGT 
TTACGGAAAC 
AGAGTACATA 
GAAGAGGCAG 
CAAGTCCATG 
TTGGAGCAGC 
CAAGAATCAA 
GCTAAATCTG 
ACTACACTGG 
CTAGCTCTAA 
GTCTCTGGCT 
CTTTGGCCAT 
AGACCOCTTC 
CACTCAACTC 
TAGGTTGATG 



11 
I 

AGCTCCAGAA 
TCTTGGTTGC 
GTAAGGTCTC 
AGAAGCATGC 
ATCATCATGA 
TACTTCAGGA 
GGAGCCCACC 
AGTGGCTATC 
CAGTGGCAGT 
GGTGGGAACA 
AACGAATGCA 
GATTCTGCTA 
CTCATTATTT 
CTTTTTTAGG 
ATGTTTGCCC 
GTCTCGAGCA 
AAGAAGTAAA 
AGCTTCTACA 
CTGCTTGTTT 
TGGGCCATAC 



21 
I 

ACGTTGACTG 
CAAACAGATT 
TGAGATCCTT 
GGCTGCTCCT 
GACCCAGCTG 
AGCTGAGGAA 
TGGCATCTAT 
AGAGAAGCCA 
GGATTGATGG 
AGCACTGTGC 
ACAAGCGCCA 
ACTCCTGCAC 
CAGAGGGGAA 
CTTAGAGACA 
CGCCATCCCT 
GTCTAGAAGA 
GATTTGAAGA 
CCCTTCTGCC 
TTCCTTTGGC 
ATTCCTTTAA 



31 
i 

GGACCACTGG 
TGCAGATCAA 
GCACTAGCTA 
ATTGCTGAGC 
TGCTCCTGGA 
CTGGTCTGAT 
CCTGAGTTTA 
GCOGATATGG 
GGCCATGTAT 
TGAGATGAGC 
ACACTTCCTG 
AGCCCCGTCC 
ACCTAGCAAA 
GAAACTTTAG 
TTCCACAGTA 
GTGCATCTCC 
CAGAAGGAAG 
CTCTCTCCAT 
CATAGGAAGG 
TAAACCATTG 



GVGSTLGAGV 
SYVTVGELWA 
AENPDIFAVI 
BEDFGNTSGR 
NPQKAIPVGI 
VGSLCALSAS 
FLFDLKDLVD 
TNDSQLGFLP 
LTKGALWAVP 
LDQGTWVRFA 



41 

t 

AGACACTGAA 
GGAGAACCCA 
CATCCTCAGG 
TGCCTGGCCA 
TGGTTTTACC 
GCCGAGCTCG 
AAGGAAGCCA 
ATTGGCCTGC 
CTGTACAGAT 
TCCAATAACA 
TGCAAGTACC 
TCTTCCTTTC 
CTAAGAGTGA 
CATTGGCCCA 
TCCTTCTTCC 
AGCCTATGAA 
AAACTCAGGA 
TGCCTGCACC 
TTTACCAGTA 
TGTACATAAG 



YVLAGAVARE 
FITGWNLILS 
IILILTGLLT 
LCLNNDTKEG 
VASLLICFIA 
LLGSMFPMPR 
LMSIGTLLAY 
EAEMFSLKTI 
LLAGSALLCA 
VWMLIGPIIY 



51 
I 

GAAGGCAGGG 
GGAGTTTCAA 
GTAGGAGGAA 
AAACAGGAGT 
ACAAGTCCAA 
AGTGTCAGTC 
GCACCATAGC 
ACGACCCACA 
CCTGGTCTGG 
ACTTTTTAAC 
GACCATAGAG 
TGCTAGCCTG 
TAAGGGCCCT 
GTAGTGGCTT 
CTCCTCCCCT 
ACAGCTGGGT 
GTAAGCTTCT 
CCACCCCAGC 
GAATCCTTGC 
AAAAAAAAAA 



Seq ID NO: 138 Protein sequence 
Protein Accession #: NP 114433.1 



11 



21 



31 41 51 

f ! 1 I | | 

MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKLRN WSDAELECQS 
YGNGAHLASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQHIDG AMYLYRSWSG 
KSMGGNKHCA EMSSNNNFLT WSSNECNKRQ HPLCKYRP 

Seq ID KO: 139 DNA sequence 

Nucleic Acid Accession #: XM_051860.2 

Coding sequence: 52.. 3 042 



1 
I 

GCTCACCCAG 
GTTAACCTCA 
GACCGGGGCA 
CCCAAACTCA 
AATGTACAGT 
TACCAGGCAG 
GTGGCAGGGA 
GCGGAGGTTG 
TACCCCTACA 
AAGTTTGCTC 
CAGCAGCTGG 
GGAGGTTATG 
TGCGTCACAG 
TTGGGCCACT 
CTTGGCCTCC 
AAGATGATCA 
GCTGTGTCCA 
GGATCTGAGG 
GGAATGTACT 
GCACATTCCA 
TCTGCCAAGG 
GACGCCGACC 
AACCAGGACC 
GCTGACAATG 
TCCAAGCAAG 
ATGATGGACA 
ATAGGCCAGA 
AACTGCACTT 
CGCCTGAATA 
GAOGTTCCGA 



11 
I 

GAAAAATATG 
GCACCGAGGT 
GAGCCTGCCG 
CAGTCACCAT 
CATGGAAACC 
AAGAGTTCCA 
AACCAATGTA 
GGCTTCTGAG 
GAAACCACAT 
TGGGATTTAA 
TGGGTCAGTA 
ACCCACCCAC 
TCCATGGCTC 
GCTTCTTCAC 
TTGTCAAGTC 
CAGGAGACTC 
CCTTCTGGAT 
AAACTGGATT 
CCCCAGGTTA 
ACTACCGGGC 
ACAAGCGGCC 
CGCTGAAGCC 
ACGGGGCCTG 
GCATTGGCCT 
AGATAAAGAA 
ATAGGATCTG 
ATTTTCCAAT 
TCCGAAAGTT 
ATGCCTGGCA 
TTACTTCCAG 



21 
I 

CAATCGTCCC 
TGTCTACAAA 
GAGCTACCGT 
TGACACCAAT 
TGGAGATACC 
GGTGCTTCCC 
CCTGCACATC 
CCGGAACATC 
CTGCAATTTC 
GGCAGCACAC 
CCCGATTCAC 
ATACATCAGG 
CAATGGCTTG 
GGAAGATGGG 
TGGAACCCTC 
CTACCCAGGG 
GGCCAATCCC 
TTGGTTTATT 
TTCAGAGCAC 
TGGCATGATC 
GTTCCTCTCA 
CCGGGAGCCG 
GCTGCGCGGC 
GACCCTGGCC 
CAGCTTGTTT 
GGGCCCTGGC 
TAGAGGAATT 
TGTGGCCCTG 
GAGCTGCCCC 
AGTGTTCTTC 



31 
I 

ATTGATATAC 
AAAGGCCAGG 
GTACGGTTCC 
GTGAACAGCA 
CTGGTCATTG 
TGCAGATCCT 
GGGGAGGAGA 
ATAGTGATGG 
TTTGACTTCG 
TTGGAGGGGA 
TTCCACCTGG 
GACCTCTCCA 
TTGATCAAGG 
CCGGAGGAAC 
CTCCCCTCGG 
TACATCCCCA 
AACAACAACC 
TTTCACCACG 
ATTCCACTGG 
ATAGACAAGG 
ATCATCTCTG 
GCCATCATCA 
GGGGATGTGT 
AGTGGTGGAA 
GTTGGCGAGA 
GGCTTGGACC 
CAGTTATATG 

CATAACAACG 
GGAGAGCCTG 



41 
I 

AGGCCACTAC 
ATTATAGGTT 
TCTGTGGGAA 
CCATTCTGAA 
CCAGTACTGA 
GCGCCCCCAA 
TAGACGGCGT 
GGGAGATGGA 
ATACCTTTGG 
CGGAGCTGAA 
COGGTGATGT 
TCCATCATAC 
ACGTTGTGGG 
GCAACACTTT 
ACCGTGACAG 
AGCCCAGGCA 
TCATCAACTG 
TACCAACGGG 
GAAAATTCTA 
GAGTCAAAAC 
CCAGATACAG 
GACACTTCAT 
GGCTGGACAG 
CCTTCCCGTA 
GTGGCAACGT 
ATAGCGGAAG 
ATGGCCCCAT 
ACACCAGCGC 
TGACCGGCAT 
GGCCCTGGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 



51 
I 

AATGGATGGA 
TGCTTGCTAC 
GCCTGTGAGG 
CTTGGAGGAT 
TTACTCCATG 
CCAGGTCAAA 
GGACATGCGG 
GGACAAATGC 
GGGCCACATC 
GCATATGGGA 
AGACGAAAGG 
ATTCTCTCGC 
CTATAACTCT 
TGACCACTGT 
CAAGATGTGC 
AGACTGCAAT 
TGCCGCTGCA 
CCCCTCCGTG 
TAACAACCGA 
CACCGAGGCC 
CCCTCACCAG 
TGCCTACAAG 
CTGCCGGTTT 
TGACGACGGC 
GGGGACGGAA 
GACCCTCCCT 
CAACATCCAA 
CCTGGCCTTC 
TGCCTTTGAG 
CAACCAGCTG 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



1004 



WO 03/042661 



PCT7US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG AOGGCTCOGT GTCCGAGTAC 1860 

CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA CTGCATCAAT 1920 

GTTCCCGACT QOAGAGGGGC CATTTGCAGT GGQTQCTATG CACAGATGTA CATTCAAGCC 1980 

TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 

TACCTGGAGG GGGCGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 

CTGCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CGGCCGAACT CGCCATCTGG 2160 

CTCATCAACT TCAACAAGGG CGACTGGATC CGA6TGGGGC TCTGCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 

GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 

TACTACTGGG AOGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 

GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

CCAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 

GCTGTCGTAG ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAAGQAC 2640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT CGGAGGATGG CATCCAGGTG 2700 

GTGGTGATTG ACGGGAACCA AGGGCGGGTG GTGAGCCACA CGAGCTTCAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2820 

GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3000 

CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCOGGT ( 3060 

GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 

CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 

GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 

GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 

GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3840 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAQCACAGGG GGCCTCCAGG 4260 

AGACCCTAGA TGTGCTCGTA CTCCCTOGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 

TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 

AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 

TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 

GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 

CCCTCCTGCT CCCAGCGCAC ACAAACCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 4620 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 

TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 

GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 

GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4860 

ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 

ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 

CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 

ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 

CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 

CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 

AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCACGGT TTTGTTGAGT TTTCACTCTT 5280 

CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 

AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 

GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTT CC TAAACCATTC 5460 

ACCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 

TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 

ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 

GTTAGATGTA TAGAGTGTTT GTATGTAAAC ATTTCTTQTA GGCATCACCA TGAACAAAGA 5700 

TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 
AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 



Seq ID NO: 140 Protein sequence 
Protein Accession #: XPJ)51860.2 



I 

MDGVNLSTEV 
LEDNVQSWKP 
DMRAEVGLLS 
HMGQQLVGQY 
VNSLGHCPPT 
DCMAVSTPWM 
NNRAHSNYRA 
AYKNQDHGAW 
GTEKMDNRIW 
LAPRLKNAHQ 



11 
I 

VYKKGQDYRF 
GDTLVIASTD 
RNIIVMGEME 
PIHFHLAGDV 
EDGPBKRNTP 
ANPNKNLINC 
GMIIDNGVKT 
LRGGDVWLDS 
GPGGLDHSGR 
SCPHNNVTGI 



21 
I 

ACYDRGRACR 
YSMYQAEEFQ 
DKCYPYRNHI 
DERGGYDPPT 
DHCLGLLVKS 



TBASAKDRRP 
CRPADNGIGL 
TLPIGQHPPI 
APEDVPITSR 



31 
I 

SYRVRFLOGK 
VLPCRSCAPN 
CNFFDFDTFG 
YIRDLSIHHT 
GTLLPSDRDS 
WPIFHHVPTG 
FLSIISARYS 
TLASGGTPPY 
RGIQLYDGPI 
VFFGEPGPWF 



41 

I 

PVRPKLTVTI 
QVKVAGKPMY 
GHIKFALGPK 
PSRCVTVHGS 
KMCKMITGDS 
PSVGMYSPGY 
PBQDADPLKP 
DDGSKQEIKN 
NXQNCTPRKF 
NQLDMDGDKT 



51 
I 

DTNVNSTILN 
LHIGEKIDGV 
AAHLEGTELK 
KGIXIKDWG 
YPGYIPKPRQ 
SEHIPLGKFY 
REPAIZSHPI 
SLPVGESGNV 
YALEGHHTSA 
SVPHDVDGSV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



1005 



WO 03/042661 



PC17US02/36810 



5 

10 
15 
20 
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SEYPGSYLTK 
HPLYLEGALT 
RGTTFSILSD 
NEREKFAFCS 
TKDHFLEVKM 
NSIIXJGIPWQ 
GFKGSFRPIW 



NDNWLVRHPD 
RSTHYQQYQP 
VHNRLLKQTS 
MKGCERIKIK 
ESSKQHFPHL 
LFNYVATIPD 
VTLDTEDHKA 



CINVPDWRGA 
WTLQKGYTI 
KTGVPVRTLQ 
ALIPKNAGVS 
WNDFAYIEVD 
NSIVLMASKG 
KIFQWPIFV 



ICSGCYAQMY 
HWDQTAPAEL 
MDKVEQSYPG 
DCTATAYPKF 
GKKYPSSEDG 
RYVSRGPWTR 
VKKKKL 



IQAYKTSNLR 
AIWLINFNKG 
RSHYYWDEDS 
TERAWDVPM 
IQVWIDGNQ 
VLEKLGADRG 



MKIIKNDPPS 
DWIRVGLCYP 
GLLFLKLKAQ 
PKKLFGSQLK 
GRWSHTSFR 
LKLKEQMAFV 



660 
720 
780 
840 
900 
960 



Seq ID NO: 141 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence* 261.. 2861 



I 

GAGCTAGCGC 
CGGCGCGGGG 
AGCTACCACT 
ACGTCOGGGG 
AGAGGGAGCA 
TGCTGACCAT 
CTGGGTGCCC 
ACCATGTGCA 
CCATCCACAT 
TGCGAACCCG 
GCCCTTTCCA 
CGGATCCTTA 
ATGGACAGAA 
CAGAAGGAGG 
TCGACCCCAA 
AAGAGAGTGA 
TTGCAGTGAA 
AATTGGGAAG 
TGAAAGGAAA 
CTGCTGCTGC 
CTTTGTCCAG 
TATCTCAGAC 
TATGCAATCG 
AGGTTGTCTA 
GCOGGAGCTA 
CCATTGACAC 
AACCTGGAGA 
TCCAGGTGCT 
TGTACCTGCA 
TGAGCCGGAA 
ACATCTGCAA 
TTAAGGCAGC 
AGTACCCGAT 
CCACATACAT 
GCTCCAATGG 
TCACGGAAGA 
AGTCTGGAAC 
ACTCCTACCC 
GGATGGCCAA 
GATTTTGGTT 
GTTATTCAGA 
GGGCTGGCAT 
GGCCGTTCCT 
AGCCCCGGGA 
CCTGGCTGCG 
AAGGCTTCTT 
CCTCTGGAAT 
CGGGGTCGCC 
GGGAGCAATG 
CTCTGACTCC 
TCTTCATCCA 
CCTCGGOCTG 
TAACAGATGG 
TGGGGAGCCC 
GTGCCCAGGC 
AAATGACTAG 
AACCCGOCCT 
AGCCCAGCCT 
TCTGCAGCTC 
CCAATTTGGC 
TGCTGCTTAA 
TAAGAGGGAG 
GTCTTGTGAT 
AGAGGGCCTG 
TTCCCCAGGT 
GGGAGGTGAC 
GGAAGGACTT 
TCCTTCTTGT 
GAACCACTTA 
TTCATTTAAA 



11 
I 

TCAAGCAGAG 
AGCCAGCGGG 
CCGCTTGCCC 
CCGCTGCGCT 
CACTGCCAGG 
CAGCTGGCTC 
TGACCAGAGC 
TATCGGCCAG 
CTCAGAGGGA 
GCACATCCTG 
GGGCAATTTC 
CTATGGTCTG 
AAAGCTCTCC 
CTATTTTTTT 
ATCAGGCACA 
ACGTCTGGTC 
TGATGAAGGT 
CAAACACTTC 
TCCATCATCT 
CCGGGTATTC 
TGAGTGGGTT 
TAAAGGTGGG 
TCCCATTGAT 
CAAAAAAGGC 
CCGTGTACGG 
CAATGTGAAC 
TACCCTGGTC 
TCCCTGCAGA 
CATCGGGGAG 
CATCATAGTG 
TTTCTTTGAC 
ACACTTGGAG 
TCACTTCCAC 
CAGGGACCTC 
CTTGTTGATC 
TGGGCCGGAG 
OCTCCTCCCC 
AGGGTACATC 
TCCCAACAAC 
TATTTTTCAC 
GCACATTCCA 
GATCATAGAC 
CTCAATCATC 
GCCGGCCATC 
CGGCGGGGAT 
GCTTACAGGA 
GGCTCAGGGA 
CTTTGCTCAC 
GGCTTTGCTG 
AAGAGGGTGA 
GGGAACTGAG 
GGATTTCAGA 
GGAAAGTGAG 
CACCCTAGCC 
ACTCCTGAGG 
AGTAGAATGA 
CCCCTTGGTG 
GGGTGCACAG 
TACAGGTGAG 
TGATCTTGGG 
TGCCCTGCTC 
AACTCTATCT 
GAACTACATT 
CCTGGCTCCC 
GGGAGCCAAC 
CATAGGGCTC 
CTTCCAGGGA 
CCACGGTTTT 
GGATGTGATC 
AAAGATATCT 



21 
I 

CCCAGCGCGG 
GCTGAGCGCG 
AOGCCCCGGG 
CCTGGCCOGC 
ATGGGAGCTG 
ACTCTGACCT 
CCTGAGTTGC 
GGCAAGACAC 
GGCAAGCTGG 
ATTGACAACG 
ACCATCATTT 
AAGTACATTG 
TGGACATTTC 
GAAAGGAGCT 
GTCATCCATT 
CAGTATTTGA 
TCTCGAAATC 
CTGCACCTTG 
TCAGTGGAAG 
AAATTGTTCC 
CAAGACGTGG 
GAGAAAATTT 
ATACAGGCCA 
CAGGATTATA 
TTOCTCTGTG 
AGCACCATTC 
ATTGCCAGTA 
TCCTGCGCCC 
GAGATAGACG 
ATGGGGGAGA 
TTCGATACCT 
GGCACGGAGC 
CTGGCOGGTG 
TCCATCCATC 
AAGGACGTTG 
GAACGCAACA 
TCGGACCGTG 
CCCAAGCCCA 
AACCTCATCA 
CACGTACCAA 
CTGGGAAAAT 
AACGGAGTCA 
TCTGCCAGAT 
ATCAGACACT 
GTGTGGCTGG 
ATGAAGGCTG 
TTCAGCCCTC 
GTCTCTCTGG 
CTTATGAGCA 
AGTCCACAGA 
CACAGGGGGC 
GCTGGAAATA 
CCCCCAAGAT 
CTTGCTGCCA 
TAGCTTCTGG 
CAGCTAGCAG 
TTGGCGGTCC 
TAGCTGCAAC 
GCCCAGCAGA 
TGTCTGAACA 
TCTCCCTGGC 
GTGGTTTATA 
TATCCCCTTT 
TCCACCCAAC 
TGTCAGGGAG 
TGCTTTTAAA 
GATTAGTGGT 
GTTGAGTTTT 
ACTTTCAGGT 
ATTTGAAAGT 



31 
I 

TGCTATCGGA 
GCCAGGGTCT 
AGCTCGCGGC 
GAGGCGTGAC 
CTGGGAGGCA 
GCTTCCCTGG 
AACCCTGGAA 
TGCTGCTCAC 
TCATTAAAGA 
GAGGAGAGCT 
TGTATGGAAG 
GGGTTGGTAA 
TGAACAAGAC 
GGGGCCACCG 
CTGACCGGTT 
ACGCGGTGCC 
TCGATGACAT 
GATTTAGACA 
ACCATATTGA 
AGACAGAGCA 
AGTGGACGGA 
CAGACCTCTG 
CTACAATGGA 
GGTTTGCTTG 
GGAAGCCTGT 
TGAACTTGGA 
CTGATTACTC 
CCAACCAGGT 
GCGTGGACAT 
TGGAGGACAA 
TTGGGGGCCA 
TGAAGCATAT 
ATGTAGACGA 
ATACATTCTC 
TGGGCTATAA 
CTTTTGACCA 
ACAGCAAGAT 
GGCAAGACTG 
ACTGTGCCGC 
CGGGCCCCTC 
TCTATAACAA 
AAACCACCGA 
ACAQCCCTCA 
TCATTGCCTA 
ACAGCTGCCA 
GGGGCATTTT 
CCTGCCGCTG 
CCCACTCATG 
CAGAGGAATT 
AGTGAGCTCC 
CTCCAGGAGA 
TAGAAAATAT 
GGGAAAGAAC 
CACCACATTG 
AAATGGGGAC 

ATCTcrrccc 

CTGTGGCCTT 
TCCCCATTGG 
GGGAGTAGGG 
GCTATTGGGT 
CCACCTTATA 
ATCTTGCAOG 
CCTGCCCCAA 
TGCACCCATG 
GTCTTTCCCA 
GATATGGCTG 
GATGGAGAGG 
CACTCTTCTA 
GGCCAGGAAT 
TCTCAGAGTT 



41 

I 

CAGAGCCTGG 
GAACCCAGAT 
GCCTGGCGGT 
ACTGTCTOGG 
GGACTTCCTC 
GGOCACATCC 
CCCTGGCCAT 
CTCTTCTGCC 
CCAOGACGAG 
GCATGCTGGG 
GGCTGATGAA 
AGGAGGCGCT 
CCTTCACCCA 
TGGAGTTATT 
TGACACCTAT 
CGATGGCAGG 
GGCCAGGAAG 
CCCTTGGAGT 
ATATCATGGA 
TGGCGAATAT 
GTGGTTCGAT 
GAAAGCTCAC 
TGGAGTTAAC 
CTACGACCGG 
GAGGCCCAAA 
GGATAATGTA 
CATGTACCAG 
CAAAGTGGCA 
GCGGGCGGAG 
ATGCTACCCC 
CATCAAGTTT 
GGGACAGCAG 
AAGGGGAGGT 
TC GCTG CGTC 
CTCTTTGGGC 
CTGTCTTGGC 
GTGCAAGATG 
CAATGCTGTG 
TGCAGGATCT 
CGTGGGAATG 
CCGAGCACAT 
GGCCTCTGOC 
CCAGGACGCC 
CAAGAACCAG 
TTTCAGAGGG 
GCTGGGGGGA 
CCTGCTGAAG 
ATGGAGAAGT 
CAGTCCCCAG 
TGCCTTAGGG 
CCCTAGATGT 
CTAGCCCAAA 
CACACAGCTA 
CCTCAACAAC 
AAGTOCCCTC 
TCCTGCTCCC 
CACTTTGTTC 
TGCTACCTGG 
CTCGCCATGT 
OCACCCCAGT 
GAGAGCCCAA 
AGGCACCAGA 
CCACAAACTC 
AGACTOGGTC 
CCAAACATCT 
CTTCAAAGGC 
AGAGTTAAAA 
ATGCAAGGGT 
GTTGAATGTC 
GTACATATGT 



51 
I 

CGAGCGCAAG 

TTCCCAGACT 

CAGCGACCAG 

CTACAGACCC 

TTCAAGGCCA 

ACAGTGGCTG 

GACCAAGACC 

ACGGTCTATT 

CCGATTGTTT 

AGTGCCCTCT 

GGTATTCAGC 

CTTGAGTTGC 

GGTGGCATGG 

GTTCATGTCA 

AGATCCAAGA 

ATCCTTTCTG 

GCGATGACCA 

TTTCTAACTG 

CATCGAGGCT 

TTCAATGTTT 

CATGATAAAG 

CCAGGAAAAA 

CTCAGCACCG 

GGCAGAGCCT . 

CTCACAGTCA 

CAGTCATGGA 

GCAGAAGAGT 

GGGAAACCAA 

GTTGGGCTTC 

TACAGAAACC 

GCTCTGGGAT 

CTGGTGGGTC 

TATGACCCAC 

ACAGTCCATG 

CACTGCTTCT 

CTCCTTGTCA 

ATCACAGAGG 

TCCACCTTCT 

GAGGAAACTG 

TACTCCCCAG 

TOCAACTACC 

AAGGACAAGC 

GACCCGCTGA 

GACCACGGGG 

GAGGCTCAGG 

GATGAGGCAG 

CTGGTGACTA 

GTGGTCAGAG 

GCAGCCCTGC 

CCTCATTTGC 

GCTCGTACTC 

GCCTTCATTT 

AGGGAGGGCC 

CGGCCCCAGA 

GAAGGAAAGG 

AGCGCACACA 

ACTACCTGTC 

CTCTCCTGTC 

TTCTGGTGAG 

CCCTTTCAGC 

AGAGCTCCTG 

GTCTCCCTGG 

TTTCCTTCAA 

GAAGAGTCCA 

TTCAGCTGCT 

CAGAGTCACA 

TGACCTCATG 

CTCACACTGT 

TTTGGCTCAG 

TTCACAGTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 



1006 



WO 03/042661 



AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT ATCTAGGCAT 4260 

TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG TCCTCCTTGT TATTTCTGTT 4320 

TGTAA GACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG CTCCTCTGAA ATGCTTGTCT 4380 

TTTTTCTGTT GCCGAAATAG CTGCTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 4440 

TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 4500 

ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 4S60 

GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG CCAAGGAAAT TGGCTCTGGT 4620 

TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG ATGTCAAAAA AAAAAAAAAA 4680 
AAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 142 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

III 111 

KGAAGRQDFL FKAMLTISWL TLTCFPGATS TVAAGCPDQS PBLQPWNPGH DQDHHVHIGQ 60 

GKTLLLTSSA TVYSIHISEG GKLVIKDHDB PIVLRTRHIL IDNGGELHAG SALCPFQGNF 120 

TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFP 180 

ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEQ 240 

SRNLDDMARK AMTKLGSKHF LHLGFRHPWS PLTVKGNPSS SVEDHIEYHG HRGSAAARVF 300 

KLFQTEHGBY FNVSLSSEWV QDVBWTBWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 360 

IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLCGKPVRPK LTVTIDTNVN 420 

STILNLEDNV QSWKPGDTLV IASTDYSMYQ ABEFQVLPCR SCAPNQVKVA GKPMYLHIGE 480 

EIDGVDMRAE VGLLSRNIIV MGEMEDKCYP YRNHICNFFD FDTFGGHIKF ALGFKAAHLE 540 

GTELKHMGQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLLI 600 

KDWGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 

PKPRQDCNAV STFWMANPNN NLINCAAAGS EETGFWFIPH HVPTGPSVGM YSPGYSEHIP 720 

LGKFYNNRAH SNYRAGMIID NGVKTTEASA KDKRPFLSII SARYSPHQDA DPLKPREPAI 780 

IRHFIAYKNQ DHGAWLRGGD VWLDSCHPRG EAQEGFLLTG MKAGGILLGG DEAASGMAQG 840 
FSPPCRCLLK LVTTGSPFAH VSLAHS 

Seq ID NO: 143 NM DNA sequence 
Nucleic Acid Accession #: NM_006017 
Coding sequence: 38.. 2635 

] 11 21 31 41 51 

I I I I I I 

CCAAGTTCTA CCTCATGTTT GGAGGATCTT GCTAGCTATG GCCCTCGTAC TCGGCTCCCT 60 

GTTGCTGCTG GGGCTGTGCG GGAACTCCTT TTCAGGAGGG CAGCCTTCAT CCACAGATGC 120 

TCCTAAGGCT TGGAATTATG AATTGCCTGC AACAAATTAT GAGACCCAAG ACTCCCATAA 180 

AGCTGGACCC ATTGGCATTC TCTTTGAACT AGTGCATATC TTTCTCTATG TGGTACAGCC 240 

GCGTGATTTC CCAGAAGATA CTTTGAGAAA ATTCTTACAG AAGGCATATG AATCCAAAAT 300 

TGATTATGAC AAGCCAGAAA CTGTAATCTT AGGTCTAAAG ATTGTCTACT ATGAAGCAGG 360 

GATTATTCTA TGCTGTGTCC TGGGGCTGCT GTTTATTATT CTGATGCCTC TGGTGGGGTA 420 

TTTCTTTTGT ATGTGTCGTT GCTGTAACAA ATGTGGTGGA GAAATGCACC AGCGACAGAA 480 

GGAAAATGGG CCCTTCCTGA GGAAATGCTT TGCAATCTCC CTGTTGGTGA TTTGTATAAT 540 

AATAAGCATT GGCATCTTCT ATGGTTTTGT GGCAAATCAC CAGGTAAGAA CCCGGATCAA 600 

AAGGAGTCGG AAACTGGCAG ATAGCAATTT CAAGGACTTG CGAACTCTCT TGAATGAAAC 660 

TCCAGAGCAA ATCAAATATA TATTGGCCCA GTACAACACT ACCAAGGACA AGGCGTTCAC 720 

AGATCTGAAC AGTATCAATT CAGTGCTAGG AGGCGGAATT CTTGACCGAC TGAGACCCAA 780 

CATCATCCCT GTTCTTGATG AGATTAAGTC CATGGCAACA GCGATCAAGG AGACCAAAGA 840 

GGCGTTGGAG AACATGAACA GCACCTTGAA GAGCTTGCAC CAACAAAGTA CACAGCTTAG 900 

CAGCAGTCTG ACCAGCGTGA AAACTAGCCT GOGGTCATCT CTCAATGACC CTCTGTGCTT 960 

GGTGCATCCA TCAAGTGAAA CCTGCAACAG CATCAGATTG TCTCTAAGCC AGCTGAATAG 1020 

CAACCCTGAA CTGAGGCAGC TTOCACCCGT GGATGCAGAA CTTGACAACG TTAATAACGT 1080 

TCTTAGGACA GATTTGGATG GCCTGGTCCA ACAGGGCTAT CAATCCCTTA ATGATATACC 1140 

TGACAGAGTA CAACGCCAAA CCACGACTGT CGTAGCAGGT ATCAAAAGGG TCTTGAATTC 1200 

CATTGGTTCA GATATCGACA ATGTAACTCA GCGTCTTCCT ATTCAGGATA TACTCTCAGC 1260 

ATTCTCTGTT TATGTTAATA ACACTGAAAG TTACATCCAC AGAAATTTAC CTACATTGGA 1320 

AGAGTATGAT TCATACTGGT GGCTGGGTGG CCTGGTCATC TGCTCTCTGC TGACCCTCAT 1380 

CGTGATTTTT TACTACCTGG GCTTACTGTG TGGCGTGTGC GGCTATGACA GGCATGCCAC 1440 

CCCGACCACC CG AGGCT GTG TCTCCAACAC CGGAGGCGTC TTCCTCATGG TTGGAGTTGG 1500 

ATTAAGTTTC CTCTTTTGCT GGATATTGAT GATCATTGTG GTTCTTACCT TTGTCTTTGG 1560 

TGCAAATGTG GAAAAACTGA TCTGTGAACC TTACACGAGC AAGGAATTAT TCCGGGTTTT 1620 

GGATACACCC TACTTACTAA ATGAAGACTG GGAATACTAT CTCTCTGGGA AGCTATTTAA 1680 

TAAATCAAAA ATGAAGCTCA CTTTTGAACA AGTTTACAGT GACTGCAAAA AAAATAGAGG 1740 

CACTTACGGC ACTCTTCACC TGCAGAACAG CTTCAATATC AGTGAACATC TCAACATTAA 1800 

TGAGCATACT GGAAGCATAA GCAGTGAATT GGAAAGTCTG AAGGTAAATC TTAATATCTT 1860 

TCTGTTGGGT GCAGCAGGAA GAAAAAACCT TCAGGATTTT GCTGCTTGTG GAATAGACAG 1920 

AATQAATTAT GACAGCTACT TGGCTCAGAC TGGTAAATCC CCCGCAGGAG TGAATCTTTT 1980 

ATCATTTGCA TATGATCTAG AAGCAAAAGC AAACAGTTTG CCCCCAGGAA ATTTGAGGAA 2040 

CTCCCTGAAA AGAGATGCAC AAACTATTAA AACAATTCAC CAGCAACGAG TCCTTCCTAT 2100 

AGAACAATCA CTGAGCACTC TATACCAAAG CGTCAAGATA CTTCAACGCA CAGGGAATGG 2160 

ATTGTTGGAG AGAGTAACTA GGATTCTAGC TTCTCTGGAT TTTGCTCAGA ACTTCATCAC 2220 

AAACAATACT TCCTCTGTTA TTATTGAGGA AACTAAGAAG TATGGGAGAA CAATAATAGG 2280 

ATATTTTGAA CATTATCTGC AGTGGATCGA GTTCTCTATC AGTGAGAAAG TGGCATCGTG 2340 

CAAACCTGTG GCCAC CGCTC TAGATACTGC TQTTGATGTC TTTCTGTGTA GCTACATTAT 2400 

OGAC CCCTT G AATTTGTTTT GGTTTGGCAT AGGAAAAGCT ACTGTATTTT TACTTCOGGC 2460 

TCTAATTTTT GGGGTAAAAC TGGCTAAGTA CTATOGTCGA ATGGATTCGG AGGACGTGTA 2520 

CGATGATGTT GAAACTATAC CCATGAAAAA TATGGAAAAT GGTAATAATG GTTATCATAA 2580 

AGATCATGTA TATGGTATTC ACAATCCTGT TATGACAAGC CCATCACAAC ATTGATAGCT 2640 

GATGTTGAAA CTGCTTGAGC ATCAGGATAC TCAAAGTGGA AAGGATCACA GATTTTTGGT 2700 

AGTTTCTGGG TCTACAAGGA CTTTCCAAAT CCAGGAGCAA GGCCAGTGGC AACGTAGTGA 2760 

CTCAGGCGGG CACCAAGGCA ACGGCACCAT TGGTCTCTGG GTAGTGCTTT AAGAATGAAC 2820 
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ACAATCACGT TATAGTCCAT GGTCCATCAC TATTCAAGGA TGACTCCCTC CCTTCCTGTC 2880 

TATTTTTGTT TTTTACTTTT TTACACTGAG TTTCTATTTA GACACTACAA CATATGGGGT 2940 

GTTTGTTCCC ATTGGATGCA TTTCTATCAA AACTCTATCA AATGTGATGG CTAGATTCTA 3000 

ACATATTGCC ATGTGTGGAG TGTGCTGAAC ACACACCAGT TTACAGGAAA GATGCATTTT 3060 

GTGTACAGTA AACGGTQTAT ATACCTTTTG TTACCACAGA GTTTTTTAAA CAAATGAGTA 3120 

TTATAGGACT TTCTTCTAAA TGAGCTAAAT AAGTCACCAT TGACTTCTTG GTGCTGTTGA 3180 

AAATAATCCA TTTTCACTAA AAGTGTGTGA AACCTACAGC ATATTCTTCA CGCAGAGATT 3240 

TTCATCTATT ATACTTTATC AAAGATTGGC CATGTTCCAC TTGGAAATGG CATGCAAAAG 3300 

CCATCATAGA GAAACCTGOG TAACTCCATC TGACAAATTC AAAAGAGAGA GAGAGATCTT 3360 

GAGAGAGAAA TGCTGTTCGT TCAAAAGTGG AGTTGTTTTA ACAGATGCCA ATTACGGTGT 3420 

ACAGTTTAAC AGAGTTTTCT GTTGCATTAG GATAAACATT AATTGGAGTG CAGCTAACAT 3480 

GAGTATCATC AGACTAGTAT CAAGTGTTCT AAAATGAAAT ATGAGAAGAT CCTGTCACAA 3540 

TTCTTAGATC TGGTGTCCAG CATGGATGAA ACCTTTGAGT TTGGTCCCTA AATTTGCATG 3600 

AAAGCACAAG GTAAATATTC ATTTGCTTCA GGAGTTTCAT GTTGGATCTG TCATTATCAA 3660 

AAGTGATCAG CAATGAAGAA CTGGTCGGAC AAAATTTAAC GTTGATGTAA TGGAATTCCA 3720 

GATGTAGGCA TTCCCCCCAG GTCTTTTCAT GTGCAGATTG CAGTTCTGAT TCATTTGAAT 3780 
AAAAAGGAAC TTGG 

Seq ID NO: 144 NP Protein sequence 
Protein Accession #: NP_006008.1 

1 11 21 31 41 51 

I I I I I I 

MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 60 

IFLYWQPRD FPEDTLRKPL QKAYESKIDY DKPBTVILGL KIVYYEAGII LCCVLGLLFI 120 

ILMPLVGYPP CMCRCQTKCG GEMHQRQKEN GPFLRKCFAI SLLVICIIIS IGIFYGFVAN 180 

HQVRTRIKRS RKLADSMFKD LRTLLNETPE QIKYILAQYN TTKDKAPTDL NSINSVLGGG 240 

ILDRLRPNII PVLDEIKSMA TAIKETKEAL ENMNSTLKSL HQQSTQLSSS LTSVKTSLRS 300 

SLNDPLCLVH PSSETCNSIR LSLSQLNSNP ELRQLPPVDA ELDNVNNVLR TOLDGLVQQG 360 

YQSLNDIPDR VQRQTTTWA GIKRVUJSIG SDIDNVTQRL PIQDILSAFS VYVNNTESYI 420 

HRNLPTLEEY DSYWWLGGLV ICSLLTLIVI FYYLGLLCGV OGYDRHATPT TRGCVSNTGG 480 

VPLMVGVGLS FLFCWILMII WLTFVFGAN VBKLICEPYT SKELFRVLDT PYLLNEDWEY 540 

YLSGKLFNKS KMKLTFEQVY SDCKKNRGTY GTLHLQNSFN ISEHLNINEH TGSISSELES 600 

LKVHLNIFIiL GAAGRKNLQD FAACGIDRMN YDSYLAQTGK SPAGVNLLSF AYDLEAKANS 660 

LPPGNLRNSL KRDAQTIKTI HQQRVLPIEQ SLSTLYQSVK ILQRTGNGLL ERVTRILASL 720 

DFAQNFITNN TSSVIIBETK KYGRTIIGYF EHYLQWIEFS ISEKVASCKP VATALDTAVD 780 

VPLCSYIIDP LNLFWFGIGK ATVPLLPALI FAVKLAKYYR RMDSEDVYDD VETIPMKNME 840 
NGNNGYHKDH VYGIHNPVMT SPSQH 

Seq ID NO i 145 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 38.. 2605 

1 11 21 31 41 51 

I I I I I I 

CCAAGTTCTA CCTCATGTTT GGAGGATCTT GCTAGCTATG GCCCTCGTAC TOGGCTCCCT 60 

GTTGCTGCTG GGGCTGTGCG GGAACTCCTT TTCAGGAGGG CAGCCTTCAT CCACAGATGC 120 

TCCTAAGGCT TGGAATTATG AATTGCCTGC AACAAATTAT GAGACCCAAG ACTCCCATAA 180 

AGCTGGACCC ATTGGCATTC TCTTTGAACT AGTGCATATC TTTCTCTATG TGGTACAGCC 240 

GCGTGATTTC CCAGAAGATA CTTTGAGAAA ATTCTTACAG AAGGCATATG AATCCAAAAT 300 

TGATTATGAC ATTGTCTACT ATGAAGCAGG GATTATTCTA TGCTGTGTCC TGGGGCTGCT 360 

GTTTATTATT CTGATGCCTC TGGTGGGGTA TTTCTTTTGT ATGTGTCGTT GCTGTAACAA 420 

ATGTGGTGGA GAAATGCACC AGCGACAGAA GGAAAATGGG CCCTTCCTGA GGAAATGCTT 480 

TGCAATCTCC CTGTTGGTGA TTTGTATAAT AATAAGCATT GGCATCTTCT ATGGTTTTGT 540 

GGCAAATCAC CAGGTAAGAA CCCGGATCAA AAGGAGTCGG AAACTGGCAG ATAGCAATTT 600 

CAAGGACTTG CGAACTCTCT TGAATGAAAC TCCAGAGCAA ATCAAATATA TATTGGOCCA 660 

GTACAACACT ACCAAGGACA AGGOGTTCAC AGATCTGAAC AGTATCAATT CAGTGCTAGG 720 

AGGCGGAATT CTTGACOGAC TGAGACCCAA CATCATCCCT GTTCTTGATG AGATTAAGTC 780 

CATGGCAACA GCGATCAAGG AGACCAAAGA GGCGTTGGAG AACATGAACA GCACCTTGAA 840 

GAGCTTGCAC CAACAAAGTA CACAGCTTAG CAGCAGTCTG ACCAGCGTGA AAACTAGCCT 900 

GCJGGTCATCT CTCAATGACC CTCTGTGCTT GGTGCATCCA TCAAGTGAAA OCTGCAACAG 960 

CATCAGATTG TCTCTAAGCC AGCTGAATAG CAACCCTGAA CTGAGGCAGC TTCCACCCGT 1020 

GGATGCAGAA CTTGACAACG TTAATAACGT TCTTAGGACA GATTTGGATG GCCTGGTCCA 1080 

ACAGGGCTAT CAATCCCTTA ATGATATACC TGACAGAGTA CAAOGCCAAA CCACGACTGT 1140 

CGTAGCAGGT ATCAAAAGGG TCTTGAATTC CATTGGTTCA GATATCGACA ATGTAACTCA 1200 

GCGTCTTCCT ATTCAGGATA TACTCTCAGC ATTCTCTGTT TATGTTAATA ACACTGAAAG 1260 

TTACATCCAC AGAAATTTAC CTACATTGGA AGAGTATGAT TCATACTGGT GGCTGGGTGG 1320 

CCTGGTCATC TGCTCTCTGC TGACCCTCAT CGTGATTTTT TACTACCTGG GCTTACTGTG 1380 

TGGCGTGTGC GGCTATGACA GGCATGCCAC CCCGACCACC CGAGGCTGTG TCTCCAACAC 1440 

CGGAGGCGTC TTCCTCATGG TTGGAGTTGG ATTAAGTTTC CTCTTTTGCT GGATATTGAT 1S00 

GATCATTGTG GTTCTTACCT TTGTCTTTGG TGCAAATGTG GAAAAACTGA TCTGTGAACC 1560 

TTACACGAGC AAGGAATTAT TCCGGGTTTT GGATACACCC TACTTACTAA ATGAAGACTG 1620 

GGAATACTAT CTCTCTGGGA AGCTATTTAA TAAATCAAAA ATGAAGCTCA CTTTTGAACA 1680 

AGTTTACAGT GACTGCAAAA AAAATAGAGG CACTTACGGC ACTCTTCACC TGCAGAACAG 1740 

CTTCAATATC AGTGAACATC TCAACATTAA TGAGCATACT GGAAGCATAA GCAGTGAATT 1800 

GGAAAGTCTG AAGGTAAATC TTAATATCTT TCTGTTGGGT GCAGCAGGAA GAAAAAACCT 1860 

TCAGGATTTT GCTGCTTGTG GAATAGACAG AATGAATTAT GACAGCTACT TGGCTCAGAC 1920 

TGGTAAATCC CCOGCAGGAG TGAATCTTTT ATCATTTGCA TATGATCTAG AAGCAAAAGC 1980 

AAACAGTTTG CCCCCAGGAA ATTTGAGGAA CTCCCTGAAA AGAGATGCAC AAACTATTAA 2040 

AACAATTCAC CAGCAACGAG TCCTTCCTAT AGAACAATCA CTGAGCACTC TATACCAAAG 2100 

CGTCAAGATA CTTCAACGCA CAGGGAATGG ATTGTTGGAG AGAGTAACTA GGATTCTAGC 2160 

TTCTCTGGAT TTTGCTCAGA ACTTCATCAC AAACAATACT TCCTCTGTTA TTATTGAGGA 2220 

AACTAAGAAG TATGGGAGAA CAATAATAGG ATATTTTGAA CATTATCTGC AGTGGATCGA 2280 

GTTCTCTATC AGTGAGAAAG TGGCATCGTG CAAACCTGTG GCCACCGCTC TAGATACTGC 2340 
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TGTTGATGTC TTTCTGTGTA GCTACATTAT CGACCCCTTG AATTTGTTTT GGTTTGGCAT 2400 

AGGAAAAGCT ACTGTATTTT TACTTCCGGC TCTAATTTTT GOGGTAAAAC TGGCTAAGTA 2460 

CTATCGTCGA ATGGATTCGG AGGAOGTGTA CGATGATGTT GAAACTATAC CCATGAAAAA 2520 

TATGGAAAAT GGTAATAATG GTTATCATAA AGATCATGTA TATGGTATTC ACAATCCTGT 2580 

J TATGACAAGC CCATCACAAC ATTGATAGCT GATGTTGAAA CTGCTTQAQC ATCAGGATAC 2640 

TCAAAGTGGA AAGGATCACA GATTTTTGGT AGTTTCTGGG TCTACAAGGA CTTTCCAAAT 2700 

CCAGGAGCAA OGCCAGTGGC AACGTAGTGA CTCAGGCGGG CACCAAGGCA ACGGCACCAT 2760 

TGGTCTCTGG GTAGTGCTTT AAGAATGAAC ACAATCACGT TATAGTCCAT GGTCCATCAC 2 920 

TATTCAAGGA TGACTCCCTC CCTTCCTGTC T AT T TTTGT T TTTTACTTTT TTACACTGAG 2880 

1U TTTCTATTTA GACACTACAA CATATGGGGT GTTTGTTCCC ATTGGATGCA TTTCTATCAA 2940 

AACTCTATCA AATGTGATGG CTAGATTCTA ACATATTGCC ATGTGTGGAG TGTGCTGAAC 3000 

ACACACCAGT TTACAGGAAA GATGCATTTT GTGTACAGTA AAOGGTGTAT ATACCTTTTG 3060 

TTACCACAGA GTTTTTTAAA CAAATGAGTA TTATAGGACT TTCTTCTAAA TGAGCTAAAT 3120 

AAGTCACCAT TGACTTCTTG GTGCTGTTGA AAATAATCCA TTTTCACTAA AAGTGTGTGA 3180 

AACCTACAGC ATATTCTTCA CGCAGAGATT TTCATCTATT ATACTTTATC AAAGATTGGC 3240 

CATGTTCCAC TTGGAAATGG CATGCAAAAG CCATCATAGA GAAACCTGCG TAACTCCATC 3300 

TGACAAATTC AAAAGAGAGA GAGAGATCTT GAGAGAGAAA TGCTGTTCGT TCAAAAGTGG 3360 

AGTTGTTTTA ACAGATGCCA ATTACGGTGT ACAGTTTAAC AGAGTTTTCT GTTGCATTAG 3420 

on GATAAACATT AATTGGAGTG CAGCTAACAT GAGTATCATC AGACTAGTAT CAAGTGTTCT 3480 

Z\) AAAATGAAAT ATGAGAAGAT CCTGTCACAA TTCTTAGATC TGGTGTCCAG CATGGATGAA 3540 

ACCTTTGAGT TTGGTCCCTA AATTTGCATG AAAGCACAAG GTAAATATTC ATTTGCTTCA 3600 

GGAGTTTCAT GTTGGATCTG TCATTATCAA AAGTGATCAG CAATGAAGAA CTGGTCGGAC 3660 

AAAATTTAAC GTTGATGTAA TGGAATTCCA GATGTAGGCA TTCCCCCCAG GTCTTTTCAT 3720 
GTGCAGATTG CAGTTCTGAT TCATTTGAAT AAAAAGGAAC TTGG 
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Seq ID NO: 146 Protein sequence 
Protein Accession #: 



on } 11 . 21 31 41 51 

30 | | | | | | 

MALVLGSLLL LGLCGNSPSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFBLVH 60 

IFLYWQPRD FPEDTLRKFL QKAYESKIDY DIVYYEAGII LCCVLGLLFI ILMPLVGYPP 120 

CMCRCCNKCG GEMHQRQKEN GPFLRKCFAI SLLVICIIIS IGIPYGFVAN HQVRTRIKRS 180 

RKLADSNFKD LRTLLNETPE QIKYILAQYN TTKDKAFTDL NSINSVLGGG ILDRLRPNII 240 

J J PVLDEIKSMA TAIKETKEAL ENMNSTLKSL, HQQSTQLSSS LTSVKTSLRS SLNDPLCLVH 300 

PSSETCNSIR LSLSQLNSNP ELRQLPPVDA ELDNVNNVLR TDLDGLVQQG YQSLNDIPDR 360 

VQRQTTTWA GIKRVLNSIG SDIDNVTQRL PIQDILSAPS VYVMNTESYI HRNLPTLEEY 420 

DSYWWLGGLV ICSLLTLIVI PYYLGLLCGV CGYDRHATPT TRGCVSNTGG VFLMVGVGLS 480 

FLFCWILMII WLTFVFGAN VEKLICEPYT SKELFRVLDT PYLLNEDWEY YLSGKLFNKS 540 

4U KMKLTFEQVY SDCKKNRGTY GTLHLQNSFN ISEHLNINEH TGSISSELES LKVNLNIFLL 600 

GAAGRKNLQD PAAOGIDRMN YDSYLAQTGK SPAGVNLLSF AYDLEAKANS LPPGNLRNSL 660 

KRDAQTIKTI HQQRVLPIEQ SLSTLYQSVK ILQRTGNGLL BRVTRILASL DFAQNFITNN 720 

TSSVIIEETK KYGRTIIGYP EHYLQWIEFS ISEKVASCKP VATALDTAVD VFLCSYIIDP 780 

LNLFWFGIGK ATVPLLPALI FAVKLAKYYR RMDSEDVYDD VETIPMKNME NGNNGYHKDH 840 

4J VYGIHNPVMT SPSQH 



Seq ID NO: 147 DNA sequence 

Nucleic Acid Accession #: BC012089.1 

Coding sequences 236.. 2806 



1 11 21 31 41 51 

I I i I 1 | 

ATGCTCTCAG CTCTCCCGCC GCGGGATGGT GCCTTGAGTG AATGACCCCC TTGGAGAACA 60 

- TTCTTCCGCA TCCCTCGCCT CAAGOCAGCC TCAGACAGAA AACTGAAGAT TCAGCAGATC 120 

J 3 CAGTGCTTCC TGCTCCTCTT CTGCCCAGGA ACAOGCTTGC CTTCCCCAAG GCTTCCAGAA 180 

GCTCTGAGGC AGGAGGCACC AAGTTCTACC TCATGTTTGG AGGATCTTGC TAGCTATGGC 240 

CCTCGTACTC GGCTCCCTGT TGCTGCTGGG GCTGTGCGGG AACTCCTTTT CAGGAGGGCA 300 

GCCTTCATCC ACAGATGCTC CTAAGGCTTG GAATTATGAA TTGCCTGCAA CAAATTATGA 360 

rr\ GACCCAAGAC TCCCATAAAG CTGGACCCAT TGGCATXCTC TTTGAACTAG TGCATATCTT 420 

OU TCTCTATGTG GTACAGCCGC GTGATTTCCC AGAAGATACT TTGAGAAAAT TCTTACAGAA 480 

GGCATATGAA TCCAAAATTG ATTATGACAA GATTGTCTAC TATGAAGCAG GGATTATTCT 540 

ATGCTGTGTC CTGGGGCTGC TGTTTATTAT TCTGATGCCT CTGGTGGGGT ATTTCTTTTG 600 

TATGTGTCGT TGCTGTAACA AATGTGGTGG AGAAATGCAC CAGCGACAGA AGGAAAATGG 660 

GCCCTTCCTG AGGAAATGCT TTGCAATCTC CCTGTTGGTG ATTTGTATAA TAATAAGCAT 720 

OD TGGCATCTTC TATGGTTTTG TGGCAAATCA CCAGGTAAGA ACCCGGATCA AAAGGAGTCG 780 

GAAACTGGCA GATAGCAATT TCAAGGACTT GCGAACTCTC TTGAATGAAA CTCCAGAGCA 840 

AATCAAATAT ATATTGGCCC AGTACAACAC TACCAAGGAC AAGGCGTTCA CAGATCTGAA 900 

CAGTA TCAAT TCAGTGCTAG GAGGCGGAAT TCTTGACCGA CTGAGACCCA ACATCATCCC 960 

- n TGTTCTTGAT GAGATTAAGT CCATGGCAAC AGCGATCAAG GAGACCAAAG AGGCGTTGGA 1020 

/U GAACATGAAC AGCACCTTGA AGAGCTTGCA CCAACAAAGT ACACAGCTTA GCAGCAGTCT 1080 

GACCAGCGTG AAAACTAGCC TGOGGTCATC TCTCAATGAC CCTCTGTGCT TGGTGCATCC 1140 

ATCAAGTGAA ACCTGCAACA GCATCAGATT GTCTCTAAGC CAGCTGAATA GCAACCCTGA 1200 

ACTGAGGCAG CTTCCACCCG TGGATGCAGA ACTTGACAAC GTTAATAACG TTCTTAGGAC 1260 

rjc AGATTTGGAT GGCCTGGTCC AACAGGGCTA TCAATCCCTT AATGATATAC CTGACAGAGT 1320 

/-> ACAACGCCAA ACCACGACTG TCGTAGCAGG TATCAAAAGG GTCTTGAATT CCATTGGTTC 1380 

AG ATATCG AC AATGTAACTC AGCGTCTTCC TATTCAGGAT ATACTCTCAG CATTCTCTGT 1440 

TTATGTTAAT AACACTGAAA GTTACATCCA CAGAAATTTA CCTACATTGG AAGAGTATGA 1500 

TTCATACTGG TGGCTGGGTG GCCTGGTCAT CTGCTCTCTG CTGACCCTCA TCGTGATTTT 1560 

cn TTACTACCTG GGCTTACTGT GTGGCGTGTG CGGCTATGAC AGGCATGCCA CCCCGACCAC 1620 

OU CCG AGGCT QT GTCTCCAACA GCGGAGGCGT CTTCCTCATG GTTGQAGTTO GATTAAGTTT 1680 

CCTCTTTTGC TGGATATTGA TGATCATTGT GGTTCTTACC TT' l ' G lXr m ' G GTGCAAATGT 1740 

GGAAAAACTG ATCTGTGAAC CTTACACGAG CAAGGAATTA TTCOGGGTTT TGGATACACC 1800 

CTACTTACTA A ATGAA GACT GGGAATACTA TCTCTCTGGG AAGCTATTTA ATAAATCAAA 1860 

AATGAAGCTC ACTTTTGAAC AAGTTTACAG TGACTGCAAA AAAAATAGAG GCACTTACGG 1920 
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CACTCTTCAC CTGCAGAACA GCTTCAATAT CAGTOAACAT CTCAACATTA ATGAGCATAC 1980 

TGGAAGCATA AGCAGTGAAT TGGAAAGTCT GAAGGTAAAT CTTAATATCT TTCTGTTGGG 2040 

TGCAGCAGGA AGAAAAAACC TTCAGGATTT TGCTGCTTGT GGAATAGACA GAATGAATTA 2100 

TGACAGCTAC TTGGCTCAGA CTGGTAAATC CCCCGCAGGA GTGAATCTTT TATCATTTGC 2160 

ATATGATCTA GAAGCAAAAG CAAACAGTTT GCCCCCAGGA AATTTGAGGA ACTCCCTGAA 2220 

AAGAGATGCA CAAACTATTA AAACAATTCA CCAGCAACGA GTCCTTCCTA TAGAACAATC 2280 

ACTGAGCACT CTATACCAAA GCGTCAAGAT ACTTCAACGC ACAGGGAATG GATTGTTGGA 2340 

GAGAGTAACT AGGATTCTAG CTTCTCTGGA TTTTGCTCAG AACTTCATCA CAAACAATAC 2400 

TTCCTCTGTT ATTATTGAGG AAACTAAGAA GTATGGGAGA ACAATAATAG GATATTTTGA 2460 

ACATTATCTG CAGTGGATCG AGTTCTCTAT CAGTGAGAAA GTGGCATCGT GCAAACCTGT 2520 

GGCCACCGCT CTAGATACTG CTGTTGATGT CTTTCTGTGT AGCTACATTA TCGACCCCTT 2S80 

GAATTTGTTT TGGTTTGGCA TAGGAAAAGC TACTGTATTT TTACTTCCGG CTCTAATTTT 2640 

TGCGGTAAAA CTGGCTAAGT ACTATCGTCG AATGGATTCG GAGGACGTGT ACGATGATGT 2700 

TGAAACTATA CCCATGAAAA ATATGGAAAA TGGTAATAAT GGTTATCATA AAGATCATGT 2760 

ATATGGTATT CACAATCCTG TTATGACAAG CCCATCACAA CATTGATAGC TGATGTTGAA 2820 

ACTGCTTGAG CATCAGGATA CTCAAAGTGG AAAGGATCAC AGATTTTTGG TAGTTTCTGG 2880 

GTCTACAAGG ACTTTCCAAA TCCAGGAGCA ACGCCAGTGG CAAOGTAGTG ACTCAGGOGG 2940 

GCACCAAGGC AACGGCACCA TTGGTCTCTG GGTAGTGCTT TAAGAATGAA CACAATCACG 3000 

TTATAGTCCA TGGTCCATCA CTATTCAAGG ATGACTCCCT CCCTTCCTGT CTATTTTTGT 3060 

TTTTTACTTT TTTACACTGA GTTTCTATTT AGACACTACA ACATATGGGG TGTTTGTTCC 3120 

CATTGGATGC ATTTCTATCA AAACTCTATC AAATGTGATG GCTAGATTCT AACATATTGC 3180 

CATGTGTGGA GTGTGCTGAA CACACACCAG TTTACAGGAA AGATGCATTT TGTGTACAGT 3240 

AAACGGTGTA TATACCTTTT GTTACCACAG AGTTTTTTAA ACAAATGAGT ATTATAGGAC 3300 

TTTCTTCTAA ATGAGCTAAA TAAGTCACCA TTGACTTCTT GGTGCTGTTG AAAATAATCC 3360 

ATTTTCACTA AAAGTGTGTG AAACCTACAG CATATTCTTC ACGCAGAGAT TTTCATCTAT 3420 

TATACTTTAT CAAAGATTGG CCATGTTCCA CTTGGAAATG GCATGCAAAA GCCATCATAG 3480 

AGAAACCTGC GTAACTCCAT CTGACAAATT CAAAAGAGAG AGAGAGATCT TGAGAGAGAA 3540 

ATGCTGTTCG TTCAAAAGTG GAGTTGTTTT AACAGATGCC AATTAOGGTG TACAGTTTAA 3600 

CAGAGTTTTC TGTTGCATTA GGATAAACAT TAATTGGAGT GCAGCTAACA TGAGTATCAT 3660 

CAGACTAGTA TCAAGTGTTC TAAAATGAAA TATGAGAAGA TCCTGTCACA ATTCTTAGAT 3720 

CTGGTGTCCA GCATGGATGA AACCTTTGAG TTTGGTCCCT AAATTTGCAT GAAAGCACAA 3780 

GGTAAATATT CATTTGCTTC AGGAGTTTCA TGTTGGATCT GTCATTATCA AAAGTGATCA 3840 

GCAATGAAGA ACTGGTOGGA CAAAATTTAA CGTTGATGTA ATGGAATTCC AGATGTAGGC 3900 

ATTCCCCCCA GGTCTTTTCA TGTGCAGATT GCAGTTCTGA TTCATTTGAA TAAAAAGGAA 3960 
CTTGGAAAAA AAAAAAAAAA AAA 

Seq ID NO: 148 Protein sequence 
Protein Accession #: AAH12089.1 

1 11 21 31 41 51 

I I I I ! I 

MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 60 

IFLYWQPRD FPEDTLRKFL QKAYESKIDY DKIVYYEAGI ILCCVLGLLF IILMPLVGYF 120 

FCMCRCCNKC GGEMHQRQKE NGPFLRKCFA ISLLVICIII SIGIFYGFVA NHQVRTRIKR 180 

SRKLADSNFK DLRTLLNETP EQIKYILAQY NTTKDKAFTD LNSINSVLGG GILDRLRPNI 240 

IPVLDEIKSM ATAIKBTKEA LENMNSTLKS LHQQSTQLSS SLTSVKTSLR SSLNDPLCLV 300 

HPSSETCNSI RLSLSQLNSN PELRQLPPVD AELDNVNNVL RTDLDGLVQQ GYQSLNDIPD 360 

RVQRQTTTW AGIKRVLNSI GSDIDNVTQR LPIQDILSAF SVYVNNTESY IHRNLPTLEE 420 

YDSYWWLGGL VICSLLTLIV IFYYLGLLCG VCGYDRHATP TTRGCVSNTG GVFLMVGVGL 480 

SFLFCWILMI IWLTFVFGA NVEKLICEPY TSKELFRVLD TPYLLNEDWE YYLSGKLFNK 540 

SKMKLTFEQV YSDCKKNRGT YGTLHLQNSF NISEHLNINE HTGSISSELE SLKVNLNIFL 600 

LGAAGRKNLQ DPAACGIDRM NYDSYLAQTG KSPAGVNLLS FAYDLEAKAN SLPPGNLRNS 660 

LKRDAQTIKT IHQQRVLPIE QSLSTLYQSV KILQRTGNGL LERVTRILAS LDFAQNFITN 720 

NTSSVIIEET KKYGRTIIGY FEHYLQWIEF SISEKVASCK PVATALDTAV DVFLCSYIID 780 

PLNLPWFGIG KATVFLLPAL IFAVKLAKYY RRMDSEDVYD DVETIPMKNM EN6NNGYHKD 840 
HVYGIHNPVM TSPSQH 

Seq ID NO: 149 DNA sequence 

Nucleic Acid Accession #t NM_033049.1 

Coding sequence: 28. .1566 

1 11 21 31 41 51 

I I I I I I 

CCAOGOGTCC GAGCAAGAAC AGCTAAAATG AAAGCCATCA TTCATCTTAC TCTTCTTGCT 60 

CTCCTTTCTG TAAACACAGC CACCAACCAA GGCAACTCAG CTGATGCTGT AACAACCACA 120 

GAAACTGCGA CTAGTGGTCC TACAGTAGCT GCAGCTGATA CGACTGAAAC TAATTTCCCT 180 

GAAACTGCTA GCACCACAGC AAATACACCT TCTTTCCCAA CAGCTACTTC ACCTGCTCCC • 240 

CCCATAATTA GTACACATAG TTCCTCCACA ATTCCTACAC CTGCTCCCCC CATAATTAGT 300 

ACACATAGTT CCTCCACAAT TCCTATACCT ACTGCTGCAG ACAGTGAGTC AACCACAAAT 360 

GTAAATTCAT TAGCTACCTC TGACATAATC ACCGCTTCAT CTCCAAATGA TGGATTAATC 420 

ACAATGGTTC CTTCTGAAAC ACAAAGTAAC AATGAAATGT CCCCCACCAC AGAAGACAAT 480 

CAATCATCAG GGCCTCCCAC TGGCACCGCT TTATTGGAGA CCAGCACCCT AAACAGCACA 540 

GGTCCCAGGA ATCCTTGCCA AGATGATCCC TGTGCAGATA ATTCGTTATG TGTTAAGCTG 600 

CATAATACAA GTTTTTGCCT GTGTTTAGAA GGGTATTACT ACAACTCTTC TACATGTAAG 660 

AAAGGAAAGG TATTCCCTGG GAAGATTTCA GTGACAGTAT CAGAAACATT TGACCCAGAA 720 

GAGAAACATT CCATGGCCTA TCAAGACTTG CATAGTGAAA TTACTAGCTT GTTTAAAGAT 780 

GTATTTGGCA CATCTGTTTA TGGACAGACT GTAATTCTTA CTGTAAGCAC ATCTCTGTCA 840 

CCAAGATCTG AAATGCGTGC TGATGACAAG TTTGTTAATG TAACAATAGT AACAATTTTG 900 

GCAGAAACCA CAAGTGACAA TGAGAAGACT GTGACTGAGA AAATTAATAA AGCAATTAGA 960 

AGTAGCTCAA GCAACTTTCT AAACTATGAT TTGACCCTTC GGTGTGATTA TTATGGCTGT 1020 

AACCAGACTG CGGATGACTG CCTCAATGGT TTAGCATGCG ATTGCAAATC TGACCTGCAA 1080 

AGGCCTAACC CACAGAGCCC TTTCTGOGTT GCTTCCAGTC TCAAGTGTCC TGATGCCTGC 1140 

AACGCACAGC ACAAGCAATG CTTAATAAAG AAGAGTGGTG GGGCCCCTGA GTGTGCGTGC 1200 

ACCAGGAAGA TGCTAATGGG. AACTGCCAAA AGTGTGCATT TGGCTACAGT 1260 
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GGACTCGACT GTAAGGACAA ATTTCAGCTG ATCCTCACTA TTGTGGGCAC CATCGCTGGC 1320 

ATTGTCATTC TCAGCATGAT AATTGCATTG ATTGTCACAG CAAGATCAAA TAACAAAACG 1380 

AAGCATATTG AAGAAGAGAA CTTGATTGAC GAAGACTTTC AAAATCTAAA ACTGCGGTCG 1440 

ACAGGCTTCA CCAATCTTGG AGCAGAAGGG AGCGTCTTTC CTAAGGTCAG GATAACGGCC 1500 

TCCAGAGACA GCCAGATGCA AAATCCCTAT TCAAGACACA GCAGCATGCC CCGCCCTGAC 1560 

TATTAGAATC ATAAGAATGT GGAACCCGCC ATGGCCCCCA ACCAATGTAC AAGCTATTAT 1620 

TTAGAGTGTT TAGAAAGACT GATGGAGAAG TGAGCACCAG TAAAGATCTG GCCTCCGGGG 16 BO 

TTTTTCTTCC ATCTGACATC TGCCAGCCTC TCTGAATGGA AGTTGTGAAT GTTTGCAACG 1740 

AATCCAGCTC ACTTGCTAAA TAAGAATCTA TGACATTAAA TGTAGTAGAT GCTATTAGOG 1800 

CTTGTCAGAG AGGTGGTTTT CTTCAATCAG TACAAAGTAC TGAGACAATG GTTAGGGTTG 1860 

TTTTCTTAAT TCTTTTCCTG GTAGGGCAAC AAGAACCATT TCCAATCTAG AGGAAAGCTC 1920 

CCCAGCATTG CTTGCTCCTG GGCAAACATT GCTCTTGAGT TAAGTGACCT AATTCCCCTG 1980 

GGAGACATAC GCATCAACTG TGGAGGTCOG AGGGGATGAG AAGGGATACC CACCACCTTT 2040 

CAAGGGTCAC AAGCTCACTC TCTGACAAGT CAGAATAGGG ACACTGCTTC TATCCCTCCA 2100 

ATGGAGAGAT TCTGGCAACC TTTGAACAGC CCAGAGCTTG CAACCTAGCC TCACCCAAGA 2160 

AGACT GGAAA GAGACATATC TCTCAGCTTT TTCAGGAGGC GTGCCTGGGA ATCCAGGAAC 2220 

TTTTTGATGC TAATTAGAAG GCCTGOACTA AAAATGTCCA CTATGGGGTG CACTCTACAG 2280 

TTTTTGAAAT GCTAGGAGGC AGAAGGGGCA GAGAGTAAAA AACATGACCT GGTAGAAGGA 2340 

AGAGAGGCAA AGGAAACTGG GTGGGGAGGA TCAATTAGAG AGGAGGCACC TGGGATCCAC 2400 

CTTCTTCCTT AGGTCCCCTC CTCCATCAGC AAAGGAGCAC TTCTCTAATC ATGCCCTCCC 2460 

GAAGACTGGC TGGGAGAAGG TTTAAAAACA AAAAATCCAG GAGTAAGAGC CTTAGGTCAG 2520 

TTTGAAATTG GAGACAAACT GTCTGGCAAA GGGTGCGAGA GGGAGCTTGT GCTCAGGAGT 2580 

CCAGOCGTCC AGCCTCGGGG TGTAGGTTTC TGAGGTGTGC CATTGGGGCC TCAGCCTTCT 2640 

CTGGTGACAG AGGCTCAGCT GTGGCCACCA ACACACAACC ACACACACAC AACCACACAC 2700 

ACAAATGGGG GCAACCACAT CCAGTACAAG CTTTTACAAA TGTTATTAGT GTCCTTTTTT 2760 

ATTTCTAATG CCTTGTCCTC TTAAAAGTTA TTTTATTTGT TATTATTATT TGTTCTTGAC 2820 

TGTTAATTGT GAATGGTAAT GCAATAAAGT GCCTTTGTTA GATGGTGAAA AAAAAAAAAA 2880 
AAAAAAA 

Seq ID NO: 150 Protein sequence 
Protein Accession #: NP_149038.1 

1 11 21 31 41 51 

I I I I I i 

MKAIIHLTLL ALLSVNTATN QGNSADAVTT TETATSGPTV AAADTTETNF PETASTTANT 60 

PSFPTATSPA PPIISTHSSS TIPTPAPPII STHSSSTIPI PTAADSESTT NVNSLATSDI 120 

ITASSPNDGL ITMVPSETQS NNEMSPTTED NQSSGPPTGT ALLETSTLNS TGPSNPCQDD 180 

PCADNSLCVK LHNTSFCLCL. EGYYYNSSTC KKGKVPPGKI SVTVSETFDP EEKHSMAYQD 240 

LHSEITSLPK DVFGTSVYGQ TVILTVSTSL SPRSBMRADD KFVNVTIVTI LAETTSDNEK 300 

TVTEKINKAI RSSSSNFLNY DLTLRCDYYG CNQTADDCLN GLACDCKSDL QRPNPQSPFC 360 

VASSLKCPDA CNAQHKQCLI KKSGGAPECA CVPGYQBDAN GNCQKCAFGY SGLDCKDKFQ 420 

LILTIVGTIA GIVILSMIIA LXVTARSNNK TKHIEEENLI DEDFQNLKLR STGFTNLGAE 480 
GSVFPKVRIT ASRDSQMQNP YSRHSSMPRP DY 

Seq ID NO: 151 DMA sequence 

Nucleic Acid Accession S: Eos sequence 

Coding sequence: 87.. 914 

1 11 21 31 41 51 

I I I I I I 

AGTACGCGGG GCCGCGTTTT GCCTCCGCAG CAGCTCTGGG CTCTTCTCAG CTGCGCGAGC 60 

AGCTGCTCCA ATGCCCCGGA GTGGCCATGG GCGCCCCGCA CTGGTGGGAC CAGCTGCAGG 120 

CTGGTAGCTC GGAGGTGGAC TGGTGCGAGG ACAACTACAC CATCGTGCCT GCTATCGCCG 180 

AGTTCTACAA CACGATCAGC AATGTCTTAT TTTTCATTTT ACCGCCCATC TGCATGTGCT 240 

TGTTTCGTCA GTATGCAACA TGCTTCAACA GTGGCATCTA CTTAATCTGG ACTCTTTTGG 300 

TTGTAGTGGG AATTGGATCC GTCTACTTCC ATGCAACCCT TAGTTTCTTG GGTCAGATGC 360 

TTGATGAACT TGCAGTCCTT TGGGTTCTGA TGTGTGCTTT GGCCATGTGG TTCCCCAGAA 420 

GGTATCTACC AAAGATCTTT CGGAATGACC GGGGTAGGTT CAAGGTGGTG GTCAGTGTCC 480 

TGTCTGCGGT TACGACGTGC CTGGCATTTG TCAAGCCTGC CATCAACAAC ATCTCTCTGA 540 

TGACCCTGGG AGTTCCTTGC ACTGCACTGC TCATOGCAGA GCTAAAGAGG TGTGACAACA 600 

TGCGTGTGTT TAAGCTGGGC CTCTTCTCGG GCCTCTGGTG GACCCTGGCC CTGTTCTGCT 660 

GGATCAGTGA CCGAGCTTTC TGCGAGCTGC TGTCATCCTT CAACTTCCCC TACCTGCACT 720 

GCATGTGGCA CATCCTCATC TGCCTTGCTG CCTACCTGGG CTGTGTATGC TTTGCCTACT 780 

TTGATGCTGC CTCAGAGATT CCTGAGCAAG GCCCTGTCAT CAAGTTCTGG CCCAATGAGA 840 

AATGGGCCTT CATTGGTGTC CCCTATGTGT CCCTCCTGTG TGCCAACAAG AAATCATCAG 900 

TCAAGATCAC GTGATGGCAA GATGGTGGCT GGCTTCTCTG CTTATCGCCC CTCATGCAGT 960 

GGGCTTCCTT TGCTAGGAAG ACAGCCAAGG GAGTTCGAAT AGTTGGGGTG TGGGCTATCT 1020 

TTTCAAAAAT CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTGTA TGTAGGGATT 1080 

TAAACTTTGT CATATGGTAC AAATATTCCC TGCCCCOCTG CAGTTTCCCA TTTGTCTTTC 1140 

A GTATGTTAA TATTTTTGTG CCATACTGGT TTTAAACTTT CATGTTGTCA CATCTGTTAA 1200 

TCTTTTCTTT AGGATTTCTG GATTTTGTGT AATTTTTAAA AAGGTCCCCT CCTCCTCCCT 1260 

AATGTGTCTG TGGACCACCT GGATTCCACT GTACAAGGGG AAAAGTGTCT ATTCCTTTCC 1320 

CAAAGATGGA AAATGGAGGG CTTAGGGACA CTAGATGCAT CTTTCTCAGC ATCACTTCCA 1380 

GATGCAGTGA CTTGTTGGGC TGCGTCCTTA ATGGCCATGG CAGAGCAGTC CCTTGGGGGA 1440 

TCCAGCCCTG TACAATGCAT CTCTTCCTGG AGAAAGCTGG CCTGCTCCAG ACCCCACCAT 1500 

TCCCAGGCGC CCTTGGAGTG GACTCTACTG ATGACAGACA GACCCTCTOA GAGACAAGAC 1560 

CCTCTGACTC TGTGATGGAA GATGCCAGAG ATTTTCCTTT GGGGTAATTG TCCTTAAACA 1620 

AAACCAAACA GATGAAACAC ACACAGGACT TGTGGCTAAA AAGGCTAGTT TTTCACTTGC 1680 

ATTTCTCAAC TAACCCAGGT TTTACATGCA TCTGTGAATC CTTTTACTAC TACCTCTGTG 1740 

GAGAGATGGA GAGACTTCAG ATAAACGTGA AGCTAATGAG TAAAACCCTC TCTGCCAAAA 1800 

CCTACACTCC ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTTTAAATA CTGAGCACAA 1860 

TTTTTAAATA CTGACATCAC TTCCTCTTCC OCCTCCCACC OCAGCTCAGC AGCCTCAAAT 1920 

CTACAQAGAA GAAGAATTAT GGCATGAACA TTCCCACAGA CCCACCATCT TTAAGACTTG 1980 

ACCTCTGTAA GTTTACCAAA GGGCTCCTCA CAATTGTGGT GGGGGTTCTG GTTCAAAATT 2040 
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TGGAGCAAAC ATGAAGTTTT TGGAAACGTT TTCTCATTTQ AAGCCTCCAG TATGCTGTAC 2100 

TATTCTGGAA ATTACCTTCA AGAGTCTCAC TTCTTGTTTC TGTTGTGTTT TCTGTGGGCA 2160 

TCATGTTCTT CACGCTTGCA GTAGAAGGTG CTTTCTCGGT TTCCCAGAGT ATCCAACGGC 2220 

TCACCTTTCT CAAGTGCTGG CAGTAGCTAT GCACTCACGG GCTGGTTTGG GTCGCTGGTG 2280 

CAGCAGCGCA AATCTGTTGC CTTCTGAATT TTTCTCACCT AATGTGACAC TGGCTACAAT 2340 

GAATCTTCTC TTCATCGGGC TGAATGAAAG ATTCAAGAAC CATCTTCAAG GTGCATGGTG 2400 

GGAATTATCA ACCTCAGGGA TACTCATTTT AACTCAGGCG TGTCCTGCTT TGTAACATTC 2460 

CATTGTTGGG AGAGGGCAGG ACAGGTGTGT TCTTCTGTGG GCAGGAGTCA TGTCACTGTC 2520 

CTACATATGT AAGAGTTGGG AAGGTGACGA TTTTTGACAC ATCCAGGAAC TCTTACTCTA 2S80 

GTTAGAATTT GTACCAGATC CAAGGTGAAA ACCCCAATAA GCAACTGAAT TTAGAGTTTA 2640 

AAAATGAATG ACTTTATGCT ACATCTGTGG TTATCAAATT ATATAGGTTG TTGAGAAGCA 2700 

GAAOGCTGTT TGTAGTAAGA AATCTTTGTG GAACCCCAGT GTGTGAAGTA AATTGTATGT 2760 

TATTAAATTT ATTTAAGGTT AAATTTATGG CATTTACTTA ATAATATATG AGGTGGTGAA 2820 

AATGCAAATT AACAAATTGG TAATTTCCAA GGTAGAAAAA TTAGGTGTTG AATGAATGTA 2880 

TATGTTGGTT TATTATAGTT TTATATATAT ATAGAGAGAG TGTTTTGTTT TTGAGTCAGG 2940 

GTCTTGCTCT ATTACCCAGG CTGGAGTGCA GTGGTGCAGT CATGACTCAC TGTAGCCTCT 3000 

GTCTCCCAGG CTCAAGTGAT CCTCTCACCT CAGCCTCCCC AGGAGCTGGG AGTACTGGTG 3060 

CGTGCCTCCA CTCCCAGCTA ATTTTTGTAT TTTTTCATAG AGATGGGGTT TCACCATTTT 3120 

GCCCAGGCTG GTCTCAAACT TCTGGGCTCA AGAGATTCGC CCGCCTCGGC CTCCCAAAGT 3180 

GCTGGGAATA GGCATGAGCC GTCAOGCCTG GCCTAAAAAA TATTTTTAAA ATGATCTTTG 3240 

AATTAAATAT TCGTAGAATT TCTAATGTAT CTCTTTGAGA CCTAGGAGGT TGATGGAACA 3300 

GAACTGCTGT TAAGTCCTTT GGGTTTCAAG TCTAGAATTT TTTAAAGGCA AATATCAGCT 3360 

CATTCTTATT TTAGATTGAC CTTATCAGGC ATGGATTCTG GTCTCATCTA CTTTATGGTA 3420 

TAAATGCTCC AAGGTAGGGG GTTTGGTATA TATTTTAAGC CCGGCTTTTT TTTTTTTTTT 3480 

TTTTTTTTTT TTTTTAATGT GAGAAGCAGA ATGTGCTTCT AGAAACTGGT TTTAAAGAGA 3540 

TGAGCTGAGA AAGAAATGTG GAATGGAGTA TATTTGAGGA GQACAAAACA TAACTTCACT 3600 

TTTGAACAGA AATCACTCTA GCTTGCCAGC ATGGGATGTA AACCAAGAGA GTAGAAATAT 3660 

ACCCATCTTA TTTTAAGTTG GGTTTATGGC ATCGCTCATA TATGTAAAAG CACTACAAAC 3720 

TCTTTAAAGA AAATTGGGAA ACTACAGAGA AGTCAAAGAA AAAAAAAAGT AACCCATATT 3780 

TCTATTGCCC AGGTATAATC CTTGTTAATA TTTTGGTTTG GTCTCCTCTT TTTTTCCCCC 3840 

AATATAGTTG TAAATAAATG ATGTCTTTCA GAGTTGACAT TTATGCTGTA GCTTGAATGG 3900 

CATGTAAATG CCAGTTGTAT ATTTTTTCAT GAAGTGTAGG TTTGGAATAC ACTAGAGTTA 3960 

GCTATATGCT TGAATGCTGA TCACTGGATT CTGAGACTGA CTACTGAGTC TACCTTTTTA 4020 

ATCAAGCCTA ACATGAATGG GCTCCAAAAA GTAATGAATG TAATTGTACT TTTTGATGTG 4080 

CCTCTGCACT TGGCTTGGTG AGTCATCATA AATAGCTGTT AAATATGTGA CTTTACAGAT 4140 

TTTGATATGT TCAGATTGTA AAAAATGAAT AGTTTATTTC ATTAATTGAT GGGCAGTCAA 4200 
GAATCTCCCT CC 

Seq ID NO: 152 Protein sequence 
Protein Accession §: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

KGAPHWWDQL QAGSSEVDWC EDNYTIVPAI AEFYNTISNV LFFILPPICM CLPRQYATCP 60 

NSGIYLIWTL LVWGIGSVY FHATLSFLGQ MLDBLAVLWV LMCALAMWFP RRYLPKIFPN 120 

DRGRFKVWS VLSAVTTCLA FVKPAINNIS LMTLGVPCTA LLIAELKRCD NMRVPKLGLF 180 

SGLWWTLALF CWISDRAFCB LLSSFNFPYL HCMWHIL.ICL AAYLGCVCFA YFDAASEIPE 240 
QGPVTKFWPN EKWAFIGVPY VSLLCANKKS SVKIT 

Seq ID NO: 153 DNA sequence 

Nucleic Acid Accession #i NM_001432.1 

Coding sequence: 167.. 676 

1 11 21 31 41 51 

I I I I I I 

TCACTTGCCT GATATTTCCA GTGTCAGAGG GACACAGCCA AOGTGGGGTC CCTTCTAGGC 60 

TGACAGCCGC TCTCCAGCCA CTGCCGOGAG CCCGTCTGCT CCXXJCCCTGC CCGTGCACTC 120 

TCCGCAGCCG CCCTCCGCCA AGCCCCAGCG CCCGCTCCCA TCGCCGATGA CCGCGGGGAG 180 

GAGGATGGAG ATGCTCTGTG CCGGCAGGGT CCCTGCGCTG CTGCTCTGCC TGGGTTTCCA 240 

TCTTCTACAG GCAGTCCTCA GTACAACTGT GATTCCATCA TGTATCCCAG GAGAGTCCAG 300 

TGATAACTGC ACAGCTTTAG TTCAGACAGA AGACAATCCA CGTGTGGCTC AAGTGTCAAT 360 

AACAAAGTGT AGCTCTGACA TGAATGGCTA TTGTTTGCAT GGACAGTGCA TCTATCTGGT 420 

GGACATGAGT CAAAACTACT GCAGGTGTGA AGTGGGTTAT ACTGGTGTCC GATGTGAACA 480 

CTTCTTTTTA ACOGTCCACC AACCTTTAAG CAAAGAGTAT GTGGCTTTGA CCGTGATTCT 540 

TATTATTTTG TTTCTTATCA CAGTGGTCGG TTCCACATAT TATTTCTGCA GATGGTACAG 600 

AAATCGAAAA AGTAAAGAAC CAAAGAAGGA ATATGAGAGA GTTACCTCAG GGGATCCAGA 660 

GTTGCCGCAA GTCTGAATGG CGCCATCAAA CTTATGGGCA GGGATAACAG TGTGCCTGGT 720 

TAATATTAAT ATTCCATTTT ATTAATAATA TTTATGTTGG GTCAAGTGTT AGGTCAATAA 780 

CACTGTATTT TAATGTACTT GAAAAATGTT TTTATTTTTG TTTTATTTTT GACAGACTAT 840 

TTGCTAATGT ATAATGTGCA GAAAATATTT AATATCAAAA GAAAATTGAT ATTTTTATAC 900 

AAGTAATTTC CTGAGCTAAA TGCTTCATTG AAAGCTTCAA AGTTTATATG CCTGGTGCAC 960 

AGTGCTTAGA AGTAAGCAAT TCCCAGGTCA TAGCTCAAGA ATTGTTAGCA AATGACAGAT 1020 

TTCTGTAAGC CTATATATAT AGTCAAATCG ATTTAGTAAG TATGTTTTTT ATGTTCCTCA 1080 

AATCAGTGAT AATTGGTTTG ACTGTACCAT GGTTTGATAT GTAGTTGGCA CCATGGTATC 1140 

ATATATTAAA ACAATAATGC AATTAGAATT TGGGAGAAGC AAATATAGGT CCTGTGTTAA 1200 

ACACTACACA TTTGAAACAA GCTAACOCTG GGGAGTCTAT GGTCTCTTCA CTCAGGTCTC 1260 

AGCTATAATT CTGTTATATG AGGGGCAGTG GACAGTTCCC TATGCCAACT CACGACTCCT 1320 

ACAGGTACTA GTCACTCATC TACCAGATTC TGCCTATGTA AAATGAATTG AAA AACAAT T 1380 

TTCTGTAATC TTTTATTTAA GTAGTGGGCA TTTCATAGCT TCACAATGTT CCTTTTTTGT 1440 

ATATTACAAC ATTTATGTGA GGTAATTATT GCTCAACAGA CAATTAGAAA AAAGTCCACA 1500 

CTTGAAGCCT AAATTTGTGC TTTTTAAGAA TATTTTTAGA CTATTTCTTT TTATAGGGGC 1560 

TTTGCTGAAT TCTAACATTA AATCACAGCC CAAAATTTGA TGGACTAATT ATTATTTTAA 1620 

AATATATGAA GACAATAATT CTACATGTTG TCTTAAGATG GAAATACAGT TATTTCATCT 1680 

TTTATTCAAG GAAGTTTTAA CTTTAATACA GCTCAGTAAA TGGCTTCTTC TAGAATGTAA 1740 
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AGTTATGTAT 
TGGTGTATTT 
CACACCTTCC 
TTTTTGGAAG 
AGGAGTATTT 
GGAGGCTCCT 
TCAGGGAGGA 
ATAATAAGCC 
CATGCATTAT 
TTACTATTAG 
TATATATATT 
GTCATTTGGT 
AATTAGATCT 
AGAACACATT 
GATTCAACCA 
ATTATAACAA 
CAGGTTGGAG 
GATCCTCCTG 
CTAATTTTTG 
CTCCTAACCT 
CCACCGCACG 
AGGAGGATGT 

CAAGGTACAA 
TGATCTTGGC 
ATTTCATTTA 
GCATGTCAAT 
AAGCTAATCC 
CACATTACTT 
GAGTAAACAT 
TTAGTTCACA 
AAGGGGCTAT 
AATTTATGGT 
ATATTAAGTT 
TAACCTTAAA 
TGAATATTTT 
TGGTATGACT 
ACCACCAGGT 
GTTAAATTCA 
CCAATATCCA 
TGACCGTTAA 
TATTTCTGGT 
TCAAATTTGA 
TTGCACTTTT 
AACATAAAGT 
TTACTGAATT 
GATATTGAAG 
GGGAATTGAA 
CACTGAC 



TTAAAGTTGT 
TTCCAAATGA 
TGTGGAGGCT 
AGACAAAGAT 
TTGGGCTATT 
TCATCGAATG 
TCTGTTCTTC 
ATGGCTGACC 
ATACCCTGGT 
GAAAATAAAC 
CATATACAAA 
GCTCTGCTAA 
AAACAGTTAT 
TTAAATACTC 
ACCAAGGACG 
CTATTTACTT 
TGCAGTGGCA 
CCTCAGCCTC 
TATTTTTAGT 
TGAGATCCAC 
TAGCATTTAC 
GAATAGGTTA 
ATTTTAGTAT 
CTGTATTTAT 
TGTTTCAAAA 
CAAAAGATTT 
TCATAAAAAC 
CTAAAATAGT 
TTTATGGGTG 
AAAAACTGAT 
CTTAGTCTAA 
GCATATTCAA 
CTTCAACCAA 
CTATAACTAG 
ATTGATATTA 
TTGCTCTAAT 
ACGTCTTAAG 
ATCAAATCAA 
GTAATGCAGT 
TTCTGTAGAC 
AAAATTCTTG 
GTATATTTTA 
TGATGTTAAA 
TAAGTTTGAA 
TGTATTGCAT 
GTATCAATTT 
GAGAAAATAG 
AAAAATTGAA 



ATCTTGACAC 
AAAATCTCAA 
GAGATGAAAA 
TTCTTCTGCA 
GCATAAGGAG 
CTAAACCTTT 
CTCTACGTTT 
TCTGGAGCAC 
GCAATCACAC 
AGACAAAAAC 
CATGTATGTA 
CCATTTATAT 
TTCTGTTTCC 
TACTTACAGT 
GAAAATGCTT 
TTTTTTTTTT 
CGATCTCGGC 
CTGAGCAGCT 
AGAGGCGGGG 
CCTCCACAGC 
ATTAGGTATT 
TATGCAAGCA 
GTGCAGGGGG 
TAAGGCTTAC 
AAATCTATTG 
ATTGTAAGCC 
AAGTCATTTT 
TAGATCTTTG 
AAAATAAGAC 
TTGCTCTCAG 
CTCCCCCATT 
TGTATTGAGA 
GGCCACATAA 
TACTCAAGGT 
TATTAAACAT 
CTCTCTGCCG 
TCAGATTTTT 
CTTTTATGCA 
TTTTAAAAAC 
TTTTGAAAAA 
TTCATATGGG 
ATGTTTTTAA 
TGTAATATAA 
GAGCCATTTT 
GCAATTTAAA 
GTTTGTGTTC 
ATAATTTACA 
ATAAATAAAA 



AGGAAATGGG 
TTGAAAGCTT 
CTAGGGCTCA 
CTCTGAGCCC 
CCACTGCTGC 
GAGTAGAGTC 
ATCCTGGCAT 
CAGGTGCCAG 
GACTGTCATC 
AAGTAAATAT 
TACATGACCT 
AAAACTTAAA 
TATTTAATAT 
TGGCCCTCTG 
AAAAAATAAT 
CTTTTTGAGA 
TCACTGCAAC 
GGGACTACAG 
TTTCACCATG 
CTCCCAAACT 
ACAAGTAATG 
CTATGCCCTT 
GCGATCTGGG 
TAGATGTGAG 
ACTTTTCAAT 
TCTCAATCTT 
TGTATTTTTC 
TAAATGCATA 
AAAAATAATA 
CTGATGTGTC 
TTACAGATTT 
ACCAAAGCAA 
TAACCCAGTT 
TTAACCTTAA 
ACATAATACA 
AAAGTCAAAG 
ATTTATGAGT 
GCAAATATAT 
CTGTATCTGA 
AAAGTTTTTA 
AGAAGGGGGA 
AAAGAGTAAT 
TGTATTTTCT 
GGTAAACGGT 
GTAACTTATT 
AATATCAGCT 
AG AT ATT ATT 
ATGCATTGAA 



AAAAAACTTA 
TTAAAATGTA 
TTTTCCTGAC 
ATAGGTCTCA 
CAOCACTTTT 
TCCCTGGATC 
GTGCTAGGGT 
GACTTGTCTC 
TAAAGTCCTG 
ATATGGTCCT 
TAATGGATCA 
AAGAAGAGAA 
AGCTGAAGTC 
TGGTTAGTTC 
ACAACAACAA 
TGGAGTCTOG 
CTCACCTCCC 
GCGCATGCCA 
TTGGCCAGGA 
GCTGGGATTA 
TAAAGATGAT 
TTATATAAGT 
AATCAGTCCC 
GAGAGTCTGA 
AAATCAGCTG 
GGTTTTTCAG 
ATCTTTAAGA 
TTAAATAATA 
GTTTTAGTGA 
CTGTACACAG 
CTCACTATAT 
CCACAAATGC 
AACTTACTCT 
AATTAAGATT 
ATGTAACTCC 
TGATGGGAGA 
CTTTGAGACT 
GATTCTAGTG 
CCCACTTTGT 
ATTTGATGCC 
GTAATGACTT 
TTCATTTAAA 
TTTTATTTTG 
TTTTATTAAA 
TGACTATGAA 
TTGATAATTG 
AATTTTTATT 
CATCTTGCAT 



AAAATTAATA 
GAAACTTAAA 
ATTTGTTTAT 
GAGAGTTAAT 
GGATTTTATG 
ACATACCAGG 
AAACGAAGGC 
CATGTGTATC 
GCCCTGGCCC 
ATACATATTG 
TAGAATTGCA 
AAGAAAAATC 
AAAATATGTA 
CACATCTGTG 
CAAAAAATAC 
CTCTGTTGCC 
GGGTTCAAGA 
CCATGCCCAG 
TGGTCTCAAT 
CAGGCGTGAG 
TTAAGTATAC 
GACTTGAACA 
CTGTGGATAC 
ATATTTTCAG 
CAATCCATTT 
TTGATCTTAA 
ATGCTTAAAA 
AAGTATGACC 
GGATGGTGCT 
TGGGAAGATT 
ATATTTCTAG 
ATAAATGCAT 
TTAACCAGGA 
TCCTTAACCT 
ACTGTTCTCC 
ATTGGTATAC 
AAATTCAATC 
TCTGACTTTT 
AATTTTTGCT 
CAATATATTC 
GTACAAACAG 
TATCTGTTAT 
CACTCTGTAA 
GATGCTATGG 
TATTATCGGA 
TGTACCTTAA 
TATTTTTCTT 
TCAAAATCTT 



Seq id NO: 154 Protein sequence 
Protein Accession #: NP_001423.1 

1 11 21 31 41 51 

I I I I I I 

MTAGRRMEML CAGRVPALLL CLGFHLLQAV LSTTVIPSCI PGESSDNCTA LVQTEDNPRV 
AQVSITKCSS DMNGYCLHGQ CIYLVDMSQN YCRCBVGYTG VRCEHFFLTV HQPLSKEYVA 
LTVILIILPL ITWGSTYYF CRWYRNRKSK EPKKBYERVT SGDPELPQV 

Seq ID NO: 155 DNA sequence 

Nucleic Acid Accession #: NM_013282.2 

Coding sequence : 85 . .2466 



CGACTCCTTA 
GTCCCTCCCC 
ACCCACACGG 
CAGGAGCTGT 
GAGGACGGCC 
GTCCGCCAGA 
ACCGACTCCG 
GCGGCCGCCG 
GGGCTGTACA 
GAGGCGCAGG 
ACGTCCAGGC 
GAGAACGGCG 
AAGTGGCAGG 
AAGGAGCGGG 
CGGGAACTCT 
TTCGTGGACG 
CCCATGAGAC 
TGCCGGGTCT 
TGCGATGAGT 



11 
I 

GAGCATGGCA 
TCAGCGCCGA 
TGGACTCGCT 
TCCACGTGGA 
ATACCCTCTT 
GCCTCGTGCT 
GCTGCTGCCT 
AGACTGACAG 
AGGTCAATGA 
TGGTCAGGGT 
CGGCGCTGGA 
TGGTCCAGAT 
ACCTGGAGGT 
GCTTCTGGTA 
ACGCCAACGT 
AAGTCTTCAA 
GGAAGAGCGG 
GOGCCTGCCA 
GCGACATGGC 



21 
I 

TGGCTCAGAG 
CACCATGTGG 
GTCCAGGCTG 
GCCAGGCCTG 
CGACTACGAG 
CCCCCACAGC 
GGGCCAGAGT 
CAGGCCAGCC 
GTACGTCGAT 
GACGCGGAAG 
GGAGGACGTC 
GAACTCCAGG 
GGGCCAGGTG 
CGACGCGGAG 
GGTGCTGGGG 
GATTGAGCGG 
GCCGTCCTGC 
CCTGTGCGGG 
CTTCCACATC 



31 

I 

GTGCTGGTAA 
ATCCAGGTTC 
ACCAAGGTGG 
CAGAGGCTGT 
GTOCGCCTGA 
ACCAAGGAGC 
GAGTCAGACA 
GATGAGGACA 
GCTCGGGACA 
GCCCCCTCCC 
ATTTACCAOG 
GACGTCCGAG 
GTCATGCTCA 
ATCTCCAGGA 
GATGATTCTC 
CCGGGTGAAG 
AAGCACTGCA 
GGCCGGCAGG 
TACTGCCTGG 



41 
I 

AACTGATGGG 
GGACCATGGA 
AGGAGCTGAG 
TCTACAGGGG 
ATGACACCAT 
GGGACTCCGA 
AGTCCTCCAC 
TGTGGGATGA 
CGAACATGGG 
GGGACGAGCC 
TGAAATACGA 
CGCGCGCCCG 
ACTACAACCC 
AGCGCGAGAC 
TGAACGACTG 
GGAGCCCCAT 
AGGACGACGT 
ACCCCGACAA 
ACCCGCCCCT 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 



60 
120 



51 

I 

GGTTTTTGCT 
CGGGAGGCAG 
GCGGAAGATC 
CAAACAGATG 
CCAGCTCCTG 
GCTCTCCGAC 
CCACGGCGAG 
GACGGAATTG 
GGCGTGGTTT 
CTGCAGCTCC 
CGACTACCOG 
CACCATCATC 
CGACAACCCC 
CAGGACGGCG 
TCGGATCATC 
GGTTGACAAC 
GAACAGACTC 
GCAGCTCATG 
CAGCAGTGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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CCCAGCCAGG ACGAGTGGTA CTGCCCTGAG TGCOGGAATG ATGCCAGCGA GGTGGTACTG 1200 

GCGGGAGAGC GGCTGAOAGA GAGCAAGAAG AAGGOGAAGA TGGCCTCGGC CACATCGTCC 1260 

TCACAGGGGG ACTGGGGCAA GGGCATGGCC TGTGTGGGCC GCACCAAGGA ATGTACCATC 1320 

GTCCCGTCCA ACCACTACGG ACCCATCCOG GGGATCCCCG TGGGCACCAT GTGGCGGTTC 1380 

CGAGTCCAGG TCAGCGAGTC GGGTGTCCAT OGGCCCCACG TGGCTGGCAT ACACGGCCGG 1440 

AGCAAGGACG GAGOGTACTC CCTAGTCCTG GCGGGGGGCT ATGAGGATGA CGTGGACCAT 1500 

GGGAATTTTT TCACATACAC GGGTAGTGGT GGTCGAGATC TTTCOGGCAA CAAGAGGACC 1560 

GCGGAACAGT CTTGTGATCA GAAACTCACC AACACCAACA GGGCGCTGGC TCTCAACTGC 1620 

TTTGCTCCCA TCAATGACCA AGAAGGGGCC GAGGCCAAGG ACTGGCGGTC GGGGAAGCCG 1680 

GTCAGGGTGG TGCGCAATGT CAAGGGTGGC AAGAATAGCA AGTACGCCCC CGCTGAGGGC 1740 

AACOQCTACG ATGGCATCTA CAAGGTTGTG AAATACTGGC COGAGAAGGG GAAGTCCGGG 1800 

TTTCTCGTGT GGCGCTACCT TCTGCGGAGG GAOGATGATG AGCCTGGCCC TTGGACGAAG I860 

GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC AGTATCCAGA AGGCTACCTG 1920 

GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 1980 

CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG GAAGTCGGCA 2040 

GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT CCAAGAAAAC CAAGGTGGAG 2100 

CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 2160 

CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG 2220 

TTGTTCCTGA GTAAAGTGGA GGAGAGGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 2280 

CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATCCTTT 2340 

CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 2400 

CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 2460 

OGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG AGGGCTCGTT 2520 

CATOGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 2580 

CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT ACATTTTCAG 2640 

GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 2700 

CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 2760 

ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCTGCCAC 2820 

CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC CCAAGGTCAG 2880 

AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC AGTGACATGG TTCCCCGTGG 2940 

TGGCCOGTGG CAGCCCGTGG CATGGCGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 3000 

AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC CCCACCAGAC 3060 

TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG GAGTTGGGGC 3120 

CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GCCAGGCGTG TGACTGACGC 3180 

TGTCCGACGA AGGCGGCCAC GGAGGGACGC CAGCACAOGA AGTCACGTGC AAGTGCCTTT 3240 

GATTCGTTCC TTCTTTCTAA AGAOGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 

GGAACOGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480 

TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA AOGTTAGGGT TTGGTTGTTT 3540 

TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 3720 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 
TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 

Seq 3D NO: 156 Protein sequence 
Protein Accession #: NP_037414.2 

1 11 21 31 41 51 

I I I I I I 

MWIQVRTMDG RQTHTVDSLS RLTKVEELRR KIQELPEVEP GLQRLFYRGK QMKDGHTLFD 60 

YEVRLNDTIQ LLVRQSLVLP HSTKERDSEL SDTDSGCCLG QSESDKSSTH GEAAAETDSR 120 

PADEDMWDET ELGLYKVNEY VDARDTNMGA WPEAQWHVT RKAPSRDEPC SSTSRPALEE 180 

DVIYHVKYDD YPENGWQMN SRDVRARART IIKWQDLEVG QWMLNYNPD NPKERGPWYD 240 

AEISRKRETR TARELYANW LGDDSLNDCR IIFVDEVFKI ERPGEGSPMV DNPMRRKSGP 300 

SCKHCKDDVN RLCRVCACHL CGGRQDPDKQ LMCDECDMAF HIYCLDPPLS SVPSEDEWYC 360 

PECRNDASBV VLAGERLRES KKKAKMASAT SSSQRDWGKO MACVGRTKEC TIVPSNHYGP 420 

IPGIPVGTMW RPRVQVSESG VHRPHVAGIH GRSNDGAYSL VLAGGYEDDV DHGNPFTYTG 480 

SGGRDLSGNK RTAEQSCDQK LTNTNRALAL NCPAPIMDQE GAEAKDWRSG KPVRWRNVK 540 

GGKNSKYAPA EGNRYDGIYK WKYWPEKGK SGFLVWRYLL RRDDDEPGPW TKEGKDRIKK 600 

LGLTMQYPEG YLEALANRER EKENSKREEE EQQEGGFASP RTGKGKHXRK SAGGGPSRAG 660 

SPRRTSKKTK VEPYSLTAQQ SSLIREDKSN AKLWNEVLAS LKDRPASGSP FQLFLSKVEE 720 
TFQCICCQEL VFRPITTVCQ HNVCKDCLDR SFRAQVPSCP ACRYDLGRSY AMOVNQPLQT 

Seq ID NO: 157 DNA sequence 
Nucleic Acid Accession #: NM_000756.1 
Coding sequence: 186.. 776 

1 11 21 

I I I 

AGAAACTCAG AGACCAAGTC CATTGAGAGA 
GAGAGTGGGA ACAGTAAAGA GAAAGGAAGA 
TCTCTGCAGA GAGGCGGCAG CACCCGGCTC 
CTAACATGCG GCTGGCGCTG CTTGTGTCCG 
GCCCGCCATG CAGGGCGCTC CTGAGCCGCG 
AGCACCCTCA GCCCTTGGAT TTCTTCCAGC 
CGCAGGCTCG GCCGGTCCTG CTCOGCATGG 
TCAACAAGAG COCGGCCGCT CCCCTTTCGC 
GCAGCCGCCC TTCGCCGGAA CAGGCGACCG 
TGCTGCTGCC TCGGCGCTCG CTCGACAGCC 
ATGCCCTCGG CGGCCACCAG GAGGCACCGG 
TCTCCCTGGA TCTCACCTTC CACCTCCTCC 
AGTTAGCACA GCAAGCTCAC AGCAACAGGA 



31 41 51 

I 1 i 

CTGAGGGGAA AGAGAGGAGA GAAAGAAAAA 60 

CAACCTCCAG AGAAAGCCCC CGGAGACGTC 120 

ACCTGCGAAG CGCCTGGGAA GCGAGTGCCC 180 

CGGGAGTCCT GCTGGTGGCT CTCCTGCCCT 240 

GGCCGGTCCC GGGAGCTCGG CAGGCGCCGC 300 

CGCCGCCGCA GTCCGAGCAG CCCCAGCAGC 360 

GAGAGGAGTA CTTCCTCCGC CTGGGGAACC 420 

CCGCCTCCTC GCTCCTOGCC GGAGGCAGCG 480 

CCAACTTTTT C C GCGT GT TO CTGCAGCAGC 540 

CCGCGGCTCT CGCGGAGCGC GGCGCTAGGA 600 

AGAGAGAAAG GCGGTCCGAG GAGCCTCCCA 660 

GGGAAGTCTT GGAAATGGCC AGGGCCGAGC 720 

AACTCATGGA GATTATTGGG AAATAAAACG 780 
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PCT/US02/36810 



10 



20 



35 



60 



GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 840 

TCTGTACCAT AGCGCTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 900 

GGGAGAGAGG GAGAGAGCCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 960 

GCAGCAACAC AATGTTATTC GTTTTGTCTA OGTTTAGTTT COGTTTCCAG GTGTTTATAG 1020 

TGGTGTTTTA AAG AGAATG T AGACCTGTGA GAAAAOGTTT TGTTTGAAAA AGCAGACAGA 1080 

AGTCACTCAA TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 1140 

TAAGACTAAA TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAATAAA 1200 

ACATCTTTCC AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 1260 
TAATAAAAGT TGCAAAGGT 

Seq ID NO: 158 Protein sequence 
Protein Accession 8: NP 000747.1 



ic ] " 21 31 41 51 

MRLPLLVSAG VLLVALLPCP PCRALLSRGP VPGARQAPQH PQPLDFFQPP PQSEQPQQPQ 60 

ARPVLLRMGB EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAN PFRVLLQQLL 120 

LPRRSLDSPA ALAERGARNA LGGHQEAPER ERRSEEPPIS LDLTFHLLRE VLEMARAEQL 180 
AQQAHSNRKL MEIIGK 



Seq ID NO: 159 DMA sequence 

Nucleic Acid Accession #: NM_00l2O0.1 

Coding sequences 32S..1514 



25 1 U 21 31 41 51 

I I I I | | 

GGGGACTTCT TGAACTTGCA GGGAGAATAA CTTGOGCACC CCACTTTGCG CCGGTGCCTT 60 

TGCCCCAGCG GAGCCTGCTT CGCCATCTCC GAGCCCCACC GCCCCTCCAC TCCTCGGCCT 120 

n TGCCCGACAC TGAGACGCTG TTCCCAGCGT GAAAAGAGAG ACTGCGCGGC CGGCACCCGG 180 

JU GAG AAGGAGG AGGCAAAGAA AAGGAACGGA CATTCGGTCC TTGCGCCAGG TCCTTTGACC 240 

AGAGTTTTTC CATGTGGACG CTCTTTCAAT GGACGTGTCC CCGCGTGCTT CTTAGACGGA 300 

CTGCGGTCTC CTAAAGGTCG ACCATGGTGG CCGGGACCCG CTGTCTTCTA GCGTTGCTGC 360 

TTCCCCAGGT CCTCCTGGGC GGCGCGGCTG GCCTCGTTCC GGAGCTGGGC CGCAGGAAGT 420 

TCGCGGCGGC GTCGTCGGGC CGCCCCTCAT CCCAGCOCTC TGAGGAGGTC CTGAGCGAGT 480 

TCGAGTTGCG GCTGCTCAGC ATGTTCGGCC TGAAACAGAG ACCCACCCCC AGCAGGGACG 540 

CCGTGGTGCC CCCCTACATG CTAGACCTGT ATCGCAGGCA CTCAGGTCAG CCGGGCTCAC 600 

CCGCCCCAGA CCACCGGTTG GAGAGGGCAG CCAGCCGAGC CAACACTGTG CGCAGCTTCC 660 

ACCATGAAGA ATCTTTGGAA GAACTACCAG AAACGAGTGG GAAAACAACC CGGAGATTCT 720 

Ap. TCTTTAATTT AAGTTCTATC CCCACGGAGG AGTTTATCAC CTCAGCAGAG CTTCAGGTTT 780 

W TCCGAGAACA GATGCAAGAT GCTTTAGGAA ACAATAGCAG TTTCCATCAC CGAATTAATA 840 

TTTATGAAAT CATAAAACCT GCAACAGCCA ACTCGAAATT CCCCGTGACC AGACTTTTGG 900 

ACACCAGGTT GGTGAATCAG AATGCAAGCA GGTGGGAAAG TTTTGATGTC ACCCCCGCTG 960 

TGATGCGGTG GACTGCACAG GGACACGCCA ACCATGGATT CGTGGTGGAA GTGGCCCACT 1020 

TGGAGGAGAA AGAAGGTGTC TCGAAGAGAC ATGTTAGGAT AAGCAGGTCT TTGCACCAAG 1080 

HO ATGAACACAG CTGGTCACAG ATAAGGCCAT TGCTAGTAAC TTTTGGCCAT GATGGAAAAG 1140 

GGCATCCTCT CCACAAAAGA GAAAAACGTC AAGCCAAACA CAAACAGCGG AAACGCCTTA 1200 

AGTCCAGCTG TAAGAGACAC CCTTTGTACG TGGACTTCAG TGACGTGGGG TGGAATGACT 1260 

GGATTGTGGC TCCCCCGGGG TATCACGCCT TTTACTGOCA GGGAGAATGC CCTTTTCCTC 1320 

CA TGGCTGATCA TCTGAACTCC ACTAATCATG CCATTGTTCA GACGTTGGTC AACTCTGTTA 1380 

JV ACTCTAAGAT TCCTAAGGCA TGCTGTGTCC OGACAGAACT CAGTGCTATC TCGATGCTGT 1440 

ACCTTGACGA GAATGAAAAG GTTGTATTAA AGAACTATCA GGACATGGTT GTGGAGGGTT 1500 
GTGGGTGTCG CTAGTACAGC AAAATTAAAT ACATAAATAT ATATATA 

Seq ID NO: 160 Protein sequence 
3D Protein Accession ih NP_00119l.l 



1 11 21 31 41 51 

I I I I I I 

MVAGTRCLLA LLLPCVLLGG AAGLVPELGR RKFAAASSGR PSSQPSDEVL SEPHLRLLSM 60 
FGLKQR PTPS RDAWPPYML DLYRRHSGQP GSPAPDHRLE RAASRANTVR SPHHEESLEE 120 
LPETSGKTTR RFFFNLSSIP TEEFITSAEL QVPREQMQDA LGNNSSFHHR INIYEIIKPA 180 
TANSKPPVTR LLDT 



Seq ID NO: 161 DNA sequence 
Oj Nucleic Acid Accession ft: NMJ)01999.2 
Coding sequence: 1..8736 

1 11 21 31 41 51 

7H 1 > I I i I 

/V ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG CTGTGTGGTG 60 

CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 

CAGCCGCCGC CGCAACAGGT TCGGTCCGCT ACAGCAGGCT CTGAAGGCGG GTTTCTAGCG 180 

CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 240 

GACGTGCTCC GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 300 

fj TGGAAGACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 360 

GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 420 

TGTGGATCAA AATCAATTCA GCAGTGCAGT GTGAGATGCA TGAATGGTGG GACCTGTGCA 480 

GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT 540 

opj GAAAATGGAT GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG TGTTTATGGG 600 

5U TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC TCAGGTCAAC 660 

AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT GTGCTGTGCC 720 

ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 780 

OGACGGGGTT TCATCCCCAA CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 840 

GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 900 
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AGATGCCCTG CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAG 960 
TGCAGCATCA TTCCTGGQAT ATGTGAAACT GGTGAATGTT CCAACACCGT GGGAAGCTAT 1020 
TICTUTUTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCGATG CATCGATCAG 1080 
AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG 1140 
AGAATGACGA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1200 
CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 
CCAATGGGAG GAATTCCAGG GAGTGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 
TTTGCCCCAA GTGGCAATGG CAATGGCTAT GGCCCAGGAG GGACAGGCTT CATCCCCATC 1380 
CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 
GGACCTATCA TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 1500 
GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1560 
AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 1620 
AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 
GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG 1740 
AATGGGGTTC TTTGTAAAAA CGGTCGATCC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 
TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 
ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 
ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 
TGCCAGACCC CAGGAATCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 
TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2100 
ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 
GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 
CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 
ATCACTGTGG ATGGAAGAGA TATCAATGAA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT 2340 
GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 
GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 
GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 
GTGTTCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 
GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATGTTC GCCCGGCAGC 2640 
AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 
ATCCAGGACA GCCGCTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 
GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2820 
CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 
TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG 2940 
TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3000 
CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 
ATGGATGCCT GCTGCTGTGC TGTCGGGGOG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 
AAACCTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 
GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 3240 
GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 3300 
AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAGGATT 3360 
TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 
TGCTTCGAAG GCTATGAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTGACGGA 3480 
TGTGAACGTA ACCCTCTCCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 
CAGTGTGACT GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3600 
AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3660 
ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 
GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3780 
GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT 3840 
GCAGACATTG ATGAATGTGA AAACAATCCT GATATCTGTG ATGGCGGCCA GTGTACCAAC 3900 
ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACATGAAA 3960 
ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 
GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 
ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 
GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4200 
AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 4260 
AATGCTCAGT GTGTAAATAC CCOGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT 4320 
GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 4380 
AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT 4440 
CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4S00 
GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 
GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 
TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 
TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 
AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4800 
AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGG GAAACCCCTG TGAGACATGC 4860 
CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCOG GAGGTGAAGG CTTCAGACCT 4920 
AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 
CAGGGTGGAA ACTGCATCAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 
TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 
GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 
GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 
CGAAGCTATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 5280 
TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTGAAOC ATGCCCAACT 5340 
CCAGGAACAG CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTTGACATT 5400 
CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 
GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATTCAGTTAC 5520 
AATGACCTGC TG1TGGTTTG TGAAGATATA GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 
CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT 5640 
TTCAAACTTT CACCCAATGG GGCCTGTGTA GATOGCAATG AATGTTTAGA AATTCCTAAC 5700 
GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 
GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 
CCATGTGGAA ATGGAACTTG TAAAAACACC GTTGGATCCT ATAACTGTCT GTGCTACCCA 5880 
GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT 5940 



1016 



WO 03/042661 PCT/DS02/36810 



10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



GGTCAGGTGT GCAGAAATGG ACGTTGTTTT AATGAAATTG GTTCTTTC AA GTGTCTATGT 6000 
AAOGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC 6060 
GCCCTTCCC6 GCTCTTGCTC TCCTGGTACC TGTCAGAATT TGGAGGGATC CTTCAGATGC 6120 
ATCTGTCCCC CAGGGTATGA AGTAAAAAGC GAGAACTGCA TTGATATAAA TGAATGTGAT 6180 
GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC 6240 
CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGACG6A GATGCTTTGA TACTOGCCAG 6300 
AGCTTCTGCT TCACAAATTT TQAAAATGGA AAGTGTTCTG TACCCAAAGC TTTCAACACC 6360 
ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG GCTGGGGGGA CCCCTGTGAG 6420 
CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG CCATGGAACT 6480 
GTCCCTAGTC TTCATGATAC ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT 6540 
TGTTCAAATG GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 6600 
TACAACCTTG ACTACACTGG AGTAGGCTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 
CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTGAATGCAA TTGCAATGAA 6720 
GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC CCAGAACCCA 6780 
CTGCTGTGTG CTTTACGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 
GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 
TTACACGACT GTGAATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC CTTCATGTGC 6960 
ATCTGCCCTC CTGGAATGGC COGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 
TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 
AGATGTGAGT GTAATGAAGG ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 
CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 7200 
CGCAATCTOG TCACTAAGTC AGAATGCTGC TGTGATGOTG GGOGAGGCTG GGGCCACCAG 7260 
TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 
GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 7380 
AATGGTCAGT GCATCAATAC CATGGGCTCA TTCOGATGCT TCTGCAAGGT TGGCTACACC 7440 
ACAGACATCA GTGGAACCTC TTGTATAGAC CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 
TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7S60 
GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 7620 
AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 7680 
TTCACACAGC ATCAGACTGC TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 
TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 
GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 
CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7920 
GGCTACATCC AGCACTACCA GTGGAATCAG TGTGTCGATG AGAATGAATG CTCCAATCCC 7980 
AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC 8040 
TCGGGGTTCT CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 
TCCAAGAACC CCTGCAATTA OGGCTGCTCT AACAOGGAGG GGGGCTACCT CTGTGGCTGC 8160 
CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 
GGGCAGTACC TGTCACTGGA TACAOAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 8280 
TGCTAOGAGT GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 
CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTOGA CATGGACAGC 8400 
CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTOGGCTCTA AGGAGCACAT CCTGGAACTA 8460 
AGGCCOGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 
GACAGOGTCT TCCGCATCCA CCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 
AAGCTCATGC COGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG B640 
GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8700 
GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT TCACAGACTT GGGCCCAGGC 8760 
TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8820 
AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 
CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 
GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 
TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT TTTTTTTTTT 9060 
CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAAATATA 9120 
GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 
AAGGGAAACT CACTTGTTTT TCTTTAGATT TATAAATTTG AGCTATTTTT TTTAGAGGTG 9240 
CTTTTTAAAA ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTT C T G OC AGTCATCCAG 9300 
CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 9360 
TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGTAGATC GAAGCCAAAG 9420 
CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA 9480 
CAACCACIGT AGCAAAATAC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 
GTACTGTATT TOCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 
CTTACCCAGG GTGCGCTGCG TCCTCATGGT ACTGTAGGCA GCTGAAGAAC CGCCGTTCCC 9660 
TTGAAAGGGA ACACCTGGCA TTCTGTGGTG TTTCGTGCTG TCTTAAATAA TGGTGCATTT 9720 
ATTATGTTCA AGTTATTTCA GGATTGCCAT ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 
GGAATATATG TTGTTGTTGT TGTTTTAAAC CCATTTTTTT TTTAGAATTT TCATTAATAC 9840 
TGTAGTTATA CACCATATGC CTCATTTTAT CATAGCCTAT TGTGTATGAA AGATGTTTGT 9900 
ACAATGAATT GATGTTTAGT TTGCTTTAGT CATTTAAAAA GATATTGTAC CAGGATGTGC 9960 
TATTAAGAGC ACGTATCCAT TATTCTTCTC AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 
CCAAACCTCA TATGTGAAAT GGCCAAAGCA CATGCAGGCT CCTGGTTGTT CCTCTCAAAC 10080 
CTGTGCTGAC CAAAGATTAG TAACCAGTTA TACCCAGTAT TTTGAGGTTT TATTGTTTTT 10140 
TTAATAACTA AAAAAAAACT CGTGCC 

Seq ID NO: 162 Protein sequence 
Protein Accession #: NPJJ01990.1 

1 11 21 31 41 51 

I I II I I 

MQRRRRLCLQ LYFLWLGCW LWAQGTAGQP QPPPPKPPRP QPPPQQVRSA TAGSEGGFLA 60 
PEYREEGAAV ASRVRRRGQQ DVLRGPNVCG SRFHSYCCPG WKTLPGGKQC ZVPICRNSOG 120 
DGFCSRPNMC TCSSGQISST CGSKSIQQCS VRCMNGGTCA DDHCQCQKGY IGTYCGQPVC 180 
ENGOQNGGRC IAQPCACVYG PTGPQCERDY RTGPCFTQVN NQNOQGQLTG IVCTKTLCCA 240 
TTGRAWGHPC EMCPAQPQPC RRGPIPNIRT GAOQDVDECQ AIPGICQGGN CINTVGSPBC 300 
RCPAGHKQSB TTQKCEDIDE CSI1PGICBT GECSNTVGSY PCVCPRGYVT STDGSRCIDQ 360 
RTGMCFSGLV NGRCAQBLPG RKTKMQCCCE PGRCWGIGTI PBACPVRGSB BYRRLCMDGL 420 
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PMGGIPGSAG SRPGGTGGNG FAPSGNGNGY GPGGTGFIPI PGGNGFSPGV GGAGVGAGGQ 480 

GPIITGLTIL NQTIDICKHH ANLOaiGRCI PTVSSYRCEC NMGYKQDANG DCIDVDECTS 540 

NPCTNGDCVN TPGSYYCKCH AGFQRTPTKQ ACIDIDECIQ NGVLCKNGRC VNSDGSFQCI 600 

OTAGFELTTD GKNCVDHDEC TTTNMCLNGM CINEDGSPKC ICKPGFVLAP NGRYCTDVDE 660 

CQTPGICMNG HCIHSEGSFR CDCPPGLAVG MDGRVCVDTH MRSTCYGGIK KGVCVRPFPG 720 

AVTKSECCCA NPDYGFGEPC QPCPAKNSAE FHGLCSSGVG ITVDGRDINE CALDPDICAN 780 

GICENLRGSY RCNCNSGYBP DASGRNCIDI DECLVNRLLC DNGLCRNTPG SYSCTCPPGY 840 

VFRTETETCE DINECESNPC VNGACRNNLG SFNCECSPGS KLSSTGLICI DSLKGTCWLN 900 

IQDSRCEVNI NGATLKSECC ATLGAAWGSP CERCELDTAC PRGLARIKGV TCEDVNECEV 960 

FPGVCPNGRC VNSKGSFHCE CPEGLTLDGT GRVCLDIRME QCYLKWDEDE CIHPVPGKFR 1020 

KDACCCAVGA AWGTECEECP KPGTKEYETL CPRGAGFANR GDVLTGRPFY KDINECKAFP 1080 

GMCTYGKCRN TIGSFKCRCN SGFALDMEER NCTDIDECRI SPDLCGSGIC VNTPGSFECE 1140 

CFEGYESGFM MMKNCM0IDG CERNPLLCRG GTCVNTEGSF QCDCPLGHEL SPSREDCVDI 1200 

NECSLSDNLC RNGKCVNMIG TYQCSCNPGY QATPDRQGCT DIDECMIMKG GCDTQCTNSE 1260 

GSYECSCSEG YALMPDGRSC ADIDECENNP DICDGGQCTN IPGEYRCLCY DGFMASMDMK 1320 

TCIDVNECDI* NSNICMFGEC ENTKGSFICH CQLGYSVKKG TTGCTDVDEC EIGAHKCDMH 1380 

ASCLNIPGSF KCSCREGWIG NGIKCIDLDE CSNGTHQCSI NAQCVNTPGS YRCACSEGFT 1440 

GDGFTCSDVD ECAENINLCB NGQCLNVPGA YRCECEMGFT PASDSRSCQD IDECSFQNIC 1500 

VSGTCNNLPG MFHCIODDGY ELDRTGGNCT DIDECADPIN CVNGLCVNTP GRYECNCPPD 1560 

FQLNPTGVGC VDNRVGNCYL KFGPRGDGSL SCNTBIGVGV SRSSCCCSLG KAWGNPCETC 1620 

PPVNSTEYYT LCPGQEGFRP NPITIILEDI DECQELPGLC QGGNCINTFG SFQCECPQGY 1680 

YLSEDTRICE DIDECFAHPG VCGPGTCYNT LGNYTCICPP EYMQVNGGHN CMDMRKSFCY 1740 

RSYNGTTCEN BLPFNVTKRM CCCTYNVGKA GNKPCEPCPT PGTADFKTIC GNIPGFTFDI 1800 

HTGKAVDIDE CKEIPGICAN GVCINQIGSP RCECPTGFSY NDLLLVCEDI DECSNGDNLC 1860 

QRNADCINSP GSYRCECAAG FKLSPNGACV DRNECLEIPN VCSHGLCVDL QGSYQC1CHM 1920 

GFKASQDQTM CMDVDECERH PCGNGTCKNT VGSYNCLCYP GFELTHNNDC LDIDECSSFF 1980 

GQVCRNGRCF NEIGSFKCLC NEGYELTPDG KNCIDTNECV ALPGSCSPGT CQNLEGSFRC 2040 

ICPPGYEVKS ENCIDINECD EDPNICLFGS CTNTPGGFQC LCPPGFVLSD NGRRCFDTRQ 2100 

SFCFTNFENG KCSVPKAFNT TKAKCCCSKM PGEGWGDPCB LCPKDDEVAF QDLCPYGHGT 2160 

VPSLHDTRED VNECLESPGI CSNGQCINTD GSFRCECPMG YNLDYTGVRC VDTDECSIGN 2220 

PCGNGTCTNV IGSFECNCNE GFEPGPMMNC EDINECAONP LLCALRCMNT FGSYECTCPI 2280 

GYALREDQKM CKDLDECAEG LHDCESRGMM CKNLIGTFMC ICPPGMARRP DGEGCVDENE 2340 

CRTKPGICEN GRCVNIIGSY RCECNEGFQS SSSGTECLDN RQGLCFAEVL QTICQMASSS 2400 

RNLVTKSECC CDGGRGWGHQ CELCPLPGTA QYKKI CPHGP GYTTDGRDID ECKVMPNLCT 2460 

NGQCINTMGS FRCFCKVGYT TDISGTSCID LDECSQSPKP CNYICKNTEG SYQCSCPRGY 2520 

VLQEDGKTCK DLDECQTKQH KCQFLCVNTL GGPTCKCPPG FTQHHTACID NNECGSQPLL 2580 

CGGKGICQNT PGSFSCEGQR GFSU)ATGI2J CEDVPEODGN HROQHGCQNI LGGYRCGCPQ 2640 

GYIQHYQWNQ CVDENECSNP NACGSASCYN TLGSYKCACP SGFSFDQFSS ACHDVNECSS 2700 

SKNPCNYGCS NTEGGYLCGC PPGYYRVGQG HCVSGMGFNK GQYLSLDTEV DEENALSPEA 2760 

CYECKINGYP KKDSRQKRSI HEPDPTAVEQ ISLESVDMDS PVNMKFNLSH LGSKEHILEL 2820 

RPAIQPLNNH IRYVISQGND DSVFRIHQRN GLSYLHTAKK KLMPGTYTLE ITSIPLYKKK 2880 
ELKKLEBSKE DDYLLGELGE ALRMRLQIQL Y 

Seq ID NO: 163 DNA sequence 

Nucleic Acid Accession #; NMJ>13372.1 

Coding sequence: 63.. 617 

1 11 21 31 41 * 51 

I I I I I I 

GCGGCOGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 

GTATGAGCCG CACAGCCTAC ACGGTGGGAG OCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCOGCCA GACAAGGCCC 180 

AGCACAATGA CTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 

GGGGCCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACOGC TTCTGTTACG 420 

GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCOGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 

CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGOGTA GTTCGTGTGC ATGAGTGTGG 780 

ATGGG TGCC T GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 

CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 

CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCTGACT TTGTGTTTTT GTGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 1020 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 1200 

TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTC CTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 1500 

ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC 7CTGGCTAGA GAGTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TQTTCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 
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TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TQTTTTCTT C AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

5AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAjG ATTCAGTTGC CGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2S80 

1 - CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

1U ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTOGTTAA TGGAGATGAC TTAAGTTGGC 2820- 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

ID TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTAT GGCA AG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT G ATGTACA CT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

Z\) GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA COGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

Z3 GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 

AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

^ AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 

35 Seq ID NO: 164 Protein sequence 
Protein Accession #: NP_037504.l 

1 11 21 31 41 51 

Ad 1 1 1 I I I 

MSHTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 60 

GQGRGTAMPG EEVLESSQBA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTI INRPCYG 120 

QCNSPYIPRH IRKBEGSFQS CSFCKPKKPT TMMVTLNCPE LQPPTKKKRV THVKQCRCIS 180 
IDLD 

45 Seq ID NO: 165 DNA sequence 

Nucleic Acid Accession #» CAT cluster 

1 11 21 31 41 51 

ca I I I I I | 

JW GAATTGCATC GGACAGAGCT TCGCCATGGC CGAGTTGCGC GTGGTTGTGG CACTAACACT 60 

GCTACGTTTC CGCCTGAGCG TGGACOGAAC GCGCAAGGTG CGGCGGAAGC CGGAGCTCAT 120 

ACTGOGCACG GAGAACGGGC TCTGGCTCAA G 



55 



Seq ID NO: 166 DNA sequence 

Nucleic Acid Accession #: FGE2JESH predicted 

Coding sequence: 1..1650 



&(\ } 11 21 31 41 51 

ATGCCACCTC TGCCATCCAG AATGTCTCGG GGGCCCTCCT TGGTGACAGG CAGGATGCTG 60 

CCCATCACAG ACOGCCTGCT GCACCTCCTG GGGCTGGAGA AGACGGCGTT CCGCATATAC 120 

GCGGTGTCCA CCCTTCTCCT CTTCCTGCTC TTCTTCCTGT TCCGCCTGCT GCTGCGGTTC 180 

, CTGAGGCTCT GCAGGAGCTT CTACATCACC TGCCGCCGGC TGCGCTGCTT CCCCCAGCCT 240 

Oj CCCCGGCGCA ACTGGCTGCT GGGCCACCTG GGCATGTACC TTCCAAATGA GGCGGGCCTT 300 

CAAGATGAGA AGAAGGTACT GGACAACATG CACCATGTAC TCTTGGTATG GATGGGACCT 360 

GTCCTGCCGC TGTTGGTTCT GGTGCACCCT GATTACATCA AACCCCTTTT GGGAGCCTCA 420 

GCTGCCATCG CCCCCAAGGA TGACCTCTTC TATGGCTTCC TAAAACCTTG GCTAGGGGAT 480 

« n GGGCT GCTGC TCAGCAAAGG TGACAAGTGG AGCCGGCACC GTCGCCTGCT GACACCCGCC 540 

/U TTCCACTTTG ACATCCTGAA GCCTTACATG AAGATCTTCA ACCAGAGCGC TGACATTATG 600 

CATGCTAAAT GGCGGCATCT GGCAGAGGGC TCAGCGGTCT CCCTTGATAT GTTTGAGCAT 660 

ATCAGCCTCA TGACCCTGGA CAGTCTTCAG AAATGTGTCT TCAGCTACAA CAGCAACTGC 720 

CAAGAGAAGA TGAGTGATTA TATCTCCGCT ATCATTGAAC TGAGCGCTCT GTCTGTCCGG 780 

*jc CGCCAGTATC GCTTGCACCA CTACCTCGAC TTCATTTACT ACCGCTOGGC GGATGGGCGG 840 

I J AGGTTCCGGC AGGCCTGTGA CATGGTGCAC CACTTCACCA CTGAAGTCAT CCAGGAACGG 900 

CGGCGGGCAC TGCGTCAGCA GGGGGCCGAG GCCTGGCTTA AGGCCAAGCA GGGGAAGACC 960 

TTGGACTTTA TTGATGTGCT GCTCCTGGCC AGGGATGAAG ATGGAAAGGA ACTGTCAGAC 1020 

GAGGATATCC GAGCCGAAGC AGACACCTTC ATGTTTGAGG GTCACGACAC AACATCCAGT 1080 

oft GGGATCTCTT GGATGCTGTT CAATTTGGCA AAGTATCCGG AATACCAGGA GAAATGCOGA 1140 

OU GAAGAGATTC AGGAAGTCAT GAAAGGCOGG GAGCTGGAGG AGCTGGAGTG GGACGATCTG 1200 

ACTCAGCTGC CCTTTACAAC TATGTGCATT AAGGAGAGCC TGCGCCAGTA CCCACCTGTC 1260 

ACTCTTGTCT CTCGCCAATG CACGGAGGAC ATCAAGCTCC CAGATGGGCG CATCATCCCC 1320 

AAAGGAATCA TCTGCTTGGT CAGCATCTAT GGAACCCACC ACAACCCCAC AGTGTGGCCT 1380 

GACTCCAAGG TGTACAACCC CTACCGCTTT GACCCGGACA ACCCACAGCA GCGCTCTCCA 1440 

1019 
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5 



CTGGCCTATG TGCOCTTCTC TGCAGGACCC AGGAATTGCA TCGGACAGAG CTTCGCCATG IS 00 

GCCGAGTTGC GCGTGGTTGT GGCACTAACA. CTGCTACGTT TCOGCCTGAG CGTGGACOGA 1560 

AOGCGCAAGQ TGCGGCGGAA GCCGGAGCTC ATACTGCGCA CGGAGAACGG GCTCTGGCTC 1620 
AAGGTGGAGC CGCTGCCTCC GCGGGCCTGA 

Seq ID NO: 167 Protein sequence 
Protein Accession <h FGENESH predicted 

1A f «■ 21 31 41 51 

MPPLPSRMSR GPSLVTGRML PITDRLLHLL GLEKTAFRIY AVSTLLLFLL FFLFRLLLRF 60 

LRLCRSFYIT CRRLRCFPQP PRRNWLLGHL GMYLPNEAGL QDEKKVLDNM HHVLLVWKGP 120 

VLPLLVLVHP DYIKPLLGAS AAIAPKDDLF YGFLKPWLGD GLLLSKGDKW SRHRRLLTPA 180 

FHFDILKPYM KIFNQSADIM HAKWRHLAEG SAVSLDMFEH ISLMTLDSLQ KCVFSYNSNC 240 

1J QEKMSDYISA IIELSALSVR RQYRLHHYLD FIYYRSADGR RFRQACDMVH HFTTEVIQBR 300 

RRALRQQGAE AWLKAKQGKT LDFIDVIiLUV RDEDGKELSD EDIRABADTF KFEGHDTTSS 360 

GISWMLFNLA KYPBYQEKCR EEIQBVMKGR ELEELEWDDL TQLPFTTMCI KESLRQYPPV 420 

TLVSRQCTED IKLPDGRIIP KOHCLVSIY GTHHNPTVWP DSKVYNPYRF DPDNPQQRSP 480 

LAYVPFSAGP RNCIGQSFAM AKLRWVALT LLRFRLSVDR TRKVRRKPEL ILRTEMGLWL 540 

L\) KVEPLPPRA 



25 



65 



Seq ID NO: 168 DNA sequence 
Nucleic Acid Accession fl: AK0S8088.1 
Coding sequence : 252 . . 1772 



60 
120 
180 



1 11 21 31 41 SI 

I I I I | | 

AGGAAACCAA GGCAAGCTCC CCCTGTCAAA GCACCTTGGC CCATAAGAAG AAAAGGGGGA 
GCCCCAGATG TGATGAGCGC TTCCAGGCTT CAGGCTCAGA AGGCGCCGCC AGCTCTCCTG 
JU TAACTCAGAG GCCAGTGTGA TGGGAGTTCC TCCACTCAGC ACACTTCCCC TGTAAACACG 

CCTGTGGTGG GCAAAAGGGC TTTGGAACGG TTGCTTGTCT TTTCTCTCCT GOGTAATTTC 240 

CACTTTCATT CATGATAATG TCGAACACGC ACAAAGCTCG GCTGGAACGC CGGGTCACXG 300 

GCTCAACCAA COGGTGGCGT TTGCCCAAAC AGCCTTTCTC TGGGGACCTG CTCTCACTTT 360 

or CCCAGATGTG CAAGGCTCTG AGCATAGACT TTGAGGAAGC TTTGAGGAAC CCAGACAGGT 420 

JO TATGCATTTC ACAAATCCAG AAGTTTTTCT TTGAGAATTT CAAGAACAAG GACATCCAAA 480 

GTGGGGAAGC AGATGTGATT CTCGAGTGCC TGGGCTTCAA ATGGGAGCTC CATCAGCCCC 540 

GGCTTTTTCA GTCTGAGACC TTGGCCAAGC TCTACCTGAA AGCCCTGGCG CAGGGCACCA 600 

CACACCCCCT GAGGGAGCTG GAGGAGCTTC TGCGAGCTCA ATCACCTAAG AAGACCAAAG 660 

A/\ AAAAATCCCC TGCAAAGAGG ATCATCATTT CCTTGAAGAT CAATGACCCA CTGGTCACTA 720 

W AAGTCGCCTT CGCCACGGCC CTQAAGAACC TCTACATGAG TGAGGTGGAG ATTAACTTGG 780 

AAGACCTACT GGGAGTGCTG GCTTCCGCCC ACATCCTCCA GTTCAGTGGC CTGTTTCAAA 840 

GGTGCGTGGA TGTGATGATA GCCAGACTCA AGCCAAGCAC CATCAAGAAA TTCTACGAGG 900 

CCGGCTGCAA GTACAAGGAA GAGCAGCTCA CCACCGGCTG CGAGAAGTGG CTGGAAATGA 960 

ac ACTTGGTTCC TCTAGGGGGG ACGCAGATCC ACCTCCACAA AATCCCACAG GACCTGCTCC 1020 

ACAAAGTGCT GAAGTCCCCC AGGTTATTTA CCTTTAGTGA ATTCCATCTT CTGAAAACAA 1080 

TGCTTTTGTG GGTCTTCTTG CAACTGAACT ACAAGATTCA GGCAATTCCG ACTTATGAAA 1140 

CCGTGATGAC ATTTTTTAAG AGCTTTCCTG AGAACTGTTG CTTTCTGGAC CGGGACATAG 1200 

GACGGAGCTT GAGGCCGCTC TTCCTCTGCT TGCGTCTGCA CGGCATCACC AAAGGCAAGG 1260 

C(\ ATCTGGAGGT GCTGCGGCAC CTTAACTTCT TCCCAGAGTC ATGGCTCGAC CAGGTTACAG 1320 

TCAACCATTA CCACGCACTG GAGAATGGGG GCGACATGGT CCACCTGAAA GATCTTAACA 1380 

CCCAGGCTGT GAGATTTGGG CTGCTCTTTA ACCAGGAGAA TACAACTTAT TCGAAAACGA 1440 

TTGCTCTATA TGGATTCTTC TTTAAGATAA AGGGACTCAA ACATGATACT ACCTCTTATA 1500 

GTTTTTACAT GCAGAGAATA AAGCACACAG ACCTGGAATC TCCCTCTGCG GTCTACGAGC 1560 

pr ACAACCACGT CAGCCTGCGA GCGGCACGCC TGGTGAAGTA TGAGATCAGA GCAGAGGCCC 1620 

DO TGGTTGACGG CAAGTGGCAG GAGTTCAGGA CAAACCAGAT CAAGCAGAAG TTTGGGTTGA 1680 

CCACGTCATC CTGCAAAAGC CATACCTTGA AAATCCAAAC TGTGGGCATC CCAATCTATG 1740 

TAAGTTTTGC ATTCATCTTC CCAGCATCTT GACAGTTTCC AGAAGAATCT ATGGGATTTT 1800 
CCCCCCACTG GTCTGCATAA AAGAAAATAA AATGACATAA AAGGGAGC 

60 Seq ID NO: 169 Protein sequence 
Protein Accession #: BAB71658.1 



1 H 21 31 41 51 

I I I I I I 

MIMSNTHKAR LERRVTGSTN RWRLPKQPFS GDLLSLSQMC KALSIDFEEA LRNPDRLCIS 60 

QIQKFFFENF KNKDIQSGEA DVILECLGFK WBLHQPRLFQ SBTLAKLYLK ALAQGTTHPL 120 

RELEELLRAQ SPKKTKEKSP AKRIIISLKI NDPLVTKVAF ATALKNLYMS EVEINLEDLL 180 

GVLASAHILQ FSGLFQRCVD VMIARLKPST IKKFYEAGCK YKEEQLTTGC EKWLEMNLVP 240 

_~ U3GTQIHXHK IPQDLLHKVL KSPRLFTFSE FHLLKTMLLW VFLQLNYKEQ AIPTYETVMT 300 

l\) PPKSFPENCC FLDRDIGRSL RPLFLCLRLH GITKGKDLEV LRHLNFFPES WLDQVTVNHY 360 

HALENGGDMV HLKDLNTQAV RFGLLFNQEN TTYSKTIALY GPFFKIKGLK HDTTSYSFYM 420 

QRIKHTDLES PSAVYEHNHV SLRAARLVKY EIRAEALVDG KWQEFRTNQI KQKFGLTTSS 480 
CKSHTLKIQT VQIPIYVSFA FIFPAS 

75 Seq ID NOV 170 DNA sequence 

Nucleic Acid Accession #: NM__007000 .1 
Coding sequence: 1...777 

oft } 11 21 31 41 51 

ATGGCGTCTG OGGCAGCAGC GGAGGCCGAG AAGGGATCTC CAGTTGTGGT GGGCCTGCTA 60 

GTTGTGGGCA ATATCATTAT TCTGCTGTCA GGCCTGTCCC TGTTTGCTGA GACCATATGG 120 

GTGACAGCCG ACCAGTACCG TGTATACCCA CTGATGGGAG TCTCAGGCAA GGATGACGTC 180 

TTCGCTGGTG CCTGGATTGC CATCTTCTGC GGCTTCTCCT TCTTCATGGT AGCCAGTTTT 240 

1020 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



GGTGTGGGTG 
CTCATCGTCT 
ATCGTGTCCA 
GACCAGGGCC 
GGCACATCTG 
GAGGTGGTGT 
AACGAGGAGG 
CACATCGGCC 
CTGATGTGGA 



CCGCACTCTG 
ACATCTTCGA 
ACCCATCCCT 
AGGAGCTGAC 
GTCCCATGGA 
TCCCCTGGCC 
GCTGCCGCCT 
ACGCCATCGA 
CGCTCCOGGT 



CCGCCGCCGG 
GTGCGCCTCC 
GATCACCAAG 
CGGCCTCTGG 
CTGGGTGAAC 
CCCACTGTGC 
GGGGCACATG 
CAGCTACACG 
CATGCTGATA 



TCCATGGTCC 
TGCATCAOGT 
CAGATGCTGA 
GACCGCGTCA 
TTCACGTCAG 
TGTCGCOGGA 
GACTACCTGT 
TGGGGTATCT 
GCCATGTATT 



TCACGTACCT 
CCTAC ACCCA 
CCTTCTACAG 
TQATTGAGCA 
CCTTCCGGGC 
CGGGAAACTT 
TCACCAAGGG 
CGTGGTTTGG 
TCTACACCAT 



GGTGCTCATG 
CCGTGACTAC 
CGCGGACACC 
AGAATGCTGT 
GGCCACTCCG 
CATCCCCCTC 
CTGCTTCGAA 
GTTTGCCATC 
GCTCTGA 



Seq ID NO: 171 Protein sequence 
Protein Accession ft: NP 008931.1 



i " r r r t 

MASAAAAEAE KGSPVWGLL WGNIIIU>S GLSLFAETIW VTADQYRVYP LMGVSGKDDV 
PAGAWIAIFC GFSPFMVASF GVGAALCRRR SMVLTYLVLM LIVYIPECAS CITSYTHRDY 
MVSNPSLITK QMLTFYSADT DQGQELTRLW DRVMIEQECC GTSGPMDWVN FTSAPRAATP 
EWPPWPPLC CRRTGNPIPL NEBGCRLGHM DYLPTKGCFE HIGKAIDSYT WGISWFGPAI 
IMWTLPVMLI AMYFYTML 

Seq ID NO: 172 DNA sequence 

Nucleic Acid Accession #: NM_006760.1 

Coding sequence : 3 9 . . 5 93 



I 

GAAAGCCTGC 
GGACCTTGCC 
ACATCTCAAG 
TGCCCCCCTG 
GCAAAGTGGT 
GTGTGGTGGA 
ACCTCGTGCC 
AGTCCAGCAG 
TGGGTATGGC 
TCCTGCTGGT 
TGCCCGGAGC 
GCTCCCAGGC 
CTCCCCTAGA 
ACACTCCATT 
CTTTAAACAC 
GCTGAGGCTG 



11 
I 

CAGCACCTAT 
CTTGATCCTG 
CCTCTCTGGT 
TCACCTCACA 
GAOGTCCAGC 
CAGTGGTGCT 
AGGAACCAAA 
AGAGATCCCA 
CCGCACAGGG 
GCTGGGCTTC 
AGCAGCTTCT 
CCCAGGCCTG 
GCCCTCTCCT 
ATTCCTCTCA 
CCCCATAACA 
CATCTGCCAA 



21 
I 

TCCACCTOCC 
ATTCTGCTGG 
CTGCTGTCCC 
GGAGGCAATG 
TTTGTGGTGC 
GGCTTCACAG 
TTCTACATTT 
ATGTCCACAC 
GGCATGGTGG 
ATCATTGCCC 
CCAGGAAGCC 
TCGCTCCCTT 
CCCTCTGTCC 
CCCCACTCCT 
ATTCCCCCAT 
AAAAAAAAAA 



31 
I 

AGCCCAGCAT 
CTCTGCTGTC 
CGGCGCTAAC 
CCACACTGAT 
CTCCGTGCCG 
TCACTCGGCT 
CCTACCTAGT 
TCCCTCGAAG 
TCATCACGGT 
TGGCACTGGG 
CAGGGCACCA 
GGTGCCCTCG 
CTCTCCTTGC 
GTCAGAGTTG 
CCTTCAGTGA 
AA 



41 
I 

GGCACCCCTG 
CCCAGGGGCT 
GGAGAGCCTG 
GGTCCGGAGA 
TGGGCGCAGG 
CAGTGCATAC 
GAAGAAGGGG 
GAACATGGAA 
GCTGCTCTCT 
CTCCCGCAAG 
TCCAGCTCCC 
CCTCCTCCTC 
CCCCAGTGCC 
ACTTTCCTCC 
ACTAAGTCCC 



51 
I 

CTGCCCATCC 
GCAGACTTCA 
CTGGTTGCCT 
GCCAATGACA 
GAACTGGTGA 
CAGGTGACAA 
ACAGCCACTG 
TCCATTGGGC 
GTCGCCATGT 
TAAGGAGGTC 
CAGCCCACCT 
CTGCCCTCCT 
T CACCT TCCA 
CATTTTACCA 
TATAATAAAG 



51 



Seq ID NO: 173 Protein sequence 
Protein Accession ft: NP_00675l.l 

* 11 21 31 41 

I I I I I I 

MAPLLPIRTL PLILILLALL SPGAADFNIS SLSGLLSPAL TESLLVALPP CHLTGGNATL 
MVRRANDSKV VTSSPWPPC RGRRELVSW DSGAGPTVTR LSAYQVTNLV PGTKFYISYL 
VKKGTATESS REIPMSTLPR RNMESIGLGM ARTGGMWIT VLLSVAMFLL VLGFIIALAL 
GSRK 

Seq ID NOi 174 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..2733 



1 
I 

ATGAAAGTTG 
TTCCTGGGGA 
CATCTAGGCC 
GAGAAAAGAG 
GGGCTAATTA 
CTGCAGTGTA 
AACTGCTACC 
AGCCAGAGTG 
AGGTTTACAA 
ATTGAAATTC 
ACCCAATTTC 
TCTGAACTGC 
CTGTTTCCAT 
GTCTTTGGAT 
GGAAACATCA 
CTCTCTCTGC 
GCAGCTGTGT 
ACAGTGGGGA 
GCCAGCCATT 
ATCCTTAATT 
AGCTCACGGT 
CCTCTGAATT 
CTCAAAAGGG 
AGAGGCCGTG 



11 

I 

GAGTGCTGTG 
AAAATGATGG 
CAGTCGAAGA 
ATTTGAGAAA 
GAATTATCAG 
CCTGTGAAGA 
TTCACACGGC 
TCAATTTCTG 
ATGACCTTTT 
AACTTAAAAA 
GAAATGGAAG 
TGTCAGCCAT 
TAGAAGACGG 
TTGGGTCCAA 
CAGCCAAGTG 
TTGAAGAACT 
CATCCTTCGT 
ATCTGGCTTC 
TCAGGGTGTC 
CAGCCTCAGT 
TACTAGAGAC 
TTTCTOGGAA 
GTTACAGCTA 
TGTTAATTGG 



21 
I 

GCTCATTTCT 
CATCAAAACA 
ATATCAGCTG 
TTTTCTGAAG 
AGCAAAGGCT 
CAGCTACACC 
TGGAGCACTC 
TGAGAGAACA 
GAATTCATCT 
AGCATATGAA 
CATCGTTGCT 
TGAACATGTT 
CTCTTTCAGA 
GGATGATGAA 
TGAGTCCTCT 
GAACAAGAAT 
GCAAAATCTT 
GGTGGTGTCG 
CAATTCAACA 
AACCAACTGG 
ATTAGAAAAC 
ATTCATTGAC 
TCAGATTAAA 
GTCAGACCAA 



31 
I 

TTCTTCACCT 
AAAAAAGAAC 
CTGCTTCAGG 
CTCTTGAAGC 
ACCACAGACT 
TGGTTTCCTC 
CCAAGCTGTG 
AAGATTTGGG 
TCTGCTATAT 
AGAATTCAAG 
GGGTATGAAG 
GCCGAGAAGG 
GTGTTCGGAA 
TATACCCTGC 
GGGTGGCAGG 
TTCAGTATGA 
TCTGTCATCA 
ATTCTGAGCA 
ATGGAGGATG 
ACAGTCTTAC 
ATCAGCACTC 
TGGAAAGGGA 
ATGTGTCCCC 
TTCCAGAGAT 



41 
I 

TCACTGACGG 
TCATTGTGAA 
TGACCTATAG 
CTCCATTATT 
GCAACAGCCT 
CCTCATGCCT 
AATGTCATCT 
GCACTTTCAA 
ACTCCAAATA 
GTTTTGAGTC 
TTGTTGGCTC 
CTAAGACAGC 
AAGCCCAGTG 
CCTGCAGCAG 
TCATCAGGGA 
TTGTAGGCAA 
TTCGGCAAAA 
ATATTTCATC 
TCATCAGTAT 
TGCGGGAAGA 
TGGTGCCTCC 
TTCCAGTGAA 
AAAATACATC 
CCCTTCCAGA 



51 
I 

CCACGGTGGC 
TAAGAAAAAA 
AGATTCCAAG 
ATGGTCACAT 
GAATGGAGTC 
TGATCCCCAG 
CAACAACCTC 
AATTAATGAA 
TGCAAATGGA 
GGTTCAGGTC 
CAGCAGTGCA 
CCTTCACAAG 
TAATGACATT 
TGGCTACAGG 
GACTTGTGTG 
TGCCACTGAG 
CCCATCAACC 
TCTGTCACTG 
AGCTGACAAT 
AAAGTATGCC 
GACAGCTCTT 
CAAAAGCCAA 
TATTCCCATC 
AACTATTATC 



300 
360 
420 
480 
S40 
600 
660 
720 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



1021 
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OGTTGACTCT GGGGAACATT CTACCCGTTT CCAAAAATGG AAATCCTCAG 1500 

GTCAATGGAC CTGTGATATC CACGGTTATT CAAAACTATT CCATAAATGA AGTTTTCCTA 1560 

TTTTTTTCCA AGATAGAGTC AAACCTGAGC CAGCCTCATT GTGTGTTTTG GGATTTCAGT 1620 

C CATTTGCAGT GGAACQATGC AGGCTGCCAC CTAGTGAATG AAACTCAAGA CATCGTGACG 1680 

^SS ATGTA CTCACmAC CTCCTTCTCC ATATTGATGT CACCTTTTGT CCCCTCTACA 1740 

~Z CCCCG TTGTAAAATG GATCACCTAT GTGGGACTGG GTATCTCCAT TGGAAGTCTC 1800 

ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA TTAAAAAAAG CCAAACCTCT 1860 

CACACACGTC GTATTTGCAT GGTGAACATA GCCCTGTCCC TCTTGATTGC TGATGTCTGG 1920 

10 ^^T TGTTG GTGCCACAGT GGACACCACG GTGAACOCTT CTGGAGTCTG CACAGCTGCT 1980 

AU GTGTTCTTTA CACACTTCTT CTACCTCTCT TTGTTCTTCT GGATGCTCAT GCTTGGCATC 2040 

CTGCTGGCTT ACCGGATCAT CCTCGTGTTC CATCACATGG CCCAGCATTT GATGATGGCT 2100 

GTTGGATTTT GCCTGGGTTA TGGGTGCCCT CTCATTATAT CTGTCATTAC CATTGCTGTC 2160 

AOGCAACCTA GCAATACCTA CAAAAGGAAA GATGTGTGTT GGCTTAACTG GTCCAATGGA 2220 

1 S TCCTGGCTTT TGTTCTCCCT GCACTGGCTA TTGTGGCTGT GAACTTCGTT 2280 

GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGGCCGACTG TTGGGGAAAG ACTGAGTCGG 2340 

GATGACAAGG CCACCATCAT CCGCGTGGGG AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 2400 

GGGCTCACCT GGGGCTTTGG AATAGGAACA ATAGTGGACA GCCAGAATCT GGCTTGGCAT 2460 

GTTATTTTTG CTTTACTCAA TGCATTCCAG GGATTTTTTA TCTTATGCTT TGGAATACTC 2520 

OH TTGGACAGTA AGCTGCGACA ACTTCTGTTC AACAAGTTGT CTOCCTTAAG TTCTTGGAAG 2580 

ZU CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 2640 

AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA CTGGAGATTC CTCCGACAAC 2700 
ATCATGCTAA CTCAGTTTGT CTCAAATGAA TAA 

~ - Seq ID NOi 17S Protein sequence 
ZD Protein Accession #: Eos sequence 

} " 21 31 41 51 

OH MKVGVLWLIS FPTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK 60 

-3U EKRDLRNFLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WFPPSCLDPQ 120 

NCYLHTAGAL PSCECHLNNL SQSVNFCBRT KIWGTFKINE RFTNDLLNSS SAIYSKYANG 180 

IEIQLKKAYE RIQGFESVQV TQFRNGSIVA GYEWGSSSA SELLSAIEHV AEKAKTALHK 240 

LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GNITAKCESS GWQVIRETCV 300 

i 1 ?™™^ FSMIVGNATE AAVSSFVQNL SVIIRQNPST TVGNLASWS ILSNISSLSL 360 

DJ ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA SSRLLETLEN ISTLVPPTAL 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 480 

SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDFS 540 

HLQWNDAGCH LVNETQDIVT CQCTHLTSFS ILMSPFVPST IFPWKWITY VGLGISIGSL 600 

A(\ IliCLIIEALF WKQIKKSQTS HTRRICMVNI ALSLL1ADVW FIVGATVDTT VNPSGVCTAA 660 

tU VFFTHFFYLS LFFWMLMLGI LLAYRIILVF HHMAQHLMMA VGFCLGYGCP LIISVITIAV 720 

TQPSNTYKRK DVCWLNWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 780 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIFALLNAFQ GFFILCFGIL 840 

LDSKLRQLLF NKL3ALSSWK QTEKQNSSDL SAKPKFSKPF NPLQNKGHYA PSHTCDSSDM 900 
IMLTQFVSNE 



45 



50 



Seq ID NO: 176 DMA sequence 

Nucleic Acid Accession #: AB035089.1 

Coding sequence: 9845.. 10219 



] " ?1 31 41 51 

I I I ! j ( 

GGGCATGCAG CCATCGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 60 

CAGTTCTAGT AAAAGGGAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAGAAGG 120 

4TC CCAAQAGGAA TTAGGGAGAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 180 

JJ TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG CAGAGTAGGA 240 

AGGAGGAAGG GAGGCAAGAG ATAATATCAT TTTCTCTGTG CTCCAACTGT ACTTACATAT 300 

GAGACTATTT CCCTCTCTOC TTTTCAAACC TTACTGGAGT TGTTTTCCCT CATGAAAACC 360 

AAGAAAGGAA AGCTAGTTAG TCTTGTTCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 

fZ(\ GTAGACAGAA TOCTTGGGAA TACAGTAATT GACATATATT CTGTTATTTG ATGCTTGAAA 480 

OU AATCTCCTCC ACTAACCAGT TTCCCTATAG ATTGCCACAA GCACATAATA AGAAACAATA 540 

AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CAGCCTTCAT 600 

AAGGAAGTGA GGTCCAGGAA AATCTAGGAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 

GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 

fiZ GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 

CAAGTGTTCA TATGCAAAAA CTTCTTGGAA TTTCTGAGTT CTCTGTGGCA ATATATGACA 840 

TCAGGATATG TCCAGTCTCA CACACCAGGA TATGTCCTTT CTAGCCTGTC TATCACATGC 900 

TAGGAGAACT ATTTAGGAAC AGAAAAAAAT GCCTGAAATG ATTTCTCATT TGAACTCATC 960 

CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 

70 ^ CAACCTTC AGGGCAAACC TCCGTGCCTC AGACGTTTAG CCATAGTCTG AAATTCTCTT 1080 

/U CCATAGATTG GTCCCCTGTA ACCCCQGTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTTC 1140 

CCTCCATTCC CAGGATGAGC TTGTTGCTTC TGTCCTATGA GACATTAGAT TCCTTTTCTT 1200 

TGGTACCCGA GTAAATCCAT CCTACTCCAA TAGAGGAAGG TCCATTTTTG TCTTATAGOG 1260 

CTGGATGCAG ACT CAGCTGA GAAGACCATT ATTCATTTTT GGAATTCTTT ATCTCAGATA 1320 

7S IT r S TCrrC 1TTCTTTrTC TTCTATCTTT GGATTTTTAG TCCATCAACG CCCCATTAGT 1380 

CTATTCCCCG ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 

ATATATATTG ATACAGGAGA CCTAAGAAGA GCATGTCTTG GGGGTTGAGG AAACAGGCAG 1500 

GTGAGAAATT TCCAGATTGQ AAACACAGCT TOCTTTCTCC CATCCAGCCC CTACTTTCAG 1560 

CCTATGTGTT TCTGGCACCT TGTTGTAGAT AAATCTCCCT TGACTTTGTG ATGTGCTGAG 1620 

OA AAAACAAACT CACGGCTGGT GTTAAAAAGG GCCCATGACA ATACCAAGTG TTGGGGAGAA 1680 

OV TGTGGAGAAA TCAGAACTCT ATTCACGGTC GGTTGGAATO CACACTTGTG CAGAATTCTA 1740 

TGGAGAAGAG TC TGGCA TTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGOGA 1800 

TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATGTAC 1860 

ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 1980 
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AAAAGGAATG ATGTACTGAT CCATQCAATG ATGTGGACAA ACCATGAAAA TAA CAGT AQA 
TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 
AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 
5 GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 
TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 
TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 
ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 
AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTTCGG 
10 CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 
AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 
GGGGAGGCCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 
AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 
15 ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 
TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 
GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 
AACAGAAAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 
CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 
20 ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 
TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 
GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 
CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 
AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 
25 CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 
TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 
TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 
TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 
GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 
30 GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 
AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 
TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA GG AACACAGA GGGAGAGTGC CTTGCAGCCA 
GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 
35 AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACCAGCAAT 
GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 
TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 
CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 
40 TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 
CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 
CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 
CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 
TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 
45 AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 
TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 
CACAGAGAAC ACCACAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 
CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 
AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 
50 TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 
CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 
AAATGATGAA TAAGAGCAGG ACACAACTGC TOGGAGTCCC AGTGACCTCA TCCCAGAAAA 
CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 
GCCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 
55 TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 
ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 
CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 
TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 
60 TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 
AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAO TCGTATCCTT GATGACTGTT 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 
AACCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 
65 AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 
GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 
GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 
ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 
AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 
70 ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAGAAAAG AOGTATCAAT TTTTACAGGT 
AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 
AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 
GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAACGATAC 
ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 
75 TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 
CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 
GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 
CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 
GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 
80 CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 
TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGA CAGGA GA TACTGCATCT 
ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 
TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 
CATCAAGAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 
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AAGTGGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 
5 GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 7320 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7360 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 7560 
10 ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
15 GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA OGTGAACTCC 7920 
TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
20 GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
25 GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 
TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGQTAAGAA 8760 
30 CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTT T TTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 9000 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
35 CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 9120 
CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 9360 
40 ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
45 TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 
CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 
TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 
A TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 
50 GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10020 
ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 
55 GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 10380 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAAOCAAACC AAATCATACT TTCTCTTCAA 10560 
60 ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC ATCATCAATA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
65 CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TQATGGTTGT TTGTCAAAGA 10920 
GGGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAAT AGTTA C 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGT TTCT 11160 
70 GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
75 CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 
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DAIKKFYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLFPDGTIGN DTTLVLVNAI 180 

YFKGQWEKKF KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALLEDVQAK VLEIPYKGKD 240 

LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRFKMB ESYDLKDTLR 300 

TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VBAAAATAW WELSSPSTN 360 
EEFCCNHPFL FFIRQNKTNS ILFYGRFSSP 

Seq ID MO: 178 DKA sequence 

Nucleic Acid Accession #: NM_001910.1 

Coding sequence: 50.. 1240 



1 11 21 31 41 51 

I I I I I I 

GGAGAGAAGA AAGGAGGGGG CAAGGGAGAA GCTGCTGGTC GGACTCACAA TGAAAACGCT 60 

CCTTCTTTTG CTGCTGGTGC TCCTGGAGCT GGGAGAGGCC CAAGGATCCC TTCACAGGGT 120 

lO GCCCCTCAGG AGGCATCCGT CCCTCAAGAA GAAGCTGCGG GCACGGAGCC AGCTCTCTGA 180 

GTTCTGGAAA TCCCATAATT TGGACATGAT CCAGTTCACC GAGTCCTGCT CAATGGACCA 240' 

GAGTGCCAAG GAACCCCTCA TCAACTACTT GGATATGGAA TACTTCGGCA CTATCTCCAT 300 

TGGCTCCCCA CCACAGAACT TCACTGTCAT CTTCGACACT GGCTCCTCCA ACCTCTGGGT 360 

CCCCTCTGTG TACTGCACTA GCCCAGCCTQ CAAGAOGCAC AGCAGGTTCC AGCCTTCCCA 420 

GTCCAGCACA TACAGCCAGC CAGGTCAATC TTTCTCCATT CAGTATGGAA CCGGGAGCTT 480 

GTCCGGGATC ATTGGAGCCG ACCAAGTCTC TGTGGAAGGA CTAACCGTGG TTGGCCAGCA 540 

GTTTGGAGAA AGTGTCACAG AGCCAGGCCA GACCTTTGTG GATGCAGAGT TTGATGGAAT 600 

TCTGGGCCTG GGATACCCCT CCTTGGCTGT GGGAGGAGTG ACTCCAGTAT TTGACAAGAT 660 

GATGGCTCAG AACCTGGTGG ACTTGCCGAT GTTTTCTGTC TACATGAGCA GTAACCCAGA 720 

L J AGGTGGTGCG GGGAGCGAGC TGATTTTTGG AGGCTACGAC CACTCCCATT TCTCTGGGAG 780 

CCTGAATTGG GTCCCAGTCA CCAAGCAAGC TTACTGGCAG ATTGCACTGG ATAACATCCA 840 

GGTGGGAGGC ACTGTTATGT TCTGCTCCGA GGGCTGCCAG GCCATTGTGG ACACAGGGAC 900 

TTCCCTCATC ACTGGCCCTT CCGACAAGAT TAAGCAGCTG CAAAACGCCA TTGGGGCAGC 960 

CCCCGTGGAT GGAGAATATG CTGTGGAGTG TGCCAACCTT AACGTCATGC CGGATGTCAC 1020 

CTTCACCATT AACGGAGTCC CCTATACCCT CAGCCCAACT GCCTACACCC TACTGGACTT 1090 

CGTGGATGGA ATGCAGTTCT GCAGCAGTGG CTTTCAAGGA CTTGACATCC ACCCTCCAGC 1140 

TGGGCCCCTC TGGATCCTGG GGGATGTCTT CATTCGACAG TTTTACTCAG TCTTTGACCG 1200 

TGGGAATAAC CGTGTGGGAC TGGCCCCAGC AGTCCCCTAA GGAGGGGCCT TGTGTCTGTG 1260 

CCTGCCTGTC TGACAGACCT TGAATATGTT AGGCTGGGGC ATTCTTTACA CCTACAAAAA 1320 

GTTATTTTCC AGAGAATGTA GCTGTTTCCA GGGTTGCAAC TTGAATTAAG ACCAAACAGA 1360 

ACATGAGAAT ACACACACAC ACACACATAT ACACACACAC ACACTTCACA CATACACACC 1440 

ACTCCCACCA CCGTCATGAT GGAGGAATTA OGTTATACAT TCATATTTTG TATTGATTTT 1500 

TGATTATGAA AATCAAAAAT TTTCACATTT GATTATGAAA ATCTCCAAAC ATATGCACAA 1560 

GCAGAGATCA TGGTATAATA AATCCCTTTG CAACTCCACT CAGCCCTGAC AACCCATCCA 1620 

CACACGGCCA GGCCTGTTTA TCTACACTGC TGCCCACTOC TCTCTCCAGC TCCACATGCT 1680 

GTACCTGGAT CATTCTGAAG CAAATTCCGA GCATTACATC ATTTTGTCCA TAAATATTTC 1740 

TAACATCCTT AAATATACAA TCGGAATTCA AGCATCTCCC ATTGTCCCAC AAATGTTTGG 1800 

CTGTTTTTGT AGTTGGATTG TTTGTATTAG GATTCAAGCA AGGCCCATAT ATTGCATTTA 1860 

TTTGAAATGT CTGTAAGTCT CTTTCCATCT ACAGAGTTTA GCACATTTGA ACGTTGCTGG 1920 

TTGAAATCCC GAGGTGTCAT TTGACATGGT TCTCTGAACT TATCTTTCCT ATAAAATGGT 1980 

AGTTAGATCT GGAGGTCTGA TTTTGTGGCA AAAATACTTC CTAGGTGGTG CTGGGTACTT 2040 

CTTGTTGCAT CCTGTCAGGA GGCAGATAAT GCTGGTGCCT CTCTATTGGT AATGTTAAGA 2100 
CTGCTGGGTG GGTTTGGAGT TCTTGGCTTT AATCATTCAT TACAAAGTTC AGCATTTT 

50 Seq ID NO: 179 Protein sequence 
Protein Accession ft: NP_001901.1 

1 11 21 31 41 51 

cr I 1 I I I I 

J J MKTLUjLLLV LLELGEAQGS lhrvplrrhp SLKKKLRARS QLSEFWKSHN LDMIQFTESC 60 

SMDQSAKEPL INYLDMEYFG TISIGSPPQN PTVIFDTGSS NLWVPSVYCT 3PACKTHSRF 120 

QPSQSSTYSQ PGQSFSIQYG TGSLSGIIGA DQVSVEGLTV VGQQFGESVT EPGOTFVDAE 180 

FDGILGU3YP SIAVGGVTPV FDNMMAQNLV DLPMFSVYMS SNPEGGAGSE LIFGGYDHSH 240 

FSGSLNWVPV TKQAYWQIAIi DNIQVGGTVM FCSEGCQAIV DTGTSLITGP SDKIKQLQNA 300 

IGAAPVDGEY AVECANLNVM PDVTFTINGV PYTLSPTAYT LLDFVDGMQF CSSGFQGLDI 360 
HPPAGPLWIL GDVFIRQFYS VPDRGNNRVG LAPAVP 



Seq ID NO: 180 DNA sequence 
Nucleic Acid Accession ft: NM 018058.1 
(O Coding sequence: 319.. 1575 

1 11 21 31 41 51 

I I I I I I 

TACGCGCTGC GGGACCGGCA GGGGAAOGCC ATCGGGGTCA CAGCCTGCGA CATCGACGGG 60 

/U GACGGCCGGG AGGAGATCTA CTTCCTCAAC ACCAATAATG CCTTCTCGGG GGTGGCCACG 120 

TACACCGACA AGTTGTTCAA GTTCCGCAAT AACCGGTGGG AAGACATCCT GAGCGATGAG 180 

GTCAACGTGG CCCGTGGTGT GGCCAGCCTC TTTGCCGGAC GCTCTGTGGC CTGTGTGGAC 240 

AGAAAGGGCT CTGGACGCTA CTCTATCTAC ATTGCCAATT ACGCCTACGG TAATGTGGGC 300 

„ CCTGATGCCC TCATTGAAAT GGACCCTGAG GCCAGTGACC TCTCCCGGGG CATTCTGGCG 360 

/J CTCAGAGATG TGGCTGCTGA GGCTGGGGTC AGCAAATATA CAGGGGGCCG AGGCGTCAGC 420 

GTGGGCCCCA TCCTCAGCAG CAGTGCCTCG GATATCTTCT GCGACAATGA GAATGGGCCT 480 

AACTTCCTTT TCCACAACCG GGGCGATGGC ACCTTTGTGG ACGCTGCGGC CAGTGCTGGT 540 

GTGGACGACC CCCACCAGCA TGGGCGAGGT GTCGCCCTGG CTGACTTCAA CCGTGATGGC 600 

on AAAGTGGACA TCGTCTATGG CAACTGGAAT GGCCCCCACC GCCTCTATCT GCAAATGAGC 660 

OU ACCCATGGGA AGGTCCGCTT CCGGGACATC GCCTCACCCA AGTTCTCCAT GCCCTCCCCT 720 

GTCCGCACGG TCATCACCGC CGACTTTGAC AATGACCAGG AGCTGGAGAT CTTCTTCAAC 780 

AACATTGCCT ACOGCAGCTC CTCAGCCAAC OGCCTCTTCC GCGTCATCCG TAGAGAGCAC 840 

GGAGACCCCC TCATCGAGGA GCTCAATCCC GGCGACGCCT TGGAGCCTGA GGGCCGGGGC 900 

ACAGGGGGTG TGGTGACCGA CTTCGACGGA GACGGGATGC TGGACCTCAT CTTGTCCCAT 960 
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TGGCTCAGCC 
TGGTGCCACG 
AGAAGAGTGG 
AGCCCGTGGC 
CAGATGGCAA 
TCCTCTACCC 
ATGCATCCAG 
AAGCTACAGG 
CACAGCCTGC 
TGCTGCTGCC 
CGTAGATGGA 
AGCAGGGGTG 
GTGCTGCTGC 
TCCATGCACA 
CACATAGCTG 
GTCTAATGAC 
AGTATGAGGA 
GACCTTACAC 
TAAGAATAGA 
ACAAAACCTG 
AACACGTG 



GCTGTCCGTC 
CACCCGGGTT 
GGCCCACCTG 
ACACTTTGGC 
GATGGTCAGC 
CCGGGATGAG 
TTCCCATTCG 
TGCCGGACCA 
GTGGGGACTC 
ACTGCOGCTG 
GATCTCAATC 
GGACATGAAC 
CTAGACAGTA 
TTACTTAGCT 
TGATCACAGC 
CATATCTTAG 
CTTTAGTGTC 
CAGTCACTTA 
ATCTTGGGGT 
GCACATAGTA 



TTCCGGGGCA 
GGGGOCTTTG 
AGGATCATCG 
CTGGGGAAGG 
CGGAACGTGG 
GACACACTTC 
TGTGCCCTCG 
ACAAGAAGTG 
TCGGCCAGTC 
CTGCTGCCGC 
TGGGGTCGGT 
CAGCGGATGG 
GGGATGTAAA 
AACAATTAGG 
AGACAGGGTC 
GACACAGATG 
CTGAGTTCAA 
ACTTGTTAGC 
TAGTGTGGAG 
AAGGCTCAAT 



ATCAGGGCTT 
CCAGGGGAGC 
ACGGGGGCTC 
ATGAAGCCAG 
CCAGCGGGGA 
AGGACCCAGC 
AGACAAGCCC 
CAGTCGGGGC 
ACCGGGCCCC 
TGCTGGAGCT 
GGTTAAGGAG 
AGTCCAGCAG 
GGCCTGGGAG 
GAGACTCGTA 
GCTGCCCTGA 
TGCCCAGGGA 
ATCCTGATTC 
CATCCATTAT 
ATTAGATTAA 
AAAAACAAGT 



CAACAACAAC 
TAAGGTCGTG 
AGGCTACCTG 
CAGTGTGGAG 
GATGAACTCA 
CCCACTGGAG 
GTATGTGTCA 
TACGAGCCCA 
CGCCCCACCA 
GCCACTGCTG 
AGCTGCGAGC 
GGGAGTGGGA 
CTAGACCCTC 
AGGCCAGGCC 
TGGCGCTTAC 
GGTGGTGTCA 
AGGAACTCAC 
CGCATCTGCA 
ATGTATGTAA 
GCCTCTCACT 



1020 
1080 
1140 
1200 
1260 
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1380 
1440 
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Seq ID NO: 181 Protein sequence 
Protein Accession #: NP 060528.1 



MDPEASDLSR 
RGDGTFVDAA 
PRDIASPKPS 
ELNPGDALEP 
RTRVGAFARG 
KMVSRNVASG 
GAGPTRSAVG 



11 
I 

GILALRDVAA 
ASAGVDDPHQ 
MPSPVRTVIT 
EGRGTGGWT 
AKVVLYTKKS 
EMNSVLEILY 
ATSPTRMAQP 



21 
I 

EAGVSKYTGG 
HGRGVALADF 
ADFDNDQELE 
DFDGDGMLDL 
GAHIiRIIDGG 
PRDEDTLQDP 
AWGLSASHRA 



31 
I 

RGVSVGPILS 
NRDGKVDIVY 
IPFNNIAYRS 
ILSHGESMAQ 
SGYLCEMEPV 
APLETPMNAS 
PAPPPPPLLL 



41 
I 

SSASDIFCDN 
GNWNGPHRLY 
SSANRLFRVI 
PLSVFRGNQG 
AHFGLGKDEA 
SSHSCALETS 
PLPLLLPLLE 



LOMSTHGKVR 
RREKGDPLIE 
FNNNWLRWP 
SSVEVTWPDG 
PYVSTPMEAT 
LPLLHRSS 



Seq ID NO: 182 DNA sequence 
Nucleic Acid Accession #: AJ279016 
Coding sequence t 1 . .1962 



60 
120 
180 
240 
300 
360 



1 11 21 31 41 51 

I I 1 I I I 

ATGTCCAGGA TGTTACCGTT CCTGCTGCTG CTCTGGTTTC TGCCCATCAC TGAGGGGTCC 
CAGOGGGCTG AACCCATGTT CACTGCAGTC ACCAACTCAG TTCTGCCTCC TGACTATGAC 
AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACTG ATGTGGACCA TGATGGGGAC 
TTTGAGATOG TCGTGGCGGG GTACAATGGA CCCAACCTGG TTCTGAAGTA TGACCGGGCC 
CAGAAGCGGC TGGTGAACAT CGCGGTOGAT GAGCGCAGCT CACCCTACTA CGCGCTGCGG 
GACCGGCAGG GGAACGCCAT CGGGGTCACA GCCTGCGACA TCGACGGGGA CGGCCGGGAG 
GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CACCGACAAG 
TTGTTCAAGT TCCGCAATAA CCGGTGGGAA GACATCCTGA GCGATGAGGT CAACGTGGCC 
CGTGGTGTGG CCAGCCTCTT TGCOGGACGC TCTGTGGCCT GTGTGGACAG AAAGGGCTCT 
GGACGCTACT CTATCTACAT TGCCAATTAC GCCTACGGTA ATGTGGGCCC TGATGCCCTC 
ATTGAAATGG ACCCTGAGGC CAGTGACCTC TCCOGGGGCA TTCTGGCGCT CAGAGATGTG 
GCTGCTGAGG CTGGGGTCAG CAAATATACA GGGGGCCGAG GOGTCAGCGT GGGCCCCATC 
CTCAGCAGCA GTGCCTOGGA TATCTTCTGC GACAATGAGA ATGGGCCTAA CTTCCTTTTC 
CACAACCGGG GCGATGGCAC CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGACGACCCC 
CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAACC GTGATGGCAA AGTGGACATC 
GTCTATGGCA ACTGGAATGG CCCCCACCGC CTCTATCTGC AAATGAGCAC CCATGGGAAG 
GTCCGCTTCC GGGACATCGC CTCACCCAAG TTCTCCATGC CCTCCCCTGT CCGCACGGTC 
ATCACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATTGCCTAC 
CGCAGCTCCT CAGCCAACCG CCTCTTCCGC GTCATCCGTA GAGAGCACGG AGACCCCCTC 
ATCGAGGAGC TCAATCCCGG CGACGCCTTG GAGCCTGAGG GCCGGGGCAC AGGGGGTGTG 
GTGACCGACT TCGACGGAGA CGGGATGCTG GACCTCATCT TGTCCCATGG AGAGTCCATG 
GCTCAGCCGC TGTCCGTCTT CCGGGGCAAT CAGGGCTTCA ACAACAACTG GCTGCGAGTG 
GTGCCACGCA CCCGGTTTGG GGCCTTTGCC AGGGGAGCTA AGGTCGTGCT CTACACCAAG 
AAGAGTGGGG CCCACCTGAG GATCATCGAC GGGGGCTCAG GCTACCTGTG TGAGATGGAG 
CCOGTGGCAC ACTTTGGCCT GGGGAAGGAT GAAGCCAGCA GTGTGGAGGT GACGTGGCCA 
GATGGCAAGA TGGTGAGCCG GAAOGTGGCC AGOGGGGAGA TGAACTCAGT GCTGGAGATC 
CTCTACCCCC GGGATGAGGA CACACTTCAG GACCCAGCCC CACTGGAGTG TGGCCAAGGA 
TTCTCCCAGC AGGAAAATGG CCATTGCATG GACACCAATG AATGCATCCA GTTCCCATTC 
GTGTGCCCTC GAGACAAGCC CGTATGTGTC AACACCTATG GAAGCTACAG GTGCCGGACC 
AACAAGAAGT GCAGTCGGGG CTACGAGCCC AACGAGGATG GCACAGCCTG CGTGGGGACT 
CTCGGCCAGT CACCGGGCCC CCGCCCCACC ACCCCCACCG CTGCTGCTGC CACTGCCGCT 
GCTGCTGCCG CTGCTGGAGC TGCCACTGCT GCACCGGTCC TCGTAGATGG AGATCTCAAT 
CTGGGGTOGG TGGTTAAGGA GAGCTGCGAG CCCAGCTGCT GAGCAGGGGT GGGACATGAA 
CCAGCGGATG GAGTCCAGCA GGGGAGTGGG AAAGTGGGCT TGTGCTGCTG CCTAGACAGT 
AGGGATGTAA AGGCCTGGGA GCTAGACCCT CCCCAAGCCC ATCCATGCAC ATTACTTAGC 
TAACAATTAG GGAGACTCGT AAGGCCAGGC CCTGTGCTGG GCACATAGCT GTGATCACAG 
CAGACAGGGT CGCTGCCCTG ATGGCGCTTA CATTCCAGTG GGTCTAATGA C CATA TCTTA 
GGACACAGAT GTGCCCAGGG AGGTGGTGTC ACTGCACAGG AAGTATGAGG ACTTTAGTGT 
CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAAGCTATG TGACCTTACA CCA GTCA CTT 
AACTTGTTAG CCATCCATTA TOGCATCTGC AAAATGGGGA TTAAGAATAG AATCTTGGGG 
TTAGTGTGGA GATTAGATTA AATGTATGTA AGACACTTGG CACAAAACCT GGCACATAGT 
AAAGGCTCAA TAAAAACAAG TGCCTCTCAC TGGGCXTTGT CAACACG 
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Seq ID NO: 183 Protein sequence 
Protein Accession S: CAC08451 
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1 
I 

MSRMLPFLLL 
FEIWAGYKG 
EIYFLNTNNA 
GRYSIYIANY 
LSSSASDIFC 
VYGNWNGPHR 
RSSSANRLFR 
AQPLSVFRGN 
FVAHFGLGKD 
FSQQENGHCM 
LGQSPGPRPT 



11 
I 

LWPLPITEGS 
PNLVLKYDRA 
FSGVATYTDK 
AYGNVGPDAL 
DNENGPMPLP 
LYLQMSTHGK 
VIRREHGDPL 
QGFNNNWLRV 
EASSVEVTWP 
DTNECIQFPF 
TPTAAAATAA 



21 
I 

QRAEPMFTAV 
QKRLVNIAVD 
LFKFRNNRWB 
IEMDPEASDL 
HNRGDGTFVD 
VRFRDIASPK 
IEELNPGDAL 
VPRTRFGAFA 
DGKMVSRNVA 
VCPRDKPVCV 
AAAAAGAATA 



31 
I 

TNSVLPPDYD 
ERSSPYYALR 
DItiSDBVNVA 
SRGILALRDV 
AAASAGVDDP 
FSMPSPVRTV 
EPEGRGTGGV 
RGAKWLYTK 
SGEMNSVLEI 
NTYGSYRCRT 
APVLVDGDLN 



41 
I 

SNPTQLNYGV 
DRQGMAIGVT 
RGVASLFAGR 
AAEAGVSKYT 
BQHGRGVALA 
ITADFDNDQE 
VTDFDGDGKL 
KSGAHLRIXD 
LYPRDEDTLQ 
NKKCSRGYEP 
LGSWKESCE 



51 
I 

AVTDVDHDGD 
ACDIDGDGRE 
SVACVDRKGS 
GGRGVSVGPI 
DFNRDGKVDI 
LEIFFNNIAY 
DLILSHGESM 
GGSGYLCBME 
DPAPLECGOG 
NEDGTACVGT 



Seq ID NO i 184 DMA sequence 
Nucleic Acid Accession #: 
Coding sequence: 1..4794 

1 11 21 31 41 51 

1 I I I I 1 

ATGGCGTGTC CGGGAGGACT CCCAGCCCGT TGCTCTGGTT GGATGGGACT GGGTGGGCCC 
AGCGGCTCCT CCCCAGCATC CCCTCCCCAT TCCTCCTCCA GGTACAATGG ACCCAACCTG 
GTTCTGAAGT ATGACCGGGC CCAGAAGOGG CTGGTGAACA TCGCGGTCGA TGAGCGCAGC 
TCACCCTACT ACGCGCTGCG GGACCGGCAG GGGAACGCCA TCGGGGTCAC AGCCTGOGAC 
ATCGACGGGG ACGGCCGGGA GGAGATCTAC TTCCTCAACA CCAATAATGC CTTCTCGGGC 
CACAGCAGCT CAGCGCAGGT CCCTTCTGGG CTCCACAGAA ACAGGCCTGT GCTGAAGCCT 
CCACCTACAA CCCCTGCAGG CCTCCTGGGT CTGCCTCCAC TCAGCGGAAG GGACTTTTCC 
TCCTCCCTGG GTCAGGCTTC TCCGGACAGC AGGCAGGGAG AGAGGGTGCC GGTTCCCTGC 
TGTCGGGGTG GACTGAGACC TACCCATGAA CCAGAACCAT TTCTTCTGAG ACCCAAATCA 
GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA ATAACCGGTG GGAAGACATC 
CTGAGCGATG AGGTCAACGT GGCCCGTGGT GTGGCCAGCC TCTTTGCCGG ACGCTCTGTG 
GCCTGTGTGG ACAGAAAGGG CTCTGGACGC TACTCTATCT ACATTGCCAA TTACGCCTAC 
GGTAATGTGG GCCCTGATGC CCTCATTGAA ATGGACCCTG AGGCCAGTGA CCTCTCCCGG 
GGCATTCTGG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 
TTCTCCCACA CTGCCTCTCC AAGCATTGGT GAGATATCTG GCAGAACCGA GGAGCGGGAA 
GGAGGAGACC CAGAGGAGGC AGATGAGGAG CACAGTGGGG ATGGAAGCAC CAGCCAACTG 
TGCCGGCTGG GCTGGAAGGA CGGGCAGTTC AAGGAAGAAG CAGCAGCTTT GGTGGAGGAA 
CAGAGGGAGG CTGGGGCAGC TGGCGTGCCC AGAGGACGTG TTCGAACAGC TCTGCAGACT 
TCCAAAAGCC ATTTGGCTGA CAAGAACCTA TTTGGCCCAC CATGTTACTA TTCTGTCTGC 
GCGCCTTCTC CAGCCCACCC TTTCCCTGCC CGCCAAGCCC CCCAACACTA CCCTGTAGCC 
CCCCTTGTCA CTCAGCTAAT GACACATGGA CGTCTGGCTG GAAAACTAGC CCGGAGTGTC 
CCCCACCCCC GAGCCCCAGG AATGGACCCC AAATGTAAGG GCCGCCATGC TGAGCCOGGC 
CTGATGGCTG AGGCTTTGGG CGCGTGGCCA GCGCTCAGCA CCACTGTGGT GCCAGGGGGC 
CTGAGAAGCT GGGAGGAAAG CAGGCAGAAG GGGCAGGCCA TGTCCAGATG TGCACTCAGG 
GAGCTGGGAG GTCCCTGGAG CCAAGCCACA CAGCACCTGC CTGCTAGAGA GCTGTATGAC 
CTGGGAGAAC CTCCCATTTT ACAAAGAACA GACGGAGATC GAGGGA6GAG AAGGGACTCG 
CCCAAGGTCA CACAGGAGTG CCATCTAGTG GCCACCATGC CAGCTCTCGG GGGACTCGAG 
GGCCCCGGGA GGGTGGCCAA GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA 
CTCTCCCATC CCCTGGTCCC CAACTTCCCC AGCTGCTTGA GGCCTCTTGA AGCCGGGACA 
GTGCCGGGAG CTGCCCTGCC TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCAAGGCC 
CTGGCGTGGA ACCAGATGGA AAAAGAGGAG GGGAAGATTC ATGGAGACCA TGAGCCCAGA 
TTTAGGCTCA GGAAAGCACG GGAAGCAGAA TTCCCCCCAG GCTCCTCTGA GGAGCCTCTG 
CTGCAGTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCTCC AGGTGGGCCT GGGGCTTGCT 
TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GAGGCGTCAG CGTGGGOCCC 
ATCCTCAGCA GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGCC TAACTTCCTT 
TTCCACAACC GGGGCGATGG CACCTTTGTG GACGCTGCGG CCAGTGCTGA ACGTCGTTTA 
GCCTTCATCG TTCACCTCAA ATATCACCTC TGCAGAGATT TTCCTCACTC CCTGTGCCAC 
CTAGCAGAAA CTGGTCCTTC CTCCTCCTGC TGCCCGTGGC ATGCACGTCT TCTTCAGGCT 
CCACATTGCC ATCATGGTTT GTCTATGAGC TTTACAAGGA CCGGGTCACG GTTCTATTCA 
TTCTTGACGC AAGGCTTGGC CTCCAGTGCC CACCGGAGGA CACTCAGCCT CCAGGGTTCT 
CAGGGGGCCC CACCCTGCCT TCTGGCAAGA GCTCCCTGTG TCCTGGGGTC TCTGATCCCC 
ACTGCCTATT ACATTGTCCT GTGGTCTGCC ATCCCAGAGA GCCTGATGAC CCACAGCTAT 
TTGTCCTCTG AAAGAGTCAA CGTGGGTGTG GACGACCCCC ACCAGCATGG GCGAGGTGTC 
GCCCTGGCTG ACTTCAACCG TGATGGCAAA GTGGACATCG TCTATGGCAA CTGGAATGGC 
CCCCACCGCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCCG GGACATCGCC 
TCACCCAAGT TCTCCATGCC CTCCCCTGTC CGCACGGTCA TCACCGCCGA CTTTGACAAT 
GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCAACCGC 
CTCTTCCGAT GCTCCATCCT GGCTCGTGGC TCTTCATCCT TGACAGCTGG TGGGAGGAAC 
GGTCAGGGAG AAGGTTTAAG AATCAGAAGG GGAGGGTTCC CAGGGCCAGG GGGTCAGGCC 
AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 
AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTGCAGGG 
AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCC AAGCGCCACA AGATACAAAG 
CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTAGGGG 
GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 
CGGGGTCCAA TCACTACCAG GAAAAGGGGC TAOGGGGTCC AATCACTACC AGGAAAAGGG 
GCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 
AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAC 
AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 
CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 
ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TAOGGGGTCC 
AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 

1027 
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GTCCAATCAC 
CGGGGTCCAA 
GCTATGGGQT 
AGGGGCTATG 
AGAGAGCACG 
GGCCGGGGCA 
TTGTCCCATG 
AACAACAACT 
AAGGTCGTGC 
GGCTACCTGT 
AGTGTGGAGG 
ATGAACTCAG 
CCACTGGAGT 
GAATGCATCC 
GGAAGCTACA 
GGCACAGCCT 
CCCAAAAAGG 
CCGGGTTGCC 
CAGAAAGCTC 



TACCACAGAA 
TCACTACCAG 
CCAATCACTA 
OGGTCCAATC 
GAGACCCCCT 
CAGGGGGTGT 
GAGAGTCCAT 
GGCTGCGAGT 
TCTACACCAA 
GTGAGATGGA 
TGAOGTGGCC 
TGCTGGAGAT 
GTGGCCAAGG 
AGTTCCCATT 
GGTGCCGGAC 
GCGTGGGTAC 
AGCTGCAACT 
GGCTGCTCCT 
CAGGTATTCC 



AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCACAG 
CATCGAGGAG 
GGTGACCGAC 
GGCTCAGCCG 
GGTGCCACGC 
GAAGAGTGGG 
GCCOGTGGCA 
AGATGGCAAG 
CCTCTACCCC 
ATTCTCCCAG 
OGTGTGCCCT 
CAACAAGAAG 
TGAGCTAGGC 
TTCCCAAGGC 
CAAAAGAGCT 
AGAAGCCCAA 



GGCTCCAATC 
TACGGGCTCC 
GGGCTACGGG 
AAAGGGGCTA 
CTCAATCCCG 
TTOGACGGAG 
CTGTCCGTCT 
ACCOGGTTTG 
GCCCACCTGA 
CACTTTGGCC 
ATGGTGAGCC 
OGGGATGAGG 
CAGGAAAATG 
CGAGACAAGC 
TGCAGTCGGG 
TCTAGGCATA 
ATCTGCACCC 
CAGCTCCAGG 
GTGTATGAAC 



ACTACCAGGA 
AATCACTACC 
CTCCAATCAC 
CGGGGTCCAA 
GCGACGCCTT 
ACGGGATGCT 
TCCGGGGCAA 
GGGCCTTTGC 
GGATCATCGA 
TGGGGAAGGA 
GGAACGTGGC 
ACACACTTCA 
GCCATTGCAT 
COGTATGTGT 
GCTACGAGCC 
CAATGACGTG 
COGTCTGGTC 
CTGCTCCCAG 
AAGATCAGGA 



AAAGGGGCTA 
AGGAAAAGAG 
TACCAGGAAA 
CGTCATCCGT 
GGAGCCTGAG 
GGACCTCATC 
TCAGGGCTTC 
CAGGGGAGCT 
CGGGGGCTCA 
TGAAGCCAGC 
CAGCGGGGAG 
GGACCCAGCC 
GGACACCAAT 
CAACACCTAT 
CAACGAGGAT 
GAAACCAAGG 
CTTTTTCCTG 
CACCCTTCTC 
ATAA 
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Seq ID NO: 185 Protein sequence 
Protein Accession #: FGENESHH 

1 11 21 31 

I I I I 

MACPGGLPAR CSGWKGLGGP SGSSPASPPH SSSRYNGPNL 
SPYYALRDRQ GNAIGVTACD IDGDGREEIY PLNTNNAFSG 
PPTTPAGLLG LPPLSGRDPS SSLGQASPDS RQGERVPVPC 
GVATYTDKLF KFRNNRWEDI LSDEVNVARG VASLFAGRSV 
GNVGPDALIE MDPEASDLSR GILALRDVAA EAGVSKYTEG 
GGDPEEADEE HSGDGSTSQL CRLGWKDGQF KEEAAALVEE 
SKSHLADKNL PGPPCYYSVC APSPAHPFPA RQAPQHYPVA 
PHPRAPGMDP KCKGRHAEPG LMAEALGAWP ALSTTWPGG 
ELGGPWSQAT QHLPARELYD LGEPPILQRT DGDPGRRRDS 
GPGRVAKREI GRETGAVGRP LSHPLVPNFP SCLRPLBAGT 
LAWNQMEKEE GKIHGDHEPR FRLRKAREAE FPPGSSEEPL 
SATHCGSMSF LGGRGVSVGP ILSSSASDIF CDNENGPNFL 
AFIVHLKYHL CRDFPHSLCH LAETGPSSSC CPWHARLLQA 
FLTQGLASSA HRRTLSLQGS QGAPPCLLAR APCVLGSLIP 
LSSERVNVGV DDPHOHGRGV ALADFNRDGK VDIVYGNWNG 
SPKFSMPSPV RTVITADFDN DQELBIFFNN IAYRSSSANR 
GQGEGLRIRR GGFPGPGGQA KVNTGPLMKK QKGRKDEDWA 
KGKGNVAQSV PRTQAPQDTK PHYHKKGLOG PITTRKRGYG 
RGPITTRKRG YGVQSLPGKG ATGSNHYQEK GLQGPITTRK 
RKGLRAPITT RKRGYGVQSL PGKGATGSNH YQEKGLRGPI 
NHYQEKGLQG PITTRKRGYR VQSLPQKGAT GSNHYQEKGL 
AMGSNHYQEK GLRAPITTRK RGYGVQSLPQ KGATGSNVIR 
GRGTGGWTD FDGDGMLDLI LSHGESMAQP LSVFRGNQGF 
KWIiYTKKSG AHLRIIDGGS GYLCEMEPVA HFGLGKDEAS 
MNSVLBILYP RDEDTLQDPA PLECGQGFSQ QENGHCMDTN 
GSYRCRTNKK CSRGYEPNED GTACVGTELG SRHTMTWKPR 
PGCRLLLKRA QLQAAPSTLL QKAPGIPEAQ VYEQDQB 

Seq ID NO: 186 DNA sequence 

Nucleic Acid Accession g: NM_000584.l 

Coding sequence: 75.. 374 

1 11 21 31 

I I 1 I 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCAGGAAGAA 
CACTGTGTGT AAACATGACT TCCAAGCTGG CCGTGGCTCT 
CTGCAGCTCT GTGTGAAGGT GCAGTTTTGC CAAGGAGTGC 
GCATAAAGAC ATACTCCAAA CCTTTCCACC CCAAATTTAT 
AGAGTGGACC ACACTGCGCC AACACAGAAA TTATTGTAAA 
TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT 
CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA 
GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT 
CCAGATGCAA TACAAGATTC CTGGTTAAAT TTGAATTTCA 
CATTGTACCA TGAAATATCC AGAACATACT TATATGTAAA 
CAAAAAACAA CAAATAATTT TTAAATATAA GGATTTTCCT 
TACAAATAGC AAAATTGAGC CAAGGGCCAA GAGAATATCC 
GAATGGGTTT GCTAGAATGT GATATTTGAA GCATCACATA 
TTTGCCATAA AGTCAAATTT AGCTGGAAAT CCTGGATTTT 
CCTAGTCTGC TAGCCAGGAT CCACAAGTCC TTGTTCCACT 
TTTCTAAGTG GAAAAAGTAT TAGCCACCAT CTTACCTCAC 
TGGAAGCACT TTAAGTTTTT TCATCATAAC ATAAATTATT 
CTATTTATTA TTTATGTATT TATTTAAGCA TCAAATATTT 
AGAAGATGAA TCATTGATTG AATAGTTATA AAGATGTTAT 
GATATTAAAT GATGTTTTAT TAGATAAATT TCAATCAGGG 
ACAATTGGGT ACCCAGTTAA ATTTTCATTT CAGATAAACA 
AGTACATTAT TOTTTATCTG AAAGTTTTAA TTGAACTAAC 
AGTCTTGTCA TTGCCAGCTG TGTTGGTAGT GCTGTGTTGA 
AACTATTAAA ACAGCCAAAA CTCCACAGTC AATATTAGTA 
TGTTTATTAT GTACAAATAG ATTCTTATAA TATTATTTAA 



41 
I 

VLKYDRAQKR 
HSSSAQVPSG 
CRGGLRPTHE 
ACVDRKGSGR 
FSHTASPSIG 
GREAGAAGVP 
PLVTQLMTHG 



51 
I 

LVNIAVDERS 
LHRNRPVLKP 
PEPFLLRPKS 
YSIYIANYAY 



PKVTQECHLV 
VPGAALPGNP 
LQFPSGLRGS 
FKNRGCGTFV 



TAYYIVLWSA 
PRRLYLQMST 
LFRCSILARG 
RGCGNAGQSL 
VQSLPGKGAT 
RGYGLQSLPG 
TTRKRGYGLQ 
RGPITTRKRG 
REHGDPLIEB 
NNNWLRWPR 
SVEVTWPDGK 
ECIQFPFVCP 
PKKELQLSQG 



RGRVRTALOT 
RIAGKLARSV 
GQAMSRCALR 
ATMPALGGLE 
GNWVLDMAKA 
PVLQVGLGLA 
DAAASAERRL 
FTRTGSRFYS 
IPESLMTHSY 
HGKVRFRDIA 
SSSLTAGGRN 
AKEPASAIAG 



KGATGSNHYH 
SLPGKGATGS 
YGIiQSLPGKE 
LNPGDALEPE 
TRFGAFARGA 
MVSRNVASGE 
RDKPVCVNTY 
ICTPVWSFFI* 



41 

1 

ACCACCGGAA 
CTTGGCAGCC 
TAAAGAACTT 
CAAAGAACTG 
GCTTTCTGAT 
GGAGAAGTTT 
GAATCAGTGA 
GTGGGTCTGT 
GTAAACAATG 
GTATTAXTTA 
AGATATTGCA 
GAACTTTAAT 
AAAATGATGG 
TTTCTGTTAA 
GTGCCTTGGT 
AGTGATGTTG 
TTCAAGTGTA 
GTGCAAGAAT 
AGTAAATTTA 
TTTTTAGATT 
ACAAATAATT 
AATCCTAGTT 
ATTACGGAAT 
ATTTCTTGCT 
ATGACTGCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



51 
I 

GGAACCATCT 
TTCCTGATTT 
AGATGTCAGT 
AGAGTGATTG 
GGAAGAGAGC 
TTGAAGAGGG 
AGATGCCAGT 
TGTAGGGTTG 
AATAGTTTTT 
TTTGAATCTA 
CGGGAGAATA 
TTCAGGAATT 
GACAATAAAT 
ATCTGGCAAC 
TTCTCCTTTA 
TGAGGACATG 
ACTTATTAAC 
TTGGAAAAAT 
TTTTATTTTA 
AAACAAAGAA 
TTTTAGTATA 
TGATACTCCC 
AATGAGTTAG 
GGTTGAAACT 
TTTTAAATAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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AAGGCTTTAT ATTTTTAACT TTAAGATGTT TTTATGTGCT CTCCAAATTT TTTTTACTGT 
TTCTGATTGT ATQGAAATAT AAAAGTAAAT ATGAAACATT TAAAATATAA TTTGTTGTCA 
AAGTAAAAAA AAAAAAAAA 

Seq ID NO: 187 Protein sequence 
Protein Accession #: NP_000575.1 



1560 
1620 



1 11 21 31 41 51 

I I 1 I 1 I 

MTSKLAVALL AAFLISAALC EGAVLPRSAK ELRCQCIKTY SKPFHPKFIK ELRVIESGPH 
CANTEIIVKL SDGRELCLDP KENWVQRWB KFLKRAENS 

Seq ID NO: 168 DNA sequence 

Nucleic Acid Accession §: NMJJ03661.1 

Coding sequence : 1 . . 1152 



ATGAGTGCAC 
CAAAACGTTC 
GCTGCTGGCA 
AA6GAAAAAG 
GGATTOGTGG 
GACAACCTTG 
TACAGAAACT 
AGAAGGCTCC 
AATGTGGTGT 
CTGGCACCCT 
ATCACAGCCG 
ACACAAGCCC 
GAGTTTTTGG 
ACACGAGGCA 
GTACCGGATG 
GAACAGGTGQ 
ACGGATGTGG 
TCAAAGCACT 
CAGGAGCTGG 
CAAGAACTGT 



11 

I 

TTTTCCTTGG 
CAAGTGGGAC 
CCATGGACCC 
TGAGCACACA 
CTGCTGCTGA 
CAAGACAAAT 
GGTTTCTGAA 
GTGCCCTTGC 



TCACAGAGGG 
CTTTGACCGG 
AAGCCCACGA 
GTGAGAACAT 
TTGGQAAGGA 
CCTCAGCCTC 
AGAGGGTTAA 
CCCCTGTAAG 
TACATGAGGG 
AGGAGAA6CT 
GA 



21 
I 

TGTGGGAGTG 
AGATACTGGA 
AGAGAGCAGT 
GAATCTGCTA 
ACTGCCCAGG 
GATCATGAAA 
AGAGTTTCCT 
AGATGGGGTT 
CAGCATTTCC 
AGGCAGCCTT 
GATTACCAGC 
CCTGGTCATC 
ATCCAACTTT 
CATCCGTGCC 



TGAACCCAGC 
CTTCTTTCTT 
GGCAAAGTCA 
AAACATTCTC 



31 
I 

AGGGCAGAGG 
GATCCTCAAA 
ATCTTTATTG 
CTCCTGCTGA 
AATGAGGCAG 
GACAAAAACT 
CGGTTGAAAA 
CAGAAGGTCC 
TCTGGCATCC 
GTACTCTTGG 
AGTACCATGG 
AAAAGCCTTG 
CTTTCCTTAG 
CTCAGACGAG 
GTCACTGAGC 
ATCCTGGAAA 
GTGCTGGATG 
GAGACAGCTG 
AACAATAATT 



41 
I 

AAGCTGGAGC 
GTAAGCCCCT 
AGGATGCCAT 
CTGATAATGA 
ATGAGCTGCG 
GGCACGATAA 
GTGAGCTTGA 
ACAAAGGCAC 
TGACCCTCGT 
AACCTGGGAT 
ACTACGGAAA 
ACAAATTGAA 
CTGGCAATAC 
CCAGAGCCAA 
CAATCTCAGC 
TGAGCAGAGG 
TAGTCTACCT 
AGGAGCTGAA 
ATAAGATTCT 



51 
I 

GAGGGTGCAA 
CGGTGACTGG 
TAAGTATTTC 
GGCCTGGAAC 
TAAAGCTCTG 
AGGCCAGCAG 
GGATAACATA 
CACCATOGCC 
CGGCATGGGT 
GGAGTTGGGA 
GAAGTGGTGG 
GGAGGTGAGG 
TTACCAACTC 
TCTTCAGTCA 
TGAAAGCGGT 
AGTCAAGCTC 
CGTGTACGAA 
GAAGGTGGCT 
GCAGGCGGAC 



Seq ID NO: 189 Protein sequence 
Protein Accession #: NP 003652.1 



1 
I 

MSALFLGVGV 
KEKVSTONLL 
YRNWFLKEFP 
LAPFTEGGSL 



EQVERVNEPS 
QELEEKLNIL 



11 
I 

RAEEAGARVQ 
LLLTDNEAWN 
RLKSELEDNI 
VLLEPGMELG 
LSLAGNTYQL 
ILEMSRGVKL 
NNNYKILQAD 



21 
I 

QNVPSGTDTG 
GFVAAAELPR 
RRLRAIADGV 
ITAALTGITS 
TRGIGKDIRA 
TDVAPVSFPL 
QEL 



31 

I 

DPOSKPLGDW 
NEADELRKAL 
QKVHKGTTIA 
STMDYGKKWW 
LRRARANLQS 
VLDWYLVYE 



41 
I 

AAGTMDPESS 
DNLARQMIMK 
NWSGSLSIS 
TQAQAHDLVI 
VPHASASRPR 
SKHLHEGAKS 



SI 
I 

IFIEDAIKYF 
DKNWHDKGQQ 
SGILTLVGMG 
KSLDKLKEVR 
VTEPISAESG 
ETAEELKKVA 



Seq ID NO: 190 DNA sequence 

Nucleic Acid Accession ft: NM_0144S2.1 

Coding sequence: 1..1968 



1 11 21 

I I I 

ATGGGGACCT CTCCGAGCAG CAGCACCGCC 
GCCACAGCCA CGATGATCGC GGGCTCCCTT 
GCTCAGCCAG AACAGAAGGC CTCGAATCTC 
ACCGGCCAGG TGCTAACCTG TGACAAGTGT 
ACCAACACAA GCCTGOGCGT CTGCAGCAGT 
AATGGCATAG AGAAATGCCA TGACTGTAGT 
TTACCTTGTG CTGCCTTGAC TGACCGAGAA 
AACGCTACCT GTGCCCCCCA TACGGTGTGT 
ACAGAGACTG AGGATGTGCG GTGTAAGCAG 
TCTAGTGTGA TGAAATGCAA AGCATACACA 
AAGCCGGGGA CCAAGGAGAC AGACAACGTC 
ACCTCACCTT CCCCTGGCAC AGCCATCTTT 
GTCCCTTCCT CCACTTATGT TCCCAAAGGC 
TCTGTTAGAC CAAAGGTACT GAGTAGCATC 
TCAGCAAGGG GGAAGGAAGA CGTGAACAAG 
CAGCAAGGCC CCCACCACAG ACACATCCTG 
GGCGAGAAGT CCAGCACGCC CATCAAGGGC 
CACAAGCATT TTGACATCAA TGAGCATTTG 
GTGCTTGTGG TGATTGTGGT GTGCAGTATC 
CCCCGGCAGG ATCCCAGTGC CATTGTGGAA 
ACCCAGAACC GGGAGAAATG GATCTACTAC 
CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG 
AGTGAGAGGG AGGTTGCTGC TTTCTCCAAT 
GCAGCTCTGC AGCACTGGAC CATCOGGGGC 
GCCCTGCGCC AGCACCGGAG AAACGATGTT 
ACCACCCAGC TGGAAACTGA CAAACTAGCT 



31 41 

I I 
CTCGCCTCCT GCAGCCGCAT 
CTCCTGCTTG GATTCCTTAG 
ATTGGCACAT ACCGCCATGT 
CCAGCAGGAA CCTATGTCTC 
TGCCCTGTGG GGACCTTTAC 
CAGCCATGCC CATGGCCAAT 
TGCACTTGCC CACCIGGCAT 
CCTGTGGGTT GGGGTGTGCG 
TOTGCTCGGG GTACCTTCTC 
GACTGTCTGA GTCAGAACCT 
TGTGGCACAC TCCCGTCCTT 
CCACGCCCTG AGCACATGGA 
ATGAACTCAA CAGAATCCAA 
CAGGAAGGGA CAGTCCCTGA 
ACCCTCCCAA ACCTTCAGGT 
AAGCTGCTGC CGTCCATGGA 
CCCAAGAGGG GACATCCTAG 
CCCTGGATGA TTGTGCTTTT 
CGGAAAAGCT CGAGGACTCT 
AAGGCAGGGC TGAAGAAATC 
TGCAATGGCC ATGGTATCGA 
AAAGATATCT ATCAGTTTCT 
GGGTACACAG CCGACCACGA 
CCCGAGGCCA GCCTCGCCCA 
GTGGAGAAGA TTCGTGGGCT 
CTCCCGATGA GCCCCAGCCC 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 



51 
I 

OGCCCGCCGA 
CACCACCACA 
TGACCGTGCC 
TGAGCATTGT 
CAGGCATGAG 
GATTGAGAAA 
GTTCCAGTCT 
GAAGAAAGGG 
AGATGTGCCT 
GGTGGTGATC 
CTCCAGCTCC 
AACCCATGAA 
CTCTTCTGCC 
CAACACAAGC 
AGTCAACCAC 
GGCCACTGGG 
ACAGAACCTA 
CCTGCTGCTG 
GAAAAAGGGG 
CATGACTCCA 
TATCCTGAAG 
TTGCAATGCC 
GCGGGCCTAC 
GCTAATTAGC 
GATGGAAGAC 
GCTTAGCCCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



1029 



WO 03/042661 



PCT/US02/36810 
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AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG CTCTCCTGAC GGTGGAGCCT 
TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT CGGAGCCCCT TCTCCGCTGT 
GACTCTACAT CCAGOGGCTC CTCOGCGCTG AGCAGGAACG GTTCCTTTAT TACCAAAGAA 
AAGAAGGACA CAGTGTTGCG GCAGGTAOGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 
GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGGG TGATTGAAGA GATTCCCCAG 
GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG TCAAGAGCCA GGAAGCCAGC 
CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 

Seq ID NOs 191 Protein sequence 
Protein Accession 8: NP 0S5267.1 



1620 
1680 
1740 
1800 
1860 
1920 



1 
I 

MGTSPSSSTA 
TGQVLTCDKC 
LPCAALTDRE 
SSVMKCKAYT 
VPSSTYVPKG 
QQGPHHRHIL 
VLWIWCSI 
LVAAQVGSQW 
ALRQHRRNDV 



DDMLHFLNPE 



11 
i 

IASCSRIARR 
PAGTYVSEHC 
CTCPPGMFQ9 
DCLSQNLWI 
MNSTESNSSA 
KLLPSHEATG 
RKSSRTLKKG 
KDIYQFLCNA 
VEKIRGLMED 
VDESEPLLRC 
ELRVIEEIPQ 



21 
I 

ATATMTAGSL 
TNTSLRVCSS 
HATCAPHTVC 
KPGTKETDNV 
SVRPKVLSSI 
GEKSSTPIKG 
PRQDPSAIVE 
SEREVAAFSN 
TTQLETDKLA 



AEDKLDRLFE 



31 
I 

LLLGFLSTTT 
CPVGTFTRHE 
PVGWGVRKKG 
CGTLPSFSSS 
QEGTVPDNTS 
PKRGHPRQNL 
KAGLKKSMTP 
GYTADHERAY 
LPMSPSPLSP 
SRNGSFITKE 
IIGVKSQEAS 



41 
I 

AQPEQKASNL 
NGIEKCHDCS 
TETEDVRCKQ 
TSPSPGTAIP 
SARGKEDVNK 
HKHFDINEHL 
TQNREKWIYY 
AALQHWTIRG 
SPIPSPNAKL 
KKDTVIiRQVR 
QTLLDSVYSH 



51 
I 

IGTYRHVDRA 
QPCPWPMIEK 
CARGTFSDVP 
PRPEHMETHE 
TLPNLQWNH 
PWMIVLFLLL 
CNGHGIDILK 
PEASLAQLIS 
ENSALLTVEP 
LDPCDLQPIF 
LPDLL 



Seq ID NOs 192 DNA sequence 

Nucleic Acid Accession #s XM_044533 

Coding sequence: 238.. 2751 



11 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



21 31 41 51 

I I I I I I 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 
AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 
GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GOGGACCGCG GGGCGGAGCT GCCGCCCGTG 
AGTCOGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 
CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 
CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 
TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 
TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTC7GC TGAGCAGGGA TGGCAGGACC 
CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 
GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 
TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 
AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 
AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 
AAGGGCOGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGGGAG 
CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 
AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 
GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 
TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 
ATTGCCGGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 
TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 
CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 
GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 
ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 
CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 
AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 
TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 
CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 
GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 
CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 
CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 
CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 
TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 
ACCAGGCOGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 
TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 
CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 
TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 
GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 
GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 
CAAACAGATG AGGGTGGCAG TGTACCOGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 
GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 
TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 
AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 
CCTGTG G TGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 
CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTGCCG AGTCTTCACT 
GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 
CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCOSTGACT CTGT GGT GTG AGAGCTGACT 
TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 
TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 
GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 
ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 
TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 
GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 
GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 

1030 
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TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 3240 

CAGGACCAGC TTGGGCTGCG TGOGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3300 

CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 

CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCOGT GTGCCCCTTC 3540 

CCACCATATC CACCCTOGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID MO: 193 Protein sequence 
Protein Accession #: XP 044533.3 



1 11 21 31 41 51 

I I I I I I 

MLRTAMGLRS WLAAPWGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGSBERPFL 60 

RFEAEHISNY TALLfcSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADABKKQQC 120 

SFKGKDPQRD OQNYIKILLP LSGSHLFTOG TAAFSPMCTY INMENFTLAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG ELYTGTVSSP QGNDPAISRS QSLRPTKTES SLNWLQDPAF 240 

VASAYIPESL GSLQGDDDKI YPPPSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQLLC SRPDDGPPFN VLQDVPTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 420 

NFLKDHPLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLPLGTGD GRLHKAVSVG 480 

PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQL ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FOPNTVNTLA CPLLSNLATR LWbRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAVL LPVLPLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 
iU PLDHRGYQSL SDSPPGSRVP TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 



Seq ID NO: 194 DNA sequence 

Nucleic Acid Accession &: NM_022819.1 

Coding sequence : 1 . . 63 5 

1 11 21 31 41 51 

I I I I I I 

ATGGCAGATG GGGCAAAGGC CAACCCCAAA GGGTTCAAAA AGAAGGTGCT GGATAGATGC 60 
TTCTCTGGGT GGAGGGGCCC ACGCTTCGGG GCCTCCTGTC CTTCAAGAAC CTCCAGGTCT 120 
AGCCTGGGTA TGAAGAAGTT CTTCACCGTG GCCATCCTTG CTGGCAGOGT TCTGTCCACA 180 
GCTCACGGCA GCCTGCTCAA CCTGAAGGCC ATGGTGGAGG CCGTCACAGG GAGGAGOGCC 240 
ATCCTGTCCT TCGTGGGCTA CGGTTGCTAC TGTGGGCTGG GGGGCCGTGG CCAGCCCAAG 300 
GATGAGGTGG ACTGGTGCTG CCACGCCCAC GACTGCTGCT ACCAGGAACT CTTTGACCAA 360 
GGCTGTCAGC CCTATGTGGA OCACTATGAT CACACCATCG AGAACAACAC TGAGATAGTC 420 
TGCAGTGACC TCAACAAGAC AGAGTGTGAC AAGCAGACAT GCATGTGTGA CAAGAACATG 480 
GTTCTGTGCC TCATGAACCA GACGTACCGA GAGGAGTACC GTGGCTTCCT CAATGTCTAC 540 
TGCCAGGGCC CCACGCCCAA CTGCAGCATC TATGAACCGC CCCCTGAGGA GGTCACCTGC 600 
AGTCACCAAT CCCCAGCGCC CCCCGCCCCT CCCTAG 



50 Seq ID NO i 195 Protein sequence 

Protein Accession #: NP 073730 



1 11 21 31 41 51 

I I I I I I 

MADGAKANPK GFKKKVLDRC FSGWRGPRFG ASCPSRTSRS SLGMKKFFTV AILAGSVLST 60 
AHGSLLNIiKA MVEAVTGRSA ILSFVGYGCY CGLGGRGQPK DEVDWOCHAH DCCYQELPDQ 120 
GCHPYVDHYD HTIENNTEIV CSDLNKTECD KQTCMCDKNM VLCIWNQTYR EEYRGFIcNVY 180 
CQGPTPNCSI YEPPPEEVTC SHQSPAPPAP P 

Seq ID NO i 196 DNA sequence 

Nucleic Acid Accession ft: XM_028196.1 

Coding sequence: 1315.. 1791 



^- 1 11 21 31 41 51 

65 | | | | | | 

GGCATTGATG CTGTGTGCGC GTGCGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 60 

GTGTGTGTGT GTCTGGAGTC ATGGCAGGGT CCCTTTCTGT CTGTCTCCTT GCTCTGCCCC 120 

AGACTGGGGG GCTGCAGAGG TGAGGGTATC TGGCCTCAAC AGCTGCTTAT TCCCGATGGG 180 

ATGGCCTGGG CTGGGCCCCT GAGGCCAGGC TGACTTGGAC ATGGCAAGAG GGGTCCCAGG 240 

CTCTTGTGGG CAAAGCAGGG GAGGOGCCAA TGTGGAGGAA CAGAGTCTCC TGGCTGGCTG 300 

CTGCTCTCTG GAGCGGGTGG AGTCAGGGAA GAGCTGAGCT GGGGAGTCAC CCTGGGCCTG 360 

GGGTCACCGT AGGCCCCATG TAGCACCCTG GTTCCCCTGC CTGTAGGTGA CAGGAGCCAG 420 

CCCAGCCAGG TGTGCTCCCT CCCCAGGCCC TAGGCAGGCG GGTACAGGGG CCAGCAGCTG 480 

CGCCCGCCCC ACCTTCCTTC CCACCCACAT GCCGAAGGGT GGCCAGGCAG GCAGGTGGAC 540 

GAGTCCAGGC AGCGGCTGAG TCAGTGTGTG TGGAATGTTC TGGCCGCTCC CAGCTGCACC 600 

CTGCCCCTAC CTGCCACCAC CTCACCTTCA TCCTCAGGCG CTGCGGCCCT GAGCCCCTGC 660 

CAGGAATGCA CCTTTAGCCC AGGCCTGCTC AGTGAGCTCC GCCGACAGCC AGCCCTGCTC 720 

CTCCCGCCAT GACCCTGCAG ACCCCTCTGO GCTTCCAAGT TCCTGGGGGC TGCAGTGAAC 780 

ATGCTCCACC TGCATGGCTG GCAAACCATG GTGGGCCCCA GCTGTGGTGC GTGCTGGGGT 840 

AGAGGCAAGG AAGTGATGGG ACCGCAGAGA TGAGACCCCC AGGGATGAGA TGGGACCCCC 900 

AGGCAGGGCC CAGGGTCCAG GGCCCAGGAG AGAGAAGCAG GGAGGGAGAG AGCTTCCTGG 960 

TGGAGGACGC ATCCTACAGT GGGGGCAAGG GTGCTCTGAG GTCCGGTGAA GGCAGGGACT 1020 

AGGCTGCCCA GGCCCTGCCT GCTTGGCTGG GGCTGGGGGC TGCTGGGAGG TGGCTGGGAG 1080 

GCTGGGCCTG GGCAGCTAAG CTGGAGCTTT GGCCAGGGTC CAGAGCCTCC CTCCCTTCAG 1140 

1031 



WO 03/042661 



CTTCCTGCTG CACAGAACCC TCGCCCCTGG CCACCCCGTG CTGCCTCCTT GCCCTGGCAG 1200 

ACCCAGCACT GGCTGCTGCT AGTCAGATGG GGTAGCGGGC AGGGGCCGGA GGGGCCACCC 1260 

TCCCAGCTGA CCCAGCCTCC TGGGCOGCTT CTTCCAAACC AGCAGGGTAG AAAGATGGGG 1320 

CACCCACCAG TCTCTCCCAG TGCCCCGGCC CCAGCTGGCA CCACAGCTAT ACCTGGGCTT 1380 

ATTCCAGACC TTGTOGCCGG GACCCOCTGT GAGTTGTGGG ATTCCCAAGA GGGGTGTGGG 1440 

GATAACCCAG CCAAGTGGGG GCTGCAGCTG TCCACAGATG CACTCAGCCT GGCCTCTACC 1500 

CCAGGGCCCC GCTGGGCTCT CATTGCCGGC GCCCTTGCCG OGGGCGTCCT CCTCGTCTCC 1560 

TGCCTCCTCT GTGCTGCCTG CTGCTGCTGC CGCCGCCACA GGAAGAAGCC CAGGGACAAG 1620 

GAGTCOGTGG GTCTGGGCAG TGCCOGCGGC ACCACCACCA CCCACCTGGT GAGGAGOGGC 1680 

TCCTTGCTCA CTCAGTCCAG AGAGGGCTTG AAATCCAGGC TCCAGAGCCC AGGGCAGCGA 1740 

GGCGAGTTCA GCCCCAGGGA TGGTTTAACC CCCACAGAGG CAGGGCGTTG AGGACCTTCC 1600 

TGGCAGGGAA AGTGGGTGAA CAGAGGTGAG AAGGAGGCCA TGCAACAGGG GCTGCCCCAT 1860 

GGGCCCGAGG GAGCCACAGC GGGTTCTTGA GGAAGGCAGG GGGTACCCCA GATGCCACGT 1920 

TTTGGGTGGG TTTGGCCGGT CTCACAGAGC GAAGCGGAOG ATTTGTGCCT GTTGGGTGGC 1980 

CTGGCCTGGA GGCGGGGGGT CTTGACCCAT GTCATGCAAG GGCTGCCCGG GAGCCCAGGG 2040 

CTCTGATGAG GCATGATGTC AGCACCACCT GCCCCTTGTC CCAACTCACT CCAGGTGCAA 2100 

CCTGATGTGG ATGGCCTGGA GTCCAGCCCG GGGGATGCTC AGCAATGGGG GTGCCTGCAG 2160 

CTCTCCCTGG AGTTGGACTT TGGAAGCCAG GAGGTGAAGG GCCCOGCTGC GCAGGACCAG 2220 

CGGTTCTGCG AGTTTCCGGA AAGGGTGAOG GGGGAAGGGC AGACCCCATG CCCTGGGTGG 2280 

TGGGGAGCTG ACAGGGCAGG GGCCCTTGGC TGAGCCCACC CCGCTGGCTC CCAGATCAGG 2340 

GTGGGCCTGA GGCAGGCAGC CGACCTGAGG CCTGGGGGCA CCGTGGACCC CTATGCCOGG 2400 

GTCAGCGTCT CCACCCAGGC CGGACACAGA CATGAGACAA AAGTGCACCG AGGCACGCTC 2460 

TGCCCCGTGT TTGACGAGAC CTGCTGCTTC CACGTGAGTC AGGGATGGTC GGCTGGGTGG 2520 

GCCTGGACGG CTGGATGGGC CTGGGCTGGG TGGGCCTGGG CAGCTGGGTG GGCCTGGGCA 2580 

GCTGGGTGGG CCTGAGCTAG GGCAGCAGGG CCTGGCTCAC GCOGCTGCCT CAGATCCCGC 2640 

AGGCGGAGCT GCCAGGGGCC ACCCTGCAGG TGCAGCTTTT CAACTTCAAG CGCTTCTOGG 2700 

GGCATGAGCC CCTGGGTGAG CTCCGTCTGC CACTGGGCAC OGTGGATCTG CAGCATGTTC 2760 

TGGAGCACTG GTACCTGCTG GGCCGGCGGG CTGCCACTCA GGTGAGGTGC TGGTCACCAG 2820 

GCCACAGCCC AAGGCAGAGC TGGCAGGGAC CCTGCCCTAT GGGCCATCGG AAAGACAGGC 2880 

CTGATGGGCA GCATTTTCGG GGGTCTGAGC CCCAACTCGG CCAGAATCAC CCTCCCGGGC 2940 

TGAAGCCCCT CTTGCTGCCC ACAGCCCGAG CAGGTCGGGG AGCTGTGCTT CTCTCTCCGG 3000 

TACGTGCCCA GCTCAGGCCG GCTGACCGTG GTGGTGCTGG AGGCTOGAGG CCTGCGTCCA 3060 

GGACTTGCAG AGCCCTACGT GAAGGTCCAG CTCATGCTGA ACCAGAGGAA GTGGAAGAAG 3120 

AGAAAGACAG CCACCAAAAA GGGCAGGGOG GCCCCCTACT TCAATGAGGC CTTCACCTTC 3180 

CTGGTGCCCT TCAGCCAGGT CCAGAATGTG GACCTGGTGC TGGCTGTCTG GGACCGCAGC 3240 

CTGCCGCTCC GAACTGAGCC CGTAGGCAAG GTGCACCTGG GTGCCCGGGC CTCGGGGCAG 3300 

CCCCTGCAGC ACTGGGCAGA CATGCTGGCC CACGCCCGGC GGCCCATTGC CCAGCGGCAC 3360 

CCCCTGCGGC CAGCCAGGGA GGTGGACCGC ATGCTGGCCC TGCAGCCCCG CCTTCGCCTG 3420 

CGCCTGCCCT TGCCCCACTC CTGAATGCAC CACATGCCTC TGTCTCCCCG CTGAGCCCAG 3460 
GCACTTGCCC AGGCCGCCCT GCAGGACCAG TGCAATAAAC GCCTTCTCCT GCC , 

Seq ID NO: 197 Protein sequence 
Protein Accession #: XP_028196.1 

1 11 21 31 

I I I I 

KGHPPVSPSA PAPAGTTAIP GLIPDLVAGT PCELWDSQBG 
STPGPRWALI AGALAAGVLL VSCLLCAACC CCRRHRKKPR 
SGSLLTQSRE GLKSRLQSPG QRGEFSPRDG LTPTEAGS 

Seq ID NO: 198 DMA sequence 
Nucleic Acid Accession ft: NM_000612.2 
Coding sequence: 553.. 1095 ~ 

1 11 21 31 41 51 

I 1 I I I I 

TTCTCCCGCA ACCTTCCCTT CGCTCCCTCC CGTCCCCCCC AGCTCCTAGC CTCCGACTCC 60 

CTCCCCCCCT CACGCCCGCC CTCTCGCCTT CGCCGAACCA AAGTGGATTA ATTACACGCT 120 

TTCTGTTTCT CTCCGTGCTG TTCTCTCCCG CTGTGCGCCT GCCCGCCTCT CGCTGTCCTC 180 

TCTCCCCCTC GCCCTCTCTT CGGCCCCCCC CTTTCACGTT CACTCTGTCT CTCCCACTAT 240 

CTCTGCCCCC CTCTATCCTT GATACAACAG CTGACCTCAT TTCCCGATAC CTTTTCCCCC 300 

CCGAAAAGTA CAACATCTGG CCCGCCCCAG CCCGAAGACA GCCCGTCCTC CCTGGACAAT 360 

CAGACGAATT CTCCCCCCCC CCCCAAAAAA AAAAGCCATC CCCCCGCTCT GCCCCGTCGC 420 

ACATTCGGCC CCCGCGACTC GGCCAGAGOG GCGCTGGCAG AGGAGTGTCC GGCAGGAGGG 480 

CCAACGCCCG CTGTTCGGTT TGCGACACGC AGCAGGGAGG TGGGCGGCAG CGTCGCCGGC 540 

TTCCAGACAC CAATGGGAAT CCCAATGGGG AAGTCGATGC TGGTGCTTCT CACCTTCTTG 600 

GCCTTCGCCT CGTGCTGCAT TGCTGCTTAC CGCCCCAGTG AGACCCTGTG CGGCGGGGAG 660 

CTGGTGGACA CCCTCCAGTT CGTCTGTGGG GACCGCGGCT TCTACTTCAG CAGGCCCGCA 720 

AGCCGTGTGA GCCGTCGCAG CCGTGGCATC GTTGAGGAGT GCTGTTTCCG CAGCTGTGAC 780 

CTGGCCCTCC TGGAGACGTA CTGTGCTACC CCCGCCAAGT CCGAGAGGGA CGTGTOGACC 840 

CCTCCGACCG TGCTTCCGGA CAACTTCCCC AGATACCCCG TGGGCAAGTT CTTCCAATAT 900 

GACACCTGGA AGCAGTCCAC CCAGCGCCTG CGCAGGGGCC TGCCTGCCCT CCTGCGTGCC 960 

CGCCGGGGTC ACGTGCTCGC CAAGGAGCTC GAGGCGTTCA GGGAGGCCAA ACGTCACOGT 1020 

CCCCTGATTG CTCTACCCAC CCAAGACCCC GCCCACGGGG GCGCCCCCCC AGAGATGGCC 1080 

AGCAATCGGA AGTGAGCAAA ACTGCCGCAA GTCTGCAGCC CGGCGCCACC ATCCTGCAGC 1140 

CTCCTCCTGA CCACGGACGT TTCCATCAGG TTCCATCCCG AAAATCTCTC GGTTCCACGT 1200 

CCCCCTGGGG CTTCTCCTGA CCCAGTCCCC GTGCCCCGCC TCCCCGAAAC AGGCTACTCT 1260 

CCTCGG CCCC CTCCATCGGG CXGAGQAAGC ACAGCAGCAT CTTCAAACAT GTACAAAATC 1320 
GATTGGCTTT AAACACCCTT CACATACCCT CCCCCC 

Seq ID NO: 199 Protein sequence 
Protein Accession fit NP_000603.1 

1 11 21 31 41 51 



41 51 
I i 

CGDNPAKWGL QLSTDALSLA 60 
DKESVGLGSA RGTTTTHLVR 120 
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I I I I I I 

MGIPMGKSML VLLTFLAFAS CCIAAYRPSE TLCGGELVDT LQFVCGDRGF YFSRPASRVS 60 
RRSRGIVEEC CFRSCDLALL ETYCATPAKS ERDVSTPPTV LPDNFPRYPV GKFFQYDTWK 120 
QSTQRLRRGL PALLRARRGH VLAKELEAFR EAKRHRPLIA LPTQDPAHGG APPEMASNRK 



Seq ID NO: 200 DNA sequence 

Nucleic Acid Accession 8: AK0S7131.1 

Coding sequence : 61 .. 1146 



1 11 21 31 41 SI 

I I 1 I I I 

AGTCTGGGCG TTTAGGTCAG AACTACCCCG GTAGCCTGAC AGCAGGAGCT CGAGAGAAGC 60 

ATGGCTCAGC GGTGCGTTTG CGTGCTGGCC CTGGTGGCTA TGCTGCTCCT AGTTTTCCCT 120 

ACCGTCTCCA GATCGATGGG CCCGAGGAGC GGGGAGCATC AAAGGGCGTC GCGAATCCCT 180 

TCTCAGTTCA GCAAAGAGGA ACGCGTCGCG ATGAAAGAGG CGCTGAAAGG TGCCATCCAG 240 

ATTCCAACAG TGACTTTTAG CTCTGAGAAG TCCAATACTA CAGCCCTGGC TGAGTTGGGA 300 

AAATACATTC ATAAAGTCTT TCCTACAGTG GTCAGCACCA GCTTTATCCA GCATGAAGTC 360 

GTGGAAGAGT ATAGCCACCT GTTCACTATC CAAGGCTCGG ACCCCAGCTT GCAGCCCTAC 420 

CTGCTGATGG CTCACTTTGA TGTGOTGCCT GCCCCTGAAG AAGGCTGGGA GGTGCCCCCA 480 

TTCTCTGGGT TGGAGCGTGA TGGCGTCATC TATGGTTGGG GCACACTGGA CGACAAGAAC . 540 

TCTGTGATGG CATTACTGCA GGCCTTGGAG CTCCTGCTGA TCAGGAAGTA CATCCCCCGA 600 

AGATCTTTCT TCATTTCTCT GGGCCATGAT GAGGAGTCAT CAGGGACAGG GGCTCAGAGG 660 

ATCTCAGCCC TGCTACAGTC AAGGGGCGTC CAGCTAGCCT TCATTGTGGA CGAGGGGGGC 720 

TTCATCTTGG ATGATTTCAT TCCTAACTTC AAGAAGCCCA TCGCCTTGAT TGCAGTCTCA 780 

GAGAAGGGTT CCATGAACCT CATGCTGCAA GTAAACATGA CTTCAGGCCA CTCTTCAGCT 840 

CCTCCAAAGG AGACAAGCAT TGGCATCCTT GCAGCTGCTG TCAGCCGATT GGAGCAGACA 900 

CCAATGCCTA TCATATTTGG AAGCGGGACA GTGGTGACTG TATTGCAGCA ACTGGCAAAT 960 

GAGGTTTATG GAGAGAAATC CCTTAACCAA TGCAATAATC AGGACCACCA CGGCACTCAC 1020 

CATATTCAAA GCAGGGGTCA AGTTCAATGT CATCCCCCCA GTGGCCCAGG CCACAGTCAA 1080 

CTTCCGGATT CACCCTGGAC AGACAGTCCA AGAGGTCCTA GAACTCACGA AGAACATTGT 1140 

GGCTGATAAC AGAGTCCAGT TCCATGTGTT GAGTGCCTTT GACCCCCTCC CCGTCAGCCC 1200 

TTCTGATGAC AAGGCCTTGG GCTACCAGCT GCTCCGCCAG ACCGTACAGT CCGTCTTCCC 1260 

GGAAGTCAAT ATTACTGCCC CAGTTACTTC TATTGGCAAC ACAGACAGCC GATTCTTTAC 1320 

AAACCTCACC ACTGGCATCT ACAGGTTCTA CCCCATCTAC ATACAGCCTG AAGACTTCAA 1380 

ACGCATCCAT GGAGTCAACG AGAAAATCTC AGTCCAAGCC TATGAGACCC AAGTGAAATT 1440 

CATCTTTGAG TTGATTCAGA ATGCTGACAC AGACCAGGAG CCAGTTTCTC ACCTGCACAA 1500 

ACTGTGAGGT CAAGGGGCCT GCTGGGTTAG GCATGCCOGA CCCOGGGACA GGACTAACCC 1560 

AAGGGGGAAA GCTAGTGTTG ATGAAACTTT TGATCAAAAC CACATTGTAA AACATTGCCC 1620 

ATCTGTCTTG CTCACTCTTA AACTCTCCCA AGAACAAGGC CGGGGTAAGG TAAAGTCAGC 1680 

AGAAATCTGG CTTCTCCCTT CCTCCCGACA TCTGCATCCC TTGATCCACT GGCATTTGCT 1740 

GCCCTCTTGT CCCTTATCTG TCTTATGCTG GTTATTTCAC TGCTTCACCT TCCAGGCTTG 1800 

ACTTAACAAA TGTAGATTTG AGAAATCTCA ACCAGTTGTT ACCTGATAGG AGTCTTTAAT I860 

TTAGGGCACT CTTGCTGGGA TGCTTTCTCC AGAGCTTATA TATTTCTTCT TACTAGAACT 1920 

TTCTTCCCCC TTTTATTCCC CTCTCTTCTT GGACTCATGA GCTGTCTCTT CATCTCTCCT 1980 

CTCTCTCCTG CATCTCTCCC CTTACTCTTC AATTTATTCT ACTTCTGGAC CTGGACTTAC 2040 

CCAAACTGTG ATACTACCAT AATTGTCACC ATAATCAGTC AAATAAAGTG ATCTGTGCAT 2100 
C 



Seq ID NO: 201 Protein sequence 
Protein Accession #: BAB71368.1 

1 11 21 31 41 51 

I I I I I ! 

MAQRCVCVLA LVAMLLLVFP TVSRSMGPRS GBHQRASRIP SQFSKEERVA MKEALKGAIQ 60 
IPTVTPSSEK SNTTALAEFG KYIHKVPPTV VSTSFIQHEV VEEYSHLPTI QGSDPSLQPY 120 
LLMAHFDWP APEEGWEVPP PSGLERDGVI YGWGTLDDKN SVMALLQALE LLLIRKYIPR 180 
RSFPISLGHD EESSGTGAQR ISALLQSRGV QLAFIVDEGG PILDDFIPNF KKPIALIAVS 240 
EKGSMNLMLQ VNMTSGHSSA PPKETSIGIL AAAVSRLEQT PMPIIFGSGT WTVLQQLAN 300 
EVYGEKSLNQ CNNQDHHGTH HIQSRGQVQC HPPSGPGHSQ LPDSPWTDSP RGPRTHEEHC 360 
G 



Seq ID NO: 202 DNA sequence 

Nucleic Acid Accession #i NMJD04217.1 

Coding sequence: 58.. 1092 



1 11 21 31 41 51 

I I I I I I 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 60 

GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 120 

AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

AGTGGGACAC CCGACATCTT AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 300 

CCTCTGGGCA AAGGCAAGTT TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 360 

ATCGTGGCGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 420 

CTGCGCAGAG AGATCGAAAT CCAGGCCCAC CTGCACCATC CCAACATCCT GCGTCTCTAC 480 

AACTATTTTT ATGACOGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 540 

CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGOGAACAGC CACGATCATG 600 

GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 

AAGCCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG , 720 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 

CTTTGCTATG AGCTGCT GG T GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GGCCCAGGTC 1020 
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TCAGCCCACC CTTGGGTCCX3 GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA 10 BO 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATQ TCTQTGTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CTGAAGCTTT TTGT 

Seq ID NO: 203 Protein sequence 
Protein Accession tt: np_004208 

1 11 21 31 41 51 

1 I I I. I I 

MAQKENSYPW PYGRQTAPSG LSTLPQRVLR KBPVTPSALV LMSRSNVQPT AAPGQKVMEN 60 

SSGTPDILTR HFTIDDFEIG RPLGKGKFGN VYLAREKKSH FIVALKVLFK 5QIEKEGVEH 120 

QLRREIEIQA HLHHPNILRL YNYFYDRRRI YLILEYAPRG ELYKELQKSC TFDEQRTATI 180 

MEELADALMY CHGKKVIHRD IKPENLLLGL. KGELKIADFG WSVKAPSLRR KTMCGTLDYL 240 

PPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHNE TYRRIVKVDL KFPASVPTGA 300 
QDIiISKIJjRH NPSERLPLAQ VSAHPWVRAN SRRVLPPSAL QSVA 

Seq ID NO: 204 DNA sequence 
Nucleic Acid Accession #t AK0S5663 
Coding sequence : 38 . . 1423 

1 11 21 31 41 51 

I I I I I I 

AGAACGGCTT CCGGCGGGAG CTGTGCAGCT CCTTATCATG GGGACAATTC ATCTCTTTCG 60 

AAAACCACAA AGATCCTTTT TTGGCAAGTT GTTACGGGAA TTTAGACTTG TAGCAGCTGA 120 

CCGAAGGTCC TGGAAGATAC TGCTCTTTGG TGTAATAAAC TTGATATGTA CTGGCTTCCT 180 

GCTTATGTGG TGCAGTTCTA CTAATAGTAT AGCTTTAACT GCCTATACTT ACCTGACCAT 240 

TTTTGATCTT TTTAGTTTAA TGACATGTTT AATAAGTTAC TGGGTAACAT TGAGGAAACC 300 

TAGCCCTGTC TATTCATTTG GGTTTGAAAG ATTAGAAGTC CTGGCTGTAT TTGCCTCCAC 360 

AGTCTTGGCA CAGTTGGGAG CTCTCTTTAT ATTAAAAGAA AGTGCAGAAC GCTTTTTGGA 420 

ACAGCCCGAG ATACACACGG GAAGATTATT AGTTGGTACT TTTGTGGCTC TTTGTTTCAA 480 

CCTGTTCACG ATGCTTTCTA TTCGGAATAA ACCTTTTGCT TATGTCTCAG AAGCTGCTAG 540 

TACGAGCTGG CTTCAAGAGC ATGTTGCAGA TCTTAGTCGA AGCTTGTGTG GAATTATTCC 600 

GGGACTTAGC AGTATCTTCC TTCCCCGAAT GAATCCATTT GTTTTGATTG ATCTTGCTGG 660 

AGCATTTGCT CTTTGTATTA CATATATGCT CATTGAAATT AATAATTATT TTGCCGTAGA 720 

CACTGCCTCT GCTATAGCTA TTGCCTTGAT GACATTTGGC ACTATGTATC CCATGAGTGT 780 

GTACAGTGGG AAAGTCTTAC TCCAGACAAC AOCACCCCAT GTTATTGGTC AGTTGGACAA 840 

ACTCATCAGA GAGGTATCTA CCTTAGATGG AGTTTTAGAA GTCCGAAATG AACATTTTTG 900 

GACCCTAGGT TTTGGCTCAT TGGCTGGATC AGTGCATGTA AGAATTCGAC GAGATGCCAA 960 

TGAACAAATG GTTCTTGCTC ATGTGACCAA CAGGCTGTAC ACTCTAGTGT CTACTCTAAC 1020 

TGTTCAAATT TTCAAGGATG ACTGGATTAG GCCTGCCTTA TTGTCTGGGC CTGTTGCAGC 1080 

CAATGTCCTA AACTTTTCAG ATCATCACGT AATCCCAATG CCTCTTTTAA AGGGTACTGA 1140 

TGATTTGAAC CCAGTTACAT CAACTCCAGC TAAACCTAGT AGTCCACCTC CAGAATTTTC 1200 

ATTTAACACT CCTGGGAAAA ATGTGAACCC AGTTATTCTT CTAAACACAC AAACAAGGCC 1260 

TTATGGTTTT GGTCTCAATC ATGGACACAC ACCTXACAGC AGCATGCTTA ATCAAGGACT 1320 

TGGAGTTCCA GGAATTGGAG CAACTCAAGG ATTGAGGACT GGTTTTACAA ATATACCAAG 1380 

TAGATATGGA ACTAATAATA GAATTGGACA ACCAAGACCA TGATAGACTC TAACTTATTT 1440 

TTATAAGGAA TATTGACTCC TTGGCTTCCA ATTTATTTAG TAATCCAACT TTGCATTGAC 1500 

TGTTTAATCA TTTACTCTAA ATGTTAGATA ATAGTAGTCT TGTTCACATT TCATGAAACC 1560 

TATGAAACTA TATTTTTGTA AAATGTATTT GTGACAGTGA AATCCTOGTA AATGTTAAAG 1620 

GCTTTAAATA GGCTTCCTTT AGAAAATGTG TTTCTTTAAA TTTGGATTTT GGTATCTTTG .1680 

GTTTTGTAGT TGACTGCAGT GTGATGTGAC CTTACCTTTA TAAGAGCCAC TTGATGGAGT 1740 

AGATCTGTCA CATTACTAAG ATACQATATT TCTTTTTTTT TCCGAGACGG AGTCTTGCTC 1800 

TGCCACTGTG CCCGGCCAAT ACATTATTAT TAACTTAAGG CTGTACTTTA TTAAGGCTTC 1860 

CTTAGTTTTT GTTTTGTTTT GTTTTTTGAG ATGGAGTCTC ACTCTGTCGC CCAGGCTGGA 1920 

ATGCAGTGGC ATGATCTCAG CTCACTGCAA CCTCTGCCTC CTGAGTTCAA ATGATTCTCC 1980 

TGCCTCAGCC TCCCGAGTAG CTGGGATTAC AGGCACCTGC CACCACGCCC AGCTAATTTT 2040 

TGTATTTTTA GTAAAGACGG GGGATTTCAC CATGTTGGCC AGGCTGGTCT TGAACTCCTG 2100 

ACCTCATGAT CCACCCACCT TAGCCTCCCA AAGTGCTGGG ATTAGGTGTG AGCCACCGCA 2160 

CCTGGCCGAT ATTTTCTTTA ATGAAATTTA TAAATATGCT TCTTGAATAA TACACATTTT 2220 

GGGAAAGGGA AAAATGTCTG TTCAAAAAGT AAAGGTCTCT TTTATAGCTT TTCCAAACTT 2280 

AATTGCTAAA TTTTTCTTTG AGGTTCTCCT GAATTATGTC TTACAAACTA AAAGCAAAAA 2340 

TTTTTAGCAG AAATTTTGGA ATACATTCTA TCTAGCACAA TTTGAATTTT TAATTATCAA 2400 
GATTTTTGTT AAAGTTTCTC TCCTTTAAAA ATTTTAGTAC ATTTGTAAAT 

Seq ID NO: 205 Protein sequence 
Protein Accession #: BAB70980.1 

1 11 21 31 41 51 

I I I I I I 

MGTIHLFRKP QRSFFGKLLR EFRLVAADRR SWKILLFGVI NLICTGFLLM WCSSTNSIAL 60 

TAYTYLTIFD LFSLMTCLIS YWVTLRKPSP VYSFGFERLE VLAVFASTVL AQXiGALPILK 120 

ESAERFLEQP EIHTGRLLVG TFVALCFNLF TMLSIRNKPF AYVSEAASTS WLQEHVADLS 180 

RSI/CGI IPGL SSIPLPRMNP FVLIDLAGAF ALCITYMLIE INNYFAVDTA SAIAIALMTF 240 

GTMYPMSVYS GKVLLQTTPP HVIGQLDKLI REVSTLDGVL EVRNEHFWTL GFGSLAGSVH 300 

VRIRRDANEQ MVLAHVTNRL YTLVSTLTVQ IFKDDWIRPA LLSGPVAANV LNFSDHHVIP 360 

MPLLKGTDDL NPVTSTPAKP SSPPPEFSFN TPGKNVNPVI LLNTQTRPYG FGLNHGHTPY 420 
SSMLNQGLGV PGIGATQGLR TGFTNIPSRY GTNNRIGQPR P 

Seq ID NO: 206 DNA sequence 

Nucleic Acid Accession 8: NM_016361.1 

Coding sequence: 3 97.. 1662 

1 11 21 31 41 51 



1034 



WO 03/042661 PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GGAACTCAGG 
TTTAATGTTC 
AGGAGOGACC 
TTGTTCTCTC 
TGCTTGCTGA 
TGAGCCCCGG 
CGAGGGCCTA 
GTCCTGACCT 
GAGGCCGATG 
GTGTTTOGAC 
TGGAACCCCC 
CTAGCTGGTG 
AAGGGGGGCA 
GGAGAGAGAC 
CCACAGGAGG 
TTGCTGGCTG 
GCAGATTCAG 
AGAGGCCGGA 
AAGGACAGGA 
GTGGCTGCCG 
ATGATCGAAC 
AGTCTTCAGA 
ATGGACTCTG 
GTGACCTTCA 
GCTGTTGACC 
CTCTATTACC 
GACATGTTCT 
TGCTCTCAAA 
ATGTGTTGAT 



GCCGGCTCCT 
CTCGTCGGGC 
AACCCCGACC 
CCCCGTTCCC 
AGGGCTGGAT 
CGCCCGCAGA 
CCACCATGAT 
CGCTGGCGTA 
GCCAGTGTCC 
ACGGGGCTCG 
AGCTATTAGA 
GTCCGAAACC 
TGTTTGCTGG 
TGAGGAAGAA 
TCTTTATTCX3 
GGCTTTTCCA 
AAGTCTTGTA 
GGCAGACTGC 
TGGGCATTGA 
AGCAGGCACA 
AGAGAGCTGT 
TGGCAGTAGG 
CCACTGOCCC 
TACCGCTCTT 
TGACCATGGA 
ACGGGAAGGA 
TGAATGCCAT 
CTCAGGTGAT 
TTTAAAATAA 



GTTCCTTCAA 
AAAAGATAAG 
CGGGTTAAAA 
ACTCGGGGTC 
GTACGCATCC 
AGACTTGTGT 
CACTGGTGTG 
CTGCCTGCAC 
GGTCGACCGC 
GAGTCCTCTC 
GGTCCCACCC 
ATATTCTCCT 
GCAGCTGACC 
CTATGTGGAA 
TTCCACTAAC 
GTGTCAGAAA 
TCCCAACTAC 
CTCTTTACAG 
CAGTAGTGAT 
CAACCTCCCA 
GGACACATCC 
CCCATTCCTC 
CGACAAGATC 
AATGACCCTG 
ACTTTACCAG 
GCAGGTGCCG 
GTCAGTTTAT 
GGAAGTTGGA 
AGTGCCTTTA 



GAGTGCTGGA 
GATCCGATCT 
CTCCCAGGGA 
TCCCTCAGGG 
GCAGGTTCGC 
TTGOCTCCTG 
TTCAGCATGC 
CAGCGGCGGG 
AGCCTGCTGA 
AAGCCGCTCC 
CAAACTCAGT 
TACGACTCTC 
AAGGTGGGCA 
GACATTCCCT 
ATTTTTCGGA 
GAAGGACCCA 
CAAAGCTGCT 
CCAGGAATCT 
AAAGTGGACT 
AGCTGCCCCA 
TTGTACATAC 
CACATCCTAG 
AGAAAGCTGT 
GGGATTTTTG 
CACCTGGAAT 
AGAGGTTGCC 
ACCTTAAGCC 
AATGAAGAGT 
TACAAAAAAA 



I 

GGCCAAACTT 
CCOCCGGCCC 
CTCTTCGCTG 
CCGGGAGGCA 
GCGGACTTGG 
CAGCCTCAAC 
GCTTGTGGAC 
TGGCCCTGGC 
AGTTGAAAAT 
CGCTGGAGGA 
TTGATTACAC 
AATACCATGA 
TGCAGCAAAT 
TTCTTTCACC 
ATCTGGAGTC 
TCATCATCCA 
GGAGCCTGAG 
CAGAGGATTT 
TCTTCATCCT 
TGCTGAAGAG 
TGCCCAAGGA 
AGAGCAACCT 
ATCTCTATGC 
ACCACAAATG 
CTAAGGAGTG 
CTGATGGGCT 
CAGAAAAATA 
AACTGATTTA 
AAAAAAAAAA 



I 

GAAATACAAG 
GGTGTGCAGC 
CTGCCACCTC 
CAGCGGTCCC 
GGGCGCCCGC 
CCGGAGGCAG 
CCCAGTGGGC 
CGAGCTGCAG 
GGTGCAGGTC 
GCAGGTAGAG 
AGTCACCAAT 
GACCACCCTG 
GTTTGCCTTG 
AACCTTCAAC 
CACCCGTTGT 
CACTGATGAA 
GCAGAGAACC 
GAAAAAGGTG 
CCTGGACAAC 
ATTTGCACGG 
AGACAGGGAA 
GCTGAAAGCC 
GGCTCATGAT 
GCCACCGTTT 
GTTTGTGCAG 
CTGCCCGCTG 
CCATGCACTC 
TAAAAGCAGG 
A 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: 207 Protein sequence 
Protein Accession ft-. NP 0S7445.1 



MRLWTPVGVL 
LPLBEQVEWN 
GMQQMFALGE 
PIXIHTDEAD 
DPFILLDNVA 
LESNLLKAMD 
ESKEWFVQLY 
E 



11 
I 

TSLAYCLHQR 
PQLLEVPPQT 
RLRKNYVEDI 
SEVLYPNYQS 
AEQAHNLPSC 
SATAPDKIRK 
YKGKEQVPRG 



21 

1 

RVALAEIiQEA 
QFDYTVTNIiA 
PPLSPTFNPQ 
CWSLRQRTRG 
PMLKRFARMI 
LYLYAAHDVT 
CPDGLCPLDM 



31 

I 

DGQCFVDRSL 
GGPKPYSPYD 
EVPIRSTNIP 
RRQTASLQPG 
EQRAVDTSIiY 
FIPLLMTLGI 
FLNAMSVYTL 



41 
I 

LKLKMVQWF 
SQYHETTLKG 
RKLESTRCLL 
ISEDLKKVKD 
ILPKEDRESL 
FDHKWPPFAV 
SPEKYHALCS 



51 
I 

RHGARSPLKP 
GMFAGQLTKV 
AGLFQCQKEG 
RMGIDSSDKV 
QMAVGPFLHI 
DLTMELYQHL 
QTQVMEVGNE 



Seq ID NO: 208 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 
I 

TTTGAGGGGG 
TAACAGCCTG 
TGCTGGGGGC 
TAGTGGCAGA 
TCTGGCTGCT 
GGAGCCCTCC 
CGTCTTTATT 
GCAGGTTCCT 
TGAGCTGGCA 



11 

! 

TGGTGGGGCG 
CCCTTGGAGA 
CAGTGTTAGG 
GCCTGGAGTG 
CTTGCATTTG 
GCAGATCAGT 
GGCAAATGGG 
CACTGCTCCT 
GCGCAG 



21 
I 

AGTTTAATTC 
GAAGTCCTTC 
CTTCAGGCCA 
ATGAGTGGGA 
CATTTGCCAC 
CCGCTCAGCT 
TCATTGGCTT 
TCACCAGTGT 



31 
I 

ATAAAGAAGC 
CTTGAGGATA 
TCCCTGGAGG 
TGGCCTTCTC 
TCAG AACTGC 
GCAGGTTTTT 
CCAAGGCAGT 
CCTGCGAGGT 



41 
1 

CTCCTGATCA 
AGGCCTCCCA 
CCAGTCCTGT 
AGGTACAGGA 
CGCGATCCCA 
CCAGTCATAG 
CAGGCCAACT 
CACCTTGGOG 



Seq ID NOt 209 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..S64 



60 
120 
180 
240 
300 
360 
420 



51 
I 

GAAAGGGGCC 
GGGGAGGAGG 
GCTCAGCAAG 
CTGTGCTGCT 
GCAATGGCCA 
TAGAAGGGAT 
GTGTGACTCT 
AGGGCTCACC 



60 
120 
180 
240 
300 
360 
420 
480 



ATGGAGCCCT 
TCAGATCCGT 
CAGAGTCACA 
GATCCCTTCT 
CTGGCCATTG 
AAGCAGCACA 
ACCTTGGCCA 
AGCTACACAT 
GATGCTGCTG 
GGAACGAAGG 



11 
I 

GGGCGTGGCT 
TCTCTGCGCT 
CAGTTGGCCT 
ACTATGACTG 
CTGGGATCGC 
GTCCTGTACC 
AATCAAATAA 
CAGTCTTCOG 
CCCTCACATC 
GCTGTACCTA 



21 
I 

GCAGGGTTTA 
GCCAGCTCAG 
GACTGCCTTG 
GAAAAACCTG 
GGCAGTTCTG 
TGAGAAGGCC 
ACCTTTATCT 
AGTGCCTCTT 
AGGCCATCCA 
ATGA 



31 
I 

AAGAGCCGAC 
GACACTGGTG 
GAAGCCAATG 
CAGCTGAGCG 
AGTGGCAAAT 
ATCCCACTCA 
CCAAGCACCT 
TCTGCGTCCC 
AGCATGCAGA 



41 
I 

CCACGTGCCC 
AAGGAGCAGT 
ACCCATTTGC 
GACTGATCTG 
GCAAATGCAA 
TCACTCCAGG 
TTGTCTTGGT 
TGTACCCTGC 
ACATAAGCAT 



51 
I 

AGCAGCCTCC 
GAGGAACCTG 
CAATAAAGAC 
CGGAGGGCTC 
GAGCAGCCAG 
CAGATTTCTC 
GTTTGGCATC 
CATTCCTGGT 
GCAGAACACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 210 Protein sequence 
Protein Accession fl: FGENESH predicted 



1 11 21 31 41 51 

1 I I I I I 

MEPWAWLQGL KSRPTCPAAS SDPFSALPAQ DTGEGAVRNL QSHTVGLTAL EANDPFANKD 
DPFYYDWKNL QLSGLICGGL LAIAGIAAVL SGKCKCKSSQ KQHSPVPEKA IPLITPGRFL 



60 
120 



1035 



WO 03/042661 



raVKSNKPLS PSTFVLVFGI SYTSVFRVPL SASLYPAIPG DAAALTSGHP SMQNISMQNT 180 
GTKGCT 

Seq ID NO: 211 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..318 

] " 21 31 41 51 

I I I | | | 

ATGCCCGGCC ACCCCGTCTG TGAAGTGAGG AGCACCTCTG CCCGGCTGCC CCGTCTGGGA 60 

AGTGAGGAGC GCCTCTGCCC GGCTGCCACC CCGTCTGTGA GTGCCTGCTG CGCTGGGCCC 120 

AGGCCGCCCG TGCCCTGCCA GGCCCTCCGG CCCCCCACCT TCCACCCCAG GGCCTGCTCC 180 

TCACCCCAGG QTTCCATCTC CCTAGTTTCC ACCAGAGACT GGGTCTTCAT TCTCACCCTG 240 

CTACACAGCC CCTACCAGAA CGTTCTGAAA TGCAAACCTA ACAACTGTCT CACCCCAGCA 300 

GGAAACTCCC CAGGGTCCCG GGCCCCCTGC GGGGTTGCAG GCCTCACTCT TCGCGCCCAT 3 SO 

CCCTCCGCCC TGACCGCCCT GAGCTOGCCC CCAGTGCTGG CCCTTCACGT CCAGTTATCC 420 

CTCCCAGCCT CCAAGGTCCC CGTTACOGAA GACCGCCACC ATCACGACAT AGCGCAGCAC 480 

ATATGGGACA CTGGTGAAGG AGCAGTGAGG AACCTGCAGA GTCACACAGT TGGCCTGACT 540 

GCCTTGGAAG CCAATGACCC ATTTGCCAAT AAAGACGATC CCTTCTACTA TGACTGGAAA 600 

AACCTGCAGC TGAGCGGACT GATCTGCGGA GGGCTCCTGG CCATTGCTCG GATCGCGGCA 660 

GTTCTGAGTG GCAAATGCAA ATGCAAGAGC AGCCAGAAGC AGCACAGTCC TGTACCTGAG 720 

AAGGCCATCC CACTCATCAC TCCAGGCAGA TTTCTCACCT TGGCCAAATC AAATAAACCT 780 

TTATCTCCAA GCACCTTTGT CTTGGTGTTT GGCATCAGCT ACACATCAGT CTTCCGAGTG 840 

CCTCTTTCTG CGTCCCTGTA CCCTGCCATT CCTGGTGATG CTGCTGCCCT CACATCAGGC 900 
CATCCAAGCA TGCAGAACAT AAGCATGCAG AACACTGGAA CGAAGGGCTG TACCTAA 

Seq ID NO: 212 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I f | I 

MPGHPVCEVR STSARLPRLG SEERLCPAAT PSVSACCAGP RPPVPCQALR PPTFHPRACS 60 

SPQGSISLVS TRDWVFILTL LHSPYQNVLK CKPNNCLTPA GNSPGSRAPC GVAGLTLRAH 120 

PSALTALSSP PVLALHVQLS LPASKVPVTE DRHHHDIAQH IWDTGEGAVR NLQSHTVGLT 180 

ALEANDPFAN KDDPFYYDWK NLQLSGLICG GLLAIAGIAA VLSGKCKCKS SQKQHSPVPE 240 

KAIPLITPGR FLTLAKSNKP LSPSTFVLVF GISYTSVFRV PLSASLYPAI PGDAAALTSG 300 
HPSMQNISMQ NTGTKGCT 

Seq ID NO: 213 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence : 1 . . 1758 

] « 21 31 41 51 

ATGATGGGGT CTCATGTTGC CCAGGCTGGT CTTGAACTCC TGGGCTCGAG TGACCCTCCT 60 

GCCTTGGCCT CCGAAAGTGC TGGGATTACA GGACTGTTAT TACAGGAATC CATAACACTG 120 

GAGGATGTGG CTGTGGACTT CACTTGGGAG GAGTGGCAAC TCCTGGGCGC TGCTCAGAAG 180 

GACCTGTACC GGGATGTGAT GTTGGAGAAC TACAGCAACC TGGTGGCAGT GGGGTATCAA 240 

GCCAGCAAAC CGGATGCACT CTTCAAGTTG GAACAAGGAG AACAACTGTG GACAATTGAA 300 

GATGGAATCC ACAGTGGAGC CTGTTCAGGT TCTCCAAAGG TCCCGTTCTC CATTTTCTCA 360 

TCTGTGCCCT TCACTCTTCA AAATTGCCTT CATTCTAACA TATGGAAAGT TGATCATGTG 420 

CTGGAGCGCT TGCAGAGTGA AAGCCTGGTG AACAGAAGGA AACCATGTCA TGAACATGAT 480 

GCATTTGAAA ATATTGTTCA TTGCAGCAAA AGTCAGTTTC TGTTAGGGCA AAATCATGAT 540 

ATATTTGACT TACGTGGAAA AAGTTTGAAA TCCAATTTAA CTTTAGTTAA CCAGAGCAAA 600 

GGCTATGAAA TAAAGAACTC TGTTGAGTTT ACTGGAAATG GGGACTCCTT TCTTCATGCT 660 

AACCATGAAC GACTTCATAC TGCAATTAAA TTCCCTGCAA GTCAAAAACT CATCAGCACT 720 

AAGTCCCAAT TCATCAGTCC CAAGCATCAG AAAACACGAA AATTAGAGAA GCATCATGTG 780 

TGCAGTGAAT GTGGGAAAGC CTTCATCAAG AAGTCTTGGC TAACTGATCA CCAGGTAATG 840 

CATACAGGAG AGAAACCCCA CAGATGTAGT CTATGTGAGA AAGCCTTCTC CAGAAAGTTC 900 

ATGCTTACTG AACATCAGCG AACTCATACA GGAGAAAAAC CTTATGAATG CCCTGAATGT 960 

GGCAAAGCCT TTCTCAAGAA ATCACGGCTC AACATACATC AGAAAACACA TACOGGAGAG 1020 

AAACCCTATA TATGCAGTGA ATGTGGAAAA GGCTTCATCC AGAAAGGAAA TCTCATTGTA 1080 

CACCAGCGAA TTCATACAGG TGAGAAACCT TATATATGCA ATGAATGTGG AAAAGGCTTC 1140 

ATTCAGAAGA CGTGTCTCAT AGCACATCAG AGATTTCACA CAGGAAAGAC GCCCTTTGTG 1200 

TGCAGTGAAT GTGGAAAATC CTGTTCTCAG AAATCAGGTC TCATTAAACA TCAAAGAATT 1260 

CACACAGGAG AGAAACCCTT TGAATGTAGT GAATGTGGGA AAGCCTTTAG CACAAAGCAA 1320 

AAGCTCATTG TCCATCAAAG GACTCATACA GGAGAGAGAC CCTATGGCTG TAACGAGTGT 1380 

GGGAAAGCGT TTGCGTATAT GTCGTGTCTG GTTAAGCATA AGAGAATACA CACAAGGGAG 1440 

AAACAAGAGG CAGCCAAGGT GGAAAATCCT CCTGCAGAGA GGCACAGCTC ATTACACACC 1500 

AGTGATGTCA TGCAGGAGAA AAACTCTGCT AAOGGGGCGA CTACACAAGT GCCTTCTGTG 1S60 

GCCCCTCAGA CATCATTAAA CATCAGCGGC CTCCTCGCAA ACAGGAACGT AGTCCTTGTG 1620 

GGACAGCCAG TGGTCAGATG TGCAGCCTCA GGAGATAACA GAGGATTTGC ACAGGACAGA 1680 

AACCTTGTGA ATGCAGTGAA TGTGGTTGTG CCTTCCGTGA TCAATTATGT CTTATTTTAT 1740 
GTTACAGAAA ACCCATAG 

Seq ID NOt 214 Protein sequence 
Protein Accession 8: FGENESH predicted 

) « 21 31 41 si 

MKGSHVAQAG LELLGSSDPP ALASESAGIT GLLLQESITL EDVAVDFTHB EWQLLGAAQK 60 

DLYRDVMLEN YSNLVAVGYQ ASKPDALFKL EQGEQLWTIE DGIHSGACSG SPKVPFSIFS 120 

SVPPTLQNCL HSNIWXVDHV LERLQSESLV NRRKPCHEHD AFENIVHCSK SQFLLGQNHD 180 

IFDLRGKSLK SNLTLVNQSK GYEIKNSVEF TGNGDSFLHA NHERLHTAIK FPASQKLIST 240 

1036 



WO 03/042661 



KSQFISPKHQ KTRKLEKHHV CSECGKAFIK KSWLTDHQVM HTGEKPHRCS LCEKAFSRKF 300 

MLTEHQRTHT GEKPYBCPEC GKAFLKKSRL NIHQKTHTGE KPYICSEOGK GPIQKGNLIV 360 

HQRIHTGEKP YICNECGKGF IQKTCLIAHQ RFHTGKTPFV CSEOGKSCSQ KSGLIKHQRI 420 

HTGEKFFECS ECGKAFSTKQ KLIVKQRTHT GERPYGCNEC GKAFAYMSCL VKHKRIHTRE 480 

KQEAAKVENP PAERHSSLHT SDVMQEKNSA NGATTQVPSV APQTSUIISG LLANRNWLV 540 
GQPWRCAAS GDNRGFAQDR NLVNAVNVW PSVINYVLFY VTENP 

Seq ID NO: 215 DNA sequence 

Nucleic Acid Accession #: NMJ>32190.1 

Coding sequence: 502.. 1332 

1 11 21 31 41 51 

I I 1 I I I 

GATTCCGTGT TCTTGQCCAT GTTAGCCATA ATATCCTGTG CAGTATGTTT TTCCTGTGCA 60 

GAGGCAAAAA CATATTGGGC ATATGTTCCC AAGCCCCCAG CAGTATGACC CATACTTTGG 120 

AGTGACACTC CTCCTAAGAT TTATCATGAT TAAGGAGCAT GGGCTCCAGG ACCCCTAACT 180 

CCACCTGACA TAGAACAGTT AGACTCTCAG AATAATGTCA TTAATTATAC CGCTCCATTG 240 

GAAGGACTTC CTTTGTGTGT CACCACAAAG ACATCACTCA GCCATAGCTC TCTTACAGTT 300 

CAAGCTCACA CATGGTTGAG TCACTA7GGG AAAATCATGT ACTTATTAAG TCTTGGTTAT 360 

ATTAATGTAA CCGGTGTGCT AACCAACCAT TCCTGGCCCA ATOGCCTTCA TTGTGCTGAC 420 

TATACAGAAT GGATTCCCTT CAATAGTTCC TACCCCCCTC CATAGACCGA GTGTCTTGGC 480 

CCACTGGCTA GAAAACAATC TATGTTAACT GGAGACATTG TGGATTGGGG ACCTAAAGGC 540 

CAATTAGATG GAAAAGAAGA AAATCAGAAA TCGTGGCACA AACTTTGCTG GCATTGGTGG 600 

CAAGCTTTTA ATGCTTCTTC TTTATATAAC ACTGGGATCC AATCCCAGTC GGCCGCCCAG 660 

ATTGCTTGGC ATGGAGCAGG CTTTAGCCOG CCTCTTCCTC AGTGGCATTA TCTAGGGAGG 720 

AAAGGACCAA TTCAAAAGAT GATATGGAAG GCAGCATTCC CATTTATGAA TGGCAACATC 780 

TGGGTTGCCA TAATACTATC CAATAATAGC AATAGTAAGC AACACAGTCT TAATGTTACA 840 

TTTGTAAAGA ATATCACCAC TCAATTTACA GTTTGTGTTT TTAATCCTTA TGTGTTTTTG 900 

GCAGCTAAGA AGGACCAGCT CCAGGTAAAC AATACCCAAT TGACCTGTAA ATCTTGCCAG 960 

TTATATCACT GCATTAATCA TAGCACATTG CAAACACATA ATATCTCTAC TTTGATGATT 1020 

TTAGGTTGCA TCCCTGGGCT ATGGATTCCT GTTAATCTGT CTGAGCCATG GGCTGCCACA 1080 

ATTGCTTTAC ATTTTGTGAA ACTTCTTCTA ACTCAGTTTA CTCATTGTGT CCGTAGAGGC 1140 

TTAGGCATGA TAATTTTTGC TATTGTTTAC TTGGTCACAC TAATAATTTC TGTTGTGATG 1200 

TCCTCTGTAG CTTTGCATAG TTCTATTCAA ACAGCTCAGT ATGTGGAGAA CTGGACACGC 1260 

ACAGTCAACC AAGGGTGGCT ACTTGAGAAT AAAATTAACA CTGAGTTACA AACTGAAGTG 1320 

GCAGTGTTAT AATCCACGAT TCTATGGTTA GGGGAACAAG TACAAAGCTT GCAATTGCAG 1380 

CAGTAATTGT GTTGTCATTT TAATCACACT CATATTTGTG TAACCAACTT AGAATATAAC 1440 

CAAAGTGAGT ATCCATGGGA TCTTGTGAAA GCCCATTTGC AGGGAGCTTT CACATCCGAC 1500 

ATCACCTTTG ATATTGGTGA ATTACAAAAC AAAATTCTTG ATTTAAATAA ACAAATTCCA 1560 

GAGTTTCAGC CTTCTTTAGA AGACTGGACT GAATTCCAGC AAGGCCTGGA GAGCGTCAAC 1620 

CCTTGGACCT ATCTAAAGCA CCACATTAAC ATCTTATATA TAGTTCTTGG AATAATGTTG 1680 

TTTTGTCTCT GTCTTCTGTT CATAGTCTGT AAAATCGGAT GGACTGCCAA TCGGAGAATG 1740 

AAAGCTACCC AGCCTGGCCT TACATTCTTT CACTTAATAC ATAAACAAGA AGGGGGAAAT 1800 

GTTGGGAGCC AAAAAGGCCA AAGGGATGGT GACCAACTCA GCATTCCACT GGAGGCTACA 1860 

TGATCAAACA GCAAACTGTT TATCATGAAT ACAGAATGTG GGCAAACTCG CTTCTGTGCC 1920 

TGCCCAGAAG GTTTGCTGAG GGCCATOGCT CCCTGGCCCC GGCTCCTTGA GGTTATCTAC 1980 

TGGGACATCT AGAGCCTATT GTTCGAGGAA TGCAGTCTTG CAAGCCTACT CTGGACCGAG 2040 

CAGCTGACCT CTTCTTCCAC ACCCCTTCTC ACTATCTCTT TTGCCTAATA AATATGGAGG 2100 

GCTGTGTAAA GCTCAGGGCC CTTGTCCACT AGAGGCAAGG TGTCCCCTGA CCCTTCTTCC 2160 
AAACAT 

Seq ID NO: 216 Protein sequence 
Protein Accession 8: NP_115S66.1 

1 11 21 31 41 51 

I I t i I i 

MLTGDIVDWG PRGQLDGKBE NQKSWHKLCW RWWQAFNASS LYNTGIQSQS AAQIAWHGAG 60 

FSPPLPQWHY LGRKGPIQKM IWKAAFPFMN GNIWVAIILS NNSNSKQHSL NVTFVKNITT 120 

QFTVCVFNPY VPLAAKKDQL QVNNTQLTCK SCQLYHCTNH STLQTHNIST LMILGCIPGL 180 

WIPVNLSEPW AATIALHFVK LLLTQFTHCV RRGLGMIIFA IVYLVTLIIS WMSSVALHS 240 
SIQTAQYVEN WTRTVNQGWL LENKINTELQ TEVAVL 

Seq ID NO: 217 DNA sequence 

Nucleic Acid Accession flj FGENBSB predicted 

Coding sequence i 1..1566 

1 11 21 31 41 51 ■ 

lilt I I 

ATGGTGAACC CCAAATCCAC TTCCTCCCTC TTCAGGTTAT GTTTTTTGCT CCTGAGGAGT 60 

CAGAACCTGT GGGTTGAAGA GCAAATTCAA TGCAAAAACA TATTGGGCAT ATGTTCCCAA 120 

TCCCCCAGCA GTATGGCCTA TACTTTGGAG CTCACTCCTC CTGAGATTTA TCACGATCAG 180 

GGAGAGTGGG CTCCAGGACC CCTAACTCCC CGTGACATAG AAAAGTTAGA CTCTCAGAAC 240 

AATGTCATTA ATTATACCAC TCCACTGGAA GGACTCCCTT TGTTTATCAC CACAAAGACG 300 

TCGCTCAGCC ATAGCTGTCT TGCAATTCAA GCTCAAACAT GGTTGAGTCA CTATGGAAAA 360 

ATTATGTACT TATTAGGTCT TGGTTCTATT AATGTAACTG GTGTGCTAAC CAATCATTCC 420 

CAGTCCAGTC ACCCTAATTG TGCTGATTAT ACAGAATGGA TTCCATTCAA TAGTTCCTAC 4 B0 

CCCACTCTGT GGACCCAGTG TCTTGATCCA CTGGCTAGTA AACAATATAT GTCAACTGAA 540 

GACACTGTGG ATTGGGAACC TAAAGGTCAA TTAGATGGAA AAGGTGAAAG TCAGAAATCA 600 

TGGCACAAAC TTCACTGGCA TTGGCGGCAA GCTTTTAATG CTTCTTCTTT ATACAACAGC 660 

AGAATCCAAT CCCAGTCTGC TGCTCAGATT GCTTGGCATG GAGCAGGCTT TAGCCCACCT 720 

CTTCCTCAGT TGCATTATCT GGGGAGGAAA GGACCAATTC AAGAAACTAT ATGGAAGGCA 780 

GCACTCCCAT TTATGAATGG CAACATCTGG ATTGGAACAC TGTCTAATAA TAGCAATAGT 840 

AAGCAACACA GTCTTAATGT TGCATTTGTA AAGAATATCA CCACTCAGTT TACAGTTTGT 900 

GTTTTTAATC CTTATGCCTT TTTGGCAGCT AAGAAGAACC AGCTTCAGGT GGAGAACTGG 960 



1037 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



ACACGCACAG 
GAAGTGGCAA 
TTGCAGCAGC 
TATAACCAAA 
TCCAACATCA 
ACTCAAGAAT 
CTCAACCCTT 
ATGTTGTTTT 
AGAATGAGAQ 
GGATATGCAG 
ATATGA 



CTGACCAAGC 
TGTTGAAATC 
AATTGCGTCA 
GTGAGTATCC 
CCTTTGATAT 
TTCAGCCTTC 
GGACCTATCT 
GTCTCTGTCT 
CCTCCCAGCC 
GGAGCCAAAG 



GAGGCTACTT 
CATGGTTCTG 
TCATTTTAAT 
GTGGGACCTT 
TGGTGAATTA 
TTTAGAAGAC 
AAGGCACCAC 
TCGGTTTATA 
CAGCCTTACA 
GCCTGTGGGA 



CAGAATAAAA 
TGGTTAGGAG 
CACATTCATA 
GTGAAAGCCC 
CAAAACAAAA 
TGGACCGAAT 
ATTAACATCT 
GTCTGTAAAA 
TTCTTTCAAT 
CGTGACCAAC 



TTAACACTGA 
AACAGGTACA 
TTTGCGTAAC 
ATTTGCAAGG 
TTATTGATTT 
TCCAGGAAGG 
TATATGTAGT 
TCGGATGGAC 
TAATACATAA 
TCAGCATTCT 



GTTACAAACT 
AAGCTTGCAG 
TAACTCAGAA 
AGCTTTCACA 
AAATAGGCAA 
CCTGGAGAGC 
TCTTGGAATA 
CACCAATTGG 
ACAGAAAGGG 
GCTGGAGGCT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



Seq ID NO: 218 Protein sequence 
Protein Accession fl: FGENBSH predicted 



MVNPKSTSSL 
GEWAPGPLTP 
IMYLLGLGSI 
DTVDWEPKGQ 
LPQLHYLGRK 
VFNPYAFLAA 
LQQQLRHHFN 
TQEFQPSLED 
RMRASQPSLT 



11 
I 

FRLCFLLLRS 
RDIEKLDSQN 
NVTGVLTNHS 
LDGKGESQKS 
GPIQETIWKA 
KKNQLQVENW 
HIHICVTNSE 
WTEFQEGLES 
FFQLIHKOKG 



21 
I 

QNLWVEEQIQ 
NVINYTTPLE 
QSSHPNCADY 
WHKLHWHWRQ 
ALPFMNGNIW 
TRTADQARLL 
YNQSEYPWDL 
LNPWTYLRHH 
GYAGSQRPVG 



31 
I 

CKNILGICSQ 
GLPLFITTKT 
TEWIPFNSSY 
AFNASSLYNS 
IGTLSNNSNS 
QNKINTELQT 
VKAHLQGAFT 
INILYWLGI 
RDQLSILLEA 



41 

I 

SPSSMAYTLE 
SLSHSCLAIQ 
PTLWTQCLDP 
RIQSQSAAQI 
KQHSLNVAFV 
EVAMLKSMVL 
SNITFDIGEL 
MLFCLCLRFI 
I 



51 
1 

LTPPEXYHDQ 
AQTWLSHYGK 
LASKQYMSTE 
AWHGAGFSPP 
KNITTQFTVC 
WLGEQVQSLQ 
QNKIIDLNRQ 
VCKIGWTTNW 



Seq ID NO: 219 DNA sequence 

Nucleic Acid Accession #: FGENBSH predicted 

Coding sequence: 1..900 



ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
GGCTGCGGCG 
GGACCCGGGG 
CTTCCTAACT 
CCGGTGCGCA 
CTTTGCTACC 
TTTCAAAACA 
GTGCTGCTGG 
GACCAGGGGG 
ATCCGAGCCT 
TTTGACTCGG 
GCCAAAGGTG 



11 

I 

GGGAGCTGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGCTGTGCA 
GAGGAGACTG 
CAGGCTCTCC 
TTGAGCTCTG 
CGGATACCGA 
TCACAGAGAA 
TGGGCACCCA 
GCCGGGAGGG 
GCTGCTACCT 
CTATTCTCAG 
TGCGCACCCT 



21 
I 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGGT 
CCGGGGAGCT 
GAGCAGGCCC 
CCGCCCCGCC 
GGACACAGCG 
TGTCTTCCTG 
ATGGCTGCCC 
GGCCGACCTO 
CCCCGTGCCC 
TGAGTGCTCA 
TGCCATTGAG 
CTCCCGCTGC 



31 
1 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
ACGTACGTTC 
GGGGCGGGCG 
CGAGGTGGCG 
CCTGCAGTGC 
GGACAGGAGG 
GCGTGCTTCA 
GAGATCCGCA 
AGGGACGATG 
CAACCCCAGG 
GCCTTGACGC 
CACAAAGCCC 
CGCTGGAAGA 



41 

I 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
AATCGCCCGT 
TCTOGGCGGG 
CTGGTGOGGC 
AAGTCCTGGT 
ATTTTGACCG 
GCGTGGTGCA 
CGCACAACCC 
TCAACGTACT 
CTCAGGGTCT 
AGAAGAACTT 
GGCTGGAGAA 
AGTTCTTCTG 



1 
I 

ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
ATTGAGCTCT 
CCGGATACCG 
ATCACAGAGA 
GTGGGCACCC 
GGCOGGGAGG 
TGCTGCTACC 
GCTATTCTCA 
GTGGGCACCC 



11 
I 

GGGAGCTGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGACACAGC 
ATGTCTTCCT 
AATGGCTGCC 
AGGCCGACCT 
GCCCCGTGCC 
TTGAGTGCTC 
GTGCCATTGA 
TCTCCCGCTG 




CGAGATCCGC 
GAGGGACGAT 
CCAACCCCAG 
AGCCTTGACG 
GCACAAAGCC 
CCGCTGGAAG 



31 
I 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
CAAGTCCTGG 
GATTTTGACC 
AGCGTGGTGC 
ACGCACAACC 
GTCAACGTAC 
GCTCAGGGTC 
CAGAAGAACT 
CGGCTGGAGA 
AAGTTCTTCT 



41 

t 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
TGGATGGAGC 
GACTTCGTTC 
AGCCCAGCTC 
CCCAGGCGCC 
TAATTCAGCT 
TGGCCGAGAA 
TGAAGGAAGT 
AGAAACTGAA 
GCTTGGTTTG 



60 
120 
180 
240 
300 
360 
420 
480 



51 

i 

GACCCCTCCC 
GGGCGACGGC 
CGCGCGCTAC 
GCGGCCGCGT 
AGGGCGCAGA 
CCAGGACGCT 
GGATGGAGCT 
ACTTCGTTCC 
GCCCAGCTCC 
CCAGGCGCCT 
AATTCAGCTG 
GGCCGAGAAG 
GAAGGAAGTA 
GAAACTGAAT 
CTTCGTTTGA 



Seq ID NO: 220 Protein sequence 
Protein Accession # : FGENESH predicted 

1 11 21 31 41 51 

I I I I 1 I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 
RPTALDTFSG TYVQSPVRPR GCGGAVHRGA GAGVSAGGRR GPRGGDWSRP RGGAGAAQDA 
LPNSGSPRPA PAVQVLVDGA PVRIELWDTA GQEDFDRLRS LCYPDTDVFL ACFSWQPSS 
FQNITEKWLP EIRTHNPQAP VLLVGTQADL RDDVNVLIQL DQGGREGPVP QPQAQGLAEK 
IRACCYLECS ALTQKNLKEV FDSAILSAIE HKARLEKKLN AKGVRTLSRC RWKKFFCFV 

Seq ID NO: 221 DNA sequence 

Nucleic Acid Accession #: XM_063832.2 

Coding sequence: 1..711 



51 

1 

GACCCCTCCC 
GGGCGACGGC 
CGCGCGCTAC 
TCCGGTGCGC 
CCTTTGCTAC 
CTTTCAAAAC 
TGTGCTGCTG 
GGACCAGGGG 
GATCCGAGCC 
ATTTGACTCG 
TGCCAAAGGT 
A 



Seq ID NOs 222 Protein sequence 
Protein Accession #: XP_063832.i 



11 
I 



21 
1 



31 

i 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



1038 



WO 03/042661 



PCT/US02/36810 



MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVQKSSLIVS YTCNGYPARY 60 

RPTALDTFSV QVLVDGAPVR IBLWDTAGQE DPDRLRSLCY PDTDVFLACF SWQPSSFQN 120 

ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLABKIRA 180 
^ CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KPFCFV 

Seq ID NO: 223 DNA sequence 

Nucleic Acid Accession #; Eos sequence 

Coding sequence : 1 . . 1 161 

10 1 11 21 31 41 51 

I I 1 I I I 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCGGGTGT TGGGGCTGGA GTTTATCTTT 120 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180 

ID TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 240 

CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA AGTTTGGGGA CATCCCTTGC 300 

CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480 

CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 540 

AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGOGGCA GAGACAAATG 660 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTO CGGATCCGCA TCTTCTGGCT CCTGCACACT 780 

TCGGGCACGC AGAATTGTGA AGTGTACCGC TOGGTGGACC TGGCGTTCTT TATCACTCTC 840 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080 

ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140 
CAGTTGGGAT GTTGCATCGA G 



20 
25 
30 
35 
40 
45 



Seq ID NO: 224 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I 1 I I I I 

MNRHHLQDHF LEIDKXNCCV FRDDPIAKVL PPVLGLEPIF GLLGNGLALW IFCFHLKSWK 60 

SSRIFLPNIiA VADFLLIICL PFVMDYYVRR SDWKFGDIPC RLVLFMFAMN RQGSIIFLTV 120 

VAVDRYPRW HPHHALNKIS NWTAAIISCL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 180 

SICHTPRWHE AMFLLEFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWAIVF 240 

VICFLPSWV RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 360 
TSNNHSKKGH CHQEPASLEK QLGCCIE 

Seq ID NO: 225 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
Coding sequence: 1..1092 

50 1 11 21 31 41 51 

I I I I ! I 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

TTCCGAGATG ACTTCATTGT CAAGGTGTTG CCGCGGGTGT TGGGGCTGGA GTTTATCTTC 120 

cc GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180 

*>D TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTGGCTGACT TTCTACTGAT CATCTGCCTG 240 

CCCTTCCTGA TGGACAACTA TGTGAGGCGT TGGGACTGGA AGTTTGGGGA CATCCCTTGC 300 

CGGCTGATGC TCTTCATGTT GGCTATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

sry. AATCGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTATTGG CCTGACAGTC 480 

OU CACCTCCTGA AGAAGAAGAT GCCGATCCAG AATGGCGGTG CAAATTTGTG CAGCAGCTTC 540 

AGCATCTGCC ATACCTTCCA GTGGCACGAA GCCATGTTCC TCCTGGAGTT CTTCCTGCCC 600 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720 

- GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780 

Oj TCGGGCACGC AGAATTGTGA AGTGTACCGC TOGGTGGACC TGGCGTTCTT TATCACTCTC 840 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080 



70 
75 
80 



Seq ID NO: 226 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 SI 

I I I I I I 

MNRHHLQDHF LBIDKKNCCV FRDDFIVKVL PPVLGLEPIF GLLGNGLALW IFCFHLKSWK 60 
SSRIFLFNLA VADFLLIICL PFLMDNYVRR WDWKFGDIPC RLMLFMLAMN RQGSIIFLTV 120 
VA VDRY FRW HPHHALNKIS NRTAAIISCL LWGITIGLTV HLLKKKKPIQ NGGANLCSSF 180 
SICHTFQWHE AMFLLEFFLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWATVP 240 
VICFLPSVW RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 300 
PPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 360 
TSP 



1039 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Seq ID NO: 227 DMA sequence 
Nucleic Acid Accession fl: NM_006018 
Coding sequence: 61.. 1224 

1 11 21 31 41 51 

I 1 I I I I 

CGCCACTTTG CTGGAGCATT CACTAGGCGA GGCGCTCCAT CGGACTCACT AGCCGCACTC 60 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGT6TG 120 

TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 180 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 240 

TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 300 

CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA ACTTTGGGGA CATCCCTTGC 360 

CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 420 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 480 

AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 540 

CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 600 

AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 660 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 720 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 780 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 840 

TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 900 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 960 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 1020 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1080 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1140 

ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1200 

CAGTTGGGCT GTTGCATCGA GTAATGTCAC TGGACTCGGC CTAAGGTTTC CTGGAACTTC 1260 

CAGATTCAGA GAATCTGATT TAGGGAAACT GTGGCAGATG AGTGGGAGAC TGGTTGCAAG 1320 

GTGTGACCAC AGGAATCCTG GAGGAACAGA GAGTAAAGCT TCTAGGCATC TGAAACTTGC 1380 

TTCATCTCTG ACGCTCGCAG GACTGAAGAT GGGCAAATTG TAGGCGTTTC TGCTGAGCAG 1440 

AGTTGGAGCC AGAGATCTAC TTGTGACTTG TTGGCCTTCT TCCCACATCT GCCTCAGACT 1500 

GGGGGGGGCT CAGCTCCTCG GGTGATATCT AGCCTGCTTG TGAGCTCTAG CAGGGATAAG 1560 

GAGAGCTOAG ATTGGAGGGA ATTGTGTTGC TCCTGGAGGA AGCCCAGGCA TCATTAAACA 1620 

AGCCAGTAGG TCACCTGGCT TCGGTGGACC AATTCATCTT TCAGACAAGC TTTAGAGAAA 1680 

TGGACTCAGG GAAGAGACTC ACATGCTTTG GTTAGTATCT GTGTTTCCGG TGGGTGTAAT 1740 

AGGGGATTAG CCCCAGAAGG GACTGAGCTA AACAGTGTTA TTATGGGAAA GGAAATGGCA 1800 

TTGCTGCTTT CAACCAGCGA CTAATGCAAT CCATTCCTCT CTTGTTTATA GTAATCTAAG 1860 

GGTTGAGCAG TTAAAACGGC TTCAGGATAG AAAGCTGTTT CCCACCTGTT TCGTTTTACC 1920 

ATTAAAAGGG AAACGTGCCT CTGCCCCACG GGTAGAGGGG GTGCACGTTC CTCCTGGTTC 1980 

CTTCGCTTGT GTTTCTGTAC TTACCAAAAA TCTACCACTT CAATAAATTT TGATAGGAGA 2040 
CAAAAAAAAA A 

Seq ID NO» 228 Protein sequence 
Protein Accession *: NP_006009.1 

1 11 21 31 41 51 

I I I I 1 I 

MNRHHLQDHF LEIDKKNCCV FRDDPIAKVL PPVLGLEFIF GLLGNGLALW IFCFHLKSWK 60 

SSRIFLFNLA VADFLLIICL PPVMDYYVRR SDWNPGDIPC RLVLFMFAMN RQGSIIFLTV 120 

VAVDRYPRW HPHHALNKIS NWTAAIISCL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 180 

SICHTFRWHE AMFLLEFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TPIMWAIVF 240 

VICPIiPSWV RIRIPWLLHT SGTQNCBVYR SVDLAPPITL SPTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIK RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 360 
TSNNHSKKGH CHQEPASLEK QLGCCIE 

Seq ID NO: 229 DNA sequence 

Nucleic Acid Accession #: NM_014398.1 

Coding sequence: 64.. 13 14 

1 11 21 31 41 51 

I I I I I I 

GGCACCGATT CGGGGCCTGC CCGGACTTCG CCGCACGCTG CAGAACCTCG CCCAGCGCCC 60 

ACCATGCCCC GGCAGCTCAG CGCGGCGGCC GCGCTCTTCG CGTCCCTGGC CGTAATTTTG 120 

CACGATGGCA GTCAAATGAG AGCAAAAGCA TTTCCAGAAA CCAGAGATTA TTCTCAACCT 180 

ACTGCAGCAG CAACAGTACA GGACATAAAA AAACCTGTCC AGCAACCAGC TAAGCAAGCA 240 

CCTCACCAAA CTTTAGCAGC AAGATTCATG GATGGTCATA TCACCTTTCA AACAGCGGCC 300 

ACAGTAAAAA TTCCAACAAC TACCCCAGCA ACTACAAAAA ACACTGCAAC CACCAGCCCA • 360 

ATTACCTACA CCCTGOTCAC AACCCAGGCC ACACCCAACA ACTCACACAC AGCTCCTCCA 420 

GTTACTGAAG TTACAGTCGG CCCTAGCTTA GCCCCTTATT CACTGCCACC CACCATCACC 480 

CCACCAGCTC ATACAGCTGG AACCAGTTCA TCAACCGTCA GCCACACAAC TGGGAACACC 540 

ACTCAACCCA GTAACCAGAC CACCCTTCCA GCAACTTTAT CGATAGCACT GCACAAAAGC 600 

ACAACCGGTC AGAAGCCTGA TCAACCCACC CATGCCCCAG GAACAACGGC AGCTGCCCAC 660 

AATACCACCC GCACAGCTGC ACCTGCCTCC ACGGTTCCTG GGCCCACCCT TGCACCTCAG 720 

CCATCGTCAG TCAAGACTGG AATTTATCAG GTTCTAAACG GAAGCAGACT CTGTATAAAA 780 

GCAGAGATGG GGATACAGCT GATTGTTCAA GACAAGGAGT CGGTTTTTTC ACCTCGGAGA 840 

TACTTCAACA TCGACCCCAA CGCAACGCAA GCCTCTGGGA ACTQTGGCAC CCGAAAATCC 900 

AACCTTCTGT TGAATTTTCA GGGCGGATTT GTGAATCTCA CATTTACCAA GGATGAAGAA 960 

TCATATTATA TCAGTGAAGT GGGAGCCTAT TTGACCGTCT CAGATCCAGA GACAGTTTAC 1020 

CAAGGAATCA AACATGCGGT GGTGATGTTC CAGACAGCAG TCGGGCATTC CTTCAAGTGC 1080 

GTGAGTGAAC AGAGCCTCCA GTTGTCAGCC CACCTGCAGG TGAAAACAAC CGATGTCCAA 1140 

CTTCAAGCCT TTGATTTTGA AGATGACCAC TTTGGAAATG TGGATGAGTG CTCGTCTGAC 1200 

TACACAATTG TGCTTCCTGT GATTGGGGCC ATCGTGGTTG GTCTCTGCCT TATGGGTATG 1260 

GGTGTCTATA AAATCCGCCT AAGGTGTCAA TCATCTGGAT ACCAGAGAAT CTAATTGTTG 1320 

CCCGGGGGGA ATGAAAATAA TGGAATTTAG AGAACTCTTT CATCCCTTCC AGGATGGATG 1380 

1040 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 
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80 



TTGGGAAATT 
AATGAAGTGA 
GTTTATTTTA 
GCCACTCAAA 
AGCCTTCAAA 
ATTTTATTTT 
GCTTTTACTA 
ACTCCTTTTC 
CAGGCTGGAG 
TGATTCTCCT 
GCTAATTTTT 
CTCTTGACCT 
AGCCATTGCG 
GTTGTCTAAG 
CTTGATGACT 
CTAAACAATA 
TAGGCTAAGC 
ACTGAGACTT 
GAGCTTGAAT 
CCTACAAGAA 
TCACCTTACA 
AGCTTTGCAG 
TGAGGGGCTT 
GTAAAGATGA 
ACTTCGCTAA 
TTCTGCACTT 
GAATTTTATT 
AGAAAAGTCC 
CCATGTTGAC 
TAGTCTTAAT 



CCCTCAGAGT 
GTCATGTGTG 
TGAAAGATAT 
GTCAACATTT 
TTATAAACCA 
ACCCTTGATC 
TCTGTGTTTT 
CACTTTAAAT 
TACAGTGGCA 
GCTTCAGCTT 
GTATTTTTAT 
CAGGTGATCC 
CCOGGCCTTA 
TGTTTTTATO 
CCTGCTCCAQ 
AGCAAGAGAC 
ACTTTATCTA 
AAGGGAACTG 
TCATGTTGGT 
CAATGACACC 
GGGAAATGGG 
ATAACAAAAT 
TGTAAAACAT 
AGGCATCAAA 
CCAACTGTTC 
CATATCCATA 
TCTGCTGTTT 
ACATAACCCT 
TTTCCTCATG 
AAAACATTGA 



GTGGGTCCTT 
ATTTAAGTTC 
AGTGAGCTGT 
GAGATATGTT 
AGGGTCAATT 
TTAACAAAGC 
ATGGTTTCAT 
TTGTTTTTGT 
CGATCTCGGC 
CCCGAGTAGC 
TATAGACGGG 
ACCCACCTCA 
AATGTTTTTT 
TAAAACCAAC 
AATTGCTAGA 
AATAATAATG 
TATCTCATTT 
AATCACTTAA 
CTGACATCAA 
ACACTCTGCC 
TTTATCCAGG 
AGCCTATCCT 
TAGTCAGTTG 
TAAACTCAAA 
TTTCTTGAGT 
TTTCCTATTG 
TTTTTGCTGC 
AGAATTCTTA 
TGTTTCCTTA 
ATTGTAGTAA 



CAAACAATGT 
AGGCAGCACA 
TTATTTTCTA 
GAATTAACAT 
GTAACTAATA 
CTTTGCTTTG 
GTAACATACA 
TTTTTGAGAC 
TTATGGCAAC 
TGGGATTACA 
TTTCACCATG 
GCCTCCCAAA 
TTAATCATCA 
AAAAAGAACA 
CTAAGAATTA 
GCCCTTAATT 
CATTCTCACA 
ATGTCACCTG 
GGTCTTTGGT 
TGAAGGCTCA 
ATCATGAGAC 
TAATAAATCC 
CTCATTTTTA 
GTATTTTTAA 
GTATAGCCCC 
TTCACTTTAT 
TAAAGAAAGG 
GTCAAGGAAT 
TGACTCAGTA 
AGGTTTTTGC 



AAACCACCAT 
TCAATTTCTA 
GTTTCCTTTA 
AATATATGTA 
CTACTGTGTG 
TTATCAAATG 
TATTCCTGGT 
GGAGTTTCAC 
CTCCGCCTCC 
GGCACACACT 
TTGGCCAGAC 
GTGCTGGGAT 
AAAAGAACAA 
AATCAGCTTA 
GGTGGCTACA 
ATTAACAAAG 
ACTTATAAGT 
GCTAACTGAT 
CTTCTCCCTA 
CACCTCATAC 
ATTAGGGTAG 
TCCACTCTCT 
TGGGATTGCT 
ATTTTTTTGA 
ATCTTGTGGT 
TCTGTAGAGC 
AACTAAGTCA 
AATTCAAGTC 
AGTTGGCAAG 
AATAAAAACT 



CTTCTATTCA 
AATACTTTTT 
GAATATTTTA 
AAGTAGAATA 
TGCATTGAAG 
GACTTTCAGT 
GTAGCACTTA 
TCTTGTCACC 
CGGGTTCAAG 
ACCACGCCTG 
TGGTCTTGAA 
TACAGGCATG 
CATATCTCAG 
TATTTTTTAT 
GATGGTAGAA 
TGCCAGAGTC 
GAATGAGTAA 
GGCAGAGCCA 
CACCAAGTTA 
CAGCATACGC 
ATGAAAGGAG 
GGAAGGAGAC 
TAGCTGGGCT 
TAATAGAGAA 
AACTTGCTGC 
AGCCTGCCAA 
GGATGTTAAC 
AGCCTAGAGA 
GTCCTGACTT 
TACTTTGG 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



Seq ID NO: 230 Protein sequence 
Protein Accession #: NP 055213.1 



1 
I 

MPRQLSAAAA 
HQTLAARFMD 
TEVTVGPSLA 
TGQKPDQPTH 
BMGIQLIVQD 
YYISEVGAYL 



11 

i 

LFASLAVILH 
GHITFQTAAT 
PYSLPPTITP 
APGTTAAAEN 
KESVPSPRRY 
TVSDPETVYQ 
GNVDECSSDY 



21 
I 

DGSQMRAKAF 
VKIPTTTPAT 
PAHTAGTSSS 
TTRTAAPAST 
PNIDPNATQA 
GIKHAWMFQ 
TIVLPVIGAI 



31 
I 

PETRDYSQPT 
TKNTATTSPI 
TVSHTTGNTT 
VPGPTLAPQP 
SGNCGTRKSN 
TAVGHSPKCV 
WGLCLMGMG 



41 
I 

AAATVQDIKK 
TYTLVTTQAT 
QPSNQTTLPA 
SSVKTGIYQV 
LLLNFQGGPV 
SEQSLQLSAH 
VYKIRliRCQS 



51 
I 

PVQQPAKQAP 
PNNSHTAPPV 
TLSIALHKST 
LNGSRLCIKA 
NLTFTKDEES 
LQVKTTDVQIi 
SGYQRI 



Seq ID NO: 231 DNA sequence 

Nucleic Acid Accession gt NM_005409.3 

Coding sequence: 94.. 378 



1 11 21 

! I I 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT 
CAACAGCACC AGCAGCAACA GCAAAAAACA 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA 
ATAATGTACC CAAGTAACAA CTGTGACAAA 
AAAGGACAAC GATGCCTAAA TCCCAAATCG 
GAAAGAAAGA ATTTTT AAAA ATATCAAAAC 
AACCTAGAAC AAGTTTAACT GTGACTACTG 
AGACTTTTCT ATGGTTTTGT GACTTTCAAC 
GGGTGAAAGG ACCAAAAACA GAAATACAGT 
CTGCCCAAAG GAGTCCAGCA ATTAAATGGA 
GGTTACCATC GGAGTTTACA AAGTGCTTTC 
GCATTTCTAG GCTAGAGAAC CTTCTAGATT 
GAGAACATTT CTGTCTCTAG AAGTTATCTG 
CTGTGGTTAC AGTGGAGACA TTGACATTAT 
CATCTATGTG TCGTAAAGCA TTCCTCAAAC 
CCAAATATCA TGTAGCACAT CAATATGTAG 
TTTATAACCA ATTCATTAAA TGTAATTCAT 
TGGGATACTG GCAACAGTGC ACATATTTCA 
GATGTTTTTC AACTTTTATT CATTGAGATG 
TGTACTTTTT GTTTTGATCC GTTTGTATAA 
TACAAAATGT TTTTGTCTAC CAAAGAAAAA 
AATCACTTTT ACTTTTTGTA ATTCTGTCTC 
TTGTTCATGC CTATATACTG TAAAATTTAG 
GTCATTTTTT TCTCTAATAA ACTACCACAA 



60 
120 
180 
240 
300 
360 



31 41 51 

I I I 

CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 
AACATGAGTG TGAAGGGCAT GGCTATAGCC 
CAAGGCTTCC CCATGTTCAA AAGAGGACGC 
GTGAAAGTGG CAGATATTGA GAAAGCCTCC 
ATAGAAGTGA TTATTACCCT GAAAGAAAAT 
AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 
ATATGAAGTC CTGGAAAAGG GCATCTGAAA 
AAATGACAAG AATTCTACAG TAGGAAACTG 
TTTTGTACAG TTATGTGAAG GATGAAAGGT 
CTTCCTGAAT GAATGACAAT CAGAATTCCA 
TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 
ACGTTCTTAC TTGTTGTATT ATACATTCAT 
TGATGCTTAC AACTATTCTG TTGTGACTAT 
TCTGTATTGA TCTTTATGCT ATATTACTAT 
TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 
ATTTTTTCAT GCAAATACAC ACTTCTTTCC 
GGAAACATTC TTATGCATCA TTTGGTTTGT 
AAAATGTACT ATGAAAAAAA TTATACGCTA 
TAAOCAAATT AGCAGCACCG GTCTTAATTT 
TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 
ATGATAGCAA TATCTTGGAC ACATTTGAAA 
TGTTGAAAAA TAAGCAAATG TATACCTAGC 
TTAGAAAAAT ACATAATCTA ATCAATTTCT 
GTATACTCAA GACTAGTTTA AAGAATCAAA 
CCTTTCTTTT TTAAAAAAAA AAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



Seq ID NO i 232 Protein sequence 
Protein Accession 8: NP_005400.l 

1 11 21 31 41 51 

I I I I I I 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIBKASI MYPSNNCDKI 
BVIITLKENK GQRCLNPKSK QARLIIKKVE RKNP 



60 



1041 



WO 03/042661 



PCT/US02/36810 



Seq ID NO: 233 DNA sequence 

Nucleic Acid Accession #: NM_000577.i 

Coding sequence: 41.. 520 

5 1 11 21 31 41 51 

1 I I 1 I I 

GGCACGAGGG GAAGACCTCC TGTCCTATCA GGCCCTCCCC ATGGCTTTAG AGACGATCTG 60 

CCGACCCTCT GOGAGAAAAT CCAGCAAGAT GCAAGCCTTC AGAATCTGGG ATGTTAACCA 120 

A GAAGACCTTC TATCTGAGGA ACAACCAACT AGTTGCCGGA TACTTGCAAG GACCAAATGT 180 

1U CAATTTAGAA GAAAAGATAG ATGTGGTACC CATTGAGCCT CATGCTCTGT TCTTGGGAAT 240 

CCATGGAGGG AAGATGTGCC TGTCCTGTGT CAAGTCTGGT GATGAGACCA GACTCCAGCT 300 

GGAGGCAGTT AACATCACTG ACCTGAGCGA GAACAGAAAG CAGGACAAGC GCTTCGCCTT 360 

CATCCGCTCA GACAGTGGCC CCACCACCAG TTTTGAGTCT GCCGCCTGCC CCGGTTGGTT 420 

CCTCTGCACA GCGATGGAAG CTGACCAGCC CGTCAGCCTC ACCAATATGC CTGACGAAGG 480 

1 J OGTCATGGTC ACCAAATTCT ACTTCCAGGA GGACGAGTAG TACTGCCCAG GCCTGCCTGT 540 

TCCCATTCTT GCATGGCAAG GACTGCAGGG ACTGCCAGTC CCCCTGCCCC AGGGCTCCCG 600 

GCTATGGGGG CACTGAGGAC CAGCCATTGA GGGGTGGACC CTCAGAAGGC GTCACAACAA 660 

CCTGGTCACA GGACTCTGCC TCCTCTTCAA CTGACCAGCC TCCATGCT6C CTCCAGAATG 720 

GTCTTTCTAA TGTGTGAATC AGAGCACAGC AGCCCCTGCA CAAAGGCCTT CCATGTCGCC 780 

2U TCTGCATTCA GGATCAAACC CCGACCACCT GCCCAACCTG CTCTCCTCTT GCCACTGCCT 840 

CTTCCTCCCT CATTCCACCT TCCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 900 

ACCAAGTGGC TCCCACACCC TGTTTTACAA AAAAGAAAAG ACCAGTCCAT GAGGGAGGTT 960 

TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT VSTVLTl ' m CAGTCCCCGT 1020 

GAAGGAGAGC CCTTCATTTG GAGATTATGT TCTTTCGGGG AGAGGCTGAG GACTTAAAAT 1080 

2j ATTCCTGCAT TTGTGAAATG ATGGTGAAAG TAAGTGGTAG CTTTTCCCTT CTTTTTCTTC 1140 

TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT TAGCCCCATA 1200 

ATTTTTTTTT TCCTTTTAAA ACACTTCCAT AATCTGGACT CCTCTGTCCA GGCACTGCTG 1260 

CCCAGCCTCC aagctccatc TCCACTCCAG ATTTTTTACA GCTGCCTGCA GTACTTTACC 1320 

TCCTATCAGA AGTTTCTCAG CTCCCAAGGC TCTGAGCAAA TGTGGCTCCT GGGGGTTCTT 1380 

D\J TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT AGAGCTTCTG GCACTTGGAG 1440 

ACTTGTATGA AAGATGGCTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 1500 

GAGCAGGAAA CATGACTCGT ATATGTCTCA GGTCCCTGCA GGGCCAAGCA CCTAGCCTCG 1560 

CTCTTGGCAG GTACTCAGCG AATGAATGCT GTATATGTTG GGTGCAAAGT TCCCTACTTC 1620 

CTGTGACTTC AGCTCTGTTT TACAATAAAA TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 1680 

D J AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 234 Protein sequence 
Protein Accession #i NP_000568.1 

40 i 11 21 31 41 51 

I I 1 I I I 

MALETICRPS GRKSSKMQAF RIWDVNQKTP YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 60 

HALFLGIHGG KMCLSCVKSG DETRLQIiEAV NITDLSENRK ODKRFAFIRS DSGPTTSPES 120 
- _ AACPGWPIjCT AMEADQPVSL tnmpdegvmv tkfypqbde 

45 

Seq ID NO: 235 DNA sequence 

Nucleic Acid Accession 8» NM_001840-1 

Coding sequence: 149.. 1567 

50 i 11 21 31 41 51 

1 I I I 1 I 

GGGGACTACG GAGAGCTCTG CAGGGAGCCG AGGCCCCCGC CCGGGCCAAG GGAGCTTCTG 60 

TCCCGAGGAC CAGGGGATGC GAAGGGATTG CCCCCTGTGG GTCACTTTCT CAGTCATTTT 120 

GAGCTCAGCC TAATCAAAGA CTGAGGTTAT GAAGTCGATC CTAGATGGCC TTGCAGATAC 180 

!>J CACCTTOCGC ACCATCACCA CTGACCTCCT GTACGTGGGC TCAAATGACA TTCAGTACGA 240 

AGACATCAAA GGTGACATGG CATCCAAATT AGGGTACTTC CCACAGAAAT TCCCTTTAAC 300 

TTCCTTTAGG GGAAGTCCCT TCCAAGAGAA GATGACTGCG GGAGACAACC CCCAGCTAGT 360 

CCCAGCAGAC CAGGTGAACA TTACAGAATT TTACAACAAG TCTCTCTCGT CCTTCAAGGA 420 

, A GAATGAGGAG AACATCCAGT GTGGGGAGAA CTTCATGGAC ATAGAGTGTT TCATGGTCCT 480 

OU GAACCCCAGC CAGCAGCTGG CCATTGCAGT CCTGTCCCTC ACGCTGGGCA CCTTCACGGT 540 

CCTGGAGAAC CTCCTGGTGC TGTGCGTCAT CCTCCACTCC OGCAGCCTCC GCTGCAGGCC 600 

TTCCTACCAC TTCATCGGCA GCCTGGCGGT GGCAGACCTC CTGGGGAGTG TCATTTTTGT 660 

CTACAGCTTC ATTGACTTCC ACGTGTTCCA CCGCAAAGAT AGCCGCAACG TGTTTCTGTT 720 

c CAAACTGGGT GGGGTCACGG CCTCCTTCAC TGCCTCCGTG GGCAGCCTGT TCCTCACAGC 780 

05 CATCGACAGG TACATATCCA TTCACAGGCC CCTGGCCTAT AAGAGGATTG TCACCAGGCC 840 

CAAGGCCGTG GTGGCGTTTT GCCTGATGTG GACGATAGCC ATTGTGATCG CCGTGCTGCC 900 

TCTCCTGGGC TGGAACTGCG AGAAACTGCA ATCTGTTTGC TCAGACATTT TCCCACACAT 960 

TGATGAAACC TACCTGATGT TCTGGATCGG GGTCACCAGC GTACTGCTTC TGTTCATCGT 1020 

GTATGCGTAC ATGTATATTC TCTGGAAGGC TCACAGCCAC GCCGTCCGCA TGATTCAGCG 1080 

/U TGGCACCCAG AAGAGCATCA TCATCCACAC GTCTGAGGAT GGGAAGGTAC AGGTGACCCG 1140 

GCCAGACCAA GCCCGCATGG ACATTAGGTT AGCCAAGACC CTGGTCCTGA TCCTGGTGGT 1200 

GTTGATCATC TGCTGGGGCC CTCTGCTTGC AATCATGGTG TATGATGTCT TTGGGAAGAT 1260 

GAACAAGCTC ATTAAGACGG TGTTTGCATT CTGCAGTATG CTCTGCCTGC TGAACTCCAC 1320 

CGTGAACCCC ATCATCTATG CTCTGAGGAG TAAGGACCTG CGACACGCTT TCCGGAGCAT 1380 

lO GTTTCCCTCT TGTGAAGGCA CTGCGCAGCC TCTGGATAAC AGCATGGGGG ACTCGGACTG 1440 

CCTGCACAAA CAGGCAAACA ATGCAGCCAG TGTTCACAGG GCCGCAGAAA GCTGCATCAA 1500 

GAGCACGGTC AAGATTGCCA AGGTAACCAT GTCTGTGTCC ACAGACACGT CTGCCGAGGC 1560 

TCTGTGAGCC TGATGCCTCC CTGGCAGCAC AGGAAAAGAA TTTTTTTTTT TAAGCTCAAA 1620 

q~ ATCTAGAAGA GTCTATTGTC TCCTTGGTTA TATTTTTTTA ACTTTACCAT GCTCAATGAA 1680 

oU AAGGTGATTG CCACATGTCA CTTATTTGCT TAGTTTCCGT TTGGGCTAAT CTTCCGGGGT 1740 
TCGTAGGAAA CCTTT 



Seq ID NO: 236 Protein sequence 
Protein Accession §: NP 001831.1 



1042 



WO 03/042661 



1 H 21 31 41 51 

1 I I I I I 

MKSILDGLAD TTFRTITTDL LYVGSNDIQY EDIKGDMASK LGYFPQKFPL TSFRGSPFQE 60 

KMTAGDNPQL VPADQVNITE FYNKSLSSFK ENEENIQCGE NFMDIECFMV LNPSQQLAIA 120 

VLSLTLGTFT VLENLLVLCV I LHSRSLRCR PSYHFIGSLA VADLLGSVIP VYSFIDPHVP 180 

HRKDSRNVFL FKLGGVTASF TASVGSLFLT AIDRYISIHR PLAYKRIVTR PKAWAFCLM 240 

WTIAIVIAVL PLLGWNCEKL QSVCSDIFPH IDETYLMFWI GVTSVLLLFI VYAYMYILWK 300 

AHSHAVRMIQ RGTQKSIIIH TSEDGKVQVT RPDQARMDXR LAKTLVLILV VLIICWGPLL 360 

AIMVYDVFGK MNKLIKTVFA FCSMLCLLNS TVNPIIYAiiR SKDLRHAFRS MFPSCEGTAQ 420 
PLDNSMGDSD CLHKHANNAA SVHRAAESCI KSTVKIAKVT MSVSTDTSAE AL 



Seq ID NO: 237 DNA sequence 

Nucleic Acid Accession 8s NMJU6083.2 

Coding sequence: 64.. 1482 



1 11 21 31 41 SI 

I I I I I I 

GATTGCCCCC TGTGGGTCAC TTTCTCAGTC ATTTTGAGCT CAGCCTAATC AAAGACTGAG 60 

GTTATGAAGT CGATCCTAGA TGGCCTTGCA GATACCACCT TCCGCACCAT CACCACTGAC 120 

CTCCTGTACG TGGGCTCAAA TGACATTCAG TAOGAAGACA TCAAAGGTGA CATGGCATCC 180 

AAATTAGGGT ACTTCCCACA GAAATTCCCT TTAACTTCCT TTAGGGGAAG TCCCTTCCAA 240 

GAGAAGATGA CTGCGGGAGA CAACCCCCAG CTAGTCCCAG CAGACCAGGT GAACATTACA 300 

GAATTTTACA ACAAGTCTCT CTCGTCCTTC AAGGAGAATG AGGAGAACAT OCAGTGTGGG 360 

GAGAACTTCA TGGACATAGA GTGTTTCATG GTCCTGAACC CCAGCCAGCA GCTGGCCATT 420 

GCAGTCCTGT CCCTCACGCT GGGCACCTTC ACGGTCCTGG AGAACCTCCT GGTGCTGTGC 480 

GTCATCCTCC ACTCCCGCAG CCTCCGCTGC AGGCCTTCCT ACCACTTCAT CGGCAGCCTG 540 

GCGGTGGCAG ACCTCCTGGG GAGTGTCATT TTTGTCTACA GCTTCATTGA CTTCCAOGTG 600 

TTCCACCGCA AAGATAGCCG CAACGTGTTT CTGTTCAAAC TGGGTGGGGT CACGGCCTCC 660 

CTCACTGCCT CCGTGGGCAG CCTGTTCCTC ACAGCCATOG ACAGGTACGT ATCCATTCAC 720 

AGGCCCCTGG CCTATAAGAG GATTGTCACC AGGCCCAAGG CCGTGGTGGC GTTTTGCCTG 780 

ATGTGGACCA TAGCCATTGC GATCGCCGTG CTGCCTCTCC TGGGCTGGAA CTGCGAGAAA 840 

CTGCAATCTG TTTGCTCAGA CATTTTCCCA CACATTGATG AAACCTACCT GATGTTCTGG 900 

ATCGGGGTCA CCAGCGTACT GCTTCTGTTC ATCGTGTATG CGTACATGTA TATTCTCTGG 960 

AAGGCTCACA GCCACGCCGT COGCATGATT CAGCGTGGCA CCCAGAAGAG CATCATCATC 1020 

CACACGTCTG AGGATGGGAA GGTACAGGTG ACCCGGCCAG ACCAAGCCCG CATGGACATT 1080 

AGGTTAGCCA AGACCCTGGT CCTGATCCTG GTGGTGTTGA TCATCTGCTG GGGCCCTTTG 1140 

CTTGCAATCA TGGTGTATGA TGTCTTTGGG AAGATGAACA AGCTCATTAA GACGGTGTTT 1200 

GCATTCTGCA GTATGCTCTG CCTGCTGAAC TCCACCGTGA ACCCCATCAT CTATGCTCTG 1260 

AGGAGTAAGG ACCTGCGACA CGCTTTCOGG AGCATGTTTC CCTCTTGTGA AGGCACTGOG 1320 

CAGCCTCTGG ATAACAGCAT GGGGGACTCG GACTGCCTGC ACAAACACGC AAACAATGCA 1380 

GCCAGTGTTC ACAGGGCCGC AGAAAGCTGC ATCAAGAGCA OGGTCAAGAT TGCCAAGGTA 1440 

ACCATGTCTG TGTCCACAGA CAOGTCTGCC GAGGCTCTGT GAGCCTGATG CCTCCCTGGC 1500 

AGCACAGGAA AAGAATTTTT TTTTTTAAGC TCAAAATCTA GAAGAGTCTA TTGTCTCCTT 1560 

GGTTATATTT TTTTAACTTT ACCATGCTCA ATGAAAAGGT GATTGTCACC ATGATCACTT 1620 

ATCAGTTTGC TAATGTTTCC ATAGTTTAGG TACTCAAACT CCATTCTCCA GGGGTTTACA 1680 

GTGAAGAAAG CCTGTTGTTT AAGTGACTGA ACGATCCTTC AAAGTCTCAA TGAAATAGGA 1740 

GGGAAACCTT TGGCTACACA ATTGGAAGTC TAAGAACCCA TGGAAAAATG CCATCAAATG 1800 

AATAATGCCT TTGTAACCAC AACTTTCACT ATAATGTGAA ATGTAACTGT CCGTAGTATC 1860 

AGAGATGTCC ATTTTTACAA GTTATAGTAC TAGAGATATT TTGTAAAATG TATTATGTCC 1920 

TGTGAGATGT GTATCAGTGT TTATGTGCTA TTAATATTTG TTTAGTTCAG CAAAACTGAA 1980 

AGGTAGACTT TTATGAGAAC AATGGACAAG CAGTGGATAC GTGTCAATGT GTGCACTTTT 2040 

TTTCTATATT ATTGCCCATG ATATAACTTT AGAAATAAAC CTTAATATTT CTTCAAATAT 2100 

CTCTATTTAA TTTTGACACT GAAATAACCG TAAAGGTTTA TTTTTCTGTT ACCTCAACAA 2160 

GAAGAATTTG AAGACTTCAA AATATTGAGC AGAATTCATT CATACTTAAA AATTTATTAG 2220 

CCCTGCATTT TCATAGGAAG ACACATTATC TTCTGGACTA TAGCTGTTCT AATGGATTAT 2280 

AATC AGAA TG GAAGAGAGAA AGCATATTGA CTTTTTTTGA GCGACATCTC TGACTTTCTT 2340 

TAGTCTTTAG CTATTACTGG ATCTCTTAAG ACAGCATGTG TTAATCTTAA TGTATATCGT 2400 

TATCACTGTG CAGTTGCTGT TTACTTGAAT AGTATTGTGT TCCTATATTC CAGGTTTAAG 2460 

TAGATTTCAT GCCTGGGTGG CCAAACAACA GTCTTCATTT TTTTTAATTG AAAAGAAGTA 2520 

GTGTCTGGAT CAGTAAAATT ATACTGTGTG TGAGTGTGAA TATAAATGTG TGTATGTGTG 2580 

TTTCTGTCCQ TAACTGTTAC AGTAATGTCA TAAAGTGAGA AAACTGTGAC CAAGTATAAA 2640 

CTTTTACCAC TTGCTGCACT CTTGCACATG GATTCAGTTT CTAAAATTGA GTTCTTCCTG 2700 

TAATCTTGTT GATAAAAATA CTGACTCCAA CCATTCAAAA ATTTCACCCC ATCCCTCCTT 2760 

AAGAGATTGG ATCAAGTATT ACTAAATTGA CCTTTAGGTA TTACACAAGA CCAGTGCTTA 2820 

GCAAAAAATA ATGACAGGCA TCCAAGGAAG GGATGTATTT GTAGTGTTAT TGCCAGGAAA 2880 

GGAGAGTACT TTGGTTTCTG AGCACCGAAT ATTGAGCAAT ATGTCAGTCA CTAAAAGGAA 2940 

GACAGTTCTA CAGAAAAACA AATGGTAACA TTTTTCAATA GCGTGTGTAG ATAGTATGCA 3000 

CTATATACAT CACGTTAAAG TAGGACTATC ACACCCAGCC CATGTGGCTA AAAAAGCTGA 3060 

ATCAGACAGT GGATGAGACA CACAACGGCA GTGAAGAACC GATACACTTG GCATTGACGT 3120 

CTAGCTATGC TGTATCTGTG CTTTGCCCAC ATGCCCTTGG TGACAGCTGA GCACCCAGCT 3180 

CTGTCTTGGT AGGTTTGGGC TAAGGAACAA ATCTCTCCTT TGCTCGTGGT TAGCAAGATA 3240 

CACTCAAGCA TGAAGATAAA CACAGCTGCT TTCTTCTTAC ACCCCGGTCT CATGCTCCTT 3300 

AATGGCGCCA TGGGTGCTTG TTGGGCCTTT TTCCAGTAAG GAATGATATT GCTGAAGAAT 3360 

CTACTTAACC CTGACAAATT TTAATTATAA TCTCTTCTTA TACAGATAAA ACATGACTCC 3420 

TACAAGGCCC CAAGGTTTAC ATAGTCTGAA GTGAAGTACA GAGCTGGCAT CTATCTGGTG 3480 

ATTTCTAGCT CTCGAGATAC CCAAGCAGCC TGATGGGGCA GTTCCCCTTC TTACGGTTCA 3540 

OGCTCTAAGG CAGGATGTGG CTTATGAGAT ACTTTGCATT GTCTGTCTGC ACACCTTGAA 3600 

TCTGCCTGCT GGCTCCCTTA CTTTACCTCT CTGTCATGTG CAGATGAAGG CTCAGGGTGC 3660 

TAGAGGATTA GTAAGATCTC TTTCTAAAGA CAGGAGAGAT TATTTACAAG AAGAACTCAC 3720 

CAGGGTTTAG TTTGCATTTA AGAATTGCCA GTCTTTTGTC CTGCATCATC TTGAACATTA 3780 

ATCCACATGT TTCAGAGCTC ACCAGGCAGT ACCAATGCTC TTTTCACAGC TATGAAGAGC 3840 

TAGAGAAATT CTTGTTATGG TAGAAAAATT TCACGGTTCA TTTTTGAAAC TGCATTTGTG 3900 

CGTATGCAGT GTAGATTTTA TAGTGTGTTG TGCTTTCAAG ATCTAAATCA TATATAATAA 3960 



1043 



WO 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



ATTAAGGGAC 
TCTGTGAAAT 
AGATTCACTT 
AATGAATATA 
TAGATCCATT 
GTATCTTACT 
ATGTTCAGAA 
AATCAGTCCT 
TTATTTTGTA 
CCTACACTTG 
AAATGCCAAG 
ATGAGAATTA 
CATTTCATGT 
TAACTGTATA 
ATTAATTGTC 
AAAAACATTC 
ATGTTTGAGC 
CGTGGCATCT 
AGTTCCCAAG 
AGGGGGAAAC 
GAATATTGTA 
TTCGTTAGTA 
AGCTTCAGTC 
TTAAGCCCAC 
ACATCTCATC 
ATGAAGAACA 



AATGGGGCTG 
ATCATCAOGT 
TGACACTTTT 
CTCATTAGAA 
AATTTAAATG 
TTCCTCTGAG 
AGGGCCAAAT 
AGAATGTTTC 
TGAAATATAT 
AGATGATTTC 
GACCTTTTGC 
GGCCCTGGAT 
CCATAAAAGA 
TTTGTATGAT 
TTGCACTGGC 
TATCTACTGA 
AGTGGCCTAC 
TAAAAGCTAT 
AAAGGAAACC 
ATGACTCTTT 
TTGTTAGATG 
ATCTGTATAA 
ATTAAATTAC 
AGTATATTTT 
ATGTCTATTG 
TA 



ACAGCACTAA 
ATGTTATACA 
CATATCATTT 
TTACCATTTG 
ATTTAAATTT 
GATGATGTAC 
TCCCAACCTG 
ATTTGCACAA 
GTGAAAGGAT 
CAAGCCTCTC 
TGCCTAAATC 
GAGTAGCGCT 
GACCACCCAT 
ATTTTAAAAT 
CTTCTGATGA 
TTTGGGCTGA 
AAATCAGTAA 
CTTCATGTAA 
TTTTCTTGTA 
ATTGTCTGTA 
TTTGTTGAGC 
TACTTTGTAT 
TATAGCAAAG 
ATTTCAATGT 
TCCAAAGTTA 



ACTTGGTGCT 
ACCTTCATTT 
CTTAACCCAA 
TTAATATCAC 
AAGTAAGTTT 
TTGCCCTGAC 
CTCATTTTTT 
GTAGGGCTGC 
ATGAATCTGA 
TGGCACTTTG 
CACTCTGCAG 
ATAGTTACTG 
ATCATGCACA 
CTCCTAAATG 
AATGTTAACA 
ATGTATGTAA 
TTTTOGGGTG 
ATTGACTGTA 
TCTGGATTCA 
AATCTAACAT 
TGGTAACATC 
ACAAGTACTG 
TAGTACTTCT 
AATTAAACTG 
CCTGGAATCA 



TATTGATATT 
AAAAAGGTTT 
GTGACGAAAA 
TCATTAATTA 
TATAAGGTCT 
CATGCATTTT 
TTTTTATCAG 
CTCCAAGAGG 
GAGATGCTGT 
AGTTAAGTCT 
GAAATAGGCC 
TCCTGTTGAT 
CAATTAGATT 
CTGGGCAATG 
ATGCCTATTG 
ATAGGTTTCT 
GGAGAGTTTC 
CTAGGCCTAC 
AATTTATTTC 
TATTACTTTT 
GTTGCAACCA 
GTAAGATTGT 
TCTGTAATAT 
TTAACTTATT 
AATAAAAATT 



CTAAGAAATA 
AAAACTAGTT 
CATTGTCCCC 
ACCCCATAAT 
GACATCAGAG 
ACCATCACAC 
AGTCATGATG 
AACCTCTGAT 
AGACATCTGT 
ATCTGGTATT 
CAACCACCAG 
TAATTTCTGC 
TCTCACACTC 
GCTATTAACA 
TAATATAGAA 
AAAAAGTCAG 
TTTACATTGC 
TGGGGATCAG 
CAATGTTTCA 
CCTCTTAGAA 
CTGCAATATC 
TATTAAATGT 
TTACAATGTA 
CAAAGAGAAA 
CTAGATTACC 



4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4S00 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 



Seq ID NO: 238 Protein sequence 
Protein Accession 8: NP 057167.1 



MKSILDGLAD 
KMTAGDNPQL 
VLSLTLGTPT 
HRKDSRNVFL 
WTIAIAIAVL 
AHSHAVRMIQ 
AIMVYDVFGK 
PLDNSMGDSD 



11 
I 

TTPRTITTDL 
VPADQVNITE 
VLENLLVLCV 
FKLGGVTASL 
PliLGWNCBKL 
RGTQKSIIIH 
MNKLIKTVFA 
CLHKHANNAA 



21 
I 

LYVGSNDIQY 
FYNKSLSSFK 
ILHSRSLRCR 
TASVGSLPLT 
QSVCSDIFPH 
T3BDGKVQVT 
PCSMLCLIiNS 
SVHRAAESCI 



31 
I 

EDIKGDMASK 
ENEENIQCGE 
PSYHFIGSIiA 
AIDRYVSIHR 
IDETYLMFVJI 
RPDQARMDIR 
TVNPIIYALR 
KSTVKIAKVT 



41 

I 

LGYFPQKFPL 
NFMDIECFMV 
VADLLGSVIF 
PLAYKRIVTR 
GVTSVLLLFI 
LAKTLVLILV 
SKDLRHAFRS 
MSVSTDTSAE 



51 
I 

TSFRGSPFQE 
LNPSQQLAIA 
VYSFIDFHVF 
PKAWAFCLM 
VYAYMYILWK 
VLIICWGPLIi 
MPPSCEGTAQ 
AL» 



Seq ID NO: 239 DMA sequence 

Nucleic Acid Accession #t NM_033181.1 

Coding sequence: 17.. 1252 



1 
I 

ATGAAGTCGA 
CTGTACGTGG 
CCAGTGTGGG 
GCTGGCCATT 
GGTGCTGTGC 
CGGCAGCCTG 
CTTCCACGTG 
CAOGGCCTCC 
ATCCATTCAC 
GTTTTGCCTG 
CTGCGAGAAA 
GATGTTCTGG 
TATTCTCTGG 
CATCATCATC 
CATGGACATT 
GGGCCCTCTG 
GACGGTGTTT 
CTATGCTCTG 
AGGCACTGCG 
AAACAATGCA 
TGCCAAGGTA 



11 
I 

TCCTAGATGG 
GCTCAAATGA 
GAGAACTTCA 
GCAGTCCTGT 
GTCATCCTCC 
GCGGTGGCAG 
TTCCACOGCA 
TTCACTGCCT 
AGGCCCCTGG 
ATGTGGACCA 
CTGCAATCTG 
ATGGGGGTCA 
AAGGCTCACA 
CACAOGTCTG 
AGGTTAGCCA 
CTTGCAATCA 
GCATTCTGCA 
AGGAGTAAGG 
CAGCCTCTGG 
GCCAGTGTTC 
ACCATGTCTG 



21 

1 

CCTTGCAGAT 
CATTCAGTAC 
TGGACATAGA 
CCCTCAOGCT 
ACTCCCGCAG 
ACCTCCTGGG 
AAGATAGCCG 
CGGTGGGCAG 
CCTATAAGAG 
TAGCCATTGT 
TTTGCTCAGA 
CCAGCGTACT 
GCCACGCCGT 
AGGATGGGAA 
AGACCCTGGT 
TGGTGTATGA 
GTATGCTCTG 
ACCTGCOACA 
ATAACAGCAT 
ACAGGGCCGC 
TGTCCACAGA 



31 



ACCACCTTCC 
GAAGACATCA 
GTGTTTCATG 
GGGCACCTTC 

GAGTGTCATT 
CAACGTGTTT 
CCTGTTCCTC 
GATTGTCACC 
GATOGCOGTG 
CATTTTCCCA 
GCTTCTGTTC 
CCGCATGATT 
GGTACAGGTG 
CCTGATCCTG 
TGTCTTTGGG 
CCTGCTGAAC 
CGCTTTCCGG 
GGGGGACTOG 
AGAAAGCTGC 
CACGTCTGCC 



41 
I 

GCACCATCAC 
AAGGAGAATG 
GTCCTGAACC 
ACGGTCCTGG 
AGGCCTTCCT 
TTTGTCTACA 
CTGTTCAAAC 
ACAGCCATCG 
AGGCCCAAGG 
CTGCCTCTCC 
CACATTGATG 
ATCGTGTATG 
CAGCGTGGCA 
ACCCGGCCAG 
GTGGTGTTGA 
AAGATGAACA 
TCCACOGTGA 
AGCATGTTTC 
GACTGCCTGC 
ATCAAGAGCA 
GAGGCTCTGT 



Seq ID NO: 240 Protein sequence 
Protein Accession fl: NPJ149421.1 



60 
120 
180 
240 
300 
360 
420 



51 
I 

CACTGACCTC 
AGGAGAACAT 
CCAGCCAGCA 
AGAACCTCCT 
ACCACTTCAT 
GCTTCATTGA 
TGGGTGGGGT 
ACAGGTACAT 
CCGTGGTGGC 
TGGGCTGGAA 
AAACCTACCT 
CGTACATGTA 
CCCAGAAGAG 
ACCAAGCCCG 
TCATCTGCTG 
AGCTCATTAA 
ACCCCATCAT 
CCTCTTGTGA 
ACAAACACGC 
CGGTCAAGAT 
GA 



MALQIPPSAP 
LSLTLGTPTV 
RKDSRNVFLF 
TIAIVIAVLP 
HSHAVRMIQR 
IMVYDVFGKM 



11 
I 

SPLTSCTWAQ 
LENLLVLCVI 
KLGGVTASFT 
IiLGHNCEKLQ 
GTQKSIIIHT 
NKLIKTVFAF 
LHKHANNAAS 



21 
I 

MTFSTKTSKE 
LHSRSLRCRP 
ASVGSLFLTA 
SVCSDIFPHI 
SEDGKVQVTR 
CSMIiCLUfST 
VHRAAESCIK 



31 
I 

NEENIQCGEN 
SYHFIGSLAV 
IDRYISIHRP 
DETYU4FHIG 
PDQARMDIRL 
VNPIIYALRS 
9TVKIAKVTM 



41 
I 

FMDIECFMVL 
ADLLGSVIFV 
LAYKRIVTRP 
VTSVLLLFIV 
AKTLVLILW 
KDIiRHAFRSM 
SVSTDTSAEA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



51 
I 

NPSQQLAIAV 
YSPIDFHVFH 
KAWAFCLMW 
YAYMYILWKA 
HICWGPLLA 
FPSCEGTAQP 
L 



60 
120 
180 
240 
300 
360 



Seq ID NOt 241 DNA sequence 

Nucleic Acid Accession #: NM_003596.l 



1044 



WO 03/042661 



Coding sequence: 82.. 11 94 



1 11 21 

I I I 

GTAGACTGTC CATGGCCTGA ACATTTTCCG 
TAACAAGATA ACCACATCAA GATGGTTGGA 
CTGGTGATTA GTTCTGTGAC TGTGTTTTAC 
CGGATAGAGG AACGTAGCCA GCCAGTCAAA 
GGCCTGGACC TCAAAGCCAA CAAAACCTTT 
ATTGGRGGTG TGCCTCGGAG TGGAACCACA 
GACATTCGCT GTGGAGAGGA AACCAGGGTC 
TGGTCACGGT CAAGTAAAGA GAAGATCCGC 
CTGGATTCTG CCATGCAAGC CTTCTTACTA 
CCTTATTTAT GTAATAAAGA TCCTTTTGCC 
TTCCCCAATG CCAAATTTCT CCTGATGGTC 
ATTTCTCGAA AAGTTACTAT AGCTGGATTT 
AAGTGGAATC GTGCTATAGA GACCATGTAT 
TGCATGTTGG TTCACTATGA ACAACTTGTC 
TTAAAGTTCC TCCAGATTCC ATGGAACCAC 
AAAGCTGGGG GAGTGTCTCT GTCAAAAGTG 
GTCAATGTAG GAGCTCTATC AAAATGGGTT 
ATGGCAGTGA TTGCTCCTAT GCTTGCCAAG 
AACTACGQAA AACCTGATCC CAAAATTATT 
TTCCAACTAC CTGACTTTCT TAAAGAAAAA 
CAGGAGCCTC TTCCATACAT GAGGAAAGAT 
AGGATTGGCT GTCCCCTGCC AAGCTTGGTG 
GCATTTGCCA GTTTCCTCCC ACTGAGAGGA 
GAGGGATCTG CCTCCTGAGC AAAGAGCTCT 
AAGGAGCCCA GAAGGAACAT GTGTTTCCTG 
TTATGACGTT TGTTTTCAAG GAGAGGGTTT 
AGTCTCCTTT TGAAATAGGT TGTCTGTACA 
TGTTTAAGTG TATTGATGTG AATAATATTA 
TTTCTGAGAA GTTGGGAAAT TACCATTATA 
TTTTTCAAAA TAAAAGCTTT CAATGTGA 



31 41 51 

I I I 

AAAATCATTT TGAGCAAAAT ATCTGTTTAA 60 

AAGCTGAAGC AGAACTTACT ATTGGCATGT 120 

CTGGGCCAGC ATGCCATGGA ATGCCATCAC 180 

TTGGAGAGCA CAAGGACCAC TGTGAGAACT 240 

GCCTATCACA AAGATATGCC TTTAATATTT 300 

CTCATGAGGG CCATGCTGGA CGCACATCCT 360 

ATTCCCCGAA TCCTGGCCCT GAAGCAGATG 420 

CTGGATGAGG CTGGTGTTAC TGATGAAGTG 480 

GAAATTATCG TTAAGCATGG GGAGCCAGCC 540 

CTGAAATCTT TAACTTACCT TTCTAGGTTA 600 

CGAGATGGCC GGGCATCAGT ACATTCAATG 660 

GATCTGAACA GCTATAGGGA CTGTTTGACA 720 

AACCAGTGTA TGGAGGTTGG TTATAAAAAG 780 

TTACATCCTG AACGGTGGAT GAGAACACTC 840 

TCAGTATTGC ACCATGAAGA GATGATTGGG 900 

GAGAGATCTA CAGACCAAGT AATCAAGCCA 960 

GGGAAGATAC CGCCAGATGT TTTACAAGAC 1020 

CTTGGATATG ACCCATATGC CAACCCACCT 1080 

GAAAACACTC GAAGGGTCTA TAAGGGAGAA 1140 

CCACAGACTG AGCAAGTGGA GTAGCAGAAC 1200 

TGCTGCCTTT TCAGCAGAAG GGAAATTCCT 1260 

GAGCGTCTGC ACCTTGGCTG CGCCGCCTGT 1320 

TGGAGGTGTC CGCACAGCTT TGGGCCTCGT 1380 

TGATCCCGAT TTCATGCACA GCCCTGCAGT 1440 

TTAAAACTCC TCTTGTTCTC TTTTCTTACA 1500 

AAAAATGGGA TCCTGTAAGC AGACTTGGGC 1560 

TGTTCTAATG TTTTGTAGAA CACGTGTGCC 1620 

AATATCCTAA TTATTTAATT CATTGTATTG 1680 

CATTTACAAC CTAATGACTT TTGTATTTTA 1740 



Seq ID NO: 242 Protein sequence 
Protein Accession #: NP_003587.1 

} 11 21 31 41 51 

I I I I i | 

MVGKLKQNLL LACLVISSVT VFYLGQHAME CHHRIEERSQ PVKLESTRTT VRTGLDLKAN 60 
KTFAYHKDMP LIFIGGVPRS GTTLMRAMLD AHPDIRCGEE TRVIPRILAL KQMWSRSSKE 120 
KIRLDEAGVT DEVLD SAMOA PLLEIIVKHG EPAPYLCNKD PPALKSLTYL SRLFPNAKFL 180 
LMVRDGRASV HSMISRKVTI AGFDLNSYRD CLTKWNRAIE TMYNQCMEVG YKKCMLVHYE 240 
QLVLHPERWM RTLLKFLQIP WNHSVLHBEE MIGKAGGVSL SKVBRSTDQV IKPVNVGALS 300 
KWVGKIPPDV LQDMAVIAPM LAKLGYDPYA NPPNYGKPDP KIIEHTRRVY KGEFQLPDFL 360 
KEKPQTEQVE 



Seq ID NO: 243 DNA sequence 

Nucleic Acid Accession #t NMJJ01492.3 

Coding sequence: 1395.. 2513 



1 11 21 31 41 51 

I I i I I i 

ACGCGGGGCG CGCGGCTCCG TCGGCTACCG CGGGCGGGCG CAGGCGACGG GCACGGCGGG 60 

CGAGCGGGCG GTATGGCGGC GGCGGGGCCC GCGGCGGGGC CGACGGGGCC CGAGCCCATG 120 

CCGAGCTACG CGCAGCTAGT GCAGCGCGGC TGGGGCAGCG CGCTGGCGGC GGCGCGGGGC 180 

TGCACGGACT GCGGCTGGGG GCTGGGGCGT CGCGGCCTGG CTGAGCACGC GCACCTGGCG 240 

CCGCCCGAGC TGCTGCTGCT GGOGCTCGGC GCGCTGGGCT GGACCGCGCT GCGCTCCGCG 300 

GCCACTGCGC GCCTCTTTCG GCCCCTGGCG AAGCGGTGCT GCCTCCAGCC CAGAGATGCC 360 

GCCAAGATGC CCGAGAGCGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTG GAGCTACAGT 420 

GCCTACCTGC TGTTTGGCAC CGACTACCCC TTCTTCCATG ACCCACCATC TGTCTTCTAC 480 

GACTGGACGC OGGGCATGGC AGTGCCACGG GACATTGCAG CCGCCTACCT GCTCCAGGGA 540 

AGCTTCTATG GCCACTCCAT CTACGCTACG CTATACATGG ACACCTGGCG CAAGGACTCG 600 

GTGGTCATGC TGCTCCACCA CGTGGTCACT CTCATCCTCA TOGTCTCCTC CTACGCCTTC 660 

CGGTACCACA ATGTGGGCAT CCTTGTGCTC TTCCTGCACG ATATCAGTGA CGTGCAGCTT 720 

GAGTTCACCA AGCTCAACAT TTACTTCAAG TCCCGCGGCG GCTCCTACCA TCGGCTGCAT 780 

GCCTTGGCAG CAGACTTGGG CTGCCTCAGC TTCGGCTTCA GCTGGTTCTG GTTCCGCCTC 840 

TACTGGTTCC CGCTCAAGGT CCTGTATGCC ACCAGTCACT GCAGTCTGCG CACGGTGCCT 900 

GACATCCCCT TCTACTTCTT CTTCAATGCG CTCCTGCTGC TGCTCACCCT TATGAACCTC 960 

TACTGGTTCC TGTACATCGT GGCGTTTGCA GCCAAGGTGT TGACAGGCCA GGTGCACGAG 1020 

CTGAAGGACC TGCGGGAGTA TGACACAGCC GAGGCCCAGA GCCTGAAGCC CAGCAAAGCC 1080 

GAGAAGCCAC TGAGGAACGG CCTGGTGAAG GACAAGCGCT TCTGAACCCC TCGGCCCCGC 1140 

CCCCGTGGAC CCGGCCCCAC CCCGAATACC CCGGCCACGC TCCCCGTCCT TGGCCGCCCC 1200 

TCCACCCCCT CCAACTCTGC TCCTCTAGGG CCGCCGCCAC CTCCCCTGGG ACCCCGCCCC 1260 

CTCATCCTGC CTCCATTTCC CGGCCACGCC CCCCAGGACC CCTGCCCCTC CGGGGACACC 1320 

GGCCCCGCCC TCAGCCCACT GGTCCCGGGC CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 1380 

CGCCT6GGAG GAAGATGCCA COGCCGCAGC AAGGTCCCTG CGGCCACCAC CTCCTCCTCC 1440 

TCCTGGCCCT GCTGCTGCCC TOGCTGCCCC TGACCCGCGC CCCCGTGCCC CCAGGCCCAG 1500 

CCGCCGCCCT GCTCCAGGCT CTAGGACTGC GCGATGAGCC CCAGGGTGCC CCCAGGCTCC 1560 

GGCCGGTTCC CCCGGTCATG TGGOGCCTGT TTCGACGCCG GGACCCCCAG GAGACCAGGT 1620 

CTGGCTCGCG GCGGACGTCC CCAGGGGTCA CCCTGCAACC GTGCCACGTG .GAGGAGCTGG 1680 

GGGTCGCCGG AAACATCGTG CGCCACATCC CGGACCGCGG TGCGCCCACC CGGGCCTCGG 1740 

AGCCTGTCTC GGCCGCGGGG CATTGCCCTG AGTGGACAGT CGTCTTCGAC CTGTCGGCTG 1800 



1045 



WO 03/042661 



PCT/US02/36810 



10 
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TGGAACCCGC 
CGGCAGCCCC 
CGGACCCCGG 
CGGAGCTGCT 
OGCTGGOGCT 
TGCTGGTGAC 
AACCOGTGTT 
TCCGOGAGGT 
GCCAGGGTCA 
ACCAOGCTGT 
GCTGC6TGCC 
TGGTGCTGCG 
CGGGCAGGGA 



TGAGCGCCCG 
GGAGGGOGGC 
GCCGGTGCTG 
GGGOGCOGCT 
ACGCCCCCGG 
CCTCGACCCG 



GGGCTGGCAC 
GTGCGCGCTG 
GCTGCGCGCG 
CGCGCGCCTG 
GCAGTATGAG 
CGOGGGCCCA 



AGCCGGGCCC 
TGGGAGCTGA 
CTCCGCCAGT 
TGGGCTCGCA 
GCCCCTGCOG 
CGCCTGTGCC 

CGCTGGGTCA 
CCCGTCGCGC 
CTCATGCACG 
TCGCCCATCT 
GACATGGTGG 
ACAATAAATG 




Seq ID NO: 244 Protein sequence 
Protein Accession #: NP 001483.2 



MPPPQQGPCG 
VMWRLFRRRD 
AGHCPEWTW 
VliLRQLVPAL 
DPRLCHPIAR 
ALPVALSGSG 
YEDMWDECG 



11 
I 

HHLLLLLALL 
PQETRSGSRR 
PDLSAVEPAE 
GPPVRAELLG 
PRRDAEPVLG 
GPPALNHAVL 
CR 



21 

I 

LPSLPLTRAP 
TSPGVTLQPC 
RPSRARLELR 
AAWARNASWP 
GGPGGACRAR 
RALMHAAAPG 



31 
1 

VPPGPAAALL 
HVEELGVAGN 
FAAAAAAAPE 
RSLRLALALR 
RLYVSFREVG 
AADLPCCVPA 



41 
I 

QALGLRDEPQ 
IVRHIPDRGA 
GGWELSVAQA 
PRAPAACARL 
WHRWVIAPRG 
RLSPISVLFF 



51 
I 

GAPRLRPVPP 
PTRASEPVSA 
GQGAGADPGP 
AEASLLLVTL 
FIiRNYCQGOC 
DNSDNWLRQ 



60 
120 
180 
240 
300 
360 



Seq ID NO: 245 dna sequence 

Nucleic Acid Accession #: NM_021267,1 

Coding sequence : 17 . . 1125 

1 11 21 31 41 51 

I I I I I I 

ACGCGGGGCG CGCGGCTCCG TCGGCTACCG CGGGCGGGCG CAGGGGACGG GCACGGCGGG 
CGAGCGGGCG GTATGGCGGC GGCGGGGCCC GCGGCGGGGC CGACGGGGCC CGAGCCCATG 
CCGAGCTACG CGCAGCTAGT GCAGCGCGGC TGGGGCAGCG CGCTGGCGGC GGCGGGGGGC 
TGCACGGACT GCGGCTGGGG GCTGGCGCGT GGCGGCCTGG CTGAGCACGC GCACCTGGCG 
CCGCCCGAGC TGCTGCTGCT GGCGCTCGGC GCGCTGGGCT GGACCGCGCT GCGCTCCGCG 
GCCACTGCGC GCCTCTTTCG GCCCCTGGCG AAGCGGTGCT GCCTCCAGCC CAGAGATGCC 
GCCAAGATGC CCGAGAGCGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTG GAGCTACAGT 
GCCTACCTGC TGTTTGGCAC CGACTACCCC TTCTTCCATG ACCCACCATC TGTCTTCTAC 
GACTGGACGC CGGGCATGGC AGTGCCACGG GACATTGCAG CCGCCTACCT GCTCCAGGGA 
AGCTTCTATG GCCACTCCAT CTACGCTACG CTATACATGG ACACCTGGCG CAAGGACTCG 
GTGGTCATGC TGCTCCACCA CGTGGTCACT CTCATCCTCA TCGTCTCCTC CTACGCCTTC 
CGGTACCACA ATGTGGGCAT CCTTGTGCTC TTCCTGCACG ATATCAGTGA CGTGCAGCTT 
GAGTTCACCA AGCTCAACAT TTACTTCAAG TCCCGCGGCG GCTCCTACCA TCGGCTGCAT 
GCCTTGGCAG CAGACTTGGG CTGCCTCAGC TTCGGCTTCA GCTGGTTCTG GTTCCGCCTC 
TACTGGTTCC CGCTCAAGGT CCTGTATGCC ACCAGTCACT GCAGTCTGCG CACGGTGCCT 
GACATCCCCT TCTACTTCTT CTTCAATGCG CTCCTGCTGC TGCTCACCCT TATGAACCTC 
TACTGGTTCC TGTACATCGT GGCGTTTGCA GCCAAGGTGT TGACAGGCCA GGTGCACGAG 
CTGAAGGACC TGCGGGAGTA TGACACAGCC GAGGCCCAGA GCCTGAAGCC CAGCAAAGCC 
GAGAAGCCAC TGAGGAACGG CCTGGTGAAG GACAAGCGCT TCTGAACCCC TCGGCCCCGC 
CCCCGTGGAC CCGGCCCCAC CCCGAATACC CCGGCCACGC TCCCCGTCCT TGGCCGCCCC 
TCCACCCCCT CCAACTCTGC TCCTCTAGGG CCGCCGCCAC CTCCCCTGGG ACCCCGCCCC 
CTCATCCTGC CTCCATTTCC CGGCCACGCC CCCCAGGACC CCTGCCCCTC CGGGGACACC 
GGCCCCGCCC TCAGCCCACT GGTCCCGGGC CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 
CGCCTGGGAG GAAGATGCCA CCGCCGCAGC AAGGTCCCTG CGGCCACCAC CTCCTCCTCC 
TCCTGGCCCT GCTGCTGCCC TCGCTGCCCC TGACCCGCGC CCCCGTGCCC CCAGGCCCAG 
CCGCCGCCCT GCTCCAGGCT CTAGGACTGC GCGATGAGCC CCAGGGTGCC CCCAGGCTCC 
GGCCGGTTCC CCCGGTCATG TGGGGCCTGT TTCGACGCCG GGACCCCCAG GAGACCAGGT 
CTGGCTCGCG GCGGACGTCC CCAGGGGTCA CCCTGCAACC GTGCCACGTG GAGGAGCTGG 
GGGTCGCCGG AAACATCGTG CGCCACATCC CGGACCGOGG TGCGCCCACC CGGGCCTCGG 
AGCCTGTCTC GGCCGCGGGG CATTGCCCTG AGTGGACAGT CGTCTTCGAC CTGTCGGCTG 
TGGAACCCGC TGAGCGCCCG AGCCGGGCCC GCCTGGAGCT GCGTTTCGCG GOGGCGGCGG 
CGGCAGCCCC GGAGGGCGGC TGGGAGCTGA GCGTGGCGCA AGCGGGCCAG GGCGCGGGOG 
CGGACCCCGG GCCGGTGCTG CTCCGCCAGT TGGTGCCCGC CCTGGGGCCG CCAGTGCGCG 
CGGAGCTGCT GGGCGCCGCT TGGGCTCGCA ACGCCTCATG GCCGCGCAGC CTCCGCCTGG 
CGCTGGCGCT ACGCCCCCGG GCCCCTGCOG CCTGCGCGCG CCTGGCCGAG GCCTCGCTGC 
TGCTGGTGAC CCTCGACCCG CGCCTGTGCC ACCCCCTGGC CCGGCCGCGG CGCGACGCCG 
AACCOGTGTT GGGCGGCGGC CCCGGGGGCG CTTGTCGCGC GCGGCGGCTG TACGTGAGCT 
TCCGOGAGGT GGGCTGGCAC CGCTGGGTCA TCGOGCCGCG CGGCTTCCTG GCCAACTACT 
GCCAGGGTCA GTGCGCGCTG CCCGTCGCGC TGTCGGGGTC CGGGGGGCCG CCGGCGCTCA 
ACCACGCTGT GCTGCGCGCG CTCATGCACG CGGCCGCCCC GGGAGCCGCC GACCTGCCCT 
GCTGOGTGCC CGCGCGCCTG TCGCCCATCT CCGTGCTCTT CTTTGACAAC AGCGACAACG 
TGGTGCTGCG GCAGTATGAG GACATGGTGG TGGACGAGTG CGGCTGCOGC TAACCCGGGG 



60 
120 
180 
240 
300 
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420 
480 
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600 
660 
720 
780 
840 
900 
960 
1020 
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1140 
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1260 
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1440 
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Seq ID NO: 246 Protein sequence 
Protein Accession #: NP_067090.1 

1 11 21 31 41 51 

I I I I I I 

MAAAGPAAGP TGPEPMPSYA QLVQRGWGSA LAAARGCTDC GWGLARRGLA EHAHLAPPBL 
LLLALGALGW TALRSAATAR LPRPLAKRCC LQPRDAAKMP ESAHKPLPYL GSWSYSAYIX 
FGTDYPFFHD PPSVFYDWTP GMAVPRDIAA AYLLQGSPYG HSIYATLYMD TWRKDSWML 

1046 



60 
120 
180 



WO 03/042661 



PCT/US02/36810 



LHHWTLILI VSSYAFRYHN VGILVLPLHD ISDVQLEFTK LNIYFKSRGG SYHRLHALAA 240 

DI/3CLSFGFS WFWFRLYWFP LKVLYATSHC SLRTVPDIPP YFFFNALLLL LTLMMLYWFL 300 
YIVAFAAKVL TGQVHELKDL REYDTAEAQS LKPSKAEKPL RNGLVKDKRF 

5 Seq ID NO: 247 DNA sequence 

Nucleic Acid Accession #: NM_002081-1 
Coding sequence: 222.. 1898 

1 11 21 31 41 51 

10 i I i i I I 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCOGC CGCCGCCGCC 60 

GGCTTTTGTT GTCTCCGCCT CCTOGGCCGC CGCCGCCTCT GGACCGCGAG CCGCGCGCGC 120 

CGGGACCTTG GCTCTGCCCT TCGCGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 180 

AGAGGCGCGG GCGGGTGGCC GGGGGCGCOG CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 240 

15 GCTGGTGGCT GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCCGGCCA 300 

GCAAGAGCOG GAGCTGCGGC GAGGTCCGCC AGATCTAGGG AGCCAAGGGC TTCAGCCTGA 360 

GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG GATCTGTCCC CAGGGCTACA 420 

CCTGCTGCAC CAGCGAGATG GAGGAGAACC TGGCCAACGG CAGCCATGCC GAGCTGGAGA 480 

CCGCGCTCCG GGACAGCAGC CGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGCGCAGCT 540 

20 TCGATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 600 

CCGGCGCCTT CGGAGAGCTG TACAOGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 660 

AGCTGCGCCT GTACTAOCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 720 

GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG CTGCCTGATG 780 

ACTACCTGGA CTGCCTGGGC AAGCAGGCCG AGGCGCTGCG GCCCTTCGGG GAGGCCCOGA 840 

25 GAGAGCTGCG CCTGCGOGCC ACCCGTGCCT TCGTGGCTGC TCGCTCCTTT GTGCAGGGCC 900 

TGGGCGTGGC CAGCGACGTG GTCGGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 960 

CGAGAGCTGT CATGAAGCTG GTCTACTOTG CTCACTGCCT GGGAGTCCCC GGCGCCAGGC 1020 

CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG GCCGACCTGG 1080 

ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 1140 

30 CATCGGGTGT GGAGAGTGTC ATOGGCAGCG TGCACACGTG GCTGGCGGAG GCCATCAACG 1200 

CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG COGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 1380 

GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

35 TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCOG 1500 

AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC OGAGGTGGAG GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 1620 

TGCGCAGCGC CTACAACX3GC AACGACGTGG ACTTCCAGGA CGCCAGTGAC GACGGCAGCG 1680 

GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 1740 

40 GCTCCAGCTC CCGGACGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCOCAGC CCCCGACCTT CCTCCTGCCC CTCCTCCTCT 1860 

TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 1920 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGAOG ATATTTAATT CACCTCAGCC 1980 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 2040 

45 GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAGCCAGT GTGCCCAAAA GOCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC CCCTCGAGGC 2220 

CTACAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG CGAGCCTGTG CCTTCCTCCC 2280 

CGCCTCCTCC CACTGGGACT CCCAGCAGAG CCCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

50 CCTCCAGAGA AGCCCOGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAGAGCC 2400 

TCTGAGATGA TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGACGCC TGAGACAGCA CCACTGCTGA 2520 

GGAGTCTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCAT GCGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC AAAGGCCCAG 2640 

55 GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GGTCCAGGGC TGTTGGAGGA OCCOGAGGGC TGAGGAGCAG CCAGQACCOG 2760 

CCTGCTCCCA TCCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCACGG TGACACAGGT 2820 

CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGG GATGCTGGTG GCTGGTGAGA 2880 

, A CCCCGCACTG CACACGGGAA TGCCTAGGTC CCTTCOCGAC CCAGCCAGCT GCACTGCAGG 2940 

60 GCACOGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC COGCAGGGCC 3060 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 3120 

CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG GTGGCCCTGG GGAGGGGTGG 3180 

TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 3240 

65 TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACOGTCC 3300 

CGCACAGTGG AOGGAGGTCC CCGGTTGCIG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC CCCATAATAA 3420 

CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCOGGGC GCCAGGACAG 3480 

CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT CAGACCCCAC CCTACGCTCA 3540 

70 TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 3600 

CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG GTGACGTGTG TTCTTTTGAG 3660 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 

Seq ID NO: 248 Protein sequence 
75 Protein Accession #: NPJ)02072.1 

1 11 21 31 41 51 

till!) 

MELRARG WW L LCAAAALVAC ARGDPASKSR SCGEVRQIYG AKGFSLSDVP QAEISGEHLR 60 
oO ICPQGYTCCT SEMEENLANR SHABLETALR DSSRVLOAML ATQLRSFDDH FQHLLNDSER 120 
TLQATFPGAF GELYTQNARA FRDLYSBLRL YYRGANLHLE ETLAEFWARL LERLFKQLHP 180 
QLLLPDDYIiD CLGXQABALR PFGEAPRELR LRATRAFVAA RSFVQGLGVA SDWRKVAQV 240 

PLGPECSRAV MKLVYCAHCL GVPGARPCPD YCRNVLKGCL ANQADLDAEW RNLLDSMVLI 300 
TDKFWGTSGV ESVIGSVHTW LAEAINALQD NRDTLTAKVI QGCGNPKVNP QGPGPEEKRR 360 

1047 
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15 
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25 



RGKLAPRERP PSGTLEKLVS EAKAQLRDVQ DFWISLPGTL CSEKHALSTA SDDRCWNGMA 420 

RGRYLPEVMG DGLANQINNP EVEVDITKPD MTIRQQIMQL KIMTNRLRSA YNGNDVDFQD 480 
ASDDQSGSGS 

5 Seq ID NO: 249 DMA sequence 

Nucleic Acid Accession #: NMJ>01492.3 
Coding sequence: 8.. 18 64 

, - 1 11 21 31 41 SI 

10 | i i i i i 

GAAGGCCATG GTCTCCCCAC GQATGTCCGG OCTCCTCTCC CAGACTGTGA TCCTAGCGCT 60 

CATTTTCCTC CCCCAGACAC GGCCCGCTGG CGTCTTCGAG CTGCAGATCC ACTCTTTCGG 120 

GCCGGGTCCA GGCCCTGGGG CCCCGCGGTC CCCCTGCAGC GCCCGGCTCC CCTGCCGCCT 180 

CTTCTTCAGA GTCTGCCTGA AGCCTGGGCT CTCAGAGGAG GCCGCCGAGT CCCCGTGCGC 240 

CCTGGGCGCG GCGCTGAGTG CGCGCGGACC GGTCTACACC GAGCAGCCCG GAGCGCCCGC 300 

GCCTGATCTC CCACTGCCCG ACGGGCTCTT GCAGGTGCCC TTCOGGGACG CCTGGCCTGG 360 

CACCTTCTCT TTCATCATCG AAACCTGGAG AGAGGAGTTA GGAGACCAGA TTGGAGGGCC 420 

CGCCTGGAGC CTGCTGGCGC GCGTGGCTGG CAGGCGGCGC TTGGCAGCCG GAGGCCCGTG 480 

GGCCCGGGAC ATTCAGCGCG CAGGCGCCTG GGAGCTGCGC TTCTCGTACC GCGCGCGCTG 540 

CGAGCCGCCT GCCGTCGGGA CCGCGTGCAC GCGCCTCTGC CGTCCGCGCA GOGCCCCCTC 600 

GCGGTGCGGT CCGGGACTGC GCCCCTGCGC ACCGCTCGAG GACGAATGTG AGGCGCCGCT 660 

GGTGTGCCGA GCAGGCTGCA GCCCTGAGCA TGGCTTCTGT GAACAGCCCG GTGAATGCCG 720 

ATGCCTAGAG GGCTGGACTG GACCCCTCTG CACGGTCCCT GTCTCCACCA GCAGCTGCCT 780 

CAGCCCCAGG GGCCCGTCCT CTGCTACCAC CGGATGCCTT GTCCCTGGGC CTGGGCCCTG 840 

TGACGGGAAC CCGTGTGCCA ATGGAGGCAG CTGTAGTGAG ACACCCAGGT CCTTTGAATG 900 

CACCTGCCCG CGTGGGTTCT ACGGGCTGOG GTGTGAGGTG AGCGGGGTGA CATGTGCAGA 960 

TGGACCCTGC TTCAACGGCG GCTTGTGTGT CGGGGGTGCA GACCCTGACT CTGCCTACAT 1020 

CTGCCACTGC CCACCTGGTT TCCAAGGCTC CAACTGTGAG AAGAGGGTGG ACCGGTGCAG 1080 

CCTGCAGCCA TGCCGCAATG GCGGACTCTG CCTGGACCTG GGCCACGCCC TGCGCTGCCG 1140 

30 CTGCCGCGCC GGCTTCGOGG GTCCTCGCTG CGAGCACGAC CTGGACGACT GCGCGGGCCG 1200 

CGCCTGCGCT AACGGCGGCA CGTGTGTGGA GGGCGGCGGC GCGCACCGCT GCTCCTGCGC 1260 

GCTGGGCTTC GGCGGCCGCG ACTGCCGCGA GCGCGCGGAC CCGTGCGCCG CGCGCCCCTG 1320 

TGCTCACGGC GGCCGCTGCT ACGCCCACTT CTCCGGCCTC GTCTGOGCTT GCGCTCCCGG 1380 

CTACATGGGA GCGCGGTGTG AGTTCCCAGT GCACCCCGAC GGCGCAAGCG CCTTGCCCGC 1440 

GGCCCCGCCG GGCCTCAGGC CCGGGGACCC TCAGCGCTAC CTTTTGCCTC CGGCTCTGGG 1500 

ACTGCTCGTG GCCGCGGGCG TGGCCGGCGC TGOGCTCTTG CTGGTCCACG TGCGCCGCCG IS 60 

TGGCCACTCC CAGGATGCTG GGTCTCGCTT GCTGGCTGGG ACCCCGGAGC CGTCAGTCCA 1620 

CGCACTCCCG GATGCACTCA ACAACCTAAG GACGCAGGAG GGTTCGGGGG ATGGTCCGAG 1680 

CTOGTCCGTA GATTGGAATC GCCCTGAAGA TGTAGACCCT CAAGGGATTT ATGTCATATC 1740 

40 TGCTCCTTCC ATCTACGCTC GGGAGGTAGC GACGCCCCTT TTCCCCCCGC TACACACTGG 1800 

GCGCGCTGGG CAGAGGCAGC ACCTGCTTTT TCCCTACCCT TCCTCGATTC TGTCOGTGAA 1860 
ATGAATTGGG TAGAGTCTCT GGAAGGTTTT AAGCCCATTT TCAGTTCTAA CTTACTTTCA 

Seq ID NO: 250 Protein sequence 
45 Protein Accession #: NP_058637.1 

1 11 21 31 41 51 

I I I I II 

MVSPRMSGLL SQTVILALIF LPQTRPAGVP BLQIHSFGPG PGPGAPRSPC SARLPCRLPP 60 

50 RVCLKPGLSE EAAESPCALG AALSARGPVY TBQPGAPAPD LPLPDGLLQV PFRDAWPGTF 120 

SFIIETWREE LGDQIGGPAW SLLARVAGRR RLAAGGPWAR DIQRAGAWEL RFSYRARCEP 180 

PAVGTACTRIi CRPRSAPSRC GPGLRPCAPL EDECEAPLVC RAGCSPEHGF CEQPGECRCL 240 

EGWTGPLCTV PVSTSSCLSP RGPSSATTGC LVPGPGPCDG NPCANGGSCS ETPRSFECTC 300 

PRGFYGLRCE VSGVTCADGP CFNGGLCVGG ADPDSAYICH CPPGFQGSNC EKRVDRCSLQ 360 

PCRNGGLCLD LGHALRCRCR AGFAGPRCEH DLDDCAGRAC ANGGTCVEGG GAHRCSCALG 420 

FGGRDCRERA DPCAARPCAH GGRCYAHFSG LVCACAPGYM GARCEFPVHP DGASALPAAP 480 

PGLRPGDPQR YLLPPALGLL VAAGVAGAAL LLVHVRRRGH SQDAGSRLLA GTPEPSVHAL 540 

PDALNNLRTQ EGSGDGPSSS VDWNRPEDVD POGIYVISAP SIYAREVATP LFFPLKTGRA 600 
GQRQHLLFPY PSSILSVK 
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Seq ID NO: 251 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 11 21 31 41 51 

65 | | | | | | 

GAAATATAAC CATTGCAATT AGAAAATATC CAAAATAGCC TGTTATCTTC CACGTGGCCT 60 

AGATTATTGA CAATCCCAAA TATACAATTT TTCTTTAAAA GTAGTACAAT TTCCTTTGTA 120 

GCTTCAATTC CTTATATGAC TTCAGACTGG AGAAGCCTGT TAAACCACTG TTAGTTTCAG 180 

TTAGAAAGTC TGAGAGACTT TATACATAAA TTCTCAATTT GGCTGCTGTA CACGTGCCAG 240 

70 AGTTTTACTA CTGTAGTGAC CGTTGAGAAG ACCCTTGTTT ATTTACATTT GAAGCACTGT 300 

TTGTGCAAAC AACCTTTCAT TGTTAAGTGC CTGTATTCCT TTCATTTACT TCATGTCCAG 360 

GGGTGCTATT TACCTAGAAC CATTGTCTAC TACAATTAAC ATTTACATTA CAAAGTGTGT 420 

GGTTTTCTTT TTCAAGGAGG TTCAATTAAG GCAATAAGAT GTTTGCTGGA GAAACCTATT 480 

GTTTACTGAA AGCACTCAAT GAAGTCAAAT TACTGAAGCT TTTGCCTACA TCTTGGTCTT 540 

75 TTATGTAAAT ATGTTAAATA TAACATCTAA GGAAAATAAA CAATATTATA ATTATGTGTT 600 

TGCCATTGTC ATATCAAACT TGCTTTGTAT CATACTAATG TTACATAACT TATCGATCAA 660 
TAAAAATACA TTTCAATGTT AAAAAAAAAA AAAAAAAA 



Seq ID NO: 252 DNA sequence 

Nucleic Acid Accession §: Eos sequence 

1 11 21 31 41 51 

1 I I I 1 I 

AGGTACTGCC AGAAAGGATC AGGACCTGGA GTCTGGCAAG AGGAAGACAG AGGCCTGTGT 60 
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10 



GGGAAGCGAG TTGTTATCTT TGGTTATCTA GCTGTATGAG TGTATTGGTC TTCATAAAGC 120 

TAGATAACOG AAAGTAAAAA CTTCTTCAAG ATOGCCGGGG AGOGTGTGAG AATGAAAGAC 180 

TACAGCOGAG AGACAGTAAA AACCAGAAAG GTCAGGAATA CTTATTGAAT CTAACTTTGT 240 

TTTTGTTTTG TTTTTTTCCT TATGATTAAA GGTGGGATGA GAGAAAATTA AATGACACAC 300 

ACATGCTAAA ATATCAAGGT TCCAGATATG TCTTGAGAGG GGTTGTTGCA GCTGCAAAGG 360 

AGAAGTGTAT AGTGATAATG AGTAAAGATG CATGTGCAGT TTGTTCTATT TTAAGGCAAA 420 

AGTTATATCA GGGATTTTTT TCTTAGAAAG GTGTTGCAGA GATGTCTGGT ACCTAGTTTA 480 

AAAATGATTC CATAATATGT AGACTTGGGC AGTTCCTTTG GGAGGCACCT CCCTCTCAAA 540 

ATTTGAAGAT TGTGCTTGGA AATTACTTTA CATGTATTTG GGCTGTATGT CAATTTGTAC 600 

GAAATTAGAC TTTCAGAAAA GTTTATACTG GAAGGTTAAT AATTTGTATC TACTGAGGAC 660 

TTAGAGCTAG CAGGCAAAAT GAAAAAAAAA AAAAAAAARG CAAGGGCTGA TTTTTATTCT 720 

TCTATTCAAA ATACAAGGAC AGATGCTTCT CTGTTCCAAG AGGGTTTCCT TGAGGAAGCT 780 

ACTGAAGCAG AAAGACATGA TGGAGACGAG ATCGCCTCCC CTCTTGTCAA AGTGTTAAAA 840 

AAATGTTCTG TCTTACTCTG CGCCTAGCAT TGGAAATGAA AGTGACATTT AOGCCACAAC 900 

1 J CCACGTGTGC GCCTCCTCTC TTTTTGTTTA AGGATGATGA GGTCTATCCA GGAAACAGCT 960 

CTGGCATCCC AAACTGAAAT AATTAGGACG TATATAGACC TGACAAAAAT GGAAAGGGGG 1020 

TGGGGAATCT GAGGGTCTGT CTTGCCTAAT TGATTCCGCT AAACGGAATG CAGGAGATGT 1080 

GAACGGCAGG ACGCTCCGAT TCCCACGCTC GGGGGCAAGT GATAAAGCGG GGCOGGGCAG 1140 

CCTATGACAG ACAGCCCTGT TGGGGGGTGG GGGTATGAAA AAAACATCAA GTGCACACAC 1200 

CATACTCATC TCCATCGCTT AAGAAAGTAA AGGCATTTCC CACCCACAGC CATCTGCAGC 1260 

TTCCCAAATG GCAGCACCAA CTGGTCTGTA GCTGCTACAT AGTCTGCTTC TGTTAATTTT 1320 

TTAACCACTG TTTAATCTGG CCATAATTAA GTTTGGCTTT CTTOGTTGTT TGAGATTTTC 1380 

AGAATTCAAG GCAAGCTAGT AGAAAGCAAT TCCAAGAAAG TCCCATGACT GCCTGCOCCT 1440 

AATGTCAAAA TCTCAGTCCA TGAGATTATG G CC TTGT G AC CACATTTTTG CTTTTGTTTT 1500 

GGGTGGGCAA ATGTGTATAG AGATAAAATA CATATCTCTA TATAACAGTC GTTATTATAA 1560 

TTTCATGAGG CTTTTTACCT CTTAACATGA TACATCTAGG AACTTGGTCT AATTGTGCTA 1620 

GTAGATATAC ACTAGAAGTA AAACTATAAA GTCATTAGCC TGTAGAAAGT TGATTATGAT 1680 

AACAATATGA TAAAAAGTTT GTTTTGGAAT AOTCTCAGCT AGATGG3TCT AAATAGCCAT 1740 

TTTAATGTAA TCTAAAAATA ACACTATGCC TAGCAGAAAC TTTGGCGCTT TGGAGGTCCC 1800 

CATTGTCGCC TTTTCATAAA AGTCCTTAAG TTTTCCATAT GTCACCAAGC AAACATTTGT 1860 

ATAGGCATTT ACACAGAAAT ATAACCATTG CAATTAGAAA ATATCCAAAA TAGCCTGTTA 1920 

TCTTCCACGT GGCCTAGATT ATTGACAATC CCAAATATAC AATTTTTCTT TAAAAGTAGT 1980 

ACAATTTCCT TTGTAGCTTC AATTCCTTAT ATGACTTCAG ACTGGAGAAG CCTGTTAAAC 2040 

CACTGTTAGT TTCAGTTAGA AAGTCTGAGA GACTTTATAC ATAAATTCTC AATTTGGCTG 2100 

CTGTACACGT GCCAGAGTTT TACTACTGTA GTGACCGTTG AGAAGACCCT TGTTTATTTA 2160 

CATTTGAAGC ACTGTTTGTG CAAACAACCT TTCATTGTTA AGTGCCTGTA TTCCTTTCAT 2220 

TTACTTCATG TCCAGGGGTG CTATTTACCT AGAACCATTG TCTACTACAA TTAACATTTA 2280 

CATTACAAAG TGTGTGGTTT TCTTTTTCAA GOAGGTTCAA TTAAGGCAAT AAGATGTTTG 2340 

CTGGAGAAAC CTATTGTTTA CTGAAAGCAC TCAATGAAGT CAAATTACTG AAGCTTTTGC 2400 

CTACATCTTG GTCTTTTATG TAAATATGTT AAATATAACA TCTAAGGAAA ATAAACAATA 2460 

TTATAATTAT GTGTTTGCCA TTGTCATATC AAACTTGCTT TGTATCATAC TAATGTTACA 2S20 
TAACTTATCG ATCAATAAAA ATACATTTCA ATGTT 
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Seq ID NO: 253 DMA sequence 
45 Nucleic Acid Accession 8: NM_0O1650.2 
Coding sequence: 40.1011 

1 11 21 31 41 51 

fA I I I I I I 

DU GGGGCAGGCA ATGAGAGCTG CACTCTGGCT GGGGAAGGCA TGAGTGACAG ACCCACAGCA 60 

AGGCGGTGGG GTAAGTGTGG ACCTTTGTGT ACCAGAGAGA ACATCATGGT GGCTTTCAAA 120 

GGGGTCTGGA CTCAAGCTTT CTGGAAAGCA GTCACAGOGG AATTTCTGGC CATGCTTATT 180 

TTTGTTCTCC TCAGCCTGGG ATCCACCATC AACTGGGGTG GAACAGAAAA GCCTTTACCG 240 

GTCGACATGG TTCTCATCTC CCTTTGCTTT GGACTCAGCA TTGCAACCAT GGTGCAGTGC 300 

TTTGGCCATA TCAGCGGTGG CCACATCAAC CCTGCAGTGA CTGTGGCCAT GGTGTGCACC 360 

AGGAAGATCA GCATCGCCAA GTCTGTCTTC TACATCGCAG CCCAGTGCCT GGGGGCCATC 420 

ATTGGAGCAG GAATCCTCTA TCTGGTCACA CCTCCCAGTG TGGTGGGAGG CCTGGGAGTC 480 

ACCATGGTTC ATGGAAATCT TACCGCTGGT CATGGTCTCC TGGTTGAGTT GATAATCACA 540 

TTTCAATTGG TGTTTACTAT CTTTGCCAGC TGTGATTCCA AACGGACTGA TGTCACTGGC 600 

TCAATAGCTT TAGCAATTGG ATTTTCTGTT GCAATTGGAC ATTTATTTGC AATCAATTAT 660 

ACTGGTGCCA GCATGAATCC CGCCCGATCC TTTGGACCTG CAGTTATCAT GGGAAATTGG 720 

GAAAACCATT GGATATATTG GGTTGGGCCC ATCATAGGAG CTGTCCTCGC TGGTGGCCTT 780 

TATGAGTATG TCTTCTGTCC AGATGTTGAA TTCAAACGTC GTTTTAAAGA AGCCTTCAGC 840 

AAAGCTGCCC AGCAAACAAA AGGAAGCTAC ATGGAGGTGG AGGACAACAG GAGTCAGGTA 900 

GAGACGGATG ACCTGATTCT AAAACCTGGA GTGGTGCATG TGATTGACGT TGACCGGGGA 960 

GAGGAGAAGA AGGGGAAAGA CCAATCTGGA GAGGTATTGT CTTCAGTATG ACTAGAAGAT 1020 

CGCACTGAAA GCAGACAAGA CTCCTTAGAA CTGTCCTCAG ATTTCCTTCC ACCCATTAAG 1080 

GAAACAGATT TGTTATAAAT TAGAAATGTG CAGGTTTGTT GTTTCATGTC ATATTACTCA 1140 

GTCTAAACAA TAAATATTTC ATAATTTACA AAGGAGGAAC GGAAGAAACC TATTGTGAAT 1200 

70 TCCAAATCTA AAAAAAGAAA TATTTTTAAG ATGTTCTTAA GCAAATATAT ACCTATTTTA 1260 

TCTAGTTACC TTTCATTAAC AACCAATTTT AACCGTGTGT CAAGATTTGG TTAAGTCTTG 1320 

CCTGACAGAA CTCAAAGACA CGTCTATCAG CTTATTOCTT CTCTACTGGA ATATTGGTAT 1380 
AGTCAATTCT TATTTGAATA TTTATTCTAT TAAACTGAGT TTAACAATGG C ' 
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Seq ID NO: 254 Protein sequence 
Protein Accession #: NP_001641.1 

1 11 21 31 41 51 

I I I I I I 

MSDRPTARRW GKCGPLCTRE NIMVAFKGVW TQAFWKAVTA EPLAMLIFVL LSLGSTINWG 60 
GTEKPLPVDM VLISLCFGLS IATKVQCFGH ISGGHINPAV TVAMVCTRKI SIAKSVPYIA 120 
AQCLGAIIGA GILYLVTPPS WGGLGVTMV HGNLTAGHGL LVELI ITFQL VFTIPASCDS 180 
KRTDVTGSIA LAIGFSVAIG HLFAINYTGA SMNPARSFGP AVIMGNWENH WIYWVGPIIG 240 
AVLAGGLYEY VFCPDVEPKR RPKEAFSKAA QQTKGSYMEV EDNRSQVETD DLILKPGWH 300 
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Seq ID NO: 2S5 DMA sequence 
Nucleic Acid Accession ft: U26742.1 
Coding sequence: 325.-1449 
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i 
1 

CAGGAAACCC 
GGCAGCGGAC 
CTCAAACCTC 
GCGCCCGTCC 
TCTTTAAGTG 
GCTGTCCCTT 
GCAGAAAGAA 
TCCACCTACA 
GTGGACATAT 
AACACTGAAC 
AACAAACGGA 
AACTTCCTGC 
AAAATGGCTT 
TCAATGATTT 
GAAGTTCTCA 
TCAGCCAGAT 
CTTATGTCAG 
AATGTGGAAA 
GGATTTCGCT 
AGGGGACATG 
AAATCACCTG 
CGTGAACCTT 
ATGGTTGATA 
TCTGTTCCCT 
TGCGTCTAGA 
TGTTCTGGTT 
TTTCCCATTT 
TCACATTTCT 
CAATAAAGCT 



11 
I 

TGGTACTGGC 
CCGGCACTTC 
CCTGCAGACC 
ATTCCAAGAT 
CCATAATCAA 
CATTGAATTT 
GACAGCTGTT 
GAACAGCATG 
GGAATGTCAT 
TCAAOGTGTC 
TGCCAACCAC 
TTGCAGCGTT 
TAGCCACATT 
CTGACTCCAG 
AACTACCCAC 
CCTGTTTCTC 
ATCCTCCCCC 
ATGTCTTCCA 
ACCGATGCCA 
CCGGTGGTTC 
CTAAGAAGCT 
TGCACCCCAT 
CTTGGCCTCC 
CCTCAGGAAG 
TGGATAACAT 
CCTCCAGGGT 
TCAGTCATTT 
TTCTGTCACC 
GTGTACACTA 



21 
I 

AGCAGCCAGC 
CAACATTATT 
AATGGACACC 
AAATAGGATT 
CTGCCATTTC 
TAGAATGATT 
TGCAGAGATG 
CAAGCTTAGG 
AGAAGCATTG 
CCGCTTAGAG 
TCACCAAATC 
TGATCCGGAA 
GTGTQGAGGG 
TGGGGTGATG 
GGCAGTTTTT 
CCAACAGAAA 
GCAGTGTCTG 
TCCGGTTGAG 
ACAGTGTCAC 
TCATAGCAAC 
GACTAATGCA 
GTTCCCAGAT 
CAGACCTGTA 
TCCTTTTATT 
GACTTCTTCT 
GCATGGTACC 
TGGAATGTTC 
ACAGAGATTG 
AATGTCC 



31 
I 

CTCTGCTGTG 
AAATAATAAG 
TTCTAAGAGT 
TACCAATCCT 
AAAGAATATA 
GAAGATAGTG 
AGGGCTCAAG 
TTTGTTCAGA 
CGGGAAAATG 
GCTGTGCTCT 
CATGTGGAGC 
GGCCATGGTA 
AAGATCATGG 
GTTTATGGAC 
GAAGGTCCTT 
AAAGTCACGT 
GTCTGGTTGC 
TGTTCCTACT 
AATTACCAGC 
CAGCACCAAA 
TTAAGCAAGT 
CAGCCTGAGA 
ACCAGCATGA 
ACCAGGAGCT 
ACCCTAAAAT 
CATTAACCCA 
TCTGTGAACC 
GCCTACGGTT 



41 
I 

CCCACATGAC 
AAAGCGGCTC 
TTGGCGAGTC 
TGGATGAAGT 
GATGGTTTTG 
GGAAAAGAGG 
ATCTGGATCG 
AGAAATGCAA 
CTCTGAACAA 
OCACTATTTT 
AGTCCATCAG 
AAATTTCAGT 
ACAAATTAAG 
GATATGACCA 
CATTTGGTTA 
TAAATGGTTT 
CTCTTCTGCA 
GCCACAGTGA 
TCTGTCAGGA 
TGAAAGAGTA 
CCCTGAGCTG 
AGCCACTCAA 
ACGACACCCT 
CGGACGGTGC 
ATTCCTATAA 
AAATATGATT 
ACAGTTGGGT 
TCTGTTTTGA 



51 
I 

CCACAACTCT 
CTACTCCAGG 
AGTGACTGAA 
GCTTGGGAAG 
AAAAGTTCAT 
AAATACCATG 
CATCCGACTC 
TTTGCACCTG 
CCTGGACCCA 
TTACCAGCTC 
CCTCCTCCTT 
ATTTGCTGTC 
ATATATTTTC 
ATTCCTTGGG 
CACAGAACAG 
CTTGGACAOG 
TCGACTAGCA 
GAGTATGATG 
CTGCTTCTGG 
CAOGTCATGG 
TGCTTCCAGC 
CTTGGCTCAC 
GTTCTCCCAC 
TTTTGGTGGA 
TACTTTGAGC 
ATTTCCCTTT 
TGTTTAAAGC 
GGGTGCTGTT 



Seq ID NO: 256 Protein sequence 
Protein Accession fti AAC50424.1 



MIEDSGKRGN 
ALRENALNNL 
PEGHGKISVF 
VFEGPSPGYT 
VECSYCHSES 
NALSKSLSCA 
FITRSSDGAF 



11 

I 

TMAERRQLPA 
DPNTELNVSR 
AVKMALATLC 
EQSARSCFSQ 
MMGFRYROQQ 
SSREPLHPMF 
GGCV 



21 
I 

EMRAQDLDRI 
LEAVLSTIFY 
GGKIMDKLRY 
QKKVTLNGFL 
CHNYQLCQDC 
PDQPEKPLNL 



31 



41 
I 



RLSTYRTACK LRFVQKKCNL 
QLNKRMPTTH QIHVEQSISL 
IFSMISDSSG VMVYGRYDQF 
DTLMSDPPPQ CLVWLPLLHR 
FWRGHAGGSH SNQHQMKEYT 
AHIVDTWPPR PVTSMNDTLF 



51 
I 

HLVDIWNVIE 
LLNFLLAAFD 
LREVLKLPTA 
LANVENVFHP 
SWKSPAKKLT 
SHSVPSSGSP 



Seq ID NO: 257 DNA sequence 

Nucleic Acid Accession #: NM_004172.1 

Coding sequence: 179.. 1807 



1 
I 

GCGGATTGTT 
CACCCCTTAC 
TGTGGGTGAT 
GACTAAAAGC 
AGTCCGTAAA 
TAAAAGTTAC 
TACAATCCTT 
CTCCTTTCCT 
CTCCAGTCTT 
GCGAGCTGTA 
TGTCATCATC 
TGTACGAGTG 
TCTGGTAGAA 
GCCCATCCAG 
GGAGACTCTT 
CAATGCCCTG 
GGAACAGGGG 
GGTAGCAGTA 
TGTGGAGATG 
CATTGTTGGC 
ACGGAAAAAC 
GACCTCTTCA 
CGTGGACAAG 
GACTGCCCTC 
GAACTTCGGA 
TGOAATTCCT 
CACTGACGAC 



11 
I 

GCTCCGTTGT 
AAAATCAGAA 
TCCCAGACAC 
AATGGAGAAG 
CGCACACTTT 
CTGTTTCGGA 
GGATTTACCC 
GGGGAACTTC 
GTCACAGGAA 
GTCTATTATA 
ATCCATCCTG 
ACAGCTGCAG 
GCCTGCTTTA 
GCCAACGAAA 
ACCCGAATCA 
GGTCTAGTTG 
CAGGCCCTGA 
ATAATOTGGT 
GAAGACATGG 
TTACTCATTC 
CCTTGGGTTT 
AGTTCTGCCA 
CGCGTCACCA 
TATGAGGCTT 
CAAATTATTA 
CAGGOGGGCC 
ATCACGCTCA 



21 
I 

ACCTGCTGGG 
AAGTTGTGTT 
TGAAGTGCAA 
AGCCCAAGAT 
TGGCCAAGAA 
ATGCTTTTGT 
TCCGACCATA 
TGATGAGGAT 
TGGCGGCGCT 
TGACTACCAC 
GGAAGGGCAC 
ATGCCTTCCT 
AACAGTTTAA 
CGCTTGTGGG 
CAGAGGAGCT 
TCTTCTCCAT 
GAGAGTTCTT 
ATGCCCCCGT 
GTGTGATTGG 
ACGCAGTCAT 
TTATTGGAGG 
CCCTACCCAT 
GATTCGTGCT 
TGGCTGCCAT 
CAATCAGCAT 
TGGTCACTAT 
TCATCGCOGT 



31 
I 

GAATTCACCT 
TTCTAATACC 
AGAAGAGACC 
GGGGGGCAGG 
GAAAGTGCAG 
GCTGCrCACA 
CAGAATGAGC 
GTTAGAGATG 
AGATAGTAAG 
CATCATTGCT 
AAAGGAAAAC 
GGACTTGATC 
AACCAACTAT 
TGCTGTGATA 
GGTCCCAGTT 
GTGCTTCGGT 
TGATTCTCTT 
GGGTATTCTC 
GGGGCAGCTT 
CGTCTTGCCA 
GTTGCTGCAA 
CACCTTCAAG 
CCCCGTAGGA 
TTTCATTGCT 
CACAGCCACA 
GGTCATTGTG 
GGACTGGTTC 



41 

I 

CGTTACTGCT 
AAAGAGGAGG 
CTCCTAGAAA 
ATGGAGAGAT 
AACATTACAA 
GTCACCGCTG 
TACCGGGAAG 
CTGGTCTTAC 
GCATCAGGGA 
GTGGTGATTG 
ATGCACAGAG 
AGGAACATGT 
GAGAAGAGAA 
AACAATGTGT 
CCAGGATCTG 
TTTGTGATTG 
AACGAAGCCA 
TTCCTGATTG 
GCCATGTACA 
CTOCTCTACT 
GCACTCATCA 
TGCCTGGAAG 
GCCACCATTA 
GAAGTTAACA 
GCTGCCAGTA 
CTGACATCTG 
CTGGATGGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



60 
120 
180 
240 
300 
360 



51 

I 

TGATATCTTC 
TTTGGCTTTC 
AGTAAAATAT 
TCCAGCAGGG 
AGGAGGATGT 
TCATTGTGGG 
TCAAGTACTT 
CACTTATCAT 
AGATGGGAAT 
GCATAATCAT 
AAGGCAAAAT 
TCCCTCCAAA 
GCTTTAAAGT 
CTGAGGCCAT 
TGAATGGAGT 
GAAACATGAA 
TCATGAGACT 
CTGGGAAGAT 
CCGTGACTGT 
TCTTGGTAAC 
CCGCTCTGGG 
AGAACAATGG 
ACATGGATGG 
ACTTTGAACT 
TTGGGGCAGC 
TCGGCCTGCC 
TCCGGACCAC 



60 
1 120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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CACCAACGTA CTGGGAGACT CCCTGGGAGC 
ACTGAAGAAC AGAGATGTTG AAATGGGTAA 
ACCATATCAA CTGATTGCAC AGGACAATGA 
GATGTAGACT AACATAAAGA AACACTTTCT 
ATCTTTCCAT CTCATTACAG CTCATTCGCT 
CTTCTGATAA GACTGGAAAA TAGTCCTCCA 
GTGTACAATT TTCATCCCAC AATTGAAATT 
TAAGGTACCA AGATCACAAA TAGTGTTGAT 
TCCTATAAAT GTGATTTTTT TATATAAGTT 
TAAAATCAAC TTTTGAAATT TAAAAATCTT 
TGTTAACAAC TTGAATTACA ACCGGTTATC 
TTCTGACTGG TATACCTATT TCATTAGTAG 
TGAGGACAGA CAGAAGGCAA AGTTTCCATG 
GGCCTCAGTG TCCTCATCTA TAAAATGAGG 
CAGCCCAGAC ATCCTGTGAT GTCATGAAAG 
CTGTACCATC CATGGAGCAC GAGGCCTTCA 
CTAACTAAGG ACAGQATGGC TGTGTGTGGT 
TTATGCAAGT CGCCAGGCAG GAGGCCATTC 
CCCAGTTCAT CCCAATGGGG GAAGTATTCC 
TGCCACTCCT CAACTGATGA TAGACTTCGA 
GGTGCTTGTA GTCACACTGT GGAACACTAA 
ATTACAAACA AAGGATTTGA AAACACCAAG 
AAGAAGAGGT TCATTTTTCT GGCTTTTTTT 
AATCATTCCC CCCGTGAAGT CTGCTTACCA 
AGTCAAATGA ATGAGCTCTC TAATAGAAGT 
AAGTGTTTTT CTTTAATCAA AAGTCCTTAG 
GGAATCCCAC TTATCAAATC ATTCAAAACT 
GTTTGTGTCC ATAAGAGAAA TGGTAGAAGA 
TTATGAGAAA AAAACAGCAG GGGCATTAGT 
TTAATTTTTA CCCCCTAAGG AATATCCAGT 
TAGCAGCTGT GTTTGAGTTT CTGGCTGAAA 
AAACTGAAAA ATCTAGACAT AGATCCTCTG 
CCCAAGCATT CTGTGCATAA AAGTTAACAT 
GAGGCTCTAT TTCGGAAATA CACTACAAAT 
CTAGTAGAGC AAAGACTTAA TCATATCAAC 
ATATACTAAC CATTTCTTAT GGAAAGGTCC 
CACAGGCTCT GCATACACAT GCACTCAGTG 
TTTTCAATAA AGAAAAAAAT AGTTTTACAT 

Seq ID NO; 258 Protein sequence 
Protein Accession #: NP 004163.1 



TGGGATTGTG GAGCACTTGT CACGACATGA 1680 

CTCAGTGATT GAAGAGAATG AAATGAAGAA 1740 

AACTGAGAAA CCCATCGACA GTGAAACCAA 1800 

TGAGCACCAG GTGTTAAAAA CCATTATAAA 1860 

CCAGCAAGCC CGTCATCTTC CCTTTCCTCC 1920 

AAACACAAGG GAGGATTTTG GGTGGCCAAA 1980 

TTTAAATCAT TTCATGTTAG TCTTACCGAA 2040 

CAGATCTTAC AAGTTTATGT GGCACACAAT 2100 

AAAGAGACAA ATAGTAGGCT AAAAACATTT 2160 

TCAGAATACA ATTCAGTTTT AGTTTCAAAA 2220 

AGTTGGACAG TAAGATTTTA TCCCTTTCTC 2280 

CTAGGTGCAC ATATACATCT AGCACAGCTG 2340 

TGGCCTTGAG CAAGTCCCAT CTCACCTCTA 2400 

GACTTCCCTA GAAGTCTTCA TGGTCTCTTC 2460 

CACCTGCCCT CTGTTTOCCC TCAGAACACC 2520 

GAAAAGACAC TTCAATGGGA GTGAACATTT 2580 

GGTCACCAGG TCCTGTGAGC AAAGTGCAGG 2640 

CAGGAGTGGG ATTATTCATC AAACTCTTTG 2700 

CTJ'CriTCCT ACTCTGGGAA GAATGTCTCC 2760 

AAACAGATGA GAAGACTAGC AGCTAGCAAG 2820 

AGAGCTAGGA AAGAGTTGAG CACAGGCAAC 2880 

AGTACAGGTC TTCTTTAAGG AAGAATAAAA 2940 

TTCACCTGAA ACACTTTTTC TCGAGTCCAA 3000 

AAACATAAGA CGACTTATAT ATTTGAAAGA 3060 

CCATGAGTTG AGTGGGTATT TCTTATTTGA 3120 

AATGAGGGAA ACAAAATATT TATTTGTTTT 3180 

TTCAGCTGGA GTGGGGTTTG CTTTTGTTTT 3240 

TGAATCAGTA TGAAGACACT GTCAATGAGG 3300 

TTCAGGCAAG GCAGCTCCCA GGTTTAGAGA 3360 

CAAAGACGCT GAGTGGGAGC TGTCAGGCAG 3420 

ATGGTGAAGA ATGGACTTAA TTATGCTAAC 3480 

ATATACAATT AGAGATATTT TTATATAGAC 3540 

TAGGCTGTGG TGCAGTAACC ATTTAATGTC 3600 

GTTAAAGTAC GTGGCTGTCC TCTTAAGACA 3660 

TTAATTCTGT TACACAATAT GTGTTTTTTA 3720 

TGTGGGGAGC CCATCCTCTC GCCAAGCCAT 3780 

TGGACTGGGA AGCATTACTT TGTAGATGTA 3840 
T 



GTILGPTLRP 
MRAWYYKTT 
NLVEACFKQP 
VNALGLWFS 
IVEMEDMGVI 
GTSSSSATIiP 
LNFGQIITIS 
TTNVLGDSLG 
KM 



11 
I 

MGGEMERFQQ 
YRMSYREVKY 
TIIAWIGII 
KTNYEKRSFK 
MCFGFVIGNM 
GGQLAMYTVT 
ITPKCLEENN 
ITATAASIGA 
AGIVEHLSRH 



21 
I 

GVRKRTLLAK 
FSFPGELLMR 
IVIIIHPGKG 
VPIQANETLV 
KEQGQALREF 
VIVGLLIHAV 
GVDKRVTRFV 
AGIPQAGLVT 
BLKNKDVEMG 



31 

I 

KKVQNITKED 
MLQMLVLPLI 
TKENMHREGK 
GAVINNVSKA 
FDSLNEAIMR 
IVLPLLYFLV 
LPVGATIKMD 
MVIVLTSVGL 



41 
I 

VKSYLFRNAF 
ISSLVTGMAA 
rVRVTAADAF 
METLTRITEE 
LVAVIMWYAP 
TRKNPWVFIG 
GTALYEALAA 
PTDDITLIIA 
KPYQLIAQDN 



51 
I 

VLLTVTAVIV 
LDSKASGKMG 
LDLIRNMFPP 
LVPVPGSVNG 
VGILFLIAGK 
GLLQALITAL 
IFIAQVNNFE 
VDWFLDRIiRT 
ETEKPIDSET 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 259 DNA sequence 

Nucleic Acid Accession #: NMJ>21948.1 

Coding sequence: 48.. 2783 



1 
I 

TGTGGCACTG 
TCCTGCCCCT 
TGGAAGGAGA 
TGCAGGGCGT 
CGCCGAGCCG 
GCCGGGAGGC 
GGTTCCGCGT 
GCGAGCTGCG 
ACAGCAGCGA 
CTGCCCGCTA 
ACATCGCCAC 



ACGGAGACAT 
TCTATGATGT 
CAGAGAAGCT 
CCACCACGGG 
GGCTAGCTGA 
GCTTGCCTGG 
ACAGCCGCTT 
CCTCCAACCC 
AGACCCTGGA 
TCTACTCCAT 
CAGAGGCCGC 



11 
I 

CCTGCGTACC 
GCTGGCAGCC 
CAGCTCAGAG 
GCTCGGCGGC 
CCGGGCTGTG 
AGAGGTGCTG 
GGCACTGCCT 
CCCCAACGAC 
CGCTGTGGAG 
TGCTTTCTOC 
CCCGGAGCAG 
GTCGGATCAG 
GGATGGCTTC 
GTACTGTTAT 
GACATTGGAG 
CCAACTGTAT 
TGGCAGTGTG 
TGTCAAGACT 
CAACGTCTAC 
AGCCTCCAAC 
GGAACTGCAG 
CCCCATCATG 
TAGGACGCTC 




GACCGCGCTT 
GCCCTCACCA 
CTGGGCTCTC 



GCGTACCCAG 
TCAGGTATCT 
GTCAAGGTCA 
TTTTCTGGGG 
CTCTATGCOG 
ACCGTGAGGT 
CCCGGGGTCC 
GCTGAAGACC 
GAAGCACGGG 
GCAGCCTGGG 
CGCTACCCCA 
CTCTTCCTCT 
TGCTTCOGAG 
CCAGCCTCTG 
CTGCCTCAGG 
GAGGAOGGAG 
CTAGAATTTG 



31 
I 

CCTGGGTAGC 
CCCAGGCTCC 
TTCGCGTGCG 
TCCCTTGCCA 
OGOGGGTCAA 
GAGTGCGCGT 
OGTCGCTCAC 
ATCGCTGTGA 
AAGGGGTCGT 
CCCAGGAGGC 
CCTACCTTGG 
ATCOCATCCA 
GGAACTATGG 
TAAATGGAGA 
OGTACTGCCA 
ATGGTGGCCT 
TCGTCACACC 
TCCCCAACCA 
ACTCGGCCCA 
ATGGACTAGA 
AAGCCACAGA 
GAGGTGGAAG 
AAACACAATC 



41 
I 

CTGCAGCATG 
TGCAGCTTTA 
CATCGOGGGC 
CGTCCACTAC 
GTGGACTTTC 
CAAGGTGAAC 
CGACGTCTCC 
GGTCCAGCAC 
CTTTCTCTAC 
CTGTGCCCGC 
GGGCTATGAG 
GACCCCACGA 
TGTGGTGGAC 
ATTGTTCCTG 
GGAGCGGGGT 
GGACCACTGC 
CAGCCAGCGC 
GACTGGCTTC 
GCCTTCTGCC 
GGCTATOGTC 
GAGTGAATCC 
CTCCACTCCA 
CATGGTACCG 



51 
I 

GCCCAGCTGT 
GCAGATGTTC 
GACGCGCCAC 
CTGCGGCCAC 
CTGTCCCGGG 
GAGGCCTACC 
CTGGOGCTGA 
GGCATCGATG 
CGAGAGGGCT 
ATTGGAGCCC 
CAATGTGATG 
GAGGCCTGTT 
CCGGATGACC 
GGTGACCCTC 
GCAGAGATTG 
AGCCCAGGGT 
TGTGGTGGGG 
CCCAATAAGC 
ATCCCTGAGG 
ACAGTGACAG 
CGTGGGGCCA 
GAAGACCCAG 
CCCAOGGGGT 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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1380 
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TCTCAGAAGA GGAAGGTAAG GCATTGGAGG AAGAAGAGAA ATATGAAGAT GAAGAAGAGA 1440 

AAGAGGAGGA AGAAGAAGAG GAGGAGGTGG AGGATGAGGC TCTGTGGGCA TGGCCCAGCG 1500 

AGCTCAGCAG CCCGGGCCCT GAGGCCTCTC TCCCCACTGA GCCAGCAGCC CAGGAGGAGT 1560 

- CACTCTCCCA GGCGCCAGCA AGGGCAGTCC TGCAGCCTGG TGCATCACCA CTTCCTGATG 1620 

GAGAGTCAGA AGCTTCCAGG CCTCCAAGGG TCCATGGACC ACCTACTGAG ACTCTGCCCA 1680 

CTCCCAGGGA GAGGAACCTA GCATCCCCAT CACCTTCCAC TCTGGTTGAG GCAAGAGAGG 1740 

TGGGGGAGGC AACTGGTGGT CCTGAGCTAT CTGGGGTCCC TCGAGGAGAG AGCGAGGAGA 1800 

CAGGAAGCTC CGAGGGTGCC CCTTCCCTGC TTCCAGCCAC ACGGGCCCCT GAGGGTACCA I860 

„ GGGAGCTGGA GGCCCCCTCT GAAGATAATT CTGGAAGAAC TGCCCCAGCA GGGACCTCAG 1920 

1U TGCAGGCOCA GCCAGTGCTG CCCACTGACA GCGCCAGCCG AGGTGGAGTG GCCGTGGTCC 1980 

CCGCATCAGG TGACTGTGTC CCCAGCCCCT GCCACAATGG TGGGACATGC TTGGAGGAGG 2040 

AGGAAGGGGT CCGCTGCCTA TGTCTGCCTG GCTATGGGGG GGACCTGTGC GATGTTGGCC 2100 

TCCGCTTCTG CAACCCCGGC TGGGACGCCT TCCAGGGCGC CTGCTACAAG CACTTTTCCA 2160 

i c CACGAAGGAG CTGGGAGGAG GCAGAGACCC AGTGCCGGAT GTAOGGOGGG CATCTGGCCA 2220 

13 GCATCAGCAC ACCCGAGGAA CAGGACTTCA TCAACAACCG GTACCGGGAG TACCAGTGGA 2280 

TCGGACTCAA CGACAGGACC ATCGAAGGCG ACTTCTTGTG GTCGGATGGC GTCCCCCTGC 2340 

TCTATGAGAA CTGGAACCCT GGGCAGCCTG ACAGCTACTT CCTGTCTGGA GAGAACTGCG 2400 

TGGTCATGGT GTGGCATGAT CAGGGACAAT GGAGTGACGT GCCCTGCAAC TACCACCTGT 2460 

OH CCTACACCTG CAAGATGGGG CTGGTQTCCT GTGGGCCGCC ACCGGAGCTG CCCCTGGCTC 2520 

ZU AAGTGTTCGG CCGCCCACGG CTGCGCTATG AGGTGGACAC TGTGCTTCGC TACCGGTGCC 2580 

GGGAAGGACT GGCCCAGCGC AATCTGCCGC TGATCCGATG CCAAGAGAAC GGTCGTTGGG 2640 

AGGCCCCCCA GATCTCCTGT GTGCCCAGAA GACCTGCCCG AGCTCTGCAC CCAGAGGAGG 2700 

ACCCAGAAGG ACGTCAGGGG AGGCTACTGG GACGCTGGAA GGCGCTGTTG ATCCCCCCTT 2760 

0 CCAGCCCCAT GCCAGGTCCC TAGGGGGCAA GGCCTTGAAC ACTGCCGGCC ACAGCACTGC 2820 
Z ^ CCTGTCACCC AAATTTTCCC TCACACCCTG CGCTCACCAC AGGAAGTGAC AACATGAC 

Seq ID NO: 260 Protein sequence 
Protein Accession #: NP_068767.l 

30 1 11 21 31 41 51 

III III 

MAQLFLPLLA ALVLAQAPAA LADVLEGDSS EDRAPRVRIA GDAPLQGVLG GALTIPCHVH 60 

YLRPPPSRRA VLGSPRVKWT FLSRGREAEV LVARGVRVKV NEAYRFRVAL PAYPASLTDV 120 

_ - SLALSELRPN DSGIYRCEVQ HGIDDSSDAV EVKVKGWFL YREGSARYAF SFSGAQEACA 180 

J O RIGAHIATPE QLYAAYLGGY EQCDAGWLSD QTVRYPIQTP REACYGDMDG PPGVRNYGW 240 

DPDDLYDVYC YAEDLNGELP LGDPPEKLTL EEARAYCQER GAEIATTGQL YAAWDGGLDH 300 

CSPGWLADGS VRYPIVTPSQ RCGGGLPGVK TLFLFPNQTG FPNKHSRFNV YCFRDSAQPS 360 

AIPEASNPAS NPASDGLEAI VTVTETLEEL QLPQEATESE SRGAIYSIPI MEDGGGGSST 420 

. - PEDPAEAPRT LLEFETQSMV PPTGPSEBEG KALEEEEKYE D EBBKEBBEE EEEVEDEALW 480 

AWPSELSSPG PEASLPTEPA AQBESLSQAP ARAVLQPGAS PLPDGESEAS RPPRVHGPPT 540 

ETLPTPRERN LASPSPSTLV EARBVGEATG GPELSGVPRG ESEETGSSEG APSLLPATRA 600 

PEGTRELEAP SEDNSGRTAP AGTSVQAQPV LPTDSASRGG VAWPASGDC VPSPCHNGGT 660 

CLEEEEGVRC LCLPGYGGDL CDVGLRFCNP GWDAFQGACY KHFSTRRSWE EAETQCRMYG 720 

AHLAS1STPE EQDFINNRYR EYQWIGLNDR TIEGDPLWSD GVPLLYENWN PGQPDSYPLS 780 

HO GENCWMVWH DQGQWSDVPC NYHLSYTCKM GLVSCGPPPE LPLAQVFGRP RLRYEVDTVL 840 

RYRCREGIAQ RNLPLIRCQE NGRWEAPQIS CVPRRPARAL BPBEDPEGRQ GRLLGRWKAL 900 
LIPPSSPMPG P 

-~ Seq ID NO: 261 DNA sequence 
JU Nucleic Acid Accession #i NMJ)04386.l 
Coding sequence i 2.. 3 967 

* 11 21 31 41 51 

« 1 I i I I I 

J J GATGGGGGCC CCGTTTGTCT GGGCCTTGGG CCTTTTGATG CTGCAGATGC TGCTCTTTGT 60 

GGCTGGGGAA CAGGGCACAC AGGATATCAC CGATGCCAGC GAAAGGGGGC TCCACATGCA 120 

GAAGCTGGGG TCTGGGTCAG TGCAGGCTGC GCTGGOGGAG CTGGTGGCCC TGCCCTGTCT 180 

CTTTACCCTG CAGCCAOGGC CAAGCGCAGC CCGAGATGCC CCTCGGATAA AGTGGACCAA 240 

s-r\ GGTGOGGACT GCGTCGGGCC AGCGACAGGA CTTGCCCATC CTGGTGGCCA AGGACAATGT 300 

DU CGTGAGGGTG GCCAAAAGCT GGCAGGGACG AGTGTCACTG CCTTCCTACC CCCGGCGCCG 360 

AGCCAACGCC ACGCTACTTC TGGGGCCACT GAGGGCCAGT QACTCTGGGC TGTACOGCTG 420 

CCAGGTGGTG AGGGGCATCG AGGATGAGCA GGACCTGGTG CCCTTGGAGG TGACAGGTGT 480 

TGTGTTCCAC TACCGATCAG CCCGGGACCG CTATGCACTG ACCTTOGCTG AGGCCCAGGA 540 

GGCCTGCCGT CTCAGCTCAG CCATCATTGC AGCCCCTCGG CATCTACAGG CTGCCTTTGA 600 

GGATGGCTTT GACAACTGTG ATGCTGGCTG GCTCTCTGAC CGCACTGTTC GGTATCCTAT 660 

CACCCAGTCC CGTCCTGGTT GCTATGGCGA CCGTAGCAGC CTTCCAGGGG TTCGGAGCTA 720 

TGGGAGGCGC AACCCACAGG AACTCTACGA TGTGTATTGC TTTGCCCGGG AGCTGGGGGG 780 

CGAGGTCTTC TACGTGGGCC CGGCCCGCCG CCTGACACTG GCCGGCGCGC GTGCACAGTG 840 

_~ CCGCCGCCAG GGTGCCGCGC TGGCCTCGGT GGGACAGCTG CACCTGGCCT GGCATGAGGG 900 

/U CCTGGACCAG TGCGACCCGG GCTGGCTGGC CGACGGCAGC GTGCGCTACC CGATCCAGAC 960 

GCCGCGCCGG CGCTGCGGGG GCCCAGCCCC GGGCGTGCGC ACCGTCTACC GCTTCGCTAA 1020 

CCGGACCGGC TTCCCCTCAC CCGCCGAGCG CTTCGACGCC TACTGCTTCC GAGCTCATCA 1080 

CCCCACGTCA CAACATGGAG ACCTAGAGAC CCCATCCTCT GGGGATGAGG GGGAGATTCT 1140 

GTCAGCAGAG GGGCCCCCAG TTAGAGAACT GGAGCCCACC CTGGAGGAGG AAGAGGTGGT 1200 

/ J CACCCCTGAC TTCCAGGAGC CTCTGGTGTC CAGTGGGGAA GAAGAAACCC TGATTTTGGA 1260 

GGAGAAGCAG GAGTCTCAAC AQACCCTCAG CCCTACCCCT GGGGACCCCA TGCTGGCCTC 1320 

ATGGCCCACT GGGGAAGTGT GGCTAAGCAC GGTGGCCCCC AGCCCTAGOG ACATGGGGGC 1380 

AGGCACTGCA GCAAGTTCAC ACACGGAGGT GGCOCCAACT GACCCTATGC CTAGGAGAAG 1440 

on GGGGCGCTTC AAAGGGTTGA ATGGGOGCTA CTTCCAGCAG CAGGAACCGG AGCCGGGGCT 1500 

OU GCAAGGGGGG ATGGAGGCCA GCGCCCAGCC CCCCACCTCA GAGGCTGCAG TGAACCAAAT 1560 

GGAGCCTCCG TTGGCCATGG CAGTCACAGA GATGTTGGGC AGTGGCCAGA GCCGGAGCCC 1620 

CTGGGCTGAT CTGACCAATG AGGTGGATAT GCCTGGAGCT GGTTCTGCTG GTGGCAAGAG 1680 

CTCCCCAGAG CCCTGGCTGT GGCCCCCTAC CATGGTCCCA CCCAGCATCT CAGGCCACAG 1740 

CAGGGCCCCT GTCCTGGAGC TAGAGAAAGC CGAGGGCCCC AGTGCCAGGC CAGCCACCCC 1800 
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AGACCTGTTT TGGTCCCCCT TGQAGGCCAC TGTCTCAGCT CCCAGCCCTG CCOCCTGGGA 1860 
GGCATTCCCT GTGGCCACCT CCCCAGATCT CCCTATGATG GCCATGCTGC GTGGTCCCAA 1920 
AGAGTGGATG CTACCACACC CCACCCCCAT CTCCACCGAG GCCAATAGAG TTGAGGCACA 1980 
TGGTGAGGCC ACCGCCACGG CTCCACCCTC CCCTGCTGCA GAGACCAAGG TGTATTCCCT 2040 
GCCTCTCTCT TTGACCCCAA CAGGACAGGG TGGAGAGGCC ATGCCCACAA CACCTGAGTC 2100 
CCCCAGGGCA GACTTCAGAG AAACTGGGGA GACCAGCCCT GCTCAGGTCA ACAAAGCTGA 2160 
GCACTCCAGC TCCAGCCCAT GGCCTTCTGT AAACAGGAAT GTGGCTGTAG GTTTTGTCCC 2220 
CACTGAGACT GCGACTGAGC CAAOGGGCCT CAGGGGTATC CCGGGGTCTG AGTCTGGGGT 2280 
CTTCGACACA GCAGAAAGCC CCACTTCTGG CTTGCAGGCC ACTGTAGATG AGGTGCAGGA 2340 
CCCCTGGCCC TCAGTGTACA GCAAAGGGCT GGATGCAAGT TCCCCATCTG CCCCCCTGGG 2400 
GAGCCCTGGA GTCTTCTTGG TACCCAAAGT CACCCCAAAT TTGGAGCCTT GGGTTGCTAC 2460 
AGATGAAGGA CCCACTGTGA ATCCCATGGA TTCCACAGTC ACGCOGGCCC CCAGTGATGC 2520 
TAGTGGAATT TGGGAACCTG GATCCCAGGT GTTTGAAGAA GCCGAAAGCA CCACCTTGAG 2580 
CCCTCAGGTG GCCCTGGATA CAAGCATTGT GACGCCCCTC ACGACCCTGG AGCAGGGGGA 2640 
CAAGGTTGGA GTTCCAGCCA TQTCTACACT GGGCTCCTCA AGCTCCCAAC CCCACCCAGA 2700 
GCCAGAGGAT CAGGTGGAGA CCCAGGGAAC ATCAGGAGCT TCAGTGCCTC CGCATCAGAG 2760 
CAGTCCCCTA GGGAAACCGG CTGTTCCTCC TGGGACACCG ACTGCAGCCA GTGTGGGCGA 2820 
GTCTGCCTCA GTTTCCTCAG GGGAGCCTAC GGTACCGTGG GACCCCTCCA GCACCCTGCT 2880 
GCCTGTCACC CTGGGCATAG AGGACTTCGA ACTGGAGGTC CTGGCAGGGA GCCCGGGTGT 2940 
AGAGAGCTTC TGGGAGGAGG TGGCAAGTGG AGAGGAGCCA GCCCTGCCAG GGACCCCTAT 3000 
GAATGCAGGT GCGGAGGAGG TGCACTCAGA TCCCTGTGAG AACAACCCTT GTCTTCATGG 3060 
AGGGACATGT AATGCCAATG GCACCATGTA TGGCTGTAGC TGTGATCAGG GCTTOGCCGG 3120 
GGAGAACTGT GAGATTGACA TTGATGACTG CCTCTGCAGC CCCTGTGAGA ATGGAGGCAC 3180 
CTGTATTGAT GAGGTCAATG GCTTTGTCTG CCTTTGCCTC COCAGCTATG GGGGCAGCTT 3240 
TTGTGAGAAA GACACCGAGG GCTGTGACCG CGGCTGGCAT AAGTTCCAGG GCCACTGTTA 3300 
CCGCTATTTT GCCCACCGGA GGGCATGGGA AGATGCCGAG AAGGACTGCC GCCGCCGCTC 3360 
OGGCCACCTG ACCAGCGTCC ACTCACCGGA GGAACACAGC TTCATTAATA GCTTTGGGCA 3420 
TGAAAACACG TGGATCGGCC TGAACGACAG GATCGTGGAG AGAGATTTCC AGTGGACGGA 3480 
CAACACCGGG CTGCAATTTG AGAACTGGCG AGAGAACCAG COGGACAATT TCTTCGOGGG 3540 
TGGCGAGGAC TGTGTGGTGA TGGTGGCGCA TGAAAGCGGG CGCTGGAACG ATGTCCCCTG 3600 
CAACTACAAC CTACCCTATG TCTGCAAGAA GGGCACAGTG CTCTGTGGTC CCCCTCCGGC 3660 
AGTGGAGAAT GCCTCACTCA TCGGTGCCCG CAAGGCCAAG AACAATGTCC ATGCCACTGT 3720 
AAGGTACCAG TGCAATGAAG GATTTGCCCA GCACCATGTG GTCACCATTC GATGCCGGAG 3780 
CAATGGCAAG TGGGACAGGC CCCAAATTGT CTGCACCAAA CCCAGAGGTT CACATCGGAT 3840 
GCGGGGACAC CACCACCACC ACCAACACCA CCACCAGCAT CACCACCACA AATCCCGCAA 3900 
GGAGCGCAGA AAACACAAGA AACACCCAAC GGAGGACTGG GAGAAGGACG AAGGGAATTT 3960, 
TTGCTGAAGA ACCAGAAAAA AGAAAGCACA ACACCTTTCC CATGCCTCCT CTGGAGCCTT 402 o' 
CGCCTGGGGA GACAGAACCC AGAGAGAAAC AAGAGAGTCC AGAAGTCCCT GAACCCCAAA 4080 
CTGTTCTCGC AAAAAAAATA TTCCTTTGAA CAAAGGTCTT CTTTTCCTTT TTTTACATAC 4140 
ACAAGATCTT CTTGGCAGGT GGAGCCAGGT GTCTGAAAAG TTCATTCTCG TCTGGCTGAA 4200 
CTCTGGGAGT GTGTCCCAGC TGAGGGAAGC ACAAGTAGCA AAGCTCATTG GTCTGGTCTC 4260 
TTGTTTGCCA GGCTGATTGA AGCAGGCCTT GATGAGGGTG CATGAGTGTA TGTTTGCATT 4320 
CACATGAAGG AATTGCTTTT CACACCAGAA ATTCAGACTT AGTCAATGTT GGCTGAATTC 4380 
CTAAATCCAG GAAGAAGCCT GGACGTAGGG TCATTAGCTT TGGGAATAGA AGGCTACACA 4440 
GAAGCACACT GTTTTTGAAC TTGACAACAG CTCTCCCTTT ACCCTGGACT TCAGCCCAAG 4500 
TTCCGTCTTT GGTCTTGGTG GATAAACACA CAGTGTGGAG ATCCCACGTA CTGCATTTTA 4S60 
GGGATGTTTT TAGGACAACC TCCCTCCATG CCTTCAGAGT TAGGAGTGAG AATGATCAAA 4620 
GCAATATGTA GGTGATGGAG GGAGAGTGTA TTGCTAACCC TTCCAGGTCT AGTCCAGCGC 4680 
TGAGATTTGG TGGTTCTGCA TGTGTGATGA ATCTCTTTCA CACAAATAGA CGAGAGGATA 4740 
TTTAGGGCTA GATGAGCCCA GATTTCTTCC CCCTCCATCT CTCAGGGAGA CAAAGAACCT 4800 
CCTTCCTGGA CCAAGGAGGT GCTGCCAAGT TTTCTAGCCC AGTGCACATA CCCAGTCCTT 4860 
AAGCAGACAT TGGTAGTGCC CCTGCCCTGa GTCCCACTCC TGCCCCACCC CACCCTTGTC 4920 
CCTGGCGATT GCCTGGTGGT CTAGAAACAC TTAAAACTTG AAGTAGTGAC ACCTACCTGC 4980 
GGT CATAT TG TAGAGAGATG CTCAGTGTTA AAACTGAAAC ACACAAACAC ACACACACAC 5040 
ACATTTTTCT CTTGTAGATT TTAATTTTTT AAGTGGGAAA GAACTCACCT TGCCTTCCTC 5100 
CCCCAAATGT GCAACCTGTA AAAGGTCTCT CCACACCAGG GGCCAGGATC CAGTTCCCTC 5160 
ATCTCTGGCA GGAAAGATCC ACAGCTTTTC CTCCATGTCT GTTACTCACT TTCAGCAGTC 5220 
CGGGTAAAAT CTGTGGATCA GGGTTAAAAA AGCACCGTGG AGAATGGCCC TCTTCAGGAA 5280 
AGAAAAATAA GCAAATGAAT GGTCCACCTA GGGGTTCAGT AAAGAAAGAA ATGTGTTAAC 5340 
TGAGCCTGAA TCCCTTCTGG GAAGTAATAA TGACCATTGA CAACTAAOAA GTAGACACCA 5400 
TGCTAAAGAC TTACATACAA TCTCCTTGAA TCTTCTCAAT AGCCCATTQA CTTAGAAACT 5460 
GTTACTTTCC CATTTTACAC ACAGTGAAAC TGAGGCTCAG ATATAAAGGA AAGGTACTGG 5520 
CTTGAAGTCA CAACCACGAC AGGAGTAAGG ATTTGGAATA AGGATTTGGT CCTGTTTTCT 5580 
GGACCAAATC CTTACTCTGG CTCTGCTTAC ACTTTCTCTC CATCACCAAA TCCTTACTCC 5640 
AAATCCAGAA GTCAGAGCCA ACXCCCATCT TGGTTCTGAC CCAAATCCTG CTCTGGACTC 5700 
TGGAGAGGAG ATTGAAATAT AATTGCACCC TCATACACAT TTAGGAAATG GTTAAGAAGT 5760 
GTAAACTGAA CCCTTATCCT TGTCTTCAAT CTTCCTCCCT GTAGACATCT ATCTTATTAT 5820 
GGTTATTATT CAGAAAACCC AGGGATACAQ G TTTGTCTTC TTACTTTGAT AACTCTTCTT 5880 
AGTTTAAAAT AATAATAATA ACACATCTTT GGTCATCTAT GTCACACAAA AATTTTCCTT 5940 
TGTTTGCGGG GGGCTGGGGA TGCAGTGTTT TTTGGGGGGT CTTGGTTTAT GCTCCCTGCC 6000 
CTTGAGCCCC TCAGCOGTTT GCOCTGCCCC CACCTCGGCT CCATGGTGGG AGGGGGCTCT 6060 
GGTCTTTTCT AAAGTGGGCG tflTMtTrr TGATCTTTCC CTTTTGGATG TGCGTGTGTG 6120 
TCTGCGTGTG CCATGTGCGT GGCACGCATA TGAGTGTGTG TGCGTGTGAA CGGCTTTGGG 6180 
TCCTGCTGGT TTTGCTGTGA GCTGCAGTGT TCTGTGGGTC TGTGGTATCT GACACTGTGG 6240 
ACATTAATGT ACTTCTTGGA CATTTTAATA AATTTTTTAA CAGTTCAAAA AAAAAAAAAA 6300 
AAAAAAAAAA 

Seq ID NO: 262 Protein sequence 
Protein Accession tt: np 004377.1 



1 11 21 31 41 51 
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MGAPFVWALG LLMLQMLLFV AGEQGTQDIT DASERGLHMQ KLGSGSVQAA LAELVALPCL 
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FTLQPRPSAA RDAPRIKKTK VRTASGQRQD LPILVAKDNV VRVAKSWQGR VSLPSYPRKR 120 

ANATLLLGPL RASDSGLYRC QWRG;EDEQ DLVPLEVTGV VFHYRSARDR YALTFAEAQE 180 • 

ACRLSSAIIA APRHLQAAFE DGPDNCDAGW LSDRTVRYPI TQSRPGCYGD RSSLPGVRSY 240 

GRRNPQELYD VYCPARELGO EVFYVGPARR ^TLAGARAQC RRQGAALASV GQLHLAWHEG 300 

5 LDQCDPGWLA DGSVRYPIQT PRRRCGGPAP jpVRTVYRFAN RTGPPSPAER FDAYCFRAHH 360 

PTSQHGDLBT PSSGDEGEIL SAEGPPVREL. EPTLEEEEW TPDFQEPLVS SGEEETLILE 420 

EKQESQQTLS PTPGDPMLAS WPTGEJ7WL9* VAPSPSDMGA GTAASSHTEV APTDPMPRRR 480 

GRFKGLNGRY FQQQEPEPGL, QGGMEASAQP PTSEAAVNQM EPPLAMAVTE MLGSGQSRSP S40 

HADLTNEVDM PQAGSAGGKS SPEPWLWPPT MVPPSISGHS RAPVLELEKA EGPSARPATP 600 

10 DLFWSPLEAT VSAPSPAPWE AFPVATSPDL PftflAMLRGPK EWMLPHPTPI STBANRVEAH 660 

GEATATAPPS PAAETKVYSI* PLSLTPTGQG GEAMPTTPES PRADFRETGE TSPAQVNKAE 720 

HSSSSPWPSV NRNVAVGFVP TETATEPTGL" RGIPGSESGV FDTAESPTSG LQATVDEVQD 780 

PWPSVYSKGL DASSPSAPLG SPGVFLVPKV TPNLEPWVAT DEGPTVNPMD STVTPAPSDA 840 

SGIWEPGSQV FEEAESTTLS PQVALDTSIV TPLTTLEQGD KVGVPAMSTL GSSSSQPHPB 900 

15 PEDQVETQGT SGASVPPHQS SPLGKPAVPP GTPTAASVGE SASVSSGBPT VPWDPSSTU* 960 

PVTLGIEDFE LEVLAGSPGV ESFWEEVASG EEPALPGTPM NAGABEVHSD PCENNPCLHG 1020 

GTCNANGTMY GCSCDQGFAG ENCEIDIDDC LCSPCENGGT CIDEVNGFVC LCLPSYGGSF 1080 

CEKDTEGCDR GWHKFQGHCY RYFAHRRAWE DAEKDCRRRS GHLTSVHSPE EHSFINSFGH 1140 

ENTWIGLNDR IVERDFQWTD NTGLQFENWR ENQPDNPFAG GEDCWMVAH ESGRWNDVPC 1200 

20 NYNLPYVCKK GTVLCGPPPA VENASLIGAR KAKNNVHATV RYQCNEGFAQ HHWTIRCRS 1260 

KGKWDRPQIV CTKPRRSHRM RGHHHHHQHH HQHHHHKSRK ERRKHKKHPT EDWEKDEGNF 1320 
C 

Seq ID NO: 263 DMA sequence 
25 Nucleic Acid Accession 8* Eos sequence 
Coding sequence : 7 . . 2 0 8 5 

1 11 21 31 41 51 

o n 1 1 I I I I 

3K) GCCGCGATGG CCAGCACCAG GAGTATCGAG CTGGAGCACT TTGAGGAACG GGACAAAAGG 60 

CCGCGGCCGG GGTCGCGGAG AGGGGCCCCC AGCTCCTCCG GGGGCAGCAG CAGCTCGGGC 120 

CCCAAGGGGA ACGGGCTCAT CCCCAGTCOG GCGCACAGTG CCCACTGCAG CTTCTACCGC 180 

ACGCGGACCC TGCAGGCCCT CAGCTOGGAG AAGAAGGCCA AGAAGGCGCG CTTCTACCGG 240 

AACGGGGACC GCTACTTCAA GGGCCTGGTG TTTGCCATCT CCAGCGACCG CTTCCGGTCC 300 

35 TTCGATGCGC TCCTCATAGA GCTCACCOGC TCCCTGTCGG ACAACGTGAA CCTGCCCCAG 360 

GGTGTCCGCA CTATCTACAC CATCGACGGC AGCCGGAAGG TCACCAGCCT GGACGAGCTG 420 

CTGGAAGGTG AGAGTTAOGT GTGTGCATCC AATGAACCAT TTCGTAAAGT CGATTACACC 480 

AAAAATATTA ATCCAAACTG GTCTGTGAAC ATCAAGGGTG GGACATCCCG AGOGCTGGCT 540 

GCTGCCTCCT CTGTGAAAAG TGAAGTAAAA GAAAGTAAAG ATTTCATCAA ACCCAAGTTA 600 

40 GTGACTGTGA TTCGAAGTGG AGTGAAGCCT AGAAAAGCCG TGCGGATCCT TCTGAATAAA 660 

AAGACTGCTC ATTCCTTTGA ACAAGTCTTA ACAGATATCA CCGAAGCCAT TAAACNAGCC 720 

TCAGGAGTOG TCAAGAGGCT CTGCACCCTG GATGGAAAGC AGGTGAGAGT TACGTGTGTG 780 

CATCTGCCAG ACTTTTTTGG TGATGACGAT GTTTTTATTG CATGTGGACC AGAAAAATTT 840 

CGTTATGCCC AAGATGACTT TGTCCTGGAT CATAGTGAAT GTCGTGTCCT GAAGTCATCT 900 

45 TATTCTCGAT CCTCAGCTGT TAAGTATTCT GGATCCAAAA GCCCTGGGCC CTCTCGACGC 960 

AGCCAGATTT CTGCTCATGG CAGATCTTCT TCCAATGTAA AOGGTGGACC TGAGCTTGAC 1020 

CGTTGCATAA GTCCTGAAGG TGTGAATGGA AACAGATGCT CTGAATCATC AACTCTTCTT 1080 

GAGAAATACA AAATTGGAAA GGTCATTGGT GATGGCAATT TTGCAGTAGT CAAAGAGTGT 1140 

ATAGACAGGT CCACTGGAAA GGAGTTTGCC CTAAAGATTA TAGACAAAGC CAAATGTTGT 1200 

50 GGAAAGGAAC ACCTGATTGA GAATGAAGTG TCAATACTGC GCCGAGTGAA ACATCCCAAT 1260 

ATCATTA7GC TGGTCGAGGA GATGGAAACA GCAACTGAGC TCTTTCTGGT GATGGAATTG 1320 

GTCAAAGGTG GAGATCTCTT TGATGCAATT ACTTCGTOGA CCAAGTACAC TGAGAGAGAT 1380 

GGCAGTGCCA TGGTGTACAA CTTAGCCAAT GCCCTCAGGT ATCTCCATGG CCTCAGCATC 1440 

GTGCACAGAG ACATCAAACC AGAGAATCTC TTGGTGTGTG AATATCCTGA TGGAACCAAG 1500 

55 TCTTTGAAAC TGGGAGACTT TGGGCTTGCG ACTGTGGTAG AAGGCCCTTT ATACACAGTC 1560 

TGTGGCACAC CCACTTATGT GGCTCCARAA ATCATTGCTG AAACTGGCTA TGGCCTGAAG 1620 

GTGGACATTT GGGCAGCTGG TGTGATCACA TACATACTTC TCTGTGGATT CCCACCATTC 1680 

CGAAGTGAGA ACAATCTCCA GGAAGATCTC TTCGACCAGA TCTTGGCTGG GAAGCTGGAG 1740 

TTTCCGGCCC CCTACTGGGA TAACATCACG GACTCTGCCA AGGAATTAAT CAGTCAAATG 1800 

60 CTTCAGGTAA ATGTTGAAGC TCGGTGTACC GOGGGACAAA TCCTGAGTCA CCCCTGGGTG 1860 

TCAGATGATG CCTCCCAGGA GAATAACATG CAAGCTGAGG TGACAGGTAA ACTAAAACAG 1920 

CACTTTAATA ATGCGCTCCC CAAACAGAAC AGCACTACCA CCGGGGTCTC CGTCATCATG 1980 

GTGAGTGGAA GGCGGCAGGT CTGGCCTGAC TGOGGAGCCG GCCTTGAAGT TTTTGAATTA 2040 

GGTAGCCGGG AGCTGCCCTC ACATGGAAGT TGGTGCCTTC OGTAGTCCTA TTTCATATGA 2100 

65 AGATTGGCTT GGCATGTGGA GGGCACTCAT TCGGCAACTC CCAGGCTTTG GGCACTGTGT 2160 

GGAGGGGCT7 GTGTAGGGAC CAGCAGGCCT GGTGTGAGGG GTCCAGGOGT CAAGGAGCTC 2220 

CTGGCTGGGC CCTCTGGGCA GCTGCTTCCA CTCTTGTCTC TGCCTTCTCA TCTAGAGAGA 2280 

CTCCCAAGCC CTGGAGGGGT GTGTTGTGTT AGGAATTAAC TCCCTGCCTA CCCCAAGGCC 2340 

TCAGAAATAG ATTATTAGAG ATGTGAATTA TTCTTTGAGA CTTGGGATAA GAAACAGCCA 2400 

70 AAGCTAAACA TATTTCAGTT TTAAAAAATC AGTGTTTTAT AAAACACAGT TTGGG GCTTT 2460 

TAAAGGTACA TAATCAAGGA AAAAAATATA TATTCATTTT TCAGGGTTGG TAACATTTTA 2520 

TGAGATGTCA GTGACAAOGA TGGCCTTATT TTTTTCAGCC TTTTCTTCTT CCAAAATGTT 2580 

TCTTAAGGCA ACTCTCCTAA ATACATAAAC ACAACAAATT AAAATGAAAA GTGACATGAG 2640 

AGTAAATGAA TCAAAAGGAA AAAACATTGA ACCAGAGGTG AGGGCAGCAC ACCOGCAGCA 2700 

75 GCTGTCCAGG CCTGAGCCAA TGCAACCCTG GGCGGGAAGG CCAGCTCACC GTGAGCAGGT 2760 

AGAAGCCAGC CAGCCACCCA GGCAGGGAOC TTGGTTCTCC CCACACACTC CCAGGAGCAG 2820 

GGAACAGGGG TGGAGTGGCC TTTCCCAGAG CTGGAGTTGG CTGCAGCAGC TTTCGAATCA 2880 

GACCTGCCAA GGTGATGGGC GTCTGAGTTT CACATCTGGG CCCCCCGTGA CCCCACTGAG 2940 

_ _ TCCTGACAGC TAAGGATGGG CCACCTCCAC AGCTCCGTCA CTCGTACTTG GGACAGGCCT 3000 

80 CTCATCCTCT GGGAAGGTCC TCCTTGTTTC CTACCCAACT AGAAGGGAAA CAGTGGCATA 3060 

TTCTCATGGT ACATGGTTGT CTGAAAGCCT TACCTAGGAA GAOGCAGGGT CTAGATAGAA 3120 

GCTATAAGGA AGCCACACAC ATAACCCACA TCCCCACACC CCCAACATCC CCCACACTCC 3180 
CCACACCCCC CACACCCCCC ACATCCCCAC CATAATTACC CCCACCTCCA AATATCTCAT 
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Seq ID NO: 264 Protein sequence 
Protein Accession #: Eos sequence 

- 1 11 21 31 41 51 
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MASTRSIELE HFEERDKRPR PGSRRGAPSS SGGSSSSGPK GNGLIPSPAH SAHCSFYRTR 60 

TLQALSSEKK AKKARFYRNG DRYFKGLVFA ISSDRFRSFD ALLIBLTRSL SDNVNLPQGV 120 

RTIYTIDGSR KVTSLDELLB GESYVCASNE PPRKVDYTKN INFNWSVNIK GGTSRALAAA 180 

SSVKSEVKES KDFIKPKLVT VIRSGVKPRK AVRILLNKKT AHSFEQVLTD ITEAIKXASG 240 

10 WKRLCTLDG KQVRVTCVHL PDFFGDDDVP IACGPEKFRY AQDDFVLDHS ECRVLKSSYS 300 

RSSAVKYSGS KSPGPSRRSQ ISAHGRSSSN VNGGPELDRC ISPEGVNGNR CSESSTLLBK 360 

YKIGKVIGDG NFAWKECID RSTGKEFALK IIDKAKCCGK EHLIENEVSI LRRVKHPNII 420 

MLVEEMETAT ELFLVMELVK GGDLFDAITS STKYTERDGS AMVYNLANAL RYLHGLSIVH 480 

RDIKPENLLV CEYPDGTKSL KLGDFGLATV VEGPLYTVCG TPTYVAPXII AETGYGLKVD 540 

15 IWAAGVITYI LLCGFPPFRS ENNLQEDLFD QILAGKLEFP APYWDNITDS AKELISQMLQ 600 

VNVRARCTAG QILSHPWVSD DASQENNMQA EVTGKLKQHF NNALPKQNST TTGVSVIMVS 660 
GRRQVWPDCG AGLEVFELGS RELPSHG5WC LP 

Seq ID NO: 26S DNA sequence 
20 Nucleic Acid Accession ft: AB020684.1 
Coding sequence : 1 . . 1744 

1 11 21 31 41 51 

0 c I I 1,1 I I 

ZD CCCCCTTGTC ATTAATACAT TAAAAAGATT CAATCTTTAC CCTGAGGTAA TTTTGGCCAG 60 

TTGGTACCGG ATTTATACCA AAATAATGGA* CTTGATTGGT ATTCAAACCA AGATATGTTG 120 

GACGGTTACC AGAGGAGAAG GACTCAGTCC TATTGAAAGC TGTGAAGGAT TGGGAGATCC 180 

TGCTTGCTTT TATGTTGCTG TAATTTTTAT TTTAAATGGA CTAATGATGG CATTATTCTT 240 

CATATATGGC ACATATTTAA GTGGCAGCCO ATTAGGAGGC CTGGTTACAG TGTTGTGCTT 300 

30 CTTTTTCAAT CATGGAGAGT GTACCCGTGT AATGTGGACA CCACCTCTCC GTGAAAGCTT .360 

CTCATATCCA TTTCTTGTTC TTCAGATGTT GCTAGTGACT CATATTCTCA GGGCTACAAA 420 

ACTTTATAGA GGAAGCTTGA TTOCACTCTG CATTTCCAAT GTATTTTTCA TGCTTCCTTG 480 

GCAGTTTGCT CAGTTTGTAC TTCTTACTCA GATTGCATCA TTATTTGCAG TATATGTTGT 540 

CGGGTACATT GATATATGTA AATTACGGAA GATCATTTAT ATACACATGA TTTCTCTTGC 600 

35 ACTTTGTTTT GTTTTGATGT TTGGGAACTC AATGTTATTA ACTTCTTATT ATGCTTCTTC 660 

TTTGGTAATT ATTTGGGGTA TTCTGGCAAT GAAACCACAT TTCCTGAAAA TAAATGTATC 720 

TGAACTTAGT TTATGGGTTA TTCAAGGATG TTTTTGGTTA TTTGGAACTG TCATACTTAA 780 

ATACTTGACA TCTAAAATTT TTGGTATTGC AGATGACGCT CATATTGGCA ACTTACTAAC 840 

ATCAAAATTC TTTAGTTATA AGGATTTTGA TACTTTATTG TATACCTGTG CAGCGGAGTT 900 

40 TGACTTTATG GAAAAAGAGA CTCCACTGAG ATACACAAAG ACATTATTGC TTCCAGTTGT 960 

TCTTGTAGTG TTTGTTGCTA TTGTTAGAAA GATTATTAGT GATATGTGGG GTGTCTTAGC 1020 

TAAACAACAG ACACATGTAA GAAAACACCA GTTTGATCAT GGAGAGCTGG TTTACCATGC 1080 

ATTGCAATTG TTAGCATATA CAGCCCTTGG TATTTTAATT ATGAGACTAA AACTCTTCTT 1140 

GACACCACAC ATGTGTGTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGGATGGCT 1200 

45 CTTTTGCAAA GTACATCCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 1260 

AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTGTAGGG GAGTTCAGCA ATTTGCCCCA 1320 

AGAAGAACTT ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGCGGGTGC 1380 

CATGCCCACG ATGGCAAGTG TTAAGCTCTC TGCACTTCGG CCCATTGTGA ATCATCCACA 1440 

TTATGAAGAC GCAGGCTTGA GAGCCAGAAC AAAAATAGTA TACTCAATGT ATAGTCGGAA 1500 

50 AGCAGCCGAA GAAGTGAAGC GAGAACTGAT AAAGTTAAAA GTGAACTATT ACATTCTAGA 1560 

AGAGTCATGG TGTGTAAGAA GATCGAAGCC TGGTTGCAGT ATGCCTGAAA TTTGGGATGT 1620 

AGAAGATCCT GCCAATGCTG GGAAAACTCC CTTATGTAAC CTCTTGGTGA AGGATTCCAA 1680 

ACCTCACTTC ACCACTGTAT TCCAGAACAG TGTTTACAAA GTCCTAGAAG TTGTAAAAGA 1740 

ATGACTGCTA CATGACCTGC TGCCTACGGA GAACTACATC TGTAATGGTT TTAATGTTTT 1800 

55 GCTAAGTCAT GTGTTGTTCA TATCCCAAAA ACTTTTATAG GTAACTGTTT TCAAATAGAA 1860 

AACGTTTTAT TTGGTCAATT TGAATGTCAT TCTAATTATA AAAATOACTT ACACCTTTAT 1920 

CAATTGGTTA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTTG I960 

TACTTGACTT TAATATTTGT GCTAAAGTGA GCAAAGCTAC CTGTATAAAG AAAACACAGT 2040 

£ GGGTTGTGAC AAGGATQACA TQAAAATACA GGACAATTCT GACAATGTAG GGGCTGATTT 2100 

60 TATAGTGTAA GAACTATTAA TGCCCCTTGC TTCTTTTTTC TGCCTCTTGC TCTTGTCTTT 2160 

TGGACATTTC AGTGATTGTA AGTTCTTCGG TCATGTCAGC CCCTGTCATC AACTTGAGTT 2220 

ACAGTAGATG GGGCAGACAT GGAGTGTTTG CTATATAAAA CTATCTGTTT GTTTTACTTC 2280 

CTTGTGCGCT TTTTGTTCTC TGTTCTCTTG TTAATGAAGC TTTTCCTGCC CATTATTAAT 2340 

CCAAACTCTT GGACCTTGTG GTTAGGAAAT TCCCTTAACT TCCAGCCATA TGGCATXATC 2400 

65 GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT CTTCCOCATA TTTTCTGTCA 2460 

AATAAGTACT GTTTACTCAT TTAGTTGCTT ATCAAGTACT TATTCTTGGT TTTAAAAAAA 2520 

ATTAATGGTA ACTGTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT ATTTTAATAC 2580 

CTTTAAACCA CTTTAAAGTT TTTTCATGTT TAATTATAGT TTTAAGAAAA ACTATTTTGA 2640 

A ACAACCCCAA ATATAGTGCA TCTAGAAACT AATGTATATT TGATTAGACA TCATTTATAG 2700 

70 TGGAACAGTA GACTGTAGTA CATGGTAATT TTTCTTTTAC TATTAAGATA CAATA AAACA 2760 

TGACTAATTT TGCTGTCAAA AATGTAAAGA ATAATGATAA ATGGAGTTTT TTATATTTTA 2820 

CTTTTAAGAT TGCCTGTCTT TAATAAGACA AAGCCTTAAG CCTTATGTTA TAATTTTGGT 2880 

TCTAAAAACC ATCATTTCAG TATAAGGAAT AAGTATATTT CGTCCTCCTC TTTAGTTTTT 2940 

TTCTTCCTAT TTATTTTTAT TTTGAAAAAT TTCTACACCT TCTTTGAATT CCTTGTATGA 3000 

75 ATTTTTGTTT CTTAGAAGTT AATTTGTGTG AAATGAGATT CTTCAAAACG ATGAAACCTC 3060 

ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA AATTCTAAGC AAAGGGTGAC TATGGCTGAC 3120 

AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT AACCTCATTG 3180 

TGGATTGTCC TTCAGACCTT AGTCCTCAGG CATGGTTTCT GGTGCCCACT CCTGGAAGCC 3240 

c GCTGTTCCCT TTCTACCTTC TTACCAGAGC CCAAGGGCAG GCCTGGTCCC GGGGAAGCAG 3300 

80 CAGCTTGCTG ACATAAGTCA GCTGCAAAGG CTGAGGAGTG TGCCCTCAGA GAAGCACCGC 3360 

CCCCCAGTCT TGTGCCAGCG CCTAGAGCCG CAGCTCCCAG GGATGCTCCT TCCCTG GAGG 3420 

CAGCCCAGGA GAGGGACTCT GGCAGCGTTC TTCAGATTTG TGGCCACTGT TTCTCATTTG 3480 

CTGGTTGACT GTTTTTATTT CTTAGGCTTT TGCTAGTTTT AGAAAATAGG GAAGCAGCCC 3S40 

TTGATTTGTG GATTAAAAGC AACATTTGAG CGATGATGCA CAACAGTCCA GGAAAATGGG 3600 
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CGGTGGACAC TTGAGGCTGA GGATGGGAGT TGACATGAGC AGGGAGAGGG AGGTGCGCGC 3660 

TGCTTA TCTG TGATTGTTGC TCACCTGAGT GTGGCTGATT GTGTACATCC AGCAGTTACA 3720 

ATTTTTAAAA ATTATACTTT TACATTTATT TTATATTTTT CTCACCCCCA GTAATTTCCT 3780 

TCCAAAGAAG TTCACATGTA ATAAGTAGAA ATTCTGTATA GGAAAAAAGC ATTAAAAATA 3640 

CTATTATAAC TGCTTCATTT GCTGGGAACC ATTAAAAGTA ATATAAATTA G C TTTTTCCA 3900 

GAAGGATCCT TTTGTAGCAG TGTTTATGAA TGTAACCCCC AGCAAAATAT GGCTATATAT 3960 

TAGGGGAGCC AGTTTGGAGC AGAGGCCTGA AGGTCCCTGC TATGCAGCCG TGGCCACAGC 4020 

TCGCAGCCCA AGCACTGTGG AGCATCCACA CCTTTGATGG CAATGCAGAT TGGTAGCAGG 4080 

TTCCATAGGC GTACAAAACA GTATTAAAGC TCAGTGTTTT GCATATTGTT AGCATTTACA 4140 

AATATTTTTG CTTTAGTATG AGGAAAGTAA GGATGGGCAA AGAAGCGATC AAAATAGCTA 4200 

TTGCTACAAC ATTTTCGAAA ACAAAGTTGG GGCTGTATTT CTTTAAAAAG ATAAGCCTCT 4260 

AAAAATGCTT GGCAAAAAAA ATATAGTGTT AAAATAGGCC AGTGATATTA ATGAGAAAAT 4320 

GAAAGTATGT ATCAGGAATA AAGTGATATT GCATAGGAGT ATTGTATTTT TATGAATTTT 4380 
ATGCCAGTTG TTTACATGTA CTATATATGT TAAATTAAAA AAAATCATGA GAAATG 

Seq ID NO: 266 Protein sequence a 
Protein Accession : BAA74900.lv 

1 11 21 31 41 51 

1 I I I I I 

PLVINTLKRF NLYPEVILAS ^ tyYRIYTKIMD LIGIQTKICW TVTRGEGLSP IESCEGLGDP 60 

ACPYVAVIFI LNGLMMALFF * I YGTYL5GSR LGGLVTVLCP FFNHGECTRV MWTPPLRESF 120 

SYPFLVLQML LVTHILRATK LYRGSLIA^C* ISNVPFMLPW QPAQFVLLTQ IASLFAVYW 180 

GYIDICKLRK IIYIHMISLA LCFVLMFGNS MLLTSYYASS LVIIWGILAM KPHPLKTNVS 240 

ELSLWVIQGC FWLFGTVILK YLTSKIFGIA DDAHIGNLLT SKFPSYKDFD TLLYTCAAEF 300 

DPMEKETPLR YTKTLLLPW LWFVATVRK IISDMWGVLA KQQTHVRKHQ FDHGELVYHA 360 

LQLLAYTALG ILIMRLKLPL TPHMCVMASI* ICSRQIiFGWL FCKVHPGAIV FAILAAMSIQ 420 

GSANLQTQWN IVGBFSNLPQ EELIEWIKYS TKPDAVFAGA MPTMASVKLS ALRPIVNHPH 480 

YEDAGLRART KIVYSMYSRK AAEEVKRELI KLKVNYYILE ESWCVRRSKP GCSMPEIWDV S40 
EDPANAGKTP LCNLLVKDSK PHFTTVFQNS VYKVLEWKE 

Seq ID NO* 267 DNA sequence 
Nucleic Acid Accession #: U26744.1 
Coding sequence: 59 -.1600 

1 11 21 31 41 51 

I I I I I I 

CTTCAAAGAA TATAGATGGT TTTGAAAAGT TCATGCTGTC CCTTCATTGA ATTTTAGAAT 60 

GATTGAAGAT AGTGGGAAAA GAGGAAATAC CATGGCAGAA AGAAGACAGC TGTTTGCAGA 120 

GATGAGGGCT CAAGATCTGG ATCGCATCCG ACTCTCCACC TACAGAACAG CATGCAAGCT 180 

TAGGTTTGTT CAGAAGAAAT GCAATTTGCA CCTGGTGGAC ATATGGAATG TCATAGAAGC 240 

ATTGCGGGAA AATGCTCTGA ACAACCTGGA CCCAAACACT GAACTCAACG TGTCCCGCTT 300 

AGAGGCTGTG CTCTCCACTA TTTTTTACCA GCTCAACAAA CGGATGCCAA CCACTCACCA 360 

AATCCATGTG GAGCAGTCCA TCAGCCTCCT CCTTAACTTC CTGCTTGCAG CGTTTGATCC 420 

GGAAGGCCAT GGTAAAATTT CAGTATTTGC TGTCAAAATG GCTTTAGCCA CATTGTGTGG 480 

AGGGAAGATC ATGGACAAAT TAAGATATAT TTTCTCAATG ATTTCTGACT CCAGTGGGGT 540 

GATGGTTTAT GGACGATATG ACCAATTCCT TCGGGAAGTT CTCAAACTAC CCACGGAAGT 600 

TTTGGAGGGT CCTTCATTTG GTTACACAGA ACAGTCAGCC AGATCCTGTT TCTCCCAACA 660 

GAAAAAAGTC ACGTTAAATG GTTTCTTGGA CACGCTTATG TCAGATCCTC CCCCGCAGTG 720 

TCTGGTCTGG TTGCCTCTTC TGCATCGACT AGCAAATGTG GAAAATGTCT TCCATCCGGT 780 

TGAGTGTTCC TACTGCCACA GTGAGAGTAT GATGGGATTT CGCTACCGAT GCCAACAGTG 840 

TCACAATTAC CAGCTCTGTC AGGACTGCTT CTGGAGGGGA CATGCCGGTG GTTCTCATAG 900 

CAACCAGCAC CAAATGAAAG AGTACACGTC ATGGAAATCA CCTGCTAAGA AGCTGACTAA 960 

TGCATTAAGC AAGTCCCTGA GCTGTGCTTC CAGCCGTGAA CCTTTGCACC CCATGTTCCC 1020 

AGATCAGCCT GAGAAGCCAC TCAACTTGGC TCACATCGTT GATACTTGGC CTCCCAGACC 1080 

TGTAACGAGC ATGAACGACA CCCTGTTCTC CCACTCTGTT CCCTCCTCAG GAAGTCCTTT 1140 

TATTACCAGG AGCATGCTTG AGAGTTCAAA CCGGCTTGAT GAAGAACACA GGCTAATTGC 1200 

CAGGTATGCG GCAAGGCTGG CAGCAGAGTC CTCTTCGTCT CAGCCACCTC AGCAGAGAAG 1260 

TGCTCCTGAC ATCTCTTTCA CCATCGATGC GAATAAGCAG CAAAGGCAGC TGATTGCTGA 1320 

GCTAGAAAAC AAGAACAGAG AAATCTTACA GGAGATCCAG AGACTTOGGC TAGAGCATGA 1380 

ACAAGCTTCT CAGCCCAOGC CAGAGAAGGC ACAGCAAAAC CCCACCCTGC TGGCAGAACT 1440 

CCGGCTCCTC AGACAGCGCA AAGATGAGCT GGAACAGAGA ATGTCTGCTC TCCAGGAGAG 1500 

CCGGAGAGAG CTAATGGTCC AGTTGGAGGG TCTCATGAAG CTACTAAAGG AAGAAGAACT 1560 

GAAGCAGGGA GTAAGTTATG TCCCCTACTG CAGGTCTTAA CTAACAGTGG AGGGGCCTGC 1620 

CGACCTGCGG TTTTCTCATT GCTTTTGCTC TAATGTATGT TCATGCTTCA GTTTGGAAAG 1680 

AGAAAAAAGT CATACTAATT TGCTTCTTTT TCAATGTAGT GCTTGAATTG AGATATATAA 1740 

ATTTAGCATT TTTTATAACT ATCACTACTA TCCACATCAA AAGAAGAACT ATGACATCTT 1800 

TTAGAAAAGG GAACGAATTG TCATTTATTG GAAACATTTT AGATCCCCAG AGGTATAAGT 1860 

TTCAAACCAG TCTTAGCTTT TCAAGTTGTT GATCAGACCC TTCTCTTAAC AGAGAGATAC 1920 

CACAGTCACT AGAGATACCC TGAGGTTCAT GTCATCCCAA AACCCACAGC ACTCAGAAGC 1980 

TAACCTCTAC ACCCACTCAC ACTGTGAGTA TTCAGTTCGG TTTCATTTTA CTGAAAACCT 2040 

GTGAAACCTC TTTTTATAAA AATCAGGCAA TTAAATCCCT TTTCATCACA CAATTATTGA 2100 

GCCTTGTTCC CCATGGCTCA CCAAAATGTG CTCAATTTTG TGAGAGAAAG ACTGTACTCC 2160 

ATAACTGACT ATTCAOGTCC CATCTTTTTG GCTCTTCCCC AAAGCAGAAT CCTTACTGTT 2220 

GGTTGACAGT AATCTCTTTT TAAAAAGTAA CTCTCAGCTT TTTCCTTAGC ACCAGAGCCT 2280 

TTCGGCTCCG GGAGACGAGA GGGTCATTAC ATACTTTTTT TTTTTTCTGG AAATAGGGGC 2340 

ATTGTGACTT TATAGCCTAA ACTGGAGCTG TCTGAACCTG TGGTCAGGCT CAAGAGCCAG 2400 
CAGGGGGAGC AGCAAACTC 

Seq ID NO: 268 Protein sequence 
Protein Accession #s AAC50426.1 

1 11 21 31 41 51 

I I I I I I 



1056 



WO 03/042661 



MIEDSGKRGN TMAERRQLFA EMRAQDLDRI RLSTYRTACK LRPVQKKCNIi HLVDIWNVIE 60 

ALRENALNNL DPNTELNVSR LEAVLSTIFY QLNKRMPTTH QIHVEQSISL LLNFLLAAPD 120 

PEGHGKISVF AVKMALATLC GGKIMDKLRY IFSMISDSSO VMVYGRYDQF LREVLKLPTE 180 

VLEGPSFGYT EQSARSCFSQ QKKVTIAKSFL DTU4SDPPPQ CLVWLPLLHB LANVENVFHP 240 

VECSYCHSES MMGFRYRCQQ CHKYQLOQDC FWRGHAGGSH SNQHQMKEYT SWKSPAKKLT 300 

NALSKSLSCA SSREPLHPMF PDQPEKPLNL AHIVDTWPPR PVTSMNDTLP SHSVPSSGSP 360 

FITRSMLESS NRLDEEHRLI ARYAARLAAE SSSSQPPQQR SAPDISFTID ANKQQRQLIA 420 

ELENXNREIL QEIQRLRLEH EQASQPTPEK AQQNPTLLAE LRLLRQRKDE LEQRMSALQE 480 
SRRELMVQLE GLMKLUCEEE LKQGVSYVPY CRS 



Seq ID NO: 269 DMA sequence 

Nucleic Acid Accession ft: NMJ>01276.1 

Coding sequence: 127.. 1278 



1 11 21 31 41 51 

I I I I I I 

AGTGGAGTGG GACAGGTATA TAAAGGAAGT ACAGGGCCTG GGGAAGAGGC CCTGTCTAGG 60 

TAGCTGGCAC CAGGAGCCGT GGGCAAGGGA AGAGGCCACA CCCTGCCCTG CTCTGCTGCA 120 

GCCAGAATGG GTGTGAAGGC GTCTCAAACA GGCTTTGTGG TCCTGGTGCT GCTCCAGTGC 180 

TGCTCTGCAT ACAAACTGGT CTGCTACTAC ACCAGCTGGT CCCAGTACOG GGAAGGCGAT 240 

GGGAGCTGCT TCCCAGATGC CCTTGACCGC TTCCTCTGTA CCCACATCAT CTACAGCTTT 300 

GCCAATATAA GCAACGATCA CATCGACACC TGGGAGTGGA ATGATGTGAC GCTCTACGGC 360 

ATGCTCAACA CACTCAAGAA CAGGAACCCC AACCTGAAGA CTCTCTTGTC TGTCGGAGGA 420 

TGGAACTTTG GGTCTCAAAG ATTTTCCAAG ATAGCCTCCA ACACCCAGAG TCGCCGGACT 480 

TTCATCAAGT CAGTACCGCC ATTCCTGCGC ACCCATGGCT TTGATGGGCT GGACCTTGCC 540 

TGGCTCTACC CTGGACGGAG AGACAAACAG CATTTTACCA CCCTAATCAA GGAAATGAAG 600 

GCCGAATTTA TAAAGGAAGC CCAGCCAGGG AAAAAGCAGC TCCTGCTCAG CGCAGCACTG 660 

TCTGCGGGGA AGGTCACCAT TGACAGCAGC TATGACATTG CCAAGATATC CCAACACCTG 720 

GATTTCATTA GCATCATGAC CTACGATTTT CATGGAGCCT GGCGTGGGAC CACAGGCCAT 780 

CACAGTCCCC TGTTCCGAGG TCAGGAGGAT GCAAGTCCTG ACAGATTCAG CAACACTGAC 840 

TATGCTGTGG GGTACATGTT GAGGCTGGGG GCTCCTGCCA GTAAGCTGGT GATGGGCATC 900 

CCCACCTTCG GGAGGAGCTT CACTCTGGCT TCTTCTGAGA CTGGTGTTGG AGCCCCAATC 960 

TCAGGACCGG GAATTCCAGG CCGGTTCACC AAGGAGGCAG GGACCCTTGC CTACTATGAG 1020 

ATCTGTGACT TCCTCCGCGG AGCCACAGTC CATAGAACCC TCGGCCAGCA GGTCCCCTAT 1080 

GCCACCAAGG GCAACCAGTG GGTAGGATAC GACGACCAGG AAAGCGTCAA AAGCAAGGTG 1140 

CAGTACCTGA AGGATAGGCA GCTGGCAGGC GCCATGGTAT GGGCCCTGGA CCTGGATGAC 1200 

TTCCAGGGCT CCTTCTGCGG CCAGGATCTG CGCTTCCCTC TCACCAATGC CATCAAGGAT 1260 

GCACTCGCTG CAACGTAGCC CTCTGTTCTG CACACAGCAC GGGGGCCAAG GATGCCCCGT 1320 

CCCCCTCTGG CTCCAGCTGG CCGGGAGCCT GATCACCTGC CCTGCTGAGT CCCAGGCTGA 1380 

GCCTCAGTCT CCCTCCCTTG GGGCCTATGC AGAGGTCCAC AACACACAGA TTTGAGCTCA 1440 

GCCCTGGTGG GCAGAGAGGT AGGGATGGGG CTGTGGGGAT AGTGAGGCAT CGCAATGTAA 1500 

GACTCGGGAT TAGTACACAC TTGTTGATGA TTAATGGAAA TGTTTACAGA TCCCCAAGCC 1560 

TGGCAAGGGA ATTTCTTCAA CTCCCTGCCC "' CCTAGCCCTC CTTATCAAAG GACACCATTT 1620 

TGGCAAGCTC TATCACCAAG GAGCCAAACA TCCTACAAGA CACAGTGACC ATACTAATTA 1680 

TACCCCCTGC AAAGOCAGCT TGAAACCTTC ACTTAGGAAC GTAATCGTGT CCCCTATCCT 1740 

ACTTCCCCTT CCTAATTCCA CAGCTGCTCA ATAAAGTACA AGAGTTTAAC AGTGTGTTGG 1800 

CGCTTTGCrr TGGTCTATCT TTGAGCGCCC ACTAGACCCA CTGGACTCAC CTCCCCCATC 1860 

TCTTCTGGGT TCCTTCCTCT GAGCCTTGGG ACCCCTGAGC TTGCAGAGAT GAAGGCCGCC 1920 
ATGTT 



Seq ID NO: 270 Protein -sequence 
Protein Accession #: NP_001267.l 

1 11 21 31 41 51 

MGVKASQTGP WLVLLQCCS AYKLVCYYTS WSQYREGDGS CFPDALDRFL CTHIIYSFAN 60 

ISNDHIDTWE WNDVTLYGML NTLKNRNPNL KTLLSVGGWN FGSQRFSKIA SNTQSRRTFI 120 

KSVPPFLRTH GFDGLDLAWL YPGRRDKQHF TTLIKEMKAE FIKEAQPGKK QLLLSAALSA 180 

GKVTIDSSYD IAKISQHLDF ISIMTYDFHG AWRGTTGHHS PLFRGQEDAS PDRFSNTDYA 240 

VGYMLRLGAP ASKLVMGIPT PGRSFTIiASS ETGVGAPISG PGIPGRFTKE AGTLAYYEIC 300 

DFLRGATVHR TLGQQVPYAT KGNQWVGYDD QESVKSKVQY LKDRQLAGAM VWALDLDDFQ 360 
G5FCGQDLRF PLTNAIKDAL AAT 



Seq ID NO; 271 DMA sequence 

Nucleic Acid Accession §: NM_006474.1 

Coding sequence: 181.. 669 

1 11 21 31 41 SI 

| | | I I I 

GCTGCCTAGG GTCTGGAAAG CTCGGGCACC CTCCCTCTCC GGGGCTCCTG C TCCCA CCCC 60 

TCCGGCCCCC CCACCGTCGC GCTCCTCCAG GCTGGGCCTG TGGCCGCGGT GCTTTTAATT 120 

TTCCCCCAGC TCAGAATCTT GCTGCTCGGC CCCCAGGAGA GCAACAACTC AACGGGAACG 180 

ATGTGGAAGG TGTCAGCTCT GCTCTTCGTT TTGGGAAGCG CGTCOCTCTQ GGTCCTGGCA 240 

GAAGGAGCCA GCACAGGCCA GCCAGAAGAT GACACTGAGA CTACAGGTTT GGAAGGCGGC 300 

GTTGCCATGC CAGGTGCCGA AGATGATGTG GTGACTCCAG GAACCAGOGA AGACCGCTAT 360 

AAGTCTGGCT TGACAACTCT GGTGGCAACA AGTGTCAACA GTGTAACAGG CATTCGCATC 420 

GAGGATCTGC CAACTTCAGA AAGCACAGTC CACGCGCAAG AACAAAGTCC AAGCGCCACA 480 

GCCTCAAACG TGGCCACCAG TCACTCCACG GAGAAAGTGG ATGGAGACAC ACAGACAACA 540 

GTTGAGAAAG ATGGTTTGTC AACAGTGACC CTGGTTGGAA TCATAGTTGG GGTCTTACTA 600 

GCCATCGGTT TCATTGGTGG AATCATCGTT GTGGTTATGC GAAAAATGTC GGGAAGGTAC 660 

TCGCCCTAAA GAGCTGAAGG GTTACGCCCT GCTTGCCAAC GTGCTTTAAA AAAAG ACCGT 720 

TTCTGACTCT GTGGCCCTGT CCCTGAGCTC GTGGGGAGAA GATGACCCTG GGAACATTTG 780 

CGGGCCCATT CAGATTCCAC GGTGACTTTC CGTTTGCCAA ATTAACCGAG GAAAGACCTT 840 



1057 



WO 03/042661 



TCACCAGATT TGGTTCTTAA ACTTT 

Seq ID NO: 272 Protein sequence 
Protein Accession 8: NP_006465.1 

1 11 21 31 41 51 

I I I I 1 I 

MWKVSALLFV LGSASLWVIA EGASTGQPED DTBTTGLEGG VAMPGAEDDV VTPGTSEDRY 60 

KSGLTTLVAT SVNSVTQIRI EDLPT9BSTV HAQEQSPSAT ASNVATSHST EKVDGDTQTT 120 

VEKDGLSTVT LVGIIVGVLL AIGPIGGIIV WMRKMSGRY SP 

Seq ID NO: 273 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 

1 11 21 31 41 51 

I I I I I I 

GCGGCCGCCA GCTTGCAAAG CCGAAGTCTG GCCGCGCTCT TCGACTCGCT GCGCCACGTC 60 

CCCGGGGGTG COGAGCOGGC GGGGGGTGAG GTGGCTGCGC CGGCGGCOGG GCTAGGAGGT 120 

GCGGGCACTG GGGGOGCGGG AGGGGACGTG GCAGGCCCCG CGGGGGCCAC GGCGATCCCA 180 

GGGGCCAGGA AGGTCCCGCT GCGGGCACGC AATCTGCCTC CGTCCTTCTT CACGGAGCCG 240 

TCCCGGGCAG GCGGCGGCGG GTGTGGCCCG TCGGGGCCGG ACGTGAGCTT GGGCGACCTG 300 

GAGAAGGGCG CGGAGGCCGT GGAGTTCTTT GAGCTGCTGG GGCCCGACTA CGGCGCCGGC 360 

ACGGAGGCGG CAGTCTTGCT TGCCGCCGAG CCTCTCGACG TGTTCCCCGC CGGAGCCTCC 420 

GTACTGCGGG GACCCCCGGA GCTGGAGCCC GGCCTCTTTG AGCCGCCGCC GGCAGTGGTG 480 

GGAAACCTAC TGTACCCCGA GCCCTGGAGC GTCCCGGGCT GCTCCCCGAC CAAAAAGAGC 540 

CCCTTGACTG CCCTCCGCGG CGGGTTGACC TTGAACGAGC CCTTGAGCCC CCTGTACCCC 600 

GCCGCTGCGA ATTTCTCCCG GCGGGGAGGA CGGGCCGGGC CATTTGGCTT CTTTCGCCCC 660 
CTTCTTTCCA GACTGCGCTT TGC* 

Seq ID NO; 274 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

1 11 21 31 41 51 

I ! I • I I I 

CAAAGAGGCC GGGCTCCAGC TCCGGGGGTC CCCGCAGTAC GGAGGCTCCG GCGGGGAACA 60 

CGTCGAGAGG CTCGGCGGCA AGCAAGACTG " CCGCCTCCGT GCCGGCGCCG TAGTCGGGCC 120 

CCAGCAGCTC AAAGAACTCC ACGGCCTCCG' CGCCCTTCTC CAGGTCGCCC AAGCTCACGT 180 

CCGGCCOCGA CGGGCCACAC CCGCCGCCGC. CTGCCCGGGA CGGCTCCGTG AAGAAGGACG 240 

GAGGCAGATT GCGTGCCCGC AGCGGGACCT TCCTGGCCCC TGGGATCGCC GTGGCCCCOG 300 

CGGGGCCTGC CACGTCCCCT COOGCGCCCC CAGTGCCCGC ACCTCCTAGC CCGGCCGCCG 360 

GCGCAGCCAC CTCACCCCCC GCCGGCTCGCf CACCCCCGGG GACGTGGCGC AGCGAGTCGA 420 
AGAGCGCGGC CAGACTTCGG CTTTGCAAGC ■ TGGCGGCCGC 

Seq ID NO: 275 DNA sequence 

Nucleic Acid Accession #s NM_00111B,1 

Coding sequence : 74 .. 1651 

1 11 21 31 41 51 

I I I I I I 

AGCCCAGAGA CACATTGGGG CTGACCTGCC GCTGCTGTCA GTGGGAGGCC AGTGGTGCTG 60 

GCCAAGAAGT GTCATGGCTG GTGTCGTGCA CGTTTCCCTG GCTGCTCACT GCGGGGCCTG 120 

TCCGTGGGGC CGGGGCAGAC TCOGCAAAGG ACGCGCAGCC TGCAAGTCCG CGGCCCAGAG 180 

ACACATTGGG GCTGACCTGC CGCTGCTGTC AGTGGGAGGC CAGTGGTGCT GGCCAAGAAG 240 

TGTCATGGCT GGTGTCGTGC AOGTTTCCCT GGCTGCTCTC CTCCTGCTGC CTATGGOCCC 300 

TGCCATGCAT TCTGACTGCA TCTTCAAGAA GGAGCAAGCC ATGTGCCTGG AGAAGATCCA 360 

GAGGGCCAAT GAGCTGATGG GCTTCAATGA TTCCTCTCCA GGCTGTCCTG GGATGTGGGA 420 

CAACATCACG TGTTGGAAGC CCGCCCATGT GGGTGAGATG GTCCTGGTCA GCTGCCCTGA 480 

GCTCTTCOGA ATCTTCAACC CAGACCAAGT CTGGGAGACC GAAACCATTG GAGAGTCTGA 540 

TTTTGGTGAC AGTAACTCCT TAGATCTCTC AGACATGGGA GTGGTGAGCC GGAACTGCAC 600 

GGAGGATGGC TGGTCGGAAC CCTTCCCTCA TTACTTTGAT GCCTGTGGGT TTGATGAATA 660 

TGAATCTGAG ACTGGGGACC AGGATTATTA CTACCTGTCA GTGAAGGCCC TCTACACGGT 720 

TGGCTACAGC ACATCCCTCG TCACCCTCAC CACTGCCATG GTCATCCTTT GTCGCTTCCG 780 

GAAGCTGCAC TGCACACGCA ACTTCATCCA CATGAACCTG TTTGTGTCGT TCATGCTGAG 840 

GGCGATCTCC GTCTTCATCA AAGACTGGAT TCTGTATGCG GAGCAGGACA GCAACCACTG 900 

CTTCATCTCC ACTGTGGAAT GTAAGGCCGT CATGGTTTTC TTCCACTACT GTGTTGTGTC 960 

CAACTACTTC TGGCTGTTCA TCGAGGGCCT GTACCTCTTC ACTCTGCTGG TGGAGACCTT 1020 

CTTCCCTGAA AGGAGATACT TCTACTGGTA CACCATCATT GGCTGGGGGA CCCCAACTGT 1080 

GTGTGTGACA GTGTGGGCTA OGCTGAGACT CTACTTTGAT GACACAGGCT GCTGGGATAT 1140 

GAATGACAGC ACAGCTCTGT GGTGGGTGAT CAAAGGOCCT GTGGTTGGCT CTATCATGGT 1200 

TAACTTTGTG CTTTTTATTG GCATTATCGT CATCCTTGTG CAGAAACTTC AGTCTCCAGA 1260 

CATGGGAGGC AATGAGTCCA GCATCTACTT GCGACTGGCC CGGTCCACOC TGCTGCTCAT 1320 

CCCACTATTC GGAATCCACT ACACAGTATT TGCCTTCTCC CCAGAGAATG TCAGCAAAAG 1380 

GGAAAGACTC GTGTTTGAGC TGGGGCTGGG CTCCTTCCAG GGCTTTGTGG TGGCTGTTCT 1440 

CTACTGTTTT CTGAATGGTG AGGTACAAGC GGAGATCAAG OGAAAATGGC GAAGCTGGAA 1500 

GGTGAACCGT TACTTCGCTG TGGACTTCAA GCACCGACAC CCGTCTCTGG CCAGCAGTGG 1560 

GGTGAATGGG GGCACCCAGC TCTCCATCCT GAGCAAGAGC AGCTCCCAAA TCCGCATGTC 1620 
TGGCCTCCCT GCTGACAATC TGGCCACCTG AGCCATGCTC CCCT 

Seq ID NO: 276 Protein sequence 
Protein Accession 8: NP_001109.1 

1 11 21 31 41 51 

1 I I I I I 

MAGWHVSLA AHCGACPWGR GRLRKGRAAC KSAAQHHIGA DfcPLLSVGGQ WCWPRSVMAG 60 
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WHVSLAALL LLPMAPAMHS DCIFKKEQAM CLEKIQRANE LMGFNDSSPG CPGMWDNITC 120 

WKPAHVGEMV LVSCPELPRI FNPDQVWETE TIGESDFGDS NSLDLSDMGV VSRNCTEDGW 180 

SEPPPHYFDA CGFDEYESET GDQDYYYLSV KALYTVGYST SLVTLTTAMV ILCRFRKLKC 240 

TRNFIHMNLF VSFMLRAISV FIKDpiLYA* QDSNHCFIST VECKAVMVFF HYCWSNYFW 300 

LFIEGLYLFT LLVETFFPBR RYFYWYTIIG WGTPTVCVTV WATLRLYFDD TGCWDMNDST 360 

ALWWVIKGPV VGSIMVNFVL FIGIIVILVQ KLQSPDMGGN ESSIYLRLAR STLLLIPLFG 420 

IHYTVFAFSP ENVSKRERLV FBLGLGSFQG FWAVLYCFL NGEVQAEIKR KWRSWKVNRY 480 
FAVDPKKRHP SLASSGVNGG TQLSILSKSS SQIRNSGLPA DMLAT 

Seq ID NO: 277 DMA sequence 

Nucleic Acid Accession #: NM_004000.1 

Coding sequence: 3 6.. 11 93 



1C 1 11 21 31 41 51 

15 | | | | | | 

AGAAGAAGCT GGCCAAGGAT ATGGGAGCAA CCACCATGGA CCAGAAGTCT CTCTGGGCAG 60 

GTGTAGTGGT CTTGCTGCTT CTCCAGGGAG GATCTGCCTA CAAACTGGTT TGCTACTTTA 120 

CCAACTGGTC CCAGGACCGG CAGGAACCAG GAAAATTCAC CCCTGAGAAT ATTGACCCCT 180 

OA TCCTATGCTC TCATCTCATC TATTCATTCG CCAGCATCGA AAACAACAAG GTTATCATCA 240 

ZU AGGACAAGAG TGAAGTGATG CTCTACCAGA CCATCAACAG TCTCAAAACC AAGAATCCCA 300 

AACTGAAAAT TCTOTTGTCC ATTGGAGGGT ACCTGTTTGG TTCCAAAGGG TTCCACCCTA 360 

TGGTGGATTC TTCTACATCA CGCTTGGAAT TCATTAACTC CATAATCCTG TTTCTGAGGA 420 

ACCATAACTT TGATGGACTG GATGTAAGCT GGATCTACCC AGATCAGAAA GAAAACACTC 480 

ATTTCACTGT GCTGATTCAT GAGTTAGCAG AAGCCTTTCA GAAGGACTTC ACAAAATCCA 540 

CCAAGGAAAG GCTTCTCTTG ACTGOGGGCG TATCTGCAGG GAGGCAAATG ATTGATAACA 600 

GCTATCAAGT TGAGAAACTG GCAAAAGATC TGGATTTCAT CAACCTCCTG TCCTTTGACT 660 

TCCATGGGTC TTGGGAAAAG CCCCTTATCA CTGGCCACAA CAGCCCTCTG AGCAAGGGGT 720 

GGCAGGACAG AGGGCCAAGC TCCTACTACA ATGTGGAATA TGCTGTGGGG TACTGGATAC 780 

ATAAGGGAAT GCCATCAGAG AAGGTGGTCA TGGGCATCCC CACATATGGG CACTCCTTCA 840 

CACTGGCCTC TGCAGAAACC ACCGTGGGGG CCCCTGCCTC TGGCCCTGGA GCTGCTGGAC 900 

CCATCACAGA GTCTTCAGGC TTCCTGGCCT ATTATGAGAT CTGCCAGTTC CTGAAAGGAG 960 

CCAAGATCAC GCGCCTCCAG GATCAGCAGG TTCCCTACGC AGTCAAGGGG AACCAGTGGG 1020 

TGGGCTATGA TGATGTGAAG AGTATGGAGA CCAAGGTTCA ' GTTCTTAAAG AATTTAAACC 1080 

TGGGAGGAGC CATGATCTGG TCTATTGACA TGGATGACTT CACTGGCAAA TCCTGCAACC 1140 

AGGGCCCTTA COCTCTTGTC CAAGCAGTCA AGAGAAGCCT TGGCTCCTTG TGAAGGATTA 1200 

ACTTACAGAG AAGCAGGCAA GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCCAGGAATT 1260 

CTCATGTGGG ATTCCCCTTG CCAGGCTGGC CTTTGGATCT CTCTTCCAAG CCTTTCCTGA 1320 

CTTCCTCTTA GATCATAGAT TGGACCTGGT TTTGTTTTCC TGCAGCTGTT GACTTGTTGC 13 80 
CCTGAAGTAC AATAAAAAAA ATTCATTTTG CTCCAGTA 



Seq ID NO: 278 Protein sequence 
Protein Accession #: NP_003 991.1 

1 11 21 31 41 51 

I I I I I I 

MDQKSLWAGV WLLLLQGGS AYKLVCYFTN WSQDRQEPGK FTPENIDPPL CSHLIYSFAS 60 
IENNKVIIKD KSEVMLYQTI NSLKTKNPKL KILLS IGGYL FGSKGFHPMV DSSTSRLBFI 120 
NSIILFLRNH NPDGLDVSWI YPDQKENTHF TVLIHELAEA FQKDFTKSTK ERLLLTAGVS 180 
AGRQMIDNSY QVEKLAKDLD FINLLSFDFH GSWEKPLITG HNSPLSKGWQ DRGPSSYYNV 240 
EYAVGYWIHK GMPSEKWMG IPTYGHSFTL ASAETTVGAP ASGPGAAGPI TESSGFLAYY 300 
EICQFLKGAK ITRLQDQQVP YAVKGNQWd YDDVKSMETK VQFLKNLNLG GAMIWSIDMD 360 
DFTGKSCNQG PYPLVQAVKR SLGSL 



_ Seq ID NO: 279 DNA sequence 

Nucleic Acid Accession tf: NW_015166.1 
Coding sequence: 116.. 1249 



1 11 21 31 41 51 

I I II 1 i 

TGCTGGAAGT CCCTCACCCA GAGACCAGTG CTCCCAACGG CAGAGCAGCG GGGGAGATAA 60 

AGAACTGGTG ACACGTGGCT GTACATTCAG CACAGCTGTG GTGTCCCCAA GTGCCATGAC 120 

CCAGGAGCCA TTCAGAGAGG AGCTGGCCTA TGACCGGATG CCCACGCTGG AGCGGGGCCG 180 

GCAAOACCCC GCCAGCTATG CCCCAGACGC GAAOCCGAGC GACCTGCAGC TGTCGAAGAG 240 

ACTGCCCCCC TGCTTCAGCC ACAAGACGTG GGTCTTCTCT GTGCTGATGG GGAGCTGCCT 300 

CCTGGTGACC TCGGGGTTTT CGCTGTACCT GGGGAACGTG TTCCCGGCTG AGATGGATTA 360 

CTTGCGCTGT GCTGCAGGCT CTTGCATCCC CTCGGCAATT GTGAGCTTCA CCGTCTCCAG 420 

GAGGAACGCC AATGTGATTC CCAACTTTCA GATATTGTTT GTTTCCACGT TTGCTGTGAC 480 

CACTACGTGT TTAATTTGGT TTGGATGCAA ACTAGTCCTG AACCCATCAG CAATAAACAT 540 

CAACTTCAAC CTCATCCTGC TGCTCCTGCT GGAGCTGCTC ATGGOGGCCA CGGTGATCAT 600 

CGCTGCACGG TCCAGCGAGG AGGACTGCAA GAAAAAGAAG GGCTCCATGT CTGACAGCGC 660 

CAACATTCTG GACGAAGTGC CATTTCCTGC TCGGGTCCTG AAATCTTACT CAGTCGTCGA 720 

GGTAATCGCA GGCATCTCTG CCGTCCTCGG GGGGATCATT GCCCTGAACG TGGATGACTC 780 

AGTTTCAGGC CCACACCTCT CAGTGACGTT CTTTTGGATC CTAGTGGCCT GCTTTCCAAG 840 

TGCCATTGCC AGTCATGTGG CAGCAGAGTG TCCCAGCAAG TGTCTGGTGG AGGTCCTGAT 900 

ID TGCCATAAGC AGCCTCACGT CTCCGCTGCT GTTCACAGCC TCTGGATATC TGTCATTCAG 960 

CATCATGAGA ATCGTGGAGA TGTTTAAGGA TTACCCGCCA GCCATAAAAC CATCCTACGA 1020 

TGTGCTGCTG CTGCTGCTGC TGCTAGTGCT CCTGCTGCAG GCCGGCCTCA ACAOGGGCAC 1080 

CGCCATCCAG TGCGTGCGCT TCAAGGTCAG TGCAAGGCTG CAGGGTGCAT CCTGGGACAC 1140 

CCAGAACGGC CCGCAGGAGC GCCTGGCTGG GGAGGTGGCC AGGAGCCCCC TGAAGGAGTT 1200 

CGACAAGGAG AAAGCCTGGA GAGCGGTCGT GGTGCAAATG GCCCAGTGAC CCCCAGACGC 1260 

GGAAACCGGG TGOCAGCGCC CAGCCTGGCC CCAAGCATGG AAACGCACAA CCCCTAATCG 1320 

CCCT GAGCTA CTGCTTCTAA CACCTCTTTT CCCTTGTGTG AGGGCAAACC AGGCTGCAGG 1380 

TGGGGTTTTC ACTTCCTAGG GTAGTTTAAT TTTAAAATAG GCCAATGTTG GCTAGTCTGT 1440 

GCCTCAGTGA GATCAGTCAG CTCOGAGTGG CTCCCGTGTC GTAACAGCAG GAGCATGGCC 1500 
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GCAACTTCCC 
CTGGCCCCAG 
CTGACCAGAC 
GCTTAGGGTC 
GGTGCACCTG 
TTGGCGCCTC 
GCGACACAGG 
TGCCCTGTCT 
CCCACGTCCT 
CGGGCACTGT 
TCATGCCCAG 
GGGCTGCATC 
GGGAAGCTGA 
GGTCAGAGCC 
TTGTCTTGAT 
GGGGTCTTAG 
GGTQTGAGAA 
TCCTCGTCCC 
GGGCGACTGT 
TTCACAATGG 
CGCCGTGGGG 
GTGGCCTGGG 
CGGCCTGCAG 
GAGTQACTCT 
CAACCATTGA 
CAGGCCCTCT 
CTTTGGCCAC 
CCGGATGCAG 
GCGGGGTCTG 
TCTTCTTGGG 
TTTTTGCTGG 
ATGGGGTTTA 
AAACGCTGAC 



AGGCCGAGGA 
CTCCTCTTCC 
TGTGGGTCCC 
TGTCCCGGGT 
CCCCAGGCTC 
TGTCCTGGCC 
CTGCCGCTGC 
CCCTACTTAG 
GGGTGCTGTC 
CATGCCCAGC 
CTTGGAGTGC 
CAAGGCTTTG 
GCCCCTCCCT 
CAGGAATTCT 
CCTCACTGAG 
GGTACAGCCC 
TAACACTGGC 
GAGGGOGCCT 
GAAACTGGAA 
TTTTGTTTCT 
TGTATTGACT 
GCCATCAAGA 
ACCCTGCCAG 
GGCTGCCAAG 
AGGGGGATGC 
CTGCAGTCCT 
AGCCCCAAAC 
ACCCCACCCT 
CTOCTGGGAA 
GGCAGACCAG 
CTTGCACTTT 
AAAGAACTGT 
TTGCAGAOGT 



AGGGCCCCCG 
TGCCTCTCTC 
TGOGTCTCCT 
ACTCAGTCAG 
AGGCTGCCCA 
CGGGAGCCGC 
TCTCCCAGGT 
ACCCAGTCAT 
ATGCCCAGCT 
TTGGAGTGCC 
CAOGTGGCCG 
TAAAOCGGCT 
CCTGTGTTTG 
CACAGGTTCA 
AAGGCCCCAG 
AGGCGGTCAC 
TTTGGGTAGT 
GGGTCCTCCC 
TGCTGTTACT 
TTCCTTCTGA 
GGTCCCCCAT 
TCAAAGAACC 
GGGGTTTGAG 
ATACCTTCAT 
AGGCAAGACA 
OGGCAGCTCC 
AAGCGCCACC 
GTCTGCAGGC 
TGGGGTGGGA 
AGAGCTCAAG 
CTTATTTTCT 
CTCTTTCAGT 
G 



GCTCGGCCTC 
ATGGCTTGGG 
GCCCACTCTG 
CCCAGTGGGA 
GCGGCTCTTC 
AGGGGCCCCT 
GAAATCCACA 
TCTAGAGGGA 
TGGAGTGCCA 
ACATGGCOGC 
CTGCTGTGAC 
GGACCACGTC 
CTCCCATTAC 
CCCAGCGCCC 
GGCAGCGGTC 
TGCCCACCTG 
GCCATGGCCA 
AGCTGACGGC 
TTGATAATTA 
TCTGAGAAGA 
GGGCTGCTGG 
AGGAGGCCTG 
GACCCTCCCA 
GGTGTTCATG 
CCTTCCCAGC 
GGAAGGACAC 
GTGGGAGAGG 
CACCCCCACC 
GCCACAGGGA 
TTTCAGAGTC 
TATTTTAGAG 
CTACATTTTT 



TTGAGAGCCC 
CTGGAGTGGG 
ACCGGGCTTC 
TCTTACCCAC 
CTGGACAGTG 
CCTCCAGAGC 
CCAGTCCAOG 
TCCACCGCCA 
CGTGGCCGCT 
TGCCCACGTC 
AGGCAGTGTT 
TCCCTGGCCC 
TCAAAATGCA 
TCTACCTCCT 
TTCTCCATCT 
CCAGGCTGCA 
GGAGTGGGTT 
AGTAAATCCA 
CTTTCCAGCA 
CATGAACGTT 
AAAGGCCCGG 
GGAGATGCAG 
GGTTTCCCAC 
ACAAGTGGAA 
TGCTCCTAGA 
AGTCAGGGGC 
AGA6GCTGCT 
TCCCTGCAGC 
CGACCGGGGG 
AGAATTAGGC 
CGCTTAAAAA 
GTTTAATACG 



CACCCCTGAA 
CTCTCTGGAC 
CTCCCTCCAC 
TTCCCTGCAA 
AGAGCAGGGC 
CTGGGCGCAA 
CCGGGTCGCC 
CACTGGCGGG 
GCCCACGTCC 
CCGGGCACTG 
CTTGGGGGTG 
CAGTGACCGG 
GGACAGATCA 
AGCAAGTACT 
CCGCTGTTTT 
GGGACAGTTG 
TCCCTGCGTC 
CAGTGAGTTG 
GGTGTTTTCC 
TTCTCTTCAC 
AGATGCATCT 
CTGGATGGGG 
TGCGGAACAG 
TCATTATTTT 
GGGGACAAGC 
CGGGCAAACA 
GTCACTGGTA 
TTTGAGGCTG 
GGGGCTGATG 
ACTTGGAACG 
ATCCGGAAAA 
CTTGAGCAAT 



51 



Seq ID NO: 281 DNA sequence 

Nucleic Acid Accession #t NM_004518. 

Coding sequence: 43.. 2 57 7 



1 
I 

GCTGAGCCTG 
AACGGCGGCG 
CTGGACCCCG 
GCCCCCAAGC 
CCCCCCAAGC 
CGGCCGCGCG 
CTCGTGCTGT 
TACATCCTGG 
GCCGCAGGCT 
CCGTTCTGTG 
TCCCAGGGCA 
CGGATGATCC 
GCCCACAGCA 
TCGTTCCTGG 
GCACTCTGGT 
ACCTGGAACG 
CTGCCTGCAG 
AAGCACTTTG 
TACGCCACCA 
GTCACOGTGC 
CTCAAGAGTA 
CAGAAGGTCA 
AAGGGGTCCC 
G ACAG CCCCA 
GCTTTCCGCA 
GAGGACATTG 
GGCCTCAAAG 
TTCAAGGAGA 
CACCTGGACA 
CGGGGCCCAG 
GAGGACCCCA 



11 
I 

AGCCCGACCC 
TATACCCCGG 
GCGCGCCCGA 
GCGGCAGCAT 
GCAACGCCTT 
GCTGGGCGTT 
CTGTGTTTTC 
AAATCGTGAC 
GCTGCTGCCG 
TGATTGACAT 
ACGTCTTTGC 
GCATGGACCG 
AGGAGCTGGT 
TGTACTTGGC 
GGGGCCTGAT 
GCAGGCTCCT 
GCATCTTGGG 
AGAAGAGGCG 
ACCTCTCGCG 
CCATGTACAG 
AATCTGGACT 
GTTTGAAAGA 
CGCAGGCCCA 
GCAAGGTGCC 
TCAAGGGTGC 
TGGATGACAA 
TCAGCATCAG 
GCCTGCGGCC 
TGCTGTCCCG 
OGATCACGGA 
GCATGATGGG 



21 
I 

GGGGCGCCTC 
CCCGAGCGGG 
CTCCACCCGG 
CCTCAGCAAA 
CTACCGCAAG 
CATCTACCAC 
CACCATCAAG 
TATCGTGGTG 
GTACCGTGGC 
CATGGTGCTC 
CACATCTGCG 
GCGGGGAGGC 
CACTGCCTGG 
AGAGAAGGGG 
CAOGCTGACC 
TGCGGCAACC 
GTCTGGGTTT 
GAACCCGGCA 
CACAGACCTG 
ACTTATCCCC 
CGCTTTCAGG 
TOGTGTCTTC 
GACTGTGAGG 
CAAGAGCTGG 
CGOGTCACGG 
GAGCTGCCOC 
AGCCGTGTGT 
CTACGACGTG 
AATTAAGAGC 
CAAGGACCGC 
AOGGCTCGGG 



31 

1 

CCGCCAGGCA 
GAGAAGAAGC 
GACGGGGCGC 
CCTCGCGCGG 
CTGCAGAATT 
GCCTACGTGT 
GAGTATGAGA 
TTTGGCGTGG 
TGGAGGGGGC 
ATCGCCTCCA 
CTCCGGAGCC 
ACCTGGAAGC 
TACATCGGCT 
GAGAACGACC 
ACCATTGGCT 
TTCAOCCTCA 
GCCCTGAAGG 
GCAGGCCTGA 
CACTCCACGT 
CCGCTGAACC 
AAGGACCCCC 
TCCAGCCCCC 
CGGTCACCCA 
AGCTTOGGGG 
CAGAACTCAG 
TGCGAGTTTG 
GTCATGCGGT 
ATGGAOGTCA 
CTGCAGTCCA 
ACCAAGGGCC 
AAGGTGGAGA 



41 

I 

CCATGGTGCA 
TGAAGGTGGG 
TGCTGATCGC 
GCGGCGCGGG 
TCCTCTACAA 
TCCTCCTGGT 
AGAGCTOGGA 
AGTACTTCGT 
GGCTCAAGTT 
TTGOGGTGCT 
TGCGCTTCCT 
TGCTGGGCTC 
TCCTTTGTCT 
ACTTTGACAC 
ACGGGGACAA 
TCGGTGTCTC 
TTCAGGAGCA 
TCCAGTCGGC 
GGCAGTACTA 
AGCTGGAGCT 
CGCCGGAGCC 
GAGGCGTGGC 
GCGCCGACCA 
ACCGCAGCCG 
AAGAAGCAAG 
TGACOGAGGA 
TCCTGGTGTC 
TCGAGCAGTA 
GAGTGGACCA 
CGGCCGAGGC 
AGCAGGTCTT 



1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
<2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



Seq ID NO: 280 Protein sequence 
Protein Accession #: NP_055981.1 

1 11 21 31 41 

I I I I 1 I 

MTQEPFREEL AYDRMPTLER GRQDPASYAP DAKPSDLQLS KRLPPCFSHK TWVFSVLMGS 
CLLVTSGFSL YLGNVFPAEM DYLRCAAGSC IPSAIVSFTV SRRNANVIPN FQILPVSTPA 
VTTTCLIWFG CKLVLNPSAI NINFNLILLL LLELLMAATV IIAARSSEED CKKKKGSMSD 
SANILDEVPP PARVLKSYSV VBVIAGISAV LGGIIALNVD DSVSGPHLSV TFFWILVACF 
PSAIASHVAA ECPSKCLVEV LIAISSLTSP LLFTASGYLS FSIMRTVEMP KDYPPAIKPS 
YDVLLLLLLL vlllqaglnt GTAIQCVRFK VSARLQGASW DTQNGPQERL AGEVARSPLK 
EFDKEKAWRA VWQMAQ 



60 
120 
180 
240 
300 
360 



51 
I 

GAAGTCGCGC 

CTTOGTGGGG 

CGGCTCCGAG 

CGCCGGGAAG 

OGTGCTGGAG 

TTTCTCCTGC 

GGGGGCCCTC 

GCGGATCTGG 

TGCCCGGAAA 

GGCCGCCGGC 

GCAGATTCTG 

TGTGGTCTAT 

CATCCTGGCC 

CTACGCGGAT 

GTACCCCCAG 

CTTCTTCGCG 

GCACAGGCAG 

CTGGAGATTC 

CGAGCGAACG 

GCTGAGGAAC 

GTCTCCAAGC 

TGCCAAGGGG . 

GAGCCTOGAG 

GGCACGCCAG 

CCTCCCCGGA 

CCTGACCCCG 

CAAGCGGAAG 

CTCAGCOGGC 

GATCGTGGGG 

GGAGCTGCCC 

GTCCATGGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 



1060 



WO 03/042661 



PCT/US02/36810 



AAGAAGCTGG ACTTCCTGOT GAATATCTAC ATGCAGCGGA TGGGCATCCC CCCGACAGAG 1920 
ACCGAGGCCT ACTTTGGGGC CAAAGAGCCG GAGCCGGCGC OGCCQTACCA CAGCCCGGAA 1980 
GACAGCCGGG AGCATGTCGA CAGGCACGGC TGCATTGTCA AGATCGTGCG CTCCAGCAGC 2040 
TCCACGGGCC AGAAGAACTT CTCGGCGCCC CCGGCOGOGC CCCCTGTCCA GTGTCCGCCC 2100 
5 TCCACCTCCT GGCAGCCACA GAGCCACCCG CGCCAGGGCC ACGGCACCTC CCCCGTGGGG 2160 
GACCACGGCT CCCTGGTGCG CATCCCQCCG CCGCCTGCCC ACGAGOGGTC GCTGTCOGCC 2220 
TACGGOGGGG GCAACCGCGC CAGCATGGAG TTCCTGCGGC AGGAGGACAC CCCGGGCTGC 2280 
AGGCCCCCCG AGGGGACCCT GCGGGACAGC GACACGTCCA TCTCCATCCC GTCCGTGGAC 2340 
CACGAGGAGC TGGAGCGTTC CTTCAGCGGC TTCAGCATCT CCCAGTCCAA GGAGAACCTG 2400 
10 GATGCTCTCA ACAGCTGCTA CGCGGCCGTG GCGCCTTGTG CCAAAGTCAG GCCCTACATT 2460 
GCGGAGGGAG AGTCAGACAC CGACTCCGAC CTCTGTACCC CGTGCGGGCC CCCGCCACGC 2520 
TCGGCCACCG GGGAGGGTCC CTTTGGTGAC GTGGGCTGGG CCGGGCCCAG GAAGTGAGGC 2580 
GGCGCTGGGC CAGTGGACCC GCCCGOGGCC CTCCTCAGCA OGGTGCCTCC GAGGTTTTGA 2640 
GGOGGGAACC CTCTGGGGCC CTTTTCTTAC AGTAACTGAG TGTGGCGGGA AGGGTGGGCC 2700 
15 CTGGAGGGGC CCATGTGGGC TGAAGGATGG GGGCTCCTGG CAGTGACCTT TTACAAAAGT 2760 
TATTTTCCAA CAGGGCACTC CCAGGCCCTG TCGCCATTGA GGTGCCTCCG CTGGGCTGTC 2820 
TCCTCACCCC TCCCTGTGCT GGAGCCTGTC CCAAAAAGGT GCCAACTGGG AGGCCTCGGA 2880 
AGCCACTGTC CAGGCTCCCA CTGCCTGTCT GCTCTGTTCC CAAAGGCAGC GTGTGTGGCC 2940 
TCGGGCCCTG CGGTGGCATG AAGCATCCCT TCTGGTGTGG GCATCGCTAC GTGTTTTGGG 3000 
20 GGCAGCGTTT CACGGCGGTG CCCTTGCTGT CTCCCTTGGG CTGGCTCGAG CCTGGGGTCC 3060 
ATGTCCCTTT GCCGTCCCGT CATGGGGCAG GGAATCCATA GCGGGGCCCA CAGGCAGGGG 3120 
TATGAGTGCG TCCCACCCAA CGCAGCACCA GCCCCGGCCA CCGCTCCCCG TGTCCCCAGT 3180 
TCCGTCTCAG CTACCTGGAC TCCAGGACCC TGGAGAAGGG AGACCTGGCA GTGGAGGGAG 3240 
GCTGTGCTGT GTGTCCCCCT GCAGGTGTGA CtCCGCCTGC TCTTTCCTCC CCCGCCAGGT 3300 
25 GTGGCCCCGC CTGCTCTTTC CT(XCCCACC AGTATGGCCC CACCTGCTCT TTCCTKCCCC 3360 
CCCAAGGTGT GGCCCCACCT G 'fTCTTTCCT CCCCTGCCGA GGTGTGACCC CACCTGCTCT 3420 
TTCCTCCCTC CCAGTATGGC CCCACCTGCT CTTTCCTCCC CCGAGGTGAG GCCCCGCCTG 3 4 BO 
CTCTTTCCTC CCATGGGAGC CGCTGAGGCG TGCGCACCTG GGCACAGGTT GGGGCTCTGC 3540 
AGGATGAGGA AGACAGGCCA ATCCCTTCCC TCCCAGAAGC TGGCCGCCCA GCAGGAGGGA 3600 
30 CTGAGGCCAG ACTCATGTCC AGCAAGGAAC J3TGTGGTGTG TCCCCTGGGA AGTCTCTGGG 3660 
CCCTGGGAAG AGGGAAGGTG CAOGTCCTGG GATGGTTGCG GGGCCCTGTT TTGGGAGACA 3720 
AAGGGGTAGA GGGTCTGTCT TGMGCCCCCC CAGACTCTAG CCCGAGCAGT GCAGCCACCT 3780 
ACTGCCCCAC CTCAGAGAAG TGCAGCGGGA AGGAGGCTGG AGGTGGTGCG GCGCTGCCTC 3840 
GGGTGTCTGC GTGAATGAGC GTGGCCAAGG ACCAGTGCCA CCTCATGGCA AAGAGCTCCC 3900 
35 GCAGTGTTTG TTAGAGTGCA CATCCTACGT. GCCCACTGGC ACACACACGT GCTCACATAC 3960 
ANGTCNGNGT ACAGGNGTAC ACATGCANGC TTGCACACAT GCACACAGAC CACATAGCAC 4020 
ACATGTGCAC TGACCACACC TGTATAGACC ATGCACAGTA CACATACGTG CATACACATG 4080 
CCTGCATACA GGCATACACA TGCACGCTTA. CATGTACACG TGCACAGATC ACACACATGC 4140 
ACACACGTGT AGCTCACACA CAGTATACAC , ATACACAAGT GCACAGACCA CACACAGCAC 4200 
40 TAACACATGC ACACACAAAG TGCATAGGCC : ACACAGCACA TGCACACAGG TGCACAGACC 4260 
ACACAGCACA CACAAGTGCA CAGAGCACAC. TGCACACATG CACACACACA CGCGTGCATG 4320 
CACACTCCTC GCACTTCCAG CCTTGGAGCC CTTCTGTCTC TGGTCTTTCT CTTTGACCCT 4380 
GCTGAGTGTA AGCTGCCTGG GGAGGGGCTA CAAGGAGTAA TTGTGGCTTT AGGGGTCGTG 4440 
GTGATGCTGG AATGTCAAGC GCCGTGGTGG GGTATCCGAC TGTCCGGGCT CCTGGTCCGC 4500 
45 AGTGGCAGAG CGCCAGGCAG AGCCAATCrfB GGTCTCGTGC TGCCCTTCCC CCCCACAGCC 4S60 
TGGCAGCCAT CCAGAGGAGG GGCTCTACCA GATGCCAAGG TGCCCOGGTG TCTGTATGGG 4620 
TGTCCGGTTG GGTCCTGTGT TTGGTCTGCC CTGGAGGTGG CTGGGCCCTC CTGGGATGGG 4680 
TGGCTCAGCC TCGAATCCCA GGCCCCAGCC CAGGCAGGTG CTGCTGCCTG TTGTGGTTTC 4740 
CTGGCCCAGC TTCTCCTTCT CCCTCTGCAT AAAATCACAG TCCGTGAGTC TTCCAGCTGC 4800 
50 CACCACGGCT GGGACACGCT GGGGGAGGGC TCCTCCCATG CCTCCTGCAC ACAGCCGTCT 4860 
GAGCAGGGCA GGTGCCAACA CCCCCCACCG GAGACACGCT GCCCCTCAGC GATGCCCCTA 4920 
CCTTTTGGGG GGCCTOGTCT CAAGCCCCCC! CTTGGAGGCT GAAATCACCC CAGGCACTGT 4980 
GAGGGCTTCT CCAGGGGGAC ACCCTTTGAG CTGTGGGTCT GATCACCCCA AGTCCCGCAC 5040 
ACGGAGGAGA GGCACAGCCA GGGOGTGTGG TTTAATGTTT GCCCCTTCGG GGCTGGAGGT 5100 
55 CTCAGTGTTT CTAGATTCCA GACCCTGCTG CCAGAGAGAC CTGCTGCCGG AGAGAAGGGG 5160 
AGGAGGACTC CAGCTGGGCT CGGTCCCCCA CAGTCAGGGA CCCCCATAAA GGACACCCCC 5220 
TTCTCTCTAG AAAGAGCTGG GCTCTCAGCT ATTTCTAGTT GCTTCCCAGA AGCCGAGGAG S280 
CAGAAGGAGC TGTGAGAGCT TTGCAGAAAC GCCCTTGTCC CCGCCCTCCT GAGCTATGAA 5340 
TGCCGTACAG AGCAGAGGCT GGGGCATTGG CAAGATCACA GGTTGATGCT GCACAGCCCC 5400 
60 ATTGACACAA ACCCTCAAAG CAGACGTGAG AGGGACGGTT CACAAAGCTT GGACCTGCCG 5460 
TGGAGGGTGC CCGGCAGACG TGGCGTGAGA GGGACGGCTC ACGAGGCTTG GACCTGCTGT 5520 
GGAGGGTGCC CAGCAGACGT GGTGTGAGAG GAACGGCTCA CGAGACTTGG ACCTGGTGGA 5580 
GGGTGCCCAG CAGACGTGGT GTGAGAGGGA CGGCTCACAG GGCTTGGACC GGAGAGAGAT 5640 
GGCTCATGAG ACTTGGACCT GCCGTGGAGG GTGCCCAGCA GACGTGGTAT GAGAGGGATG 5700 
65 GCTCAOGAGG CTTGGACCTG GTGGAGGGTG OCCGGCAGAC GTGTGAGAGG GAOGGTTCAC 5760 
AAGGCTTGGA CCTGCCATGG AGGGTGCCCA GCAGACGTGG TGTGAGAGGG ACAGCTCACG 5820 
AGGCTTGGAC CTGCCGTGGA GGGTGCCCAG CAGGGGGCTG AGCTCTGAGG GGTGGGTGCT 5880 
CAGTGCACGG GTGCCCCCAG TGTCCTCTGA TCCTGTCCGG TGCCTCCCCC AACCCCCACA , 5940 
CCCATGCAGA ACTCCCAGGT CACATGCACG TATGTCCAGG GCATGGGGGT GGCGTGAAGA 6000 
70 GGCCTGGTCA GGGCCTTTAG GGGCTGCAGG ACGGAATGGC CACCTGGGGA GCCTGTGTGG 6060 
CTGTGCCGGG CAGCCATCCT GCATTOCCAC CCAGCGCGCA GTCTCCACCT CGGCCCCAGC 6120 
AAAGCGCTAA GCAGCCGGAG AGACAGCCAG GGCGGCTTCC TGAAGGATGT GGGATGGTGG 6180 
ACTCCGGGGT CGAGGGAATA CGCAGGTTCC TGTCCTCCGG GAGACCTAGA GAAGCTGCAC 6240 
ACCCAGGAGC TTTCCATGAC CCGGGAGCAT GAGTGAATGG GGGGTTCCAG TTTGCTGAAC 6300 
75 TTTGCTGTCT TGTAAGGGTG GGGGCTGACG GCCGACCCTG GGAGGAGGTG ACACCGCAGG 6360 
GGGAGGTTGT GGGCAACGGT GGAGGAGGAG AGACGGGAGG GGACCATTTG GGATGGAGGG 6420 
GCCTCTTCAG AGTTTTAAAA GGCGTTTGTG GGGTGGAGTT GAGTGTGCTC TGGGCTTGGA 6480 
CACTTGCOGT GGTGCCCCTG GCTGGCOSAG GAGACTGGCT CTGGCCAGGG CCCCGTCCTG 6S40 
on AGAGGTCCTC AGOGTCTGAC TCTCGGCCAG GCGCCAGCAA GGAGGGGCCG GT CCCCGG GG 6600 
80 CTACCAGGCA GGCAOGTGCA CATCGCCATC GCCACACGCC AACTCCGCCT GGGTTTTACA 6660 
AAGTCGTTGC CTTAATGCAT GTGGACAGGA ACTCCCTGAG GTCGCCCCAT GCCCOCTGGC 6720 
TGTGCCAGGT ACGGACGCCC TGGACCCTGC GAACAGGTGG GGCGGGCGAG GGGOCCAAGG 6780 
GACGGGCTCC AGAGACACGC GCAGGGCAGG AGGGGTCTCA CGGAGGGGTC TCGCACTGAG 6840 
GCGCCCAGAG CTGGTGGTCC CGCTGGACGC CATCCCTCTG CCCGGGATCC ACACGGCCCA 6900 



1061 



WO 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CGTGTGCCCG 
GTGGCTGCAG 
CCAAAGAACA 
TCCTTGTGCC 
CTCAACTTTG 
TCTAACAGAT 
AGAATGCGCC 
COGTGGAAGG 
TGTGCACTCA 



CCATGCCCGC 
CTTCCCCATT 
TTTCATAATG 
GTAGTGACCA 
TGCACCGTCC 
TAGCTGAGAT 
AGTGCGGGTT 
TTCTGGGGAT 
GTAAAACGCA 



GCCCCAOGCC 
TGOGCCGTTG 
CCTTCAGTAC 
GGCCTTTTTT 
ATTTTCTAGG 
ATTAACTTAC 
TGCGGGGGAC 

GTGATTCAAA 



ATTGCAGTCT 
CCTCTGGCTG 
CGACGTACAC 
TGGTGGATGT 
GATAGACGCC 
TCACACGGAC 
TTCGGGTGTG 
ACGGGGACGA 
AAAAAAAAAA 



TCCATCCTCT 
TCTGCACTTT 
TTCTGACCAT 
GTTACCCCGC 
CAGGGAATGA 
AGGTTGATGC 
GGGTCCIGCG 
GTTCGGACGC 



GGCCGTGACG 
TGTTCATGCT 
TTTGTATGTG 
ACACTTCAAT 
ACTCTAGTTT 
CAGAGCCGTA 
GCOGCGATGG 
CAGGTGGACC 



6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 



Seq ID NO: 282 Protein sequence 
Protein Accession #: NP_004509.1 



MVQKSRNGGV 
GAGAGKPPKR 
SSEGALYILE 
AVLAAGSQGN 
LCLILASFLV 
GVSPFALPAG 
QYYERTVTVP 
GVAAKGKGSP 
EASI*PGEDIV 
EQYSAGHLDM 
QVLSMEKKLD 
IVRSSSSTGQ 
ERSLSAYGGG 
QSKENLDALN 
GPRK 



11 

I 

YPGPSGEKKL 
NAFYRKLQNP 
IVTIWPGVE 
VPATSALRSL 
YLAEKGENDH 
ILGSGFALKV 
MYRLI PPLNQ 
QAQTVRRSPS 
DDKSCPCEFV 
LSRIKSLQSR 
PLVNIYMQRM 
KNFSAPPAAP 
NRASMEFLRQ 
SCYAAVAPCA 



21 
I 

KVGFVGLDPG 
LYNVLERPRG 
YFVRIWAAGC 
RFLQILRM1R 
FDTYADALWW 
QEQHRQKHPE 
LELLRNLKSK 
ADQSLEDSPS 
TEDLTPGliKV 
VDQIVGRGPA 
GIPPTETEAY 
PVQCPPSTSW 
EDTPGCRPPE 
KVRPYIAEGE 



31 
I 

APDSTRDGAL 
WAFIYHAYVF 



MDRRGGTWKL 
GLITLTTIGY 
KRRNPAAGLI 
SQLAFRKDPP 
KVPKSWSFGD 
SIRAVCVMRF 
ITDKDRTKGP 
FGAKEPEPAP 
QPQSHPRQGH 
GTLRDSDTSI 
SDTDSDLCTP 



41 

I 

LIAGSBAPKR 
LLVFSCLVLS 
LKFARKPFCV 
LGSWYAHSK 
GDKYFQTHKG 
QSAWRFYATN 
PEPSPSQKVS 
RSRARQAFRI 
LVSKRKFKES 
AEAELPEDPS 
PYHSPEDSRE 
GTSPVGDHGS 
SIPSVDHEEL 
CGPPPRSATG 



51 
I 

GSILSKPRAG 
VFSTIKEYEK 
IDIMVLIASI 
ELVTAWYIGP 
RLLAATFTLI 
LSRTDLHSTW 
LKDRVFSSPR 



Seq ID NO: 2838 DNA sequence 
Nucleic Acid Accession #: AF152496. 
Coding sequence: 1..2391 



LRPYDVMDVI 
MMGRLGKVEK 
HVDRHGCIVK 
LVRIPPPPAH 
ERSFSGFSIS 
EGPFGDVGWA 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



1 11 21 31 41 

t I I I I j 

ATGGAGGCGG GAGGAGAGCG ATTTCTTAGA CAAAGGCAAG TCTTGCTTCT CTTTGTTTTT 60 

CTGGGAGGGT CTCTGGCTGG GTCCGAGTCA AGACGCTATT CTGTGGCTGA GGAAAAAGAG 120 

AAGGGCTTTT TAATAGCCAA CCTAGCAAAG GATCTGGGAC TAAGGGTAGA GGAACTGGCC 180 

GCGAGGGGGG CCCAAGTTGT GTCCAAAGGG AACAAACAGC ATTTTCAGCT CAGTCATCAG 240 

ACAGGTGATT TGCTCCTGAA TGAGAAATTG GACCGGGAGG AGCTATGCGG CCCCACAGAA 300 

CCATGCATAC TACATTTTCA GATATTACTG CAAAACCCTT TGCAATTCGT TACAAACGAG 360 

CTCCGTATCA TAGATGTAAA TGACCATTCT CCGGTATTCT TTGAAAATGA AATGCATCTG 420 

AAAATCCTAG AAAGCACTCT GCCAGGAACA GTAATTCCTT TGGGAAATGC TGAGGACTTG 480 

GATGTGGGAA GAAACAGCCT CCAAAACTAC ACTATCACTC CGAATTCCCA CTTCCACGTA 540 

CTCACTCGCA GTCGTAGGGA CGGAAGGAAG TACCCGGAAC TAGTACTGGA TAAAGCGCTC 600 

GATCGGGAGG AGCAGCCGGA ACTCAGCTTA ACGCTCACCG CGCTGGACGG CGGCTCTCCC 660 

CCTCGGTCTG GGACAGCCCA GATAAACATC CAGGTCTTAG ATATAAACGA CAATGCACCA 720 

GAATTTGCAC AGCCGCTCTA TGAGGTTGCA GTTCTAGAGA ATACCCCCGT TAACTCTGTC 780 

ATTGTCACTG TCTCGGCTTC TGACTTAGAT ACAGGAAGTT TTGGGACAAT ATCATATGCA 840 

TTTTTTCATG CTTCTGAAGA AATTCGCAAA ACTTTTCAGC TAAATCCAAT TACTGGTGAT 900 

ATGCAACTGG TCAAATATTT GAATTTTGAA GOGATTAATA GTTATGAAGT CGACATCGAG 960 

GCCAAGGATG GCGGAGGCCT ATCCGGAAAG TCTACAGTCA TAGTCCAGGT GGTTGATGTC 1020 

AACGACAACC CACCGGAACT GACCTTGTCT TCAGTAAACA GCCCTATTCC TGAGAACTCG 1080 

GGAGAGACTG TACTGGCTGT TTTCAGTGTT TCTGATCTAG ACTCTGGAGA CAACGGAAGA 1140 

GTGATGTGTT CCATTGAGAA CAATCTCCCC TTCTTCCTGA AACCATCTGT AGAGAATTTT 1200 

TACACCCTAG TGTCAGAAGG CGCGCTGGAC AGAGAGACCA GATCCGAGTA CAACATTACC 1260 

ATCACTATCA CTGACCTGGG GAGACCCAGG CTGAAAACCA AGTACAACAT AACCGTGCTG 1320 

t GTCTCCGACG TCAATGACAA OGCCCCCGCC TTCACCCAAA TCTCCTACAC CCTGTTCGTC 1380 

CGCGAGAACA ACAGCCCCGC CCTGCACATC GGCAGTGTCA GCGCCACAGA CAGAGACTCA 1440 

GGCACCAACG CCCAGGTAAC CTACTCGCTG CTGCCGCCCC AGGACOCGCA CCTGCCCCTC 1500 

TCTTCCCTGG TCTCCATCAA CGCGGACAAC GGCCACCTGT TTGCCCTCAG GTCGCTGGAC 1560 

TACGAGGCCC TGCAGGOGTT CGAGTTCCGC GTGGGCGCCA CAGACCGTGG CTCCCCGGCT 1620 

TTGAGCAGOG AGGCGCTGGT GCGCGTGCTG GTGCTGGACG CCAACGACAA CTCGCOCTTC 1680 

GTGCTGTACC CGCTGCAGAA CGGCTCCGCG CCCTGCACCG AGCTGGTGCC CCGGGCGGCT 1740 

GAGCCGGGCT ACCTGGTGAC CAAGGTGGTG GCGGTGGACG GCGACTCGGG CCAGAACGCC 1800 

TGGCTGTCGT ACCAGCTGCT CAAGGCCACG GAGCCCGGGC TGTTCGGCGT GTGGGCGCAC 1860 

AATGGCGAAG TGCGCACCGC CAGGCTGCTG AGCGAGOGCG ACGCGGCCAA GCACAGGCTG 1920 

GTGGTGCTGG TCAAGGACAA TGGCGAGCCT CCGCGCTCGG CCACCGCCAC GCTGCATGTG 1980 

CTCCTGGTGG ACGGCTTCTC CCAGCCCTAC CTGCCTCTCC CGGAGGCGGC ACCGGCCCAG 2040 

GCCCAGGCCG ACTTGCTCAC CGTCTACCTG GTGGTGGCAT TGGCCTCGGT GTCTTCGCTC 2100 

TTCCTCTTTT CGGTGCTCCT GTTCGTGGCG GTGCGGCTGT GCAGGAGGAG CAGGGOGGCC 2160 

TCGGTGGGTC GCTGCTCGGT GCCCGAGGGC CCCTTTCCAG GGCAGATGGT GGACGTGAGC 2220 

GGCACCGGGA CCCTGTCCCA GAGCTACCAG TACGAGGTGT GTCTGACTGG AGGCTCCGGG 2280 

ACAAATGAGT TCAAGTTCCT GAAGCCAATT ATCCCCAACT TCGTTGCTCA GGGTGCAGAG 2340 
AGGGTTAGCG AGGCAAATCC CAGTTTCAGG AAGAGCTTTG AATTCAGTTA A 

Seq ID NO: 284 Protein sequence 
Protein Accession fl: AAD43757.1 



1 11 21 

I I I 

MEAGGERFLR QRQVLLLPVF 



31 41 51 

1 1 1 

RRYSVAEEKE KGFLIANLAK DLGLRVEEIiA 



60 



1062 



WO 03/042661 



ARGAQWSKG NKQHFQLSHQ TGDLLLNEKL DREELCGPTE PCILHFQILL QNPLQFVTNB 120 

LRIIDVNBHS PVFPSNEMHL KILESTLPGT VIPLGNAEDL DVGRNSLQNY TITPNSHFHV 180 

LTRSRRDGRK YPELVLDKAL DREEQPELSL TLTALDGGSP PRSGTAQINI QVLDIKDMAP 240 

EFAQPLYEVA VLENTPVNSV IVTVSASDLD TGSFGTISYA FFHASEEIRX TFQLNPITGD 300 

MQLVKYLNFE AINSYEVDIE AKDGGGLSGH STVIVQWDV NDNPPELTLS SVNSPIPENS 360 

GETVLAVPSV SDLDSGDNGR VMCSIENNLP' FPLKPSVENF YTLVSEGALD RETRSEYNIT 420 

ITITDLGTPR LKTKYNITVL VSDVNDNAPA FTQISYTLFV REKNSPALHI GSVSATDRDS 480 

GTNAQVTYSL LPPQDPHLPL SSLVSINADN GHLFALRSLD YEALQAFEFR VGATDRGSPA S40 

LSSEALVRVL VLDANDNSPF VLYPLQNGSA PCTELVPRAA EPGYLVTKW AVDGDSGQNA 600 

WLSYQLLKAT EPGLFGVWAH NGEVRTARLL SERDAAKHRL WLVKDNGEP PRSATATLHV 660 

LLVDGPSQPY LPLPEAAPAQ AQADLLTVYL WALASVSSL FLFSVLLFVA VRLCRRSRAA 720 

SVGRCSVPEG PFPGQMVDVS GTGTLSQSYQ YEVCLTGGSG TNEFKFLKPI IPNFVAQGAE 780 
RVSEANPSFR KSFEFS 

Seq ID NO: 285 DNA sequence 

Nucleic Acid Accession ft: NM_001794.2 

Coding sequence: 15.-2765 

1 11 21 31 41 51 

I. I I I I I 

CGCCGGCGGG GAAGATGACC GCGGGCGCCG GCGTGCTCCT TCTGCTGCTC TCGCTCTCCG 60 

GCGCGCTCCG GGCCCATAAT GAGGATCTTA CAACTAGAGA GACCTGCAAG GCTGGGTTCT 120 

CTGAAGATGA TTACACGGCA TTAATCTCCC AAAATATTCT AGAAGGGGAA AAGCTACTTC 180 

AAGTCAAGTT CAGCAGCTGT GTGGGGACCA AGGGGACACA ATATGAGACC AACAGCATGG 240 

ACTTCAAAGT TGGGGCAGAT GGGACAGTCT TCGCCACCCG GGAGCTGCAG GTCCCCTCCG 300 

AGCAGGTGGC GTTCACGGTG ACTGCATGGG ACAGCCAGAC AGCAGAGAAA TGGGACGCCG 360 

TGGTGCGGTT GCTGGTGGCC CAGACCTCGT CCCCGCACTC TGGACACAAG CCGCAGAAAG 420 

GAAAGAAGGT CGTGGCTCTG GACCCCTCTC CGCCTCCGAA GGACACCCTG CTGCCGTGGC 480 

CCCAGCACCA GAACGCCAAC GGGCTGAGGC GGCGCAAACG GGACTGGGTC ATCCCGCCCA 540 

TCAACGTGCC CGAGAACTCG OGCGGGCCCT TCCCGCAGCA GCTCGTGAGG ATCGGGTCCG 600 

ACAAAGACAA TGACATCCCC ATCCGGTACA GCATCACGGG AGTGGGCGCC GACCAGCCCC 660 

CCATGGAGGT CTTCAGCATT GACTCCATGT CCGGCCGGAT GTACGTCACA AGGCCCATGG 720 

ACCGGGAGGA GCACGCCTCT TACCACCTCC GAGCCCACGC TGTGGACATG AATGGCAACA 780 

AGGTGGAGAA CCCCATCGAC CTGTACATCT ACGTCATCGA CATGAATGAC AACCGCCCTG 840 

AGTTCATCAA CCAGGTCTAC AACGGCTCCG TGGACGAGGG CTCCAAGCCA GGCACCTACG 900 

TGATGACCGT CACGGCCAAC GATGCTGACG ACAGCACCAC GGCCAACGGG ATGGTGOGGT 960 

ACCGGATCGT GACCCAGACC CCACAGAGCC CGTCCCAGAA TATGTTCACC ATCAACAGCG 1020 

AGACTGGAGA TATOGTCACA GTGGCGGCTG GCCTGGACCG AGAGAAAGTT CAGCAGTACA 1080 

CAGTCATCGT TCAGGCCACA GATATGGAAG GAAATCTCAA CTATGGCCTC TCAAACACAG 1140 

CCACAGCCAT CATCAOGGTQ ACAGATGTGA ATGACAACCC GCCAGAATTT ACCGCCAGCA 1200 

CGTTTGCAGG GGAGGTCCCC GAAAACCGCG TGGAGACCGT GGTOGCAAAC CTCACGGTGA 1260 

TGGACCGAGA TCAGCCCCAC TCTCCAAACT GGAATGCCGT TTACCGCATC ATCAGTGGGG 1320 

ATCCATCOGG GCACTTCAGC GTCCGCACAG ACCCCGTAAC CAACGAGGGC ATGGTCACCG 1380 

TGGTGAAGGC AGTCGACTAC GAGCTCAACA GAGCTTTCAT GCTGACAGTG ATGGTGTCCA 1440 

ACCAGGCGCC CCTGGCCAGC GGAATCCAGA TGTCCTTCCA GTCCACGGCA GGGGTGACCA 1500 

TCTCCATCAT GGACATCAAC GAGGCTCCCT ACTTCCCCTC AAACCACAAG CTGATCCGCC 1560 

TGGAGGAGGG CGTGCCCCCC GGCACCGTGC TGACCACGTT TTCAGCTGTG GACCCTGACC 1620 

GGTTCATGCA GCAGGCTGTG AGATACTCAA AGCTGTCAGA CCCAGCGAGC TGGCTGCACA 1680 

TCAATGCCAC CAACGGCCAG ATCACCACGG CGGCAGTGCT GGACCGTGAG TCCCTCTACA 1740 

CCAAAAACAA CGTCTACGAG GCCACCTTCC TGGCAGCTGA CAATGGGATA CCCCCGGCCA 1800 

GCGGCACCGG GACCCTCCAG ATCTATCTCA TTGACATCAA CGACAACGCC CCTGAGCTGC 1860 

TGCCCAAGGA GGCGCAGATC TGCGAGAAGC CCAACCTGAA CGCCATCAAC ATCACGGCGG 1920 

CCGACGCTGA CGTCGACCCC AACATCGGCC CCTACGTCTT CGAGCTGCCC TTTGTCCCGG 1980 

CGGCCGTGCG GAAGAACTGG ACCATCACCC GCCTGAACGG TGACTATGCC CAACTCAGCT 2040 

TGCGCATCCT GTACCTGGAG GCCGGGATGT ATGACGTCCC CATCATCGTC ACAGACTCTG 2100 

GAAACCCTCC CCTGTCCAAC ACGTCCATCA TCAAAGTCAA GGTGTGCCCA TGTGATGACA 2160 

ACGGGGACTG CACCACCATT GGCGCAGTGG CAGCGGCTGG TCTGGGCACC GGTGCCATCG 2220 

TGGCCATCCT CATCTGCATC CTCATCCTGC TGACCATGGT CCTGCTGTTT GTCATGTGGA 2280 

TGAAGCGGCG AGAGAAGGAG CGOCACACGA AGCAGCTGCT CATTGACCCC GAGGACGACG 2340 

TCCGCGACAA CATCCTCAAG TATGACGAGG AAGGCGGTGG CGAGGAGGAC CAGGACTACG 2400 

ACCTCAGCCA GCTGCAGCAG CCGGAAGCCA TGGGGCACGT GCCAAGCAAA GCCCCTGGCG 2460 

TGCGTCGCGT GGATGAGCGG CCGGTGGGOG CTGAGCCCCA GTACCCGATC AGGCCCATGG 2520 

TGCCGCACCC AGGCGACATC GGTGACTTCA TCAATGAGGG ACTCOGCGCT GCTGACAACG 2580 

ACCCCACGGC ACCCCCCTAT GACTCCCTGC TGGTCTTCGA CTAOGAGGGG AGCGGCTCCA 2640 

CCGCAGGCTC CGTCAGCTCC CTGAACTCAT CCAGTTCCGG GGACCAAGAC TACGATTACC 2700 

TCAACGACTG GGGGCCCAGA TTCAAGAAGC TGGCGGACAT GTATGGAGGT GGTGAAGAGG 2760 

ATTGACTGAC CTCGCATCTT CGGACCGAAG TGAGAGCCGT GCTCGGACGC CGGAGGAGCA 2820 

GGACTGAGCA GAGGCGGCCG GTCTTCCCGA CTCCCTGCGG CTGTGTCCTT AGTGCTGTTA 2880 

GGAGGCCCCC CAATCCCCAC GTTGAGCTGT CTAGCATGAG CACCCACCCC CACAGCGCCC 2940 

TGCACCCGGC CGCTGCCCAG CACCGCGCTG GCTGGCACTG AAGGACAGCA AGAGGCACTC 3000 

TGTCTTCACT TGAATTTCCT AGAACAGAAG CACTGTTTTT AAAAAAAAAA AAAAAAAAAG 3060 
AAG 

Seq ID NO: 286 Protein sequence 
Protein Accession ft: NF_001785.2 

1 11 21 31 41 SI 

I I I I I I 

MTAGAGVLLL LLSLSGALRA HNEDLTTRET CKAGPSEDDY TALISQNILE GEKLLQVKFS 60 

SCVGTKGTQY ETNSMDFKVG ADGTVFATRE LQVPSEQVAF TVTAWDSQTA EXWDAWRLL 120 

VAQTSSPHSG HKPQKGKKW ALDPSPPPKD TLLPWPQHQN ANGLRRRKRD WVIPPINVPB 180 

NSRGPFPQQL VRIRSDKDND IPIRYSITGV GADQPPMEVF SIDSMSGRMY VTRPMDREEH 240 

ASYHLRAHAV DMNGNKVENP IDLYIYVIDM NDNRPEFINQ VYNGSVDEGS KPGTYVMTVT 300 

ANDADDSTTA NGMVRYRIVT QTPQSPSQNM FTINSETGDI VTVAAGLDRB KVQQYTVIVQ 360 
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ATDMEGNLNY GLSNTATAII TVTDVNDNPP EFTASTFAGE VPENRVETW ANLTVMDRDQ 420 

PHSPNWNAVY RIISGDPSGH PSVRTDPVTN EGMVTWKAV DYELNRAFNL TVMVSNQAPL 480 

ASGIQMSFQS TA6VTZSIMD INEAPYPPSN HKLIRLEEGV PPGTVLTTFS AVDPDRFMQQ 540 

AVRYSKLSDP ASWLHINATN GQITTAAVLD RESLYTKNNV YEATFLAADN GIPPASGTGT 600 

LQIYliIDIND NAPELLPKEA QICEKPNLNA INITAADADV DPNIGPYVFE LPFVPAAVRK 660 

NWTITRLNGD YAQLSLRILY LEAGMYDVPI IVTDSGNPPL SNTSIIKVKV CPCDDNGDCT 720 

TIGAVAAAGL GTGAIVAILI CILILLTMVL LFVMWMKRRE KERHTKQLLI DPEDDVRDNI 780 

LKYDEEGGGE EDQDYDLSQL QQPEAMGHV^ SKAPGVRRVD ERPVGAEPQY PIRPMVPHPG 840 

DIGDFINEGL RAADNDPTAP PYDSLLVFDJT EGSGSTAGSV SSLMSSSSGD QDYDYLNDWG 900 
PRFKKLADMY GGGEED V 

Seq ID NO: 287 DNA sequence • 
Nucleic Acid Accession #: AF1S2495.1 
Coding sequence 1 . . 23 97 

1 11 21 31 41 51 

I I I I I i 

ATGGAGGCCG GAGAGGGGAA GGAGCGCGTT CCGAAACAAA GGCAAGTCCT GATATTCTTT 60 

GTTTTGCTGG GCATAGCTCA GGCTAGTTGC CAGCCTAGGC ACTATTCAGT GGCCGAGGAA 120 

ACGGAGAGTG GCTCCTTTGT GGCCAA7TTG TTAAAAGACC TGGGGCTGGA GATAGGAGAA 180 

CTTGCTGTGA GGGGGGCCAG GGTCGTTTCC AAAGGAAAAA AAATGCATTT GCAGTTCGAT 240 

AGGCAGACGG GGGATTTGTT GTTAAATGAG AAATTGGACC GGGAGGAGCT GTGCGGCCCC 300 

ACAGAGCCCT GTGTCCTACC TTTCCAGGTG TTACTAGAAA ATCCCTTGCA GTTTTTTCAG 360 

GCGGAGCTAC GGATTAGGQA CGTAAATGAT CATTCCCCAG TTTTCCTAGA CAAAGAAATA 420 

CTTTTGAAAA TTCCAGAAAG TATCACTCCT GGAACTACTT TCTTAATAGA ACGTGCCCAG 480 

GACTTGGATG TAGGAACCAA CAGTCTCCAA AATTACACAA TCAGTCCCAA TTTCCACTTT 540 

CATCTTAATT TACAAGACAG TCTCGATGGC ATAATATTAC CACAGCTGGT GCTGAACAGA 600 

GCCCTGGATC GCGAGGAGCA GCCTGAGATC AGGTTAACCC TCACAGCGCT AGATGGCGGG 660 

AGTCCACCCA GGTCCGGCAC GGCCCTGGTA CGGATTGAAG TTGTGGACAT CAATGACAAC 720 

GTCCCAGAGT TTGCAAAGCT GCTCTATGAG GTGCAGATCC CGGAGGACAG CCCCGTTGGA 780 

TCCCAGGTTG CCATCGTCTC TGCCAGGGAT TTAGACATTG GAACTAATGG AGAAATATCT 840 

TATGCATTTT CCCAAGCATC TGAAGACATT CGCAAAACGT TTCGATTAAG TGCAAAATCG 900 

GGAGAACTGC TTTTAAGACA GAAACTGGAT TTCGAATCCA TCCAGACATA CACAGTAAAT 960 

ATTCAGGCGA CAGATGGTGG GGGCCTATCT GGAACTTGTG TGGTATTTGT CCAAGTGATG 1020 

GATTTGAATG ACAATCCTCC GGAACTAACT ATGTCGACAC TTATCAATCA GATCCCAGAA 1080 

AACTTGCAGG ACACCCTCAT TGCTGTATTC AGCGTTTCAG ATCCTGACTC CGGAGACAAC 1140 

GGAAGGATGG TGTGCTCCAT CCAAGATGAT CTTCCTTTTT TCTTGAAACC TTCTGTTGAG 1200 

AACTTTTACA CTCTGGTGAT AAGCACGGCC CTGGACCGGG AGACCAGATC CGAATACAAC 1260 

ATCACCATCA CCGTCACCGA CTTCGGGACA CCCAGGCTGA AAACCGAGCA CAACATAACC 1320 

GTGCTGQTCT CCGACGTCAA TGACAACGCC CCCGCCTTCA CCCAAACCTC CTACACCCTG 1380 

TTCGTCCGCG AGAACAACAG CCCCGCCCTG CACATCGGCA GCGTCAGCGC CACAGACAGA 1440 

GACTGGGGCA CCAACGCCCA GGTCACCTAC TCGCTGCTGC CGCCCCAGGA CCCGCACCTG 1500 

CCCCTCGCCT CCCTGGTCTC CATCAACGCG GACAACGGCC ACCTGTTCGC TCTCCAGTCG IS 60 

CTGGACTACG AGGCCCTGCA GGCGTTCGAG TTCCGCGTGG GCGCCGCAGA CCGCGGCTCC 1620 

CCGGCGTTGA GCAGCGAGGC GCTGGTGCGC GTGCTGGTGC TGGAOGCCAA CGACAACTCG 1680 

CCCTTCGTGC TGTACCCGCT GCAGAACGGC TCCGCGCCCT GCACCGAGCT GGTGCCCCGG 1740 

GCGGCCGAGC CGGGCTACCT GGTGACCAAG GTGGTGGCGG TGGACGGCGA CTCGGGCCAG 1800 

AACGCCTGGC TGTCGTACCA GCTGCTCAAG GCCACGGAGC CCGGGCTGTT CGGCGTGTGG 1860 

GCGCACAATG GCGAGGTGCG CACCGCCAGG CTGCTGAGGG AGCGCGACGC TGCCAAGCAG 1920 

AGGCTGGTGG TGCTGGTCAA GGACAATGGC GAGCCTCCGC GCTCGGCCAC CGCCACGCTG 1980 

CACGTGCTCC TGGTGGACGG CTTCTCCCAG CCCTACCTGC TGCTCCCGGA GGCGGCACCG 2040 

GCCCAGGCCC AGGCOGACTT GCTCACCGTC TACCTGGTGG TGGCGTTGGC CTCGGTGTCT 2100 

TCGCTCTTCC TCTTCTCGGT GCTCCTGTTC GTGGCGGTGC GGCTGTGCAG GAGGAGCAGG 2160 

GCGGCCTCGG TGGGTCGCTG CTCGGTGCCC GAGGGCCCCT TTCCAGGGCA GATGGTGGAC 2220 

GTGAGCGGCA CCGGGACCCT GTCCCAGAGC TACCAGTACG AGGTGTGTCT GACTGGAGGC 2280 

TCGGGGACAA ATGAGTTCAA GTTCCTGAAG CCAATTATCC CCAACTTCGT TGCTCAGGGT 2340 
GCAGAGAGGG TTAGCGAGGC AAATCCCAGT TTCAGGAAGA GCTTTGAATT CACTTAA 



Seq ID NO: 288 Protein sequence 
Protein Accession #i AAD43756.1 



1 11 21 31 41 51 

I I I I I I 

MEAGEGKERV PKQRQVLIFP VLLGIAQASC QPRHYSVAEE TESGSPVANL LKDLGLEIGE 60 

LAVRGARWS KGKKHHLQFD RQTGDLLLNE FXDRBELCGP TEPCVLPFQV LLENPLQFFQ 120 

AELRIRDVND HSPVFLDKEI LLKIPESITP GTTPLIERAQ DLDVGTNSLQ NYTISPNFHF 180 

HLNLQDSLDG IILPQLVLNR ALDREEQPEI RLTLTALDGG SPPRSGTALV RIEWDINDN 240 

VPEPAKLLYE VQIPEDSPVG SQVAIVSARD LDIGTNGEIS YAPSQASEDI RKTFRLSAKS 300 

GELLLRQKLD FESIQTYTVN IQATDGGGLS GTCWFVQVM DLNDNPPELT MSTLINQIPE 360 

NLQDTLIAVF SVSDPDSGDN GRKVCSIQDD LPPPLKPSVE NPYTLVISTA LDRETRSEYN 420 

ITITVTDFGT PRLKTEHNIT VLVSDVNDNA PAPTQTSYTL PVRENNSPAL HIGSVSATDR 480 

DSGTNAQVTY SLLPPQDPHL PLASLVSINA DNGHLFALQS LDYKALQAFE FRVGAADRGS 540 

PALSSEAIiVR VLVUJANDNS PPVLYPLQNG SAPCTELVPR AAEPGYLVTK WAVDGDSGQ 600 

NAWLSYQLLK ATEPGLFGVW AHNGEVRTAR LLRERDAAKQ RLWLVKDNG EPPRSATATL 660 

HVLLVDGPSQ PYLLLPEAAP AQAQADLLTV YLWALASVS SLPLPSVLLP VAVRLCRRSR 720 

AASVGRCSVP EGPFPGQMVD VSGTGTLSQS YQYEVCLTGG SGTNEFKFLK PIIPNFVAQG 780 
AERVSEANPS FRKSFEPT 



Seq ID NO: 289 DNA sequence 

Nucleic Acid Accession ft: NM_018674.l 

Coding sequence: 390.. 2009 

1 11 21 31 41 51 

I I I I I I 
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CGGAGCACAT GCTGAGCGGA GOGGCTGGGG CTGCGCGGCG TGGCGGAGCA GOGCTCGCTC 60 

CCTCGCTCAC TCGCTOGCTC GCAGGGACAC ACGCAGGGGC TGACAGCTGT GCTGGTGCTG 120 

ATAAGGGAAG CCACAAGGAG ACGATCGAGG AGAGAGACAA GCGGCAGCAG AGGCAGCAGC 180 

GGCAGAGGCA GCACCAGGGC TGCGGAGCTG CTGGGAGTGG GAGTGACTCC CCCACCTCGG 240 

GCCCCCACCC TGTCCCTGTC CTCTTCCCGC TTGCCCTGAG TTTAGAAGAG CAGCCGCTGC 300 

CACCACTGCC ACTCGGGAGG GCACCAGGGC TGCTGGCTAG GGAGGGACAG GGCAGGGAGG 360 

CTCTGGCCAG TCCCAGCAGC CGGGGACAGA TGCCGATCGA GATTGTGTGC AAAATCAAAT 420 

TTGCTGAGGA GGATGCGAAA CCCAAGGAGA AGGAGGCAGG GGATGAGCAG AGCCTCCTCG 480 

GGGCTGTTGC CCCTGGAGCA GCCCCCCGAG ACCTGGCCAC CTTTGCCAGC ACCAGCACCC 540 

TGCATGGACT GGGCCGGGCC TGTGGCCCAG GCCCCCACGG ACTGCGCAGA ACCCTGTGGG 600 

CACTGGCCCT ACTCACCTCG CTGGCTGCCT TCCTGTACCA GGCGGCTGGC CTGGCCCGGG 660 

GCTACCTGAC CCGGCCTCAC CTGGTGGCAA TGGACCCCGC TGCCCCAGCC CCAGTGGCGG 720 

GCTTCCCGGC TGTCACCCTC TGCAATATCA ACCGCTTCCG GCATTCGGCA CTCAGCGATG 780 

CCGACATCTT CCACCTGGCC AATCTGACAG GGCTGCCCCC CAAAGACCGG GATGGGCACC 840 

GTGCGGCTGG CCTGCGCTAC CCAGAGCCTG ACATGGTAGA CATCCTCAAC CGCACTGGCC 900 

ACCAGCTCGC CGACATGCTT AAGAGCTGCA ACTTCAGTGG GCATCACTGC TCCGCCAGCA 960 

ACTTCTCTGT GGTCTATACT CGCTATGGGA AGTGTTACAC CTTCAAOGCG GACCCGCGGA 1020 

GCTCGCTGCC CAGCCGGGCA GGGGGCATGG GCAGTGGCCT GGAGATCATG CTGGACATCC 1080 

AGCAGGAGGA GTACCTGCCC A7CTGGAGGG AGACAAATGA GACGTCGTTT GAGGCAGGTA 1140 

TTCGGGTGCA GATCCACAGC CAGGAGGAGC CGCCCTACAT CCACCAGCTG GGGTTCGGGG 1200 

TGTCCOCAGG CTTCCAGACC TTTGTGTCCT GCCAGGAACA GCGGCTGACC TACCTGCCCC 1260 

AGCCCTGGGG CAACTGCCGC GCAGAGAGTG AGCTCAGGGA GCCTGAGCTT CAGGGCTACT 1320 

CGGCCTACAG TGTGTCTGCC TGCCGGCTGC GCTGTGAAAA GGAGGCCGTG CTTCAGCGCT 1380 

GCCACTGCCG GATGGTGCAC ATGCCAGGCA ATGAGACCAT CTGCCCACCA AATATCTACA 1440 

TCGAGTGTGC AGACCACACA CTGGACTCCC TGGGTGGGGG CCCTGAGGGC CCGTGCTTCT 1500 

GCCCCACCCC CTGCAACCTG ACACGCTATG GGAAAGAGAT CTCCATGGTC AGGATCCCCA 1560 

ACAGGGGCTC AGCCCGGTAC CTGGCGAGGA AGTACAACOG CAACGAGACC TACATACGGG 1620 

AGAACTTCCT GGTCCTAGAT GTCTTCTTTG AGGCCCTGAC CTCTGAAGCC ATGGAGCAGC 1680 

GAGCAGCCTA TGGCCTGTCA GCCCTGCTGG GAGACCTCGG GGGACAGATG GGCCTGTTCA 1740 

TTGGGGCCAG CATCCTCACG TTGCTGGAGA TCCTCGACTA CATCTATGAG GTGTCCTGGG 1800 

ATCGACTGAA GOGGGTATGG AGGCGTCCCA AGACCCCCCT GCGGACCTCC ACTGGGGGCA 1860 

TCTCCACTTT GGGGCTTCAG GAGCTGAAGG AACAGAGTCC CTGCCCGAGC CTGGGCCGAG 1920 

CGGAGGGTGG GGGGGTCAGC AGTCTGCTCC CCAATCACCA CCACCCCCAC GGTCCCCCAG 1980 

GAGGTCTCTT TGAAGATTTT GCTTGCTAGG ACGGTGCTGT GACTGAAAGG ACCCAGGAGT 2040 

CTGGGACCCC TCCTGGGATC CCCAGCACAT TCTCCTGCTC CTGGGAGAGG CCTGGGGGCG 2100 

GTGCTCACTG GGAGGGCCAG GACTCAGTTC CTGCTCTCAT CCTCCCCTGC CCTGATGTCA 2160 

GCTGCTTTGC ACAAAGGTCC TTCTTGTCCA CACCCCTTAT CCCCAGGCTG GTGCCCOGGG 2220 

AGGGCTGGAG ACCAGGCCAT GGGCCCTCAC GGAGAGGAAG GGAAGGAAGG AGAGGGAGGG 2280 

GGAGGATAGA GCCCATCCCA GCCGGGGAGG GGGAGCCCTC TGTACATTTG TAAATATTTA 2340 

GGGAAAGCCG GGTGGGGGGA GGGGATACAG ATGTAGAAGG TGGGTAGGGC TACAGGGGTG 2400 

GGTGATTTAG GGACAGCCAG GGTCCCAGCC CCAATGTCAG CAGGATAGGG AGAGCCCCAG 2460 

GACTCAGGAG TGCTGGGCTG GTCCTACTTC CTGCCCCTCT CCAGGCCCAG CTCCCCTCTT 2520 

GGCAGGGGGA GAGGATGGCC CAGCAGGCCT GGCCCAGCTC CCAGTTCCCC CTGCACCAGC 2580 

CCCACCCCTA GAGTCCCTTC TATAGGGAGG GGGCAGGAGA CCTTCCAGAC TTCGGCTGAG 2640 

CTTGGAGGGT GGGAAGGGAG CCTTCTCAGT CCTCTCTCCC TCCAGTCTGA TTTTATAAAG 2700 
TGCTGACGAG 

Seq ID NO: 290 Protein sequence 
Protein Accession 8: NP_061144.1 

1 11 21 31 41 51 

I I I I I I 

MPIEIVCKIK FABEDAKPKE KEAGDEQSLL GAVAPGAAPR DLATPASTST LHGLGRACGP 60 

GPHGLRRTLW ALALLTSLAA FLYQAAGLAR GYLTRPHLVA MDPAAPAPVA GFPAVTLCNI 120 

NRFRHSALSO ADIFHLANLT GLPPKDRDGH RAAGLRYPBP DMVDILNRTG EQLADMLKSC 180 

NFSGHHCSAS NFSWYTRYG KCYTFNADPR SSLPSRAGGM GSGLEIMLDI QQSEYLPIWR 240 

BTNETSFEAG IRVQIHSQEE PPYIHQLGFG VSPGFQTFVS CQEQRLTYLP QPWGNCRAES 300 

ELREPELQGY SAYSVSACRL RCEKEAVJJQR CHCRMVHMPQ NETICPPNIY IECAD3TLD5 360 

LGGGPEGPCF CPTPCNLTRY GKEISMVRIP NRGSARYLAR KYNRNETYIR ENFLVLDVFP 420 

EALTSEAMEQ RAAYGLSALL GDLGGQMGLF IGASILTLLB ILDYIYEVSW DRLKRVWRRP 480 
KTPLRTSTGG ISTLGLQELK EQSPCPSLGR AEGGGVSSLL PNHHHPEGPP GGLFEDFAC 

Seq ID NO: 291 DNA sequence 
^ Nucleic Acid Accession #: Eos sequence 

65 Coding sequence: 62.. 895 

1 11 21 31 41 51 

I I I I I I 

CACTGCTCTG AGAATTTGTG AGCAGCCCCT AACAGGCTGT TACTTCACTA CAACTGACGA 60 

70 TATGATCATC TTAATTTACT TATTTCTCTT GCTATGGGAA GACACTCAAG GATGGGGATT 120 

CAAGGATGGA ATTTTTCATA ACTCCATATG GCTTGAACGA GCAGCCGGTG TGTACCACAG 180 

AGAAGCACGG TCTGGCAAAT ACAAGCTCAC CTACGCAGAA GCTAAGGCGG TGTGTGAATT 240 

TGAAGGCGGC CATCTCGCAA CTTACAAGCA GCTAGAGGCA GCCAGAAAAA TTGGATTTCA 300 

_ _ TGTCTGTGCT GCTGGATGGA TGGCTAAGGG CAGAGTTGGA TACCCCATTG TGAAGCCAGG 360 

75 GCCCAACTGT GGATTTGGAA AAACTGGCAT TATTGATTAT GGAATCCGTC TCAA TAGG AG 420 

TGAAAGATGG GATGCCTATT GCTACAACCC ACACGCAAAG GAGTGTGGTG GCGTCTTTAC 480 

AGATCCAAAG CAAATTTTTA AATCTCCAGG CTTCCCAAAT GAGTAOGAAG ATAACCAAAT 540 

CTGCTACTGG CACATTAGAC TCAAGTATGG TCAGOGTATT CACCTGAGTT TTTTAGATTT 600 

TGACCTTGAA GATGACCCAG GTTGCTTGGC TGATTATGTT GAAATATATG ACAGTTACGA 660 

OU TGATGTCCAT GGCTTTGTGG GAAGATACTG TGGAGATGAG CTTCCAGATG ACATCA TCAG 720 

TACAGGAAAX GTCATGACCT TGAAGTTTCT AAGTGATGCT TCAGTGACAG CTGGAGGTTT 780 

CCAAATCAAA TATGTTGCAA TGGATCCTGT ATCCAAATCC AGTCAAGGAA AAAATACAAG 840 

TACTACTTCT ACTGGAAATA AAAACTTTTT AGCTGGAAGA TTTAGCCACT TATAAAAAAA 900 

AAAAAAAGGA TGATCAAAAC ACACAGTGTT TATGTTGGAA TCTTTTGGAA CTCCTTTGAT 960 
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CTCACTGTTA 
TAGGGAAAAT 
ACTGCATAGA 
TTTGTGGTAT 
TCTATGTACA 
TCATTGATTA 
TGTTTTATGC 
ATTGTTGCAA 



TTATTAACAT 
TGGAAAATAT 
AATAACAAGC 
ATCTATATAT 
GTTTTGTATT 
TTCTACAAAA 
ATTATTTAAG 
TAAATATCCT 



TTATTTATTA 
AG6AAACTTT 
GTTAACATTT 
GTACCTATAT 
ATACTTTTTA 
ACATGATTTT 
CCTGTCTCTA 
TQAACACACA 



TTTTTCTAAA 
AAAGGAGAAA 
TCATATTTTT 
GTATTTGCAT 
AATCTTGAAC 
AAACAGCTGT 
TTGTTGGAAT 
AAAAAAAAAA 



TGTGAAAGCA 
ATGAAACCTC 
TTCTTTCAGT 
TTQAAATTTT 
TTTATAAACA 
AAAATATTCT 
TTCAGGTCAT 
AA 



ATACATAATT 1020 

TCATAATCCC 1080 

CATTTTTCTA 1140 

GGAATCCTGC 1200 

TTTTCTGAAA 1260 

ATGATATGAA 1320 

TTTCATAAAT 1380 



Seq ID NO: 292 Protein sequence 
Protein Accession fl: Eos sequence 

11 



51 



1 11 21 31 41 

I I I I I I 

MIILIYLFLL LWEDTQGWGP KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNCGFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKEOGGVFT DPKQIFKSPG PPNEYEDNQI CYWHIRLKYG QRIHI>SFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL 8DASVTAGGF 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 

Seq ID NO: 293 DNA sequence 

Nucleic Acid Accession #: NM_0071lS.l 

Coding sequence: 69., 902 



GAATTCGCAC 
CTGACGATAT 
GGGGATTCAA 
ACCACAGAGA 
GTGAATTTGA 
GATTTCATGT 
AGCCAGGGCC 
ATAGGAGTGA 
TCTTTACAGA 
ACCAAATCTG 
TAGATTTTGA 
GTTAOGATGA 
TCATCAGTAC 
GAGGTTTCCA 
ATACAAGTAC 
AAAAAAAAAA 
TTGATCTCAC 
TAATTTAGGG 
ATCCCACTGC 
TTGTATTTGT 
CCTGCTCTAT 
TGAAATCATT 
ATGAATGTTT 
TAAATATTGT 



11 
I 

TGCTCTGAGA 
GATCATCTTA 
GGATGGAATT 
AGCACGGTCT 
AGGCGGCCAT 
CTGTGCTGCT 
CAACTGATGA 
AAGATGGGAT 
TCCAAAGCGA 
CTACTGGCAC 
CCTTGAAGAT 
TGTCCATGGC 
AGGAAATGTC 
AATCAAATAT 
TACTTCTACT 
AAGGATGATC 
TGTTATTATT 
AAAATTGGAA 
ATAGAAATAA 
GGTATATGTA 
GTACAGTTTT 
GATTATTCTA 
TATGCATTAT 
TGCAATAAAT 



21 
I 

ATTTGTGAGC 
ATTTACTTAT 
TTTCATAACT 
GGCAAATACA 
CTCGCAACTT 
GGATGGATGG 
TTTGGAAAAA 
GCCTATTGCT 
ATTTTTAAAT 
ATTAGACTCA 
GACCCAGGTT 
TTTGTGGGAA 
ATGACCTTGA 
GTTGCAATGG 
GGAAATAAAA 
AAAACACACA 
AACATTTATT 
AATATAGGAA 
CAAGCGTTAA 
TATATGTACC 
GTATTATACT 
CAAAAACATG 
TTAAGCCTGT 
ATCCTTCGGA 



31 
I 

AGCCCCTAAC 
TTCTCTTGCT 
CCATATGGCT 
AGCTCACCTA 
ACAAGCAGCT 
CTAAGGGCAG 
CTGGCATTAT 
ACAACCCACA 
CTCCAGGCTT 
AGTATGGTCA 
GCTTGGCTGA 
GATACTGTGG 
AGTTTCTAAG 
ATCCTGTATC 
ACTTTTTAGC 
GTGTTTATGT 
TATTATTTTT 
ACTTTAAACG 
CATTTTCATA 
TATATGTATT 
TTTTAAATCT 
ATTTTAAACA 
CTCTATTGTT 
ATTC 



41 

i 

AGGCTGTTAC 
ATGGGAAGAC 
TGAACGAGCA 
CGCAGAAGCT 
AGAGGCAGCC 
AGTTGGATAC 
TGATTATGGA 
CGCAAAGGAG 
CCCAAATGAG 
GCGTATTCAC 
TTATGTTGAA 
AGATGAGCTT 
TGATGCTTCA 
CAAATCCAGT 
TGGAAGATTT 
TGGAATCTTT 
CTAAATGTGA 
AGAAAATGAA 
TTTTTTTCTT 
TGCATTTGAA 
TGAACTTTAT 
GCTGTAAAAT 
GGAATTTCAG 



51 

1 

TTCACTACAA 
ACTCAAGGAT 
GCOGGTGTGT 
AAGGCGGTGT 
AGAAAAATTG 
CCCATTGTGA 
ATCCGTCTCA 
TGTGGTGGCG 
TACGAAGATA 
CTGAGTTTTT 
ATATATGACA 
CCAGATGACA 
GTGACAGCTG 
CAAGGAAAAA 
AGCCACTTAT 
TGGAACTCCT 
AAGAAATACA 
ACCTCTCATA 
TCAGTCATTT 
ATTTTGGAAT 
GAACATTTTC 
ATTCTATGAT 
GTCATTTTCA 



Seq ID NO: 294 Protein sequence 
Protein Accession «: NP_00 9046.1 

1 11 21 31 41 SI 

I I I I I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKRIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGP 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 



Seq ID NO: 295 DNA sequence 

Nucleic Acid Accession 8» NMJJ01218.2 

Coding sequence: 116.. 1180 



1 
I 

GTACTCGCCA 
CGGOCACCGG 
CCGGCGCAGC 
CAGCCCGGCC 
CTGGTCCAAG 
TGACATCCTC 
TGCCAACAAG 
GGACATGCAC 
GGGGAACCCG 
CGAGCTGCAC 
CAAGTCAGAA 
CTATGACAAG 
CCCGGGATTC 
GGGGTCCCTG 
CGTGCAAATT 



11 
I 

CGGCACCCAG 
GAGCCCCCGG 
CTGCACGCGG 
CCAGTGAACG 
AAGTACCCGT 
CAGTATGACG 
CAGTTTCTCC 
ATCCAGGGCC 
AATGACCCGC 
ATTGTCCATT 
GGCCTCGCTG 
ATCTTCAGTC 
AACATTGAAG 
ACCACACCCC 
TCCCAGGAGC 



21 
I 

GCTGCGCGCA 
CACAGCCCGC 
CGGCCGTGCT 
GTTCCAAGTG 
CGTGTGGGGG 
CCAGCCTCAC 
TGACCAACAA 
TCCAGTCTCG 
ACGGCTCTGA 
ATAACTCAGA 
TCCTGGCTGT 
ACCTTCAACA 
AGCTGCTTCC 
CTTGCAACCC 
AGCTGCTGGC 



31 
I 

CGOGGTCCCG 
GCCCGCCCCG 
CCTGCTGGTG 
GACTTATTTT 
CCTGCTGCAG 
GCCCCTCGAG 
TGGCCATTCA 
CTACAGTGCC 
GCACACCGTC 
CCTTTATCCT 
TCTCATTGAG 
TGTAAAGTAC 
GGAGAGGACC 
CACTGTGCTC 
TTTGGAGACA 



41 

I 

GTGTGCAGCT 
CAGGAGCCCG 
ATCTTAAAGG 
GGTCCTGATG 
TCCCCCATAG 
TTCCAAGGCT 
GTGAAGCTGA 
ACGCAGCTGC 
AGCGGACAGC 
GACGCCAGCA 
ATGGGCTCCT 
AAAGGCCAGG 
GCTGAATATT 
TGGACAGTTT 
GCCCTGTACT 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 



51 
1 

GGAGAGCGAG 
CGAAGATGCC 
AAGAGCCTTC 
GGGAGAATAG 
ACCTGCACAG 
ACAATCTGTC 
ACCTGCCCTC 
ACCTGCACTG 
ACTTCGCCGC 
CTGCCAGCAA 
TCAATCCGTC 
AAGCATTCGT 
ACCGCTACCG 
TCCGAAACCC 
GCACACACAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



1066 



WO 03/042661 



PCT/US02/36810 



GGACGACCCT TCCCCCAGAG AAATGATCAA CAACTTCCGG CAGGTCCAGA AGTTOGATGA 960 

GAGGCTGGTA TACACCTCCT TCTCCCAAOT GCAAGTCTGT ACTGCGGCAG GACTGAGTCT 1020 

GGGCATCATC CTCTCACTGG CCCTOGCTOG CATTCTTGGC ATCTGTATTG TGQTGGTGGT 1080 

GTCCATTTGG CTTTTCAGAA GGAAGAGTAT CAAAAAAGGT GATAACAAGG GAGTCATTTA 1140 

5 CAAGCCAGCC ACCAAGATGG AGACTGAGGC CCAOGCTTGA GGTCCCCGGA GCTCCCGGGC 1200 

ACATCCAGGA AGGACCTTGC TTTGGACCCT ACACACTTCG GCTCTCTGGA CACTTGCGAC 1260 

ACCTCAAGGT GTTCTCTGTA GCTCAATCTG CAAACATGCC AGGOCTCAGG GATCCTCTGC 1320 

TGGGTGCCTC CTTGCCTTGG GACCATGGCC ACCCCAGAGC CATCCGATCG ATGGATGGGA 1380 

TGCACTCTCA GACCAAGCAG CAGGAATTCA AAGCTGCTTG CTGTAACTGT GTGAGATTGT 1440 

10 GAAGTGGTCT GAATTCTGGA ATCACAAACC AAGCCATGCT GGTGGGCCAT TAATGGTTGG IS 00 

AAAACACTTT CATCCGGGGC TTTGCCAGAG OGTGCTTTCA AGTGTCCTGG AAAGTCTGCT 1560 

GCTTCTCCAA GCTTTCAGAC AAGAATGTGC ACTCTCTGCT TAGGTTTTGC TTGGGAAACT 1620 

CAACTTCTTT CCTCTGGAGA CGGGGCATCT CCCTCTGATT TCCTTCTGCT ATGACAAAAC 1680 

CTTTAATCTG CACCTTACAA CTCGGGGACA AATGGGGACA GGAAGGATCA AGTTGTAGAG 1740 

15 AGAAAAAAGA AAACAAGAGA TATACATTGT GATATATTAG GGACACTTTC ACAGTCCTGT 1800 

CCTCTGGATC ACAGACACTG CACAGACCTT AGGGAATGGC AGGTTCAAGT TCCACTTCTT 1860 

GGTGGGGATG AGAAGGGAGA GAGAGCTAGA GGGACAAAGA GAATGAGAAG ACATGGATGA 1920 

TCTGGGAGAG TCTCACTTTG GAATCAGAAT TGGAATCACA TTCTGTTTAT CAAGCCATAA 1980 

TGTAAGGACA GAATAATACA ATATTAAGTC CAAATCCAAC CTCCTGTCAG TGGAGCAGTT 2040 

20 ATGTTTTATA CTCTACAGAT TTTACAAATA ATGAGGCTGT TCCT TGAAA A TGTGTTGTTG 2100 

CTGTGTCCTG GAGGAGACAT GAGTTCCGAG ATGACCCAAT CTGCCTTTGA ATCTGGAGGA 2160 

AATAGGCAGA AACAAAATGA CTGTAGAACT TATTCTCTGT AGGCCAAATT TCATTTCAGC 2220 

CACTTCTGCA GGATCCCTAC TGCCAACCTG GAATGGAGAC TTTTATCTAC TTCTCTCTCT 2280 

CTGAAGATGT CAAATCGTGG TTTAGATCAA ATATATTTCA AGCTATAAAA GCAGGAGGTT 2340 

25 ATCTGTGCAG GGGGCTGGCA TCATGTATTT AGGGGCAAGT AATAATGGAA TGCTACTAAG 2400 

ATACTCCATA TTCTTCCCOG AATCACACAG ACAGTTTCTG ACAGGCGCAA CTCCTCCATT 2460 

TTCCTCCCGC AGGTGAGAAC CCTGTGGAGA TGAGTCAGTG CCATGACTGA GAAGGAACCG 2520 

ACCCCTAGTT GAGAGCACCT TGCAGTTC<CC CGAGAACTTT CTGATTCACA GTCTCATTTT 2580 

GACAGCATGA AATGTCCTCT TGAAGCATAG CTTTTTAAAT ATCTTTTTCC TTCTACTCCT 2640 

30 CCCTCTGACT CTAAGAATTC TCTCTTCTGG AATCGCTTGA ACCCAGGAGG CGGAGGTTGC 2700 

AGTAAGCCAA GGTCATGCCA CTGCAC^CTA GCCTGGGTGA CAGAGCGAGA CTCCATCTCA 2760 
AAAAAAAAAA AAAAA f 

Seq ID NO: 296 Protein sequence 
35 Protein Accession #: NP_001209.1 

1 11 21 31 41 51 

11)111 
MPRRSLHAAA VLLLVILKEQ PSSPAPWGS KWTYFGPDGE NSWSKKYPSC GGLLQSPIDL 60 
40 HSDILQYDAS LTPLEFQGYN LSANKQFLLT NNGHSVKLNL PSDMHIQGLQ SRYSATQLHL 120 
HWGNPNDPHG SEHTVSGQHP AAELHIVHYN SDLYPDASTA SNKSEGLAVL AVLIEMGSFN 180 
PSYDKIFSHL QKVKYKGQEA FVPGFNIBEL LPERTAEYYR YKGSLTTPPC NPTVLWTVFR 240 
NPVQISQEQL LALETALYCT HMDDPSPREM INNFRQVQKF DERLVYTSFS QVQVCTAAGL 300 
SLGIILSLAL AGILGICIW WSIWLPRRK SIKKGDNKGV IYKPATKMET BAHA 

45 

Seq ID NO: 297 DNA sequence 

Nucleic Acid Accession #: NM_006632.1 

Coding sequence : 377 . . 1582 

50 1 11 21 31 41 51 

ACGCGTCCGC CCACGCGTCC GCCCACGOGT COGGTCGGGG CCAGAGCGCA GGTGTACCTG 60 

GCGGCCGTGC TGGAGCACCT GACGGCCGAG ATOCTGGAGC TGGCTGGCAA CCCGGCCCGC 120 

GACAAGAAGA CCCGCATCAT CCTGCGCCAC CTGTAGCTGG CCATTCGCAA CGGCGAGGAG 180 

55 CTTAACAAGC TGCTGGGCGA AGTCACCATC GCGCAGGGCG GTGTCCTGCC CAACATTCAG 240 

GGCGTGCTTC TGCCCCAGAA GACCAAGAGC CACCACAAGG CCAAGGGTGA AAACCATTCA 300 

CTAGGAGAGG AGAAACACAA TGGCCACCAA GACAGAGTTG AGTCCCACAG CAAGGG AGAG 360 

CAAGAACGCA CAAGATATGC AAGTGGATGA GACACTGATC CCCAGGAAAG GTCCAAGTTT 420 

ATGTTCTGCT CGCTATGGAA TAGCCCTCGT CTTACATTTC TGCAATTTCA CAACGATAGC 480 

60 ACAAAATGTC ATCATGAACA TCACCATGGT AGCCATGGTC AACAGCACAA GCCCTCAATC 540 

CCAGCTCAAT GATTCCTCTG AGGTGCTGCC TGTTGACTCA TTTGGTGGCC TAAGTAAAGC 600 

CCCAAAGAGT CTTCCTGCAA AGTCCTCAAT ACTTGGGGGT CAGTTTGCAA TTTGGGAAAA 660 

GTGGGGCCCT CCACAAGAAC GAAGCAGACT CTGCAGCATT GCTTTATCAG GAATGTTACT 720 

GGGATGCTTT ACTGCCATCC TCATAGGTGG CTTCATTAGT GAAACOCTTG GGTGGCCCTT 780 

65 TGTCTTCTAT ATCTTTGGAG GTGTTGGCTG TGTCTGCTGC CTTCTCTGGT TTGTTGTGAT 840 

TTATGATGAC CCCTTTTCCT ATCCATGGAT AAGCACCTCA GAAAAAGAAT ACATCATATC 900 

CTCCTTGAAA CAACAGGTCG GGTCTTCTAA GCAGCCTCTT CCCATCAAAG CTATGCTCAG 960 

ATCTCTACCC ATTTGGTCCA TATGTTTAGG CTGTTTCAGC CATCAATGGT TAGTTAGCAC 1020 

AATGGTTGTA TACATACCAA CTTACATCAG CTCTGTGTAC CATGTTAACA TCAGAGACAA 1080 

70 TGGACTTCTA TCTGCCCTTC CTTTTATTGT TGCCTGGGTC ATAGGCATGG TGGGAGGCTA 1140 

TCTGGCAGAT TTCCTTCTAA CCAAAAAGTT TAGACTCATC ACTGTGAGGA AAATTGCCAC 1200 

AATTTTAGGA AGTCTCCCCT CTTCAGCACT CATTGTGTCT CTGCCTTAOC TCAATTCCGG 1260 

CTATATCACA GCAACTGCCT TGCTQAOGCT CTCTTGCGGA TTAAGCACAT TGTGTCAGTC 1320 

AGGGATTTAT ATCAATGTCT TAGATATTGC TCCAAGGTAT TCCAGTTTTC TCATGGGAGC 1380 

75 ATCAAGAGGA TTTTOGAGCA TAGCACCTGT CATTGTACCC ACTGT CAGCG GATTTCTTCT 1440 

TAGTCAGGAC CCTGAGTTTG GGTGGAGGAA TGTCTTCTTC TTGCTGTTTG CCGTTAACCT 1500 

GTTAGGACTA CTCTTCTACC TCATATTTGG AGAAGCAGAT GTCCAAGAAT GGGCTAAAGA 1560 

GAGAAAACTC ACTCGTTTAT GAAGTTATCC CACCTTGGAT GGAAAAGTCA TTAGGCACCG 1620 

OA TATTGCATAA AATAGAAGGC TTCCGTGATG AAAATACCAG TGAA AAGATT TTTTTTTCCT 1680 

80 GTGGCTCTTT TCAATTATGA GATGAGTTCA TTATTTTATT CAGACTTTTT TTTGAGAGAA 1740 
ATGTAAGATG AATAAAAATT CAAATAAAAT GATAACTAAG AAAAAAAAAA AAAAA 



Seq ID NO: 298 Protein sequence 
Protein Accession #: NP_006623.1 
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1 11 21 31 41 SI 

I I I I I I 

MQVDETLIPR KGPSLCSARY GIALVLHFCN FTTIAQNVIM NITMVAMVNS TSPQSQLNDS 60 

SEVLPVDSPG GLSKAPKSLP AKSSILGGQF AIWEKWGPPQ ERSRLCSIAL SGMLLGCFTA 120 

ILIGGFISET LGWPFVFYIF GGVGCVCCLL WFWIYDDPF SYPWISTSEK EYIISSLKQQ 180 

VGSSKQPLPI KAMLRSLPIW SICLGCFSHQ WLVSTMWYI PTYISSVYHV NIRDNGLLSA 240 

LPFIVAWVIG MVGGYLADFL LTKKFRLITV RKIATILGSL PSSALIVSLP YUISGYITAT 300 

ALLTLSCGLS TLCQSGIYIN VLDIAPRYSS PLKGASRGFS SIAPVIVPTV SGFLLSQDPS 360 
FGWRNVFFIiL FAVNLLGLLF YLIFGEADVQ EWAKERKLTR L 

Seq ID NO: 299 DNA sequence 

Nucleic Acid Accession ft: NMJ>03058.1 

Coding sequence: 14 S.. 1812 

1 11 21 31 41 51 

1 I I I 1 I 

GGCCCTGCCC TGAAGGCTGG TCACTTGCAG AGGTAAACTC CCCTCTTTGA CTTCTGGCCA 60 

GGGTTTGTGC TGAGCTGGCT GCAGCCGCTC TCAGCCTCGC TCCGGGCACG TCGGGCAGCC 120 

TCGGGCCCTC CTGCCTGCAG GATCATGCCC ACCACCGTGG AOGATGTCCT GGAGCATGGA 180 

GGGGAGTTTC ACTTTTTCCA GAAGCAAATG TTTTTCCTCT TGGCTCTGCT CTCGGCTACC 240 

TTCGCGCCCA TCTACGTGGG CATCGTCTTC CTGGGCTTCA CCCCTGACCA CCGCTGCCGG 300 

AGCCCCGGAG TGGCCGAGCT GAGTCTGCGC TGCGGCTGGA GTCCTGCAGA GGAACTGAAC 360 

TACACGGTGC CGGGCCCAGG ACCTGCGGGC GAAGCCTCCC CAAGACAGTG TAGGOGCTAC 420 

GAGGTGGACT GGAACCAGAG CACCTTTGAC TGCGTGGACC CCCTGGCCAG CCTGGACACC 480 

AACAGGAGCC GCCTGCCACT GGGCCCCTGC CGGGACGGCT GGGTGTACGA GACGCCTGGC 540 

TCGTCCATCG TCACCGAGTT TAACCTGGTA TGTGCCAACT CCTGGATGTT GGACCTATTC 600 

CAGTCATCAG TGAATGTAGG ATTCTTTATT GGCTCTATGA GTATCGGCTA CATAGCAGAC 660 

AGGTTTGGCC GTAAGCTCTG CCTCCTAACT ACAGTCCTCA TAAATGCTGC AGCTGGAGTT 720 

CTCATGGCCA TTTCCCCAAC CTATACGTGG ATGTTAATTT TTCGCTTAAT CCAAGGACTG 780 

GTCAGCAAAG CAGGCTGGTT AATAGGCTAC ATCCTGATTA CAGAATTTGT TGGGCGGAGA 840 

TATCGGAGAA CAGTGGGGAT TTTTTACCAA GTTGCCTATA CAGTTGGGCT CCTGGTGCTA 900 

GCTGGGGTGG CTTACGCACT TCCTCACTGG AGGTGGTTGC AGTTCACAGT TGCTCTGCCC 960 

AACTTCTTCT TCTTGCTCTA TTACTGGTGC ATACCTGAGT CTCCCAGGTG GCTGATCTCC 1020 

CAGAATAAGA ATGCTGAAGC CATGAGAATC ATTAAGCACA TCGCAAAGAA AAATGGAAAA 1080 

TCTCTACCCG CCTCCCTTCA GCGCCTGAGA CTTGAAGAGG AAACTGGCAA GAAATTGAAC 1140 

CCTTCATTTC TTGACTTGGT CAGAACTCCT CAGATAAGGA AACATACTAT GATATTGATG 1200 

TACAACTGGT TCACGAGCTC TGTGCTCTAC CAGGGCCTCA TCATGCACAT GGGCCTTGCA 1260 

GGTGACAATA TCTACCTGGA TTTCTTCTAC TCTGCCCTGG TTGAATTCCC AGCTGCCTTC 1320 

ATGATCATCC TCACCATCGA CCGCATGGGA CGCCGTTACC CTTGGGCTGC ATCAAATATG 1380 

GTTGCAGGGG CAGCCTGTCT GGCCTCAGTT TTTATACCTG GTGATCTACA ATGGCTAAAA 1440 

ATTATTATCT CATGCTTGGG AAGAATGGGG ATCACAATGG CCTATGAGAT AGTCTGCCTG 1500 

GTCAATGCTG AGCTGTACCC CACATTCATT AGGAATCTTG GCGTCCACAT CTGTTCCTCA 1560 

ATGTGTGACA TTGGTGGCAT CATCACGCCA TTCCTGGTCT ACCGGCTCAC TAACATCTGG 1620 

CTTGAGCTCC CGCTGATGGT TTTCGGCGTA CTTGGCTTGG TTGCTGGAGG TCTGGTGCTG 1680 

TTGCTTCCAG AAACTAAAGG GAAAGCTTTG CCTGAGACCA TCGAGGAAGC CGAAAATATG 1740 

CAAAGACCAA GAAAAAATAA AGAAAAGATG ATTTACCTCC AAGTTCAGAA ACTAGACATT 1800 

CCATTGAACT AAGAAGAGAG ACCGTTGCTG CTGTCATGAC CTAGCTTTGA TGGCAGCAAG 1860 

ACCAAAAGTA GAAATCCCTG CACTCATCAC AAAGCCCATA CAACTCAACC AAACTTACCC 1920 

CTGAGCCCTA TCAACCTAGG TCTACAGCCA GTGGAGTCTA TTGTACACTG TGGAAAAATA 1980 

CCCATGGGAC CAOATCCTGC CAAATTCTTC CAGCTCACTT TATTCTCAGC ATTCCTAGGA 2040 

CATTGGACAT TGGTTTTCTG GAGGGTTTTT TTTCCGATCT TTGTATTTTT TTAAATTTGA 2100 

TTCTTTTCTT TGCAATGCTA GCAACCAGAA TACATAGGGG AACTGTGGGC TAGGCAAANA 2160 

AAATAGAAAA AGTGTGAAAA ACAGTAAAGT TGGGAGAGGA GCATCTATTT TCTTAAAGAA 2220 
ATAAAACACC NAAAACAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 300 Protein sequence 
Protein Accession #: NPJJ03049.1 

1 11 21 31 41 51 

i I I I I I 

MPTTVDDVLE HGGEFHFFQK QMFFLLALLS ATFAPIYVGI VFLGFTPDHR CRSPGVABLS 60 

LRCGWSPAEE LNYTVPGPGP AGBASPRQCR RYEVDWNQST FDCVDPLASL DTNRSRLPLG 120 

PCRDGWVYET PGSSIVTBFN LVCANSWMLD LFQSSVNVGF FIGSMSIGYI ADRFGRKLCL 180 

LTTVLZNAAA GVLMAISPTY TWMLIFRLIQ GLVSKAGWLI GYIIiITEFVQ RRYRRTVGIF 240 

YQVAYTVGLL VLAGVAYALP HWRWLQFTVA LPNPFFLLYY KCIPESPRWL ISQNKNAEAM 300 

RIIKHIAKKN GKSLPASLQR LRLEEETGKK LNPSFLDLVR TPQIRKHTMI LMYNWFTSSV 360 

LYQGLIMHMG LAGDNIYLDF FYSALVEFPA AFMIILTIDR IGRRYPWAAS NMVAGAACLA 420 

SVFIPGDLQW LKIIISCLGR MGITMAYEIV CLVNAELYPT FIRNLGVHIC SSMCDIGGII 480 

TPFLVYRIiTN IWLBLPLMVF GVLGLVAGGL VLLLPBTKGK ALPETIEEAE NMQRPRKNKE 540 
KMIYLQVQKL DIPMJ 



Seq ID NO: 301 DNA sequence 
„ Nucleic Acid Accession #: NM_012206.1 
/J Coding sequence: 52.. 1131 



1 11 21 31 41 51 

1 I I I I I 

GTTACCCAGC ATTGTGAGTG ACAGAGCCTG GATCTGAACG CTGATCCCAT AATGCATCCT 60 

CAAGTGGTCA TCTTAAGCCT CATCCTACAT CTGGCAGATT CTGTAGCTGG TTCTGTAAAG 120 

GTTGGTGGAG AGGCAGGTCC ATCTGTCACA CTACCCTGCC ACTACAGTGG AGCTGTCACA 180 

TCAATGTGCT GGAATAGAGG CTCATGTTCT CTATTCACAT GCCAAAATGG CATTGTCTGG 240 

ACCAATGGAA CCCACGTCAC CTATCGGAAG OACACACGCT ATAAGCTATT GGGGGACCTT 300 

TCAAGAAGGG ATGTCTCTTT GACCATAGAA AATACAGCTG TGTCTGACAG TGGCGTATAT 360 



1068 



WO 03/042661 



TGTTGCCGTG TTGAGCACCG TGGGTGGTTC AATGACATGA AAATCACCGT ATCATTGGAG 420 

ATTGTGCCAC CCAAGGTCAC GACTACTCCA ATTGTCACAA CTGTTCCAAC CGTCAOGACT 480 

GTTCGAACGA GCACCACTGT TCCAACGACA ACGACTGTTC CAACGACAAC TCTTCCAACA 540 

ACAATGAGCA TTCCAAOGAC AACGACTGTT CCGACGACAA TGACTGTTTC AACGACAACG 600 

AGCGTTCCAA CGACAACGAG CATTCCAACA ACAACAAGTG TTCCAGTGAC AACAACGGTC 660 

TCTACCTTTG TTCCTCCAAT GCCTTTGCCC AGGCAGAACC ATGAACCAGT AGOCACTTCA 720 

CCATCTTCAC CTCAGCCAGC AGAAACCCAC CCTACGACAC TGCAGGGAGC AATAAGGAGA 780 

GAACCCACCA GCTCACCATT GTACTCTTAC ACAACAGATG GGAATGACAC CGTGACAGAG 840 

TCTTCAGATG GCCTTTGGAA TAACAATCAA ACTCAACTGT TCCTAGAACA TAGTCTACTG 900 

ACGGCCAATA CCACTAAAGG AATCTATGCT GGAGTCTGTA TTTCTGTCTT GGTGCTTCTT 960 

GCTCTTTTGG GTGTCATCAT TGCCAAAAAG TATTTCTTCA AAAAGGAGGT TCAACAACTA 1020 

AGTGTTTCAT TTAGCAGCCT TCAAATTAAA GCTTTGCAAA ATGCAGTTGA AAAGGAAGTC 1080 

CAAGCAGAAG ACAATATCTA CATTGAGAAT AGTCTTTATG CCAOGGACTA AGACCCAGTG 1140 

GTGCTCTTTG AGAGTTTACG CCCATGACTG CAGAAGACTG AACAGGTATC AGCACATCAG 1200 

ATGTCTTTTA GACTCCAAGA CAATTTTTCT GTTTCAGTTT CATCTGGCAT TCCAACATGT 1260 

CAGTGATACT GGGTAGAGTA ACTCTCCCAC TCCAAACTGT GTATAGTCAA CCTCATCATT 1320 

AATGTAGTCC TAATTTGTTT TGCTAAAACT GGCTCAATCC TTCTGATCAT TGCAGAGTTT 1380 
TCTCTCAAAC ATGAACACTT TAGAATTGTA TGTTCTCTTT AGACCOCATA AATCCTGTAT 

Seq ID NO: 302 Protein sequence 
Protein Accession #: NP_036338.1 

1 11 21 31 41 51 

MHPQWILSL ILHLADSVAG SVKVGGEAdP SVTLPCHYSG AVTSMCWNRG SCSLFTCQNG 60 

IVWTNGTHVT YRKDTRYKLL GDLSRRDVSL TIKNTAVSDS GVYCCRVEHR GWFNDMKITV 120 

SLEIVPPKVT TTPIVTTVPT VTTVRTSTTV PTTTTVPTTT VPTTMSIPTT TTVPTTMTVS 180 

TTTSVPTTTS IPTTTSVPVT TTVSTFVPPM PLPRQNHEPV ATSPSSPQPA ETHPTTLQGA 240 

IRREPTSSPL YSYTTDGNDT VTESSDGLWN NNQTQLFLEH SIiLTANTTKG IYAGVCISVL 300 
VLLALLGVII AKKYFFKKEV QQI.SVSPSSL QIKALQNAVE KEVQAEDNIY IEMSLYATD 

Seq ID NO: 303 DNA sequence 

Nucleic Acid Accession #: NMJ)01044.1 

Coding sequence: 129.. 1991 

1 11 21 31 41 51 

I I I.I I i 

ACCGCTCCGG AGCGGGAGGG GAGGCTTCGC GGAACGCTCT CGGCGCCAGG ACTCGCGTGC 60 

AAAGCCCAGG CCCGGGCGGC CAGACCAAGA GGGAAGAAGC ACAGAATTCC TCAACTCCCA 120 

GTGTGCCCAT GAGTAAGAGC AAATGCTCCG TGGGACTCAT GTCTTCCGTG GTGGCCCCGG 180 

CTAAGGAGCC CAATGCCGTG GGCCGGAAGG AGGTGGAGCT CATCCTTGTC AAGGAGCAGA 240 

ACGGAGTGCA GCTCACCAGC TCCACCCTCA CCAACCCGOG GCAGAGCCCC GTGGAGGCCC 300 

AGGATCGGGA GACCTGGGGC AAGAAGATCG ACTTTCTCCT GTCCGTCATT GGCTTTGCTG 360 

TGGACCTGGC CAACGTCTGG CGGTTCCCCT ACCTGTGCTA CAAAAATGGT GGCGGTGCCT 420 

TCCTGGTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCCACTTTTC TACATGGAGC 480 

TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG CCGCTGGTGT CTGGAAGATC TGCCCCATAC 540 

TGAAAGGTGT GGGCTTCACG GTCATCCTCA TCTCACTGTA TGTCGGCTTC TTCTACAACG 600 

TCATCATCGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT CACCAOGGAG CTCCCCTGGA 660 

TCCACTGCAA CAACTCCTGG AACAGCCCCA ACTGCTCGGA TGCCCATCCT GGTGACTCCA 720 

GTGGAGACAG CTCGGGCCTC AACGACACTT TTGGGACCAC ACCTGCTGCC GAGTACTTTG 780 

AACGTGGCGT GCTGCACCTC CACCAGAGCC ATGGCATCGA CGACCTGGGG CCTCCGCGGT 840 

GGCAGCTCAC AGCCTGCCTG GTGCTGGTCA TCGTGCTGCT CTACTTCAGC CTCTGGAAGG 900 

GCGTGAAGAC CTCAGGGAAG GTGGTATGGA TCACAGCCAC CATGCCATAC GTGGTCCTCA 960 

CTGCCCTGCT CCTGCGTGGG GTCACCCTCC CTGGAGCCAT AGACGGCATC AGAGCATACC 1020 

TGAGCGTTGA CTTCTACCGG CTCTGCGAGG CGTCTGTTTG GATTQACGOG GCCACCCAGG 1080 

TGTGCTTCTC CCTGGGCGTG GGGTTCGGGG TGCTGATOGC CTTCTCCAGC TACAACAAGT 1140 

TCACCAACAA CTGCTACAGG GACGCGATTG TCACCACCTC CATCAACTCC CTGACGAGCT 1200 

TCTCCTCCGG CTTCGTCGTC TTCTCCTTCC TGGGGTACAT GGCACAGAAG CACAGTGTGC 1260 

CCATCGGGGA CGTGGCCAAG GACGGGCCAG GGCTGATCTT CATCATCTAC CCGGAAGCCA 1320 

TCGCCACGCT CCCTCTGTCC TCAGCCTGGG CCGTGGTCTT CTTCATCATG CTGCTCACCC 1380 

TGGGTATCGA CAGCGCCATG GGTGGTA7GG AGTCAGTGAT CACCGGGCTC ATCGATGAGT 1440 

TCCAGCTGCT GCACAGACAC CGTGAGCTCT TCACGCTCTT CATCGTCCTG GCGACCTTCC 1500 

TCCTGTCCCT GTTCTGCGTC ACCAACGGTG GCATCTACGT CTTCACGCTC CTGGACCATT 1560 

TTGCAGCCGG CACGTCCATC CTCTTTGGAG TGCTCATCGA AGCCATCGGA GTGGCCTGGT 1620 

TCTATGGTGT TGGGCAGTTC AGCGACGACA TCCAGCAGAT GACCGGGCAG CGGCCCAGCC 1680 

TGTACTGGCG GCTGTGCTGG AAGCTGGTCA GCCCCTGCTT TCTCCTGTTC GTGGTCGTGG 1740 

TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA CATCTTCCCC GACTGGGCCA 1800 

ACGCGCTGGG CTGGGTCATC GCCACATCCT CCATGGCCAT GGTGCCCATC TATGCGGCCT I860 

ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAGAAACT GGCCTAOGCC ATTGCACCCG 1920 

AGAAGGACOG TGAGCTGGTG GACAGAGGGG AGGTGOGCCA GTTCACGCTC CGCCACTGGC 1980 

TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT CATCCTGCAA TGGGAGAGAC 2040 

ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAG GGCAACTTCT ACTCTTCAAC ' 2100 

CTCTACTGAA AACACAAACA ACAAAGCAGA AGACTCCTCT CTTCTGACTG TTTACACCTT 2160 

TCCGTGCCGG GAGCGCACCT CGCCGTGTCT TGTGTTGCTG TAATAACGAC GTAGATCTGT 2220 

GCAGCGAGGT CCACCCCGTT GTTGTCCCTG CAGGGCAGAA AAACGTCTAA CTTCATGCTG 2280 

TCTGTGTGAG GCTCCCTCCC TCCCTGCTCC CTGCTCCCGG CTCTGAGGCT GCCCCAGGGG 2340 

CACTGTGTTC TCAGGCGGGG ATCACGATCC TTGTAGACGC ACCTGCTGAG AATCCCCGTG 2400 
CTCACAGTAG CTTCCTAGAC CATTTACTTT GCCCATATTA AAAAGCCAAG TGTCCTGCTT 2460 
GGTTTAGCTG TGCAGAAGGT GAAATGGAGG AAACCACAAA TTCATGCAAA GTCCTTTCCC 2520 
GATGCGTGGC TCCCAGCAGA GGCCGTAAAT TGAGCGTTCA GTTGACACAT TGCACACACA 2580 
GTCTGTTCAG AGGCATTGGA GGATGGGGGT CCTGGTATGT CTCACCAGGA AATTCTGTTT 2640 
ATGTTCTTGC AGCAGAGAGA AATAAAACTC CTTGAAACCA GCTCAGGCTA CTGCCACTCA 2700 
GGCAGCCTGT GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG AGGAGOGTGT CCTATCCCCG 2760 
GAOGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG GACGCATGCA GGGCCCCCAC 2820 
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AGGAGCATGT CCTATCCCTG GACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG 2880 

AACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC 2940 

TGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG 3000 

GACCGGACGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC CCCAGGACGC ATGCAGGGCC 3060 

CCCACAGGAG CGTGTACTAC CCCAGGATGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC 3120 

CCCAGGACGC ATGCAGGGCC CCCATGCAGG CAGCCTGCAG ACCAACACTC TGCCTGGCCT 3180 

TGAGCCGTGA CCTCCAGGAA GGGACCCCAC TGGAATTTTA TTTCTCTCAG GTGCGTGCCA 3240 

CATCAATAAC AACAGTTTTT ATGTTTGCGA ATGGCTTTTT AAAATCATAT TTACCTGTGA 3300 

ATCAAAACAA ATTCAAGAAT GCAGTATCCG CGAGCCTGCT TGCTGATATT GCAGTTTTTG 3360 

TTTACAAGAA TAATTAGCAA TACTGAGTGA AGGATGTTGG CCAAAAGCTG CTTTCCATGG 3420 

CACACTGCCC TCTGCCACTG ACAGGAAAGT GGATGCCATA GTTTGAATTC ATGCCTCAAG 3480 

TCGGTGGGCC TGCCTACGTG CTGCCCGAGG GCAGGGGCCG TGCAGGGCCA GTCATGGCTG 3540 

TCCCCTGCAA GTGGACGTGG GCTCCAGGGA CTGGAGTGTA ATGCTCGGTG GGAGCCGTCA 3600 

GCCTGTGAAC TGCCAGGCAG CTGCAGTTAG CACAGAGGAT GGCTTCCCCA TTGCCTTCTG 3660 

GGGAGGGACA CAGAGGACGG CTTCCCCATC GCCTTCTGGC CGCTGCAGTC AGCACAGAGA 3720 

GCGGCTTCCC CATTGCCTTC TGGGGAGGGA CACAGAGGAC AGTTTCCCCA TCGCCTTCTG 3780 

GTTGTTGAAG ACAGCACAGA GAGCGGCTTC CCCATCGCCT TCTGGGGAGG GGCTCCGTGT 3840 

AGCAACCCAG GTGTTGTCCG TGTCTGTTGA CCAATCTCTA TTCAGCATCG TGTGGGTCCC 3900 
TAAGCACAAT AAAAGACATC CACAATGGAA AAAAAAAAAG GAATTC 

Seq ID NOt 304 Protein sequence 
Protein Accession ft; MP_001035.1 

1 11 21 31 41 51 

I I I I i I 

MSKSKCSVGL MSSWAPAKE PNAVGPKEVE LILVKEQNGV QLTSSTLTNP RQSPVEAGDR 60 

ETWGKKIDPL LSVIGFAVDLi ANVWRFPYLC YKNGGGAFLV PYLLFMVIAG MPLFYMELAL 120 

GQFNREGAAG VWKICPILKG VGPTVILISL YVGPFYNVII AWALHYLFSS PTTELPWIHC 180 

NNSVJNSPNCS DAHPGDSSGD SSGLNDTPGT TPAAEYPERG VLHLHQSHGI DDLGPPRWQI* 240 

TACLVLVIVL LYFSLWRGVK TSGKWWITA TMPYWLTAL LLRGVTLPGA IDGIRAYLSV 300 

DFYRLCEASV WIDAATQVCF SLGVGFGVLI APSSYNKFTN KCYRDAIVTT SINSLTSPSS 360 

GFWFSFLGY MAQKHSVPIG DVAKDGPGLI PIIYPEAIAT LPLSSAWAW FFIKLLTLGI 420 

DSAMGGMESV ITGLIDEFQL LHRHRELFTL FIVLATFLLS LFCVTNGGIY VPTLLDHFAA 480 

GTSILFGVLI EAIGVAWFYG VGQFSDDIQQ MTGQRPSLYW RLCWKLVSPC FLLFVWVSI 540 

VTFRPPHYGA YIPPDWANAI* GWVIATSSMA MVPIYAAYKF CSLPGSFREK LAYAIAPBKD 600 
RELVDRGEVR QFTLRHWLKV 

Seq ID NOt 305 DNA sequence 

Nucleic Acid Accession NM_001216.1 

Coding sequence: 43.. 1422 

1 11 21 31 41 51 

I I I I 1 I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 60 

AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 120 

CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 180 

TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 240 

AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT ACCTGGAGAG 300 

GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 

TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 

AATAATGCCC ACAGGGACAA AGAAGGGGAT GACCAGAGTC ATTGGCGCTA TGGAGGCGAC 480 

CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC GGTGGATATC 540 

CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT GGGCTTCCAG 600 

CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA ACTGACOCTG 660 

CCTCCTGGGC TAGAGATGGC TCTGGGTCCC GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT 720 

CTGCACTGGG GGGCTGCAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG CCACCGTTTC 780 

CCTGCCGAGA TCCACGTGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA CGAGGCCTTG 840 

GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC GCCTTTCTGG AGGAGGGCCC GGAAGAAAAC 900 

AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG CTCAGAGACT 960 

CAGGTCCCAG GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG CTACTTCCAA 1020 

TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 1080 

CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT CTGACACCCT GTGGGGACCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGAGGCAGC CTTTQAATGG GCGAGTGATT 1200 

GAGGCCTOCT TCCCTGCTGG AGTGGACAGC AGTCCTCGGG CTGCTGAGCC AGTCCAGCTG 1260 

AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT TTGGCCTCCT TTTTGCTGTC 1320 

ACCAGCGTCG CGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGGGGGT 1380 

GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA 1440 

TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCCGGTA ACTGTCCTGT CCTGCTCATT 1500 
ATGCCACTTC CTTTTAACTG CCAAGAAATT TTTTAAAATA AATATTTATA AT 

Seq ID NO: 306 Protein sequence 
Protein Accession ft: NP_001207.1 

1 11 21 31 41 51 

I I I I I I 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 

GEEDLPSBED SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSLKL EDLPTVEAPG 120 
DPQEPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 180 

ELLGFQLPPL PELRLRNNGH SVQLTLPPGL EMALGPGREY RALQLHLHWG AAGRPGSEHT 240 
VEGHRFPAEI HWHLSTAFA RVDEAIiGRPG GLAVLAAFLB EGPEENSAYE QLLSRLEEIA 300 
EEGSETQVPG LDISALLPSD PSRYFQYEGS LTTPPCAQGV XWTVFNQTVM LSAKQLHTLS 360 
DTLWGPGDSR LQLNFRATQP I27GRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 420 
QLIiFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 
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Seq ID NO: 307 DMA sequence 

Nucleic Acid Accession #: NMJ)03039.1 

Coding sequence: 76.. 1581 

5 1 11 21 31 41 51 

I 1 I I I I 

CTTCTCTCTC CATTCAGTGC ACGCGTTACT TTGGCTAAAA GGAGGTGAGC GGCACTCTGC 60 

CCTTCCAGAG CAAGCATGGA GCAACAGGAT CAGAGCATGA AGGAAGGGAG GCTGACGCTT 120 

GTGCTTGCCC TGGCAACCCT GATAGCTGCC TTTGGGTCAT CCTTCCAGTA TGGGTACAAC 180 

10 GTGGCTGCTG TCAACTCCCC AGCACTGCTC ATGCAACAAT TTTACAATGA GACTTACTAT 240 

GGTAGGACCG GTGAATTCAT GGAAGACTTC CCCTTGACGT TGCTGTGGTC TGTAACCGTG 300 

TCCATGTTTC CATTTGGAGG GTTTATCGGA TCXXTCCTGG TCGGCCCCTT GGTGAATAAA 360 

TTTGGCAGAA AAGGGGCCTT GCTGTTCAAC AACATATTTT CTATCGTGCC TGCGATCTTA 420 

ATGGGATGCA GCAGAGTCGC CACATCATTT GAGCTTATCA TTATTTCCAG ACTTTTGGTG 480 

15 GGAATATGTG CAGGTGTATC TTCCAACGTG GTCCCCATGT ACTTAGGGGA GCTGGCCOCT 540 

AAAAACCTGC GGGGGGCTCT CGGGGTGGTG CCCCAGCTCT TCATCACTGT TGGCATCCTT 600 

GTGGCCCAGA TCTTTGGTCT TCGGAATCTC CTTGCAAACG TAGATGGCTG GCCGATCCTG 660 

CTGGGGCTGA CCGGGGTCCC CGCGGCGCTG CAGCTCCTTC TGCTGCCCTT CTTCCOCGAG 720 

AGCCCCAGGT ACCTGCTGAT TCAGAAGAAA GACGAAGCGG CCGCCAAGAA AGCCCTACAG 780 

20 ACGCTGCGCG GCTGGGACTC TGTGGACAGG GAGGTGGCCG AGATCCGGCA GGAGGATGAG 840 

GCAGAGAAGG CCGCGGGCTT CATCTCCGTG CTGAAGCTGT TCCGGATGCG CTCGCTGCGC 900 

TGGCAGCTGC TGTCCATCAT GGTCCTCATG GGOGGCCAGC AGCTGTCGGG CGTCAAOGCT 960 

ATCTACTACT ACGCGGACCA GATCTACCTG AGOGCOGGCG TGCCGGAGGA GCACGTGCAG 1020 

TACGTGACGG CCGGCACCGG GGCCGTGAAC GTGGTCATGA CCTTCTGCGC CGTGTTCGTG 1080 

25 GTGGAGCTCC TGGGTOGGAG GCTGCTGCTG CTGCTGGGCT TCTCCATCTG CCTCATAGCC 1140 

TGCTGCGTGC TCACTGCAGC TCTGGCACTG CAGGACACAG TGTCCTGGAT GCCATACATC 1200 

AGCATCGTCT GTGTCATCTC CTACGTCATA GGACATGCCC TCGGGCCCAG TCCCATACCC 1260 

GCGCTGCTCA TCACTGAGAT CTTCCTGCAG TCCTCTCGGC CATCTGCCTT CATGGTGGGG 1320 

GGCAGTGTGC ACTGGCTCTC CAACTTCACC GTGGGCTTGA TCTTCCCGTT CATCCAGGAG 1380 

30 GGCCTCGGCC CGTACAGCTT CATTGTCTTC GCCGTGATCT GCCTCCTCAC CACCATCTAC 1440 

ATCTTCTTGA TTGTCCCGGA GACCAAGGCC AAGACGTTCA TAGAGATCAA CCAGATTTTC 1S00 

ACCAAGATGA ATAAGGTGTC TGAAGTGTAC COQGAAAAGG AGGAACTGAA AGAGCTTCCA 1560 

CCTGTCACTT CGGAACAGTG ACTCTGGAGA GGAAGCCAGT GGAGCTGGTC TGCCAGGGGC 1620 

TTCCCACTTT GGCTTATTTT TCTGACTTCT AGCTGTCTGT GAATATOCAG AAATAAAACA 1680 

35 ACTCTGATGT GGAATGCAGT CCTCATCTCC AGCCTCCCCA CCCCAGTGGG AACTGTGCAA 1740 

AGGGCTGCCT TGCTGTTCTT GAAGCTGGGC TGTCTCTCTC CATGTTGGCC TGTCAGCAGA 1800 

CCOGAGTCAA TTAAACAGCT GGTCCTCCAC TTTGCTGGTT CAGCCTTCGT GTGGCTCCTG 1860 

GTAACGTGGC TCCACCTTGA TGGGTCAACC TTTGTGTGGC TCCTGGTAAC ATAACAACAA 1920 

CAGTTACTAT AGTGGTGAGA TGGAAGGAAT CAAATTTTGC CAGAGAAACT AACTCGGTGG 1980 

40 CCCCAACAGG TCTTCOGGGG CCATGGGCAT TTGTTTAGAG CCAAATTCAT CCTCTTACCA 2040 

GATCCTTTTC CAGAAATACC TGTCTAGGAA GGTGTGATGT CAGAAACAAT GACATCCAGA 2100 

AAGCTGAGGA ACAGGTTCCT GTGGAGACAC TGAGTCAGAA TTCTTCATCC AA ATTATT TT 2160 
GTTAGTGGAA AATGGAATTG CTTCTGTGTA GTCAATAAAA TGAACCTGAT CACTTTTC 

45 Seq ID NO: 308 Protein sequence 
Protein Accession #: NP_Q03030.1 

1 11 21 31 41 51 

r n I I I 1 I I 

DO MEQQDQSMKE GRLTLVLAIA TLIAAFGSSF QYGYNVAAVN SPALLMQQFY NETYYGRTGE 60 

FMBDFPLTLL WSVTVSMFPF GGFIGSLLVG PliVNKFGRKG ALLFNNIFSI VPAILMGCSR 120 

VATSFELIII SRLLVGICAG VSSNWPMYL GELAPKNLRG ALGWPQLFI TVGILVAQIF 180 

GLRNUANVD GWPILLGLTG VPAALQLLLL PFFPESPRYL LIQKKDBAAA KKALQTLRGW 240 

DSVDRBVAEI RQEDEABKAA GFISVLKLFR MRSLRWQLLS IIVLMGGQQL SGVNAIYYYA 300 

55 DQIYLSAGVP EEHVQYVTAG TGAVNWMTF CAVPWELLG RRLLLLLGFS ICLIACCVLT 360 

AAIiALQDTVS WMPYISIVCV ISYVIGHALG PSPIPALLIT EIPLQSSRPS AFMVGGSVHW 420 

LSNFTVGLIF PFIQEGLGPY SFIVFAVICL LTTIYIFLIV PETKAKTFIE INQIPTKMNK 480 
VSEVYPEKEE LKELPPVTSE Q 

60 Seq ID NO; 309 DNA sequence 

Nucleic Acid Accession #: NM_001252.1 
Coding sequence: 138.. 719 

1 11 21 31 41 51 

65 | | | | | | 

GGCTGGTCCC CTGACAGGTT GAAGCAAGTA GACGCCCAGG AGCCCCGGGA GGGGGCTGCA 60 

GTTTCCTTCC TTCCTTCTOG GCAGCGCTCC GCGCCCCCAT CGCCCCTCCT GCGCTAGCGG 120 

AGGTGATCGC CGCGGCGATG COGGAGGAGG GTTCGGGCTG CTCGGTGCGG CGCAGGCCCT 180 

ATGGGTGCGT CCTGCGGGCT GCTTTGGTCC CATTGGTCGC GGGCTTGGTG ATCTGCCTCG 240 

70 TGGTGTGCAT CCAGOGCTTC GCACAGGCTC AGCAGCAGCT GCCGCTCGAG TCACTTGGGT 300 

GGGACGTAGC TGAGCTGCAG CTGAATCACA CAGGACCTCA GCAGGACOCC AGGCTATACT 360 

GGCAGGGGGG CCCAGCACTG GGCCGCTCCT TCCTGCATGG ACCAGAGCTG GACAAGGGGC 420 

AGCTACGTAT CCATCGTGAT GGCATCTACA TGGTACACAT CCAGGTGACG CTGGCCATCT 480 

_ _ GCTOCTCCAC GACGGCCTCC AGGCACCACC CCACCACCCT GGCCGTGGGA ATCTGCTCTC 540 

75 CCGCCTCOCG TAGCATCAGC CTGCTGCGTC TCAGCTTCCA CCAAGGTTGT ACCATTGCCT 600 

CCCAGCGCCT GACGCCCCTG GCCCGAGGGG ACACACTCTG CACCAACCTC ACTGGGACAC 660 

TTTTGCCTTC CCGAAACACT GATGAGACCT TCTTTGGAGT GCAGTGGGTG CGCCCCTGAC 720 

CACTGCTGCT GATTAGGGTT TTTTAAATTT TATTTTATTT TATTTAAGTT CAAGAGAAAA 780 

0 _ AGTGTACACA CAGGGGCCAC CCGGGGTTGG GGTGGGAGTG TGGTGGGGGG TAGTGGTGGC 840 
80 AGGACAAGAG AAGGGATTGA GCTTTTTCTT TCATTTTCCT ATTAAAAA 

Seq ID NO: 310 Protein sequence 
Protein Accession #: NP 001243.1 
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1 11 21 31 41 51 

I I I I I I 

MPEEGSGCSV RRRPYGCVXiR AALVPLVAGL VICLWCIQR FAQAQQQLPL ESLGWDVABL 60 

QLNHTGPQQD PRLYWQGGPA LGRSFLHGPE LDKGQLRIHR DGIYMVHIQV TLAICSSTTA 120 

SRHHPTTLAV GICSPASRSI SULRLSFHQG CTIASQRLTP LARGDTLCTN LTGTLLPSRN 180 
TDBTPFGVQW VRP 

Seq ID KO: 311 DMA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..3978 

1 11 21 31 41 SI 

I I I I I I 

ATGGTGGGTG AAGGACCCTA CCTTATCTCA GATCTGGACC AGCGAGGCCG GCGGAGATCC 60 

TTTGCAGAAA GATATGACCC CAGCCTGAAG ACCATGATCC CAGTGCGACC CTGTGCAAGG 120 

TTAGCACCCA ACCCGGTGGA TGATGCCGGG CTACTCTCCT TCGCCACATT TTCCTGGCTC 180 

ACGCOGGTGA TGGTGAAAGG CTACCGGCAA AGGCTGACOG TAGACACCCT GCCCCCATTG 240 

TCGACATATG ACTCATCTGA CACCAATGCC AAAAGATTTC GAGTCCTTTG GGATGAAGAG 300 

GTAGCAAGGG TGGGTCCTGA GAAGGCCTCT CTGAGCCACG TGGTGTGGAA ATTCCAGAGG 360 

ACACGCGTGT TGATGGACAT OGTGGCCAAC ATCCTGTGCA TCATCATGGC AGCCATAGGG 420 

CCGACAGTTC TCATTCACCA AATCCTCCAG CAGACTGAGA GGACCTCTGG GAAAGTCTGG 480 

GTTGGCATTG GACTGTGCAT AGCCCTTTTT GCCACCGAGT TTACCAAAGT CTTCTTTTGG 540 

GCC CTTGCC T GGGCCATCAA CTACCGCACG GCCATCCGGT TGAAGGTGGC GCTCTCCACC 600 

TTGGTTTTTG AAAACCTAGT GTOCTTCAAG ACATTGACCC ACATCTCTGT TGGCGAGGTG 660 

CTCAATATAC TGTCAAGTGA TAGCTATTCT TTGTTTGAAG CTGCCTTGTT TTGTCCTTTG 720 

CCAGCCACCA TCCCGATCCT AATGGTCTTT TGTGCGGCGT ACGCCTTTTT CATTCTGGGG 780 

CCCACAGCTC TCATCGGGAT ATCAGTGTAT GTCATATTCA TACCCGTCCA GATGTTTATG 840 

GCCAAGCTCA ATTCAGCTTT CCGAAGGTCA GCAATTTTGG TGACAGACAA GCGAGTTCAG 900 

ACAATGAATG AGTTTCTGAC CTGCATCAGG CTGATCAAAA TGTATGCCTG GGAGAAATCT 960 

TTTACCAACA CTATCCAAGA TATAAGAAGG AGGGAAAGAA AATTACTGGA AAAAGCTGGA 1020 

TTTGTGCAAA GTGGAAACTC TGCCCTGGCC CCCATCGTGT CCACCATAGC CATCGTGCTG 1080 

ACATTATCCT GCCACATCCT CCTGAGACGC AAACTCACOG CACCOGTGGC ATTTAGTGTG 1140 

ATTGCCATGT TTAATGTAAT GAAGTTTTCC ATTGCAATCT TGCCCTTCTC CATCAAAGCA 1200 

ATGGCTGAAG CGAATGTCTC TCTAAGGAGA ATGAAGAAAA TTCTCATAGA TAAAAGCCCC 1260 

CCATCTTACA TCACCCAACC AGAAGACCCA GATACTGTCT TGCTTTTAGC AAATGCCACC 1320 

TTGACATGGG AGCATGAAGC CAGCAGGAAA AGTACCOCAA AGAAATTGCA GAACCAGAAA 1380 

AGGCATTTAT GCAAGAAACA GAGGTCAGAG GCATACAGTG AGAGGAGTCC ACCAGCCAAG 1440 

GGAGCCACTG GCCCAGAGGA GCAAAGTGAC AGCCTCAAAT CGGTTCTGCA CAGCATAAGC 1500 

TTTGTGGTGA GAAAGTTATG TCGTTATCCC GAAGCCCAGC TCCTGGCTTG GAGGTGGCCA 1S60 

GCAGTGTTTG TTGGGAGAAT CATCAGAGGA TACAGGCCTC ATGGATTTTC TGCTAAAGAC 1620 

AAGGATGAAT CTAGAAGGCT TCTTACTTGG CCCCAAGAAG TGGATAGGAC TCAAAGGGCA 1680 

GCCAAATACC TGGGGAAGAT CTTGGGAATA TGTGGGAATG TGGGAAGTGG AAAGAGCTCC 1740 

C TCCT TGCAG CTCTCCTAGG ACAGATGCAG CTGCAGAAAG GGGTGGTGGC AGTCAATGGA 1800 

ACTTTGGCCT ACGTTTCACA GCAGGCATGG ATCTTTCATG GAAATGTGAG AGAAAACATA 1860 

CTCTTTGGAG AAAAGTATGA TCACCAAAGG TATCAGCACA CAGTCGGCGT CTGTGGCCTC 1920 

CAGAAGGACC TGAGCAACCT CCCCTATGGA GACCTGACTG AGATTGGGGA GCGGGGCCTC 1980 

AACCTCTCTG GGGGGCAGAG GCAGAGGATT AGCCTGGCCC GOGCTGTCTA CTCCGACOGT 2040 

CAGCTCTACC TGCTGGACGA CCCCCTGTCG GCCGTGGACG CCCACGTGGG GAAGCACGTC 2100 

TTTGAGGAGT GCATTAAGAA GACGCTCAGG GGAAAGACAG TCGTCCTGGT GACCCACCAG 2160 

CTACAGTTCT TAGAGTCTTG TGATGAAGTT ATTTTATTAG AAGATGGAGA GATTTGTGAA 2220 

AAGGGAACCC ACAAGGAGTT AATGGAGGAG AGAGGGCGCT ATGCAAAACT GATTCACAAC 2280 

CTGCGAGGAT TGCAGTTCAA GGATCCTGAA CACCTTTACA ATGCAGCAAT GGTGGAAGCC 2340 

TTCAAGGAGA GCCCTGCTGA GAGAGAGGAA GATGCTGGTA TAATCGGGTA CCTCCTTTCT 2400 

CTCTTCACTG TGTTCCTCTT CCTCCTGATG ATTGGCAGCG CTGCCTTCAG CAACTGGTGG 2460 

CTGGGTCTCr GGTTGGACAA GGGCTCAOGG ATGACCTGTG GGCCCCAGGG CAACAGGACC 2520 

ATGTGTGAGG TCGGCGCGGT GCTGGCAGAC ATCGGTCAGC ATGTGTACCA GTGGGTGTAC 2580 

ACTGCAAGCA TGGTGTTCAT GCTGGTGTTT GGCGTCACCA AAGGCTTCGT CTTCACCAAG 2640 

ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT TTGATAAGAT CTTAAAGAGC 2700 

CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA TGAACOGTTT TTCCAAGGAT 2760 

ATGGACGAGC TGGATGTGAG GCTGCCGTTT CAOGCAGAGA ACTTTCTGCA GCAGTTTTTT 2820 

ATGGTGGTGT TTATTCTOGT GATCTTGGCT GCTGTGTTTC CTGCTGTCCT TTTAGTCGTG 2880 

GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACGCATTT TCCACAGAGG AGTCCAGGAG 2940 

CTCAAGAAGG TGGAGAATGT CAGCCGGTCA CCCTGGTTCA CCCACATCAC CTCCTCCATG 3000 

CAGGGCCTGG GCATCATTCA OGCCTATGGC AAGAAGGAGA GCTGCATCAC CTATACTTCA 3060 

TCCAAAGGCC TGTCATTGTC ATACATCATC CAGCTGAGCG GACTGCTCCA AGTGTGTGTG 3120 

CGAACGGGAA CAGAGACGCA AGCCAAATTC ACCTCCGTGG AGCTGCTCAG GGAATACATT 3180 

TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG GGACCTGTOC CAAGGACTGG 3240 

CCCAGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATGA GATACAGAGA CAACACCCCC 3300 

CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAGTGGGC AGACAGTOGG GATTGTTGGA 3360 

AGAACAGGTT C CGGAA AGTC ATCGTTAGGA ATGGCTTTGT TTCGTCTGGT GGAGCCAGCC 3420 

AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC TCAGCTTGGA AGACCTCAGA 3480 

ACCAAGCTGA CTGTGATOCC ACAGGATCCT GTCCTGTTTG TAGGTACAGT AAGGTACAAC 3S40 

TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC AGGTTCTGGA GAGAACATTC 3600 

ATGAGAGACA CAATAATGAA ACTCCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

GAAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG CCCGAGCTCT TCTCCGTAAT 3720 

TCAAAGATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGG ACTCCAAGAC TGACACCCTG 3780 

GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG TGCTGACCAT CGCCCACCGC 3840 

CTCAACACAG TTCTCAACTG CGATCAOGTC CTGGTTATGG AAAATGGGAA GGTGATTGAG 3900 

TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATTCTG CATTTGCGAT GTTACTAGCA 3960 
GCAGAAGTCA GATTGTAG 

Seq ID NO: 312 Protein sequence 
Protein Accession ft: Eos sequence 
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1 U 21 31 41 51 

MVGEGPYLIS DLDQRGRRRS PAERYDPSLK TMIPVRPCAR LAPNPVDDAG LLSPATPSWL 60 

TPVMVKGYRQ RLTVDTLPPL STYDSSDTNA KRFRVLWDEE VARVQPEKAS LSHWWKFQR 120 

TRVLMDIVAN ILCIIMAAIQ PTVLIHQILQ QTERTSGKVW VGIGLCIALP ATEFTKVFFW 180 

ALAWAIHYRT AIRLKVALST LVPENLVSFK TLTHISVGEV LNILSSDSYS LPEAALFCPL 240 

PATIPILMVF CAAYAFFILG PTALIGISVY VIFIPVQMFM AKLNSAPRRS AILVTDKRVQ 300 

TMNEFLTCIR LIKMYAWEKS PTNTIQDIRR RERKLLEKAG FVQ5GNSALA PIVSTIAIVL 360 

TLSCHILLRR KLTAFVAFSV IAMFNVMKFS IAILPFSIKA MAEANVSLRR MKKILIDKSP 420 

PSYITQPEDP DTVLLLANAT LTWEHEASRK STPKKLQNQK RHLCKKQRSE AYSERSPPAK 4 BO 

GATGPEEQSD SLKSVLHSIS PWRKLCRYP EAQLLAWRWP AVFVGRIIRG YRPHGF6AKD 540 

KDESRRLLTW PQEVDRTQRA AKYLGKILGI OGNVGSGKSS LtAALLGQMQ LQKGWAVNG 600 

TLAYVSQQAW IFHGNVRENI LFGEKYDHQR YQHTVRVCGL QKDLSNLPYG DLTEIGERGL 660 

NLSGGQRQRI SLARAVYSDR QLYLLDDPLS AVDAHVGKHV FEECIKKTLR GKTWLVTHQ 720 

LQFLESCDEV ILLEDGEICE KGTHKELMEB RGRYAKLIHN LRGLQFKDPE HLYNAAMVEA 780 

FKESPAEREE DAGIIGYLLS LFTVFLFLLM IGSAAFSNWW LGLWLDKGSR MTCGPQGNRT 840 

MCEVGAVLAD IGQHVYQWVY TASMVFMLVF GVTKGPVFTK TTIHASSSLH DTVFDKILKS 900 

PMSFFDTTPT GRLMNRFSKD MDELDVRLPF HAENFLQQFF MWFILVILA AVFPAVLLW 960 

ASLAVGPFIL LRIFHRGVQE LKKVENVSRS PWFTHITSSM QGLGI IHAYG KKESCITYTS 1020 

SKGLSLSY1I QLSGIiLQVCV RTGTETQAKF TSVELLREYI STCVPECTHP LKVGTCPKDW 1080 

PSOGEITFRD YQMRYRDNTP LVLDSLNLNI QSGQTVGIVG RTGSGKSSLG MALFRLVEPA 1140 

SGTIFIDEVD ICILSLEDLR TKLTVIPQDP VLFVGTVRYN LDPFESHTDE MLWQVLERTF 1200 

MRDTIMKLPE KLQAEVTENG ENFSVGERQL LCVARALLRN SKI ILLDEAT ASMDSKTDTL 1260 

VQNTIKDAFK GCTVLTIAHR LNTVLNCDHV LVMENGKVTE FDKPEVLAEK PDSAFAMLLA 1320 
AEVRL 

Seq ID NO: 313 DNA sequence 
Nucleic Acid Accession #t Z31560 
Coding sequence: 1-966 

1 11 21 31 41 51 

i I I I I I 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG CGAGCGGGGT CGGGGTGGGC GCCG6CCTGG GOGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGGGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GOGCAGCGCA GATGCAGCCC 600 

ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAAOGGCT CGCCCACCTA CAGCATGTCC TACTOGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTTACC 780 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCGGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCOGGCAOG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCOGG TACGCTCAAA 1080 
AAAAA 

Seq ID NO: 314 Protein sequence 
Protein Accession #: CAA8343S 

1 11 21 31 41 51 

I I I I I I 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 60 

RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALHMKEHP DYKYRPRRKT 120 

KTliMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGM SNGSYSMMQD 180 

QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 240 

LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 300 
HYQSGPVPGT AINGTLPLSB M 

Seq ID NO: 315 DNA sequence 
Nucleic Acid Accession #: U91618 
Coding sequence: 2 9.. 541 

1 11 21 31 41 51 

CGGACTTGGC TTGTTAGAAG GCTGAAAGAT GATGGCAGGA ATGAAAATCC AGCTTGTATG 60 

CATGCTACTC CTGGCTTTCA GCTCCTGGAG TCTGTGCTCA GATTCAGAAG AGGAAATGAA 120 

AGCATTAGAA GCAGATTTCT TGACCAATAT GCATACATCA AAGATTAGTA AAGCACATGT 160 

TCCCTCTTGG AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

AGCTGAGGAA ACAGGAGAAG TTCATGAAGA GGAGCTTGTT GCAAGAAGGA AACTTCCTAC 300 

TGCTTTAGAT GGCTTTAGCT TGGAAGCAAT GTTGACAATA TACCAGCTCC ACAAAATCTG 360 

TCACAGCAGG GCTTTTCAAC ACTGGGAGTT AATCCAGGAA GATATTCTTG ATACTGGAAA 420 

TGACAAAAAT GGAAAGGAAG AAGTCATAAA GAGAAAAATT CCTTATATTC TGAAACGGCA 480 

GCTGTATGAG AA7AAACCCA GAAGACCCTA CATACTCAAA AGAGATTCTT ACTATTACTG 540 

AGAGAATAAA TCATTTATTT ACATGTGATT GTGATTCATC ATCCCTTAAT TAAATATCAA 600 

ATTATATTTG TGTGAAAATG TGACAAACAC ACTTATCTGT CTCTTCTACA ATTGTGGTTT 660 

ATTGAATGTG TTTTTCTGCA CTAATAGAAA TTAGACTAAG TGTTTTCAAA TAAATCTAAA 720 
TCTTCAAAAA AAAAAAAAAA AAATGGGGCC GCAATT 
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Seq ID NO i 316 Protein sequence 
Protein Accession #: AAB50564 

1 11 21 31 41 51 

I. I I I I I 

MMAGMKIQLV CMLLLAFSSW SLCSDSEEEM KALEADFLTN HHTSKISKAH VPSWKMTLLN 60 

VCSLVNNLNS PAEETGEVHE EELVARRKLP TALDGPSLEA MLTIYQLHKI CHSRAFQHWE 120 
LIQBDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 31*7 DNA sequence 

Nucleic Acid Accession ft: NM_006536.2 

Coding sequence: 10 9.. 2 940 

1 11 21 31 41 51 

I I I I I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CAT CTG CATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAOTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTJfTTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CAC^GGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATA<$TGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT S40 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAGT GT TTGT CCAT 600 

GAATGGGCCC ACCTCCGTTG GGGT3TGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAM3TGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 

CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 2280 

GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCOCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2580 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

GGAGTTTTAA CAGCAATGGG TTTGATAGQA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 

CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAA AA 3000 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG AT TAGA AAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TT AAAQTA AT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG ClXJiTiACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 
TACCTAGGAA A 

Seq ID NO: 31B Protein sequence 
Protein Accession ft: NP_006527.l 

1 11 21 31 41 51 

MTQR3IAGPI CNLKPVTLLV ALSSBLPFLG AGVQLQDNGY NGLL1AINPQ VPENQNLISN 60 
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PCT/US02/36810 



IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYBKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 180 

KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LPKEGCTPIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 300 

5 TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLN QIVKIHTPVG IASFDSKGEI 360 

RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGP EWEKLNGKA YGSVMILVTS 420 

GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNLEBLSRLT GGLKFFVPDI SNSNSMIDAF 480 

SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMPLVTH QASGPPEIIL S40 

FDPDGRKYYT NNPITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 600 

10 AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLiLDDG 660 

AGADVIKNDG IYSRYPFSFA ANGRYSLKVH VNHSPSISTP AHSIPGSHAM YVPGYTANGN 720 

IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

LTLSWTAPGE DFDGGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AG I RE I FTPS 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 

15 LILKGVLTAM GLIGIICLII WTHHTLSRK KRADKKENGT KLL 

Seq ID NO: 319 DNA sequence 
Nucleic Acid Accession 8: NM_000228-1 
2 g Coding sequences 82.. 3600 

1 11 21 31 41 SI 

I I I I I I 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG 60 

GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT TGTGTTTTGC CCTGCCTGGC 120 

25 CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT ATCCACCTGT TGGGGACCTG 180 

CTTGTTGGGA GGACCOGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 240 

ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 300 

CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCCGG CCCCATGCGC 360 

TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

30 TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 480 

GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 540 

ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 

CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 660 

ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 

35 ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 780 
CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 840 
GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 900 
CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 960 

GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGAGG GCCAGGACGC CCATGAATGC 1020 

40 CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 1080 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 1200 

GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 1260 

CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 

45 AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGCCACCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 1440 

CTGCCCAAOG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 1560 

CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 1620 

50 GCAGCCATCC GCCAGTGTCC AGAOCGGACC TATGGAGACG TGGCCACAGG ATGCCGAGCC 1680 

TGTGACTGTG ATTTCOGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCCGCTGC 1740 

CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 1800 

CGCTACCCGG TGTGOGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1860 

GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 1920 

55 GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG ATCCTAGATG CAAAGAGTAA GATTGAGCAG 1980 

ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 2040 

GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 2100 

GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 

60 GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 2280 

GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA GAGAGGCAGA GAGGCTGGTG 2340 

CGGCAGGCGG GAGGAGGAGG AGGCACOGGC AGCCCCAAGC TTGTGGCCCT GAGGCTGGAG 2400 

ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2520 

65 TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGCGGGG 2580 

CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 2640 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

70 GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 
CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 
75 AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 
AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 
GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GOGGAAGGTG CCAGOGAGCA GGCATTGAGT 3300 
GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 
CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 
80 GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 3480 
CTGCGGGGCA GCCAGGCCAT CATGCTGOGC TCGGCGGACC TGACAGGACT GGAGAAGCGT 3540 
GTGGAGCAGA TCOGTGACCA CATCAATGGG CGOGTGCTCT ACTATGCCAC CTGCAAGTGA 3600 
TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATCTGCCGCC TTTGCTTTTG GTTGGGGGCA 3660 
GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 3720 
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4 

GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 
GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3840 
CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCGC CTTTAGTTCT CCACTGGGGA 3900 
GGAATCCTGG ACCAAGCACA AAAACTTAAC * AAAAGTGATG TAAAAATGAA AAGCCAAATA 3960 
AAAATCTTTG G 

Seq ID NO: 320 Protein sequence 
Protein Accession fl: NP_000219.1' 

1 11 21 31 41 51 

I I I I I ! 

MRPFFLLCFA LPGLLHAQQA CSRGACYPPV GDLLVGRTRF LRASSTCGLT KPETYCTQYG 60 

BWQMKCCKCD SRQPHNYYSH RVENVASSSG PMRWWQSQND VNPVSLQLDL DRRPQLQEVM 120 

MEFQGPMPAG MLIERSSOFG KTWRVYQYLA ADCTSTPPRV RQGRPQSWQD VRCQSLPQRP 180 

NARLNGGKVQ I2JLMDLVSGI PATQ9QKIQ3, VGEITNLRVN FTRLAPVPQR GYHPPSAYYA 240 

VSQLRLQGSC FCHGHADRCA PKPGASAGPS TAVQVHDVCV CQHNTAGPNC ERCAPFYNNR 300 

PWRPAEGQDA HECQRGDCNG HSETCHFDPA VFAASQGAYG GVCDNCRDHT EGKNCERCQL 360 

HYPRNRRPGA SIQETCISCE CDPDQAVPGA PCDPVTGQCV CKEHVQGERC DLCKPGFTGL 420 

TYANPQGCHR CDCNILGSRR DMPCDEESGR CLCLPNWGP KCDQCAPYHW KLASGQGCEP 480 

CACDPHNSPQ PTVQPVHRAV PCREGFGGLM CSAAAIRQCP DRTYGDVATG CRACDCDFRG 540 

TEGPGCDKAS GRCLCRPGLT GPRCDQOQRG YCNRYPVCVA CHPCFQTYDA DLREQALRFG 600 

RLRNATASLW SGPGLEDRGL ASRILDAKSK IEQIRAVLSS PAVTEQKVAQ VASAILSLRR 660 

TLQGLQLDLP LEEETLSLPR DLESLDRSFN GLLTMYQRKR EQPEKISSAD PSGAFRMLST 720 

AYEQSAQAAQ QVSDSSRLLD QLRDSRREAE RLVRQAGGGG GTGSPKLVAL RLEMSSLPDL 780 

TPTFNKLCGN SRQMACTPIS CPGELCPQDN GTAOGSRCRG VLPRAGGAPL MAGQVAEQLR 840 

GFNAQLQRTR QMIRAAEESA SQIQSSAQRL ETQVSASRSQ MEEDVRRTRL LIQQVRD7LT 900 

DPDTDAATIQ EVSEAVLALW LPTDSATVLQ KMNEIQAIAA RLPNVDLVLS QTKQD1ARAR 960 

RLQAEAEEAR SRAHAVEGQV EDWGNLRQG TVALQEAQDT MQGTSRSLRL IQDRVAEVQQ 1020 

VLRPAEKLVT SMTKQLGDFW TRMEELRHQA RQQGAEAVQA QQLAEGASEQ ALSAQEGFER 1080 

IKQKYAELKD RLGQSSMLGE QGARIQSVKT EAEELFGETM EMMDRMKDME LELLRGSQAI 1140 

MXjRSADLTGL ekrvbqirdh ingrvlyyat ck 

Seq ID NO: 321 DNA sequence 

Nucleic Acid Accession #: NM_001944.1 

Coding sequence: 84.. 3083 

1 11 21 31 41 51 

I 1 i II I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTOGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGOG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCOGTGGGTT 2580 
GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 
TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 
CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 
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CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCOGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2 8 BO 

TAACGGAGAC TTACTOGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

5 CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

• TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 

10 TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 322 Protein sequence 
Protein Accession ft: NP_001935.1 

15 1 11 21 31 41 51 

I I I I I I 

MMGLFPRTTG ALAIFVWIL VHGELRIETK GQYDBEEMTM QQAKRRQKRE HVKPAKPCRE 60 

GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIP WDKNTGDIN ITAIVDREET 120 

PSFLITCRAL NAOGLDVEKP I*ILTVKILDI NDNPPVFSQQ IFMGEIEENS ASNSLVMILN 180 

20 ATDADBPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGLSTQ CECNIKVKDV NDNFPMFRDS QYSARIEENI LSSBLLRFOV TDLDEEYTDN 300 

WLAVYFFTSG NEGNWFEIQT DPRTNEGILK WKALDYEQL QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 480 

25 VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY TGPYT PALED QPVKLPAVWS 540 

ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 

TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 660 

GSEGTIHQWG IEGAHPEDKE ITMICVPPVT ANGADFMESS EVCTNTYARG TAVEGTSGME 720 

MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 780 

30 DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCLLIYDNEG ADATGSPVGS VGCCSFIADD 840 

LDDSFLDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIEV OOTGFVKOCT 900 

LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPLLTQNVI 960 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

35 Seq ID NO: 323 DNA sequence 

Nucleic Acid Accession ft: XM_058069.2 
Coding sequence: 1..1413 

1 11 21 31 41 51 

40 i i l l I 1 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 60 

AGCTCXACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 120 

TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTTAATG 180 

AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACTGGAC 240 

45 ACATCTACCC TGGAGATGAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCATCATTTC 300 

AGGGAAATGC CAGGGGGGCC OGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 360 

TACACACCTG ACATGAACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 420 

TGGAGTAATG TTACCCCCTT GAAATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 480 

GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 540 

50 CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 600 

GAATTCTGGA CTACACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGAGATT 660 

GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 720 

AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATG ACATAOGTGG CATTCAGTCC 780 

CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGCCAAATC CTGACAATTC AGAACCAGCT 840 

55 CTCTGTGACC CCAATTTGAG TTTTGATGCT GTCACTACCG TGGGAAATAA GATCTTTTTC 900 

TTCAAAGACA GGTTCTTCTG GCTGAAGGTT TCTGAGAGAC CAAAGACCAG TGTTAATTTA 960 

ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTGAAGCC 1020 

AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAAT TAGC AA TTTAAGACCA 1080 

GAGCCAAATT ATCCCAAGAG CATACATTCT TTTGGTTTTC CTAACTTTGT GAAAAAAATT 1140 

60 GATGCAGCTG TTTTTAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 1200 

TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 1380 
ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 



Seq ID NO: 324 Protein sequence 
Protein Accession ft: P39900 



1 11 21 31 41 51 

70 | | | | I I 

MKFLLILLLQ ATASGALPLN SSTSLEKNNV LPGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED BFW T TH SGGT NLFLTAVHEI GHSLGLGHSS DPKAVMFPTY 240 

75 KYVDINTFRL SADDIRGIQS LYGDPKSNQR LPNPDNSEPA LCDFNIiSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

80 Seq ID NO: 325 DNA sequence 

Nucleic Acid Accession ft: NM_024423.1 
Coding sequence: 64.. 2590 

1 11 21 31 41 SI 
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GGCAGGTCTC GCTCTCGGCA CCCTCCCQGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 
CCGATGGCCG CCGCTGGGCC COQGOGCTCC GTGCGOGGAG CCGTCTGCCT GCATCTGCTG 120 
CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 
CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 
TCTGCAGACC TCATCOGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGG TCAGT G 300 
TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 
GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 
TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAA GAGGA G ATGGGCACCT 480 
ATTCCTTGCT CTATGCAAGA GA AT TCC T TG GGCCCTTTCC CATTGTTTCT TCAACAAGTT S40 
GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 
AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTOGG 660 
CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 
GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 
CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 
ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 
CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 
AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 
TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 
ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 
TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 
GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 
GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 
GTAAAGCCAC TGAATTATGA AGAAAACGGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 
GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 
AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 
ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 
CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 
GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 
GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 
AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 
GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 
GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 
ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 
GAGTGGCACA GTTTTACTCA ACCCOGTCTC GGTGAAQAAT CCATTAGAGG ACACACTGGT 2580 
TAAAAATTAA ACATAAAAGA AATTGCATCG ATGTAATCAG AATGAAGACC GCATGCCATC 2640 
CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTO GTTCTGTGGG 2700 
CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 2760 
ATTTATTACA TTAGCAGAAG CATGCACAAA GAGATAATGT CACAGTGCTA CAATTAGGTC 2820 
TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT TCAACATGTA 2880 
TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2940 
AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 3000 
TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACG 3060 
GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 
ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3180 
AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 
ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 3300 
TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 
GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 
AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3480 
GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3 540 
GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 
TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 
CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 
GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3780 
ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 
TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 3900 
AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 
ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 
TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 
AGGCCTTGTG GGCCCCCTTC TTT 0GGCT T T CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 
CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 4200 
CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 
AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 
TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 
AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 
CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 4500 
CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 
TGCATGATGA GTCTGAAGGC ATTTGGAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 4620 
AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAG AAGQT 4680 
CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 
TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 
TTGAGACGGA GTCTCGCTCT GAOGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 
CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 
GACTACAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 
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TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 5040 

TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 5160 

TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 5220 

GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 5280 

TGTAACCAGA AGCCAGTTTT ATCTAAOGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCOG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 5520 

AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 5760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 5820 

CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5940 

TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 6000 

TTTAGAGATT AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GGCTGGCATG 6060 

GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 6120 

TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 6240 

GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA AAAAGAAAAG 6360 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 6420 

GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AGCTATATCA 6540 

AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA TTCCAGTAAC TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 
CATATATATA ATCCCGAAAC ATG 

Seq ID NO: 326 Protein sequence 
Protein Accession ft: NP_077741.1 

1 11 21 31 41 SI 

I I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIPSRDGE ACKKVTLNVP SKLEADKIIG RVNLEECFRS 60 

ADLXRSSDPD PRVLNDGSVY TARAVALSDtC KRSFTIWLSD KRKOTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVP YSISGRGVDK 180 

EPLNLFYIER DTGNLPCTRP VDREEYDVFD L1AYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNP EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLPSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEENAFN VEILRIPIED KDIilNTANWR VNFTILKGNE NGHPKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APPARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENIAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPFYP SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FWTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 
MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EESIRGHTG 

Seq ID NOt 327 DMA sequence 

Kucleic Acid Accession #: NM_001941.2 

Coding sequence: 64.. 2754 

1 11 21 31 41 SI 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGOGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCOGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTQ TGCCACAGAC AGAGATGAAC CGGACACAAT GCATAOGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 
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GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 
GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGOGG 1S00 
c ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAAOGGCT ATAAGGCATA TGACCCCGAA 1S60 
J AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 
ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 
CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 
GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 
10 GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 
AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 
GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 
• ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 
15 ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 
GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 
20 ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT TGCATGGATG TAATCAGAAT 2580 
GAAGACCGCA TGCCATCCCA AGATTATGTC CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 
AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 
25 AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 
TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 
CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 
AACAGACAAC TGGTAAATCT CAAACTCCAG CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 , 
TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA 3060 
30 ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 
ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 
TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 
35 ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 
TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 
ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 
TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 
40 GAATACTCGC TGCAGCTGGG GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 
TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 
CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 
GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 
45 GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 
CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 
TTGAATGTAT AAAAGAAAAA GATCAAGTTG TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 
50 AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 
GGGCAAGGAG AGGCCACAAG GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 
CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 
GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 
AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 
55 GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTCTGATT TATTCCTACA 4680 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 
CCCCCCCCCC TTTTTTTTTG AGAOGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 
rA GCTCOGATCT CTGCTCACTG AAAGCTCOGC CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 
60 GCCTCCTGAG TAGCTGGGAC TACAGGGGCC CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 
TTTAATAGAG AOGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 
ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC OGCTCCOGGC 5040 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 
TGATCATAOG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTOG AGAAGCCAGG 5160 
65 GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 
TTGCTGAAAT TTCCTGCTGT AACCAGAA6C CAGTTTTATC TAACGGCTAC TGAAACACCC 5280 
ACTGTGTTTT GCTCACTCCC TCACTCACCG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 
TAACCATCTC TTTQTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATG CCOQA A 5460 
70 TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA S520 
CATATGTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 
CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 
TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 
75 TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 
TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 
TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 
AATAGAAATA CTCAATTATG TCTTTGTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 
ATTATCAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 
80 GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATCGGGT 6120 
ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 6180 
AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC ATGAGTTAAA A AGAAA AGCA 6240 
GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 
CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC CCCATGCAGG 6360 
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AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA AATCTATTAG CTATATCAAA 6420 
AGCCTTACAT TTTAATATAG GTTGAACCAA AATTTCAATT CCAGTAACTT CTATTGTAAC 6480 
CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT TTTGTTTGTA ATAGTAAAAT 6540 
ACCGGATACA TTTCACGTGT CCTTCAGTAT TGATTTGGTT GAATATTGGG TCATAATGGT 6600 
5 TGAGAAGCAT GGACACTAGA GCCAGAATGC TTGGATATGA ATCCTGGATC TGTCACTTAC 6660 
TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 
GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT AGTTAATATA CCTAAAGTAC 6780 
ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG G TAAA ATTAT 6840 
GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 6900 
10 TATATATAAT CCCGAAACAT G 

Seq ID NO: 328 Protein sequence 
Protein Accession 8: NP_001932.l 

15 1 11 21 31 41 51 

I I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIPSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD PRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQEMSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

20 EPLNLPYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DBPDTMHTRL KYSILQQTPR SPGLFSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APPARDIPRV TALNRALVTV EVRDLDEGPE CTPAAQYVRI 480 

25 KEMLAVGSKI NGYKAYDPEN RNGNGIiRYKK LHDPKGHITI DEISGSIITS KILDREVETP 540 

KNBLYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTD I LAVDPD 600 

EPVHGAPFYP SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTOTTNNSS QGFCGTMGSG MKNGGQETIB 780 

30 MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSPTQPRLG EKLHRCNQNB 840 
DRMPSQDYVL TYNYEGRGSP AGSVGCCSEK QEEDGLDFLN NLEPKFITLA EACTKR 

Seq ID NO: 329 DNA sequence 
35 Nucleic Acid Accession #: NM_016583.2 
Coding sequence : 72 . . 842 

1 11 21 31 41 51 

Ati I I I I I I 

4U GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 60 

TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 120 

CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 180 

ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 240 

ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

45 TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 360 

CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTQACCCC CAGCTGCTGG 420 

AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTGAATAOG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 540 

TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 600 

50 TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 660 

CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 720 
AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 780 
CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 
AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 900 

55 GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 960 
TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 330 Protein sequence 
60 Protein Accession #: NPJ)57667.1 

1 11 21 31 41 51 

MFQTGGLIVP YGLLAQTMAQ PGGUPVPLDQ TLPLNVNPAL PLSPTGLAGS LTNALSNGLL 60 
65 SGGLLGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 120 
VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVU5DC 180 
THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 240 
DIVNMLIHGL QFVIKV 



70 



Seq ID NO: 331 DNA sequence 

Nucleic Acid Accession #: NM_004363.1 

Coding sequence: 115.. 2223 



75 1 11 21 31 41 51 

III!!! 

CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 60 

TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 120 

0 _ TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCCCTGOC AGAGGCTCCT GCTCACAGCC 180 

80 TCACTTCTAA CCTTCTGGAA CCCGCCCACC' ACTGCCAAGC TCACTATTGA ATCCACGCOG 240 

TTCAATOTCG CAGAGGGGAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 300 

TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 360 

GTAATAGGAA CTCAACAAGC TACCGCAGGG CCCGCATACA GTGGTCGAGA GATAATATAC 420 

CCCAATGCAT CCCTGCTGAT CCAGAACATC ATCCAGAATG ACACAGGATT CTACACCCTA 480 
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CACGTCATAA AGTCAGATCT TGTGAATGAA 
GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC 
GTGGCCTTCA CCTGTGAACC TGAGACTCAG 
CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG 
5 TTCAATGTCA CAAGAAATGA CACAGCAAGC 
GCCAGGCGCA GTGATTCAGT CATCCTGAAT 
TCCCCTCTAA ACACATCTTA CAGATCAGGG 
TCTAACCCAC CTGCACAGTA CTCTTGGTTT 
GAGCTCTTTA TCCCCAACAT CACTGTGAAT 
10 AACTCAGACA CTGGCCTCAA TAGGACCACA 
CCCAAACCCT TCATCACCAG CAACAACTCC 
TTAACCTGTG AACCTGAGAT TCAGAACACA 
CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC 
GTCACAAGGA ATGATGTAGG ACCCTATGAG 
15 CACAGCGACC CAGTCATCCT GAATGTCCTC 
TCATACACCT ATTACCGTCC AGGGGTGAAC 
CCACCTGCAC AGTATTCTTG GCTGATTGAT 
TTTATCTCCA ACATCACTGA GAAGAACAGC 
GCCAGTGGCC ACAGCAGGAC TACAGTCAAG 
20 CCCTCCATCT CCAGCAACAA CTCCAAACCC 
TGTGAACCTG AGGCTCAGAA CACAACCTAC 
GTCAGTCCCA GGCTGCAGCT GTCCAATGGC 
AGAAATGACG CAAGAGCCTA TGTATGTGGA 
GACCCAGTCA CCCTGGATGT CCTCTATGGG 
25 TCGTCTTACC TTTCGGGAGC GAACCTCAAC 
CCGCAGTATT CTTGGCGTAT CAATGGGATA 
GCCAAAATCA CGCCAAATAA TAACGGGACC 
GGCCGCAATA ATTCCATAGT CAAGAGCATC 
CTCTCAGCTG GGGCCACTGT CGGCATCATG 
30 TAGCAGCCCT GGTGTAGTTT CTTCATTTCA 
TAAAGCATTT GCAACAGCTA CAGTCTAAAA 
AGACTCTGAC CAGAGATCGA GACCATCCTA 
AAATACAAAA ATGAGCTGGG CTTGGTGGCG 
TGAGGCAGGA GAATCGCTTG AACCCGGGAG 
35 ACTGCACTCC AGTCTGGCAA CAGAGCAAGA 
TCTGACCTGT ACTCTTGAAT ACAAGTTTCT 
AACTTTAATG AACTAACTGA CAGCTTCATG 
TAATTAATTT CATGGGACTA AATGAACTAA 
TTCCCAGATT TCAGGAAACT TTTITTCTTT, 
40 AAATATACTT TTGTGAACAA AAATTGAGAC 
AGACTTGGGA AACTATTCAT GAATATTTAT 
TCAATAAAAA TCTGCTCTTT GTATAACAGA 



GAAGCAACTG GCCAGTTCCG GGTATACCCG S40 

AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

dJaSCAACCT ACCTGTGGTG GGTAAACAAT 660 

•CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

"TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

AATAGTGGAT CCTATACGTG CCAAGCCCAT 1020 

GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

TATGGCCCAG ACGACCCCAC CATTTCOCCC 1380 

CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

GGGAACATCC AGCAACACAC ACAAGAGCTC 1S0O 

GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

AACAGGACCC TCACTCTATT CAATGTCACA 1800 

ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

COGGACACCC CCATCATTTC CCCCCCAGAC 1920 

CTCTCCTGCC ACTCGGCCTC TAACCCATCC 1980 

CCGCAGCAAC ACACACAAGT TCTCTTTATC 2040 

TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2580 

GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2700 

TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
AAAA 



Seq ID NO: 332 Protein sequence 
45 Protein Accession #: NP_004354.1 

1 11 21 31 41 51 

I I ! I I I 

MESPSAPPHR WCIPWQRLLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLVHNLPQ 60 

50 HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGREI lYPNASIiLIQ NIIQNDTGFY 120 

TLHVIKSDLV NEEATGQFRV YPELPKPSIS SNNSKPVEDK DAVAFTCEPE TQDATYLWWV 180 

NNQSLPVSPR LQLSNGNRTL TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP 240 

TISPLNTSYR SGENLNLSCH AASNPPAQYS WPVNGTFQQS TQELFIPNIT VNNSGSYTCQ 300 

AHNSDTGLNR TTVTTITVYA EPPKPPITSN NSNPVEDEDA VALTCEPEIQ NTTYLWWVNN 360 

55 QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS VDHSDPVILN VLYGPDDPTI 420 

SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL IDGNIQQHTQ ELFTSNITEK MSGLYTOQAN 480 

NSASGHSRTT VKTITVSAEL PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS 540 

LPVSPRLQLS NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 600 

PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN GTYACFVSNL 660 
60 ATGRNNSIVK SITVSASGTS PGLSAGATVG IMIGVLVGVA LI 



Seq ID NOt 333 DNA sequence 

Nucleic Acid Accession #: NM_0069S2.1 

Coding sequence: 11.. 793 - 

65 9 

1 11 21 31 41 51 

I I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

70 ATCTGACCAA CACAGCCTCT ACCCACTGCT TQAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCT AGGC AT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC T GATGTTTA T 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAAOGAG ACTTTTTCAC 360 

■ ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

75 TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATO GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAA TCAGGG 660 

_ CTGCTATGAA CTGATCTCTG GTCCAATGAA COGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

80 ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAAT AT TAAGAA 



Seq ID NO: 334 Protein sequence * 
Protein Accession tt: NP_008883 /i 
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WO 03/042661 
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1 11 21 31 41 51 

I I I I I I 

MAKDNSTVRC FQGLLIFQJV IIGCCGIALT AECIPFVSDQ HSLYPLLEAT DNDDIYGAAW 
IGIFVGICLP CLSVU3IVGI MKSSRKILLA YPILMFIWA FBVASCITAA TQRDFFTPNL 
PLKQMLBRYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 
DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWPGFAI 
LCWTFWVLLG THFYWSRIEY 

Seq ID NO: 335 DNA sequence 

Kucleic Acid Accession #: NM_002638.1 

Coding sequence: 120.. 473 



60 
120 
160 
240 



CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
OGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



11 
I 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



21 
I 

CCCTTGTAAA 
GTATGGCCTT 
ATCGTGGTGG* 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCOCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTTCCAATAA, 



31 
I 

TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TSGCCTGTTT 
ASCTACAGGC 
TCTTCCTCCC 



41 

I 

COGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



51 
I 

GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
CGTGTTCCAT 
AAAGTCAAAG 
ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCAOGGCTG 



Seq ID NO: 336 Protein sequence . 
Protein Accession #: NP_002629.1 



41 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



1 11 21 31 

I I I I I I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCJLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 337 DNA sequence 

Nucleic Acid Accession #: NM_001793.2 

Coding sequence: 71.. 2560 

1 11 21 31 41 51 

I I I I 1 I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 
CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 
CTGGCTGCAG TGCGCGGCCT OCGAGCOGTG COGGGCGGTC TTCAGGGAGG CTGAAGTGAC 
CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 
CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 
TGGOGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 
ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 
TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 
AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 
CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACOGGGAGGA 
GATTGCCAAG . TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 
CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 
GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 
GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 
CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 
CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 
CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 
GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 
GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 
CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGAOGACG GGG ACCAT TT 
TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 
TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 
GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 
ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 
GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 
CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 
TGTGGGCACC CTCGACCGTG AGGATQAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 
GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 
ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 
CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 
CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA 
GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATAXGACGT 
GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG AOGGTGATCA GGGCCA CTGT 
GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 
CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 
GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 
CGTCTTCTAC TATGGOGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 
GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 
CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 
TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACQACACCCT 
CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 
CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 
GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
,1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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1920 
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2040 
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2160 
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2340 
2400 
2460 
2520 
2580 
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GGGACCAAAC 
GACTTCGGAG 
ACGTTAGAGT 
AGCACTGAAA 
TCTTAOCTGC 
TACAGTGGAC 
TTTTTTTAAT 
GCTGGGCCCA 
TGGATCTCTG 
GTTGCGTTGC 
TAAAGAAACT 



GTCAGGCCAC 
CTTGTCAGGA 
GGTTGCTTCC 
ACCTCTCCAC 
CGTAAAATGC 
TTTCTCTCTG 
GCTATCTTCA 
CTGGCCGTCC 
CGTTTTTATA 
TATAGATGAA 
TTTCCCAGAA 



AGAGCATCTC 
AGTGGCCGTA 
TTAGCCTTTC 
CTGGGCCAGG 
TCAACCCTGT 
GAATGGAACC 
AAACGTTAGA 
TGCATTTCTG 
CTGAGTGTGC 
GGGTGAGGAC 



CAAGGGGTCT 
GCAACTTGGC 
AGGATGGAGG 
GTTGCCTCAG 
GTCCTGGGCC 
TTCTTAGGCC 
GAAAGTTCTT 
GTTTCCAGAC 
CTAGGTTGCC 
AATCGTGTAT 



CAGTTCCCCC 
GGAGACAGGC 
AATGTGGGCA 
AGGCCAAGTT 
TGGGCCTGCT 
TCCTGGTGCA 
CAAAAGTGCA 
CCCAATGCCT 
CCTTATTTTT 
ATGTACTAGA 



TTCAGCTGAG 
TATGAGTCTG 
GTTTGACTTC 
TCCAGAAGCC 
GTGACTGACC 
ACTTAATTTT 
GCCCAGAGCT 
CCCATTCGGA 
TATTTTCCCT 
ACTTTTTTAT 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



Seq ID NO: 338 Protein sequence 
Protein Accession #: NPJJ017B4.2 



1 
I 

MGLPRGPLAS 
QEPALFSTDN 
KGPFPGRLNQ 
YBLFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFIi 
GAVLALLFLL 
GLEARPEWL 
DYEGSGSDAA 



11 
I 

LLLLQVCWLQ 
DDFTVRMGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEILD 
GDDGDHPTIT 
VEDVNEAPVF 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 
I 

CAASEPCRAV 
VQERRSLKER 
IPYSITGPGA 
ISIIVTDQND 
EPKDPHDLMP 
ANDKAPKFDP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRQVLNITDK 



31 



RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



I 

PREABVTLEA 
NPLK1FPSKR 
DSPPEGVFAV 
HKPKPTQDTP 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGEPVC 
NNIYBVMVLA 
DLSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNPIIE 
EWGSRFKKLA 



41 
I 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLLLN 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQLTDDSDIY 
HGHVETCPGP 



NLKAANTDPT 
DMYGGGEDD 



51 
I 

LGKVFMGCPG 
VAPISVPENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLiiTLID 
WTAEVNEEGD 
WKGGFILPVL 
QDYDITQLHR 
APPYDTLLVF 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 339 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence : 1 . . 672 



1 
I 

ATGAGGCTCC 
CGGGGCTCCC 
AAGGGCGGGG 
CTGCTCGCCT 
GCGAGACAAC 
TGTCATGTTT 
ACAGAGCCAT 
AAGCAGTGCT 
CTCCTGGAAG 
TTAGAGGGGC 
AGCTGTGGTG 
AGCCTGTCTT 



11 
I 

AAAGACCCCG 
CCTACCGGCC 
AGGGGGCGCC 
TGCTGCTGGT 
GAGATCCAGA 
GTGAGAGAGA 
ACTGCGTTAT 
CCGCTGGTTG 
AGCCCATGCC 
CACCTATCAA 
GGCTGTGGCT 
GA 



21 
I 

ACAGGCCCCG 
AGACCCGGGG 
GCGGGCTGAC 
CGTGGCCCTA 
GGACTCCCAG 
AAACACTTTC 
AGCGGCCGTG 
TGCAGCGATG 
CTTCTTTTAC 
CTCATCAGTG 
GGCCATCCTC 



31 
I 

GOGGGTGGGA 
AGAGGCGCGC 
CCTCCCTGGG 
CCGGGGGTGT 
CGAACGGACG 
GAGTGCCAGA 
AAAATATTTC 
GAGAGACCCA 
CTCAAGTGTT 
TTCAAAGAAT 
CTGCTGCTGG 



41 
I 

GGCGCGCGCC 
GGAGGCTGCG 
CACCGCTGGG 
GQACAGACGC 
AGGGTGACAA 
ACCCAAGGAG 
CACGTTTTTT 
AGCCAGAGGA 
GTAAAATTCG 
ATGCTGGGAG 
CCTCCATTGC 



Seq ID NO: 340 Protein sequence 
Protein Accession #: Eos sequence 



51 
I 

CCGGGGCGGG 
AAGGTTCCAG 
GACGATGGCG 
CAACCTGACT 
TAGAGTGTGG 
GTGCAAATGG 
CATGGTTGCG 
GAAGCGGTTT 
CTACTGCAAT 
CATGGGTGAG 
AGCCGGCCTC 



51 



1 11 21 31 41 

I I I I I I 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 
LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF EOQNPRRCKW 
TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPFFY LKCCKIRYCN 
LEGPPINSSV FKEYAGSMGE SCGGLMLAIIi LLLASIAAGL SLS 

Seq ID NO: 341 DNA sequence 

Nucleic Acid Accession 8: XM_035292.2 

Coding sequence i 53.. 1576 



1 
I 

GCTCGCTGGG 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATCGCAGTAC 
CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 



11 
I 

CCGCGGCTCC 
AAGCGGCGCG 
CTGGCCGCCA 
CAGCGGAACA 
GGCATCTTCG 
GTGGTGTGGG 
GGCACCACCA 
CTGCCCGCCT 
ATCGTGGCCC 
CCCGAGGAGG 
TGCTACAGCG 
GCCCTGGCCC 
CTAGATCCCA 
TTATACAGCG 
ATGATCAACC 
CTGGTGTACG 
TCGTCCGAGG 



21 
I 

CGGGTGTCCC 
CGCTAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCCCAC 
CGGCGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TGGTCTTCGC 
CAGCCAAGCT 
TGAAGGCCGC 
TGATCATCCT 
ACTTCTCATT 
GCCTCTTTGC 
CCTACAGAAA 
TGCTGACCAA 
CCGTGGCCGT 



31 
I 

AGGCCCGGCC 
GCCGGCGGCC 
OGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTGGATCGAG 
CACCTACCTG 
CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 
CCTGGCCTAC 
GGACTTCGGG 



41 

I 

GGTGCGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GCCATCATCG 
AAGGAGGCAG 
ATCGTGGGCG 
TACGCCTACA 
CTGCTCATCA 
CrCAAGCCGC 
CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
TGGAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



51 
I 

GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 
GCTCGCCGGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 
TGCTGCTCAC 
TTGCCGCCGC 
GAAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



1084 



WO 03/042661 



GTCCTGGATC ATCCOCGTCT TCGTGGGCCT GTCCTQCTTC GGCTCCGTCA ATGGGTCCCT 
GTTCACATCC TCCAGGCTCT TCTTCGTGGG GTCCCGGGAA GGCCACCTGC CCTCCATCCT 
CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC CTCGTGTTCA CGTGTGTGAT 
GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTOOGTC ATCAACTTCT TCAGCTTCTT 
CAACTGGCTC TGCGTGGCCC TGGCCATCAT CGGCATGATC TGGCTGQGCC ACAGAAAGCC 
TGAGCTTGAG CGGCCCATCA AGGTGAACCT GGCCCTGCCT GTGTTCTTCA TCCTGGCCTG 
CCTCTTCCTG ATCGCCGTCT CCTTCTGGAA GACACCCGTG GAGTGTGGCA TCGGCTTCAC 
CATCATCCTC AGCGGGCTGC CCGTCTACTT CTTOGGGGTC TGGTGGAAAA ACAAGCCCAA 
GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 
CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 

Seq ID NO: 342 Protein sequence 
Protein Accession ft: XP_035292.2 

1 11 21 31 41 51 

I I I I I I 

MAGAGPKRRA LAAPAABEKE EAREKMLAAK S ADO SAP AGE GEGVTLQRKI TLLNGVAIIV 60 

GTIIGSGIFV TPTGVLKEAG SPGLALWWA AOGVPSIVGA LCYABLGTTI SKSGGDYAYM 120 

LEVYGSLPAF LKLWIELLII RPSSQYIVAL VFATYLLKPL FPTCPVPEEA AKLVACLCVL 180 

LLTAVNCYSV KAATRVQDAF AAAKLLALAL IILLGFVQIG KGDVSNLDPN FSFEGTKLDV 240 

GNIVLALYSG LFAYGGWNYL NPVTEEMINP YRNLPLAIII SLPIVTLVYV LTNIAYFTTL 300 

STEQMLSSEA VAVDFGNYHL GVMSWIIPVF VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 360 

SILSMIHPQL LTPVPSLVFT CVMTIiLYAFS KDIPSVINPP SFFNWLCVAL AIIGMIHLRH 420 

RKPELERPIK VNLALPVFFI LACLFLIAVS PWKTPVECGI GFTIILSGLP VYFFGVWWKN 480 
KPKWLLQGIF STTVLCQKLM QWPQET 

Seq ID NO: 343 DNA sequence 

Nucleic Acid Accession #: NM_00526B.l 

Coding sequence: 168.. 989 

1 11 21 31 41 51 

I I I I I I 

TAAAAAGCAA AAGAATTCGC GGCCGCGTCG ACAOGGGCTT CCCCGAAAAC CTTCCCCGCT 60 

TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC CGGCTGCTGG GAGCCAGGAG 120 

AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGAOGCGTGG GTCCACCATG AACTGGAGTA 180 

TCTTTGAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AGCCTTTGGG CGCATCTGGC 240 

TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT GACGGCCGAG CGTGTGTGGA 300 

GTGATGACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC CGGCTGCTCC AACGTCTGCT 360 

TTGATGAGTT CTTCCCTGTG TCCCATGTGC GCCTCTGGGC CCTGCAGCTT ATCCTGGTGA 420 

CATGCCCCTC ACTGCTCGTG GTCATGCACG TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 480 

ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCCGGC AAGAAGCGGG 540 

GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCGAGOGTG GACATCGCCT 600 

TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGTG GTCAAGTGCC 660 

ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCCCTCA GAGAAGAACA 720 

TTTTCACCCT CTTCATGGTG GCCACAGCTG CCATCTGCAT CCTGCTCAAC CTCGTGGAGC 780 

TCATCTACCT GGTGAGCAAG AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAA GCCA 840 

TGTGCACAGG TCATCACCGC CACGGTACCA CCTCTTCCTG CAAACAAGAC GACCTCCTTT 900 

CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA GACCGCCCCC 960 

GAGACCATGT GAAGAAAACC ATCTTGTGAG GGGCTGCCTG GACTGGTCTG GCAGGTTGGG 1020 

CCTGGATGGG GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAGTGGG GGAGCTAAGC 1080 

CATGAGGTAG GGGCAGGCAA GAGAGAGGAT TCAGACGCTC TGGGAGCCAG TTCCTAGTCC 1140 

TCAACTCCAG CCACCTGCCC CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 1200 
GCTCGGTTTC CTTTTCTAGA ATGGAAATAG TGAGGGCCAA TGC 

Seq ID NO: 344 Protein sequence 
Protein Accession ft* NP_0 0525 9.1 

1 11 21 31 41 SI 

I I I I I I 

MNWSIFEGLL SGVNKYSTAF GRIWLSLVPI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 60 

SNVCFDEPPP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 120 

GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 180 

SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 240 
DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 

Seq ID NO: 345 DNA sequence 

Nucleic Acid Accession #: NM_002391.1 

Coding sequence: 26.. 457 

1 11 21 31 41 51 

1 I I I I I 

CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 60 

CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 120 

CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

CGGCGTGGGT TTCCGCGAGG GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 240 

GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 300 

TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 360 

CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC TGCACCCCCA AGACCAAAGC 420 

AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 480 

GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATGTGACC 540 

CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 720 

ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 
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Seq ID NO: 346 Protein sequence * 
Protein Accession ft: NP_002382.1 

1 11 21 31 41 51 

I I I I I I 

MQHRGFUiLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 60 
CGAQTQRIRC RVPQJWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 120 
RVTKPCTPKT KAKAKAKKGK GKD 

Seq ID NO: 347 DNA sequence 

Nucleic Acid Accession ft: NM_006783.1 

Coding sequence: 1..786 



1 11 21 31 41 51 

I I 1 I I I 

ATGGATTGGG GGACGCTGCA CACTTTCATC GGGGGTGTCA ACAAACACTC CACCAGCATC 60 

GGGAAGGTGT GGATCACAGT CATCTTTATT TTCCGAGTCA TGATCCTAGT GGTGGCTGCC 120 

20 CAGGAAGTGT GGGGTGACGA GCAAGAGGAC TTCGTCTGCA ACACACTGCA ACCGGGATGC 180 

AAAAATGTGT GCTATGACCA CTTTTTCCCG GTGTCCCACA TCCGGCTGTG GGCCCTCCAG 240 

CTGATCTTGG TCTCCACCCC AGCGCTGCTG GTGGCCATGC ATGTGGCCTA CTACAGGCAC 300 

GAAACCACTC GCAAGTTCAG GCGAGGAGAG AAGAGGAATG ATTTCAAAGA CATAGAGGAC 360 

ATTAAAAAGC ACAAGGTTOG GATAGASGGG TCGCTGTGGT GGACGTACAC CAGCAGCATC 420 

25 TTTTTCCGAA TCATCTTTGA AGCAGCCTTT ATGTATGTGT TTTACTTCCT TTACAATGGG 480 

TACCACCTGC CCTGGGTGTT GAAATGTGGG ATTGACCCCT GCCOCAACCT TGTTGACTGC 540 

TTTATTTCTA GGCCAACAGA GAAGACCGTG TTTACCATTT TTATGATTTC TGCGTCTGTG 600 

ATTTGCATGC TGCTTAACGT GGCAGAGTTG TGCTACCTGC TGCTGAAAGT GTGTTTTAGG 660 

AGATCAAAGA GAGCACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA GGAGAGTAAG 720 

30 CAGAATGAAA TGAATGAGCT GATTTCAGAT AGTGGTCAAA ATGCAATCAC AGGTTTCCCA 780 
AGCTAA 



Seq ID NO: 348 Protein sequence 
Protein Accession ft: NP 006774-1 



1 11 21 31 41 51 

I I I I I I 

MDWGTLHTFI GGVNKHSTSI GKVWITVIPI FRVMILWAA QBVWGDBQED FVCNTLQPGC 60 
KNVCYDHPFP VSHIRLWALQ LIPVSTPALL YAMHVAYYRH ETTRKFRRGS KRNDFKDIED 120 
40 IKKHKVRIEG SLWWTYTSSI FFRIIFEAAF. MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 180 
FISRPTEKTV FTIFMISASV ICMLLNVAEfc CYLLLKVCFR RSKRAQTQKN HPNHALKESK 240 
QNEMNELISD SGQNAITGFP S 

Seq ID NO: 349 DNA sequence 
45 Nucleic Acid Accession ft: NM_002571.1 
Coding sequence: 9 9.. 5 87 

1 11 21 31 41 51 

C A I I I 1 1 1 

DU CATCCCTCTG GCTCCAGAGC TCAGAGCCAC CCACAGCCGC AGCCATGCTG TGCCTCCTGC 60 

TCACCCTGGG CGTGGCCCTG GTCTGTGGTG TCCCGGCCAT GGACATCCCC CAGACCAAGC 120 

AGGACCTGGA GCTCCCAAAG TTGGCAGGGA CCTGGCACTC CATGGCCATG GCGACCAACA 180 

ACATCTCCCT CATGGCGACA CTGAjfeGCCC CTCTGAGGGT CCACATCACC TCACTGTTGC 240 

CCACCCCCGA GGACAACCTG GAGATCGTTC TGCACAGATG GGAGAACAAC AGCTGTGTTG 300 

55 AGAAGAAGGT CCTTGGAGAG AAGACTGGGA ATCCAAAGAA GTTCAA GATC AACTATACGG 360 

TGGCGAACGA GGCCACGCTG CTCGATACTG ACTACGACAA TTTCCTGTTT CTCTGCCTAC 420 

AGGACACCAC CACCCCCATC CAGAGCATGA TGTGCCAGTA CCTGGCCAGA GTCCTGGTGG 480 

AGGACGATGA GATCATGCAG GGATTCATCA GGGCTTTCAG GCCCCTGCCC AGGCACCTAT 540 

GGTACTTGCT GGACTTGAAA CAGATGGAAG AGCCGTGCCG TTTCTAGCTC ACCTCCGCCT 600 

60 CCAGGAAGAC CAGACTCCCA CCCTTCCACA CCTCCAGAGC AGTGGGACTT CCTCCTGCCC 660 

TTTCAAAGAA TAACCACAGC TCAGAAGACG ATGACGTGGT CATCTGTGTC GCCATCCCCT 720 

TCCTGCTGCA CACCTGCACC ATTGCCATGG GGAGGCTGCT CCCTGGGGGC AGAGTCTCTG 780 
GCAGAGGTTA TTAATAAACC CTTGGAGCAT G 

65 Seq ID NO: 350 Protein sequence 
Protein Accession ft: NP_002562.l 

1 11 21 31 41 51 

70 MDIPQTKQDL BLPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 60 
WENNSCVBKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 120 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 

Seq ID NO: 351 DNA sequence 
75 Nucleic Acid Accession fts NM_006SOO.l 
Coding sequence : 2 7 . . 1 967 

1 11 21 * 31 41 51 

80 ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 

TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 
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TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 
GCATCTTCTT GTGCCAGGGC AAQOGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 
TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 
GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 
5 TCATCTGGTA CAAGAATGGC CQGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 
CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 
TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 
GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 
TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 
10 GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 
GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 
AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 
TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 
CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 
15 ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 
TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 
CCAGCATACC CGGCCTGAAC OGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 
GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAAGGGC ACGGCAAGTG 1440 
20 AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 
TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 
TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 
TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 
25 TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 
GCTCAGGGAA GCAGGAGATC AOGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 
GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 
30 CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 
GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGOGGCTTT TCAGGGACCA 2160 
GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 
CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 
35 CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 
GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCOGA GGCGGGOGGA 2640 
40 TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 
TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 
CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 
CACTGCACTC CAGCCTGGGC AACACAGOGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 
45 TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 
GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 
AGAATGGTAC TTAGGGATGG AAAAOGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 
CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 
50 TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 
AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTC TACATG G GTACCACAGG 3360 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTAOCA GAGCAGACAG 3480 
CTACCCTACT TTTCAGCAGC AAAACGTGCC GTATGAOGCA GCACGAAGGG CCTGGCAGGC 3540 
55 TGTTAGCAGG AGCTATGTCC CTTCCTATOO TTTCCGTCCA CTT 

Seq ID NO: 352 Protein sequence 
Protein Accession #; NPJ>06491.1 

60 1 11 21 31 41 51 

1 t I I I I 

GLPRLVCAFL LAACCCCPRV AGVPGEABQP APELVBVEVG STALLKCGLS QSQGNLSHVD 
WFSVHKEKRT LIFRVRQGQQ QSEPGEYEQR LSLQDRGATL ALTQVTPQDE RIFLOQGKRP 
RSQEYRIQLR VYKAPEEPNI QVNPLGIPVN SKEPEEVATC VGRNGYPIPQ VIWYKNGRPL 

65 KEBKNRVHIQ SSQTVESSGL YTLQSILKAQ LVKEDKDAQF YCELNYRLPS GKEMKESREV 
- TVPVPYPTEK VWLEVEPVGM LKEGDRVEIR CLADGNPPPH FSISKQNPST REAEEETTND 
NGVLVLEPAR KEHSGRYECQ AHNLDTMISL LSEPQELLVN YVSDVRV8PA APBRQEGSSL 
TLTCEABSSQ DLBFQWLREE TDQVLERGPV LQLHDLKREA GGGYRCVASV PSIPGLNRTQ 
LVKLAIPGPP WMAPKERKVW VKENMVLNLS CEASGHPRPT ISWNVNGTAS EQDQDPQRVIi 

70 STLNVLVTPE LLETGVECTA SNDLGKNTSI LFLELVNLTT LTPDSNTTTG IiSTSTASPHT 
RANSTSTBRK LPEPESRGW IVAVIVCILV IAVW5AVLYF LYKKGKLPCR RSGKQEITLP 
PSRKTELWE VKSDKLPEEM GLLQGSSGDK RAPGDQGEKY 1DLRH 

Seq ID NO: 353 DNA sequence 
75 Nucleic Acid Accession ft: NMJ>03183.3 
Coding sequence: 165.. 2639 

1 ll 21 31 41 SI 

80 TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGGAAAAGA GGATTGAGGG 
GCTAGGCOGG GOGGATCCCO TCCTCCCCCG ATGTGAGCAG TTTTCCGAAA CCCCGTCAGG 
CGAAGGCTGC CCAGAGAGGT GGAGTCGGTA GCGGGGCOGG GAACATGAGG CAGTCTCTCC 
TATTCCTGAC CAGCGTGGTT CCTTTCGTGC TGGCGCCGCG ACCTCOGGAT GACCCGGGCT 
TCGGCCCCCA CCAGAGACTC GAGAAGCTTG ATTCTTTGCT CTCAGACTAC GATATTCTCT 
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CTTTATCTAA 
TAGAAACACT 
GTACTGAACG 
AGTACACTGC 
GGGTTCTAGC 
AATATAACAT 
TTTATAAATC 
ATTTAAAAGT 
AAGAGCTTGT 
AATTATTGGT 
CAACTACAAA 
CATGGGATAA 
AGTCTCCACA 
ATGAAGAAAA 
CTGAGGAAGC 
GAACTCTTGG 
CAAAGGCTTA 
GCACAAAGAA 
AATTGGGACA 
ATGAGGACCA 
ACAATAAGAT 
CCCAGGAGTG 
GAGAAGAGTG 
GCACGTTGAA 
AGTTTGAGAC 
CCTACTGCAC 
TTTGCTTGGA 
AGCAGCTGGA 
ACCTTTCTGG 
AAGGAAAGCC 
AGGATGTAAT 
AGTTTTTAGC 
CTTTCAGCAT 
CTCTGTTTCA 
TTATCAAACC 
CTTCGGCGCC 
CCAGCACAGA 
GCACAGCTGC 
CTGCCTCCTT 
TTAGTTCTCA 
TCAATCACAG 
TTTGAACTTC 
TAAGGTGAAA 
TTGACCTGTG 
TGTAAATTAA 
ATACACTGTA 
ATAGTTTTTT 
GTATACATTG 
CAGGGAAAAT 
TAAGACTAAT 
TTGAATTTAT 
TTTTATAAGT 
CTATTGGCTG 
CGCCACTACA 



TATCCAGCAG 
ACTAACTTTT 
TTTTTCACAA 
AAAATGGCAG 
CCACATAAGA 
AGAGCCACTT 
TGAAGATATC 
GGATAATGAA 
TCATCGAGTG 
GGTAGCAGAT 
TTACTTAATA 
TGCAGGTTTT 
AGAGGTAAAA 
GGATGCTTGG 
ATCTAAAGTT 
ATTAGCTTAT 
TTATAGCCCA 
TTATGGTAAA 
TAATTTTGGA 
GGGAGGGAAA 
GTTTTCAAAC 
TTTTCAAGAA 
TGATCCTGGC 
GGAAGGTGTC 
TGCCCAGAAG 
AGGTAATAGC 
TCTTGGCAAG 
GTOCTGTGCA 
CCGCTGTGTG 
CTGTACAGTA 
TGAACGATTT 
AGACAACATC 
TCTTGTCCAT 
CCCCAGTAAC 
CTTTCCTGCG 
AGCAGCTCCA 
CTCCCATATG 
CAAGTCATTT 
TAAACTGCAG 
GCTCTTCTGA 
CTTGTATTTT 
CTGCAGGTAA 
GTGAATCTAC 
GTGCAAAAGC 
TCTTTTATAT 
ATGAACCGCT 

r mm T 'i' T 

TATCTAAATT 
ATATATCTAA 
GCTCAGAACC 
GGAATCTACC 
ATCTTCATGT 
GGAGTGGTGG 
CTCCAGOCTG 



CATTCGGTAA 
TCAGCTTTGA 
AATTTCAAGG 
GACTTCTTCA 
GATGATGATG 
TGGAGATTTG 
AAGAATGTTT 
GAGTTGCTCC 
AAAAGAAGAG 
CATCGCTTCT 
GAGCTAATTG 
AAAGGCTATG 
CCTGGTGAAA 
GATGTGAAGA 
TGCTTGGCAC 
GTTGGCTCTC 
GTTGGGAAGA 
ACCATCCTTA 
GCAGAACATG 
TATGTCATGT 
TGCAGTAAAC 
CGCAGCAATA 
ATCATGTATC 
CAGTGCAGTG 
AAGTGCCAGG 
AGTGAGTGCC 
TGTAAGGATG 
TGTAATGAAA 
CCCTATGTCG 
GGATTTTGTG 
TGGGATTTCA 
GTTGGGTCTG 
TGTGTGGATA 
GTCGAAATGC 
CCCCAGACTC 
AAACTGGACC 
GACGAGGATG 
GAGGATCTCA 
CGTCAGAATC 
CTTAAGTGTG 
GTGAAGACTG 
ACAGTTCTTG 
TTATTTTGAG 
AGAAAATACA 
TGATAACAGC 
GAATATGAAG 
TTTTTTTTGC 
GTGGGTCTAT 
ATTTAGAAAT 
TAACCACTAC 
AACTGTTTAG 
ATCCCTGTTA 
CTCATGCCTG 
GGTGACAGAG 



CTGGACACGT 
TTATAATCAG 
TTAATGATAC 
CACGTTTGCA 
CAAAAGGGTT 
CTGACCCAGA 
ACAGATACAT 
ACAGAGTTGA 
GAATACAGAT 
AGCACTACAA 
TGTTGCTAGA 
ACCTTTTCAC 
CCAGAGCAAA 
AAAATATCTA 
CAAAGGAAGC 
ATCCGGATGG 
ATCCCATAGC 
AATCAATCTA 
AAGTTTGTGG 
TGAACAACGA 
ACAGGAACAG 
AGGCGATTAA 
CGCCTCCAGG 
GGAAATGCAT 
CTGACAACTC 
ATGCTGAACA 
ACATGAATGG 
TTGACCAGCT 
TCCTGGTTTT 
AGAAATTGGA 
TGAGCAGCAT 
CAGGCCGCCT 
ACCAGAGAAT 
GGTTTGAGAA 
CGGACCATCC 
GTGTTAACAG 
CAAAATATTT 
GGAAGTGACT 
TGTGGTTTGG 
GCTTTCAGGT 
GCTGGATTGG 
ACTGACTAGG 
CATTTGGCAT 
CTTCAACTAA 
TTCTAGTTAT 
CATTTGGGTT 
CTTACAGTGA 
GGCCCTGATT 
CTGATAGGGA 
TAATCCCAGC 
TGAGATCTGC 



TCTACAGACT 
TAAATTATAC 
GGATGGTAAA 
GGTTGGTGAG 
AATCAACACA 
CAAAGACAAA 
GTCTCCAAAA 
AGTAGACAGA 
TCCCATGAAG 
GGGCAGAGGG 
TGACATCTAT 
AGAGCAGATT 
CATGGCAAAA 
GCAATTTAGC 
ATACCAAOAT 
CAGCCATGGA 
TTTGAATAGT 
TGACCTGGTT 
TCTAGCAGAA 
TGTGAGTGGC 
TAAGACCATT 
GAACTCGAGG 
CACCTGCTGC 
TCCTTGCTGT 
TGCTACTTGC 
AAATGCTGAA 
CCCTTTCTGC 
CTGCAAGGTG 
AAAGAACTTA 
CAAATGTGAG 
GAGCATCAAT 
CTCCTTGATA 
TAAACAGTAT 
GGATTCTGCA 
GCAGCCTGCC 
GGACACCATC 
GGACCCCTTC 
GGTOGCCAGA 
CAAAGAAACA 
TTATAGATTT 
TAGCAGATGC 
CCCTTCTCCT 
TTTAGTTTTT 
GTTATGAATA 
GAAATGATCA 
TTATTTGTGA 
AAACAAAGGA 
TACCCAGAGT 
AATATGGCTC 
GGGCTATACA 
TGCTGGGCAG 
TACATGTCTT 
ACTTGGAGAG 
CTC 



TCAACACATG 
CTGACATCAA 
AACGAAAGCG 
CCTGACTCTA 
GATGGGGCOG 
AGAATGTTAG 
GTGTGTGGTT 
GAACCACCTG 
AACACGTGTA 
GAAGAGAGTA 
CGGAACACTT 
CGCATTCTCA 
AGTTACCCAA 
TTTGATATAG 
TTTGATATGG 
GGTGTTTGTC 
GGTTTGACGA 
ACAACTCATG 
TGTGCCCCGA 
GATCACGAGA 
GAAAGTAAGG 
GTGGATGAAG 
AACAGCGACT 
AAAAACTGTC 
AAAGGCGTGT 
AATGACACTG 
GAGAGGGAAC 
TGCTGCAGGG 
TTTTTGAGGA 
AAACGAGTAC 
ACTTTTGGAA 
TTTTGGATTC 
GAATCTCTGT 
TCGGTTCGCA 
CCTGTGATCC 
CAGGAAGACC 
CCAAATAGCA 
AGTGAAAAGG 
GAGTGCTAAT 
GACCTACAAA 
TGGTCATGTG 
TTTGAAAAGG 
AAAATATCTT 
TTTACGTTTT 
GTTTTTTTTT 
GAAAAGTGGA 
GATAAATTTA 
TTTTATGTAG 
TTCATAATTC 
TGGTAGCCAG 
7TTTTCTGTA 
AGAAAATTCA 
GCTGAGGTTG 



360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



Seq ID NO: 354 Protein sequence. 
Protein Accession #: NPJJ03174.2 



1 
I 

MRQSLLFLTS 
QTSTHVETLL 
GEPDSRVLAH 
PKVCGYLKVD 
RGEESTTTNY 
AKSYPNEEKD 
HGGVCPKAYY 
AECAPNEDQG 



11 
I 

WPPVLAPRP 
TFSALKRHPK 
IRBDDVIIRI 
NEELLPKGLV 
LIELIDRVDD 
AWDVKMLLEQ 
SPVGKKNIYL 
GKYVMYPIAV 
PGIMYLNNDT 



21 
I 

PDDPGFGPHQ 
LYLTSSTERF 
NTDOAEYNIE 
DREPPEBLVH 
IYRNTSWDNA 
FSPDIAEEAS 
NSGLTSTKNY 



TCKGVSYCTG 
KVCCRDLSGR 
INTFGKFLAD 
SASVRIIKPP 
PFPNSSTAAK 



CVPYVDAEQK 
NIVGSVLVFS 
PAPQTPGRLQ 
SFEDLTDHPV 



CCNSDCTLKE 
AENDTVCLDL 
NLFLRKGKPC 
LIFWIPFSIL 
PAPVIPSAPA 
ARSEKAASFK 



31 
I 

RLEKLDSLLS 
SQNFKVWVD 
PLWRFVNDTK 
RVKRRADPDP 
GFKGYGIQIE 
KVCLAHLFTY 
GKTILTKEAD 
SHCSKQSXYK 
GVQCSDRKSP 
GKCKDGKCIP 
TVGFCDMNGK 
VHCVDKKU)K 
APKLDHQRMD 
LQRQNRVNSK 



41 
I 

DYDILSLSNI 
GKNESEYTAK 
DKRMLVYKSE 
MKNTCKLLW 
QIRILKSPQB 
QDFDMGTLGL 
LVTTHELGHN 
TIESKAQECF 
CCKNCQFETA 
FCEREQQLES 
CEKRVQDVIE 
QYESLSIiFHP 
TIQEDPSTDS 
ETEC 



51 
I 

QQHSVRKRDL 
WQDFFTGHW 
DIKNVSRLQS 
ADHRFYRYMG 
VKPGEKHYNM 
AYVGSPRANS 
FGAEHDPDGL 
QERSNKVCGN 
QKKCQEAINA 
CACNBTDNSC 
RFWDPIDQLS 
SNVEMLSSMD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 355 DNA sequence 

Nucleic Acid Accession ft: NM_021832.1 

Coding sequence: 164.. 2248 

1 11 21 31 41 51 

11)111 
TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGAAAAGAG GATTGAGGGG 
CTAGGCCGGG CGGATCCCGT CCTCCCCCGA TGTGAGCAGT TTTCCGAAAC CCCGTCAGGC 
GAAGGCTGCC CAGAGAGGTG GAGTCGGTAG CGGGGCCGGG AACATGAGGC AGTCTCTCCT 
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ATTCCTGACC AGCGTGGTTC CTTTCGTGCT GGCGCCGCGA CCTOCGGATG ACCCGGGCTT 240 

CGGCCCCCAC CAGAGACTCG AGAAGCTTGA TTCTTTGCTC TCAGACTACG ATATTCTCTC 300 

TTTATCTAAT ATCCAGCAGC ATTCGGTAAG AAAAAGAGAT CTACAGACTT CAACACATGT 360 

AGAAACACTA CTAACTTTTT CAGCTTTGAA AAGGCATTTT AAATTATACC TGACATCAAG 420 

5 TACTGAACGT TTTTCACAAA ATTTCAAGGT CGTGGTGGTG GATGGTAAAA ACGAAAGCGA 480 

GTACACTGTA AAATGGCAGG ACTTCTTCAC TGGACAOGTG GTTGGTGAGC CTGACTCTAG S40 

GGTTCTAGCC CACATAAGAG ATGATGATGT TATAATCAGA ATCAACACAG ATGGGGCCGA 600 

ATATAACATA GAGCCACTTT GGAGATTTGT TAATGATACC AAAGACAAAA GAATGTTAGT 660 

TTATAAATCT GAAGATATCA AGAATGTTTC ACGTTTGCAG TCTCCAAAAG TGTGTGGTTA 720 

10 TTTAAAAGTG- GATAATGAAG AGTTGCTCCC AAAAGGGTTA GTAGACAGAG AACCACCTGA 780 

AGAGCTTGTT CATCGAGTGA AAAGAAGAGC TGACCCAGAT CCCATGAAGA ACACGTGTAA 840 

ATTATTGGTG GTAGCAGATC ATOGCTTCTA CAGATACATG GGCAGAGGGG AAGAGAGTAC 900 

AACTACAAAT TACTTAATAG AGCTAATTGA CAGAGTTGAT GACATCTATC GGAACACTTC 960 

ATGGGATAAT GCAGGTTTTA AAGGCTATGG AATACAGATA GAGCAGATTC GCATTCTCAA X020 

15 GTCTCCACAA GAGGTAAAAC CTGGTGAAAA GCACTACAAC ATGGCAAAAA GTTACCCAAA 1080 

TGAAGAAAAG GATGCTTGGG ATGTGAAGAT GTTGCTAGAG CAATTTAGCT TTGATATAGC 1140 

TGAGGAAGCA TCTAAAGTTT GCTTGGCACA CCTTTTCACA TACCAAGATT TTGATATGGG 1200 

AACTCTTGGA TTAGCTTATG TTGGCTCTCC CAGAGCAAAC AGCCATGGAG GTGTTTGTCC 1260 

AAAGGCTTAT TATAGCCCAG TTGGGAAGAA AAATATCTAT TTGAATAGTG GTTTGACGAG 1320 

20 CACAAAGAAT TATGGTAAAA CCATCCTTAC AAAGGAAGCT GACCTGGTTA CAACTCATGA 1380 

ATTGGGACAT AATTTTGGAG CAGAACATGA TCGGGATGGT CTAGCAGAAT GTGCCCOGAA 1440 

TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTGAGTGGCG ATCACGAGAA 1S00 

CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 

CCAGGAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAGGG TGGATGAAGG 1620 

25 AGAAGAGTGT GATCCTGGCA TCATOTATCT GAACAAOGAC ACCTGCTGCA ACAGCGACTG 1680 

CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA 1740 

GTTTGAGACT GCCCAGAAGA AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 1800 

CTACTGCACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA AATGCTGAAG ATGACACTGT 1860 

TTGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGCG AGAGGGAACA 1920 

30 GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGACAACTCC TGCAAGGTGT GCTGCAGGGA 1980 

CCTTTCCGGC CGCTGTGTGC CCTATGTCGA TGCTGAACAA AAGAACTTAT TTTTGAGGAA 2040 

AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 

GGATGTAATT GAACGATTTT GGGATTTCAT TGACCAGCTG AGCATCAATA CTTTTGGAAA 2160 

GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC TCCTTGATAT TTTGGATTCC 2220 

35 TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 2280 

OGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 2340 

CTGTGATCCC TTCGGCGCCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 2400 

AGGAAGACCC CAGCACAGAC TCACATATGG AOGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 

CAAATAGCAG CACAGCTGCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 2S20 

40 GTGAAAAGGC TGCCTCCTTT AAACTGCAGC GTCAGAATOG TGTTGACAGC AAAGAAACAG 2580 

AGTGCTAATT TAGTTCTCAG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 

ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTGACTTA GCAGAT GCTG 2700 

GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 2760 

TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 2820 

45 AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGG TTATGAGTAT 2880 

TTAOGTTTTT GTAAATTAAT CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 2940 

GTTTTTTTTT ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA 3000 

GAAAAGTGGA ATAGTTTTTT TTTTTTTTTT TTT TTT TTGC CTTCAACTAA AAACAAAGGA 3060 

GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 3120 

50 TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 

TTCATAATTC TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 3240 

TGGTAGCCAG TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 3300 
TTTTTCTGTA TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 3360 
AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
5 5 3421 GCTGAGGTTG CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 

Seq ID NO: 356 Protein sequence 
Protein Accession #: NP_068604.1 

60 1 11 21 31 41 51 

! | I 1 1 I 

MRQSLLFLTS WPFVLAPRP PDDPGFGPHQ RLEKLDSIiLS DYDIL9LSNI QQHSVRKRDL 60 

QTSTHVETLL TFSALKRHFK LYLTSSTERF SQNFKWWD GKNESBYTVK WQDPFTGHW 120 

GEPDSRVLAH IRDDDVIIRI NTDGABYN1B PLWRFVNDTK DKRMLVYKSE DIKNVSRLQS 180 

65 PKVCGYLKVD NEELLPKGLV D REP PEEL VH RVKRRADPDP MKNTCKLLW ADHRFYRYMG 240 

RGEESTTTNY LIELIDRVDD IYRNTSWDNA GFKGYGIQIE QIRILKSPQE VKPGEKHYNM 300 

AKSYPNEEKD AWDVKMLLEQ FSFDIAEEAS KVCLAHLPTY QDFDMGTLGL AYVGSPRANS 360 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN FGAEHDPDGL 420 

_^ AECAPNEDQG GKYVMYPIAV SGDREjNKKMF SNCSKQSIYK TIESKAQECF QERSNKVCGN 480 

70 SRVDEGEECD PGIMYLKNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA 540 

TCKGVSYCTG NSSECPPPGN AEDDTVCLDL GKCKDGKCIP FCEREQQLE9 CACNETDNSC 600 

KVCCRDLSGR CVPYVDAEQK NLFLRKGKPC TVGFCDMNGK CEKRVQDVIE RFWDFIDQLS 660 
INTFGKFLAD NIVGSVLVFS LIFW1PPSIL VHCV 

75 Seq ID NO: 357 DNA sequence 

Nucleic Acid Accession ft: NM_O04994.1 
Coding sequence: 20.. 2143 

0rt 1 11 21 31 41 51 

80 | | | | | i 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC AATCTCACCG ACAGGCAGCt GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 
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CCAGAAGCAA 
GCGAACCCCA 
CAAGTGGCAC 
GGGGOTQATT 
CACCTTCACT 
GCACGGAGAC 
TGGCCCOGGC 
GGGCGTCGTG 
CATCTTCGAG 
CTGGTGCAGT 
GAGACTCTAC 
CCAAGGCCAA 
CGCCACCACC 
CTCGACGGTG 
GGGTAAGGAG 
TACCACCTCG 
TTTGTTCCTC 
GCCGGAGGCG 
CGACGTGAAT 
AACCACCACC 
TGTCCACCCC 
AGGTCCCCCC 
TGCCTGCAAC 
CAAGGATGGG 
CCTTATCGCC 
GCTCTCCAAG 
GGTGCTGGGC 
CGGGGCCCTC 
GTTCGACGTG 
CCCOGGGGTG 
CCAGGACCGC 
GGGCTACGTG 
GCAGTGCCAT 
CAAACTGGTA 
TCACCTTTGT 



CTGTCCCTGC 
CGGTGCGGGG 
CACCACAACA 
GACGACGCCT 
CGCGTGTACA 
GGGTATCCCT 
ATTCAGGGAG 
GTTCCAACTC 
GGCCGCTCCT 
ACCACGGCCA 
ACCCGGGACG 
TCCTACTCCG 
GCCAACTACG 
ATGGGGGGCA 
TACTCGACCT 
AACTTTGACA 
GTGGCGGOGC 
CTCATGTACC 
GGCATCCGGC 
ACACCGCAGC 
TCAGAGCGCC 
ACTGCTGGCC 
GTGAACATCT 
AAGTACTGGC 
GACAAGTGGC 
AAGCTTTTCT 
CCGAGGCGTC 
CGGAGTGGCA 
AAGGCGCAGA 
CCTTTGGACA 
TTCTACTGGC 
ACCTATGACA 
GTAAATCCCC 
TTCTGTTCTG 
TTTTTGTTGG 



CCGAGACCGG 
TCCCAGACCT 
TCACCTATTG 
TTGCCOGCGC 
GCCGGGACGC 
TOGAGGGGAA 
ACGCCCATTT 
GGTTTGGAAA 
ACTCTGCCTG 
ACTACGACAC 
GCAATGCTGA 
CCTGCACCAC 
ACCGGGACAA 
ACTCGGCGGG 
GTACCAGCGA 
GCGACAAGAA 
ATGAGTTCGG 
CTATGTACCG 
ACCTCTATGG 
CCACGGCTCC 
CCACAGCTGG 
CTTCTACGGC 
TCGACGCCAT 
GATTCTCTGA 
CCGCGCTGCC 
TCTTCTCTGG 
TGGACAAGCT 
GGGGGAAGAT 
TGQTGGATCC 
CGCACGACGT 
GCGTGAGTTC 
TCCTGCAGTG 
ACTGGGACCA 
GAGGAAAGGG 
AGTGTTTCTA 



TGAGCTGGAT 
GGGCAGATTC 
GATCCAAAAC 
CTTCGCACTG 
AGACATCGTC 
GGACGGGCTC 
CGACGATGAC 
CGCAGATGGC 
CACCACCGAC 
CGACGACCGG 
TGGGAAACCC 
GGACGGTCGC 
GCTCTTCGGC 
GGAGCTGTGC 
GGGCCGCGGA 
GTGGGGCTTC 
CCACGOGCTG 
CTTCACTGAG 
TCCTCGCCCT 
CCOGACGGTC 
CCCCACAGGT 
CACTACTGTG 
CGCGGAGATT 
GGGCAGGGGG 
CCGCAAGCTG 
GCGCCAGGTG 
GGGCCTGGGA 
GCTGCTGTTC 
CCGGAGCGCC 
CTTCCAGTAC 
CCGGAGTGAG 
CCCTGAGGAC 
ACCCTGGGGA 
AGGAGTGGAG 
ATAAACTTGG 



AGCGCCACGC 
CAAACCTTTG 
TACTCGGAAG 
TGGAGCGCGG 
ATCCAGTTTG 
CTGGCACACG 
GAGTTGTGGT 
GCGGCCTGCC 
GGTCGCTCCG 
TTTGGCTTCT 
TGCCAGTTTC 
TCCGACGGCT 
TTCTGCCCGA 
GTCTTCCCCT 
GATGGGCGCC 
TGCCCGGACC 
GGCTTAGATC 
GGGCCCCCCT 
GAACCTGAGC 
TGCCCCACCG 
CCCCCCTCAG 
CCTTTGAGTC 
GGGAACCAGC 
AGCCGGCCGC 
GACTCGGTCT 
TGGGTGTACA 
GCCGACGTGG 
AGCGGGCGGC 
AGCGAGGTGG 
CGAGAGAAAG 
TTGAACCAGG 
TAGGGCTCCC 
AGGAGCCAGT 
GTGGGCTGGG 
ATTCTCTAAC 



TGAAGGCCAT 
AGGGCGACCT 
ACTTGCCGCG 
TGACGCCGCT 
GTGTCGCGGA 
CCTTTCCTCC 
CCCTGGGCAA 
ACTTCCCCTT 
ACGGCTTGCC 
GCCCCAGCGA 
CATTCATCTT 
ACCGCTGGTG 
CCCGAGCTGA 
TCACTTTCCT 
TCTGGTGCGC 
AAGGATACAG 
ATTCCTCAGT 
TGCATAAGGA 
CACGGCCTCC 
GACCCCCCAC 
CTGGCCCCAC 
CGGTGGACGA 
TGTATTTGTT 
AGGGCCCCTT 
TTGAGGAGCC 
CAGGCGCGTC 
CCCAGGTGAC 
GCCTCTGGAG 
ACCGGATGTT 
CCTATTTCTG 
TGGACCAAGT 
GTCCTGCTTT 
TTGCCGGATA 
CCCTCTCTTC 
CTTT 
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MSLWQPLVLV 



ITYWIQNYSE 
FDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 



PMYRPTEGPP 
PTAGPTGPPS 
RFSKGRGSRP 
LDKLGLGADV 
THDVFQYREK 



I 

LLVLGCCFAA 
LLLLQKQLSL 
DLPRAVIDDA 
AFPPGPGIQG 
DGLPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPPIiIADKW 
AQVTGALRSG 
AYFCQDRFYYJ 



21 
1 

PRQRQSTLVL 
PETGELDSAT 
FARAFALWSA 
DAHFDDDELW 
NYDTDDRFGF 
DRDKLFGFCP 
SDKKHGPCPD 
HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 



31 
I 

PPGDliRTNLT 
LKAMRTPRCG 
VTPLTFTRVY 
SLGKGVWPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLPLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RLWRFDVKAQ 
VDQVGYVTYD 



41 

! 

DRQLAEEYLY 
VPDLGRFQTF 
SRDADIVIQF 
RFGNADGAAC 
GNADGKPGQF 
NSAGELCVFP 
HEFGHALGLD 
PTAPPTVCPT 
FDAIAEIGKQ 
FFSGROVWVY 
MVDPRSASEV 
ILQCPED 
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300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



51 
I 

RYGYTRVAEM 
EGDLKWHHHN 
GVAEHGDGYP 
HFPFIFEGRS 
PFIFQGQSYS 
FTFLGKEYST 
HSSVPEALMY 
GPPTVHPSBR 
LYLFKDGKYW 
TGASVLGPRR 
DRHPPGVPXiD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



CGCCCGCGCG 
CCCCGAGGTA 
AAGAGGATGG 
AGCGTCAGCC 
ACGGAGTGTG 
CGGCGCTGCA 
GTCATGGAGA 
AGCCAGATGT 
GAGCTGGAGG 
TCCAACTCCA 
GTCCTGAGCC 
AGCGTCCCGC 
CCCCCCTTCT 
AAACTGCAGG 
ATCCTGCAGA 
CTGGTCTTCT 
GGCATCATGA 
TACAGGACAC 
ATCATCCCCA 
TATTTCCCTG 
CTGGAGGAGG 
CGAGGCCTTC 
CACATCCGGC 
GATGGGAOGC 



11 

I 

CTGCAGCCCC 
GGTCCAGGAC 
CAGGGCCACG 
TCTCTGGGAC 
TCCGTGTGGA 
ACACCCAGGC 
GCAGCTTCCA 
CCCCCCAAGG 
TGTTTGAGCC 
TGTCCGATGA 
AGCTCACCAG 
AGACGGACAT 
CCTTCAAGAA 
GAGAGCGGAT 
CAGCTGTGTG 
CCACCGAGTC 
GCCGCAACGA 
AGGACTACCC 
TCTTTGCTGT 
TCTCCTCACT 
CCTTCAATCG 
GGACAGAGGT 
GGGGGGAAGT 
ACGTGTGCCA 



21 
I 

ATCTCCTAGC 
GGGCGCACAG 
CCCCAGCCCA 
CTTGGCAAAC 
TAAGGACTGC 
GGAGCTGCTG 
AATCACAGAG 
CCTGCGGGTC 
ACTGGAGAGC 
TCTGGACAAC 
CGACTACACT 
GAGGCCTGAG 
CGTCATCAGC 
CTCAGGCAAC 
CACGAGGGAC 
AGCCTTCCAC 
TGAACGGTGC 
GTCGGTGCCC 
CACCAACTAC 
GGGGGTGCTG 
GATCOGCTCC 
CACCTCCAAG 
GGGTATATAC 
GCTGCOGGAG 



31 
I 

GGCAGCCCAG 
CAGCAGCCGA 
TGGGCCAGGC 
OGCTGCAAGA 
GCCTACTGCA 
GCCGCGGGCT 
GAGACCCAGA 
OGTCTGCGGC 
CCCGTGGACC 
CTCAAGAAGA 
ATTGGATTTG 
AAGCTGAAGG 
CTGACAGAAG 
CTGGATGCTC 
ATTGGCTGGC 
TATGAGGCTG 
CACCTGGACA 
ACCCTGGTGC 
TCCTATAGCT 
CAGGAGGACT 
AACCTGGACA 
ATGTTCCAGA 
CAGGTGCAGC 
GACCAGAAGG 



41 
I 

GCGOGGAGGG 
GGCTGGCCGG 
TGCTCCTGGC 
AGGCCCCAGT 
CAGACGAGAT 
GCCAGCGGGA 
TTGACACCAC 
CCGGTGAGGA 
TGTACATCCT 
TGGGGCAGAA 
GCAAGTTTGT 
AGCCCTGGCC 
ATGTGGATGA 
CTGAGGGCGG 
GCCCGGACAG 
ATGGCGCCAA 
CCACGGGCAC 
GCCTGCTCGC 
ACTACGAGAA 
CGTCCAACAT 
TCOGGGCCCT 
AGACGAGGAC 
TGOGGGCCCT 
GCAACATCCA 



SI 
I 

AGCGAGTCCG 
GAGAGGGAGG 
AGCCTTGATC 
GAAGAGCTGC 
GTTCAGGGAC 
GAGCATCGTG 
CCTGCGGCGC 
GCGGCATTTT 
CATGGACTTC 
CCTGGCTOGG 
GGACAAAGTC 
CAACAGTGAC 
GTTCCGGAAT 
CTTOGATGCC 
CACCCACCTG 
CGTGCTGGCT 
CTACaCCCAG 
CAAGCACAAC 
GCTTCACACC 
CGTGGAGCTG 
AGACAGOCCC 
TGGGTCCTTT 
TGAGCACGTG 
TCTGAAACCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAO 1S00 
CTGCAAAAAQ AGGT6CGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 
TGTGTGTGCA GCGAGQGCTG GAGTGGCCAG ACCTGCAACT GCT CCAC OGG CTCTCTGAGT 1620 
GACATTCAGC CCTGCCTGCG GGAGGGOGAG GACAAGCCGT GCTCOGGCCG TGGGGAGTGC 1680 
5 CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCOGCTACG AGGGTCAGTT CTGOGAGTAT 1740 
GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG AOGCTGCTCC 1800 
ATGGGCCAGT GTGTGTGTGA GCCTOGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 
AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GAOGTGGCCA CTGTGAGTGT 1920 
GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCA ACTA C 1980 
10 TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2040 
GGCACCGGCG AGAAGAAGGG GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 
GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCOGGGA CGAGGATGAC 2160 
GACTGCACCT ACAGCTACAC CATGGAAGGT GAOGGCGCCC CTGGGCCCAA CAGCACTGTC 2220 
CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 
15 CTCCTCCTCC TGCOGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 
TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACOGAG GTCACATGGT GGGCTTTAAG 2400 
GAAGACCACT ACATGCTGOG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 
CTGCGCAGCG GGAACCTCAA GGGCOGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 
CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580 
20 GGGCTGTCCT TGCGCCTGGC COGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTOGG 2640 
GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA AOGAGGTCTA CAGGCAGATC 2700 
TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC OGGGAAAAAG 2760 
CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 2820 
CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2880 
25 GGCTACTACA CCCTCACTGC AGACCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2940 
GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 
CAGCTGCTGG TGGAGGCCAT CGAOGTGCOC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3060 
GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACOTGG TGTCCTTTGA GCAGCCTGAG 3120 
TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 3180 
30 GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA OOGOSCAGGG CAACCGGGAC 3240 
TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 
GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 
TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 3420 
ACCATCATCA TCAGGGACCC AGATGAACTG GACOGGAGCT TCAGGAGTCA GATGTTGTCA 3480 
35 TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 3540 
GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3600 
GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 3660 
CCCTCAGTGG AGCTCAGCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 
TAOGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3780 
40 GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 3840 
AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 3900 
CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 3960 
AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 4020 
AAGGCGCGCA AOGGGGCCGG CTGGGGGCCT GAGOGGGAGG CCATCATCAA CCTGGCCACC 4080 
45 CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 4140 
AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 4200 
GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 4260 
TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 
. TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGOGTGC TAAGCACATC CTCCACCCTC 4380 

50 ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 
GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 
GACACGCCCA CCOGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 
CAGGAGCCGC GGTGCGAGCG GCOGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 4620 
GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4680 
55 GACCTCCTGC CCAACCACTC CTAOGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 
GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 
TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 
TTCACTGCCC TGAGCCCAGA CTGGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 4920 
GGGGATATCG TCGGCTACCT GGTGACCTGT OAGATGGCCC AAGGAGGAGG GCCAGCCACC 4980 
60 GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 
AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 
GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 
GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 
AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 5280 
65 GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 5340 
GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA S400 
CCCCCGCCAT GTCCCACTAG GCOTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 
CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 
TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 
70 AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
TACTG 

Seq ID NO: 360 Protein sequence 
^ Protein Accession #: NP_000204.1 

1 11 21 31 41 51 

I I I I I I 

MAGPRPSPWA RLLLAALISV SLSGTLANRC KKAPVKSCTE CVRVDKDCAY CTDEMFRDRR 
CNTOAELLAA GCQRESIWM ESSFQITEBT QIDTTLRRSQ MSPQGLRVRL RPGEERHFEL 
80 EVFEPLESPV DLYILMDFSN SMSDDLDNLK KKGQNLARVL SQLTSDYTIG FGKPVDKVSV 
PQTDMRPEKL KEPWPNSDPP FSPKNVISLT EDVDKFRNKL QGERISGNLD APBGGPDAIL 
QTAVCTRDIG WRPDSTHLLV PSTESAPHYE ADGANVLAGI MSRNDERCHL DTTGTYTQYR 
TQDYPSVPTL VRLLAKHNII PIFAVTNYSY SYYEKLHTYP PVSSLGVLQE DSSNIVELLE 
EAPNRIRSNL DIRAIiDSPRG LRTEVTSKMP QKTRTGSFHI RRGEVGIYQV QLRALEHVDG 

1091 



60 
120 
180 
240 
300 
360 
420 



/ 



WO 03/042661 
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THVCQLPEDQ KGNIHLKPSF SDGLKNDAGI ICDVCTCELQ KEVRSARCSP NGDFVOGQCV 480 

CSEGWSGQTC NCSTGSLSDI QPCLREGEDK PCSGRGBCQC GHCVCYGEGR YEGQFCBYDN S40 

FQCPRTSGPL CNDRGRCSMG QCVCEPGWTQ PSCDCPLSNA TCIDSNGGIC NGRGHCECGR 600 

CHCKQQSLYT DTICEINYSA IHPQLCEDLR SCVQCQAWGT GEKKGRTCBB CNFKVKMVDE 660 

5 LKRAEEVWR CSFRDEDDDC TYSYTMEGDG APGPNSTVLV RKKKDCPPGS FWWLIPLLLL 720 

LLPLLALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 780 

SGNLKGRDW RWKVTNNMQR PGFATHAASI NPTELVPYGL SLRLARLCTE NLLKPDTREC 840 

AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK 900 

LTEKQVEQRA FHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVDVRVPLPI RPEDDDEKQL 960 

10 LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSPEQPEFS VSRGDQVARI PVIRRVLDGG 1020 

KSQVSYRTQD GTAQGNRDYI PVEGBLLFQP GBAWKELQVK LLELQEVDSL LRGRQVRRFH 1080 

VQLSNPKFGA HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPP SGKPMGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 1200 

AQGEGPYSSL VSCRTHQEVP SEPGRLAFNV VSSTVTQLSW AEPAETNGEI TAYEVCYGLV 1260 

15 NDDNRPIGPM KKVLVDNPKN RMLLIENLRE SQPYRYTVKA RNGAGWGPER EAIINLATQP 1320 

KRPMSIPIIP DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HLVNGRMDFA 1380 

PPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTLPRDY 1440 

STLTSVSSHD SRLTAGVPOT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQLiLNGG IS 00 

ELHRLNIPNP AQTSWVEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES QVHPQSPLCP 1560 

20 LPGSAPTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM AQGGGPATAF 1620 

RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG I1TIESQDGG PPPQLGSRAG 1680 

LFQHPLQSEY SSITTTHTSA TEPFLVDGPT LGAQHLEAGG SLTRHVTQEF VSRTLTTSGT 1740 
LSTHMDQQFF QT 

25 Seq ID NO: 361 DMA sequence 

Nucleic Acid Accession #: NM_013332.1 
Coding sequence : 1 . . 63 

1 11 21 31 41 SI 

30 | | | | II 

GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC GCTGGTGCTT 60 

AGTAACCGAC TTTCCTCCGG ACTCCTGCAC GACCTGCTCC TACAGCOGGC GATCCACTCC 120 

CGGCTGTTCC CCCGGAGGGT CCAGAGGCCT TTCAGAAGGA GAAGGCAGCT CTGTTTCTCT 180 

GCAGAGGAGT AGGGTCCTTT CAGCCATGAA GCATGTGTTG AACCTCTACC TGTTAGGTGT 240 

35 GGTACTGACC CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG GCTTACTAGA 300 

GAGCCCATCG CCTGGGACCT CCTGGACCAC CAGAAGCCAA CTAGCCAACA CAGAGCCCAC 360 

CAAGGGCCTT CCAGACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC CATACTGGCC 420 

ATATTTTGGA ACACTGACCT AGACATGTCC AGATGGGAGT CCCATTCCTA GCAGACAAGC 480 

TGAGCACCGT TGTAACCAGA GAACTATTAC TAGGCCTTGA AGAACCTGTC TAACTGGATG S40 

40 CTCATTGCCT GGGCAAGGCC TGTTTAGGCC GGTTGCGGTG GCTCATGCCT GTAATCCTAG 600 

CACTTTGGGA GGCTGAGGTG GGTGGATCAC CTGAGGTCAG GAGTTCGAGA CCAGCCTCGC 660 

CAACATGGCG AAACCCCATC TCTACTAAAA ATACAAAAGT TAGCTGGGTG TGGTGGCAGA 720 

GGCCTGTAAT CCCAGTTCCT TGGGAGGCTG AGGOGGGAGA ATTGCTTGAA CCCGGGGACG 780 

GAGGTTGCAG TGAACCGAGA TCGCACTGCT GTACCCAGCC TGGGCCACAG TGCAAGACTC 840 

45 CATCTCAAAA AAAAAAAGAA AAGAAAAAGC CTGTTTAATG CACAGGTGTG AGTGGATTGC 900 

TTATGGCTAT GAGATAGGTT GATCTCGCCC TTACCCOGGG GTCTGGTGTA TGCTGTGCTT 960 

TCCTCAGCAG TATGGCTCTG ACATCTCTTA GATGTCCCAA CTTCAGCTGT TGGGAGATGG 1020 

TGATATTTTC AACCCTACTT CCTAAACATC TGTCTGGGGT TCCTTTAGTC TTGAATGTCT 1080 

TATGCTCAAT TATTTGGTGT TGAGCCTCTC TTCCACAAGA GCTCCTCCAT GTTTGGATAG 1140 

50 CAGTTGAAGA GGTTGTGTGG GTGGGCTGTT GGGAGTGAGG ATGGAGTGTT CAGTGCCCAT 1200 

TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTG TGTATTGGTC TGAAGGGGGT 1260 

GGTGGGATGC CAAAGCCTGC TCAAGTTATG GACATTGTGG CGACCATGTG GCTTAAATGA 1320 
TTTTTTCTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA AA 

55 Seq ID NO: 362 Protein sequence 
Protein Accession 8: NP_037464.1 

1 11 21 31 41 51 

60 MKHVLNLYLL GWLTLIiSIF VRVMESLEGL LESPSPGTSW TTRSQLANTE PTKGLPDHPS 60 
RSM 



Seq ID NO: 363 DNA sequence 
Nucleic Acid Accession #: NM_023 915.1 
65 Coding sequence: 250.. 1326 

1 11 21 31 41 51 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 60 

70 TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 120 

GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 180 

CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 240 

AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 300 

_ CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 360 

75 AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 420 

TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 480 

TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA OGCTGACATT TCCATTTCGA 540 

ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 600 

rtrk TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 660 

80 GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 720 

ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 780 

ATCCTGACAA ATGGTCAGCC AACAGAOGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 840 

CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 900 

GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 960 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 
55 
60 
65 
70 
75 
80 



AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
ATCAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



TAA6TCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATCG 
GTTGGAATCG 
AA 



CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



CAGGGTTGTT 1020 

TTTTACTTTT 1080 

CTGCAAAGAA 1140 

CTTTTTCATG 1200 

GAGTGAAAGC 1260 

TTACACTGAT 1320 

AATGTTTCTT 1380 



Seq ID NO: 364 Protein sequence 
Protein Accession 8: NP 076404 



1 
I 

MGFNLTLAKL 
GLAVWIFPHI 
FYANMYTSIV 
NGQPTEDNIH 
ISQSSRKRKH 
FLSAOJVCLD 



11 
I 

PNNELHGQES 
RNKTSFIFYL 
FLGLISIDRY 
DCSKLKSPLG 
NQSIRVWAV 
PIIYFFMCRS 



21 
I 

HNSGNRSDGP 
KNIWADLIM 
LKWKPFGDS 
VKWHTAVTYV 
FPTCFLPYHL 
PSRRLFKKSN 



31 
I 

GKNTTLHNEF 
TLTPPPRIVH 
RMYSITFTKV 
NSCLFVAVLV 
CRIPPTPSHL 
IRTRSESIRS 



41 

I 

DTIVLPVLYL 
DAGFGPWYFK 
LSVCVWVIMA 
ILIGCYIAIS 
DRLLDESAQK 
LQSVRRSEVR 



51 
I 

IIFVASILLN 
FILCRYTSVL 
VLSLPNIILT 
RYIHKSSRQF 
ILYYCKEITL 
IYYDYTDV 



60 
120 
180 
240 
300 



Seq ID NO: 365 DNA sequence 

Nucleic Acid Accession #: NM_005365-1 

Coding sequence: 1..948 



1 
I 

ATGTCTCTCG 
GAGGACTTGG 
TCCTCTGACA 
CCTCAGGGAG 
GAGGGCTCCA 
GAGTTCATGT 
CACAAATATC 
AATTACAAGC 
TTTGGCACTG 
CTTGGCCTCT 
CTGATCATTG 
TGGGAAGCGT 
CCCAGGAAGC 
CCCGGCAGTG 
AGCTATGAGA 
CCATCCCTTT 



11 
I 

AGCAGAGGAG 
GCCTGATGGG 
GCAAGGAGGA 
GCGCTTCCTC 
GCAGTCAAGA 
TCCAAGAAGC 
GAGTCAAGGA 
GCTACTTTCC 
ATGTGAAGGA 
OGTGCGATAG 
TCCTGGGTGT 
TGAGTGTGAT 
TGCTCACCCA 
ATCCTGCGCA 
AGGTCATAAA 
ATGAAGAGGT 



21 
I 

TCCGCACTGC 
TGCACAGGAA 
GGAGGTGTCT 
CTCCATTTCC 
AGAGGAAGAG 
ACTGAAATTG 
GCCGGTCACA 
TGTGATCTTC 
GGTGGACCCC 
CATGCTGGGT 
GATCCTAACC 
GGGGGTGTAT 
AGATTGGGTG 
CTACGAGTTC 
TTATTTGGTC 
TTTGGGAGAG 



31 
I 

AAGCCTGATG 
CCCACAGGCG 
GCTGCD3GGT 
GTCTACTACA 
CCAAGCTCCT 
AAGGTGGCTG 
AAGGCAGAAA 
GGCAAAGCCT 
GCCGGCCACT 
GATGGTCATA 
AAAGACAACT 
GTTGGGAAGG 
CAGGAAAACT 
CTGTGGGGTT 
ATGCTCAATG 
GAGCAAGAGG 



41 

i 

AAGACCTTGA 
AGGAGGAGGA 
CATCAAGTCC 
CTTTATGGAG 
CGGTCGACCC 
AGTTGGTTCA 
TGCTGGAGAG 
CCGAGTTCAT 
CCTACATCCT 
GCATGCCCAA 
GOGCCCCTGA 
AGCACATGTT 
ACCTGGAGTA 
CCAAGGCCCA 
CAAGAGAGCC 
GAGTCTGA 



Seq ID NO: 366 Protein sequence 
Protein Accession #: NP_005356.1 



41 



51 
I 

AGCCCAAGGA 
GACTACCTCC 
TCCCCAGAGT 
CCAATTCGAT 
AGCTCAGCTG 
TTTCCTGCTC 
CGTCATCAAA 
GCAGGTGATC 
TGTCACTGCT 
GGCCGCCCTC 
AGAGGTTATC 
CTACGGGGAG 
CCGGCAGGTG 
CGCTGAAACC 
CATCTGCTAC 



51 



1 11 21 31 

I I I I I i 

MSLEQRSPHC KPDEDLEAQG EDLGXKGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 
PQGGASSSIS VYYTLWSQPD EGSSSQEEEE PSSSVDPAQL EFMFQEALKL KVAELVHFLL 
HKYRVKEPVT KAEMLESVIK NYKRYFPVIP GKASEFMQVI FGTDVKEVDP AGHSYILVTA 
LGLSCDSMLG DGHSMPKAAL LIXVLGVILT KDNCAPEEVI WEALSVMGVY VGKEHMPYGE 
PRKLLTQDWV QENYLEYRQV PGSDPAHYEF LWGSKAHAET SYBKVINYLV MLNAREPICY 
PSLYEEVLGE EQEGV 

Seq ID NO: 367 DNA sequence 
Nucleic Acid Accession #: NM_014400 
Coding sequence: 86.. 1126 



1 
I 

GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
CGGCCTGGAT 
CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 
TGGCTCCTGT 
CCCTCGAATC 
CACATCTGTC 
GCCAGOGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 



11 
I 

CCTGGGCTCA 
GAGCAGGACG 
GCAGGCTGGC 
TGCGTGCAGA 
CCGGGCGTGG 
TGGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGTCACCT 
GATGAATTCT 
TGCCAGGGGT 
CCACCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG 
GGGGGGCCCC 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 



21 

I 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGCTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 
CCTCGGCCCC 
CTCCGAGACA 
GOGCOGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 



31 
I 

CCCGAGCTCG 
CCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG 
GGCGCTCGAC 
CTGTGTGGGC 
CAACGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 



41 

I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCGAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 



51 

I 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCATCCA 
AGAATGACCG 
CTCAGGATCG 
ATGAGAGTGC 
AGGCGTGCCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCOQGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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15 

20 

25 

30 
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60 
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70 
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80 



GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 
TTQTATAGTG 



GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CATATGTCTT 
TGTGTGATCA 



GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 
AQAGTGAGAG AAGTCAGCTG TCAOGGGQAA GGTGAGAGAG 
CACTTTCTCC TAGCCAGCCT GQACTTTGGA GOGTGOGGTG 
CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 
CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 
GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 



1360 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: 368 Protein sequence 
Protein Accession g : NP_055215 



1 11 

I I 
MDPARKAGAQ AMIWTAGWLL 
CTEAVGAVET IHGQPSLAVX 
SRALDPAGNE SAYPPNGVEC 
AANVTVSLPV RGCVQDEFCT 
LPPPEPTTVA STTSVTTSTS 
AGHQDRSNSG QYPAKGGPQQ 



21 31 41 51 

till 
LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 
GCGSGLPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 
YSCVGLSREA CQGTSPPWS CYNASDHVYK GCFDGNVTLT 
RDGVTGPGPT IiSGSCCQGSR QJSDLRNKTY FSPRIPPLVR 
APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 
PHNKGCVAPT AGLAALLLAV AAGVLL 



60 
120 
180 
240 
300 



Seq ID NO: 369 DNA sequence 

Nucleic Acid Accession #: NM_005329.1 

Coding sequence : 1 . . 1662 

1 11 21 31 41 51 

I I I I I 1 

ATGCCGGTGC AGCTGACGAC AGCCCTGCGT GTGGTGGGCA CCAGCCTGTT TGCCCTGGCA 
GTGCTGGGTG GCATCCTGGC AGCCTATGTG ACGGGCTACC AGTTCATCCA CACGGAAAAG 
CACTACCTGT CCTTCGGCCT GTACGGCGCC ATCCTGGGCC TGCACCTGCT CATTCAGAGC 
CTTTTTGCCT TCCTGGAGCA CCGGCGCATG CGACGTGCCG GCCAGGCCCT GAAGCTGCCC 
TCCCCGCGGC GGGGCTCGGT GGCACTGTGC ATTGCCGCAT ACCAGGAGGA CCCTGACTAC 
TTGCGCAAGT GCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA GGTGGTCATG 
GTGGTGGATG GCAACCGCCA GGAGGACGOC TACATGCTGG ACATCTTOCA CGAGGTGCTG 
GGCGGCACCG AGCAGGCCGG CTTCTTTGTG TGGCGCAGCA ACTTCCATGA GGCAGGCGAG 
GGTGAGACGG AGGCCAGCCT GCAGGAGGGC ATGGACCGTG TGCGGGATGT GGTGCGGGCC 
AGCACCTTCT CGTGCATCAT GCAGAAGTGG GGAGGCAAGC GOGAGGTCAT GTACACGGCC 
TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 
GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 
GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGCAGC 
GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 
CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 
GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TGGGGGATGA OCGGCACCTC 
ACCAACCGAG TCCTGAGCCT TGGCTACOGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 
ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACOCGCTG GAGCAAGTCT 
TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 
TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCAOGGT TATACAGCTT 
TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 
ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGA GATG AT CTTCATGTCC 
CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 
ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 
CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGAGG GGCTGGCCTA CACAGCTTAT 
TGCCAGGACC TGTTCAGTGA GAGAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 
GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA 

Seq ID NO: 370 Protein sequence 
Protein Accession ft: NP 005320.1 



1 
I 

MPVQLTTALR 
LPAFLEHRRM 
WDGNRQEDA 
STFSCIMQKW 
VGGDVQILNK 
DWYHQKFXGS 
YFREWLYNSL 
I IKATYACFL 
LIPVSIWVAV 
KKPEQYSLAF 



11 
I 

WGTSLFALA 
RRAGQALKLP 
YMLDIFHEVL 
GGKRBVMYTA 
YDSWISFLSS 



WFHKHHLWMT 
RGNAEMIFMS 
LLEGLAYTAY 
ABV 



21 
I 

VLGGILAAYV 
SPRRGSVALC 
GGTEQAGPFV 
FKAI/3DSVDY 
VRYWMAFNVE 
TNRVLSLGYR 
YESWTGFPP 
LYSLLYMSSL 
CQDLFSETEL 



31 

I 

TGYQFIHTEK 
IAAYQEDPDY 
WRSNFHEAGE 
IQVCDSDTVL 
RACQSYFGCV 
TKYTARSXCL 
FFLIATVIQL 
LPAKIFAIAT 
AFLVSGAILY 



41 
1 

HYLSFGLYGA 
LRKCLRSAQR 
GETEASLQEG 
DPACTIEMLR 
QCISGPLGMY 
TETPTKYLRW 
FYRGRIWNIL 
XNKSGWGTSG 
GCYWVALLML 



Seq ID NO« 371 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 148-709S 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 



11 

! 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 



60 
120 
180 
240 
300 
360 
420. 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 



51 
I 

ILGLHLLIQS 
ISFPDUCWM 
MDRVRDWRA 
VLEEDPOVGG 
RNSLLQQFLE 
LNQQTRWSKS 
LFLLTVQLVG 
RKTIWNFIG 
YLAIIARRCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
OGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 



60 
120 
180 
240 
300 
360 
420 
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AACACATTCA TTCATAACAC TGGQAAAACA GTGGAAATTA ATCTCA CTAA TGACTACCGT 480 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGQA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAQATGCAAA TCTACTGCTT TGATGCGQAC OGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
5 GGAAAA6GGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAQA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAOTA TTACATTTAC 840 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
10 TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
15 GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTQAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 
20 ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
25 AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
30 AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AAOGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 
35 ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCIG 2S80 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 
40 AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 
TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 
AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 
45 CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 
GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 
GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 
^ ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 
50 CCCAACATGT ATGATAATGT AAATAAGTTG AATGOGTCTT TACAAGAAAC CTCTGTTTCC 3420 
ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 
GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 
55 ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 
CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 
AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 
ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAQTGAGAA ATATGAACCA 3960 
60 GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATQATGAG 4020 
TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 
ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 
65 GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 
TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 
TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 
AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 
CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 
70 OCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 
TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 
TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 
CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AA GTGGATC A 4800 
GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 
75 GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 
GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGOGAGA ACTCAGAAGT GTTCCACGTT 4980 
TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTO 5040 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT C AGCCC TGAC TTTTAT CTGT 5100 
rt . CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 
80 TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 
ATTTCAGATG ATGTOGGAGC AATTCCAATA AAGCACTTTC CAAA QCATG T TGCAGATTTA 5280 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 
CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 
CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 
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CTTGCTGAAA AGQATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5S20 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC S580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCOGATG GGAGTGAGGA GTA CGGGA AC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAQTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GQGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAA ATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 



Seq ID NO: 372 Protein sequence: 

Protein Accession #: built from XP_031379 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEPLTDGYQD 
LIGTEEZXKB 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAPTP 
LNTTPAASSS 
ILPQVTSATE 
KTLMPSQVEP 
SLFSGPSHIP 
PVSVAEFTYT 
LNASLQETSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASBKYE 
EPLNTLINKL 
PHRDGSVTST 
KCMSCSSYRE 
SPGKSPSAKG 
ENETSTDFSP 
HESRIGXtAEG 
VISTPPTPIP 
TADSSNHPDN 
QGFLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVPIHDTLV 
SAALRQCNRE 
HTIKDFWRMI 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPEKVQA 
LGAILNNLLP 



GKGDVPKTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
DSALHATPVF 
SDKVPLHASL 
PSSDAMMHAR 
IPKSSLITPT 
TSVFGDDNKA 
SISSTKGMFP 
ASSDPASSEM 
TPKVDKISST 



IHSDBILTST 
KLIiFPSKATS 
SQEKVMNDSD 



ADTNEKDADG 
LESEKKAVIP 
PISDDVQAIP 
KHKNRYINXV 
PWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
KNRTSSIIPV 
WDHNAQLWM 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCBVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARKA 
IRVDESEKTT 
VNWYSQTTQ 
PSVDVSFESI 
PVAGGDLLLE 
8SGPEPSYAL 
ASLLQPTHAL 
LSKSBIIYGN 
GSIiAHTTTKV 
LSPSTQLLFY 
MLHLIVSNSA 
QWPSLYSND 
KSSVTGKVFA 
ELSHSAKSDA 
THENSLMDQN 
ENDIQTGSAL 
ILAAGDSEIT 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
ERSRVGISSL 
IPDGQNMAED 



31 
I 

KLVEEIGWSY 
ENTFIHKTGK 
LEKQXYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
1CTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 



41 

I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 



KSFSAGPVMS 
PVYNGETPLQ 
LSSYDGAPLL 
PSLAQYSDVL 
SDNEGSQHIF 
SGDGEWSGAS 
ETELQIPSFN 
PDHEISQVPE 
ETSASFSTEV 
SSENMLHSTS 
ELFQTANLEI 
GIPTVASDTF 
GLVGGGEDGD 
NPISYSLSEN 
LPLSPESKAW 



CLWLVGILI 



QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
SGEGTDYIHA 
EFVYWPKKDE 



QGPSVTDLEM 
PSYSSEVFPL 
PFSSASFSSE 
STTHAASETL 
TVSYSSAIPV 
SDSEFLLPDT 
EMVYPSBSTV 
NNFSVQPTHT 
LLQPSFQASD 
VPVPDVSPTS 
NQAHPPKGRH 
VSTDHSVPIG 
TDDDGDDDDD 
SEEDNRVTSV 
AVLTSDEESG 
VTSENSEVFH 
YWRKCFQTAH 
FETLKEFYQB 
DYINANYVDG 
WPADGSBEYO 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQLL 
SYIMGYYQSN 
PXNCESFKVT 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGBDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
VTPLLLDNQI 
LFRHLHTVSQ 
EFGSESGVLY 
HDSVGVTYQG 
DGLTALNISS 
MPNMYDNVNK 
VSQASGDTSL 
VDTLLKTVLP 
HMHSASLQGL 
VFATPVLSID 
NGHVAITAVS 
DRGSDGLSIH 
SSDSQTGMDR 
SGQGTSDSLN 
VSEABASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YKRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
SQSNIQQSDY 
EFIITQHPLL 
LMAEEHKCLS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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NEBKLI IQDF ILEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKB EAANRDGPMI 2220 
VHDEHGGVTA GTFCALTTLM HQLBKENSVD VYQVAKMINL MRPGVFADIE QYQFLYKVIL 2280 
SLVSTRQBEN PSTSLDSNGA ALPDGNIABS LESLV 

5 Seq ID NO: 373 DMA sequence 

Nucleic Acid Accession tf : built from NH_002851 
Coding sequence: 148-4518 

1 11 21 31 41 51 

10 | i i i I I 

CACACATACG CAGGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AG GAGC CGCA 120 
CGGQGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
15 CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTCAGOGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
20 AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840' 
25 AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
30 TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTA CCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATATGG AAAATACAGC GAGCAACTGA TTGTCGACAT GCCTACTGAT 1380 
AATCCTGAAC TTGATCTTTT COCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
35 GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTCTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCXGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
40 GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
45 GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTOCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
50 TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AG CTGAGG GG 2460 
TTGGAATCCG AGAAGAAGGC AGTTATACOC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 
TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 
TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC A CCTA TCTTT 2640 
55 CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 
TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 
GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 
AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 
CAGCTTGCTG AAAAGGAIGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 
60 TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 
TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCT0GTG 3060 
GAGAAAGGAA GGAGAAAAIG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 
AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 
ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGAOGTGTG 3240 
65 GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 
GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTQTTGTC 3360 
GTOCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 
CAGCAGATTC AACACGAAGG AACTGTCAAC AXATTTGGCT TCTTAAAACA CATCCGTTCA 3480 
CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 
70 GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 
GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 
AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 
AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 
AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATGA TGGGCTATTA CCAGAGCAAT 3840 
75 GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 
TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 
GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGT CACT 4020 
CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 
n - ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 
80 TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 
GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 
GGAACTTTCT GTGCTCTGAC AACOCTTATG CACCAACTAG AAAAAGAAAA TTCOGTQGAT 4320 
GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 
CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 
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CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AG CTGAGAG C 4500 

TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 



Seq ID NO: 374 Protein sequence: 

Protein Accession #: built from XP_031379 

1 11 21 31 41 51 

I I I I I I 

MRILKRFIiAC IQLLCVCRLD WANGYYRQQR KLVBEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KPQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILIUL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAIIiNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVEMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AXVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTBLPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE BSLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQOYIFS SENPETITYD VLIPBSARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE 1RVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAE GLESEKKAVI 780 

PLVIVSALTF ICLWLVGIL IYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EFETLKEFYQ EVQSCTVDLG ITADSSNHPD KKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHK 960 

VEVIVMITNL VEXGRRKCDQ YWPADGSEEY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSL PVLTFVRKAA YAKRHAVGFV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV KALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSQEGTDYIN ASYIMQYYQS NEFIITQHPL LHTIKDFWRM IWDHNAQLW 1260 

MIPDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLIIQD PILEATQDDY 1320 

VLEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 
AALPDGNIAE SLESLV 

Seq ID NO: 37S DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTQTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1S60 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT QACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
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GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTOGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG C TCCAG TCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
5 GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCOG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
10 CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTOGCAGAC AACCCAACCG 2400 
GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 
GAATCOGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 
15 TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 
GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 
ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 
20 CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 
GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 
AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 
CAGTACTGGC CTGCGGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 
GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 
25 AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 
TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 
GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 3360 
AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA OGAAGGAACT 3420 
GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 
30 GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 
GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 
GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 
GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATOQAACTTC TTCTATCATC 3720 
CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 
35 AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 
CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3900 
GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3960 
GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 
CTATCTAATG AGGAAAAACT TATAATTCAO GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 
40 TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 
AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 
ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 
CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 
AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 
45 ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 
GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 
AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 
ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 
GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 
50 TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 
GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 
AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4920 
GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 
55 GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 
ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 
CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 
CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 
r _ AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 5280 
60 TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 

Seq ID NO: 376 Protein sequence ; 
Protein Accession #: EOS sequence 

65 

1 11 21 31 41 51 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

70 PKASKITFHW GKCNMSSDGS BHSLEGQKFP LEMQIYCFDA DRFSSFEBAV KGKGKLRALS 180 

ILFBVGTESH LDFKAIIDGV BSVSRFGKQA ALDFFXLLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTKQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPEMVQA DPENYTSLLV TWERPRWYD TM2EKPAVLY QQLDGEDQTK 360 

_ _ HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLPPE 420 

75 LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTK 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTABSL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDEEEKTT KSFSAGPVM3 QGPSVTDLEM PHYSTFAYFP 720 

80 TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGIiAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYliEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NFVAYDHSRV 900 

K1AQLAEKDG KLTDYINANY VDGYNRPKAY IAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 

NLVEKGRRKC DQYWPADGSB EYGNFLVTQK SVQVIAYYTV RNPTLRNTKI KKGSQKGRPS 1020 



1099 



WO 03/042661 



GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRHAVG PWVHCSAGV GRTGTYIVLD 1080 

SMLQQIQHBQ TVNIFGPLKH TRSQRNYLVQ TEEQYVPIHD TLVEAILSKE TEVLDSHIRA 1140 

YVNALLIPGP AGKTKLEKQP QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY QSNEPIITQH PLLRTIKDFW RMIWDHNAQL WMIPDGQNM 1260 

AEDEFVYWPN KDBPINCESP KVTLMAEEHK CLSNEEKLI I QDFILBATQD DYVLEVRHFQ 1320 

CPKWPNPDSP ISKTPELISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 1380 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS NGAALPDGNI 1440 
AESLESLV 



Seq ID NO: 377 DNA sequence 

Nucleic Acid Accession #: BOS sequence 

Coding sequence: 501-4514 

1 11 21 31 41 51 

I I I I I 1 

CACACATACG CACGCACQAT CTCACTTCGA TCTATACACT 6GAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGQAGCCGCA 120 

OGGCGAGGGG COGCAGACOG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 

AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 

AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 

ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 

TCAGGGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 

AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 

AGATGCAAAT CTACTGCTTT GATGCGOACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 

GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 

ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCXT 780 

TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 

ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 

CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 

CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 

TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 

GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 

GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 

AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 

GTGCTATTCT CAATAATTTG CTACCCAATA TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 

GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 

ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 

AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 

ACCAAATCAG - GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 

CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 

AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA AGCAGTCACT AAATTAGCCA 1680 

CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 

AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 

ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 

GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTOCCG 1920 

CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCOG 1980 

AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 

AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 

TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 

GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 

CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 

ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 

CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA AOCCAACGGG 2400 

TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTOGTAT TGGTCTAGCT GAGGGGTTGG 2460 

AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATOGTGTC AGCCCTGACT TTTATCTGTC 2520 

TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2S80 

ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 

TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 

ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 

AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

ACAAGAATCG ATACATAAAT ATOGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 

TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 

ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 

GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTOGTGGAGA 3060 

AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TA CGGG AACT 3120 

TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 

TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 

CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTOCCAOTGC 3300 

TQACCTTTGT GAGAAAGGCA GCCTATGCCA AGOGCCATGC AGTGGGGCCT GTTGTOGTCC 3360 

ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 

AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 

GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 
CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 
TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 
ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 
GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 
TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 
TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 
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TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 

CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

CPGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGQAA 4260 

CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGQATOTTT 4320 

5 ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 43 BO 

ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 

AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT 4S60 

TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 

10 ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 

TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 

ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 

TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 

15 AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 

GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 

TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAG TTTTC T GACATTGTAT 5160 

TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 5220 

20 AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 5280 

ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 378 Protein sequence: 
25 Protein Accession #: EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MVFKASKITF HWGKCNMSSD GSEHSLEGQK FPLEMQIYCF DADRPSSPEE AVKGKGKLRA 

30 LSILPEVGTE ENLDFKAIID GVESVSRPGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP 
PCTDTVDWIV FKDTVSISES QLAVFCEVLT MQQSGYVMLM DYLQNNFREQ QYKFSRQVFS 
SYTGKBEIHE AVCSSEPENV QADPENYTSL LVTWERPRW YDTMIEKFAV LYQQLDGEDQ 
TKHEFLTDGY QDLGAILNNL LPNMSYVLQI VAICTNGLYG KYSDQLIVDM PTDNPELDLF 
PELIGTEEII KEEEEGKDIE EGAIVNPGRD SATNQIRKKE PQISTTTHYN RIGTKYNEAX 

35 TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT KLATEKDISL TSQTVTELPP HTVEGTSASL 
NDGSKTVLRS PHMNLSGTAE SLNTVSITEY EEESLLTSFK LDTGAEDSSG SSPATSAIPF 
ISENISQGYI FSSENPETIT YDVLIPESAR NASEDSTSSG SEESLKDPSM EGNVWFPSST 
DITAQPDVGS GRBSFLQTNY TEIRVDESEK TTKSFSAGPV MSQGPSVTDL EMPHYSTFAY 
FPTEVTPHAF TPSSRQQDLV STVNWYSQT TQPVYNBASN SSHESRIGLA EGLESEKKAV 

40 IPLVIVSALT PICLWLVGI LIYWRKCFQT AHFYLEDSTS PRVISTPPTP IFPISDDVGA 
1PIKHPPKHV ADLHASSGFT EEFETLKEPY QEVQSCTVDL GITADSSNHP DNKHKNRYIN 
TVAYDHSRVK LAQLAEKDGK LTDYINANYV DGYNRPKAYI AAQGPLKSTA EDFWRMIWEH 
NVEVIVMITN LVEKORRKCD QYWPADGSEE YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 
KGSQKGRPSG RWTQYHYTQ WPDMGVPEYS LPVLTFVRKA AYAKRHAVGP VWHCSAGVG 

45 RTGTYIVLDS MLQQIQHEGT VNIFGFLKHI RSQRNYLVQT EEQYVFIHDT LVEAILSKET 
BVLDSHIHAY VNALLIPGPA GKTKLEMQFQ IiLSQSNIQQS DYSAALKQCN REKNRTSSII 
PVERSRVGIS SLSGEGTDYI NASYIMGYYQ SNEFIITQHP LLHTIKDFWR MIWDHNAQLV 
VMIPDGONMA EDEFVYWPNK DEPINCESFK VTLMAEBHKC LSNEEKLIIQ DFILEATQDD 
YVLEVRHFQC PKWPOTDSPI SKTPELISVI KEEAAMRDGP MIVHDEHGGV TAGTFCALTT 

50 LMHQLEKENS VDVYQVAKMI NLMRPGVFAD IEQYQPLYKV ILSLVSTRQE ENPSTSXDSN 
GAALPDGNIA ESLESLV 

Seq ID NO: 379 DMA sequence 
Nucleic Acid Accession #t EOS sequence 
55 Coding sequence? 148-4632 

1 11 21 31 41 51 

| | | I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 

60 CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG' CCGCAGACCO TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

65 CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGQAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

70 GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGQ TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACA TTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

75 TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

80 GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGO AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGOGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



1101 



WO 03/042661 PCT/US02/36810 



ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA QAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATQATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
5 AACTTGTOGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAQAATA TGAGGAGQAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
10 GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA OGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
15 GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTAT CTGT 2520 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGC ACAC TTT 2580 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
ATTTCAGATG ATGTOGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 
20 CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 
CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 
CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2880 
CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 
25 TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 
TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 
30 CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 
GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 
35 CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 
CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 
GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 
CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 
40 GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 
ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 
45 GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 
AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 
50 CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 
ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGOCTT 4800 
55 TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 
60 AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 
ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 
65 GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 380 Protein sequence ; 
„ Protein Accession 8* EOS sequence 

70 

1 11 21 31 41 51 

| 1 I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 
QSPINIDKDL TQVNVNLKKL KFQGWDKTSL ENTPIHNTGK TVEINLTNDY RVSGOVSEKV 
75 PKASKITFHW GKCNMSSDG5 EHSLEGQKFP LEMQIYCFDA DRFSSFBEAV KGKGKLRALS 
ILFEVGTEBN U)PKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YUGSLTSPPC 
TDTVDWIVPK DTVS1SESQL AVPCEVLTMQ QSGYVMLMDY LQNNFREQQY KPSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
Art HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDNPT DNPELDLFPE 
80 LIGTEBIIKE BEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 
RSPTRGSEFS GKGDVPNTSI* NSTSQPVTKL ATBKDISLTS OTVTKLPPKT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTVSITBYEE ESLLTSPKLD TGAEDSSGSS PATSAIPPIS 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSB ESLKDPSMEG NVWPPSSTDI 
TAQPDVGSGR ESFLQTNYTE IRVDBSEKTT KSPSAGPVMS OGPSVTDLEM PHYSTFAYFP 
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TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWIjVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIP PISDDVGAXP 840 

IKHFPKHVAD LHASSGFTEE FETLKBPYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 900 

AYDHSRVKLA QLABKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FWRMIWEHNV 960 

5 EVIVMITNLV EKGRRKCDQY WPADGSEEYG KFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 1020 

SQKGRPSGRV VTQYHYTQWP DMGVPSYSI*P VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYTVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV RAILS KETEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

10 YIMGYYQSNE FIITQHPLLH TIKDFHRMIW DHNAQLWMI PDGQNMAEDB PVYMPNKDBP 1320 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHPQCPKW PNPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIAESL ESLV 

15 

Seq ID NO: 381 DMA sequence 

Nucleic Acid Accession #: HM.002851.1 

Coding sequence: 148.. 7092 

20 1 11 21 31 41 SI 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
25 CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACOGT 480 
30 GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCGGAC OGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
35 TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 
AATGGCTCAT TGACATCTCC TCOCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTOGAGAGCA ACAG TACA AG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 
40 AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTOGACAT GCCTACTGAT 1380 
45 AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
50 ACAGAAAAAG AIATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAAXGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
55 GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG .AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAG ATCT GGAAATGCCA 2280 
60 CATTATTCTA C C TTT G CCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 
ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCT GCTT C AAGTAGTGAT 2520 
TCGGCCTTGC ATGCTACGCC TGTATTTGOC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 
65 TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 
AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 
TTTGGTAGTG AATCTGGTGT TCTTTATAAA AOGCTTATGT TTTCTCAAGT TGAACCACCC 2880 
70 AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 
GATTCTGTGG GTGTAACTTA TGAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 
CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 
GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 
75 GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAACTG AAATAATATA TGGAAATGAG 3300 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATQ 3360 
CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC Cl<. J i ^iTrCC 3420 
or . ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAOGTTTTT 3480 
80 GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGC CAGCA 3600 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 
ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 
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CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 
AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 
ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 
GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 
TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 
ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 
GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 
TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 
TTAGTGGGTG GTGGTGAAGA TGGTQACACT GATGATGATG GTGATGATGA TGATGACAGA 4440 
GATAGTGATG GCTTATCCAT TCATAAGTGT ATGTCATGCT CATCCTATAG AGAATCACAG 45 00 
GAAAAGGTAA TGAATGATTC AGACACCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 4560 
ATCTCATACT CACTATCTGA GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 4620 
GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC CATCAGCAAA TGGGCTATCC 4680 
CAAAAGCACA ATGATGGAAA AGAGGAAAAT GACATTCAGA CTGGTAGTGC TCTGCTTCCT 4740 
CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGGATCAGGG 4800 
CAAGGTACCT CAGATAGCCT TAATGAGAAT GAGACTTCCA CAGATTTCAG TTTTGCAGAC 4860 
ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGGTG ACTCAGAAAT AACTCCTGGA 4920 
TTCCCACAGT CCCCAACATC ATCTGTTACT AGCGAGAACT CAGAAGTGTT CCACGTTTCA 4980 
GAGGCAGAGG CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 5040 
TCCGAGAAGA AGGCAGTTAT ACCCCTTGTG ATCGTGTCAG CCCTGACTTT TATCTGTCTA 5100 
GTGGTTCTTG TGGGTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC 5160 
TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 5220 
TCAGATGATG TCGGAGCAAT TCCAATAAAG CACTTTCCAA A GCATGT TGC AGATTTACAT 5280 
GCAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 5340 
AGCTGTACTG TTGACTTAGO TATTACAGCA GACAGCTCCA ACCACCCAGA CAACAAGCAC 5400 
AAGAATCGAT ACATAAATAT CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT 5460 
GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC 5520 
AGACCAAAAG CTTATATTGC TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTTCTGG S580 
AGAATGATAT GGGAACATAA TGTGGAAGTT ATTGTCATGA TAACAAACCT CGTGGAGAAA 5640 
GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT 5700 
CTGGTCACTC AGAAGAGTGT GCAAGTGCTT GCCTATTATA CTGTGAGGAA TTTTACTCTA 5760 
AGAAACACAA AAATAAAAAA GGGCTCCCAG AAAGGAAGAC CCAGTGGACG TGTGGTCACA 5820 
CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 5880 
ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 5940 
TGCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 6000 
ATTCAACACG AAGGAACTGT CAACATATTT GGCTTCTTAA AACACATCCG TTCACAAAGA 6060 
AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 6120 
ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGTCATATTC ATGCCTATGT TAATGCACTC 6180 
CTCATTCCTG GACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT CCTGAGCCAG 6240 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 6300 
CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 6360 
GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 6420 
ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 6480 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 6540 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 6600 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 6660 
GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 6720 
AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTQTTATAAA AGAAGAAGCT 6780 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 6840 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 6900 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 6960 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 7020 
ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 7080 
TCITTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 7140 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 7200 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 7260 
TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 7320 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 7380 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 7440 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 7500 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 7560 
AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 7620 
ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 7680 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 7740 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 7800 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 7860 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA A 
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MRILKRPLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TOALNQKNTC KKYPTCNSPK. 60 

QSPIN1DKDL TQVNVNLKKL KPQGWDKTSL ENTFIHNTGK TVECJLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EKSLEGQKFP LEMQIYCFDA DRPSSPEKAV KGKGKLRALS 180 

IIiPEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWTVPK DTVSISESQL AVFCBVMMQ QSGYVMLMDY LQNNPREQQY KFSRQVPSSY 300 

TGKBEIHEAV CSSBPENVQA DPENYTSLLV TWERPKWYD TMIEKPAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SD QLIVD MPT DNPELDLFPB 420 

LIGTEBIIKE EEEGKDIBEG AIVNPGRDSA TNQIRXKEPQ ISTTTHYKRI GTKYNBAKTN 480 
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RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 
GSKTVLRSPH MNLSGTABSL KTVSITEYBE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE BSLKDP5MEG NVWFPSSTDI 660 
TAQPDVGSGR ESFLQTNYTE IRVDBSEKTT KSPSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 
5 TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 
LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PPSSASFSSE LFRHLHTVSQ 840 
ILPQVTSATE SDKVPLHASL FVAGGDLLLE PSLAQYSDVL STTHAASETL BPGSESGVIiY 900 
KTLMPSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIP TVSYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEPLLPDT DGLTALNISS 1020 

10 PVSVAEPTYT TSVPGDDNKA LSKSEIIYGN ETELQIPSFH EMVYPSESTV MPNMYDNVNK 1080 

LMASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPB NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLPY ETSASFSTBV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLUCSESSH QWPSLYSND BLFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

15 EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGSDGD TDDDGDDDDD RDSDGLSIHK 1440 

CMSCSSYRBS QEKVMNDSDT HENSLMDQNN PISYSLSENS EEDNRVTSVS SDSQTGMDRS 1500 

PGKSPSANGL SQKHNDGKEE NDIQTGSALL PLSPESKAWA VLTSDEESGS GQGTSDSLKE 1560 

NETSTDFSFA DTNEKDADGI LAAGDSEITP GFPQSPTSSV TSENSBVFHV SBAEASNSSB 1620 

20 ESRIGLAEGL ESEKKAVIPL VIVSALTFIC LWLVGILIY WRKCFQTAHF YLEDSTSPRV 1680 

ISTPPTPIFP ISDDVGAIPI KHFPKHVADL HASSGFTEBP ETLKEFYQEV QSCTVDLGIT 1740 

ADSSNHPDNK HKKRYINIVA YDHSRVKLAQ LAEKDGKLTD YINANYVDGY NRPKAYIAAQ 1800 

GPIJCSTAEDF WRMIWEHNVE VIVMITNLVE KGRRKCDQYH PADGSEEYGN FLVTQKSVQV 1860 

LAYYTVRNFT LRNTKIKKGS QKGRPSGRW TQYHYTQWPD MGVPEYSLPV LTFVRKAAYA 1920 

25 KRHAVGPVW HCSAGVGRTG TYIVLDSMLQ QIQHEGTVNI FGFLKHIRSQ RNYLVQTEEQ 1980 

YVPIHDTLVE AILSKETEVL DSH1HAYVNA LLIPGPAGKT KLEKQFQLL8 QSNIQQSDYS 2040 

AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS YIMGYYQSNE FIITQHPLLH 2100 

TIKDFWRMIW DHNAQLWMI PDGQKMAEDE FVYWPNKDBP INCESFKVTL MAEEHKCLSN 2160 

EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT PELISVIKEE AANRDGPMIV 2220 

30 HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM RPGVFADIEQ YQFLYKVILS 2280 
LVSTRQEENP STSLDSNGAA LPDGNIAESL ESLV 

Seq ID NOi 383 DNA sequence 
Nucleic Acid Accession #: NM^OOSeBS.l 
35 Coding sequence: 126- . 4439 

1 11 21 31 41 51 

| I I i I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGQAGC 60 

40 AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTOG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

45 GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA OGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGAOGTGAAC TGCAOAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGOGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

50 TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTA CAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

55 TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCOGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTOCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAAOGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

60 AGAGTGTTCA AAAAATCOGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACOSTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

65 TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

70 ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

75 GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGOCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTG GT TG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

n _ GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

80 CCATTTTTAA TAACCTGTTG CTGGQAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGOCTTC AGCACCTGGT 2760 
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GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTOGA GGGAACGAGA 2820 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG AOGAGCTTTT CCGAAGGATC CTTOGAAGCC 3000 

5 CTATGAAGTT TTTTGACAOG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCOGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATOGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 • 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

10 AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG OGGGTCTOGC CATCTCTTAT GCTGTCCAGT 3S40 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

15 CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGC ACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

20 GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

25 CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTOGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

30 CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGQAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAG CTAT A 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

35 TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

40 CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGOAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

45 ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TGGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTG G TTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAG CGTTGC 5700 

50 ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 

CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: 384 Protein sequence 
55 Protein Accession #: NP_005679.1 

1 11 21 31 41 51 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60' 

60 LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKKQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGPSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGP LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

65 VLTYIKFIKM YAWVKAPSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKHKKDK RASRGKKEKV RQLQRTEHOA 540 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAHILKATLR DNILFGKEYD EERYNSVLNS 660 

70 CCLRPDIiAIL PSSDLTBIGB RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEIKSKKET SGSQKKSQDK GPKTGSVKKB KAVKPEEGQL VQLBEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SKAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

75 KFFPTT P TGR ILNRPSKDMD EVDVRIiPFQA E3TPIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRBLK RLDNITQSPP LSHIT8SIQG LATIHAYNKG QEPLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVU4 HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SBTEARFTSV ERINHYIKTL SLBAPARIKN KAPSPDHPQE GBVTFBNAEM 1200 

RYRENLPLVL KKVSFTIKPK BKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

80 IGLADLRSKL SIIPQEPVLF SGTVRSNLDP PNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGCNF SVGERQLLCI ARALLRHCKI LIU3EATAAM DTETDLLIQE TIREAFADCT 1380 
MLTIAHRLHT VIGSDRIKVL AQGQWKFDT PSVLLSMDSS RFYAMFAAAE NKVAVKG 
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Seq ID NO: 385 DMA sequence 

Nucleic Acid Accession #: NM_001327.1 

Coding sequence t 89.. 631 
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1 
I 

AGCAGGGGGC 
CTGAGAGCCG 
GACGGGCGAT 
TGGCGGCCCA 
AAG6GCCTCG 
GCTGAATGGA 
CCTCGCCATG 
GGATGCCCCA 
CATACTGACT 
CTGTCTCCAG 
GGCTCAGCCT 
GCCTCCTCCC 
GTTTGTCGCT 



11 
I 

GCTGTGTGTA 
GGCAGAGGCT 
GCTGATGGCC 
GGAGAGGCGG 
GGGCCGGGAG 
TGCTGCAGAT 
CCTTTCGCGA 
CCGCTTCCCG 
ATCCGACTGA 
CAGCTTTCCC 
CCCTCAGGGC 
CTAGGGAATG 
GGAGGAGGAC 



21 
I 

CCGAGAATAC 
COGGAGCCAT 
CAGGAGGCCC 
GTGCCAOGGG 
GAGGCGCCCC 
GCGGGGCCAG 
CACCCATGGA 
TGCCAGGGGT 
CTGCTGCAGA 
TGTTGATGTG 
AGAGGCGCTA 
GTCCCAGCAC 
GGCTTACATG 



31 
I 

GAGAATACCT 
GCAGGCOGAA 
TGGCATTCCT 
CGGCAGAGGT 
GCGGGGTCCG 
GGGGCCGGAG 
AGCAGAGCTG 
GCTTCTGAAG 
CCACCGCCAA 
GATCACGCAG 
AGCCCAGCCT 
GAGTGGCCAG 
TTTGTTTCTG 



41 
I 

CGTGGGCCCT 
GGCCGGGGCA 
GATGGCCCAG 
CCCCGGGGCG 
CATGGCGGOG 
AGCCGCCTGC 
GCCCGCAGGA 
GAGTTCACTG 
CTGCAGCTCT 
TGCTTTCTGC 
GGCGCCCCTT 
TTCATTGTGG 
TAGAAAATAA 



51 
I 

GACCTTCTCT 
CAGGGGGTTC 
GGGQCAATGC 
CAGGGGCAGC 
CGGCTTCAGG 
TTGAGTTCTA 
GCCTGGCCCA 
TGTCCGGCAA 
CCATCAGCTC 
CCGTGTTTTT 
CCTAGGTCAT 
GGGCCTGATT 
AACTGAGCTA 



Seq ID NO: 386 Protein sequence 
Protein Accession #: NP_001318.1 

1 11 21 31 41 51 

| | I 1 1 I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM BABLARRSLA QDAPPLFVPG 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 

Seq ID NO: 387 dna sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52.. 459 



CCTCGTGGGC 
GAAGGCCAGG 
CCTGATGGCC 
GGTCCCCGGG 
CCGCATGGCG 
GACAGCCGCC 
ATCAGCTCCT 
GTGTTTTTGG 
TAGGTCATGC 
GCCTGATTGT 
CTGAGCTA 



11 
I 

CCTGACCTTC 
GCACAGGGGG 
CAGGGGGCAA 
GCGCAGGGGC 
GTGCCGCTTC 
TGCTTCAGTT 
GTCTCCAGCA 
CTCAGGCTCC 
CTCCTCCCCT 
TTGTCGCTGG 



21 

1 

TCTCTGAGAG 
TTCGACGGGC 
TGCTGGOGGC 
AGCAAGGGCC 
TGCGCAGGAT 
CCGACTGACT 
GCTTTCCCTG 
CTCAGGGCAG 
AGGGAATGGT 
AGGAGGAOGG 



31 
I 

CCGGGCAGAG 
GATGCTGATG 
CCAGGAGAGG 
TOGGGGCCGA 
GGAAGGTGCC 
GCTGCAGACC 
TTGATGTGGA 
AGGCGCTAAG 
CCCAGCACGA 
CTTACATGTT 



41 

i 

GCTCCGGAGC 
GCCCAGGAGG 
CGGGTGCCAC 
GAGGAGGCGC 
CCTGCGGGGC 
ACCGCCAACT 
TCACGCAGTG 
CCCAGCCTGG 
GTGGCCAGTT 
TGTTTCTGTA 



Seq ID NO: 388 Protein sequence 
Protein Accession #i Eos sequence 



60 
120 
180 
240 
300 
360 
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540 
600 
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720 



51 
I 

CATGCAGGCC 
CCCTGGCATT 
GGGCGGCAGA 
CCCGCGGGGT 
CAGGAGGCCG 
GCAGCTCTCC 
CTTTCTGCCC 
CGCCCCTTCC 
CATTGTGGGG 
GAAAATAAAG 



51 



1 11 21 31 41 

| | } I I 1 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 
PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQIt QLSISSCLQQ LSLLMWITQC 
FLPVFLAQAP SGQRR 

Seq ID NO: 389 DNA sequence 
Nucleic Acid Accession ft: NM_005S62.1 
Coding sequence: 90.. 3671 



11 



ACAGCGGAGC 
AGACAGAGAC 
GCTTCTCGCT 
ATGGGAAGTC 
TCCGCTGCCT 
GCTTTTACCG 
CTCTTAGTGC 
CCAGATGCGA 
ACCAGAGACT 
ACGCGGGCCG 
CAGGTTACTA 
GGCATTCAGC 
TTCATCAAGA 
AATGGTCACA 
TTGTGGCTCC 
TTGACTACOG 
GTGCTGGTCT 
TCACCAAGAC 
TGAGTTACTT 
CATATGGAGA 
TCTCTGGAGC 
AATTCTGCCA 



GCAGAGTGAG 
TGAGCGGCCC 
CCTCCTGCCC 
CAGGCAGTGT 
CAACTGCAAT 
GCACAGAGAA 
TCGATGTGAC 
CCGATGTCTG 
GCTAGACTCC 
CTGTGTCTGC 
TAATCTGGAT 
CAGCTGCCGC 
TGTTGATGGC 
GCGCCATCAA 
TGCCAAATTT 
TGTGGACAGA 
ACGGATCACA 
TTACACATTC 
TGAGTATCGA 
ATACAGTACT 
CCCAGCACCC 
GGATTGTGCT 



21 
I 

AACCACCAAC 
GGCACCGCCA 
GCAGCCCGGG 
ATCTTTGATC 
GACAACACTG 
AGGGACCGCT 
AACTCTGGAC 
CCAGGCTTCC 
AAGTGTGACT 
AAGCCAGCTG 
GGGGGGAACC 
AGCTCTGCAG 
TGGAAGGCTG 
GATGTGTTTA 
CTTGGGAATC 
GGAGGCAGAC 
GCTCCCTTGA 
AGGTTAAATG 
AGGTTACTGC 
GGGTACATTG 
TGGGTTGAAC 
TCTGGCTACA 



31 
I 

CGAGGCGCCG 
TGCCTGCGCT 
CCACCTCCAG 
GGGAACTTCA 
ATGGCATTCA 
GTTTGCCCTG 
GGTGCAGCTG 
ACATGCTCAC 
GTGACCCAGC 
TTACTGGAGA 
CTGAGGGCTG 
AATACAGTGT 
TCCAACGAAA 
GCTCAGCCCA 
AACAGGTGAG 
ACCCATCTGC 
TGCCACTTGG 
AGCATCCAAG 
GGAATCTCAC 
ACAATGTQAC 
AGTGTATATG 
AGAGAGATTC 



41 

I 

GGCAGCGACC 
CTGGCTGGGC 
GAGGGAAGTC 
CAGACAAACT 
CTGCGAGAAG 
CAATTGTAAC 
TAAACCAGGT 
GGATGCGGGG 
TGGCATCGCA 
ACGCTGTGAT 
TACCCAGTGT 
CCATAAGATC 
TGGGTCTCCT 
ACGACTAGAC 
CTATGGGCAA 
CCATGATGTG 
CAAGACACTG 
CAATAATTGG 
AGCCCTCOGC 
CCTGATTTCA 
TCCTGTTGGG 
AGCGAGACTG 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 



51 
I 

CCTGCAGCGG 
TGCTGCCTCT 
TGTGATTGCA 
GGTAATGGAT 
TGCAAGAATG 
TCCAAAGGTT 
GTGACAGGAG 
TGCACCCAAG 
GGGCCCTGTG 
AGGTGTCGAT 
TTCTGCTATG 
ACCTCTACCT 
GCAAAGCTCC 
CCTGTCTATT 
AGCCTGTCCT 
ATTCTGGAAG 
CCTTGTGGGC 
AGCCCCCAGC 
ATCCGAGCTA 
GCCCGCCCTG 
TACAAGGGGC 
GGGCCTTTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC A CAGGA GATT 1380 
GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTQA CTGCCCAATT GGTTTCTACA 1440 
ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1S00 
CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTC OCGGG GTCACCGGTG 1560 
CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 
TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 
GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 
ACCAGTGCAA AGCAGGCTAC TTOGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 
GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 
GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 
CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 
AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 
GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 
AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 
ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 
AGTACCAGAA CCGAGTTOGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 
CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACSTGGGGC 2340 
CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 
CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 
CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2S20 
CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 
CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 
TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 
CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 
AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 
AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCOGTOCC AATC TTGCTA 2880 
AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 
TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 
AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 
AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAA TGGG G 3120 
CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 
AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 
GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 
TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 
CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 
AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCOGAGCCA 3480 
AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 
AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 
AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 
AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 
GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 
TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 
TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 
ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 
ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 
ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 
ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 
ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 
ACCCAGGGTO TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 
AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 
GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 
ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 
GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 
ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 
ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 
CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 
TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 
GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 
AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 
GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 
CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 
CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 
TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 
TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seq ID NO: 390 Protein sequence 
Protein Accession #: NP_005553.l 

1 11 21 31 41 51 

MPALWIiGCCL CPSLLLPAAR ATSRRBVCDC NGKSRQCIFD RELHRQTGNG PRCUICNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGP 120 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

PEGCTQCFCY GHSASCRSSA EYSVHKITST PHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY PVAPAKFIX3N QQVSYGQSLS FDYRVDRGGR HPSAHDVILE GAGLRITAPL 300 

MPDGKTLPCG LTKTYTPRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARLGPP GTCIPCNCQG 420 

GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETBBW 480 

CNNCPPGVTG ARCELCADGY FGDPPGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACKCNPMGS EPVOCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYKQVKIQM DQFMQQLQRM BALISKAQGG DGWPDTELE GRMQGAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQEMS YQSRLDDLKM TVERVRAIX3S QYGNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGPKSLAQE ATRLAESHVE SASNMEQLTR 780 
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ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE 
ADRSYQHSLR LUJSVSRLQG VSDQSFQVBB AKRIKQKADS LSTLVTRHMD EPKRTQKNLG 
NWKEEAQQLL QNGKSGRBKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 
QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 
IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKELEFDTN MDAVQMVITE 
AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 
MMSELBERAR QQRGHLHLLB TSIDGILADV KNLEMIRDML PPGCYNTQAL EQQ 

Seq ID NO: 391 DNA sequence 
Nucleic Acid Accession #: AF101051.1 
Coding sequence: 221.856 
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41 
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GAGCAACCTC AGCTTCTAGT ATCCAGACTC CAGCGCCGCC CCGGGCGCGG ACCCCAACCC 60 

CGACCCAGAG CTTCTCCAGC GGCGGCGCAG CGAGCAGGGC TCCCCGCCTT AACTTOCTCC 120 

GCGGGGCCCA GCCACCTTCG GGAGTCCGGG TTGCCCACCT GCAAACTCTC CGCCTTCTGC 180 

ACCTGCCACC CCTGAGCCAG CGCGGGCGCC CGAGCGAGTC ATGGCCAACG OGGGGCTGCA 240 

GCTGTTGGGC TTCATTCTCG CCTTCCTGGG ATGGATOGGC GCCATCGTCA GCACTGCCCT 300 

GCCCCAGTGG AGGATTTACT CCTATGCOGG CGACAACATC GTGACCGCCC AGGCCATGTA 360 

CGAGGGGCTG TGGATGTCCT GCGTGTCGCA GAGCACCGGG CAGATCCAGT GCAAAGTCTT 420 

TGACTCCTTG CTGAATCTGA GCAGCACATT GCAAGCAACC CGTGCCTTGA TGGTGGTTGG 480 

CATCCTCCTG GGAGTGATAG CAATCTTTGT GGCCACCGTT GGCATGAAGT GTATGAAGTG S40 

CTTGGAAGAC GATGAGGTGC AGAAGATGAG GATGGCTGTC ATTGGGGGTG CGATATTTCT 600 

TCTTGCAGGT CTGGCTATTT TAGTTGCCAC AGCATGGTAT GGCAATAGAA TCGTTCAAGA 660 

ATTCTATGAC CCTATGACCC CAGTCAATGC CAGGTACGAA TTTGGTCAGG CTCTCTTCAC 720 

TGGCTGGGCT GCTGCTTCTC TCTGCCTTCT GGGAGGTGCC CTACTTTGCT GTTCCTGTCC 780 

CCGAAAAACA ACCTCTTACC CAACACCAAG GCCCTATCCA AAACCTGCAC CTTCCAGCGG 840 

GAAAGACTAC GTGTGACACA GAGGCAAAAG GAGAAAATCA TGTTGAAACA AACCGAAAAT 900 

GGACATTGAG ATACTATCAT TAACATTAGG ACCTTAGAAT TTTGGGTATT GTAATCTGAA 960 

GTATGGTATT ACAAAACAAA CAAACAAACA AAAAACCCAT GTGTTAAAAT ACTCAGTGCT 1020 

AAACATGGCT TAATCTTATT TTATCTTCTT TCCTCAATAT AGGAGGGAAG ATTTTACCAT 1080 

TTGTATTACT GCTTCCCATT GAGTAATCAT ACTCAAATGG GGGAAGGGGT GCTCCTTAAA 1140 

TATATATAGA TATGTATATA TACATGTTTT TCTATTAAAA ATAGACAGTA AAATACTATT 1200 

CTCATTATGT TGATACTAGC ATACTTAAAA TATCTCTAAA ATAGGTAAAT GTATTTAATT 1260 

CCATATTGAT GAAGATGTTT ATTGGTATAT TTTCTTTTTC GTCCTTATAT ACATATGTAA 1320 

CAGTCAAATA TCATTTACTC TTCTTCATTA GCTTTGGGTG OCTTTGCCAC AAGACCTAGC 1380 

CTAATTTACC AAGGATGAAT TCTTTCAATT CTTCATGCGT GCCCTTTTCA TATACTTATT 1440 

TTATTTTTTA CCATAATCTT ATAGCACTTG CATCGTTATT AAGCCCTTAT TTGTTTTGTG 1500 

TTTCATTGGT CTCTATCTCC TGAATCTAAC ACATTTCATA GCCTACATTT TAGTTTCTAA 1560 

AGCCAAGAAG AATTTATTAC AAATCAGAAC TTTGGAGGCA AATCTTTCTG CATGACCAAA 1620 

GTGATAAATT CCTGTTGACC TTCCCACACA ATCCCTGTAC TCTGACCCAT AGCACTCTTG 1680 

TTTGCTTTGA AAATATTTGT CCAATTGAGT AGCTGCATGC TGTTCCCCCA GGTGTTGTAA 1740 

CACAACTTTA TTGATTGAAT TTTTAAGCTA CTTATTCATA GTTTTATATC CCCCTAAACT 1800 

ACCTTTTTGT TCCCCATTCC TTAATTGTAT TGTTTTCCCA AGTGTAATTA TCATGCGTTT 1860 

TATATCTTCC TAATAAGGTG TGGTCTGTTT GTCTGAACAA AGTGCTAGAC TTTCTGGAGT 1920 

GATAATCTGG TGACAAATAT TCTCTCTGTA GCTGTAAGCA AGTCACTTAA TCTTTCTACC 1980 

TCTTTTTTCT ATCTGCCAAA TTGAGATAAT GATACTTAAC CAGTTAGAAG AGGTAGTGTG 2040 

AATATTAATT AGTTTATATT ACTCTCATTC TTTGAACATG AACTATGCCT ATGTAGTGTC 2100 

TTTATTTGCT CAGCTGGCTG AGACACTGAA GAAGTCACTG AACAAAACCT ACACACGTAC 2160 

CTTCATGTGA TTCACTGCCT TCCTCTCTCT ACCAGTCTAT TTCCACTGAA CAAAACCTAC 2220 

ACACATACCT TCATGTGGTT CAGTGCCTTC CTCTCTCTAC CAGTCTATTT CCACTGAACA 2280 

AAACCTACGC ACATACCTTC ATGTGGCTCA GTGCCTTCCT CTCTCTAOCA GTCTATTTCC 2340 

ATTCTTTCAG CTGTGTCTGA CATGTTTGTG CTCTGTTCCA TTTTAACAAC TGCTCTTACT 2400 

TTTCCAGTCT GTACAGAATG CTATTTCACT TGAGCAAGAT GATGTATGGA AAGGGTGTTG 2460 

GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 2520 

AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 

CTGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGAAT ACATGTAAGT 2640 

GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 

AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC OCAATGCTTT CTGTGGCTAA 2880 

ACAGATGTAA TGGGAAGAAA TAAAAGCCTA OGTGTTGGTA AATCCAACAG CAAGGGAGAT 2940 

TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGOCCTCA GAGCTCTTGC 3000 

TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TCATAATAAA 3060 

CTACACAAGG AAAGTCAGCC ACOGTGTCTT ATGAGGAATT GGACCTAATA AATTT TAGTG 3120 

TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 

ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 

ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTATCCAAT 3300 

TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 3360 

TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 
AAAAGGAAAA AAAAAAAAAA AAA 

Seq ID NO: 392 Protein sequence 
Protein Accession ft: AAD16433.1 

1 11 21 31 41 51 

I I I I I I 

MANAGLQLLG FILAFLGWIG AIVSTALPOW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 

QIQCKVFDSL LNLSSTLQAT RAUWVGIU, GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 

IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYB FGQALPTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 
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Seq ID NO: 393 DMA sequence 

Nucleic Acid Accession 8: NM_006180.1 

Coding sequence: 352.. 2820 

1 11 21 31 41 51 

I I I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCOGGAGGC CGCCGGTCGG 60 

TGCCCGGCGC GCCGGGCGAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGOGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTOCGA GATTGGAGCC TAACAGTGTA 540 

GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCOGTC ACCTTGACTT GTCTGAACTG 780 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 1020 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

AT CTCTT GTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

GAT AAAA CCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TOGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AAOCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGOGTCT GOGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCGGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTOGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCOG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCGAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTT CTTT TTT TAAATTTTCT Wrr C iTClT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTT GTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 

Seq ID NO i 394 Protein sequence 
Protein Accession ft: NP_00617l.l 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLKGFCW LWGFWRAAP ACPTSCKCSA 6RIWCSDPSP GIVAFPRLEP 60 

NSVDPENITB IPIANQKRLE IINEDDVBAY VGLRNLTIVD SGLKPVAHKA FLKNSNIjQRI 120 

NFTRNXLTSI* SRKHFRHLDL SELILVGNPP TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVBB GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSI* RITNTSSDDS GKQISCVAEN LVGEDQDSVN LTVHPAPTIT PLESPTSDHH 300 

VCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYFDVIYEDY GTAANDIGDT TNRSNEIPST ' 420 

DVTDKTGREH LSVYAWVIA SWGPCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IBNPQYFGIT NSQLKPDTFV QHIKRHNTVL 540 

KRKLGEGAFG KVPLAECYNL CPEQDKILVA VKTLKDASDN ARKDPHRKAB LLTNLQHEHI 600 
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VKFYGVCVEG DPLIMVPEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 

PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VDQRPRTCPQ 780 
EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASFVYLDI LG 

Seq ID NO: 395 DMA sequence 
Nucleic Acid Accession &: AP410899 
Coding sequence: 483.. 2999 

1 11 21 31 41 51 

I I I I I I 

GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT CGCGCTCTAC GCGCTCAGTC CCCGGCGGTA 60 

GCAGGAGCCT GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAGG 120 

TGCATACCGG ACCCCCATTC GCATCTAACA AGGAATCTGC GCCCCAQAGA GTCCCGGACG 180 

CCGCGGGTCG GTGCCCGGCG OGCCGGGCCA TGCAGCGACG GCCGCGGCGG AGCTCCGAGC 240 

AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCOGC 300 

CTGCCGGAAC ACTCTTCGCT CCGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 360 

CCGCAACAAG CACCGAGGAG TTAAGAGAGC CGCAAGCGCA GGGAAGGCCT CCCCGCACGG 420 

GTGGGGGAAA GCGGCCGGTG CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG 480 

GGATGTCGTC CTGGATAAGG TGGCATGGAC CCGCCATGGC GCGGCTCTGG GGCTTCTGCT 540 

GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT TCGCCTGTCC CAOGTCCTGC AAATGCAGTG 600 

CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTTCCG AGATTGGAGC 660 

CTAACAGTGT AGATCCTGAG AACATCACOG AAATTTTCAT CGCAAACCAG AAAAGGTTAG 720 

AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 

ATTCTGGATT AAAATTTGTG GCTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 

TCAATTTTAC CCGAAACAAA CTGACGAGTT TGTCTAGGAA ACATTTCCGT CACCTTGACT 900 

TGTCTGAACT GATCCTGGTG GGCAATCCAT TTACATGCTC CTGTGACATT ATGTGGATCA 960 

AGACTCTCCA AGAGGCTAAA TCCAGTCCAG ACACTCAGGA TTTGTACTGC CTGAATGAAA 1020 

GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACGCAA TTGTGGTTTG CCATCTGCAA 1080 

ATCTGGCCGC ACCTAACCTC ACTOTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 

TGGCAGGTGA TCCGGTTCCT AATATGTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 1200 

TGAATGAAAC AAGGCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 

GTGGGAAGCA GATCTCTTGT GTGGCGGAAA ATCTTGTAGG AGAAGATCAA GATTCTGTCA 1320 

ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380 

ACTGGTGCAT TCCATTCACT GTGAAAGGCA ACCCCAAACC AGCGCTTCAG TGGTTCTATA 1440 

ACGGGGCAAT ATTGAATGAG TCCAAATACA TCTGTACTAA AATACATGTT ACCAATCACA 1500 

CGGAGTACCA CGGCTGCCTC CAGCTGGATA ATCCCACTCA CATGAACAAT GGGGACTACA 1560 

CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG 1620 

GCTGGCCTGG AATTGACGAT GGTGCAAACC CAAATTATCC TGATGTAATT TATGAAGATT 1680 

ATGGAACTGC AGCGAATGAC ATCGGGGACA CCACGAACAG AAGTAATGAA ATCCCTTCCA 1740 

CAGACGTCAC TGATAAAACC GGTCGGGAAC ATCTCTCGGT CTATGCTGTG GTGGTGATTG 1800 

CGTCTGTGGT GGGATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 

CCAAGTTTGG CATGAAAGAT TTCTCATGGT TTGGATTTGG GAAAGTAAAA TCAAGACAAG 1920 

GTGTTGGCCC AGCCTCCGTT ATCAGCAATG ATGATGACTC TGCCAGCCCA CTCCATCACA 1980 

TCTCCAATGG GAGTAACACT CCATCTTCTT CGGAAGGTGG CCCAGATGCT GTCATTATTG 2040 

GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT TGGCATCACC AACAGTCAGC 2100 

TCAAGCCAGA CACATTTGTT CAGCACATCA AGOGACATAA CATTGTTCTG AAAAGGGAGC 2160 

TAGGOGAAGG AGCCTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 2220 

AGGACAAGAT CTTGGTGGCA GTGAAGACCC TGAAGGATGC CAGTGACAAT GCACGCAAGG 2260 

ACTTCCACCG TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2340 

ATGGOGTCTG CGTGGAGGGC GACCCCCTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 

ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCGT GCTGATGGCT GAGGGCAACC 2460 

CGCCCACGGA ACTGACGCAG TCGCAGATGC TGCATATAGC CCAGCAGATC GCCGCGGGCA 2S20 

TGGTCTACCT GGCGTCCCAG CACTTCGTGC ACCGCGATTT GGCCACCAGG AACTGCCTGG 2580 

TCGGGGAGAA CTTGCTGGTG AAAATCGGGG ACTTTGGGAT GTCCCGGGAC GTGTACAGCA 2640 

CTGACTACTA CAGGGTCGGT GGCCACACAA TGCTGCCCAT TCGCTGGATG CCTCCAGAGA 2700 

GCATCATGTA CAGGAAATTC ACGACGGAAA GOGAOGTCTG GAGCCTGGGG GTCGTGTTGT 2760 

GGGAGATTTT CACCTATGGC AAACAGCCCT GGTACCAGCT GTCAAACAAT GAGGTGATAG 2820 

AGTGTATCAC TCAGGGCCGA GTCCTGCAGC GAOCCCGCAC GTGCCCCCAG GAGGTGTATG 2880 

AGCTGATGCT GGGGTGCTGG CAGCGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC 2940 

ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTGOGGTCTA CCTGGACATT CTAGGCTAGG 3000 

GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAC 3060 

ATCTTTTAAC TGCCGCTGGA GGCCACGAAG CTGCTCTCCT TCACTCTGAC AGTATTAACA 3120 

TCAAAGACTC CGAGAAGCTC TGGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 

TATTGACTTC TTTTTGGCAT TATCTCTTTC TCTCTTTCCA TCTCCCTTGG TTGTTCCTTT 3240 

TTCTTTTTTT AAATTTTC 1 T TTTCTTCTTT TTTTTCGTCT TCCCTGCTTC AOGATTCTTA 3300 

CCCTTTCTTT TGAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA GACAAAGGCC 3360 

TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT 3420 

GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 

AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3540 

TTATTATTAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT AAAAAAGAAA 3600 

ACTTGTGTTC AATCTGTGAA GCCTTTATCT ATGGGAGATT AAAACCAGAG AGAAAGAAGA 3660 

TTTATTATGA ACCGCAATAT GGGAGGAACA AAGACAACCA CTGGGATCAG CTGGTGTCAG 3720 

TCCCTACTTA GGAAATACTC AGCAACTGTT AGCTGGGAAG AATGTATTOG GCACCTTCCC 3780 

CTGAGGACCT TTCTGAGGAG TAAAAAGACT ACTGGCCTCT GTGCCATGGA TGATTCTTTT 3840 

CCCATCACCA GAAATGATAG OGTGCAGTAG AGAGCAAAGA TGGCTTCOGT GAGACACAAG 3900 

ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3960 

TTGTTTCTCA AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4020 
ATGTCCAGAG CTCATTTCGG GGTCAGGTGG GAAAGCC 

Seq ID NO: 396 Protein sequence 
Protein Accession #; AAL67 965-1 

1 11 21 31 41 51 
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MSSHIRWHGP AMARLWGPCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAPPRLEP 60 

NSVDPENITE IPIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKPVARKA PLKNSNLQHT 120 

NFTRHKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMHIK TLQBAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPMLTVBB GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NB TS H TQGSb RITNISSDDS GKQISCVABN LVGEDQDSVN LTVHPAPTXT FLESPTSDHH 300 

WCIPPTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISARFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAVWIA SWGFCLLVM LFLLKLARES KFGMKDFSWP GFGK7KSRQG 480 

VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVIIG HTKIFVIENP QYFGITNSQL 540 

KPDTFVQHIK- RHNIVLKREL GEGAFGKVFL AECYNLCPEQ DKILVAVKTL KDASDNARKD 600 

FHREAELLTN LQHBHIVKFY GVCVBGDPLI MVPEYMKHGD LKKFLRAHGP DAVLMAEGNP 660 

PTELTQSQML HIAQQIAAGM VYLASQHPVH RDLATRNCLV GENLLVKIGD FGMSRDVYST 720 

DYYRVGGHTM LPIRWMPPES IMYRKFTTES DVWSLGWLW EIPTYGKQPW YQLSNNEVIE 780 
CITQGRVLQR PRTCPQEVYE LMLGCWQREP HMRKNIKGIH TLLQNLAKAS PVYLDILG 



Seq ID NO: 397 DNA sequence 
Nucleic Acid Accession §: AB052906 
Coding sequence: 74.. 814 



1 11 21 31 41 51 

I I I I I I 

AAAACCTTGA GGTGATTCAT CTTCCAGGCT CTCCTTCCAT CAAGTCTCTC CTCCCTAGCG 60 

CTCTGGGTCC TTAATGGCAG CAGCCGCCGC TACCAAGATC CTTCTGTGCC TCCCGCTTCT 120 

GCTCCTGCTG TCCGGCTGGT CCCGGGCTGG GCGAGCOGAC CCTCACTCTC TTTGCTATGA 180 

CATCACCGTC ATCCCTAAGT TCAGACCTGG ACCACGGTGG TGTGCGGTTC AAGGCCAGGT 240 

GGATGAAAAG ACTTTTCTTC ACTATGACTG TGGCAACAAG ACAGTCACAC CTGTCAGTCC 300 

CCTGGGGAAG AAACTAAATG TCACAACGGC CTGGAAAGCA CAGAACCCAG TACTGAGAGA 360 

GGTGGTGGAC ATACTTACAG AGCAACTGCG TGACATTCAG CTGGAGAATT ACACACCCAA 420 

GGAACCCCTC ACCCTGCAGG CCAGGATGTC TTGTGAGCAG AAAGCTGAAG GACACAGCAG 480 

TGGATCTTGG CAGTTCAGTT TCGATGGGCA GATCTTCCTC CTCTTTGACT CAGAGAAGAG 540 

AATGTGGACA ACGGTTCATC CTGGAGCCAG AAAGATGAAA GAAAAGTGGG AGAATGACAA 600 

GGTTGTGGCC ATGTCCTTCC ATTACTTCTC AATGGGAGAC TGTATAGGAT GGCTTGAGGA 660 

CTTCTTGATG GGCATGGACA GCACCCTGGA GCCAAGTGCA GGAGCACCAC TCGCCATGTC 720 

CTCAGGCACA ACCCAACTCA GGGCCACAGC CACCACCCTC ATCCTTTGCT GCCTCCTCAT 780 

CATCCTCCCC TGCTTCATCC TCCCTGGCAT CTGAGGAGAG TCCTTTAGAG TGACAGGTTA 840 

AAGCTGATAC CAAAAGGCTC CTGTGAGCAC GGTCTTGATC AAACTCGCCC TTCTGTCTGG 900 

CCAGCTGCCC ACGACCTACG GTGTATGTCC AGTGGCCTCC AGCAGATCAT GATGACATCA 960 

TGGACCCAAT AGCTCATTCA CTGCCTTGAT TCCTTTTGCC AACAATTTTA CCAGCAGTTA 1020 

TACCTAACAT ATTATGCAAT TTTCTCTTGG TGCTACCTGA TGGAATTCCT GCACTTAAAG 1080 

TTCTGGCTGA CTAAACAAGA TATATCATTT TCTTTCTTCT CTTTTTGTTT GGAAAATCAA 1140 

GTACTTCTTT GAATGATGAT CTCTTTCTTG CAAATGATAT TGTCAGTAAA ATAATCAOGT 1200 

TAGACTTCAG ACCTCTGGGG ATTCTTTCCG TGTCCTGAAA GAGAATTTTT AAATTATTTA 1260 

ATAAGAAAAA ATTTATATTA ATGATTGTTT CCTTTAGTAA TTTATTGTTC TGTACTGATA 1320 
TTTAAATAAA GAGTTCTATT TCCCAAAAAA AAAAAAAAAA AA 

Seq ID NO: 398 Protein sequence 
Protein Accession #: BAB61048.1 



1 11 21 31 41 51 

I i I 1 I I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKPRPGPRWC AVQGQVDBKT 60 
PLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 120 
LQARMSCEQK AEGHSSGSWQ FSPDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 180 
SFHYPSMGDC IGWLEDFLMG MDSTLEPSAG APLAMSSGTT QLRATATTLI LCCLLIILPC 240 
PILPGI 



Seq ID NO: 399 DNA sequence 

Nucleic Acid Accession #x NM_001898.1 

Coding sequence: 57.. 482 



1 11 21 31 41 51 

I I I I I I 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 60 

CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 120 

GCCCCAAGGA GGAGGATAGG ATAATCCCGG GTGGCATCTA TAACGCAGAC CTCAATGATG 180 

AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 240 

ACTACTACAG ACGTCCGCTG CGGGTACTAA GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 300 

ATTACTTCTT CGACGTAGAG GTGGGCCGCA CCATATGTAC CAAGTCCCAG CCCAACTTGG 360 

ACACCTGTGC CTTCCATGAA CAGCCAGAAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 420 

TCTACGAAGT TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 480 

AGGGATCTGT GCCAGGCCAT TOGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 540 

CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC CCCATGTGCC TGCGCCAAGA 600 

GACAGACAGA GAAGGCTGCA GGAGTCCTTT GTTGCTCAGC AGGGCGCTCT GCCCTCCCTC 660 

CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACACA CCCCCCCACC TCCTGCAATT 720 
AAACAGTAGC ATCGCC 

Seq ID NO: 400 Protein sequence 
Protein Accession #: NP_001889.1 



1 11 21 31 41 51 

I I I I I I 

MAQYLSTLLL LLATLAVALA WSPKBEDRII PGGIYKADLN DEWVQRALHF AISEYNKATK 60 
DDYYRRPLRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAFHEQP ELQKKQLCSP 120 
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BIYEVPWENR RSLVKSROQE S 

Seq ID NO: 401 DNA sequence 

Nucleic Acid Accession 8: NMJ>03976.2 

Coding sequence: 299.961 

1 11 21 31 41 SI 

I I I I I I 

CTCTGAGCTT CTCTGAGCCT TGTTTGCTCA TCTGGAAAAA GGGGATTAAA CCATTTACCT 60 

CATGGAGTTG TGAAAGAATA GCTGCAAAGC ACCTAACACA TAGTAAQGTT CCCAGTGCAG 120 

CTACTTCTGC TGGGTTGAGT CTAGCTGTOT AGGCCCCTTG TTCCTCACCT GGAGAAACTG 180 

GGGTGGCAGG CCGGTCCCCC ACAAAAGATA ACTCATCTCT TAATTTGCAA GCTGOCTCAA 240 

CAGGAGGGTG GGGGAACAGC TCAACAATGG CTGATGGGCG CTCCTGGTGT TGATAGAGAT 300 

GGAACTTGGA CTTGGAGGCC TCTCCACGCT GTCCCACTGC CCCTGGCCTA GGCGGCAGCC 360 

TGCCCTGTGG CCCACCCTGG CCGCTCTGGC TCTGCTGAGC AGCGTCGCAG AGGCCTCCCT 420 

GGGCTCCGOG CCCCGCAGCC CTCCCCCCCG CGAAGGCCCC CCGCCTGTCC TGGCGTCCCC 480 

CGCCGGCCAC CTGCCGGGGG GACGCAGGGC CCGCTGGTGC AGTGGAAGAG CCCGGCGGCC 540 

GCCGCCGCAG CCTTCTCGGC CCGOGCCCCC GCCGCCTGCA CCCCCATCTG CTCTTCCCCG 600 

CGGGGGCCGC GCGGCGCGGG CTGGGGGCCC GGGCAGCCGC GCTCGGGCAG GGGGGGCGGG 660 

GGGCTGCCGC CTGOGCTCGC AGCTGGTGCC GGTGCGCGCG CTCGGCCTGG GCCACCGCTC 720 

CGACGAGCTG GTGCGTTTCC GCTTCTGCAG CGGCTCCTGC CGCCGCGCGC GCTCTCCACA 780 

CGACCTCAGC CTGGCCAGCC TACTGGGCGC GGGGGCCCTG GGACCGCCCC CGGGCTCCOG 840 

GCCCGTCAGC CAGCCCTGCT GCCGACCCAC GOGCTACGAA GCGGTCTCCT TCATGGACGT 900 

CAACAGCACC TGGAGAACOG TGQACCGCCT CTCCGCCACC GCCTGOGGCT GCCTGGGCTG 960 

AGGGCTCGCT CCAGGGCTTT GCAGACTGGA CCCTTACCGG TGGCTCTTCC TGCCTGGGAC 1020 

CCTCCCGCAG AGTCCCACTA GCCAGCGGCC TCAGCCAGGG ACGAAGGCCT CAAAGCTGAG 1080 

AGGCCCCTAC CGGTGGGTGA TGGATATCAT CCCCGAACAG GTGAAGGGAC AACTGACTAG 1140 

CAGCCCCAGA GCCCTCACCC TGCGGATCCC AGCCTAAAAG ACACCAGAGA CCTCAGCTAT 1200 

GGAGCCCTTC GGACCCACTT CTCACAGACT CTGGCACTGG CCAGGCCTCG AACCTGGGAC 1260 

CCCTCCTCTG ATGAACACTA CAGTGGCTGA GGCATCAGCC CCCGCCCAGG CCCTGTAGGG 1320 

ACAGCATTTG AAGGACACAT ATTGCAGTTG CTTGGTTGAA AGTGCCTGTG CTGGAACTGG 1380 
CCTGTACTCA CTCATGGGAG CTGGCCOC 

Seq ID NO: 402 Protein sequence 
Protein Accession #: NP_003967.1 

1 11 21 31 41 51 

I I I I 1 I 

MELGLGGLST LSHCPWPRRQ PALWPTIAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 60 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 120 

RGCRIiRSQLV PVRALGLGHR SDBLVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 403 DNA sequence ■ 
Nucleic Acid Accession #: NMJJ57091.1 
Coding sequence: 783.. 1445 

1 11 21 31 41 SI 

till)! 

ACTGGCCGCT GAGAGAAGAA TCGGGTGGAG CAGAGAGCAG CTGCTGCAGG GCAGACAGCC 60 

GGACCCCCAA ATCTGCACGT AOCAGCAGTC AGCCGCCCCA CGCAGGGACC GGCTTACCCC 120 

TCGCTCCCCG CCCTCACTCA CTTTC T CCOO CCCTCGGCCC GGCCTCCCAQ CTCTCTACTT 180 

CGCGTGTCTA CAAACTCAAC TCCCGGTTTC CGTGCCTCTC CACCGCTCGA GTTCTCTACT 240 

CTCCATATCC GAGGGGCCCC TCCCAGCATC TACCCCCCTC CCAACCTCGG GGGACCTAGC 300 

CAAGCTAGGG GGGACTGGAT CCGACGGGTG GAGCAGCCAG GTGAGCCCCG AAAGGTGGGG 360 

CGGGGCAGGG GCGCTCCCAG CCOCACCCCG GGATCTGGTG ACGCTGGGGC TGGAATTTGA 420 

CACCGGACGG CTGCGGCGGC GGGCAGGAGG CTGCTGAGGG ATGGAGTT6G GCCCGGCCCC 480 

CAGACAAGGC CCGGGGGCTC CGCCAGCAGC AGGTCCCTCG GGCOCCAGCC CTCGCTGCCA 540 

CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 600 

TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TCGGGGGAGA 660 

GOCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 720 

TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 780 

AGATGQAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 900 

CCCTGGGCTC CGGGCCCCGC AGCCCTGCCC CCCGCQAAGG CCCCCCGCCT GTCCTGGCGT 960 

CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 1020 

GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 1080 

CCCGCGGGGG CCGCGCGGCG OQGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG 1140 

CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGOGCTCGGC CTGGGCCACC 1200 

GCTCOQACGA GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC 1260 

CACACGACCT CAGCCTGGCC AGCCTACTGG GOGCCGGGGC CCTGCGACCG CCCCCGGGCT 1320 

CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 1380 

ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG 1440 

GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 1500 

GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGAOGAAG .GCCTCAAAGC 1560 

TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCQA ACAGGTGAAG GGACAACTGA 1620 

CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 1680 

CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 1740 

GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT 1800 

AGGGACAGCA TTTGAAGGAC ACATATTGCA QTTGCTTGGT TGAAAGTGCC TGTGCTGGAA I860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 404 Protein sequence 
Protein Accession 8: NP 003967.1 



1113 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

I I I I I I 

KELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 60 
5 PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 120 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 



10 Seq ID NOi 405 DNA sequence 

Nucleic Acid Accession 8: NM_OS7160.1 
Coding sequence : 1 . . 714 

1 11 21 31 41 51 

15 | | | ill 

ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGOTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

20 CACCTGCOGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCOG 300 

CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 360 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

25 AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

AGCCAGCCCT GCTGCCGACC CAOGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGOCCC 840 

30 TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 

35 TCACTCATGG GAGCTGGCCC C 



Seq ID NO: 406 Protein sequence 
Protein Accession «: NP_476S01.l 

40 1 11 21 31 41 51 

I I I I I I 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVABASLGS 60 
APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 120 
RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRPCSGS CRRARSPHDL 180 
45 SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 



50 



Seq ID NO: 407 DNA sequence 

Nucleic Acid Accession #: NM_05709Q.l 

Coding sequence: 29.. 715 



1 11 21 31 41 51 

I I I I I I 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

55 GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 180 

CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 360 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

60 CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCOGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 

CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 720 

65 CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 780 

GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 840 

CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 900 

CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 1020 

70 TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 1080 

TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 



Seq ID NO: 408 Protein sequence 
75 Protein Accession 8: NP_476431.1 

1 11 21 31 41 51 

I I I I i I 

MELGLGGLST LSHCPWPRRQ APLGLSAQPA LWPTLAALAL LSSVABASLG SAPRSPAPRE 60 
80 GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 120 
SRARAAGARG CRLRSQLVFV RALGLGHRSD ELVRFRPCSG SCRRARSPHD LSLASLLGAG 180 
ALRPPPGSRP VSQPCCRPTR YEAVSPMDVN STWRTVDRLS ATACGCLG 



1114 



WO 03/042661 



PCT/US02/36810 



Seq ID NO: 409 DNA sequence 

Nucleic Acid Accession fl: Bos sequence 

Coding sequence i 1. .1746 

5 1 11 21 31 41 SI 

| | I I I I 

ATGCCACTGA AGCATTATCT CCTTTTGCTO GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 
GCCTACCATG GCTGCCCTAG CGA6TGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 120 
GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA AOGCCATGAG CCTGCAGATC 180 
10 CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 
GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG OGCATCACGC CTGGGGCCTT CCGAAACCTG 300 
GGCTGGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 
TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 
CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 
15 CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 
GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 
GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 
GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACXGCTCTC CCCTGGTCTC 720 
TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 
20 CCACCCAGCA TCTTCATGCA GCTGCCOCAG CTCAACOGTC TTACTCTCTT TGGGAATTCC 840 
CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 
TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 
GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTOCC CGGGTGCCTT CAAOGGGCTA 1020 
ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 
25 TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 
CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TC CAGC TGCA GAACAACCAG 1200 
CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 1260 
TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCOGCAACTG GCTCCTGCTC 1320 
AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 
30 GGCCAGTOCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGOGTCCA TGTCCCTGAG 1440 
GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 
TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 
ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG OGGGCTGGCC 1620 
ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTOGGC 1680 
35 TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 
TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 
TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC OGTGATTGCT CTTTCTGGCC 1860 
CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCOGTAG AGAAGCAGGT 1920 
CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 
40 GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 
ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 
AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTOG 2160 
CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 
ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 
45 TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCOGTCAAT 2340 
CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 
AGACAGAAGA GCCGTCATCA GTGTCTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 
CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 
TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 
50 AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 
CACCCCTAGA GT TTG TTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 
GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 
CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 
GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 
55 TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 
TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3000 
AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 
GTOCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 
TCOGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 
60 ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 
GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 
TGAACTTCAG AATCTCACTT ACAGCAGGOG ACACGGGGGT ACACOGATGG GTCACACTGG 3360 
GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 
TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 
65 CTGCTATACA CATATTCACA TGGOGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 
CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3600 
AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CAOGACCAAT CCCGATCGGC 3660 
TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCAOCA CCAATCCCGA 3720 
TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 
70 CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3840 
CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCCAGAGGGC 3900 
TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 
AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 
ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 
75 CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 
GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 
GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 
GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 
GGTATTCCTG GCAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4 3 80 
80 GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 
CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 
GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTT GTCT G GCTGAGGAGC 4560 
TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATT CTCA 4620 
GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 
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PCT7US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCTCTCTGTT 
GTGTTGGAGA 
TGCACAGATA 
GGTAGGAGTG 
GGTGTTCAAT 
TTGTCTTGGG 
TTAGTCTTGG 
GGAAAAAATA 
TGGGCTGTAT 
AACTTTTCAT 
GAACTTCCAA 
AGTTGGTOGA 
GCCCCCAGAT 
GGAAGGAAGC 
CTCCTTCCGC 
CTTCATGCTQ 
GCCCCAGTGC 
AGCCCTGGTG 
GGTGTACAGA 



TACAGCTCCT 
AGAAACAACA 
CTCTTCAAGC 
CCGCCTCTAC 
AGGCTGGGAG 
CTTTCGTCAT 
TCATCAGAAC 
AACTCTTCCA 
GTATATTGTT 
GGACACAATT 
ACTCAGGAAG 
CAGATGTTAG 
CCCACAGTCA 
CATGGCTGTG 
CCCAGGTTTC 
CCTTCAAAGC 
TTGGCGATGC 
GGCAGGGTTG 
ATCAACAATA 



TGACAGTGCC 
AAAGCCAATT 
ACTGGACGTG 
CCACTTGTGA 
TTTTATTTAT 
TAAACCAAAG 
CTCACTTGGT 
TCCCTTAAAG 
CTTCCTCCTT 
TCCACAACCT 
TTTGCAGAGA 
ATGTATCCTA 
GAACTGAATC 
GTTCAGAGAG 
TTCTTCTCTT 
TAGATCATGT 
ATTTACAGAT 
GGGGGTCTGT 
AATAATATAC 



ACGCCCATCT 
AGAACCACTA 
GATTCTCTCT 
TGGGGTACAG 
CTCTTCAAAC 
GAAATGGAAG 
ACCATATAGA 
AATAGAATAG 
AGAATTTAGA 
TTCAGATGCT 
GCAGACAGCT 
GCTTTTAGCC 
TGOGTTGTTG 
GGTGGGCTGG 
AAGGAGAGAT 
TTGCCTTGCT 
TTCTAGGCCC 
CTTCTGCTGG 
ATGTAT 



GGAGTGGGAG 
TTTTTAAAAA 
CTAGCCCTCA 
AGGCACTTGC 
TTTGTACAAG 
CCATTCCCCT 
TCAAAAGCTT 
TTTGTCCCTC 
GATACAAGAG 
GATGTAGAGC 
AGAGATAACT 
ATAAACCACT 
GGAAGCCAGC 
CAAGCCACTT 
TGTTCTCACC 
TAGAGAATTA 
TCAGGGTTTT 
ATGCTGCTTG 



CTGGGAGTTA 
GTGCTTACTG 
GCACCCCTGC 
TCTTCTGCAT 
AGCTCATGGC 
GTTGCTCTCC 
TGTAACCACA 
TCATGGGAAT 
TTCTACTTAG 
TATTGGGAAA 
CGGGACCCAG 
CAAAGATTCA 
AGTGGCCTTG 
CCGGGGAAAA 
AACCOGCTGC 
CTGCAAATCA 
GTAGAGTGTG 
TAATCCATTT 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 



Seq ID NO: 410 Protein sequence 
Protein Accession ft: BAB84587.1 



MPLKHYLLLL 
LNTHITELNE 
FQGLD SLESL 
GKNSLTHISP 
FHNNHNLQRL 
YDNHISSLPD 
FRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
IAAIVIGIVA 



11 
I 

VGCQAWGAGL 
SPFI^ISALI 
LLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
IIiPLRNWLLL 
PDTPSYPDTT 
IJVCSLAACVG 



21 

I 

AYHGCPSECT 
ALRIEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFWQLPQ 
VLILSRNQIS 
PGNIFANVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVECT 
RITPGAFRNL 
LKBLQLHGNH 
DIPMGTFDGL 
LNRLTLFGNS 
FISPGAFNGL 
LMAIQLQNNQ 
PVCPSPANVR 
PVEDYTDLTT 
VLMQMKAPNB 



41 

I 

GARIVAVPTP 
GSLRYLSLAN 
LBYIPDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENLPLGIFD 
GQSLIIINVN 
IQVTDDRSWf 
C 



51 
I 

LPWNAMSLQI 60 

NKLQVLPIGL 120 

HLVGLTKLNL 180 

NQIGLLBPGL 240 

PMPNIiRELWL 300 

NALQDLDGNV 360 

HLGKLCELRL 420 

VAVPSVHVPE 480 

GMTQAQSGIA 540 



Seq ID NO: 411 DNA sequence 
Nucleic Acid Accession ft: XM_0981S1 
Coding sequence: 1..447 



ATGATGCATT 
AGTGGGGTAC 
TCTGGAGTGG 
CTATTTTTAA 
TCTCTAGCCC 
CA GAGG CACT 
AACTTTGTAC 
AAGCCATTCC 



11 
I 

TGCTCAATTC 
AGAGTTCAGT 
GAGCTGGGAG 
AAAGTGCTTA 
TCAQCACCCC 
TGCTCTTCTG 
AAGAGCTCAT 
CCTGTTGCTC 



21 
1 

TCAGGGCTGG 
TTTCCTCTCT 
TCAGTGTTGG 
CTGTGCACAG 
TGCGGTAGGA 
CATGGTGTTC 
GGCTTGTCTT 
TCCTTAG 



31 
1 

AATGAGCCGG 
GTTTACAGCT 
AGAAGAAACA 
ATACTCTTCA 
GTGCCGCCTC 
AATAGGCTGG 
GGGCTTTCGT 



41 
I 

CTGGTCCCCC 
CGTTGACAGT 
ACAAAAGCCA 
AGCACTGGAC 
TACCCACTTG 
GAGTTTTATT 
CATTAAACCA 



51 
I 

AGAAAGCTGG 
CCCACGCCCA 
ATTAGAACCA 
GTGGATTCTC 
TGATGGGGTA 
TATCTCTTCA 
AAGGAAATGG 



Seq ID NO: 412 Protein sequence 
Protein Accession ft: XP_098151 

21 



11 21 31 41 51 

1 I I I I I 

MMHLLNSQGW NEPAGPPESW SGVQSSVFLS VYSSLTVPRP SGVGAGSQCW RRNNKSQLEP 
LFLKSAYCAQ ILFKHWTWIL SLAIiSTPAVG VPPLPTCDGV QfiHLLFCMVF NRLGVLFISS 
NFVQELMACL GLSSLNQRKW KPFPCCSP 

Seq ID NO: 413 DNA sequence 

Nucleic Acid Accession ft: NM_002 658.1 

Coding sequence: 77.. 1372 



1 
I 

GTCCCCGCAG 
CCCCGACCTC 
GAGCGACTCC 
TGGAGGAACA 
GAAATTCGGA 
TCACTTTTAC 
CTCTGCCACT 
CCTGGGGAAA 
GCAGGTGGGC 
AAAGCCCTCC 
CCGCTTTAAG 
CATCTACAGG 
CCCTTGCTGG 
CATCGTCTAC 
GGTGGAAAAC 
CATTGCCTTG 
ACAGACCATC 
CACTGGCTTT 



11 21 31 41 51 

I I i I I 

CGCCGTOGCG CGCTCCTGCC GCAGGCCACC GAGGCOGCCG CCGTCTAGCG 
GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 
AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 
TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTQCA ACTGCCCAAA 
GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG 
CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 
GTCCTTCAGC AAACGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 
CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 
CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 
TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA C TCTGAG GCC 
ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 
AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 
GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 
CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 
CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 
CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCOGGACTAT 
TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 
GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 



60 
120 
180 
240 
300 
360 
420 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



1116 



WO 03/042661 



TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 
CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCftGGGAGA 
CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 
CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 
CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 
AGGOAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 
TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 
CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 
CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 
TGT CTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 
GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 
AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 
AATGTGGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 
ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGT TGTG TOG 
GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 
AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 
CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 
ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 
ATCCCTTOCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 
ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 
ATCAATAAAA TGTGATTTTT CTGA 

Seq ID NO: 414 Protein sequence 
Protein Accession #: NP_002649.1 

1 11 21 31 41 51 

I I 1 I I I 

MRALLARUJL CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDT MG RP CLPWNSATVL QQTYHAERSD ALQLGI/3KHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRPKII 180 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCPIDYP KKEDYIVYI/3 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALliK IRSKEGRCAQ PSRTIQTICL 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 

CAADPQWKTD SCQGDSGGPI* VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 
SHTKEENGLA L 

Seq ZD NO: 415 DNA sequence 

Nucleic Acid Accession #j NM_024422.1 

Coding sequence: 202.. 2907 

1 11 21 31 41 51 

I I I 1 I 1 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTGGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCOGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGOGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACAOG GCCCAAAACT ATACCATATA CTATTCCATA 720 
AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 
AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 
TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 
GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTOGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGAOGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

AGTTCTACTT CAGAAQTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGC ATTA CC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGOO GTQGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTPTGCAT CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400* 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCQ 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 



1117 



WO 03/042661 



PCI7US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TCAGGAATCA 
TCGQAATCCT 
ACGGAGGTGG 
CTTGGTGAAA 
GTCCTGACAT 
GAACGACAAG 
CTAGCAGAAG 
TTTATQACTT 
TGGGGGTTTT 
AGACACTATA 
TCTATCCAAG 
ATGACAACAG 
TATTTGAAGC 
AATTAAGTGT 
ACTGCATTCT 
TTTCTAGCCA 



AAAACGGAGG 
GCCGGGGGGC 
ACAACTGCAG 
AAGTGTATCT 
ATAACTATGA 
AAGAAGATGG 
CATGCATGAA 
TTAAAAAAAA 
TCTCTCATTA 
AACAAGTACA 
GAGGTCTACA 
CCAATTTATA 
ACAACCTAAT 
TCATGTGGTG 
TGCTATTATT 
GGCATTGACT 



TCAGGAGACC 
TGGCCACCAT 
ATACACTTAC 
GTGTAATCAA 
AGGAAGAGGA 
GCTTGAATTT 
GAGATGAGTG 
TTACAAACCA 
TTTGGATGGA 
CAAATTTTTC 
GAGAAATTAA 
GTGCAATAAA 
GGAAAATTGT 
CTTGGAAACT 
TTATTCTTGT 
ATTACAATTT 



ATOGAAATGG 
CACACCCTGG 
TGGGAGTGGC 
GATGAAAATC 
TCGGTGGCTG 
TTGGATAATT 
TGTTCTAATA 
AGAATTTTTT 
ATCTCTTTGG 
AATTTTTACA 
AGTCTGCCTT 
ATGTAATTAA 
AGAGACCTTG 
GTTGTTTTCC 
AATGTGACCT 
CATT 



TGAAAGGAGG 
ACTCCTGCAG 
ACAGTTTTAC 
ACAAGCATGC 
GGTCTGTAGG 
TGGAGCCCAA 
AGTCTCTGAA 
AAAGCAGAAG 
TCAAATGCAC 
TATTTTTAAA 
ATTTGTTACA 
TTCAAGTCCT 
CTTTAACATT 
TGAACATCTA 
TTTCACTGTG 



ACACCAGACC 
GGGAGGACAC 
TCAGCCCCGT 
CCAAGACTAT 
TTGTTGCAGT 
ATTTAGGACA 
AGCCAGTGGC 
ATGCTATTTG 
ATTTACAGAG 
TTACTTATCT 
TTTGGGTATA 
TATTATAGAC 
ATCTCCAGTT 
AAGTGTGTAG 
CAAAGGGAGA 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3X20 
3160 
3240 
3300 
3360 
3420 



Seq ID MOt 416 Protein sequence 
Protein Accession #: NP 077740.1 



1 
I 

MEAARPSGSW 
ANLIHSSDPD 
KKRHTKEKVL 
EPRNLFYVER 
PIFTEETYTF 
TGVITTTSSQ 
VTSVEENTVD 
KPLNYEEKQQ 
KENAEVGTTS 
KNGIYNITVL 
BPIHGPPFDF 
VTSLDVTLCD 
TSKQPKVIPD 
QETIEMVKGG 
CNQDENHKHA 
R 



11 
I 

NGALCRLLLL 
PQILEDGSVY 
RRAKRRWAPI 
DTGNLYCTRP 
TIPENCRVGT 
LDRELIDKYQ 
VEILRVTVED 
MILQIGWNE 
NGYKAYDPET 
ASDQGGRTCT 
SLESSTSBVQ 
CITENDCTHR 
DLAQQNLIVS 
HQTSESCRGA 
QDYVLTYNYE 



21 
I 

TLAILIFASD 
TTNTILLSSE 
PCSMLBNSLG 
VDREQYESPE 
TVGQVCATDK 
LKIKVQDMDG 
KDLVNTANWR 
APPSREASPR 
RSSSGIRYKK 
GTLGIIIiQDV 
RMWRLKAIND 
VDPRIGGGGV 
NTEAPGDDKV 
GHHHTLDSCR 
GRGSVAGSVG 



31 
I 

ACKNVTLHVP 
KRSPTILLSN 
PFPLFLQQVQ 
IIAFATTPDG 
DEPDTMHTRL 
QYFGLQTTST 
ANYTILKGNE 
SAMSTATVTV 
LTDPTGWVTI 
NDNSPPIPKK 
TAARLSYQND 
QLGKWAILAI 
YSANGFTTQT 
GGHTEVDNCR 
CCSERQEEDG 



41 

i 

SKLDAEKLVG 
TENQEKKKIF 
SDTAQNYTIY 
YTPELPLPLI 
KYSIIGQVPP 
CIINIDDVND 
NGNFKIVTDA 



DENTGSIKVF 
TVIICKPTMS 
PPFGSYWPI 
LLGIALLFCI 
VGASAQGVCG 
YTYSEWHSFT 
LEFLDNLEPK 



51 

I 

RVNLKECFTA 
VFLEHQTKVL 
YSIRGPGVDQ 
IKIEDENDNY 
SPTLFSMHPT 
HLPTFTRTSY 
KTNEGVLCW 
CNPPIQTVRM 
RSLDREAETI 
SAEIVAVDPD 
TVRDRLGMSS 
LFTLVCGASG 
TVGSGIKNGG 
QPRLGEKVYL 
FRTLAEACMK 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 417 DNA sequence 

Nucleic Acid Accession #: NM_004949.1 

Coding sequence: 2 02.. 2745 



CGCCAAAGGA 
CTCTCCGCGC 
GCTCCGGCCG 
GACCTGCCCC 
CTCTGCCGGC 
AATGTGACAT. 
CTGAAAGAGT 
TTGGAGGATG 
TTTACCATAT 
GAGCATCAAA 
AAGAGAAGAT 
CTTTTCCTTC 
AGAGGTCCTG 
AACTTGTATT 
TTTGCAACAA 
GAGGATGAAA 
GAAAATTGCA 
GACACGATGC 
CTATTTTCTA 
GAGTTAATTG 
GGTCTACAGA 
ACATTTACTC 
TTACGAGTTA 
ACCATTTTAA 
GAAGGAGTTC 
CAAATTGGTG 
AGCACAGCAA 
CCAATACAGA 
AAAGCATATG 
CCAACAGGGT 
GATAGAGAGG 
CAAGGAGGGA 
AGCCCATTCA 
ATTGTTGCGO 
AGTTCTACTT 
CGTCTTTCCT 
GATAGACTTG 
GAAAATGACT 
AAGTGGGCCA 



11 
I 

AAAGCCCCTT 
GCCCCACCTC 
CGGCCCTCGC 
GAGCCCTCTC 
TGCTCCTGCT 
TACATGTTCC 
GCTTTACAGC 
GTTCAGTCTA 
TACTTTCCAA 
CAAAGGTCCT 
GGGCTCCAAT 
AACAGGTTCA 
GAGTTGACCA 
GTACTCGTCC 
CTCCAGATGG 
ATGATAACTA 
GAGTGGGCAC 
ACACACGCCT 
TGCATCCAAC 
ACAAGTACCA 
CAACTTCAAC 
GTACTTCTTA 
CTGTTGAGGA 
AGGGCAATGA 
TTTGTGTAGT 
TAGTTAATGA 
CAGTTACTGT 
CTGTTCGCAT 
ACCCAGAAAC 
GGGTCACCAT 
CAGAGACCAT 
GAACATGTAC 
TACCTAAAAA 
TTGATCCTGA 
CAGAAGTACA 
ATCAGAATGA 
GCATGTCTAG 
GCACACATCG 
TCCTTGCAAT 



21 
I 

GGATGAGAGG 
CTCCGCCTCG 
CCOGCGGAGC 
CATGGAGGCA 
GACCCTCGCG 
CTCCAAACTA 
TGCAAATCTA 
TACAACAAAT 
CACTGAGAAC 
AAAGAAAAGA 
TCCTTGTTCG 
ATCTGACACG 
AGAACCTCGG 
TGTAGATCGT 
GTATACTCCA 
CCCAATTTTT 
TACTGTGGGA 
GAAGTACTCC 
TACAGGCGTG 
GTTGAAAATA 
TTGTATCATT 
TGTGACATCA 
TAAGGACTTA 
AAATG GCAAT 
TAAGCCTTTG 
AGCTCCATTT 
TAATGTAGAA 
GAAAGAAAAT 
AAGAAGTAGC 
TGATGAAAAT 
CAAAAATGGC 
GGGGACACTG 
GACAGTGATC 
TGAGCCTATC 
GAGAATGTGG 
TCCTCCATTT 
TGTCACTTCA 
TGTAGATCCA 
ATTGTTGGGC 



31 
I 

CAGGCGCTTC 
CGCTCCTCCT 
CCTCCTACCC 
GCCCGCCCCT 
ATCTTAATAT 
GATGCCGAGA 
ATTCATTCAA 
ACTATTCTAT 
CAAGAAAAGA 
CATACTAAAG 
ATGCTAGAAA 
GCCCAAAACT 
AATTTATTTT 
GAGCAGTATG 
GAACTTCCAC 
ACAGAAGAAA 
CAAGTGTGTG 
ATCATTGGGC 
ATCACCACAA 
AAAGTACAAG 
AACATTGATG 
GTGGAAGAAA 
GTGAATACTG 
TTTAAAATTG 
AATTATGAAG 
TCCAGAGAGG 
GATCAGGATG 
GCAGAAGTGG 
AGTGGCATAA 
ACAGGATCAA 
ATATATAATA 
GGCATTATAC 
ATCTGCAAAC 
CATGGCCCAC 
AGACTGAAAG 
GGCTCATATG 
TTGGATGTTA 
AGGATTGGOG 
ATAGCATTGC 



41 
I 

AGAGAAGCTA 
GAGCAGCGGG 
CGGCCOGACG 
CCGGCTCCTG 
TTGCCAGTGA 
AACTTGTTGG 
GTGATCCTGA 
TGTCCTCGGA 
AGAAAATATT 
AAAAAGTTCT 
ACTCCTTGGG 
ATACCATATA 
ATGTGGAGAG 
AATCTTTTGA 
TGCCCCTAAT 
CTTATACTTT 
CTACTGACAA 
AGGTGCCACC 
CATCATCTCA 
ACATGGATGG 
ATGTAAATGA 
ATACAGTTGA 
CTAACTGGAG 
TAACAGATGC 
AAAAGCAACA 
CTAGTCCAAG 
AGGGCCCTGA 
GAACAACAAG 
GGTATAAGAA 
TCAAAGTTTT 
TTACAGTCCT 
TTCAAGACGT 
CCACCATGTC 
CCTTTGACTT 
CAATTAATGA 
TAGTACCTAT 
CACTGTGTGA 
GTGGAGGAGT 
TCTTTTGCAT 



51 



AGAAAAGCAC 
CCCAGACTGC 
CTCGGCCCGC 
GAACGGAGCC 
TGCCTGCAAA 
TAGAGTTAAC 
CTTCCAAATT 
GAAGAGAAGT 
TGTCTTTTTG 
AAGGCGCGCC 
TCCTTTTCCA 
CTATTCCATA 
AGACACTGGA 
GATAATTGCC 
AATCAAAATA 
TACAATTTTT 
AGATGAGCCT 
ATCACCCACC 
GCTAGACAGA 
TCAGTATTTT 
CCACTTGCCA 
TGTGGAAATC 
AGCTAATTAT 
CAAAACCAAT 
GATGATCTTG 
ATCAGCCATG 
GTGTAACCCT 
CAATGGATAT 
ATTAACTGAT 
CAGAAGCCTG 
TGCATCAGAC 
GAATGATAAC 
ATCTGCGGAG 
TAGTCTGGAG 
TACAGCAGCA 
AACAGTGAGA 
CTGCATTACC 
ACAACTTGGA 
CCTGTTTACG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



1118 



WO 03/042661 



PCT/US02/36810 
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CTGGTCTGTG 
CAGCAGAACC 
AATGGCTTCA 
TCAGGAATCA 
TCGGAATCCT 
ACGGAGGTGG 
CTTGGTGAAG 
GTATCTGTGT 
CTATGAAGGA 
AGATGGGCTT 
CATGAAGAQA 
AAAAAATTAC 
TCATTATTTG 
AGTACACAAA 
TCTACAGAGA 
TTTATAGTGC 
CCTAATGGAA 
GTGGTGCTTG 
ATTATTTTAT 
TTGACTATTA 



GGGCTTCTGG 
TAATTGTATC 
CAACCCAAAC 
AAAACGGAGG 
GCOGGGGGGC 
ACAACTGCAG 
AATCCATTAG 
AATCAAGATG 
AGAGGATCGG 
GAATTTTTGG 
TGAGTGTGTT 
AAACCAAGAA 
GATGGAATCT 
TTTTTCAATT 
AATTAAAGTC 
AATAAAATGT 
AATTGTAGAG 
GAAACTGTTG 
TCTTGTAATG 
CAATTTCATT 



GACGTCTAAA 
AAACACAGAA 
TGTGGGGGCT 
TCAGGAGACC 
TGGCCACCAT 
ATACACTTAC 
AGGACACACT 
AAAATCACAA 

ATAATTTGGA 
CTAATAAGTC 
TTTTTTAAAG 
CTTTGGTCAA 
TTTACATATT 
TGCCTTATTT 
AATTAATTCA 
ACCTTGCTTT 
TTTTCCTGAA 
TGACCTTTTC 



CAACCAAAAG 
GCTCCTGGAG 
TCTGCTCAGG 
ATCGAAATGG 
CACACCCTGG 
TCGGAGTGGC 
CTGATTAAAA 
GCATGCCCAA 
TGTAGGTTGT 
GCCCAAATTT 
TCTGAAAGCC 
CAGAAGATGC 
ATGCACATTT 
TTTAAATTAC 
GTTACATTTG 
AGTCCTTATT 
AACATTATCT 
CATCTAAAGT 
ACTGTGCAAA 



TAATTCCTGA 
ATGACAAAGT 
GAGTTTGTGG 
TGAAAGGAGG 
ACTCCTGCAG 
ACAGTTTTAC 
ATTAAACAAT 
GACTATGTCC 
TGCAGTGAAC 
AGGACACTAG 
AGTGGCTTTA 
TATTTGTGGG 
ACAGAGAGAC 
TTATCTTCTA 
GGTATAATGA 
ATAGACTATT 
CCAGTTAATT 
GTGTAGACTG 
GGGAGATTTC 



TGATTTAGCC 
GTATTCTGCG 
CACCGTGGGA 
ACACCAGACC 
GGGAGGACAC 
TCAGCCCCGT 
GAAAGAAAGT 
TGACATATAA 
GACAAGAAGA 
CAGAAGCATG 
TGACTTTTAA 
GGTTTTTCTC 
ACTATAAACA 
TCCAAGGAGG 
CAACAGCCAA 
TGAAGCACAA 
AAGTGTTCAT 
CATTCTTGCT 
TAGCCAGGCA 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



Seq ID NO: 418 Protein sequence 
Protein Accession ft: NP 004940.1 



MEAARPSGSW 
ANLIHSSDPD 
KKRHTKKKVL 
EPRNLFYVER 
PIFTEETYTF 
TGVITTTSSQ 
VTSVEENTVD 
KPLNYEEKQQ 
KENAEVGTTS 
KNGIYNITVL 
EPIHGPPFDP 
VTSLDVTLCD 
TSKQPKVIPD 
QETIEMVKGG 
GHTLIKN 



11 

I 

NGALCRLLLL 
FQILEDGSVY 
RRAKRRWAP I 
DTGNLYCTRP 
TIPENCRVGT 
LDRELIDKYQ 
VBILRVTVED 
M1LQIGWNB 
NGYKAYDPET 
ASDQGGRTCT 
SLESSTSEVQ 
CITENDCTHR 
DLAQQNLIVS 



21 



TLAILIFASD 
TTNTILLSSE 



VDREQYESFE 
TVGQVCATDK 
LKIKVQDMDG 
KDLVNTANWR 
APFSREASPR 
RSSSGIRYKK 
GTLGI IliQDV 
RMWRLKAIND 
VDPRIGGGGV 
NTEAPGDDKV 
GHHHTLDSCR 



31 
I 

ACKNVTLHVP 
KRSPTILLSN 
PFPLFLQQVQ 
I IAFATTPDG 
DKPDTKHTRL 
QYPGLQTTST 
ANYTILKGNE 
SAMSTATVTV 
LTDPTGWVT1 
NDNSPFIPKK 
TAARLSYQND 
QLGKWAILAI 
YSANGFTTQT 
GGHTEVDNCR 



41 
I 

SKLDAEKLVG 
TBNQEKKKIF 
SDTAQNYTIY 
YTPELPLPLI 
KYSIIGQVPP 
CIINIDDVND 
NGNFKIVTDA 



DENTGS1KVF 
TVIICKPTMS 
PPFGSYWPI 
LLGIALLFCI 
VGASAQGVCG 
YTYSEWHSFT 



51 
I 

RVNLKECFTA 
VFLEHQTKVL 
YSIRGPGVDQ 
IKIED ENDNY 
SPTLFSMHPT 
HLPTFTRTSY 
KTNBGVLCW 
CNPPIQTVRM 
RSLDREAETI 
SAEIVAVDPD 
TVRDRLGMSS 
LFTLVCGASG 
TVGSGIKNGG 
QPRLGEESIR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO? 419 DMA sequence 
Nucleic Acid Accession #: NM_002722.1 
Coding sequence: 14.. 301 



ACTCTGGACT 
GCGTGGCTCT 
ACCCAGGGGA 
ACATCAACAT 
TCTCGGAGTG 
AATGCCACCT 
ACCCTTGGCT 
AAGCC 



11 
I 

CCGGATGGCT 
GTTACTACAG 
CAATGCCACA 
GCTGACCAGG 
GGGGTCCCCG 
TCTGTCTCCT 
CTGGCCAAAG 



21 
I 

GCOGCACGCC 
CCACTGCTGG 
CCAGAGCAGA 
CCTAGGTATG 
CATGCTGCTG 
ACGACTCCAT 
CTTGCTCCCT 



31 
I 

TCTGCCTCTC 
GTGCCCAGGG 
TGGCCCAGTA 
GGAAAAGACA 
TCCCCAGGGA 
GAGCAGCGCC 
GCTCCCACAC 



41 

I 

CCTGCTGCTC 
AGCCCCACTG 
TGCAGCTGAT 
CAAAGAGGAC 
GCTCAGCCCG 
AGCCCAGCTC 
AGGCTCAATA 



51 

I 

CTGTCCACCT 
GAGCCAGTGT 
CTCCGTAGAT 
ACGCTGGCCT 
CTGGACTTAT 
TCCCCTCTGC 
AAGCAAGTCA 



Seq ID NO: 420 Protein sequence 
Protein Accession th NP_002713.1 

1 11 21 31 41 51 

I I I 1 I I 

MAAARLCLSIi LLLSTCVALL LQPLLGAQGA PLEPVYPGDN ATPEQMAQYA ADLRRYINML 
TRPRYGKRHK EDTLAFSBWG SPHAAVPREL SPLDL 

Seq ID NO: 421 DNA sequence 

Nucleic Acid Accession #: NM_032545.l 

Coding sequence: 46.. 718 



1 
I 

AAACTGATCT 
CCATGTCAGG 
CTATCAAAGA 
GCACCGACAG 
CGAGGGCTGG 
CGCGCGGCCG 
CCCGGCCCAC 
CCTGGAGCAC 
CCTGCACTGC 
CCACGCTCAC 
ACTCCTGCAC 
CGTCCTCCAG 
TTCTATGTTG 




GGGCCGAGCG 
CGCCTCCTGC 
CGGGAGOGGC 
TAAATAATAG 



21 
I 

AAGAGAAGGA 
CGGTCAGTTT 
ACGGCGGTAG 
ACTGGACCTC 
AGCCGCTCCC 
GGAAGGGCGG 
GCTACTGCGA 
CCCTCCGCGC 
AGAOGCCTGA 
COGGGGGCGC 
GCCCGGATGC 
GCCCCTGOGG 
ATGTGTTTAG 



31 
I 

GACTCTCAAA 
GGCATTACAG 
AGAGGAAGTC 
CAGTCATTTC 
CTACTCCCGG 
TACCTGOGTG 
GCATGACCAG 
CTGCCACCTC 
CCGCTGTGAC 
GCCCAGCCTG 
GCCCGCGCAC 
AAGGCOGGGA 
TTTACOQTAA 



41 
I 

CCAAAAATGA 
ATCATCAATT 
ACCAAGGTTG 
GGAGAGGTGA 
GCTTTCGGAG 
CTGGGCAGCT 
AGGCGCAGTG 
TGCAGGTGCA 
CCGAAAGACT 
CTACTCTTGC 
CCTCGGTCCC 
CTTGGGCATC 
GCTGAAGCAC 



60 
120 
180 
240 
300 
360 
420 



60 



51 

! 

CCTGGAGGCA 
TGGGAAACAG 
CCACTCAGAA 
CTGGGAGCGC 
AGGGTGOGTC 
TCTGCGTGTG 
AATGCGGCGC 
TCTTCGGGGC 
TCCTGGCCTC 
TGCCCTGCGC 
TGGTCCCTTC 
GCCTTTAATT 
TGGGTGAATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



1119 



WO 03/042661 



PCT/US02/36810 



TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 840 



Seq ID NO: 422 Protein sequence 
5 Protein Accession ft: NPJ115934.1 

1 11 21 31 41 SI 

MTWRHHVRLL FTVSLALQII NLGNSYQREK HNGGREEVTK VATQKHRQSP LNWTSSHPGE 60 

10 VTGSAEGWGP EEPLPYSRAF 6EQASARPRC CRNGGTCVLG SFCVCPAHFT GRYCEHDQRR 120 

SBCGALEHGA WTLRACHLCR CIFGALHCLP LQTPDRCDPK DFLASHAHGP SAGGAPS3UI»L 180 
LLPCALLHRIi LRPDAPAHPR SXVPSVLQRB RRPCQRPGLG HRL 

Seq ID NO: 423 DNA sequence 
15 Nucleic Acid Accession #: NM_006533.1 
Coding sequence: 72.. 467 

1 u 21 31 41 SI 

| | 1 I I I 

20 AGGGAGAGAG GGAGGGGAGG AAATTGGAGA CCCCAGCACC CCCTTGCTCA CTCTCTTGCT 60 

CACAGTCCAC GATGGCCCGG TCCCTGGTGT GCCTTGGTGT CATCATCTTG CTGTCTGCCT 120 

TCTCCGGACC TGGTGTCAGG GGTGGTCCTA TGCCCAAGCT GGCTGACCGG AAGCTGTGTG 180 

CGGACCAGGA GTGCAGCCAC CCTATCTCCA TGGCTGTGGC CCTTCAGGAC TACATGGCCC 240 

CCGACTGCCG ATTCCTGACC ATTCACCGGG GCCAAGTGGT GTATGTCTTC TCCAAGCTGA 300 

25 AGGGCCGTGG GCGGCTCTTC TGGGGAGGCA GCGTTCAGGG AGATTACTAT GGAGATCTGG 360 

CTGCTCGCCT GGGCTATTTC CCCAGTAGCA TTGTCCGAGA GGACCAGACC CTGAAACCTG 420 

GCAAAGTCGA TGTGAAGACA GACAAATGGG ATTTCTACTG CCAGTGAGCT CAGCCTACCG 480 
CTGGCCCTGC CGTTTCCCCT CCTTGGGTTT ATGCAAATAC AATCAGCCCA GTGCAAAC 

30 Seq ID NO: 424 Protein sequence 
Protein Accession <h NP_006S24.1 

1 11 21 31 41 51 ' 

35 MARSLVCLGV IILLSAFSGP GVRGGPMPKL ADRKLCADQB CSHPISMAVA tfJDYMAPDCR 60 

FLTIHRGQW YVFSKLKGRG RLFWGGSVQG DYYGDLAARL GYPPSSIVRB DQTLKPGKVD 120 
VKTDKWDFYC Q 

Seq ID NO: 425 DNA sequence 
40 Nucleic Acid Accession ft: NM_080870.1 
Coding sequence*. 3.. 710 

1 11 21 31 41 51 

AC I I I I I I 

43 AGATGACACA AGTCACAGAA AAGTCCACAG AACACCCAGA AAAGACCACG TCAACCACAG 60 

AGAAAACCAC AAGAACCCCA GAAAAGCCTA CGCTATACTC AGAGAAGACC ATATGCACCA 120 

AAGGGAAAAA CACACCAGTC CCAGAAAAGC CTACAGAAAA CCTGGGGAAC ACCACACTGA 180 

CCACTGAGAC CATAAAAGCC CCAGTAAAGT CCACAGAAAA CCCAGAAAAA ACAGCAGCAG 240 

TCACAAAGAC TATAAAACCT TCAGTCAAGG TCACAGGAGA CAAATCTCTC ACTACTACCT 300 

50 CTTCTCATCT AAATAAAACT GAAGTTACTC ATCAGGTGCC CACTGGTTCT TTCACCCTCA 360 

TTACATCTAG AACGAAGCTO AGTTCTATCA CATCAQAAGC CACAGGAAAC GAGAGCCATC 420 

CATACCTCAA TAAAGATGGC TCACAGAAAG GTATCCACGC TGGACAGATG GGAGAGAATG 480 

ATTCATTCCC TGCATGGGCC ATAGTTATTG TGGTCCTGGT GGCTGTGATT CTCCTCCTGG 540 

TGTTCCTTGG CCTGATCTTC TTGGTCTCCT ATATGATGCG GACACGCCGC ACACTAACCC 600 

55 AGAACACCCA GTACAATGAT GCAGAGGATG AGGGTGGCCC CAATTCCTAC CCGGTCTACC 660 

TGATGGAGCA GCAGAATCTT GGCATGGGCC AGATCCCTTC CCCACGGTGA TCTTGGAGTA 720 

GGCGCCCAGC CCTGGCTCTT CCATGCTCTG CCCCTTTCCT GGATGAGGAA COGGACTCAC 780 

AATTTCTATT TCCGGGACTA CAGGAAGGGC AGAGAATACT GACGGTTACC AGTATTAACC 840 

CTTCATCTGT TCTTGAAACT GGTTGGGGAA TGAGGTGATA AGCAAGGAGG GTGTAAGTTT 900 

60 AGGGGACAAA GAAGAAAGAA TGAATAATAC GAGCAGACAT TCTCTGTAGA AGGTAATGGT 960 

CTGAGAATGA AAAGGTGTTT GATGGACATG TTGTGGGGGC ACCAATGCAG AACACTGCAC 1020 

TGAGTCCTAA AGGAAGGACA GGAGCCTTAT AGGCAATGCC CCAGACTGAC TTGTGAGTGG 1080 

GGTTTATGGG GAAAGGGAGG GACTGAGGGC AGAGTCTCTG GGTTTCAGGA CAGCATTATG 1140 

TTATTTCCAT TCACTATTAC TTAAGAGTTT GTGTGTAAAC AGGCTCATCT CTGAGTTCTC 1200 

65 AGGACCCTTG CCCCCACCCC CATTTTTTTA ATGAAAAAAA AAAACAAAAA AAACGGATCC 1260 

AAGAAGAAAA GAGAATTTAT TTCCTTCTCC ACTCTCTCCA TGCCCTGGAG AAAAAAAAGT 1320 

CCAGAAGAAA TCATAAATAT CTCTCATCTA CATGGTTGCT TCCTCTTCCT CCCAAATCCC 1380 

TTAGTTTTCC TAAATGTCTA CAGTGGACGC CCTGTTGGTT TGGCTTGCTG GGTTGTGGGT 1440 

GGACACGCAA GGAGGGGATT TTTATTTGGC CAGCAGTCTC ACCCACTGAT CTCCACCCCA 1500 

70 GACCTTCCCT GATTGGTGTC TCAGCATTTA TTTTCCTGTC TCTTCCACCA AAAGCCAGCT 1560 

GTAGCTTTAT CTCGTAAAAG TTACCCATCT TCTCTACTGT CCCCATTCTC T CTCCTC CCA 1620 

CCTTCACCCC AGATTCAAGT TTTCCTCCTT GTAGGCATTT CATCTGTGTG TGTTTTCTGG 1680 

ATTTTCTCTC TCTCTTCTTA TGGCCATTTC ACCTTATTAC TGATTGGGTA GAGGGGGAAA 1740 

AGGAGAATGA TGATGATAGT TTCCTTCTGT CTATTGACCT TTTTTATAAT AAAGTATAAC 1800 
ATGTT 



75 



Seq ID NO: 426 Protein sequence 
Protein Accession ft: NP_543146.l 



80 1 11 21 31 41 51 

MTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSBKTICTK GKNTPVPEKP TBNLGNTTLT 60 
TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTBVTH QVPTGSPTLI 120 
TSRTKLSSIT SEATGNBSHP YLNKDGSQKG IHAGQMGEND SPPAWAIVIV VLVAVILLLV 180 

1120 



WO 03/042661 



PCT/US02/36810 



FLGLIFLVSY KMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMGQ IPSPR 

Seq ID NO: 427 DNA sequence 
Nucleic Acid Accession §: XM_069480.1 
S Coding sequence: 1..4383 

1 11 21 31 41 51 

ATGGACACTG TGCTGGTGCT GCTCCTGGGC CTGCAGGCCT TGGCCGGACC CAGTCCGAAG 60 
10 CCCCAGAAGG ACTCTGTCTC AGACTGGGCC ATTGTGTTGA TCACTCTCAC TTTGGTGGCA 120 
GCAATTGTCA GCCTAATGTA CGGTATCAAG AAGGCCTQCC AGTTCCGGAG GGAGATGAGT 180 
CTGGGGTGTG OCTGTGQCTC TGTGACCCCT TACAGCA6CC ACCATGAGGG GGAGGCTGCC 240 
AGOCAGCGCT ACTCTTGTCA AATGAAAGCT TCTTGGGGGG CAGGTGCTAC TACATTCCAA 300 
GAATATCAGA AAACTGGGGA ACTCTCAACA TCCGATCACA TATTTCCCCT CACTCCAGGC 360 
15 CTTGTTTATA GTATCCCTTT TGATCACATT GTTCTGCATT CAGGACAAAG ACCTCCAGAG 420 
CTCCCTAAAT CTACAGAAAT CCATGAGCAA AAACGCCACT GCAACACCAC ACGCCATTCT 480 
AAGCCAACTG ACAAGCCTAC AGGCAACTCC AAAACTATAG ACCACAAAAG CTCTACAGAT 540 
AATCATGAGG CTCCTCCCAC TTCTGAAGAA AACTCCAGCA ACCAAGGGAA AGACCCAATG 600 
ATCCGGAACC AGCGCTCTGT TGATCCTGCT GACTCCACTA CGACACATAA AGAATCCGCT 660 
20 GGAAAAAAAC ATATAACGCC AGCACCCAAG AGCAAAATAA ACT GTCGT AA GTCCACAACA 720 
GGCAAATCAA CGGTAACAAG AAAATCAGAT AAAACTGGAA GACCTTTGGA AAAGTCCATG 780 
AGTACTTTGG ATAAGACAAG TACCAGCTCA CATAAGACTA CAACTTCCTT CCACAACTCA 840 
GGCAATTCAC AGACCAAGCA AAAAAGCACA TCTTTTCCAG AAAAAATCAC AGCAGCCTCA 900 
AAAACAACAT ACAAGACCAC AGGAACCCCA GAAGAGTCAG AAAAAACTGA AGATTCCAGA 960 
25 ACAACAGTTG CCTCAGACAA GCTOCTGACA AAAACTACAA AAAACATACA AGAGACCATA 1020 
TCAGCCAATG AGCTCACACA ATCTCTAGCA GAGOCTACAG AACATGGAGG AAGGACAGCC 1080 
AATGAGAACA ACACACCATC CCCAGCAGAG CCTACAGAAA ATAGAGAAAG GAGAGCCAAT 1140 
GAGAACACCA CACTATCCCC AGCAGAGCCT ACAGAAAATA GAGAAAGGAC AGCCAATGAG 1200 
AACACCGCAC CATTCCCAGC AGGGCCTACA GAAAATAGAG AAATGACAGC CAATGAGAAT 1260 
30 ACCACACTAT TCCCAGCAGA GCCTACAGAA CATGGAGAAA GGACAGCCAA TGAGAACACC 1320 
ACACCATCCC CAGCAGAGCC TACAGAACAT GGAGAAAGGA CAGCCAATGA GAACACTACA 1380 
CCATCCCCAG CAGAGCCTAC AGAACATGGA GAAAGGACCC CATTTGCCAA TGACAAAACC 1440 
ACATCATCCT CAGCAGAGTC TACAGAACAT GGAGAAAGGA CCCCACTGGC CAACGAGAAC 1500 
ACCACACCAT CCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAACACC 1560 
35 ACACCATCCC CAGCAGGGCC TACAGAAAAC AGAGAAACGA CAGCCAACGA GAAGACCACA 1620 
CTATCCCCAG TAGAGCCTAC AGAAAATAGA GAAACAACAG CCAATGAGAA GACCACACCA 1680 
TCCCCAGCAG AGCCTACAGA AAATGGACAA AGGACCCCAT TTGCCAATGA GAAAACCACA 1740 
TCATCCTCAG CAGAGCCTAC AGAACACGGA GAAAGGACCC CACTGGCCAA TGAGAACACC 1800 
ACACCATCCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAAGACCACA I860 
40 CCATCCCCAG CAGAGCCTAC AGAAAATGGA GACAGGACTC CTTTGGCCAA TGAGAACACC 1920 
ACGCCATCTC TAGCAGAGCC TACAGAAAAT GGACAAAGGA CCCCATTTGC CAATGAGAAG 1980 
ACCACATCAT CCTCAGCAGA GCCTACAGAA CACGAAGAAA GGACTCCACT GGCCAATGAG 2040 
AACACCACAC CATCCCCGGC AGAGCCTACA GAAAATAGAG AAAGGACAGC CAATGAGAAC 2100 
ACCACACCAT CCCCAGCAGG GCCTACAGAA AATAGAGAAA TGACAGCCAA CGAGAAGACC 2160 
45 ACACTATTCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAAGACCACA 2220 
TCATCCCCAG CAGAGCCTAC AGAAAATGGA CAAAGGACCC CATTTGCCAA TGAGAAAACC 22 B0 
ACATCATCCC CAGCAGAGCC TACAGAACAC GGAGAAAGGA CCCCACTGGC CAATGAGAAC 2340 
ACCACACTAT CCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAAGACC 2400 
ACACCATTCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAACACCACA 2460 
50 CCATCCCCAG CAGAGCCTAC AGAAAATGGA GACAGGACTC CATTGGCCAA TGAGAAGACC 2520 
ACACCATCTC TAGCAGAGCC TACAGAAAAT GGAAAAAGGA CCCCATTTGC CAATGAGAAG 2580 
ACCACATCAT CCTCAGCAGA GCCTACAGAA CACGCAGAAA GGACTCCACT GGCCAATGAG 2640 
AACACCACAT CATCCCCAGC AGAGCCTACA GAAAATAGAG AAAGGACAGC CAATGAGAAG 2700 
ACCACACAAT TCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAAGACC 2760 
55 ACACCATTCC CAGCAGAGCC TACAGAAAAT AGAGAATGGA CAGCCAATGA GAACACCACA 2820 
CTATCCCCAG CAGAGCCTAC AGAACATGAA GAAATGACCC CATTGGCCAA TGAGAAGACC 2880 
ACACTATCCC CAGCAGAGCC TACAGAAAAT GGAGAAAGGA CCCCATTTAC CAATGAGAAG 2940 
ACCACACCAT CCTCAGCAGA GCCTACAGAA CATGGAGAAA GGACCCCACT GGCCAATGAG 3000 
ATCACCACAC CATCCCGAGC AGAGCCTACA GAACATGGAG AAAGGATAGC CAATGAGAAG 3060 
60 GCCACACCAT CCCCAGCAAA GCCTACAGAA CATGGAGAAA CGACAGTCAA TGAGGACACC 3120 
ACACCATCCT CAGCAGAGCC TACAGAAAAT GGAGAAAGGA CCCCACTGGC CAATGAGAAC 3180 
ACCACAACAT CCCCAACAGA GTCTACAGAA CATGGAGAAA GGACAGCCAA TGAGAAGACC 3240 
ACACCATCCC CAGCAGAGCC TACAGAACAT GGAGAAAGGA CACCATCAGC CAATGAGAAG 3300 
ACCATACCAT CTCCAGCAAA GCCTACAGAA CACGAAGAAA TGACCCCATC GGCCAATGAG 3360 
65 AACACCACAC CATCCCCAGT AAAGCCTACA GAACATGGAG AAAAGACTAC ATTGGCCAAT 3420 
GAGAAGATCA CACTATCCCC AGAAGGGCCT ACAGAACATG GAGCAAAAAC TACGTCGGCC 3480 
AATGAGAAGA TCACACCATC CCTAGCAAAG CCTACAGAAC ATGGAGAAAG GACCACATCA 3540 
CCCAATGACA AGATCACCTC ATCTGCAGCA GAGTCTACAG AACATAGAGA TAGGGCTACA 3600 
TCAGCCAATG TGATCACACC AGCCCCAGCA GAGCCTATAA AACATGCAAA AAGGACCACA 3660 
70 TTGGCCCATG AGAAGATGAC ACAAGTCACA GAAAAGTCCA CAGAACACCC AGAAAAGACC 3720 
ACGTCAACCA CAGAGAAAAC CACAAGAACC CCAGAAAAGC CTACGCTATA CTCAGAGAAG 3780 
ACCATATGCA CCAAAGGGAA AAACACACCA GTCCCAGAAA AGCCTACAGA AAACCTGGGG 3840 
AACACCACAC TGACCACTGA GACCATAAAA GCCCCAGTAA AGTCCACAGA AAACCCAGAA 3900 
AAAACAGCAG CAGTCACAAA GACTATAAAA CCTTCAGTCA AGGTCACAGG AGACAAATCT 3960 
75 CTCACTACTA CCTCTTCTCA TCTAAATAAA ACTGAAGTTA CTCATCAGGT GCCCACTGGT 4020 
TCTTTCACCC TCATTACATC TAGAACGAAG CTGAGTTCTA TCACATCAGA AGCCACAGGA 4080 
AACGAGAGCC ATCCATACCT CAATAAAGAT GGCTCACAGA AAGGTATCCA CGCTGGACAG 4140 
ATGGGAGAGA ATGATTCATT CCCTGCATGG GCCATAGTTA TTGTGGTCCT GGTGGCTGTG 4200 
ATTCTCCTCC TGGTGTTCCT TGGCCTGATC TTCTTGGTCT CCTATATGAT GCGGACACGC 4260 
80 CGCACACTAA CCCAGAACAC CCAGTACAAT GATGCAGAGG ATGAGGGTGG CCCCAATTCC 4320 
TACCCGGTCT AOCTGATGGA GCAGCAGAAT CTTGGCATGG GCCAGATCCC TTCCCCACGG 4380 
TGA 



1121 



WO 03/042661 



PCT/US02/36810 



Seq ID NO j 428 Protein sequence 
Protein Accession 8: XP_069480.1 

1 11 21 31 41 51 

5 | | | I I I 

MDTVLVLLLG LQALAGPSPK PQKDSVSOWA IVLITLTLVA AIVSLMYGIK KACQFRREMS 60 

LGCGCGSVTP YSSHHEGEAA SQRYSCQMKA SWGAGATTFQ EYQKTGELST SDHIFPLTPO 120 

LVYSIPPDHI VLHSGQRPPE LPKSTEIHEQ KRHCNTTRHS KPTDKPTGNS KTIDHKSSTD 180 

NHEAPPTSEE NSSNQGKDPM IRNQRSVDPA DSTTTHKESA GKKHITPAPK SKINCRKSTT 240 

10 GKSTVTRKSD KTGRPLEKSM STLDKTSTSS HKTTTSFHNS 6NSQTKQKST SFPEKITAAS 300 

KTTYKTTGTP EBSEKTEDSR TTVASDKLLT KTTKNIQETI SANELTQSLA BPTEHGGRTA 360 

NENNTPSPAE PTENRERTAN ENTTLSPAEP TENRERTANB NTAPFPAGPT ENREMTANEN 420 

TTLFPAEPTB HGERTANENT TPSPAEPTEH GERTANENTT PSPAEPTEHG ERTPFANDKT 480 

TSSSAESTEH GERTPLANEN TTPSPAEPTE NRERTANENT TPSPAGPTEN RETTANEKTT 540 

15 LSPVBPTENR ETTANEKTTP SPAEPTENGO RTPFAMEKTT SSSAEPTEHG ERTPLANENT 600 

TPSPAEPTEN RERTANEKTT PSPAEPTENG DRTPLAKEKT TPSLAEPTEN GQRTPFANEK. 660 

TTSSSAEPTE HEERTPIANE NTTPSPAEPT ENRERTANEN TTPSPAGPTE NREMTANBKT 720 

TLFPAEPTEN RERTANEKTT SSPAEPTENG QRTPFANEKT TSSPAEPTEH GERTPLANEN 780 

TTLSPAEPTE NRERTANEKT TPFPAEPTEN RBRTANENTT PSPAQPTENG DRTPLANEKT 840 

20 TPSLAEPTEN GKRTPFANEK TTSSSAEPTE HAERTPLANB NTTSSPAEPT ENRERTANEK 900 

TTQFPAEPTE NRESTANEKT TPFPAEPTEN REWTANENTT LSPAEPTEHE EMTPLANEKT 960 

TLSPAEPTEN GERTPFTNEK TTPSSAEPTE HGERTPLANE ITTPSRAEPT EHGERIANEK 1020 

ATPSPAKPTE HGETTVNEDT TPSSAEPTEN GERTPLANEN TTTSPTESTE HGBRTANEKT 1080 

TPSPAEPTEH GERTPSANEK TIPSPAKPTE HEEMTPSANE NTTPSPVKPT E HGEKT TLAN 1140 

25 EKITLSPEGP TEHGAKTTSA NEKITPSLAK PTEHGERTTS P NDKITSSA A ESTBHRDRAT 1200 

SANVITPAPA EPIKHAKRTT LAHEKMTQVT EKSTEHPBKT TSTTEKTTRT PEKPTLYSBK 1260 

TICTKGKNTP VPEKPTENLG NTTLTTETIK APVKSTENPE KTAAVTKTIK PSVKVTGDKS 1320 

LTTTSSHLNK TEVTHQVPTG SFTLITSRTK LSSITSEATG NESHPYLNKD GSQKGIHAGQ 1380 

MGENDSFPAW AIV1WLVAV XLLLVFLGLI FLVSYMMRTR RTLTQNTQYN DAEDEGGPNS 1440 
30 YPVYLMEQQN LGMGQIPSPR 



Seq ID NO: 429 DNA sequence 

Nucleic Acid Accession #: FGBNESH predicted 

Coding sequence i 1.. 10674 

1 11 21 31 |1 51 

ATGTGGCCTC GCCTGGCCTT TTGTTGCTGG GGTCTGGOGC TOGTTTCGGG CTGGGCGACC 60 

TTTCAGCAGA TGTCCCOGTC GOGCAATTTC AGCTTCCGCC TCTTCCCCGA GACCGOGCCC 120 

40 GGGGCCCCCG GGAGTATCCC CGOGCCGCCC GCTCCTGGCG ACGAAGCGGC GGGGAGCAGA 180 

GTGGAGGGGC TGGGOCAGGC GTTCCGGCGA CGCGTGOGGC TGCTGCGGGA GCTCAGCGAG 240 

CGCCTGGAGC TTGTCTTCCT GGTGGATGAT TCGTCCAGCG TGGGCGAAGT CAACTTCCGC 300 

AGCGAGCTCA TGTTCGTCCG CAAGCTGCTG TCCGACTTCC CCGTGGTGCC CACGGCCACG 360 

CGCGTGGCCA TCGTGACCTT CTCGTCCAAG AACTACGTGG TGCOGCGCGT CGATTACATC 420 

45 TCCACCCGCC GCGCGCGCCA GCACAAGTGC GCGCTGCTCC TCCAAGAGAT CCCTGCCATC 480 

TCCTACCGAG GTGGCGGCAC CTACACCAAG GGCGCCTTCC AGCAAGCCGC GCAAATTCTT 540 

CTTCATGCTA GAGAAAACTC AACAAAAGTT GTATTTCTCA TCACTGATGG ATATTCCAAT 600 

GGGGGAGACC CTAGACCAAT TGCAGCGTCA CTGOGAGATT CAGGAGTGGA GATCTTCACT 660 

TTTGGCATAT GGCAAGGGAA CATTCGAGAG CTGAATGACA TGGCTTCCAC CCCAAAGGAG 720 

50 GAGCACTGTT AOCTGCTACA CAGTTTTGAA GAATTTGAGG CTTTAGCTCG CCGGGCATTG 780 

CATGAAGATC TACCTTCTGG GAGTTTTATT CAAGATGATA TGGTCCACTG CTCATATCTT 840 

TGTGATGAAG GCAAGGACTG CTGTGACCGA ATGGGAAGCT GCAAATGTGG GACACACACA 900 

GGCCATTTTG AGTGCATCTG TGAAAAGGGG TATTACGGGA AAGGTCTGCA GTATGAATGC 960 

ACAGCTTGCC CATCGGGGAC ATACAAACCT GAAGGCTCAC CAGGAGGAAT CAGCAGTTGC 1020 

55 ATTCCATGTC CTGATGAAAA TCACACCTCT CCACCTGGAA GCACATCCCC TGAAGACTGT 1080 

GTCTGCAGAG AGGGATACAG GGCATCTGGC CAGACCTGTG AACTTGTCCA CTGCCCTGCC 1140 

CTGAAGCCTC CCGAAAATGG TTACTTTATC CAAAACACTT GCAACAACCA CTTCAATGCA 1200 

GCCTGTGGGG TCCGATGTCA CCCTGGATTT GATCTTGTGG GAAGCAGCAT CATCTTATGT 1260 

CTACCCAATG GTTTGTGGTC OGGTTCAGAG AGCTACTGCA GAGTAAGAAC ATGTCCTCAT 1320 

60 CTCCGCCAGC CGAAACATGG CCACATCAGC TGTTCTACAA GGGAAATGTT ATATAAGACA 1380 

ACATGTTTGG TTGCCTGTGA TGAAGGGTAC AOACTAGAAG GCAGTGATAA GCTTACTTGT 1440 

CAAGGAAACA GCCAGTGGGA TGGGCCAGAA CCCCGGTGTO TGGAGCGCCA CTGTTCCACC 1500 

TTTCAGATGC CCAAAGATGT CATCATATCC CCCCACAACT GTGGCAAGCA GCCAGCCAAA 1560 

TTTGGGACGA TCTGCTATGT AAGTTGOCGC CAAGGGTTCA TTTTATCTGG AGTCAAAGAA 1620 

65 ATGCTGAGAT GTACCACTTC TGGAAAATGG AATGTOGGAG TTCAGGCAGC TGTGTGTAAA 1680 

GACGTGGAGG CTCCTCAAAT CAACTGTCCT AAGGACATAG AGGCTAAGAC TCTGGAACAG 1740 

CAAGATTCTG CCAATGTTAC CTGGCAGATT CCAACAGCTA AAGACAACTC TGGTGAAAAG 1800 

GTGTCAGTCC AOGTTCATCC AGCTTTCACC CCACCTTACC TTTTCCCAAT TGGAGATGTT 1860 

GCTATCGTAT ACACGGCAAC TGACCTATCC GGCAACCAGG CCAGCTGCAT TTTCCATATC 1920 

70 AAGGTTATTG ATGCAGAACC AOCTGTCATA GACTGGTGCA GATCTCCACC TCCCGTCCAG 1980 

GTCTCGGAGA AGGTACATGC OGCAAGCTGG GATGAGCCTC AGTTCTCAGA CAACTCAGGG 2040 

GCTGAATTGG TCATTACCAG AAGTCATACA CAAGGAGACC TTTTCCCTCA AGGGGAGACT 2100 

ATAGTACAGT ATACAGCCAC TGACCCCTCA GGCAATAACA GGACATGTGA TATCCATATT 2160 

GTCATAAAAG GTTCTOCCTG TGAAATTOCA TTCACACCTG TAAATGGGGA TTTTATATGC 2220 

75 ACTCCAGATA ATACTGGAGT CAACTGTACA TTAACTTGCT TGGAGGGCTA TGATTTCACA 2280 

GAAGGGTCTA CTGACAAGTA TTATTGTGCT TATGAAGATG GCGTCTGGAA ACCAACATAT 2340 

ACCACTGAAT GGCCAGACTG TGCCAAAAAA CGTTTTGCAA ACCAOGGGTT CAAGTCCTTT 2400 

GAGATGTTCT ACAAAGCAGC TCGTTGTGAT GACACAGATC TGATGAAGAA GTTTTCTGAA 2460 

GCATTTGAGA CGACCGTGGG AAAAATGGTC CCATCATTTT GTAGTGATGC AGAGGACATT 2520 

80 GACTGCAGAC TGGAGGAGAA CCTGACCAAA AAATATTGCC TAGAATATAA TTATGACTAT 2580 

GAAAATGGCT TTGCAATTGG ACCAGGTGGC TGGGGTGCAG CTAATAGGCT GGATTACTCT 2640 

TACGATGACT TCCTGGACAC TGTGCAAOAA ACAGCCACAA GCATOGGCAA TQCCAAGTCC 2700 

TCACGGATTA AAAGAAGTGC CCCATTATCT GACTATAAAA TTAAGTTAAT TTTTAACATC 2760 

ACAGCTAGTG TGCCATTACC CGATGAAAGA AATGATACCC TTGAATGGGA AAATCAGCAA 2820 
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CGACTCCTTC AGACATTGOA AACTATCACA AATAAACTOA AAAGQACTCT CAACAAAGAC 2880 
CCCATGTATT CC1TTCAOCT TGCATCAGAA ATACTTATAG CC6ACAGCAA TTCATTAGAA 2940 
ACAAAAAAGG CTTCCCCCTT CTGCAGACCA GGCTCAGTGC TGAGAGGGOG TATGTGTGTC 3000 
AATTGCCCTT TGGGAACCTA TTATAATCTG GAACATTTCA CCTGTGAAAG CTGCCGGATC 3060 
GGATCCTATC AAGATGAAGA AGGGCAACTT GAGTGCAAGC TTTGCCCCTC TGGGATGTAC 3120 
ACGGAATATA TCCATTCAAG AAACATCTCT GATTGTAAAG CTCAGTGTAA ACAAGGCACC 3180 
TACTCATACA GTGGACTTGA GACTTGTGAA TOGTGTCCAC TGGGCACTTA TCAGCCAAAA 3240 
TTTGGTTCCC GGAGCTGCCT CTCGTGTCCA GAAAACACCT CAACTGTGAA AAGAGGAGCC 3300 
GTGAACATTT CTGCATGTGG AGTTCCTTGT CCAGAAGGAA AATTCTCGCG TTCTGGGTTA 3360 
ATGCCCTGTC ACCCATGTCC TCGTGACTAT TACCAACCTA ATGCAGGGAA GGCCTTCTGC 3420 
CTGGCCTGTC CCTTTTATGG AACTACCCCA TTCGCTGGTT OCAGATCCAT CACAGAATGT 3480 
TCAACTTCAG TTCTGAATAT TACTATTTTC GGTGGATTTG GGCATCTGGA GTTGTTAAAT 3S40 
TGTCCTTCTG AGGTTTTCCA TGAATGCTTC TTTAACCCTT GCCACAATAG TGGAACCTGC 3600 
CAGCAACTTG GGCGTGGTTA TGTTTGTCTC TGTCCACTTG GATATACAGG CTTAAAGTGT 3660 
GAAACAGACA TCGATGAGTG CAGCCCACTG CCTTGCCTCA ACAATGGAGT TTGTAAAGAC 3720 
CTAGTTGGGG AATTCATTTG TGAGTGCCCA TCAGGTTACA CAGGTCAGCG GTGTGAAGAA 3780 
AATATAAATG AGTGTAGCTC CAGTCCTTGT TTAAATAAAG GAATCTGTGT TGATGGTGTG 3840 
GCTGGCTATC GTTGCACATG TGTGAAAGGA TTTGTAGGCC TGCATTGTGA A ACAGA AGTC 3900 
AATGAATGCC AGTCAAACCC ATGCTTAAAT AATGCAGTCT GTGAAGACCA GGTTGGGGGA 3960 
TTCTTGTGCA AATGCCCACC TGGATTTTTG GGTACCOGAT GTGGAAAGAA OGTOGATGAG 4020 
TGTCTCAGTC AGCCATGCAA AAATGGAGCT ACCTGTAAAG ACGGTGCCAA TAGCTTCAGA 4080 
TGCCTGTGTG CAGCTGGCTT CACAGGATCA CACTGTGAAT TGAACATCAA TGAATGTCAG 4140 
TCTAATCCAT GTAGAAATCA GGCCACCTGT GTGGATGAAT TAAATTCATA CAGTT GTAAA 4200 
TGTCAGCCAG GATTTTCAGG CAAAAGGTGT GAAACAGAAC AGTCTAGAGG CTTTAACCTG 4260 
GATTTTGAAG TTTCTGGCAT CTATGGATAT GTCATGCTAG ATGGCATGCT CCCATCTCTC 4320 
CATGCTCTAA CCTGTACCTT CTGGATGAAA TCCTCTGACG ACATGAACTA TGGAACACCA 4380 
ATCTCCTATG CAGTTGATAA CGGCAGCGAC AATACCTTGC TCCTGACTGA TTATAACGGC 4440 
TGGGTTCTTT ATGTGAATGG CAGGGAAAAG ATAACAAACT GTCCCTOGGT GAATGATGGC 4S00 
AGATGGCATC ATATTGCAAT CACTTGGACA AGTGCCAATG GCATCTGGAA AGTCTATATC 4560 
GATGGGAAAT TATCTGACGG TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGGT 4620 
GGTGCGTTAG TTCTGGGGCA AGAGCAAGAC AAAAAAGGAG AGGGATTCAG CCCAGCTGAG 4680 
TCTT T TQ TG G GCTCCATAAG CCAGCTCAAC CTCTGGGACT ATGTCCTGTC TCCACAGCAG 4740 
GTGAAGTCAC TGGCTACCTC CTGCCCAGAG GAACTCAGTA AAGGAAACGT GTTAGCATGG 4800 
CCTGATTTCT TGTCAGGAAT TGTGGGGAAA GTGAAGATOG ATTCTAAGAG CATATTTTGT 4860 
TCTGATTGCC CACGCTTAGG AGGGTCAGTG CCTCATCTGA GAACTGCATC TGAAGATTTA 4920 
AAGCCAGGTT CCAAAGTCAA TCTGTTCTGT GATCCAGGCT TCCAGCTGGT OGGGAACCCT 4980 
GTGCAGTACT GTCTGAATCA AGGACAGTGG ACACAACCAC TTCCTCACTG TGAACGCATT S040 
AGCTGTGGGG TGCCACCTCC TTTGGAGAAT GGCTTCCATT CAGCOGATGA CTTCTATGCT 5100 
GGCAGCACAG TAACCTACCA GTGCAACAAT GGCTACTATC TATTGGGTGA CTCAAGGATG 5160 
TTCTGTACAG ATAATGGGAG CTGGAACGGC GTTTCAOCAT CCTGCCTTGA TGTOGATGAG 5220 
TGTGCAGTTG GATCAGATTG TAGTGAGCAT GCTTCTTGCC TGAAOGTAGA TGGATCCTAC 5280 
ATATGTTCAT GTGTCOCACC GTACACAGGA GATGGGAAAA ACTGTGCAGA ACCTATAAAA 5340 
TGTAAGGCTC CAGGAAATCC GGAAAATGGC CACTCCTCAG GTGAGATTTA TACAGTAGGT 5400 
GCGGGAGTCA CATTTTCGTG TCAGGAAGGA TACCAGTTGA TGGGAGTAAC CAAAATCACA 5460 
TGTTTGGAGT CTGGAGAATG GAATCATCTA ATACCATATT GTAAAGCTGT TTCATGTGGT 5520 
AAACCGGCTA TTCCAGAAAA TGGTTGCATT GAGGAGTTAG CATTTACTTT TGGCAGCAAA 5580 
GTGACATATA GGTGTAATAA AGGATATACT CTGGCOGGTG ATAAAGAATC ATCCTGTCTT 5640 
GCTAACAGTT CTTGGAGTCA TTCCCCTCCT GTGTGTGAAC CAGTGAAGTG TTCTAQTCCG 5700 
GAAAATATAA ATAATGGAAA ATATATTTTG AGTGGGCTTA CCTACCTTTC TACTGCATCA 5760 
TATTCATGCG ATACAGGATA CAGCTTACAG GGCCCTTCCA TTATTGAATG CAOGGCTTCT 5820 
GGCATCTGGG ACAGAGCGCC ACCTGCCTGT CACCTCGTCT TCTGTGGAGA ACCACCTGCC 5880 
ATCAAAGATG CTGTCATTAC GGGGAATAAC TTCACTTTCA GGAACACCGT CACTTACACT 5940 
TGCAAAGAAG GCTATACTCT TGCTGGTCTT GACACCATTG AATGOCTGGC CGAOGGCAAG 6000 
TGGAGTAGAA GTGACCAGCA GTGCCTGGCT GTCTCCTGTG ATGAGCCACC CATTGTGGAC 6060 
CACGCCTCTC CAGAGACTGC CCATCGGCTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 6120 
GATGGTTACA GCCTAGCAGA CAATTCCCAG CTTCTCTGCA ATGCCCAGGG CAAGTGGGTA 6180 
CCCCCAGAAG GTCAAGACAT GCOCCGTTGT ATAGCTCATT TCTGTGAAAA ACCTCCATCG 6240 
GTTTCCTATA GCATCTTGGA ATCTGTGAGC AAAGCAAAAT TTGCAGCTGG CTCAGTTGTG 6300 
AGCTTTAAAT GCATGGAAGG CTTTGTACTG AACACCTCAG CAAAGATTGA ATGTATGAGA 6360 
GGTGGGCAGT GGAACCCTTC CCCCATGTCC ATCCAGTGCA TGCCTGTGCG GTGTGGAGAG 6420 
CCACCAAGCA TCATGAATGG CTATGCAAGT GGATCAAACT ACAGTTTTGG AGCCATGGTG 6480 
GCTTACAGCT GCAACAAGGQ GTTCTACATC AAAGGGGAAA AGAAGAGCAC CTGCGAAGCC 6540 
ACAGGGCAGT GGAGTAGTCC TATACCGACG TGCCACCOGG TATCTTGTGG TGAACCACCT 6600 
AAGGTTGAGA ATGGCTTTCT GGAGCATACA ACTGGCAGGA TCTTTGAGAG TGAAGTGAGG 6660 
TATCAGTGTA ACCCGGGCTA TAAGTCAGTC GGAAGTCCTG TATTTGTCTG CCAAGCCAAT 6720 
CGCCACTGGC ACAGTGAATC CCCTCTGATG TGTGTTCCTC TOGACTGTGG AAAACCTCCC 6780 
CCGATCCAGA ATGGCTTCAT GAAAGGAGAA AACTTTGAAG TAGGGTCCAA GGTTCAGTTT 6840 
TTCTGTAATG AGGGTTATGA GCTTGTTGGT GACAGTTCTT GGACATGTCA GAAATCTGGC 6900 
AAATGGAATA AGAAGTCAAA TCCAAAGTGC ATGCCTGCCA AGTGCCCAGA GCCG CCCC TC 6960 
TTGGAAAACC AGCTAGTATT AAAGGAGTTG ACCACCGAGG TAGGAGTTGT GACATTTTCC 7020 
TGTAAAGAAG GGCATGTCCT GCAAGGCCCC TCTGTCCTGA AATGCTTGCC ATCOCAGCAA 7080 
TGGAATGACT CTTTCCCTGT TTGTAAGATT GTTCTTTGTA CCCCACCTOC CCTAATTTCC 7140 
TTTG G T G TCC CCATTCCTTC TTCTGCTCTT CATTTTGGAA GTACTGTCAA GTATTCTTGT 7200 
GTAGGTGGGT TTTTCCTAAG AGGAAATTCT ACCACCCTCT GCCAACCTGA TGGCACCTGG 7260 
AGCTCTCCAC TGCCAGAATG TGTTCCAGTA GAATGTCOCC AACCTGAGGA AATCCCCAAT 7320 
GGAATCATTG ATGTGCAAGG CCTTGGCTAT CTCAGCACAG CTCTCTATAC CTGC AAGCC A 7380 
GGCTTTGAAT TGGTGGGAAA TACTACCACC CTTTGTGGAG AAAAtPGGTCA CTGGCTTGGA 7440 
GGAAAAGCAA CATGTAAAGC CATTGAGTGC CTGAAACCCA AGGAGATTTT GAATGGCAAA 7500 
TTCTCTTACA OGGACCTACA CTATGGACAG ACOGTTACCT ACTCTTGCAA CCGAGGCTTT 7560 
OGGCTCGAAG GTCCCAGTGC CTTGACCTGT TTAGAGACAG GTGATTGGGA TGTAGATGCC 7620 
CCATCTTGCA ATGCCATCGA CTGTGATTCC CCACAACCCA T TGAAAA TGG TTTTGTAGAA 7680 
GGTGCAGATT ACAGCTATGG TGCCATAATC ATCTACAGTT GCTTOCCTGG GTTTCAGGTG 7740 
GCTGGTCATG CCATGCAGAC CTGTGAAGAG TCAGGATGGT CAAGTTCCAT CCCAACATGT 7800 
ATGCCAATAG ACTGTGGCCT CC C TCCTCAT ATAGATTTTG GAGACTGTAC TAAACTCAAA 7860 
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GATGACCAGG GATATTTTGA GCAAGAAGAC GACATGATGG AAGTTCCATA TGTGACTCCT 7920 
CACCCTCCTT ATCATTTGGG AGCAGTGGCT AAAACCTGGG AAAATACAAA GQAGTCTCCT 7980 
GCTACACATT CATCAAACTT TCTGTATGGT ACCATGGTTT CATACACCTG TAATCCAGGA 8040 
TATGAACTTC TGGGGAACCC TGTGCTGATC TGCCAGGAAG ATGGAACTTG GAATGGCAGT 8100 
5 GCACCATCCT GCATTTCAAT TGAATGTGAC TTGCCTACTG CTCCTGAAAA TGGCTTTTTG 8160 
CGTTTTACAG AGACTAGCAT GGGAAGTGCT GTGCAGTATA GCTGTAAACC TGGACACATT 8220 
CTAGCAGGCT CTGACTTAAG GCTTTGTCTA GAGAATAGAA AGTGGAGTGG TGCCTCCCCA 8280 
CGCTGTGAAG CCATTTCATG CAAAAAGCCA AATCCAGTCA TGAATGGATC CATCAAAGGA 8340 
AGCAACTACA CATACCTGAG CACGTTGTAC TATGAGTGTG AOCCCGGATA TGTGCTGAAT 8400 
10 GGCACTGAGA GGAGAACATG CCAGGATGAC AAAAACTGGG ATGAGGATGA GCCCATTTGC 8460 
ATTCCTGTGG ACTGCAGTTC ACCCCCAGTC TCAGCCAATG GCCAGGTGAG AGGAGACGAG 8520 
TACACATTCC AAAAAGAGAT TGAATACACT TGCAATGAAG GGTTCTTGCT TGAGGGAGCC 8580 
AGGAGTCGGG TTTGTCTTGC CAATGGAAGT TGGAGTGGAG CCACTCCCGA CTGTGTGCCT 8640 
GTCAGATGTG CCACCCCGCC ACAACTGGCC AATGGGGTGA CGGAAGGCCT GGACTATGGC 8700 
15 TTCATGAAGG AAGTAACATT CCACTGTCAC GAGGGCTACA TCTTGCACGG TGCTCCAAAA 8760 
CTCACCTGTC AGTCAGATGG CAACTGGGAT GCAGAGATTC CTCTCTGTAA ACCAGTCAAC 8820 
TGTGGACCTC CTGAAGATCT TGCCCATGGT TTCCCTAATG GTTTTTCCTT TATTCATGGG 8880 
GGCCATATAC AGTATCAGTG CTTTCCTGGT TATAAGCTCC ATGGAAATTC ATCAAGAAGG 8940 
TGCCTCTCCA ATGGCTCCTG GAGTGGCAGC TCACCTTCCT GCCTGCCTTG CAGATGTTCC 9000 
20 ACACCAGTAA TTGAATATGG AACTGTCAAT GGGACAGATT TTGACTGTGG AAAGGCAGCC 9060 
CGGATTCAGT GCTTCAAAGG CTTCAAGCTC CTAGGACTTT CTGAAATCAC CTGTGAAGCC 9120 
GATGGCCAGT GGAGCTCTGG GTTCCCCCAC TGTGAACACA CTTCTTGTGG TTCTCTTCCA 9180 
ATGATACCAA ATGCGTTCAT CAGTGAGACC AGCTCTTGGA AGGAAAATGT GATAACTTAC 9240 
AGCTGCAGGT CTGGATATGT CATACAAGGC AGTTCAGATC TGATTTGTAC AGAGAAAGGG 9300 
25 GTATGGAGCC AGCCTTATCC AGTCTGTGAG CCCTTGTCCT GTGGGTCOCC ACCGTCTGTC 9360 
GCCAATGCAG TGGCAACTGG AGAGGCACAC ACCTATGAAA GTGAAGTGAA ACTCAGATGT 9420 
CTGGAAGGTT ATACGATGGA TACAGATACA GATACATTCA CCTGTCAGAA AGATGGTCGC 9480 
TGGTTCCCTG AGAGAATCTC CTGCAGTCCT AAAAAATGTC CTCTCCCGGA AAACATAACA 9540 
CATATACTTG TACATGGGGA CGATTTCAGT GTGAATAGGC AAGTTTCTGT GTCATGTGCA 9600 
30 GAAGGGTATA CCTTTGAGGG AGTTAACATA TCAGTATGTC AGCTTGATGG AACCTGGGAG 9660 
CCACCATTCT CCGATGAATC TTGCAGTCCA GTTTCTTGTG GGAAACCTGA AAGTCCAGAA 9720 
CATGGATTTG TGGTTGGCAG TAAATACACC TTTGAAAGCA CAATTATTTA TCAGTGTGAG 9780 
CCTGGCTATG AACTAGAGGG GAACAGGGAA OGTGTCTGCC AGGAGAACAG ACAGTGGAGT 9840 
GGAGGGGTGG CAATATGCAA AGAGACCAGG TGTGAAACTC CACTTGAATT TCTCAATGGG 9900 
35 AAAGCTGACA TTGAAAACAG GACGACTGGA CCCAAOGTGG TATATTCCTG CAACAGAGGC 9960 
TACAGTCTTG AAGGGCCATC TGAGGCACAC TGCACAGAAA ATGGAACCTG GAGCCACCCA 10020 
GTCCCTCTCT GCAAACCAAA TCCATGCCCT GTTCCTTTTG TGATTCCCGA GAATG CTCTG 10080 
CTGTCTGAAA AGGAGTTTTA TGTTGATCAG AATGTGTCCA TCAAATGTAG GGAAGGTTTT 10140 
CTGCTGCAGG GCCACGGCAT CATTACCTGC AACCCOGACG AGAOGTGGAC ACAGACAAGC 10200 
40 GCCAAATGTG AAAAAATCTC ATGTGGTCCA CCAGCTCAOG TAGAAAATGC AATTGCTCGA 10260 
GGCGTACATT ATCAATATGG AGACATGATC ACCTACTCAT GTTACAGTGG ATACATGTTG 10320 
GAGGGTTTCC TGAGGAGTGT TTGTTTAGAA AATGGAACAT GGACATCACC TCCTATTTGC 10380 
AGAGCTGTCT GTCGATTTCC ATGTCAGAAT GGGGGCATCT GCCAACGCCC AAATGCTTGT 10440 
TCCTGTCCAG AGGGCTGGAT GGGGCGCCTC TGTGAAGAAC CAATCTGCAT TCTTCCCTGT 10500 
45 CTGAAOGGAG GTCGCTGTGT GGCCCCTTAC CAGTGTGACT GCCOGCCTGG CTGGACGGGG 10560 
TCTCGCTGTC ATACAGCTGT TTGCCAGTCT CCCTGCTTAA ATGGTGGAAA ATGTGTAAGA 10620 
CCAAACCGAT GTCACTGTCT TTCTTCTTGG ACGGGACATA ACTGTTCCAG GTAA 
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MWPRLAFCCW GLALVSGWAT FQQMSPSRNF SFRLFPETAP GAPGSIPAPP APGDEAAGSR 
55 VERLGQAFRR RVRLLRELSE RliELVFLVDD SSSVGBVNPR SBLMPVRKLL SDFPWPTAT 
RVAIVTPSSK NYWPRVDYT STRRARQHKC ALLLQBIPAI SYRGGGTYTK GAFQQAAQIL 
LHARENSTKV VPLITDGYSN GGDPRPIAAS LRDSGVEIPT FGIWQGNIRE UJDMASTPKE 
EHCYLLHSFE EFEALARRAL HEDLPSGSFI QDDMVHCSYL CDEGKDCODR MGSCKCGTHT 
GHFECICBKG YYGKGLQYEC TACPSGTYKP EGSPGGISSC IPCPDENHTS PPGSTSPEDC 
60 VCREGYRASG QTCELVHCPA LRPPENGYFI QNTCNNHFNA ACGVRCHPGP DIiVGSSIILC 
LPNGLWSGSE SYCRVRTCPH LRQPKHGHIS CSTREMLYKT TCLVACDEGY RLEGSDKLTC 
QGNSQWDGPE PRCVERHCST PQMPKDVIIS PHNCGKQPAK FGTICYVSCR QGFILSGVKE 
MLRCTTSGKW NVGVQAAVCK DVEAPQINCP KPIEAKTLEQ QDSAKVTWQI PTAKDNSGEK 
VSVHVHPAFT PPYLFPIGDV AIVYTATDLS GNQASCIFHI KVIDAEPPVI DWCRSPPPVQ 
65 VSEKVHAASW DEPQFSDNSG AELVITRSHT QGDLFPOGET IVQYTATDPS GNNRTCDIHI 
VIKGSPCEIP PTPVNGDPIC TPDNTGVNCT LTCLEGYDPT EGSTDKYYCA YEDGWKPTY 
TTEWPDCAKK RPANHGPKSP EMFYXAARCD DTOLMKKPSE AFETTLGKMV PSFCSDAEDI 
DCRLEENLTK KYCLEYNYDY ENGPAIGPGG WGAANRLDYS YDDPLDTVQE TATSIGNARS 
SRIKRSAPLS DYKIKLIFNI TASVPLPDER NDTLEWENQQ RLLQTLETIT NKLKRTLNKD 
70 PMYSFQLASE ILIADSNSLE TKKA9PPCRP GSVLRGRMCV NCPLGTYYNL EHPTCESCRI 
GSYQDEEGQL ECKLCPSGMY TEYIHSRNIS DCKAQCKQGT YSYSGLETCE SCPLGTYQPK 
FGSRSCLSCP ENTSTVKRGA VNISAOGVPC PEGKFSRSGL MPCHPCPRDY YQPUAGKAFC 
LACPPYGTTP PAGSRSITEC STSVLNITIP GGFGHLELLN CPSEVFHECP FNPCHNSGTC 
QQLGRGYVCL CPLGYTGLKC ETDIDECSPL PCUINGVCKD LVGEPICECP SGYTGQRCEE 
75 NINECSSSPC LNKGICVDGV AGYRCTCVKG FVGLHCETEV NECQSNPCLN NAVCEDQVGG 
FLCKCPPGFIj GTRCGKNVDE CLSQPCKNGA TCKDGANSFR CLCAAGFTGS HCELNINEOQ 
SNPCRNQATC VDEWJSYSCK OQPGFSGKRC ETEQSTGFNL DFEVSGIYGY VMLDGMLPSL 
HALTCTFWMK SSDDMNYGTP ISYAVDNGSD NTLLLTDYNG WVLYVNGREK ITNCPSVNDG 
RWHHIAITWT SANGIWKVYI DGKLSDGGAG LSVGLPIPGG GALVLGQEQD KKGEGFSPAE 
80 SFVGSISQLN LWDYVLSPQQ VKSIATSCPB ELSKGNVLAW PDPLSGIVGK VKIDSKSIFC 
SDCPRLGGSV FHLRTASEDL KPGSKVNLPC DPGFQLVGNP VQYCLNQGQW TQPLPHCERI 
SCGVPPPLEN GPHSADDFYA G3TVTYQCNN GYYLI/3DSRM FCTDNGSWNG VSPSCLDVDE 
CAVGSDCSEH ASCLNVDGSY ICSCVPPYTG DGKNCAEPIK CKAPGNPENG HSSGBIYTVG 
AGVTFSCQEG YQLMGVTKIT CLESGEWNHL IPYCKAVSOQ KPAIPENGCI EELAFTPGSK 
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vtyrcnkgyt lagdkesscl ansswshspp vcepvkcssp eninngkyil sgltylstas 
yscdtgyslq gpsiiectas giwdrappac hlvfcgeppa ikdavitgnn ftprntvtyt 
ckegytlagl dtiecladgk wsrsdqqcla vscdeppivd haspetahrl fgdiapyycs 
dgysladnsq llcnaqgkwv ppegqdmprc iahpcekpps vsysilesvs kakfaagsw 
SFKCMEGFVIj ntsakiecmr ggqwnpspms iqcipvroge ppsimngyas gsnyspgamv 
ayscnkgfyi kgekkstcea tgqwsspipt chpvscgepp kvenqpleht tgrifesevr 
yqcnpgyksv gspvfvcqan rhwhsesplm cvpldogkpp piqngfmkge nfevgskvqp 
fcnegyelvg dsswtcqksg kmnkksnpkc mpakcpbppl lenqlvlkel ttevgwtfs 
ckeghvlqgp svlkclpsqq wndsfpvcki vlctppplis fgvpipssal hfgstvkysc 
vqgpflrgns ttlcqpdgtw ssplpecvpv ecpqpeeipn gi idvqglay lstalytckp 
gfelvgnttt lcgenghwlg gkptckaiec lkpkeilngk fsytdlhygq tvtyscnrgf 
rlegpsaltc letgdwdvda pscnaihcds pqpiehgfve gadysygai i iy6cfpgfqv 
aghamqtcee sgwsssiptc mpidcglpph idfgdctklk ddqgyfeqbd dmmevpyvtp 
hppyklgava ktwentkesp athssnflyg tmvsytcnpg yellgnpvli oqedgtwngs 
apscisiecd lptapengfl rftetsmgsa vqysckpghi lagsdlriicl enrkwsgasp 
rceaisckkp npvmngsikg snytylstly yecdpgyvln gterrtcqdd knwdedepic 
ipvdcssppv sangqvrgde ytfqkeieyt qjbgfllega rfirvclangs wsgatpdcvp 
vrcatppq1a ngvtegldyg pmkevtfhch egyilhgapk ltcqsdgnwd aeiplckpvn 
cgppedlahg fpkgfsfihg ghiqyqcfpg yklhgnssrr clsngswsgs spsclpcrcs 
tpvieygtvn gtdpdcgkaa riqcpkgfkl lglsbitcea dgqwssgpph cehtscgslp 
mipnapiset sswkenvity scrsgyviqg ssdlictekg vwsqpypvce plscgsppsv 
anavatgeah tyesevklrc legytmdtdt dtptgqkdgr wfperiscsp krcplpenit 

HILVHGDDFS VNRQVSVSCA EGYTFEGVNI SVCQLDGTWE PPFSD ESCSP VSOGKPESPE 
HGFWGSKYT PBSTIIYQCE PGYELEGNRE RVOQERRQWS GGVAICKETR CETPLEFLNG 
KADIEKRTTG PNWYSCNRG YSLEGPSEAH CTENGTWSHP VPLCKPNPCP VP FV I PENAL 
LSEKEPYVDQ NVSIKCREGF LLQGHGIITC NPDETWTQTS AKCEKISCGP PAHVENAIAR 
GVHYQYGDMI TYSCYSGYML EGFLRSVCLE KGTWTSPPIC RAVCRPPOQN GGIOQRPHAC 
SCPEGWMGRL CEEPICILPC LNGGRCVAPY QCDCPPGWTG SRCHTAVCGS PCLNGGKCVR 
PNRCHCLSSW TGHNCSR 
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ATGAGGTTCA GTGTCTCAGG CATGAGGACC GACTACCCCA GGAGTGTGCT GGCTCCTGCT 60 

TATGTGTCAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 120 

GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA CCCCTTGGAG 180 

CAGTGCTGTT ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG TGGTCCCCCC 240 

TGCACCTTCT GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT CACAAACGAT 300 

TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC TCCCATCTCC 360 
AGTAAATGTG AAAGAGGCCG GATATGTTAG 
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MRFSVSGMRT DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 60 
QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 
SKCERGRIC 

Seq ID NO: 433 DNA sequence 

Nucleic Acid Accession #: NMJ>07231.1 

Coding sequencet 89.. 2017 

1 11 21 31 41 51 

TAGGAACAGG GGAGAGTGCA CCTGCTACCA GTCAAGCTCA GCCAGACTGC AAGAGGAGGC 60 

GAGGCGGAGC CAGCCGAGGG AGTGAACCAT GGACAAGTTG AAATGCCCGA GTTTCTTCAA 120 

GTGCAGGGAG AAGGAGAAAG TGTCGGCTTC ATCAGAGAAT TTCCATGTTG GTGAAAATGA 180 

TGAGAATCAG GACCGTGGTA ACTGGTCCAA AAAATCGGAT TATCTTCTAT CTATGATTGG 240 

ATACGCAGTG GGATTAGGAA ATGTGTGGAG ATTTCCATAT CTGACCTACA GCAATGGTGG 300 

AGGCGCCTTC TTGATACCTT ATGCAATTAT GTTAGCATTG GCTGGTTTAC CTTTGTTCTT 360 

TCTGGAGTGT TCACTGGGAC AATTTGCTAG CTTAGGTCCA GTTTCAGTTT GGAGGATTCT 420 

TCCATTGTTT CAAGGTGTGG GAATTACAAT GGTCCTGATC TCCATTTTTG TGACAATCTA 480 

TTACAATGTC ATAATTGCCT ATAGTCTTTA CTACATGTTT GCTTCTTTTC AAAGTGAACT 540 

ACCATGGAAA AATTGTTCTT CGTGGTCAGA TAAAAACTGT AGCAGATCAC CAATAGTAAC 600 

TCACTGTAAT GTGAGTACAG TGAATAAAGG AATACAAGAG ATCATCCAAA TGAATAAAAG 660 

CTGGGTAGAC ATCAACAATT TTACCTGCAT CAACGGCAGT GAAATTTATC AGCCAGGGCA 720 

GCTTCCCAGT GAACAATATT GGAATAAAGT GGCQCTCCAA CGGTCAAGTG GAATGAATGA 780 

GACTGGAGTA ATTGTTTGGT ATTTAGCACT TTGTCTTCTT CTGGCTTGGC TCATAGTTGG 840 

AGCAGCACTA TTTAAAGGAA TCAAATCGTC TGGCAAGGTG GTATATTTTA CAGCTCTTTT 900 

CCCCTATGTG GTCCTACTCA TCCTGTTAGT AGGAGGTGCA ACTCTGGAGG GTGCTTCAAA 960 

AGGCATTTCA TACTATATTG GAGCCCAGTC AAATTTTACA AAACTTAAGG AAGCTGAGGT 1020 

ATGGAAAGAT GCTGCCACTC AGATATTTTA CTCCCTTTCA GTGGCTTGGG GTGGCTTAGT 1080 

TGCTCTATCA TCTTACAATA AGTTCAAAAA CAACTGCTTC T CTGATGC CA TTGTGGTTTG 1140 

TTTGACAAAC TGTCTCACTA GCGTGTTTGC TGGATTTGCT ATTTTTTCTA TATTGGGACA 1200 

CATGGCCCAT ATATCTGGAA AGGAAGTTTC TCAAGTTGTA AAATCAGGTT TTGATTTGGC 1260 

ATTCATTGCC TATCCAGAGG CTCTAGCCCA ACTCCCAGGT GGTC CATTTT GGTCCATATT 1320 

ATTTTTTTTC ATGCTTTTAA CTTTGGGTCT CGATTCTCAG TTTGCTTCGA TTGAAACGAT 1380 

CACAACAACA ATTCAAGATT TATTTCCCAA AGTGATGAAG AAAATGAGGG TTOCCATAAC 1440 

TTTGGGCTGC TGCTTGGTTT TGTTTCTCCT TGGTCTCGTC TGTGTGACTC AGGCTGGAAT 1500 
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TTACTGGGTT CATCTGATTG ACCACTTCTG TGCTGGATGG GGCATTTTAA TTGCAGCTAT 1560 
ACTGGAGCTA GTTGGAATCA TCTGGATTTA TGGAGGGAAC AGATTCATTG AGGATACAGA 1620 
AATGATGATT GGAGCAAAGA GGTGGATATT CTGGCTATGG TGGAGAGCTT GCTGGTTTGT 1680 
AATTAOGCCT ATCCTTTTGA TTGCAATATT TATCTGGTCA TTGGTGCAAT TTCATAGACC 1740 
5 TAATTATGGC GCAATTCCAT ACCCTGACTG GGGAGTTGCT TTAGGCTGGT GTATGATTGT 1800 
TTTCTGCATT ATTTGGATAC CAATTATGGC TATCATAAAA ATAATTCAGG CTAAAGGAAA 1860 
CATCTTTCAA CGCCTTATAA GTTGCTGCAG ACCAGCTTCT AACTGGGGTC CATACCTGGA 1920 
ACAACATCGT GGGGAAAGAT ATAAAGACAT GGTAGATCCT AAAAAAGAGG CTGACCATGA 1980 
AATACCTACT GTTAGTGGCA GCAGAAAACC GGAATGAGAT CTCATTGAAA AAAATATATG 2040 
10 ATTGTATAAT GTGATTTTTT TTAGAATAGG GGGAACCTTA TTTATTTGTG TGTTAACTGA 2100 
ATAGGAAAAT GTACATACTA TGTTCATGAT AGTGTGATTT TTTTCACATT TAAGCAGGA A 2160 
TGCAATATAA AAATGTGAAT CTCTTAATTC TCAGCCATGT GCTTATTATA TTTCTTTTTA 2220 
GATTGTCTAT CTGTATAACA CACACACACA CACCTAAGAG TCTCTATTTC ACAATTATAT 2280 
TTTTGTAAAT AGTATATGCA TTTTTAATAC ATTGGAGGCT TTATTTTGAA CTAATTTCTT 2340 
15 AGAGAATAGT TATATTTTCT ATTACACAAG TTTAAAAATA TTATTAACTT GTATTTTCTT 2400 
AATATACAAT CTATCTTTTC CACAAATATG AGTGGGAAAT AAATCAGCAC ATTTGAAAGA 2460 
AAGTGTTAAA ACTGAAGGCC TCACTTAATT AGAAACGTGA TAAATATATG GACAAATGGA 2520 
CTATACATAC TATAAGAGGA CTGTAGTTTA ATACTTTTTA OCCAAATATG TTTAAAAACA 2580 
TCGTGCATTT GTTACAGCTC ATGTTTTCTA TATGAACTTA GTCATTAATG TTCTTTATAA 2640 
20 AAAGTGAAAT AAGATGGAAA AATTAGGATC CTACAGCCAG TACGTGATAA ATCTAGAAAA 2700 
TTGAGTTTTG AGTACCTCTT TTCCCATATA CAATCTTCCT TCCTTAGGTA ATTTGGAAGA 2760 
AAACTATGAC CCATTTAATT TCTATTGTGT TTCACCAAAT TCAGTGTTGT TCATTATACC 2820 
TCTCTGAAAT ATAGGTTTAA TTTCAAATAG AATATGGACT TAAATGTTAA TG AGAAACT G 2880 
GCTTTAATCA ATT CT AG CAT TTTATTACTG TAATACAGGG CTGATAGAGT GATTTTGTCT 2940 
25 TATATGAGTC AGTTACTACT TACAGGTGAT AACTTGCATA CTATTGGAAG ATAAAGTTGT 3000 
CAAACTTGTC AAGAATGAGA AAAGCCAAAT TAGAAAATCC TATGTCCTAG TTTCCTTACC 3060 
AAGGATAATT AAATATATCA CTAAGAGCTT TATATATTGA TTATATATTG TTGACAACTG 3120 
GTTTAAGCAT CATAGCCTAT GATGATAAAC ACTGCCTATA TATGTAAATA GCTTTTCATC 3180 
AATTCTTAAA TTTCTTAACC TAGGCTTCAG GGAGCATATG AAACCAAAAT TATAT GGAAC 3240 
30 ATTTTCTGTG TGTACATGTA CATGCATTTT TCTAGGGAGA GAGTCCGTAG QTTTAT CAGA 3300 
ATATCAAGGA AAACTGTGAC CCAAAGAAGT TTAAGAATCA CATACAGTGC TGCTGGCTTT 3360 
TTGTGCTTGG CAAATGAGTG ACAATAGAAG AAATAATTTT TCTTACACAT TTTAAAACGT 3420 
TTTCTCTTCC TTGTGATTGA AGATGAAAGG AGTAAGAAAT TAAGGCATTT GTTTAATTTA 3480 
TACTGGTAAC TTATTTAGGG GGGAGGGGAC ATGAAGGTAG GTAAATAGGT AGGCCTCTAA 3540 
35 TTGAACCACC TCTCTAAGTT ATGTACGTAT ATATAAGCTG AAATTGTGTT TGACATTCTG 3600 
AGGGTTTTCT r i TV C TT T TT CCTTTTTTTT TTTTTTTGGT GGGGGGCTGG GGGTCAGAGT 3660 
CTTGTTCTGT TGOCTGGGCT GGAGTGCAGT GGCATGATCT CAGCTCACTG CAACCTCTGC 3720 
CTTCTGGATT CAAGTGATTC TCCTGGCTCA GCCTCTTGAG TAGCTGGGAC TACAGGTGCC 3780 
CGCCACCACA CCAGCTAATT TTTGTATTTT TAGTAGAGGC GAAGTTTCCC CATGTTGGCC 3840 
40 AGGCTGGTCT TGAACTCCOG ACCTCAAGTG ATCTGTCTAC CTCGGCCTCC TAAAGTGCTG 3900 
AGATTACAGG TGTGAGCCAC OGTGCCOGGC CCATTCTAAG GGTTTTCTTT GAAGACAGGT 3960 
CAAATGCTGT TAGTAAGTTT CAGGAGATTG TTAATTCCTC AGTTATACCA GATTTTATAA 4020 
AATATTTGAG AATAGATGGC TAACAAGAGG TTAGAAATAC TTTTCCTTAA TTTTAATCCA 4080 
CAGTATGTTA CATGCATTCT ACCACTACAT TTTGGTGCTA TTTAAGGTGT GCAATTTTCT 4140 
45 ATAGGTGACT TTTGCAATTC AGGGAAGATT TGGGCATATT AAATGAAAGA ATATCTAATT 4200 
GGGGGAGGTG TGAAGGGAAA GAAATTCTTT TCAAAAGCTG ACCACAAAGA GTAGTTAAAA 4260 
GTTTTTGTCA CTATCTTCAC AAGTGTGTAA AGCACAGATT TCAACAGAGT GCTTGGCATA 4320 
TTGTAGGGTG CTCAATGGTG GTTTTTATTA TTATTACTCA GATTCCACAG TGGCAAGAAA 4380 
CATCATTCTA CATAATGGAA AACATTTACA TCAAATCCCA CTTACTTTAA TGCGAACTTG 4440 
50 GAGATAATTT ATGGTATTGT ATTGTAAACC ATTAATGAAA ACTTTTTCAC AGTTGAGTGA 4500 
AATTAAAATC ACTATATCTC 

Seq ID NO? 434 Protein sequence 
^ Protein Accession #: NP_Q09162.1 

1 11 21 31 41 51 

MDKLKCPSFF KCREKEKVSA SSENFHVGEN DENQDRGNWS KKSDYLLSMI GYAVGLGNVW 
RPPYLTYSNG GGAFLIPYAI MLALAGLPLF FLBCSLGQFA SLGPVSVWRI LPLFQGVGIT 

60 MVLISIPVTI YYNVIIAYSL YYMFASPQSE LPWKNCSSWS DKNCSRSPIV THCNVSTVNK 
GIQEIIQMNK SWVDINNFTC INGSEIYQPG QLPSEQYWNK VAIXJRSSGMN ETGVIVWYLA 
LCLLLAWLIV GAALPKGIKS SGKWYFTAL FPYWLLILL VRGATLEGAS KGISYYIGAQ 
SNPTKLKBAE VWKDAATQIP YSLSVAWGGL VALSSYNKPK NNCFSDAIW CLTNCLTSVF 
AGPAIFSILG HMAHISGKEV SQWKSGFDL AFIAYPEALA QLPGGPPWSI LFFFMLLTLG 

65 LDSQFASIET ITTTIQDLFP KVMKKMRVPI TLGCCLVLFL LGLVCVTQAG IYWVHLIDHF 
CAGWGILIAA ILELVGIIWI YGGNRFIEDT EMMIGAKRWI FWLWWRACWP VITPILLIAI 
PIWSLVQFHR PNYGAIPYFD WGVALGWCMI VPCIIWIPIM AIIKIIQAKG NIFQRLISCC 
RPASNWGPYIj eqhrgerykd MVDPKKEADH EIPTVSGSRK pe 

70 Seq ID NO: 435 DNA sequence 

Nucleic Acid Accession #: M18728.1 
Coding sequence: 51.. 1085 

1 11 21 31 41 51 

| 1 I 1 I ! 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA cagagaagac agcagagaoc atgggacccc 
cctcagcccc tccctgcaga ttgcatgtcc cctggaagga ggtcctgctc acagcctcac 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 4 B0 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 
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CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 
GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 
GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 
ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 
CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 
ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 
TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 



Seq ID NO: 436 Protein sequence 
Protein Accession ft* AAA59907.1 



MGPPSAPPCR 
NRIGYSWYKG 
TLQVIKSDLV 
NGQSLPVSPR 
TISPSKANYR 
AHNSATGLNR 



11 

I 

LHVPWKEVLL 
ERVDGNSLIV 
NEEATGQFHV 
LQLSNGNMTL 
PGENLNLSCH 
TTVTMITVSG 



21 
I 

TASLLTPWNP 
GYVIGTQQAT 
YPELPKPSIS 
TLLSVKRNDA 
AASKPPAQYS 
SAPVLSAVAT 



31 
I 

PTTAKLTIES 
PGPAYSGRET 
SNNSNPVBDK 
GSYECEIQNP 
WFINGTPQQS 
VGITIGVLAR 



41 

I 

TPFNVAEGKE 
IYPNASLLIQ 
DAVAFTCEPE 
ASANRSDFVT 
TQELFIPNIT 
VALI 



Seq ID NO: 437 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 1355.. 1657 



51 
I 

VLLLAHNLPQ 
NVTQNDTGFY 
VQNTTYLWWV 
LNVLYGPDVP 
VNNSGSYMCQ 



51 



60 
120 
180 
240 
300 



1 11 21 31 41 

I I I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 
CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 
TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 
ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 
GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 
TAGGAACTCA ACAAGCTACC CCAGGGCCCG catacagtgg tcgagagaca atatacccca 
ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 
TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCGGGAGC 
TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 
CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 
GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 
GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 
ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 
CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 
ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 
TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 
CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 
TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 
TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 
GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 
CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 
ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 
GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 
AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 
TGCC TC T TTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 
GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 
AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 
AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 
TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 
GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 
GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 
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CTGACTCATT 
CTCTTGGTAT 
CTCTAAAAQC 
GGCTGGAATT 
ATAAAAGCCC 
TCTCACCTAG 
GTTAAGGAAG 
ACACAGGAGA 
TTTACAAAAA 
TGTTTCCTTG 
CTATCACTGT 
TAGCTCTATA 



CTTTATTCTA 
TACCCTCCTA 
TTTAAATGTC 
ACAAAACTCA 
CAAATGGTGG 
GTGAGCGCAT 
AAGATAGATC 
TTCCAGTCTA 
AGTAACCTGA 
TTCCAATTTG 
ACTTGTAGAG 
ACT 



TTTTAGTTGG 
ATAGTCATAC 
TGCATGCAGC 
GAGAAATGTG 
TAACTGATAA 
TGAGCCAGTG 
CAATTAAAAA 
CTTGAGTTAG 
ACTAATCTGA 
ACAAAACCCA 
TGGTGCTGCT 



TTTGTATCTT 
TAGTAGTCAT 
CAGCCATCAA 
TCATCAGGAG 
TAGCACTAAT 
GTGCTAAATG 
AAATTAAAAC 
CATAATACAG 
TGTTAACCAA 
CTGTTCTTGT 
TTAATTCATA 



GCCTAAGGTG 
ACTOCCTGGT 
ATAGTGAATG 
AACATCATAA 
GCTTTAAGAT 
CTACATACTC 
CAATTTAAAA 
AAGTCCCCTC 
TGTATTTATT 
ATTGTATTGC 
AATCACAAAT 



OGTAGTCCAA 
GTAGTGTATT 
GTCTCTCTTT 
CCCATGAAGG 
TTGGTCACAC 
CAACTGAAAT 
AAAAAAAAGA 
TACTTTAACT 
TCTGTGGTTC 
CCAGGGGGAG 
AAAAGCCAAT 



Seq ID NO: 438 Protein sequence 
Protein Accession #: AAA59908.1 

1 11 21 31 41 51 

I I I I I I 

MDSFSQDVKT RLLIMIRLLP PFNLSLLMPA SFAWQDDAVI SISQEVASEG NLTECQIYLV 
NPNVLHKIRD PLVHPVTDIS SIFNTAVCSN VQWSFSELDP 

Seq ID NO: 439 DNA sequence 
Nucleic Acid Accession fh M18728.1 
Coding sequence: 2370.. 2501 



GGAGCTCAAG 
CCTCAGCCCC 
TTCTAACCTT 
ATGTCGCAGA 
GTTACAGCTG 
TAGGAACTCA 
ATGCATCCCT 
TCATAAAGTC 
TGCCCAAGCC 
CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACCGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
TCCTCTCAGC 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCTCAGGC 
GCAAACCATG 
AACAAGACTC 
TGCCTCTTTC 
GGGTAACTTA 
AGATCCTTTA 
AAATGTACAG 
TTTAATTCAA 
GGAGGAGTCT 
GCTGAGACTA 
CTGACTCATT 
CTCTTGGTAT 
CTCTAAAAGC 
GGCTGGAATT 
ATAAAAGCCC 
TCTCACCTAG 
GTTAAGGAAG 
ACACAGGAGA 
TTTACAAAAA 
TGTTTCCTTG 
CTATCACTGT 
TAGCTCTATA 



11 
I 

CTCCTCTACA 
TCCCTGCAGA 
CTGGAACCCA 
GGGGAAGGAG 
GTACAAAGGC 
ACAAGCTACC 
GCTGATCCAG 
AGATCTTGTG 
CTCCATCTCC 
TGAACCTGAG 
CAGTCCCAGG 
GAACGATGCA 
CCCAGTCACC 
CAATTACCGT 
ACAGTACTCT 
CAACATCACT 
CCTCAATAGG 
TGTGGCCACC 
CTGGTGTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCATGA 
GCTTGGCAGG 
ACAGAGTGTC 
GTGCACCCAG 
TGGTCCTTTT 
CCCAGCCATG 
GTGCAGTTTC 
AGTTGTAGAA 
CTTTATTCTA 
TACCCTCCTA 
TTTAAATGTC 
ACAAAACTCA 
CAAATGGTGG 
GTGAGCGCAT 
AAGATAGATC 
TTCCAGTCTA 
AGTAACCTGA 
TTCCAATTTG 
ACTTGTAGAG 
ACT 



21 

I 

AAGAGGTGGA 
TTGCATGTCC 
CCCACCACTG 
GTTCTTCTAC 
GAAAGAGTGG 
CCAGGGCCCG 
AACGTCACCC 
AATGAAGAAG 
AGCAACAACT 
GTTCAGAACA 
CTGCAGCTGT 
GGATCCTATG 
CTGAATGTCC 
CCAGGGGAAA 
TGGTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCGGCATCA 
TTCGATATTT 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GACGACTTCA 
TAAGGCTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTGACAT 
CAGAGTTGGA 
CAATGCCAAA 
TGACACTTGT 
ATTAACAAAT 
TTTTAGTTGG 
ATAGTCATAC 
TGCATGCAGC 
GAGAAATGTG 
TAACTGATAA 
TGAGCCAGTG 
CAATTAAAAA 
CTTGAGTTAG 
ACTAATCTGA 
ACAAAACCCA 
TGGTGCTGCT 



31 
I 

CAGAGAAGAC 
CCTGGAAGGA 
CCAAGCTCAC 
TCGCCCACAA 
ATGGCAACAG 
CATACAGTGG 
AGAATGACAC 
CAACCGGACA 
CCAACCCCGT 
CAACCTACCT 
CCAATGGCAA 
AATGTGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACOTT 
GOGGATCCTA 
CGATGATCAC 
CGATTGGAGT 
CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAG CAG CATC 
CTTCTAGACT 
TAATAGAATT 
TGTTGAACAT 
GTGCTGCTTG 
TTTGTATCTT 
TAGTAGTCAT 
CAGCCATCAA 
TCATCAGGAG 
TAGCACTAAT 
GTGCTAAATG 
AAATTAAAAC 
CATAATACAG 
TGTTAACCAA 
CTGTTCTTGT 
TTAATTCATA 



41 
I 

AGCAGAGACC 
GGTOCTGCTC 
TATTGAATCC 
CCTGCCCCAG 
TCTAATTGTA 
TOGAGAGACA 
AGGATTCTAT 
GTTCCATGTA 
GGAGGACAAG 
GTGGTGGGTA 
CATGACCCTC 
ACAGAACCCA 
AGATGTCCCC 
CTCCTGCCAC 
CCAGCAATCC 
TATGTGCCAA 
AGTCTCTGGA 
GCTGGCCAGG 
GGCAGATTGG 
ACCACTAAAA 
AATGAAAATT 
TA ACTAGA GA 
AGCTTTTCCC 
AATTTGTCCT 
TCACAAGAAG 
AACGTTTTAC 
TTTAACACAG 
CACCTGTTCT 
GCTCCCTACC 
GGCTAAATAC 
GTTAAAATGG 
GCCTAAGGTG 
ACTCCCTGGT 
ATAGTGAATG 
AACATCATAA 
GCTTTAAGAT 
CTACATACTC 
CAATTTAAAA 
AAGTCCCCTC 
TGTATTTATT 
ATTGTATTGC 
AATCACAAAT 



51 
I 

ATGGGACCCC 
ACAGCCTCAC 
ACGCCATTCA 
AATCGTATTG 
GGATATGTAA 
ATATACCCCA 
ACCCTACAAG 
TACCCGGAGC 
GATGCTGTGG 
AATGGTCAGA 
ACTCTACTCA 
GCGAGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACT 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGGTCTG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
CCGTGTGTTC 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 
CGTAGTCCAA 
GTAGTGTATT 
GTCTCTCTTT 
CCCATGAAGG 
TTGGTCACAC 
CAACTGAAAT 
AAAAAAAAGA 
TACTTTAACT 
TCTGTGGTTC 
CCAGGGGGAG 
AAAAGCCAAT 



Seq ID NO: 440 Protein sequence 
Protein Accession #s AAA59909.1 

1 11 21 31 41 

I I I I I 

MLTNVFISW LPPCSNLTKP TVLVLYCPGG AITVLVEWCC PNS 



Seq ID NO: 441 DNA sequence 

Nucleic Acid Accession #: NNJJ02381.2 

Coding sequence: 64.. 1524 



51 
I 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



11 



21 



31 



41 



51 

1128 



WO 03/042661 



I I I I I I 

AAATCCGAGC CTCGCGTGGG CTCCTGGCCC CCGACGGACA CCACCAGGCC CACGGAGCCC 60 

ACCATGCCGC GCCCGGCCCC CGCGCGCCGC CTCCCGGGAC TCCTCCTGCT 6CTCTGGCCG 120 

CTGCTGCTGC TGCCCTCOCC CGCCCCCGAC CCCGTGGCCC GCCCGGGCTT CCGGAGGCTG 180 

GAGACCCGAO GTCCCGGGGG CAGCCCTGGA OGCCGCCCCT CTCCTGCGGC TCCCGACGGC 240 

GCGCCOGCTT COGGGACCAG CGAGCCTGGC CGOGCCOGOG GTGCAGGTGT TTGCAAGAGC 300 

AGACCCTTGG ACCTGGTGTT TATCATTGAT AGTTCTCGTA GOGTACGGCC CCTGGAATTC 360 

AOCAAAGTGA AAACTTTTGT CTCCCGGATA ATCGACACTC TGGACATTGG GCCAGCCGAC 420 

AOGCGGGTGG CAGTGGTGAA CTATGCTAGC ACTGTGAAGA TCGAGTTCCA ACTCCAGGCC 480 

TACACAGATA AGCAGTCCCT GAAGCAGGCT GTGGGTCGAA TCACACCCTT GTCAACAGGC 540 

ACCATGTCAG GCCTAGCCAT CCAGACAGCA ATGGACGAAG CCTTCACAGT GGAGGCAGGG 600 

GCTCGAGAGC CCTCTTCTAA CATCCCTAAG GTGGCCATCA TTGTTACAGA TGGGAGGCCC 660 

CAGGACCAGG TGAATGAAGT GGOGGCTCGG GCCCAAGCAT CTGGTATTGA GCTCTATGCT 720 

GTGGGOGTGG ACCGGGCAGA CATGGCGTCC CTCAAGATGA TGGCCAGTGA GCCCCTAGAG 780 

GAGCATGTTT TCTACGTGGA GACCTATGGG GTCATTGAGA AACTTTCCTC TAGATTCCAG 840 

GAAACCTTCT GTGCGCTGGA CCCCTGTGTG CTTGGAACAC ACCAGTGCCA GCACGTCTGC 900 

ATCAGTGATG GGGAAGGCAA GCACCACTGT GAGTGTAGCC AAGGATACAC CTTGAATGCC 960 

GACAAGAAAA CGTGTTCAGC TCTTGATAGG TGTGCTCTTA ACACCCACGG ATGTGAGCAC 1020 

ATCTGTGTGA ATGACAOAAG TGGCTCTTAT CATTGTGAGT GCTATGAAGG TTATACCTTG 1080 

AATGAAGACA GGAAAACTTG TTCAGCTCAA GATAAATGTG CTTTGGGTAC CCATGGGTGT 1140 

CAGCACATTT GTGTGAATGA CAGAACAGGG TCCCATCATT GTGAATGCTA TGAGGGCTAC 1200 

ACTCTGAATG CAGATAAAAA AACATGTTCA GTCCGTQACA AGTGTGCCCT AGGCTCTCAT 1260 

GGTTGCCAGC ACATTTGTGT GAGTGATGGG GCCGCATCCT ACCACTGTGA TTGCTATCCT 1320 

GGCTA CACCT TAAATGAGGA CAAGAAAACA TGTTCAGCCA CTGAGGAAGC ACGAAGACTT 1380 

GTTTCCACTG AAGATGCTTG TGGATGTGAA GCTACACTGG CATTCCAGGA CAAGGTCAGC 1440 

TCGTATCTTC AAAGACTGAA CACTAAACTT GATGACATTT TGGAGAAGTT GAAAATAAAT 1500 

GAATATGGAC AAATACATCG TTAAATTGCT CCAATTTCTC ACCTGAAAAT GTGGACAGCT 1560 

TGGTGTACTT AATACTCATG CATTCTTTTG CACACCTGTT ATTGCCAATG TTCCTGCTAA 1620 

TAATTTGCCA TTATCTGTAT TAATGCTTGA ATATTACTGG ATAAATTGTA TGAAGATCTT 1680 

CTGCAGAATC AGCATGATTT TTOCAAGGAA ATACATATGC AGATACTTAT TAAGAGCAAA 1740 

CTTTAGTGTC TCTAAGTTAT GACTGTGAAA TGATTGGTAG GAAATAGAAT GAAAAGTTTA 1800 

GTGTTTCTTT ATCTACTAAT TGAGCCATTT AATTTTTAAA TGTTTATATT AGATAACCAT 1860 

ATTCACAATG GAAACTTTAG GTCTAGTTTC TTTTGATAGT ATTTATAATA TAAATCAATC 1920 

TTATTACTGA GAGTGCAAAT TGTACAAGGT ATTTACACAT ACAACTTCAT ATAACTGAGA 1980 

TGAATGTAAT TTTGAACTGT TTAACACTTT TTGTTTTTTG CTTATTTTGT TGGAGTATTA 2040 

TTGAAGATGT GATCAATAGA TTGTAATACA CATATCTAAA AATAGTTAAC ACAGATCAAG 2100 

TGAACATTAC ATTGCCATTT TTAATTCATT CTGGTCTTTG AAAGAAATGT ACTACTAAAG 2160 

AGCACTAGTT GTGAATTTAG GGTGTTAAAC TTTTTACCAA GTACAAAAAT CCCAAATTCA 2220 

CTTTATTATT TTGCTTCAGG ATCCAAGTGA CAAAGTTATA TATTTATAAA ATTGCTATAA 2280 

ATCGACAAAA TCTAATGTTG TCTTTTTAAT GTTAGTGATC CACCTGCCTC AGCCTCCCAA 2340 

AGTGCTGGGA TTACAGGCTT GAAAGTCTAA CTTTTTTTTA CTTATATATT TGATACATAT 2400 

AATTCTTTTG GCTTTGAAAC TTGCAACTTT GAGAACAAAA CAGTCCTTTA AATTTTGCAC 2460 

TGCTCAATTC TGTTTTTCGT TTGCATTGTC TTTAATATAA TAAAAGTTAT TACCTTTACA 2520 

TATTATCATG TCTATTTTTG ATGACTCATC AATTTTGTCT ATTAAAGATA TTTCTTTAAA 2580 
TTAAAAAAAA AAAAAAAAA 



Seq ID NO: 442 Protein sequence 
Protein Accession #; NP_002372.1 

1 11 21 31 41 51 

I I I I I I 

MPRPAPARRL PGLLLLLWPL LLLPSAAPDP VARPGFRRLE TRGPGGSPGR RPSPAAPDGA 60 

PASGTSEPGR ARGAGVCKSR PLDLVFIIDS SRSVRPLEFT KVKTPVSRII DTLDIGPADT 120 

RVAWNYAST VKIEFQLQAY TDKQSLKQAV GRITPLSTGT MSGLAIQTAM DEAPTVSAGA 180 

REPSSNIPKV AIIVTDGRPQ DQVNEVAARA QASGIEliYAV GVDRADMASL KMMASEPLEE 240 

HVFYVETYGV IEKLSSRFQE TFCALDPCVL GTHQCQHVCI SDGEGKHHCS CSQGYTLNAD 300 

KKTCSALDRC ALNTHGCEHI CVNDRSGSYH CECYEGYTLN EDRKTCSAQD KCALGTHGCQ 360 

HICVNDRTGS HHCECYEGYT LNADKKTCSV RDKCALGSHG CQHICVSDGA ASYHCDCYPG 420 

YTLNEDKKTC SATEEARRLV STEDACGCEA TLAFQDKVS8 YLQRLNTKLD DILEKLKINE 480 
YGQIHR 



Seq ID HO: 443 DNA sequence 

Nucleic Acid Accession gt NM_016639.1 

Coding sequence: 40.. 429 



1 11 21 31 41 51 

I I I I I I 

GCGGOGGGCG CAGACAGCGG CGGGCGCAGG ACGTGCACTA TGGCTOGGGG CTCGCTGOGC 60 

CGGTTGCTGC GGCTCCTCGT GCTGGGGCTC TGGCTGGCGT TGCTGOGCTC CGTGGCCGGG 120 

GAGCAAGCGC CAGGCACCGC CCCCTGCTCC GGCGGCAGCT CCTGGAGCGC GGACCTGGAC 180 

AAGTGCATGG ACTGCGCGTC TTGCAGGGCG OGACCGCACA GOGACTTCTG CCTGGGCTGC 240 

GCTGCAGCAC CTCCTGCCCC CTTCCGGCTG CTTTGGCCCA TCCTTGGGGG CGCTCTGAGC 300 

CTGACCTTCG TGCTGGGGCT GCTTTCTGGC TTTTTGGTCT GGAGACGATG CCGCAGGAGA 360 

GAGAAGTTCA CCACCCCCAT AGAGGAGACC GGCGGAGAGG GCTGCCCAGC TGTGGCGCTG 420 

ATCCAGTGAC AATGTGCCCC CTGCCAGCOG GGGCTCGCCC ACTCATCATT CATTCATCCA 460 

TTCTAGAGCC AGTCTCTGCC TCCCAGACGC GGCGGGAGCC AAGCTCCTCC AACCACAAGG S40 

GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGG GAACCTTCCA 600 

AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GGAGCCTCAC GCTGGCTCAC 660 

ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 720 

CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGGCA GACTTGACAC TAGGCCCCAC 780 

TCACTCAGAT GTCCTGAAAT TCCAOCACGG GGGTCACCCT GGGGGGTTAG GGACCTATTT 840 

TTAACACTAG GGGCTGGCCC ACTAGGAGGG CTGGCCCTAA GATACAGACC CCCCCAACTC 900 

CCCAAAGOGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG GAGGGGAGGG AGAATTTATT 960 



1129 



WO 03/042661 



AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seq ID NO: 444 Protein sequence 
Protein Accession 8: NP__057723.1 

1 11 21 31 41 51 

I 1 I I I I 

MARGSLRRLL RLLVLGLWLA LLRSVAGBQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 
SDFCLGCAAA PPAPFRLLWP ILGGALSLTP VLGLLSGFLV WRRCRRREKF TTPIEETGGE 120 
GCPAVALIQ 

Seq ID NO: 445 DNA sequence 

Nucleic Acid Accession #* AF322916.1 

Coding sequence: 50.. 4300 

1 11 21 31 41 SI 

I I I I I I 

GCACTCCGCA GCCTTTAAGG TTCGCGOGGG GGCCAGGCAA GAGTTAGCCA TGAAGAGCCT 60 
CAAGTCCCGC CTGAGGAGGC AGGACGTGCC CGGCCCOGCG TCGTCTGGCG CCGCCGCCGC 120 
CAGCGG6CAT GCAGCAGATT GGAATAAATA TGATGAOCGA TTGATGAAAG CAGCAGAAAG 180 
GGGGGATGTA GAAAAAGTGA CCTCAATCCT TGCTAAAAAG GGGGTCAATC CAGGCAAACT 240 
AGATGTGGAA GGCAGATCTG TCTTCCATGT TGTGACCTCA AAGGGGAATC TTGAGTGTTT 300 
GAATGCCATC CTTATACATG GAGTTGATAT TACAACCAGT GACACTGCAG GGAGAAATGC 360 
TCTTCACCTG GCTGCTAAGT ATGGACATGC ATTGTGCCTA CAAAAACTTC TACAGTACAA 420 
TTGTCCCACT GAGCATGCAG ACCTGCAGGG AAGAACTGCA CTTCAAAAAA AAGCAATGGC 480 
AGATTGTCCT TCTAGCATAC AGCTGCTTTG TGACCATGGG GCCTCTGTGA ATGCCAAAGA 540 
TGTAAAAGGG CGGACACCAC TTGTTCTGGC TACTCAGATG AGTAGGOCAA CAATATGTCA 600 
ACTGCTGATA GATAGAGGAG GGGATGTTAA TTCCAGAGAC AAACAAAACA GAACTGCCCT 660 
CATGCTAGGT TGCGAATATG GTTGCAGAGA TGCAGTAGAA GTCTTAATTA AAAATGGTGC 720 
TGATATAAGC TTGCTGGATG CGCTTGGCCA TGATAGTTCT TACTATGCAA GAATTGGTGA 780 
CAATCTGGAC ATTCTAACCT TGTTGAAGAC TGCATCGGAA AATACCAACA AAGGGAGAGA 840 
ACTTTGGAAG AAAGGACCAT CTTTGCAACA GCGAAATTTG ACACACATGC AAGATGAAGT 900 
AAATGTGAAG TCACATCAGA GGGAGCATCA AAATATTCAG GATTTGGAGA TTGAAAATGA 960 
AGATTTGAAA GAGAGGTTGA GAAAAATTCA GCAAGAACAA AGAATACTTT TGGATAAAGT 1020 
CAATGGTTTA CAGTTACAGC TGAATGAGGA AGTTATGGTT GCTGATGATC TGGAAAGCGA 1080 
GAGAGAAAAG CTGAAGTCCC TTTTGGCAGC TAAAGAAAAG CAACATGAAG AAAGCTTAAG 1140 
GACTATTGAG GCTCTGAAAA ATAGATTTAA ATATTTTGAG AGTGATCATT TAGGATCAGG 1200 
AAGTCATTTC AGTAACOGAA AAGAAGATAT GCTTCTTAAA CAAGGTCAGA TGTATATGGC 1260 
AGACTCACAG TGTACTTCCC CAGGTATACC AGCCCATATG CAAAGCAGAT CTATGTTAAG 1320 
ACCTCTGGAA CTATCTTTAC CCAGTCAAAC GTCATACTCT GAAAATGAAA TTTTAAAGAA 1380 
AGAGTTAGAA GCAATGCGAA CTTTCTGTGA GTCAGCAAAA CAAGAOCGAC TGAAGCTCCA 1440 
AAATGAACTG GCACACAAAG TGGCAGAATG CAAAGCTTTA GCATTAGAAT GTGAAAGGGT 1500 
CAAGGAGGAT TCAGATGAAC AGATAAAGCA ATTAGAAGAT GCATTAAAAG ATGTGCAGAA 1560 
GAGGATGTAT GAGTCAGAAG GTAAAGTTAA ACAAATGCAG ACCCATTTTC TTGCCCTTAA 1620 
AGAACACTTA ACAAGTGAAG CAGCCTCAGG GAATCACAGA CTAACCGAGG AACTGAAGGA 1680 
TCAGTTGAAA GACTTGAAAG TAAAATATGA AGGTGCTTCA GCAGAAGTGG GGAAATTAAG 1740 
AAACCAAATC AAACAAAATG AGATGATAGT AGAAGAGTTT AAGAGGGATG AAGGCAAGCT 1800 
GATAGAAGAA AATAAGCGAT TACAGAAGGA ACTTAGTATG TGTGAAATGG AGCGAGAGAA 1860 
GAAAGGAAGA AAGGTCACAG AGATGGAAGG CCAGGCAAAA GAATTGTCAG OGAAGTTGGC 1920 
CCTTTCCATT CCAGCTGAAA AATTTGAAAA CATGAAGAGC TCATTATCAA ATGAAGTGAA 1980 
TGAGAAAGCA AAAAAATTAG TAGAAATGGA AAGAGAACAT GAAAAATCAC TTAGTGAAAT 2040 
TAGACAGTTA AAGAGAGAAC TTGAGAATGT TAAGGCCAAG CTTGCTCAGC AGGTCAAACC 2100 
AGAGGAACAT GAACAGGTTA AGAGCAGATT AGAACAGAAA TCAGGAGAAC TTGGGAAGAA 2160 
GATCACTGAG TTAACATTGA AAAATCAGAC ACTACAAAAG GAAATTGAAA AAGTTTATTT 2220 
GGATAATAAG CTCCTCAAGG AGCAAGCACA TAACTTAACA ATTGAAATGA AAAATCATTA 2280 
TGTTCCTTTA AAAGTAAGTG AAGACATGAA AAAGTCACAT GATGCAATTA TTGATGATCT 2340 
TAATAGAAAG CTTTTAGATG TAACACAAAA ATATACAGAA AAGAAGTTGG AAATGGAGAA 2400 
ATTGCTACTG . GAAAATGACA GCTTAAGTAA GGATGTAAGC CGCCXAGAAA CTGTGTTTGT 2460 
ACCTCCTGAG AAACATGAAA AAGAGATAAT AGCTCTGAAA TCCAATATTG TTGAACTTAA 2520 
GAAACAGCTG TCTGAACTTA AGAAAAAATG TGGTGAAGAC CAGGAGAAAA TACACGCTCT 2580 
CACATCTGAA AACACTAACT TGAAGAAGAT GATGAGTAAT CAGTATGTGC CAGTTAAAAC 2640 
CCATGAAGAG GTTAAAATGA CACTGAATGA CACGTTAGCC AAAACTAACA GAGAATTATT 2700 
AGATGTGAAG AAAAAATTTG AAGATATAAA TCAGGAATTT GTAAAAATAA AAGATAAGAA 2760 
TGAAATATTA AAAAGAAACC TGGAAAACAC TCAGAACCAA ATAAAAGCTG AGTACATCAG 2820 
CCTGGCAGAG CACGAGGCAA AGATGAGCTC GCTAAGTCAG AGCATGAGAA AGGTGCAGGA 2680 
TAGTAATGCT GAAATCTTGG CCAACTACAG AAAAGGCCAA GAAGAGATTG TGACACTGCA 2940 
TGCCGAAA7T AAAGCCCAGA AGAAGGAGCT CGACACAATA CAAGAATGCA TTAAGGTAAA 3000 
ATATGCCCCA ATTGTCAGCT TTGAGGAGTG GGAGAGAAAA TTTAAAGCAA CAGAGAAAGA 3060 
ACTAAAAGAC CAGTTATCAG AGCAGACACA AAAGTATAGT GTCAGTGAAG AAGAAGTCAA 3120 
GAAAAACAAG CAAGAGAATG ACAAGTTAAA GAAGGAGATT TTTACCCTTC AGAAAGATTT 3180 
GAGAGATAAG ACAGTTCTCA TTGAGAAGTC TCATQAAATG GAAAGAGCAT TAAGCAGAAA 3240 
AACAGACGAG CTAAACAAAC AGTTAAAAGA CTTGTCACAG AAATACACGG AAGTAAAGAA 3300 
TGTGAAAGAG AAGCTAGTAG AAGAAAATGC CAAACAGACT TCTGAGATAC TTGCAGTGCA 3360 
AAATCTTTTG CAAAAACAAC ATGTTCCATT GGAACAGGTT GAGGCTCTGA AAAAATCTCT 3420 
TAATGGCACA ATTGAAAATC TAAAGGAAGA ACTGAAGAGT ATGCAAAGGT GTTAOGAGAA 3480 
AGAGCAGCAG ACAGTGACCA AACTGGATCA ATTGTTGGAG AATCAAAAGA ACTCTTCTGT 3540 
ACCCCTGGCA GAGCATTTGC AGATTAAAGA AGCATTTGAG AAAGAAGTTG GAATCATAAA 3600 
AGCCAGCTTG AGAGAAAAGG AAGAAGAAAG CCAAAACAAA ATGGAAGAAG TCTCCAAACT 3660 
TCAGT03GAG GTTCAGAATA CTAAACAAGC ATTAAAAAAA TTAGAGACTA GAGAGGTAGT 3720 
TGACTTGTCT AAATATAAAG CAACAAAAAG TGATTTGGAG ACACAGATTT CTAGCTTAAA 3780 
TGAAAAATTG GCCAATCTGA ATAGAAAGTA TGAGGAAGTA TGTGAGGAAG TTTTGCATGC 3840 
CAAAAAGAAG GAAATATCTG CAAAAGATGA GAAGGAATTA CTGCATTTCA GCATTGAGCA 3900 
AGAAA7TAAG GATCAGAAGG AACGATGTGA TAAGTCCTTA ACAACAATCA CAGAGTTACA 3960 
AAGAAGAATA CAAGAATCTG CTAAACAAAT AGAAGCAAAA GATAATAAGA TAACTGAACT 4020 



1130 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GCTTAATGAT 
AAGTGGGAAC 
ATCTCTGGAG 
TCGGACACAC 
GCTCCAGATC 
TATCTGTTTT 
CTTACTGTGC 



GTGGAAAGAT 
CCCACCAAGA 
CAACAGCTGG 
CTTCTTAGTG 
ATACAAATGC 
ATCTTGCTGG 
TGGTATAATT 



TAAAACAGGC 
GGCAGAGCCA 
CCGATGCTGA 
CTGCACAGGG 
GGCAGGGGCT 
TGCTGAACAT 
AAAATAAAAT 



ACTCAATGGC 
QCTGATTQAC 
CAGACAGCAC 
TCACATGGAT 
TGTGTGCTAG 
TCTTTGTGCA 
ATATTTTGTT 



Seq ID NO: 446 Protein sequence 
Protein Accession ft: AAG49S77.1 



1 
I 

MKSLKSRLRR 
PGKLDVEGRS 
LQYNCPTEHA 
TICQLLIDRG 
RIGDNLDILT 
IENEDLKERI* 
ESLRTIEALK 
SKLRPLELSL 
CERVKEDSDE 
ELKDQLKDLK 
EREKKGRKVT 
LSEIRQLKRE 
KVYLDNKLLK 
EME KI JJ iEMD 
IHALTSENTN 
KDXNEILKRN 
VTLHAEIKAQ 
EEVKKNKQEN 
EVKNVKEKLV 
CYEKEQQTVT 
VSKLQSEVQN 
VLHAKKKEIS 
ITELLNDVER 
IAIYRTHLLS 



11 

I 

QDVPGPASSG 
VFHWTSKGN 
DLQGRTALQK 
ADVNSRDKQN 
LLKTASENTN 
RKIQQEQR1L 
NRPKYFESDH 
PSQTSYSENE 
QIKQLEDALK 
VKYEGASAEV 
EMEGQAKELS 
LENVKAKLAQ 
EQAHNLTIEM 
SLSKDVSRLE 
LKKMMSNQYV 
LENTQNQIKA 
KKELDTIQEC 
DKLKKEIPTL 
BENAKQTSEI 
KLHQLLENQK 
TKQALKKLET 
AKDEKELLHP 
LKQALNGLSQ 
AAQGHMDEDV 



21 

I 

AAAASAHAAD 
LECLNAILIH 
KAMADCPSSI 
RTALMLGCEY 
KGRELWKKGP 
LDKVNGLQLQ 
LGSGSHFSNR 
ILKKELEAMR 
DVQKRMYBSE 
GKLRNQIKQK 
AKLALSIPAE 
HVKPEEHEQV 
KNHYVPLKVS 
TVFVPPEKHE 
PVKTHEEVKM 
EYISLAEREA 
IKVKYAPIVS 
QKDLRDKTVL 
LAVQNLLQKQ 
NS9VPLAEHI* 
REWDLSKYK 
SIEQEIKDQK 
LTYTSGNPTK 
QEALLQIIQM 



31 
I 

WNKYDDRLMK 
GVDITTSDTA 
QLLCDHGASV 
GCRDAVEVLI 
SLQQRNLTHM 
LNEEVMVADD 
KEDMLLKQGQ 
TFCESAKQDR 
GKVKQMQTHF 
EMZVEEFKRD 
KFENWKSSLS 
KSRLEQKSGE 
EDMKKSHDAI 
KEIIALKSN1 
TLNDTLAKTN 
KMSSLSQSMR 



IEKSHEMERA 
HVPLEQVEAL 
QIKEAFEKEV 
ATKSDLETQI 
ERCDKSLTTI 
RQSQLIDTLQ 
RQGLVC 



CTTTCCCAAC 
ACTCTGCAGC 
CAAGAAGTAA 
GAAGATGTTC 
COGTTAGCAC 
ACTCCATGGT 
CTGGGTGT 



41 
I 

AAERGDVEKV 
GRNALHLAAK 
NAKDVKGRTP 
KNGADISLLD 
QDEVNVKSHQ 
LESEREKLKS 
MYMADSQCTS 
LKLQNELAHK 
LAXiKEHLTSE 
EGKLIEENKR 
NEVNEKAKKL 
LGKKITELTL 
IDDLNRKLLD 
VELKKQLSEL 
RELLDVKKKP 
KVQDSKAEIL 
TEKELKDQLS 
LSRKTDELNK 
KKSLMGTIEN 
GIIKASLREK 
SSI^JEKLANL 
TELQRRIQES 
HQVKSLEQQL 



TCACCTACAC 4080 

ACCAAGTGAA 4140 

TTGCAATTTA 4200 

AGGAGGCTCT 4260 

TGACTGCCAG 4320 

CTTTCTGGGC 4380 



51 

I 

TSILAKKGVN 
YGHALCLQKL 
LVLATQMSRP 
ALGHDSSYYA 
REHQNIQDLE 
LLAAKEKQHE 
PGIPAHMQSR 
VAECKALALE 
AASGUHRLTE 
LQKELSMCEM 
VEMEREHEKS 
KNQTLQKEIE 
VTQKYTEKKL 
KKKCGEDQEK 
EDINQEPVKI 
ANYRKGQEEI 
EOTQKYSVSE 
QLKDLSQKYT 
LKEELKSMQR 



NRKYEEVCEE 
AKQIEAKDNK 
ADADRQHQEV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 447 DNA sequence 
Nucleic Acid Accession #t NMJJ03020.1 
Coding sequence i 29 . . 664 



1 
I 

CGCTCCTCGG 
TGGCCTACTG 
CCCTGACCGG 
GGGCATTGCC 
CCAGAGCATT 
CCCCAACATC 
GGGGTACCCA 
AAACACCCCT 
TCOGGAACAT 
GAAGGGAGGA 
GGATAATGTT 
GTAAAGAGAA 
TGCACGTGTA 
TACAAAGCAG 
AAATTAGAAT 
TAAAAATTAA 
TGCAGTTTAA 
TTTGATTTTG 
GCTGTACTCA 
GCAAGCATTG 



11 
I 

GCTGCCCCTC 
TTTTGGCTGG 
GTCTCAGAAG 
AGGCCCCGAG 
GAAGGTGGAG 
GTGGCAGAGT 
GACCCTCCAA 
GACACTGCAG 
GACTATCCAG 
GAGAGACGAA 
GTTGCAAAGA 
GATGCTAGAC 
AATGGAGTCC 
CTGTATGTAG 
AAGAGCTTTT 
AATGTGAATG 
AATGGTGTCT 
ATTATGTAGT 
AGAGGAGGAG 
GC 



21 
I 

GGTTGACAAT 
CATCTGGATG 
CAGATATCCA 
TGGAATATCC 
CTCATGAAGG 
TGACTGGAGA 
ATCCCTGTCC 
AGTTCAGTCG 
GCTTGGGCAA 
AGCGGAGGAG 
AGTCTGTCCC 
GAAAACCCAC 
CTGTGAATGA 
ATAGTGTATT 
TTGTTTCTTG 
TCAACAATAA 
GAGGTTGTAC 
TCATCCAGCC 
CTGACACATT 



31 
I 

GGTCTCCAGG 
GACTCCAGCA 
GAGGCTGCTT 
AGCTCACCAG 
ACTTCAGCAT 
CAACATTCCT 
TGTTGGAAAA 
AGAGTTCCAG 
GTGGAACAAG 
TGTCAATCCA 
CCATTTTTCA 
ATTACCTGTT 
CAGCATGTTT 
GTCTTCACAC 
GGTTTTTAAA 
AAAGCAAGAC 
TATTTTGGCC 
CTTGGGCATT 
TCACTTGGCT 



41 
I 

ATGGTCTCTA 
TTTGCTTACA 
CATGGTGTTA 
GCCATGAATC 
TTGGGTCCTT 
AAGGACTTTA 
ACAGATGATG 
TTGCACCAGC 
AAACTCCTTT 
TATCTACAAG 
GATGAGGATA 
AGGCCTCAGC 
CTTACATAGA 
CGATGATTCT 
ATGTGAATCT 
TATGAAAGGC 
AAGTCTGTAG 
GTTATACACC 
GCGTCTTAAT 



51 
I 

CCATGCTATC 
GCCCCCGGAC 
TGGAGCAATT 
TTGTGGGCCC 
TTGGCAACAT 
GTGAGGATCA 
GATGTCTAGA 
ATCTCTTTGA 
ACGAGAAGAT 
GACAGAGACT 
AGGATCCAGA 
ATGGCTTATG 
TAATTATGGA 
GCTTTTTG CT 
GCAATGATCA 
TCAGATTTCT 
AAAGCTGTCA 
AGTAAAGAAG 
AAACATGAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 448 Protein sequence 
Protein Accession #: NP_003 011.1 

1 11 21 31 41 51 

I I I I I I 

MVSRMVSTML SGLLFWLASG KTPAPAYSPR TPDRVSEADI QRLLHGVMEQ LGIARPRVEY 
PAHQAMNLVG PQSIEGGAHE GLQHLGPFGN IPNIVAELTG DNIPKDPSED QGYPDPPNPC 
PVGKTDDGGj ENTPDTABPS RBPQLHQHLP DPEHDYPGLG kwnkkllyek mkggerrkrr 
SVNPYLQGQR LDNWAKKSV PHPSDEDKDP B 

Seq ID NO: 449 DNA sequence 

Nucleic Acid Accession ft: NM_003816.1 

Coding sequence: 79. .2536 

1 11 21 31 41 51 

i I I I I I 

CGGCAGGGTT GGAAAATGAT GGAAGAGGCG GAGGTGGAGG CGACCGAGTG CTGAGAGGAA 
CCTGCGGAAT CGGCCGAGAT GGGGTCTGGC GCGCGCTTTC CCTCGGGGAC CCTTCGTGTC 
OGGTGGTTGC TGTTGCTTGG CCTGGTGGGC CCAGTCCTCG GTGCGGCGCG GCCAGGCTTT 

1131 



60 
120 
180 



60 
120 
180 



WO 03/042661 



PCT/US02/36810 



CAACAGACCT CACATCTTTC TTCTTATGAA ATTATAACTC CTTGGAGATT AACTAGAGAA 240 
AGAAGAGAAG CCCCTAGGCC CTATTCAAAA CAAGTATCTT ATGTTATTCA GOCTQAAGGA 300 
AAAGAGCATA TTATTCACTT GQAAAGGAAC AAAGACCTTT TGCCTGAAGA TTTTGTGGTT 360 
TATACTTACA ACAAGGAAGG GACTTTAATC ACTGACCATC CCAATATACA GAATCATTGT 420 
5 CATTATCGGG GCTATGTGGA GGGAGTTCAT AATTCATCCA TTGCTCTTAG CGACTGTTTT 480 
GGACTCAGAG GATTGCTGCA TTTAGAGAAT GGGAGTTATG GGATTGAACC CCTGCAGAAC 540 
AGCTCTCATT TTGAGCACAT CATTTATCGA ATGGATGATG TCTACAAAGA GCCTCTGAAA 600 
TGTGGAGTTT CCAACAAGGA TATAGAGAAA GAAACTGCAA AGGATGAAGA GGAAGAGCCT 660 
CCCAGCATGA CTCAGCTACT TCGAAGAAGA AGAGCTGTCT TGCCACAGAC COGGTATGTG 720 
10 GAGCTGTTCA TTGTOGTAGA CAAGGAAAGG TATGACATGA TGGGAAGAAA TCAGACTGCT 780 
GTGAGAGAAG AGATGATTCT CCTGGCAAAC TACTTGGATA GTATGTATAT TATGTTAAAT 840 
ATTCGAATTG TGCTAGTTGG ACTGGAGATT TGGACCAATG GAAACCTGAT CAACATAGTT 900 
GGGGGTGCTG GTGATGTGCT GGGGAACTTC GTGCAGTGGC GGGAAAAGTT TCTTATCACA 960 

CGTCGGAGAC ATGACAGTGC ACAGCTAGTT CTAAAGAAAG GTTTTGGTGG AACTGCAGGA 1020 

15 ATGGCATTTG TGGGAACAGT GTGTTCAAGG AGCCACGCAG GCGGGATTAA TGT G TTT GG A 1080 

CAAATCACTG TGGAGACATT TGCTTCCATT GTTGCTCATG AATTGGGTCA TAATCTTGGA 1140 

ATGAATCAOG ATGATGGGAG AGATTGTTCC TGTGGAGCAA AGAGCTGCAT CATGAATTCA 1200 

GGAGCATCGG GTTCCAGAAA CTTTAGCAGT TGCAGTGCAG AGGACTTTGA GAAGTTAACT 1260 

TTAAATAAAG GAGGAAACTG CCTTCTTAAT ATTCCAAAGC CTGATGAAGC CTATAGTGCT 1320 

20 CCCTCCTGTG GTAATAAGTT GGTGGACGCT GGGGAAGAGT GTGACTGTGG TACTCCAAAG 1380 

GAATGTGAAT TGGACCCTTG CTGOGAAGGA AGTACCTGTA AOCTTAAATC ATTTGCTGAG 1440 

TGTGCATATG GTGACTGTTG TAAAGACTGT OGGTTCCTTC CAGGAGGTAC TTTATGCOGA 1500 

GGAAAAACCA GTGAGTGTGA TGTTCCAGAG TACTGCAATG GTTCTTCTCA GTTCTGTCAG 1560 

CCAGATGTTT TTATTCAGAA TGGATATCCT TGCCAGAATA ACAAAGCCTA TTGCTACAAC 1620 

25 GGCATGTGCC AGTATTATGA TGCTCAATGT CAAGTCATCT TTGGCTCAAA AGCCAAGGCT 1680 

GCCCCCAAAG ATTGTTTCAT TGAAGTGAAT TCTAAAGGTG ACAGATTTGG CAATTGTGGT 1740 

TTCTCTGGCA ATGAATACAA GAAGTGTGCC ACTGGGAATG CTTTGTGTGG AAAGCTTCAG 1800 

TGTGAGAATG TACAAGAGAT ACCTGTATTT GGAATTGTGC CTGCTATTAT TCAAACGCCT 1860 

AGTOGAGGCA CCAAATGTTG GGGTGTGGAT TTCCAGCTAG GATCAGATGT TCCAGATCCT 1920 

30 GGGATGGTTA ACGAAGGCAC AAAATGTGGT GCTGGAAAGA TCTGTAGAAA CTTCCAGTGT 1980 

GTAGATGCTT CTGTTCTGAA TTATGACTGT GATGTTCAGA AAAAGTGTCA TQGACATGGG 2040 

GTATGTAATA GCAATAAGAA TTGTCACTGT GAAAATGGCT GGGCTCCCCC AAATTGTGAG 2100 

ACTAAAGGAT ACGGAGGAAG TGTGGACAGT GGACCTACAT ACAATGAAAT GAATACTGCA 2160 

TTGAGGGACG GACTTCTGGT CTTCTTCTTC CTAATTGTTC CCCTTATTGT CTGTGCTATT 2220 

35 TTTATCTTCA TCAAGAGGGA TCAACTGTGG AGAAGCTACT TCAGAAAGAA GAGATCACAA 2280 

ACATATGAGT CAGATGGCAA AAATCAAGCA AACCCTTCTA GACAGCOGGG GAGTGTTCCT 2340 

CGACATGTTT CTCCAGTGAC ACCTCCCAGA GAAGTTCCTA TATATGCAAA CAGATTTGCA 2400 

GTACCAACCT ATGCAGCCAA GCAACCTCAG CAGTTCCCAT CAAGGCCACC TCCACCACAA 2460 

COGAAAGTAT CATCTCAGGG AAACTTAATT CCTGCCCGTC CTGCTCCTGC ACCTCCTTTA 2520 

40 TATAGTTCCC TCACTTGATT TTTTTAACCT TCTTTTTGCA AATGTCTTCA GGGAACTGAG 2580 

CTAATACTTT TTTTTTTTCT TGATGTTTTC TTGAAAAGCC TTTCTGTTGC AACTATGAAT 2640 

GAAAACAAAA CACCACAAAA CAGACTTCAC TAACACAGAA AAACAGAAAC TGAGTGTGAG 2700 

AGTTGTGAAA TACAAGGAAA TGCAGTAAAG OCAGGGAATT TACAATAACA TTTCGGTTTC 2760 

CATCATTGAA TAAGTCTTAT TCAGTCATCG GTGAGGTTAA TGCACTAATC ATGGATTTTT 2820 

45 TGAACATGTT ATTGCAGTGA TTCTCAAATT AACTGTATTG GTGTAAGATT TTTGTCATTA 2880 

AGTGTTTAAG TGTTATTCTG AA1TTTCTAC CTTAGTTATC ATTAATGTAG TTCCTCATTG 2940 

AACATGTGAT AATCTAATAC CTGTGAAAAC TGACTAATCA GCTGCCAATA ATATCTAATA 3000 

TTTTTCATCA TGCACGAATT AATAATCATC ATACTCTAGA ATCTTGTCTG TCACTCACTA 3060 

A CATGAATAAG CAAATATTGT CTTCAAAAGA ATGCACAAGA ACCACAATTA AGATGTCATA 3120 

50 TTATTTTGAA AGTACAAAAT ATACTAAAAG AGTGTGTGTG TATTCACGCA GTTACTCGCT 3180 

TCCATTTTTA TGACCTTTCA ACTATAGGTA ATAACTCTTA GAGAAATTAA TTTAATATTA 3240 

GAATTTCTAT TATGAATCAT GTGAAAGCAT GACATTOGTT CACAATAGCA CXATTTTAAA 3300 

TAAATTATAA GCTTTAAGGT ACGAAGTATT TAATAGATCT AATCAAATAT GTTGATTCAT 3360 

GGCTATAATA AAGCAGGAGC AATTATAAAA TCTTCAATCA ATTGAACTTT TACAAAACCA 3420 

55 CTTGAGAATT TCATGAGCAC TTTAAAATCT GAACTTTCAA AGCTTGCTAT TAAATCATTT 3480 

AGAATGTTTA CATTTACTAA GGTGTGCTGG GTCATGTAAA ATATTAGACA CTAATATTTT 3540 

CATAGAAATT AGGCTGGAGA AAGAAGGAAG AAATGGTTTT CTTAAATACC TACAAAAAAG 3600 

TTACTGTGGT ATCTATGAGT TATCATCTTA GCTGTGTTAA AAATGAATTT TTACTATGGC 3660 

AGATATGGTA TGGATCGTAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCATAAT 3720 

AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAAGT GTTTTTGGTT 3780 

TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 3840 
AAAAAAAAAA AAAAAAAAAA AAAAA 



60 
65 



80 



Seq ID NO: 450 Protein sequence 
Protein Accession #: NP 003807.1 



1 11 21 31 41 51 

I I I I I I 

MGSGARFPSG TLRVRWLLLL GLVGPVLGAA RPGPQQTSHL SSYEIITPWR LTRERREAPR 60 

70 PYSKQVSYVI QABGKEHIIH LERNKDLLPE DFWYTYNKE GTL1TDHPNI QNHCHYRGYV 120 

EGVHNSSIAL SDCFGLRGLL HLENASYGIE PLQNSSHFEH IIYRMDDVYK BPLKCGVSNK 180 

DIEKETAKDE EEEPPSMTQL LRRRRAVLPQ TRYVELPIW DKERYDMMGR NQTAVREEMI 240 

LLANYLDSMY IMLMIRIVLV GLEIWTNGNL IKIVGGAGDV LGNFVQWREK FLlTRRRHDS 300 

AQLVLKKGPG GTAGMAPVGT VCSRSHAGGI NVFGQITVET PASIVAHELG HNLGMNHDDG 360 

75 RDCSCGAKSC IMNSGASGSR NFSSCSAEDP EKLTLNKGGN CLLNIPKPDE AYSAPSCGNK 420 

LVDAGEECDC GTPKECELDP CCEGSTCKL.K SFAECAYGDC CKDCRFLFGG TLCRGKTSEC 480 

DVPEYCNGSS QFCQPDVFIQ NGYPOQNNKA YCYNGMCQYY DAQCQVIFGS KAKAAPKDCF 540 

IEWSKGDRF GNCGFSGNEY KKCATGNALC GKLQCENVQE IPVFGIVPAI IOTPSRGTKC 600 

HGVDFQLGSD VPDPGMVNEG TKCGAGKICR NPQCVDASVL HYDCDVQKKC HGHGVQJSNK 660 

NCHCENGWAP PKCETKGYGG SVDSGPTYNE MNTALRDGLL VPFFLIVPLI VCAIPIPIKR 720 

DQIWRSYFRK KRSQTYESDG KNQANPSRQP GSVPRHVSPV TPPREVPIYA NRPAVPTYAA 780 
KQPQQPPSRP PPPQPKVSSQ GNLIPARPAP APPLYSSLT 
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Seq ID NO: 451 QUA sequence 

Nucleic Acid Accession ft: NM_0166S0.1 

Coding sequence: 196..7B9 ~ 

5 1 11 21 31 41 51 

I I I I I I 

GGTTTCAATA TATGCAGATG TCTCGATATA GGAATQAAAT TACGTCTTTG GAACAACTTA 60 

AATAAGTCAA ATATACTTQG AGCTTTAAAA ATTAAAAGGA GAGAGATTCG AGCACCTTTT 120 

CTGCTGCCAT GACAACCATG CAAGGAATGG AACAGGCCAT GCCAGGGTTG GCCTGGTGTG 180 

CCCCAGCTGG GAAACATGGC TGTCATACAT TCACATCTGT GGAAAGGATT GCAAGAGAAG 240 

TTCTTGAAGG GAGAACCCAA A6TCCTTGGG GTTGTGCAGA TTCTGACTGC CCTGATGAGC 300 

CTTAGCATGG GAATAACAAT GATGTGTATG GCATCTAATA CTTATGGAAG TAACCCTATT 360 

TCCGTGCATA TCGGGTACAC AATTTGGGGG TCAGTAATGT TTA3TATTTC AGGATCCTTG 420 

TCAATTGCAG CAGGAATTAG AACTACAAAA GGCCTGGTCC GAGGTAGTCT AGGAATGAAT 4 BO 

ATCACCAGCT CTGTACTGGC TGCATCAGGG ATCTTAATCA ACACATTTAG CTTGGCGTTT 540 

TATTCATTCC ATCACCCTTA CTGTAACTAC TATGGCAACT CAAATAATTG TCATGGGACT 600 

ATGTCCATCT TAATGGGTCT GGATGGGATG GTGCTCCTCT TAAGTGTGCT GGAATTCTGC 660 

ATTGCTGTGT CCCTCTCTGC CTTTGGATGT AAAGTGCTCT GTTGTACCCC TGGTGGGGTT 720 

GTG TTAAT TC TGCCATCACA TTCTCACATG GCAGAAACAG CATCTCCCAC ACCACTTAAT 780 

GAGGTTTGAG GCCAACAAAA GATCAACAGA CAAATGCTCC AGAAATCTAT GCTGACTGTG 840 

ACACAAGAGC CTCACATGAG AAATTACCAG TATCCAACTT CGATACTGAT AGACGTGTTG 900 

ATATTATTAT TATATGTAAT CCAATTATGA ACTGTGTGTG TATAGAGAGA TAATAAATTC 960 

AAAATTATGT TCTCATTTTT TTCCCTGGAA CTCAATAACT CACTTCACTG GCTCTTTATC 1020 
GAGAGTACTA GGAGTTAAAT TAATAAATAA TGCATTTAAT GAGGCCACAG GAAAAA 



35 



65 
70 
75 
80 



Seq ID NO: 452 Protein sequence 
Protein Accession #: NP 057734.1 



OA 1 U 21 31 41 51 

30 i I I I l l 

MAVIHSHLWK GLQEKFLKGS PKVLGWQIL TALMSLSMGI TMMCMASNTY GSNPISVHIG 60 

YTIWGSVMFI ISGSLSIAAG IRTTKQLVRG SLGMNITSSV LAASGILINT FSLAFYSFHH 120 

PYCNYYGNSN NCHGTMSIUM GLDGMVLLLS VLEFCIAVSL SAPGCKVLCC TPGGWLILP 180 
SHSHMABTAS PTPLNEV 



Seq ID NO: 453 dna sequence 

Nucleic Acid Accession fl: NMJ)02091.1 

Coding sequence: 56.. 503 



40 1 11 21 31 41 SI 

I I I I I I 

AGTCTCTGCT CTTCCCAGCC TCTCCGGCGC GCTCCAAGGG CTTCCCGTCG GGACCATGCG 60 

CGGCAGTGAG CTCCCGCTGG TCCTGCTGGC GCTGGTCCTC TGCCTAGCGC CCCGGGGGCG 120 

. AGCGGTCCCG CTGCCTGCGG GCGGAGGGAC CGTGCTGACC AAGATGTACC CGCGCGGCAA 180 

4j CCACTGGGCG GTGGGGCACT TAATGGGGAA AAAGAGCACA GGGGAGTCTT CTTCTGTTTC 240 

TGAGAGAGGG AGCCTGAAGC AGCAGCTGAG AGAGTACATC AGGTGGGAAG AAGCTGCAAG 300 

GAATTTGCTG GGTCTCATAG AAGCAAAGGA GAACAGAAAC CACCAGCCAC CTCAACCCAA 360 

GGCCTTGGGC AATCAGCAGC CTTCGTGGGA TTCAGAGGAT AGCAGCAACT TCAAAGATGT 420 

r(\ AGGTTCAAAA GGCAAAGTTG GTAGACTCTC TGCTCCAGGT TCTCAACGTG AAGGAAGGAA 480 

CCCCCAGCTG AACCAGCAAT GATAATGATG GCCTCTCTCA AAAGAGAAAA ACAAAACCCC 540 

TAAG AGACT G AGTTCTGCAA GCATCAGTTC TAOGGATCAT CAACAAGATT TCCTTGTGCA 600 

AAATATTTGA CTATTCTGTA TCTTTCATCC TTGACTAAAT TCGTGATTTT CAAGCAGCAT 660 

CTTCTGGTTT AAACTTGTTT GCTGTGAACA ATTGTCGAAA AGAGTCTTCC AATTAATGCT 720 

TTTTTATATC TAGGCTACCT GTTGGTTAGA TTCAAGGCCC CGAGCTGTTA CCATTCACAA 780 

JJ TAAAAGCTTA AACACAT 

Seq ID NO: 454 Protein sequence 
Protein Accession #» NP_0020B2.1 

60 i 11 21 31 41 51 

I I I I I I 

MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGBSSS 60 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 120 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 



Seq ID NO: 455 DNA sequence 

Nucleic Acid Accession #: NM_016522.1 

Coding sequence: 265.. 1299 

1 U 21 31 41 51 

I I I I I I 

GCGGAAGCAG CGAGGAGGGA GCCCCCTTTG GCCGTCCTCC GTGGAACCGG TTTTCOGAGG 60 

CTGG CAAAAG CCGAGGCTGG ATTTGGGGGA GGAATATTAG ACTCGGAGGA GTCTGCGCGC 120 

TTTTCTCCTC CCCGCGCCTC CCGGTCGCCG CGGGTTCACC GCTCAGTCCC CGCGCTCGCT 180 

CCGCACCCCA CCCACTTCCT GTGCTCGCCC GGGGGGCGTG TGCCGTGOGG CTGCCGOAGT 240 

TCGGGGAAGT TGTGGCTGTC GAGAATGGGG GTCTGTGGGT ACCTGTTCCT GCCCTGOAAG 300 

TGCCTCGTGG TCGTGTCTCT CAGGCTGCTG TTCCTTGTAC CCACAGGAGT GCCCGTGOSC 360 

AGCGGAGATG CCACCTTCCC CAAAGCTATG GACAACGTGA CGGTCCGGCA GGGGGAGAGC 420 

GCCACCCTCA GGTGCACTAT TGACAACCGG GTCACCCGGG TGGCCTGGCT AAAOCGCAGC 480 

ACCATCCTCT ATGCTGGGAA TGACAAGTGG TGCCTGGATC CTCGCGTGGT CCTTCTGAGC 540 

AACACCCAAA CGCAGTACAG CATCGAGATC CAGAACGTGG ATGTGTATGA CGAGGGCCCT 600 

TACACCTGCT CGGTGCAGAC AGACAACCAC CCAAAGACCT CTAGGGTCCA CCTCATTGTG 660 

CAAGTATCTC CCAAAATTGT AGAGATTTCT TCAGATATCT CCATTAATGA AGGGAACAAT 720 

ATTAGCCTCA CCTGCATAGC AACTGGTAGA CCAGAGCCTA CGGTTACTTG GAGACACATC 780 
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TCTCCCAAAG CGGTTGGCTT TGTGAGTGAA GACGAATACT TGGAAATTCA GGGCATCACC 840 

CGGGAACAGT CAGGGGACTA CGAGTGCAGT GOCTCCAATG ACGTGGCCGC GCCCGTGGTA 900 

CGGAGAGTAA AGGTCACCGT GAACTATCCA CCATACATTT CAGAAGCCAA GGGTACAGGT 960 

GTCCCCGTGG GACAAAAGGG GACACTGCAG TGTGAAGCCT CAGCAGTCCC CTCAGCAGAA 1020 

TTCCAGTGGT ACAAGGATGA CAAAAGACTG ATTGAAGGAA AGAAAGGGGT GAAAGTGGAA 1080 

AACAGACCTT TCCTCTCAAA ACTCATCTTC TTCAATGTCT CTGAACATGA CTATGGGAAC 1140 

TACACTTGCG TGGCCTCCAA CAAGCTGGGC CACACCAATG CCAGCATCAT GCTATTTGGT 1200 

CCAGGCGCCG TCAGCGAGGT GAGCAAOGGC ACGTCGAGGA GGGCAGGCTG GGTCTGGCTG 1260 

CTGCCTCTTC TGGTCTTGCA CCTGCTTCTC AAATTTTGAT GTGAGTGCCA CTTCCCCACC 1320 

CGGGAAAGGC TGCCGCCACC ACCACCACCA ACACAACAGC AATGGCAACA CCGACAGCAA 1380 

CCAATCAGAT ATATACAAAT GAAATTAGAA GAAACACAGC CTCATGGGAC AGAAATTTGA 1440 

GGGAGGGGAA CAAAGAATAC TTTGGGGGGA AAAGAGTTTT AAAAAAGAAA TTGAAAATTG 1500 

CCTTGCAGAT ATTTAGGTAC AATGGAGTTT TCTTTTCCCA AACGGGAAGA ACACAGCACA 1560 

CCCGGCTTGG ACCCACTGCA AGCTGCATCG TGCAACCTCT TTGGTGCCAQ TGTGGGCAAG 1620 

GGCTCAGCCT CTCTGCCCAC AGACTGCCCC CACGTGGAAC ATTCTGGAGC TGGCCATCCC 1680 

AAATTCAATC AGTCCATAGA GACGAACAGA ATGAGACCTT CCGGCCCAAG CGTGGCGCTT 1740 

CCGGCCCAAG CGTGGQGCTG CGGGCACTTT GGTAGACTGT GCCACCACGG CGTGTGTTGT 1800 
GAAACGTGAA ATAAAAAGAG CAAAAAAAAA AAAAAAAAA 



Seq ID NO: 456 Protein sequence 
Protein Accession #i NP_057606.1 



1 11 21 31 41 51 

I I I I I I 

MGVOGYLFLP WKCLVWSLR LIiFLVPTGVP VRSGDATFPK AMDNVTVRQG ESATLRCTID 60 
NRVTRVAWLN RSTILYAGND KWCLDPRWL LSNTQTQYSI EIQNVDVYDE GPYTCSVQTD 120 
NHPKTSRVHL IVQVSPKIVE ISSDISINEG NNISLTCIAT GRPEPTVTWR HISPKAVGFV 180 
SEDEYLEIQG ITREQSGDYE CSASNDVAAP WRRVKVTVN YPPYISEAKG TGVPVGQKGT 240 
LQCEASAVPS AEFQWYKDDK RLIEGKKGVK VENRPFLSKL IFFNVSEHDY GNYTCVASNK 300 
LGHTNASIML FGPGAVSEVS NGTSRRAGCV WLLPLLVLHL LLKP 



Seq ID NO: 457 DNA sequence 

Nucleic Acid Accession #: NM_012261.1 

Coding sequence: 203.. 1045 



1 11 21 31 41 51 

I I I I I I 

GATTTGCTCT GCCAGCAGCT GTCGGTGCCG CGCTCGACAC OGAGTCCTAG CTAGGCGCTC 60 

ACAGAATACG CGCTCCCTCC CTCCCCCTTC TCTGTCCCCC GCCTCTCGCT CACCCCGGCC 120 

CACTCCAGCG GCGACTTTGA GGGATTCCCT CTCTGGCGGC CTCTGCAGCA GCACAGCCGG 180 

CCTCATTCGG GGCACTGCGA GTATGGATCT CCAAGGAAGA GGGGTCCCCA GCATCGACAG 240 

ACTTCGAGTT CTCCTGATGT TGTTCCATAC AATGGCTCAA ATCATGGCAG AACAAGAAGT 300 

GGAAAATCTC TCAGGCCTTT CCACTAACCC TGAAAAAGAT ATATTTGTGG TGCGGGAAAA 360 

TGGGACGACG TGTCTCATGG CAGAGTTTGC AGCCAAATTT ATTGTACCTT ATGATGTGTG 420 

GGCCAGCAAC TACGTAGATC TGATCACAGA ACAGGCCGAT ATCGCATTGA CCCGGGGAGC 480 

TGAGGTGAAG GGCCGCTGTG GCCACAGCCA GTCGGAGCTG CAAGTGTTCT GGGTGGATCG 540 

CGCATATGCA CTCAAAATGC TCTTTGTAAA GGAAAGCCAC AACATGTCCA AGGGACCTGA 600 

GGCGACTTGG AGGCTGAGCA AAGTGCAGTT TGTCTACGAC TCCTCGGAGA AAACCCACTT 660 

CAAAGACGCA GTCAGTGCTG GGAAGCACAC AGCCAACTCG CACCACCTCT CTGCCTTGGT 720 

CACCCCCGCT GGGAAGTCCT ATGAGTGTCA AGCTCAACAA ACCATTTCAC TGGCCTCTAG 780 

TGATCOGCAG AAGACGGTCA CCATGATCCT GTCTGCGGTC CACATCCAAC CTTTTGACAT 840 

TATCTCAGAT TTTGTCTTCA GTGAAGAGCA TAAATGCCCA GTGGATGAGC GGGAGCAACT 900 

GGAAGAAACC TTGCCCCTGA TTTTGGGGCT CATCTTGGGC CTCGTCATCA TGGTAACACT 960 

CGCGATTTAC CACGTCCACC ACAAAATGAC TGCCAACCAG GTGCAGATCC CTCGGGACAG 1020 

ATCCCAGTAT AAGCACATGG GCTAGAGGCC GTTAGGCAGG CACCCOCTAT TCCTGCTCCC 1080 

CCAACTGGAT CAGGTAGAAC AACAAAAGCA CTTTTCCATC TTGTACACGA GATACACCAA 1140 

CATAGCTACA ATCAAACAGG CCTGGGTATC TGAGGCTTGC TTGGCTTGTG TCCATGCTTA 1200 

AACCCACGGA AGGGGGAGAC TCTTTCGGAT TTGTAGGGTG AAATGGCAAT TATTCTCTCC 1260 

ATGCTGGGGA GGAGGGGAGG AGGGTCTCAG ACAGCTTTCG TGCTCATGGT GGCTTGGCTT 1320 

TGACTCTCCA AAGAGCAATA AATGCCACTT GGAGCTGTAT CTGGCCCCAA AGTTTAGGGA 1380 

TTGAAAACAT GCTTCTTTGA GGAGGAAACC CCTTTAGGTT CAGAAGAATA TGGGGTGCTT 1440 

TGCTCCCTTG GACACAGCTG GCTTATCCTA TACAGTTGTC AATGCACACA GAATACAACC 1500 

TCATGCTCCC TGCAGCAAGA CCCCTGAAAG TGATTCATGC TTCTGGCTGG CATTCTGCAT 1560 

GTTTAGTGAT TGTCTTGGGA ATGTTTCACT GCTACCCGCA TCCAGCGACT GCAGCACCAG 1620 

AAAACGACTA ATGTAACTAT GCAGAGTTGT TTGGACTTCT TCCTGTGCCA GGTCCAAGTC 1680 

GGGGGACCTG AAGAATCAAT CTGTGTGAGT CTGTTTTTCA AAATGAAATA AAACACACTA 1740 
TTCTCTGGC 



Seq ID NO: 4S8 Protein sequence 
Protein Accession 8: NP_036393.1 



1 11 21 31 41 51 

I I I I I I 

MDLQGRGVPS IDRLRVLLML PHTMAQIMAE QEVENLSGLS TNPEKDIFW RENGTTCLMA 60 
EPAAKPIVPY DVWASNYVDL ITBQADIALT RGABVKGRCG HSQSELQVFW VDRAYAUCML 120 
PVKESHNMSK GPEATWRLSK VQFVYDSSBK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 180 
ECQAQQTISL ASSDPQKTVT MILSAVHIQP FDIISDFVFS EEHKCPVDER EQLEETLPLI 240 
LGLIDGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYKHMG 



Seq ID NO: 459 DNA sequence 

Nucleic Acid Accession 0: NM_001169.1 

Coding sequence: 85.. 870 



1 11 21 31 41 51 
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I I I I I I 

TAGGAGATAA GAGTATCTTG CACAGCAGGT GCAGGTTTCC CAGCAGCTCA GGCAAGAGTC 60 

CGATGTTTGT GCCATCTGAT CCTGATGTCT GGAGAGATAG OCATGTGTGA GCCTQAATTT 120 

GGCAATGACA AGGCCAGGGA GCCGAGCGTG GGTGGCAGGT GOCGAGTGTC CTGGTACGAA 180 

5 OGGTTTGTGC AGCCATGTCT GGTCGAACTG CTGGGCTCTG CTCTCTTCAT CTTCATCGGG 240 

TGCCTGTCGG TCATTGAGAA TGGGACGGAC ACTGGGCTGC TGCAGCCGGC CCTGGCCCAC 300 

GGGCTGGCTT TGGGGCTCGT GATTGCCACG CTGGGGAATA TCAGTGGTGG ACACTTCAAC 360 

CCTGCGGTGT CCCTGGCAGC CATGCTGATC GGAGGCCTCA ACCTGGTGAT GCTCCTCCOG 420 

TACTGGGTCT CACAGCTGCT CGGGGGGATG CTCGGGGCTG CCTTGGCCAA GGTGGTGAGT 480 

10 CCTGAGGAGA GGTTCTGGAA TGCATCTGGG GCGGCCTTTG TGACAGTCCA GGAGCAGGGG 540 

CAGGTGGCAG GGGCGTTGGT GGCAGAGATC ATCCTGACGA CGCTGCTGGC CCTGGCTGTA 600 

TGCATGGGTG CCATCAATGA GAAGACAAAG GGCCCTCTGG CCCCGTTCTC CATCGGCTTT 660 

GCCGTCACCG TGGATATCCT GGCTGGGGGC CCTGTGTCTG GAGGCTGCAT GAATCCCGCC 720 

CGTGCTTTTG GACCTGCGGT GGTGGCCAAC CACTGGAACT TOCACTGGAT CTACTGGCTG 780 

15 GGCCCACTCC TGGCTGGCCT GCTTGTTGGA CTGCTCATTA GGTGCTTCAT TGGAGATGGG 840 

AAGACCCGCC TCATCCTGAA GGCTCGGTGA GCAGAGCTCG TGGGATTCCT GCTGCTCCAG 900 

GTGTCCTCAG CTCACCTGTC CCAGACTGAG GACAGGGGAG TTCCTGCATT TCCTGCCAGG 960 

GCAGAGGCCC AGAGGAGCGA CCCCCTGCTT CCACTGCTTG GGCCTGCTTT CTCAGATAGA 1020 

CTGACTGCTG AGGAGGCTCT AGGTTCTTGG AATTCCTTTG TGCTCATCAG AGACCCCAGC 1080 

20 CTGGGGAACA CGCTGCCCGC ACTGCCCAGA GAGCAGTGCA AACACCACAA CACGAGCGTG 1140 

TTTCTTGAGA GGAATGTCCC CGAGTTGGAC AAGGAGGCTG TTTCTGCACA TCAGCTCATT 1200 

TCCCGCACCC CATTTCTTGC TTGATTGCTT TGTTGGGGGC CTGGCCACTT CCTTGCTTCT 1260 
CAAGCTGACA ATTCTCACTT TGCAATAAAT AGTCCAGTGT TTCCTTCAT 

25 

Seq ID NO: 460 Protein sequence 
Protein Accession #: NP_00ll60.1 



1 11 21 31 41 SI 

30 | | | | | | 

MSGEIAMCEP EFGNDKAREP SVGGRWRVSW YERFVQPCLV ELLGSALFIF IGCLSVIENG 60 
TDTGLLQPAL AHGLALGLVI ATLGNISGGH PNPAVSIAAM LIGGLNLVML LPYWVSQLLG 120 
GMLGAALAKV VSPEERFWNA SGAAPVTVQE QGQVAGALVA EI ILTTLLAL AVCMGAINEK 180 
TKGPIAPFSI GFAVTVDILA GGPVSGGCMN PARAFGPAW ANHWNFHWIY WLGPLLAGLL 240 
35 VGLIilRCFIG DGKTRLIliKA R 



Seq ID NO » 461 DNA sequence 
Nucleic Acid Accession fti NMJ)03226.1 
40 Coding sequence: 2.. 22 6 

1 11 21 31 41 SI 

I I I I I I 

. _ GATGCTGGGG CTGGTCCTGG CCTTGCTGTC CTCCAGCTCT GCTGAGGAGT AGGTGGGCCT 60 

45 GTCTGCAAAC CAGTGTGCCG TGCCGGCCAA GGACAGGGTG GACTGCGGCT ACCCCCATGT 120 

CACCCCCAAG GAGTGCAACA ACCGGGGCTG CTGCTTTGAC TCCAGGATCC CTGGAGTGCC 180 

TTGGTGTTTC AAGCCCCTGA CTAGGAAGAC AGAATGCACC TTCTGAGGCA CCTCCAGCTG 240 

CCCCTGGGAT GCAGGCTGAG CACCCTTGCC CGGCTGTGAT TGCTGCCAGG CACTGTTCAT 300 

A CTCAGTTTTT CTGTCCCTTT GCTCCCGGCA AGC TT TCT G C TGAAAGTTCA TATCTGGAGC 360 
50 CTGATGTCTT AACGAATAAA GGTCCCATGC TCCACCOG 



Seq ID NO: 462 Protein sequence 
Protein Accession #: NPJ)03217.l 

55 l 11 21 31 41 51 

I I I I I I 

MLGLVLALLS SSSAEEYVGL SANQCAVPAK DRVDCGYPHV TPKECNNRGC CFDSRIPGVP 60 
WCFKPIjTRKT ECTP 

60 Seq ID NO: 463 DNA sequence 

Nucleic Acid Accession #: NMJ>02993.1 
Coding sequence: 64.. 40 8 



1 11 21 31 41 51 

65 i | | | | | 

GGCACGAGCC AGTCTCCGCG CCTCCACCCA GCTCAGGAAC CCGCGAACCC TCTCTTGACC 60 

ACTATGAGCC TCCCGTCCAG CCGCGCGGCC CGTGTCCCGG GTCCTTCGGG CTCCTTGTGC 120 

GCGCTGCTCG CGCTGCTGCT CCTGCTGACG CCGCCGGGGC CCCTCGCCAG CGCTGGTCCT 180 

GTCTCTGCTG TGCTGACAGA GCTGCGTTGC ACTTGTTTAC GCGTTACGCT GAGAGTAAAt 240 

70 CCCAAAACGA TTGGTAAACT GCAGGTGTTC CCCGCAGGCC CGCAGTGCTC CAAGGTGGAA 300 

GTGGTAGCCT CCCTGAAGAA CGGGAAGCAA GTTTGTCTGG ACCCGGAAGC CCCTTTTCTA 360 

AAGAAAGTCA TCCAGAAAAT TTTGGACAGT GGAAACAAGA AAAACTGAGT AACAAAAAAO 420 

ACCATGCATC ATAAAATTGC CCAGTCTTCA GCGGAGCAGT TTTCTGGAGA TCCCTGGACC 480 

CAGTAAGAAT AAGAAGGAAG GGTTGGTTTT TTTCCATTTT CTACATGGAT TCCCTACTTT 540 

75 GAAGAGTGTG GGGGAAAGCC TACGCTTCTC CCTGAAGTTT ACAGCTCAGC TAATGAAGTA 600 

CTAATATAGT ATTTCCACTA TTTACTGTTA TTTTACCTOA TAAGTTATTG AACCCTTTGG 660 

CAATTGACCA TATTGTGAGC AAAQAATCAC TGOTTATTAG TCTTTCAATG AATATTGAAT 720 

TGAAGATAAC TATTGTATTT CTATCATACA TTCCTTAAAG TCTTAOCGAA AAGGCTGTGG 780 

ATTTCGTATG GAAATAATGT TTTATTAGTG TGCTGTTGAG GGAGGTATCC TGTTGTTCTT 840 

80 ACTCACTCTT CTCATAAAAT AGGAAATATT TTAGTTCTGT TTTCTTGGGG AATATGTTAC 900 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 960 

TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTGTAAGGG 1020 

CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC TTCTTAGTAT GGCATAATGT 1080 

CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT TTCACTATAG GATGACTATA 1140 
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ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG CCCAAAAACA GATAAATTCC 
TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 
AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTQAAT ATAAATTCAT 
CATTTAGTCC TCAAAATATA TACAGCATTQ CTAAGATTTT CAGATATCTA TTGTGGATCT 
TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT GTATCACATT CACTATATTA 
AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT TTGGTACTTO TATTGTCATT 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 464 Protein sequence 
Protein Accession ft: NPJ) 02 984.1 

1 11 21 31 41 51 

I I I I I I 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 60 
KTIGKLQVPP AGPQCSKVEV VASLKNGKQV CLDPBAPPLK KVIQKILDSG NKKN 

Seq ID NO: 465 DNA sequence 
Nucleic Acid Accession ft: NM_002038.2 
Coding sequence: 10 8,. 500 

1 11 21 31 41 51 

I I I I I I 

GAACCGTTTA CTCGCTGCTG TGCCCATCTA TCAGCAGGCT CCGGGCTGAA GATTGCTTCT 60 

CTTCTCTCCT CCAAGGTCTA GTGACGGAGC CCGCGCGOGG CGCCACCATG CGGCAGAAGG 120 

CGGTATCGCT TTTCTTGTGC TACCTGCTGC TCTTCACTTG CAGTGGGGTG GAGGCAGGTA 180 

AGAAAAAGTG CTCGGAGAGC TCGGACAGCG GCTCCGGGTT CTGGAAGGCC CTGACCTTCA 240 

TGGCCGTCGG AGGAGGACTC GCAGTCGCCG GGCTGCCCGC GCTGGGCTTC ACCGGCGCCG 300 

GCATCGCGGC CAACTCGGTG GCTGCCTCGC TGATGAGCTG GTCTGCGATC CTGAATGGGG 360 

GOGGCGTGCC CGCCGGGGGG CTAGTGGCCA CGCTGCAGAG CCTCGGGGCT GGTGGCAGCA 420 

GCGTCGTCAT AGGTAATATT GGTGCCCTGA TGGGCTACGC CACCCACAAG TATCTCGATA 480 

GTGAGGAGGA TGAGGAGTAG CCAGCAGCTC CCAGAACCTC TTCTTCCTTC TTGGCCTAAC 540 

TCTTCCAGTT AGGATCTAGA ACTTTGCCTT TTTTTTTTTT TTTTTTTTTT TTTGAGATGG 600 

GTTCTCACTA TATTGTCCAG GCTAGAGTGC AGTGGCTATT CACAGATGCG AACATAGTAC 660 

ACTGCAGCCT CCAACTCCTA GCCTCAAGTG ATCCTCCTGT CTCAACCTCC CAAGTAGGAT 720 

TACAAGCATG CGCCGACGAT GCCCAGAATC CAGAACTTTG TCTATCACTC TCCCCAACAA 780 
CCTAGATGTG AAAACAGAAT AAACTTCACC CAGAAAA 

Seq ID NO: 466 Protein sequence 
Protein Accession #: NP_002029.3 

1 11 21 31 41 51 

1 I 1 I I I 

MRQKAVSLFL CYLLLFTCSG VEAGKKKCSE SSDSGSGFWK ALTFMAVGGG LAVAGLPALG 60 

PTGAGIAANS VAASLMSWSA ILNGGGVPAG GLVATLQSLG AGGSSWIGN IGALMGYATH 120 
KYLDSEEDBE 

Seq ID NO: 467 DNA sequence 

Nucleic Acid Accession ft: NMJ503469.2 

Coding sequence : 92 . . 1945 

1 11 21 31 41 51 

I I I I I I 

GAAACGGCCC GAGAAGCTOG CCCGGAGAAC GGGGAGGAAT ATGCTGTGGA GCTCCTCTGC 60 

CATATAAACA AAAAGAGGAA ATCTTTCAAA CATGGCTGAA GCAAAGACCC ACTGGCTTGG 120 

AGCAGCCCTG TCTCTTATCC CTTTAATTTT CCTCATCTCT GGGGCTGAAG CAGCTTCATT 180 

TCAGAGAAAC CAGCTGCTTC AGAAAGAACC AGACCTCAGG TTGGAAAATG TCCAAAAGTT 240 

TCCCAOTCCT GAAATGATCA GGGCTTTGGA GTACATAGAA AACCTCCGAC AACAAGCTCA 300 

TAAGGAAGAA AGCAGCCCAG ATTATAATCC CTACCAAGGT GTCTCTGTCC CCCTTCAGCA 360 

AAAAGAAAAT GGCGATGAAA GCCACTTGCC CGAGAGGGAT TCACTGAGTG AAGAAGACTG 420 

GATGAGAATA ATACTCGAAG CTTTGAGACA GGCTGAAAAT GAGCCTCAGT CTGCACCAAA 480 

AGAAAATAAG CCCTATGCCT TGAATTCAGA AAAGAACTTT CCAATGGACA TGAGTGATGA 540 

TTATGAGACA CAGCAGTGGC CAGAAAGAAA GCTTAAGCAC ATGCAATTCC CTCCTATGTA 600 

TGAAGAGAAT TCCAGGGATA ACCCCTTTAA ACGCACAAAT GAAATAGTGG AGGAACAATA 660 

TACTCCTCAA AGCCTTGCTA CATTGGAATC TGTCTTCCAA GAGCTGGGGA AACTGACAGG 720 

ACCAAACAAC CAGAAACGTG AGAGGATGGA TGAGGAGCAA AAACTTTATA CGGATGATGA 780 

AGATGATATC TACAAGGCTA ATAACATTGC CTATGAAGAT GTGGTCGGGG GAGAAGACTG 840 

GAACCCAGTA GAGGAGAAAA TAGAGAGTCA AACCCAGGAA GAGGTGAGAG ACAGCAAAGA 900 

GAATATAGGA AAAAATGAAC AAATCAACGA TGAGATGAAA CGCTCAGGGC AGCTTGGCAT 960 

CCAGGAAGAA GATCTTCGGA AAGAGAGTAA AGACCAACTC TCAGATGATG TCTCCAAAGT 1020 

AATTGCCTAT TTGAAAAGGT TAGTAAATGC TGCAGGAAGT GGGAGGTTAC AGAATGGGCA 1080 

AAATGGGGAA AGGGCCACCA GGCTTTTTGA GAAACCTCTT GATTCTCAGT CTATTTATCA 1140 

GCTGATTGAA ATCTCAAGGA ATTTACAGAT ACCCCCAGAA GACTTAATTG AGATGCTCAA 1200 

AACTGGGGAG AAGCCGAATG GATCAGTGGA ACCGGAGCGG GAGCTTGACC TTCCTGTTGA 1260 

CCTAGATGAC ATCTCAGAGG CTGACTTAGA CCATCCAGAC CTGTTCCAAA ATAGGATGCT 1320 

CTCCAAGAGT GGCTACCCTA AAACACCTGG TCGTGCTGGG ACTGAGGCCC TACCAGACGG 1380 

GCTCAGTGTT GAGGATATTT TAAATCTTTT AGGGATGGAG AGTGCAGCAA ATCAGAAAAC 1440 

GTCGTATTTT CCCAATCCAT ATAACCAGGA GAAAGTTCTG CCAAGGCTCC CTTATGGTGC 1500 

TGGAAGATCT AGATCGAACC AGCTTCCCAA AGCTGCCTGG ATTCCACATG TTGAAAACAG 1S60 

ACAGATGGCA TATGAAAACC TGAACGACAA GGATCAAGAA TTAGGTGAGT ACTTGGCCAG 1620 

GATGCTAGTT AAATACCCTG AGATCATTAA TTCAAACCAA GTGAAGCGAG TTCCTGGTCA 1680 

AGGCTCATCT GAAGATGACC TGCAGGAAGA GGAACAAATT GAGCAGGCCA TCAAAGAGCA 1740 

TTTGAATCAA GGCAGCTCTC AGGAGACTGA CAAGCTGGCC CCGGTGAGCA AAAGGTTCCC 1800 

TGTGGGGCCC CCGAAGAATG ATGATACCCC AAATAGGCAG TACTGGGATG AAGATCTGTT 1860 

AATGAAAGTG CTGGAATACC TCAATCAAGA AAAGGCAGAA AAGGGAAGGG AGCATATTGC 1920 
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TAAGAGAGCA ATGGAAAATA TGTAAGCTGC TTTCATTAAT TACCCTACTT TCATTCCTCC 1980 

CACCCCAAGC AAATCCCAAC ATTTCTCTTC AGTGTGTTGA CTTCTATCCT GTTAACACTG 2040 

TAATATCTTT AAATGATGTA CAGGCAGATG AAACCAQGTC ACTGGGGAGT CTGCTTCATT 2100 

TCCTCTGAGC TGTTATCTTG TGTATGGATA TGTQTAAATG TTATGACTCC TTGATAAAAA 2160 

5 ATTTATTATG TCCATTATTC AAGAAAGATA TCTATGACTG TGTTTAATAG TATATCTAAT 2220 

GGCTGTGGCA TTGTTGATGC TCACATATGA TAAAAAAGTG TCCTATAATT CTATTGAAAG 2280 

TTTTTAATAT TTATTGAATT ATTTTGTTAC TGTCTGTAGC GTTTTGTGGA GTACTGGACC 2340 
AAAAAAATAA AGCATTATAA ATATA 

10 Seq ID NO: 468 Protein sequence 
Protein Accession #: NP_003460.l 

1 11 21 31 41 51 

«c I I I I I I 

I J MAEAKTHWLG AALSLIPLIF LISGAEAASF QRNQLLQKEP DLRLENVQKF PSPEMIRALE 60 

YIENLRQQAH KEESSPDYNP YQGVSVPLQQ KENGDESHLP ERDSLSEEDW MRIILEALRQ 120 

AENEPQSAPK ENKPYALNSE KNFPMDMSDD YETQQWPERK LKHMQFPPMY EENSRDNPFK 180 

RTNBIVEEQY TPQSLATLES VFQELGKLTG PNNQKRERMD EEQKLYTDDE DDIYKANNIA 240 

YEDWGGEDW NPVEEKIESQ TQEEVRDSKE NIGKNEQXND EMKRSGQLGI QEEDLRKESK 300 

20 DQLSDDVSKV IAYLKRLVNA AGSGRLQNGQ NGERATRLFE KPLDSQSIYQ LIEISRNLQI 360 

PPEDLIEMLK TGEKPNGSVE PERELDLPVD LDDISEADLD HPDLFQNRML SKSGYPKTPG 420 

RAGTEALPDG LSVEDILNLL GMESAANQKT SYFPNPYNQE KVLPRLPYGA GRSRSNQLPK 480 

AAMIPHVENR QMAYENLNDK DQELGEYLAR MLVKYPEI1N SNQVKRVPGQ GSSEDDLQEE 540 

EQIBQAIKEH LNQGSSQETD KLAPVSKRFP VGPPKNDDTP NRQYWDEDLL MKVLEYLNQE 600 

KAEKGREHXA KHAMENM 



25 
30 
35 



45 
50 
55 
60 



80 



Seq ZD NO: 469 DNA sequence 

Nucleic Acid Accession #: NMJJ06398.1 

Coding sequence: 19.. 516 



1 11 21 31 41 51 

I I I I I I 

GGCCCCTTGT CTGCAGAGAT GGCTCCCAAT GCTTCCTGCC TCTGTGTGCA TGTCCGTTCC 60 

GAGGAATGGG ATTTAATGAC CTTTGATGCC AACCCATATG ACAGCGTGAA AAAAATCAAA 120 

GAACATGTCC GGTCTAAGAC CAAGGTTCCT GTGCAGGACC AGGTTCTTTT GCTGGGCTCC 180 

AAGATCTTAA AGCCAOGGAG AAGCCTCTCA TCTTATGGCA TTGACAAAGA GAAGACCATC 240 

CACCTTACCC TGAAAGTGGT GAAGCCCAGT GATGAGGAGC TGCCCTTGTT TCTTGTGGAG 300 

TCAGGTGATG AGGCAAAGAG GCACCTCCTC CAGGTGCGAA GGTCCAGCTC AGTGGCACAA 360 

GTGAAAGCAA TGATCGAGAC TAAGAOGGGT ATAATCCCTG AGACCCAGAT TGTGACTTGC 420 

40 AATGGAAAGA GACTGGAAGA TGGGAAGATG ATGGCAGATT ACGGCATCAG AAAGGGCAAC 480 

TTACTCTTCC TGGCATCTTA TTGTATTGGA GGGTGACCAC CCTGGGGATQ GGGTGTTGGC 540 

AGGGGTCAAA AAGCTTATTT CTTTTAATCT CTTACTCAAC GAACACATCT TCTGATGATT 600 

TCCCAAAATT AATGAGAATG AGATGAGTAG AGTAAGATTT GGGTGGGATG GGTAGGATGA 660 

AGTATATTGC CCAACTCTAT GTTTCTTTGA TTCTAACACA ATTAATTAAG TGACATGATT 720 
TTTACTAATG TATTACTGAG ACTAGTAAAT AAATTTTTAA GGCAAAATAG AGCATTC 



Seq ID NO: 470 Protein sequence 
Protein Accession #: NP_006389.1 

1 11 21 31 41 51 

I I I I I I 

MAPNASCLCV HVRSEEWDLM TFDANPYDSV KKIKEHVRSK TKVPVQDQVL LLGSKILKPR 60 
RSLSSYGIDK EKTIHLTLKV VKPSDEBLPL PLVESGDBAK RHLLQVRRSS SVAQVKAMIE 120 
TKTGIIPETQ IVTCNGKRLE DGKMMADYGI RKGNLLFLAS YCIGG 

Seq ID NO: 471 DNA sequence 

Nucleic Acid Accession #: XMJJ94741.1 

Coding sequence: 1..948 



1 11 21 31 41 51 

I I I I I I 

ATGAAGGCCA ACTACAGCGC AGAGGAGCGC TTTCTCCTGC TGGGTTTCTC CGACTGGCCT 60 

TCCCTGCAGC CGGTCCTCTT CGCCCTTGTC CTCCTGTGCT ACCTCCTGAC CTTGACGGGC 120 

AACTCGGCGC TGGTGCTGCT GGCGGTGCGC GACCCGCGCC TGCACACGCC CATGTACTAC 180 

03 TTCCTCTGCC ACCTGGCCTT GGTAGACGCG GGCTTCACTA CTAGCGTGGT GCCGCOGCTG 240 

CTGGCCAACC TGCGCGGACC AGCGCTCTGG CTGCCGCGCA GCCACTGCAC GGCCCAGCTG 300 

TGCGCATCGC TGGCTCTGGG TTOGGCCGAA TGOGTCCTCC TGGCGGTGAT GGCTCTGGAC 360 

CGCGCGGCCG CAGTGTGCCG CCCGCTGCGC TATGCGGGGC TCGTCTCCCC GCGCCTATGT 420 

CGCACGCTGG CCAGCGCCTC CTGGCTAAGC GGCCTCACCA ACTCGGTTGC GCAAACCGCG 480 

70 CTCCTGGCTG AGCGGCCGCT GTGCGCGCCC CGCCTGCTGG ACCACTTCAT CTGTGAGCTG 540 

CCGGCGTTGC TCAAGCTGGC CTGOGGAGGC GACGGAGACA CTACCGAGAA CCAGATGTTC 600 

GCCGCCCGCG TGGTCATCCT GCTGCTGCCG TTTGCCGTCA TCCTGGCCTC CTACGGTGCC 660 

GTGGCCCGAG CTGTCTGTTG CATGCGGTTC AGCGGAGGCC GGAGGAGGGC GGTGGGCACG 720 

TGTGGGTCCC ACCTGACAGC CGTCTGCCTG TTCTACGGCT CGGCCATCTA CACCTACCTG 780 

75 CAGCCCGCGC AGCGCTACAA CCAGGCACGG GGCAAGTTCG TATCGCTCTT CTACACCGTG 840 

GTCACACCTG CTCTCAACCC GCTCATCTAC ACCCTCAGGA ATAAGAAAGT GAAGGGGGCA 900 
GCGAGGAGGC TGCTGCGGAG TCTGGGGAGA GGCCAGGCTG GGCAGTGA 



Seq ID NO: 472 Protein sequence 
Protein Accession #: XP_094741.1 

1 -11 21 31 41 51 

I I I I I I 

MKANYSABER FLLLGFSDWP SLQPVLPALV LLCYLLTLTG NSALVLLAVR DPRLHTPMYY 
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10 



FLCHLALVDA GPTTSWPPL LANLRGPALW LPRSHCTAQL CASLALGSAE CVLLAVMALO 120 

RAAAVCRPLR YAGLVSPRLC RTLASASWLS GLTNSVAQTA LLABRPLCAP RLLDHFICEL 1B0 

PALLKLAOGG DGDTTENQMF AARWILLLP PAVILASYGA VARAVOCMRP SGGRRRAVGT 240 

CGSHLTAVCL PYGSAIYTYIi QPAQRYNQAR GKFVSLFYTV VTPALNPLIY TLRNKKVKGA 300 
ARRLLRSLGR GQAGQ 

Seq ID NO: 473 DNA sequence 

Nucleic Acid Accession #: NM_O01062.1 

Coding sequence: 76.. 1380 



1 11 21 31 41 SI 

I I I I I I 

GCTCTCATTA CCTTCTGCCC ATCACTTAAT AAATAGCCAG CCAATTCATC AACATTCTGG 60 

TACACTGTTG GAGAGATGAG ACAGTCACAC CAGCTGCCCC TAGTGGGGCT CTTACTGTTT 120 

15 TCTTTTATTC CAAGCCAACT ATGCGAGATT TGTGAGGTAA GTGAAGAAAA CTACATCCGC 180 

CTAAAACCTC TGTTGAATAC AATGATCCAG TCAAACTATA ACAGGGGAAC CAGCGCTGTC 240 

AATGTTGTGT TGTCCCTCAA ACTTGTTGGA ATCCAGATCC AAACCCTGAT GCAAAAGATG 300 

ATCCAACAAA TCAAATACAA TGTGAAAAGC AGATTGTCAG ATGTAAGCTC GGGAGAGCTT 360 

GCCTTGATTA TACTGGCTTT GGGAGTATGT CGTAACGCTG AGGAAAACTT AATATATGAT 420 

20 TACCACCTGA CTGACAAGCT AGAAAATAAA TTCCAAGCAG AAATTGAAAA TATGGAAGCA 480 

CACAATGGCA CTCCCCTGAC TAACTACTAC CAQCTCAGCC TGGACGTTTT GGCCTTGTGT 54 0 

CTGTTCAATG GGAACTACTC AACCGCCGAA GTTGTCAACC ACTTCACTCC TGAAAATAAA 600 

AACTATTATT TTGGTAGCCA GTTCTCAGTA GATACTGGTG CAATGGCTGT CCTGGCTCTG 660 

ACCTGTGTGA AGAAGAGTCT AATAAATGGG CAGATCAAAG CAGATGAAGG CAGTTTAAAG 720 

25 AACATCAGTA TTTATACAAA GTCACTGGTA GAAAAGATTC TGTCTGAGAA AAAAGAAAAT 780 

GGTCTCATTG GAAACACATT TAGCACAGGA GAAGCCATGC AGGCCCTCTT TGTATCATCA 840 

GACTATTATA ATGAAAATGA CTGGAATTGC CAACAAACTC TGAATACAGT GCTCACGGAA 900 

ATTTCTCAAG GAGCATTCAG TAATCCAAAC GCTGCAGCCC AGGTCTTACC TGCCCTGATG 960 

GGAAAGACCT TCTTGGATAT TAACAAAGAC TCTTCTTGCG TCTCTGCTTC AGGTAACTTC 1020 

30 AACATCTCOG CTGATGAGCC TATAACTGTG ACACCTCCTG ACTCACAATC ATATATCTCC 1080 

GTCAATTACT CTGTGAGAAT CAATGAAACA TATTTCACCA ATGTCACTGT GCTAAATGGT 1140 

TCTGTCTTCC TCAGTGTGAT GGAGAAAGCC CAGAAAATGA ATGATACTAT ATTTGGTTTC 1200 

ACAATGGAGG AGCGCTCATG GGGGCCCTAT ATCACCTGTA TTCAGGGCCT ATGTGCCAAC 1260 

AATAATGACA GAACCTACTG GGAACTTCTG AGTGGAGGCG AACCACTGAG CCAAGGAGCT 1320 

35 GGTAGTTACG TTGTCOGCAA TGGAGAAAAC TTGGAGGTTC GCTGGAGCAA ATACTAATAA 1380 

GCCCAAACTT TCCTCAGCTG CATAAAATCC ATTTGCAGTG GAGTTCCATG TTTATTGTCC 1440 

TTATGCCTTC TTCTTCATTT ATCCCAGTAC GAGCAGGAGA GTTAATAACC TCCCCTTCTC 1500 
TCTCTACATG TTCAATAAAA GTTGTTGAAA GATTAAC 

40 Seq ID NO: 474 Protein sequence 
Protein Accession #: NP_0010S3.l 

1 11 21 31 41 51 

AC I I I I I I 

4j MRQSHQLPLV GLLLFSPIPS QLCEICEVSE ENYIRLKPLL NTMIQSNYNR GTSAVNWLS 60 

LKLVGIQIQT LMQKMIQQIK YNVKSRLSDV SSGELALIIL ALGVCRNAEE NLIYDYHLTD 120 

KLENKPQAEI ENMEAHNGTP LTNYYQLSLD VLALCLPNGN YSTABWNHF TPENKNYYFG 180 

SQPSVDTGAM AVLALTCVKR SLINGQIKAD EGSLKNISIY TKSLVEKILS EKKENGLIGN 240 

TPSTGEAMQA LPVSSDYYNE NDWNCQQTLN TVLTEISQGA FSNPNAAAQV LPALMGKTFL 300 

50 DINKDSSCVS ASGNFNISAD EPITVTPPDS QSYISVNYSV RINETYFTNV TVLNGSVFLS 360 

VMEKAQKMND TIFGFTMEER SWGPYITCIQ GLCANNNDRT YWELLSGGEP LSQGAGSYW 420 
RNGENLEVRW SKY 

Seq ID NO: 475 DNA sequence 
55 Nucleic Acid Accession #: NM_004BS2.1 
Coding sequence: 89.. 1546 

1 11 21 31 41 SI 

I I I I 1 I 

OU GCCCCCGOCC GCCCCGGGCC CTGATGGACT GAATGAAGGC TGCCTACACC GCCTATCGAT 60 

GCCTCACCAA AGACCTAGAA CGTGCGCCAT GAACCOGGAG CTGACAATGG AAAGTCTGGG 120 

CACTTTGCAC GGCGCGCGCG GCGGCGGCAG TGGCGGGGGC GGCGGCGGGG GCGGCGGGGG 180 

CGGGGGCGGG GGCCCGGGCC ATGAGCAGGA GCTGCTGGCC AGCCCCAGCC CCCACCACGC 240 

GCGCCGCGGC CCGCGTGGCT CGCTGCGGGG CCCTCCGCCG CCTCCAACCG CGCACCAGGA 300 

65 GCTGGGCAOG GCGGCAGCGG CGGCAGCGGC GGOGTCGCGC TCGGCCATGG TCACCAGCAT 360 

GGCCTCGATC CTGGACGGCG GCGACTACOG GCCCGAGCTC TCCATCCCGC TGCACCACGC 420 

CATGAGCATG TCCTGCGACT CGTCTCCGCC TGGCATGGGC ATGAGCAACA CCTACACCAC 480 

GCTGACACCG CTCCAGCCGC TGCCACCCAT CTCCACCGTG TCTGACAAGT TCCACCACCC 540 

A TCACCCGCAC CACCATCCGC ACCACCACCA CCACCACCAC CACCAGCGCC TGTCCGGCAA 600 

70 CGTCAGCGGC AGCTTCACCC TCATGCGCGA CGAGCGCGGG CTCCCGGCCA TGAACAACCT 660 

CTACAGTCCC TACAAGGAGA TGCCCGGCAT GAGCCAGAGC CTGTCCCCGC TGGCCGCCAC 720 

GCCGCTGGGC AACGGGCTAG GCGGCCTCCA CAACGCGCAG CAGAGTCTGC CCAACTACGG 780 

TCCGCCGGGC CACGACAAAA TGCTCAGCCC CAACTTCGAC GCGCACCACA CTGCCATGCT 840 

nc GACCCGCGOT GAGCAACACC TGTCCCGCGG CCTGGGCACC CCACCTGCGG CCATGATGTC 900 

75 GCACCTGAAC GGCCTGCACC ACCCGGGCCA CACTCAGTCT CACGGGCCGG TOCTGGCACC 960 

CAGTCGCGAG CGGCCACCCT CGTCCTCATC GGGCTCGCAG GTGGCCACGT OGGGCCAGCT 1020 

GGAAGAAATC AACACCAAAG AGGTGGCCCA GCGCATCACA GCGGAGCTGA AGCGCTACAG 1080 

TATCCCCCAG GCGATCTTTG CGCAGAGGGT GCTGTGCCGG TCTCAGGGGA CTCTCTCCGA 1140 

CCTGCTCCGG AATCCAAAAC CGTGGAGTAA ACTCAAATCT GGCAGGGAGA CCTTCCGCAG 1200 

80 GATGTGGAAG TGGCTTCAGG AGCCCGAGTT CCAGCGCATG TCCGCCTTAC GCCTGGCAGC 1260 

GTGCAAACGC AAAGAGCAAG AACCAAACAA AGACAGGAAC AATTCCCAGA AGAAGTCCCG 1320 

CCTGGTGTTC ACTGACCTCC AACGCCGAAC ACTCTTCGCC ATCTTCAAGG AGAACAAACG 1380 

CCCGTCAAAG GAGATGCAGA TCACCATTTC CCAGCAGCTG GGCCTGGAGC TCACAACCGT 1440 

CAGCAACTTC TTCATGAACG CCCGGCGCCG CAGCCTGGAG AAGTGGCAAG ACGATCTGAG 1500 
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CACAGGGGGC TCCTCGTCCA CCTCCAGCAC GTGTACCAAA GCATGATGGA AGGACTCTCA IS 60 
CTTGGGCACA AGTCACCTCC AAATQAGGAC AACAGATACC AAAAGAAAAC AAAGGAAAAA 1620 
GACACCGGAT TCCTAGCTGQ GGCCCTTCAC TGGTG 

5 Seq ID NO: 476 Protein sequence 
Protein Accession NP_004843.1 

1 11 21 31 41 51 

10 1 1 1 1 1 1 

1U MNPELTMESL GTLHGARGGG SGGGGGGGGQ GGGGGPGHEQ ELLAS PSPHH ARRGPRGSLR 60 

GPPPPPTAHQ ELGTAAAAAA AASRSAMVTS MASILDGGDY RPELSIPLHH AMSMSCDSSP 120 
PGMGMSNTYT TLTPLQPLPP ISTVSDKFHH PHPHHHPHHH HHHHHQRLSG NVSGSFTLMR 180 
DERGLPAMNN LYSPYKEMPG MSQSLSPLAA TPLGKGLGGL HNAQQSLPNY GPPGHDKMLS 240 
PNFDAHHTAM LTRGEQHLSR GLGTPPAAMM SHLNGLHRPG HTQSHGPVLA PSRERPPSSS 300 
I J SGSQVATSGQ LEBINTKEVA QRITAELKRY SIPQAIPAQR VLCRSQGTLS DLLRNPKPWS 360 
KLKSGRETFR RMWKWLQEPE FQRMSALRLA ACKRKEQEPN KDRNNSQKKS RLVFTDLQRR 420 
TLPAIFKENK RPSKEKQITI SQQLGLELTT VSNPPMNARR RSLEKWQDDL. STGGSSSTSS 480 
TCTKA 

20 Seq ID NO: 477 DNA sequence 

Nucleic Acid Accession #: NM.013271.1 
Coding sequence: 27.. 809 



1 11 21 31 41 51 

25 | 1 | | | | 

TCCGGAGCCA GGCTOGCTGG GGCAGCATGG CGGGGTCGCC GCTGCTCTGG GGGCCGCGGG 60 

CCGGGGGCGT CGGCCTTTTG GTGCTGCTGC TGCTCGGCCT GTTTCGGCCG CCCCCCGCGC 120 

TCTGCGCGCG GCGGGTAAAG GAACCCCGCG GCCTAAGCGC AGCGTCTCCG CCCTTGGCTG 180 

AGACTGGCGC TCCTCGOCGC TTCCGGCGGT CAGTGCOCCG AGGTGAGGCG GCGGGGGCGG 240 

3U TGCAGGAGCT GGCGCGGGCG CTGGCGCATC TGCTGGAGGC CGAACGTCAG GAGCGGGCGC 300 

GGGCCGAGGC GCAGGAGGCT GAGGATCAGC AGGCGCGCGT CCTGGCGCAG CTGCTGCGCG 360 

TCTGGGGCGC CCCCCGCAAC TCTGATCCGG CTCTGGGCCT GGACGACGAC CCCGAOGCGC 420 

CTGCAGCGCA GCTCGCTCGC GCTCTGCTCC GCGCCCGCCT TGACCCTGCC GCCCTAGCAG 480 

CCCAGCTTGT CCCCGCGCCC GTCCCCGCCG CGGCGCTCOG ACCCCGGCCC CCGGTCTACG 540 

ACGACGGCCC CGCGGGCCCG GATGCTGAGG AGGCAGGCGA CGAGACACCC GACGTGGACC 600 

CCGAGCTGTT GAGGTACTTG CTGGGACGGA TTCTTGCGGG AAGCGCGGAC TCCGAGGGGG 660 

TGGCAGCCOC GCGCCGCCTC CGCCGTGCCG CCGACCACGA TGTGGGCTCT GAGCTGCCCC 720 

CTGAGGGCGT GCTGGGGGCG CTGCTGCGTG TGAAACGCCT AGAGACCCCG GCGCCCCAGG 780 

TGCCTGCACG CCGCCTCTTG CCACCCTGAG CACTGCCCGG ATCCCGTGCA CCCTGGGACC 840 

CAGAAGTGCC CCCGCCATCC CGCCACCAGG ACTTCTOCCC GCCAGCACGT CCAGAGCAAC 900 

TTACCCCGGC CAGCCAGCCC TCTCACCCGA GGATCCCTAC CCCCTGGCCC ACAATAACAT 960 
GATCTGAGC 



35 
40 



Seq ID NO: 478 Protein sequence 
4 J Protein Accession #: NP 037403.1 



50 



1 11 21 31 41 51 

1 I I I I I 

MAGSPLLWGP RAGGVGLLVL LLLGI.FRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 60 

RSVPRGEAAG AVQBLARALA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 120 

PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 180 

EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSEL PPEGVLGALL 240 
RVKRLETPAP QVPARRLLPP 

55 Seq ID NO: 479 DNA sequence 

Nucleic Acid Accession #: NM_002214 
Coding sequence: 681.. 2990 

1 11 21 31 41 51 

60 | | | | | | 

CCCAGAGCCG CCTCCCCCTG TTGCTGGCAT CCCGAGCTTC CTCCCTTGCC AGCCAGGACG 60 

CTGCCGACTT GTCTTTGCCC GCTGCTCCGC AGACGGGGCT GCAAAGCTGC AACTAATGGT 120 

GTTGGCCTCC CTGCCCACCT GTGGAAGCAA CTGCGCTQAT TGATGCGCCA CAGACTTTTT 180 

TCCCCTCGAC CTCGCCGGCG TACCCTCCCA CAGATCCAGC ATCACCCAGT GAATGTACAT 240 

TAGGGTGGTT TCCCCCCCAG CTTCGGGCTT TGTTTGGGTT TGATTGTGTT TGGCTCTTCG 300 

CTAAGCTGAT TTATGCAGCA GAAGCCCCAC CGGCTGGAGA GAAACAAAAG CTCTTTTCTT 360 

TGTCCCGGAG CAGGCTGCGG AGCCCTTGCA GAGCCCTCTC TCCAGTCGCC GCCGGGCCCT 420 

TGGCCGTCGA AGGAGGTGCT TCTCGCGGAG ACCGCGGGAC CCGCCGTGCC GAGCCGGGAG 480 

GGCCGTAGGG GCCCTGAGAT GCCGAGCGGT GCCCGGGCCC GCTTACCTGC ACCGCTTGCT 540 

CCGAGCCGCG GGGTCCGCCT GCTAGGCCTG CGGAAAACGT CCTAGCGACA CTCGCCCGCG 600 

GGCCCCGAGG TCGCCCGGGA GGCCGAGCCC GOGTCCGQAA GGCAGCCAGG CGGCGGGCGC 660 

GGGGCGGGCT GTTTTGCATT ATGTGCGGCT CGGCCCTGGC TTTTTTTACC GCTGCATTTG 720 

TCTGCCTGCA AAACGACCGG CGAGGTCCCG CCTCGTTCCT CTGGGCAGCC TGGGTGTTTT 780 

CACTTGTTCT TGGACTGGGC CAAGGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 840 

CCTGTGCCAG GTGCCTTGCG CTGGGTCCAG AATGTGGATG GTGTGTTCAA GAGGATTTCA 900 

TTTCAGGTGG ATCAAGAAGT GAACGTTGTQ ATATTOTTTC CAATTTAATA AGCAAAGGCT 960 

GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 1020 

TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCAGCT GCGTCCAGGA GCCGAAGCTA 1080 

ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTGTTG 1140 

ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAATTAAA TTCCGTTGGA AACGATTTAT 1200 

CTAGAAAAAT GGCATTTTTC TCCCGTGACT TTCGTCTTGG ATTTGGCTCA TACGTTGATA 1260 

AAACAGTTTC ACCATACATT AGCATCCACC CCGAAAGGAT TCATAATCAA TGCAGTGACT 1320 

ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 

TCACTGAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA GATACACCAG 1440 
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AAGGAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 1500 

AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 1560 

GCAAATTGGC AGGCATAGTG GTGCCCAATG AOGGAAACTG TCATCTGAAA AACAACGTCT 1620 

AOGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 

ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 

TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA I860 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 

CAGGCATGGA AGGATGCAGA AAOGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTOGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 

CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATOGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCACCGATCC CAGGAGCATC GGCOGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 

GCAAGGAAAA CTGGAATTGT ATGCAATGOC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2820 

ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 

TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 

TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 3300 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 



Seq ID NO: 480 Protein sequence 
Protein Accession #: NF_002205 



1 
I 

HCGSALAFFT 
LGPECGWCVQ 
GEVSIQLRPG 
SRDPRLGFGS 
VHRQKISGNI 
VPNDGNCHLK 
TIAGEIESKA 
NVTSNDEVLF 
CVDETPLDSK 
VYGKYCEKDD 
CSGRGTCVCG 
CALMEQQHYV 
DYRVSASKKD 



11 
I 

AAFVCLQNDR 
EDPISGGSRS 
AEANFMLKVH 
YVDKTVSPYI 
DTPEGGFDAM 
NNVYVKSTTM 
ANLNNLWBA 
NVTVTMKKCD 
CFQCDENKCH 
FSCPYHHGNLi 
RCECTDPRSI 
DQTSECPSSP 
KLILQSVCTR 



21 
I 

RGPASFLWAA 
ERCDIVSNLI 
PLKKYPVDLY 
SIHPERIHNQ 
LQAAVCESHI 
EHPSLGQLSE 
YQKLISEVKV 
VTGGKNYAII 
FDBDQFSSBS 



GRPCEHCPTC 
SYLRIPPIIP 
AVTYRREKPE 



31 
I 

WVFSLVLGLG 
SKGCSVDSIE 
YLVDVSASMH 
CSDVNLDCMP 
GWRKEAKRLL 
KLIDNNINVI 
QVENQVQGIY 
KPIGFNETAK 
CKSHXDQPVC 
RCQCFSGWEG 
YTACKENWNC 
IVTPLIGLLK 
EIKMDISKLN 



41 
I 

QGEDNRCASS 
YPSVHVIIPT 
NNIEKLNSVG 
PHGYIHVLSL 
LVMTDQTSHL 
PAVQGKQFKW 
FNITAICPDG 
IHIHRNCSCQ 
SGRGVCVCGK 
DRCQCPSAAA 
KQCLHPRKLS 
VLIIRQVILQ 
AHETFRCNP 



51 
I 

NAASCARCLA 
ENEINTQVTP 
NDLSRKMAPF 
TENITEFEKA 
ALDSKLAGIV 
YKDLLPLLPG 
SRKPGMEGCR 
CEDNRGPKGK 
CSCHKIKLGK 
QHCVNSKGQV 
QAILDQCKTS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 481 DNA sequence 

Nucleic Acid Accession ft: NMJJ03318.1 

Coding s equence : 1 . . 2 57 4 



1 

I 

ATGGAATCCG 
AGAGACATTA 
ATTTCTGCTG 
AACCCAGAGG 
GATGCTCTTT 
GATAAATATG 
GCTATTCAAG 
AAATTTGCTT 
AAAAGTAAAC 
GAAATTGCCC 
AAGAATTTAT 
CATTTACAGA 
TTATATGGAG 
CAAACTAACA 
AGCCCAGATT 
ACCTCTAGAT 
TCCTGTGAAT 
TCAGATGAAA 



11 
I 

AGGATTTAAG 
AAAATAAGTT 
ATACTACAGA 
ACTGGTTGAG 
TAAATAAATT 
GCCAAAATGA 
AGCCAGATGA 
TTGTTCATAT 
AACTTCTTCA 
TGCGGAATTT 
CAGCATCTAC 
ATAGGAACAA 
AGAACATGCC 
AAACTAAACA 
GTGATGTGAA 
CAGAATGCCG 
TAAGAAATTT 
AGAGTTCTGA 



21 
I 

TGGCAGAGAA 
TAAAAATGAA 
TAACTCGGGA 
TTTGTTGCTC 
GATTGGTCGT 
GAGTTTTGCT 
TGCACGTGAC 
ATCTTTTGCA 
AAAAGCTGTA 
AAACCTCCAA 
GGTATTAACT 
CAGTTGTGAT 
ACCACAAGAT 
GTCATGCCCA 
GACAGATGAT 
AGATTTGGTT 
AAAGTCTGTT 
ACTTATTATT 



31 
I 

TTGACAATTG 
GACCTTACTG 
ACTGTTAACC 
AAACTAGAGA 
TACAGTCAAG 
AGAATTCAAG 
TACTTTCAAA 
CAATTTGAAC 
GAACGTGGAG 
AAAAAGCAGC 
GCCCAAGAAT 
TCCAGAGGAC 
GCAGAAATAG 
TTTGGAAGAG 
TCAGTTGTAC 
GTGCCTGGAT 
CAAAATAGTC 
ACTGATTCAA 



41 
I 

ATTCCATAAT 
ATGAACTAAG 
AAATTATGAT 
AAAACAGTGT 
CAATTGAAGC 
TGAGATTTGC 
TGGCCAGAGC 
TGTCACAAGG 
CAGTACCACT 
TGCTTTCAGA 
CATTTTCCGG 
AGACTACTAA 
GTTACCGGAA 
TCCCAGTTAA 
CTTGTTTTAT 
CTAAACCAAG 
ATTTCAAGGA 
TAACCCTGAA 



51 
1 

GAACAAAGTG 
CTTGAATAAA 
GATGGCAAAC 
TCCGCTAAGT 
GCTTCCCCCA 
TGAATTAAAA 
AAACTGCAAG 
TAATGTCAAA 
AGAAATGCTG 
GGAGGAAAAG 
TTCACTTGGG 
AGCCAGGTTT 
TTCATTGAGA 
CCTTCTAAAT 
GAAAAGACAA 
TGGAAATGAT 
ACCTCTGGTG 
GAATAAAACG 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



1140 



WO 03/042661 



PCT/US02/36810 



GAATCAAGTC TTCTAGCTAA ATTAGAAGAA ACTAAAGAGT ATCAAGAACC AGAGGTTCCA 1140 

GAGAGTAACC AGAAACAGTG GCAATCTAAG AGAAAGTCAG AGTGTATTAA CCAGAATCCT 1200 

GCTGCATCTT CAAATCACTG GCAGATTCOG GAGTTAGCCC GAAAAGTTAA TACAOAGCAG 1260 

AAACATACCA CTTTTGAGCA ACCTGTCTTT TCAGTTTCAA AACAGTCACC ACCAATATCA 1320 

5 ACATCTAAAT GGTTTGACCC AAAATCTATT TGTAAGACAC CAAGCAGCAA TACCTTGGAT 1380 

GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC TGCTTGTCAG 1440 

TTGTCAACAC CTTATGGCCA ACCTGCCTGT TTCCAGCAGC AACAGCATCA AATACTTGCC 1S00 

ACTCCACTTC AAAATTTACA GGTTTTAGCA TCTTCTTCAG CAAATGAATG CATTTCGGTT IS 60 

AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAGGAAGTG GAGGTTCAAG CAAGGTATTT 1620 

10 CAGGTGTTAA ATGAAAAGAA ACAGATATAT GCTATAAAAT ATGTGAACTT AGAAGAAGCA 1680 

GATAACCAAA CTCTTGATAG TTACOGGAAC GAAATAGCTT ATTTGAATAA ACTACAACAA 1740 

CACAGTGATA AGATCATCCG ACTTTATGAT TATGAAATCA CGGACCAGTA CATCTACATG 1800 

GTAATGGAGT GTGGAAATAT TGATCTTAAT AGTTGGCTTA AAAAGAAAAA ATCCATTGAT 1860 

CCATGGGAAC GCAAGAGTTA CTGGAAAAAT ATGTTAGAGG CAGTTCACAC AATCCATCAA 1920 

15 CATGGCATTG TTCACAGTGA TCTTAAACCA GCTAACTTTC TGATAGTTGA TGGAATGCTA 1980 

AAGCTAATTG ATTTTGGGAT TGCAAACCAA ATGCAACCAG ATACAACAAG TGTTGTTAAA 2040 

GATTCTCAGG TTGGCACAGT TAATTATATG CCACCAGAAG CAATCAAAGA TATGTCTTCC 2100 

TCCAGAGAGA ATGGGAAATC TAAGTCAAAG ATAAGCCCCA AAAGTGATGT TTGGTCCTTA 2160 

GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TTCAGCAGAT AATTAATCAG 2220 

20 ATTTCTAAAT TACATGCCAT AATTGATCCT AATCATGAAA TTGAATTTCC CGATATTCCA 2280 

GAGAAAGATC TTCAAGATGT GTTAAAGTGT TGTTTAAAAA GGGACCCAAA ACAGAGGATA 2340 

TCCATTCCTG AGCTCCTGGC TCATCCCTAT GTTCAAATTC AAACTCATCC AGTTAACCAA 2400 

ATGGCCAAGG GAACCACTGA AGAAATGAAA TATGTTCTGG GCCAACTTGT TGGTCTGAAT 2460 

TCTCCTAACT CCATTTTGAA AGCTGCTAAA ACTTTATATG AACACTATAG TGGTGGTGAA 2520 

25 AGTCATAATT CTTCATCCTC CAAGACTTTT GAAAAAAAAA GGGGAAAAAA ATGA 



Seq ID NO: 482 Protein sequence 
Protein Accession #: NP_003309.l 

30 1 11 21 31 41 SI 

I I I I I I 

MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDNSG TVNQIMMMAN 60 

NPEDWLSLLL KLEKNSVPLS DALLNKLIGR YSQAIEALPP DKYGQNESFA RIQVRFAELK 120 

AIQEPDDARD YFQMARANCK KPAFVHISFA QFELSQGKVK KSKQLLQKAV ERGAVPLEML 180 

35 EIALRNLNLQ KKQLLSEBEK KNLSASTVLT AQESPSGSLG HLQNRNNSCD SRGQTTKARF 240 

LYGENMPPQD AEIGYRNSLR QTNKTKQSCP FGRVPVNLLN SPDCDVXTDD SWPCFMKRQ 300 

TSRSECRDLV VPGSKPSGND SCELRNLKSV QNSHFKEPLV SDEKSSELII TDSITLKNKT 360 

ESSLLAKLEE TKEYQEPBVP ESNQKQWQSK RKSECINQNP AASSNHWQIP BLARKVNTEQ 420 

KHTTFEQPVF SVSKQSPPIS TSXWFDPKSI CKTPSSNTLD DYMSCFRTPV VKNDFPPACQ 480 

40 LSTPYGQPAC FQQQQHQ1LA TPLQNLQVLA SSSANECISV KGRIYSILKQ IGSGGSSKVF 540 

QVLNEKKQIY AIKYVNliEEA DNQTLDSYRN EIAYLNKLQQ HSDKIIRLYD YEITDQYIYM 600 

VMECGNIDLN SWLKKKKSID PWERKSYWKN MLEAVHTIHQ HGIVHSDLKP ANFLIVBGML 660 

KLIDFGIANQ MQPDTTSWK DSQVGTVNYM PPEA1KDMSS SRENGKSKSK ISPKSDVWSL 720 

GCILYYMTYG KTPFQQIINQ ISKLHAIIDP NHEIBFPDIP EKDLQDVLKC CLKRDPKQRI 780 

45 SIPELLAHPY VQIQTHPVNQ MAKGTTEEMK YVLGQLVGLN SPNSILKAAK TLYEHYSGGE 840 
SHNSSSSKTF EKKRGKK 



Seq ID NO: 483 DNA sequence 
Nucleic Acid Accession #: NM_003667.1 
50 Coding sequence: 1..2651 

1 11 21 31 41 51 

I I I I I I 

_. _ ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60 

55 GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120 

GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 180 

CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240 

CTGCTCCCGA ATCCCCTGCC CAGTCTCOGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300 

GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360 

60 CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420 

CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480 

CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540 

GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600 

ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660 

65 AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720 

TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780 

GAACTACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 840 

CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 900 

_ A ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 960 

70 CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1020 

GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1080 

GAAATCTACG AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1140 

TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1200 

_ _ ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCGTCTTTTC CTATAACTGG GTTACATGGT 1260 

75 TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACAGA GCTT GATAT C ATCTGAAAAC 1320 

TTTCCAGAAC TCAAGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGTGC ATTTGGAGTG 1380 

TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 1440 

GACCTTCATA AGAAAGATGC TGGAATGTTT CAGGCTCAAG ATGAACGTGA CCTTGAAGAT 1500 

Qf . TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT CAGTGCAGTG TTCACCTTCC 1560 

uO CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAO AATTGGAGTQ 1620 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 1680 

TCCCCTCTGT ACATTTCCCC CATTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1740 

CTCAOGGGAG TCTCCAGTGC OGTGCTGGCT GGTGTGGATG CGTTCACTTT TGGCAGCTTT 1800 

GCACGACATG GTGCCTGGTG GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC I860 

1141 
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ATTTTTGCTT CAGAATCATC TGTTTTCCTG CTTACTCTGG CAGCCCTGGA GCGTGGGTTC 1920 

TCTGTGAAAT ATTCTGCAAA ATTTGAAAOG AAAGCTCCAT TTTCTAQCCT GAAAGTAATC 1980 

ATTTTGCTCT GTGCCCTGCT GGCCTTGACC ATGGCCGCAG TTCCCCTGCT GGOTGGCAGC 2040 

AAGTATGGCG CCTCCCCTCT CTGCCTGCCT TTGCCTTTTG Q6GA6CCCAG CACCATGGGC 2100 

5 TACATGGTCG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATGATGAC CATTGCCTAC 2160 

ACCAAGCTCT ACTGCAATTT GGACAAGGGA GACCTGGAGA ATATTTGGGA CTGCTCTATG 2220 

GTAAAACACA TTGCCCTGTT GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 2280 

TTGTCCTTCT CCTCTTTAAT AAACCTTACA TTTATCAGTC CTGAAGTAAT TAAGTTTATC 2340 

CTTCTGGTGG TAGTCCCACT TCCTGCATGT CTCAATCCCC TTCTCTACAT CTTGTTCAAT 2400 

10 CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG GACAAGATCA 2460 

AAACACCCAA GCTTGATGTC AATTAACTCT GATGATGTCG AAAAACAGTC CTGTGACTCA 2520 

ACTCAAGCCT TGGTAACCTT TACCAGCTCC AGCATCACTT ATGACCTGCC TCCCAGTTCC 2580 

GTGCCATCAC CAGCTTATCC AGTGACTGAG AGCTGCCATC TTTCCTCTGT GGCATTTGTC 2640 
CCATGTCTTA A 



15 



35 
40 



Seq ID KO: 484 Protein sequence 
Protein Accession #: NP 003658.1 



1 11 21 31 41 51 

20 | | | | | | 

MDTSRLGVLL SLPVLLQLAT GGS9PRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLFNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 180 

AFRSLSALQA MTLALNKIHH IPDYAPGNLS SLWLHLBNN RIHSLGKKCF DGLHSLETLD 240 

25 LNYNNLDEFP TAIRTLSNLK ELHFYDNPIQ FVGRSAFQHL PELRTLTLNG ASQITEFPDL 300 

TGTANLESLT LTGAQISSLP QTVCNQLPNL QVLDLSYNLL EDLPSFSVCQ KLQKIDLRHN 360 

EIYEIKVDTF QQLLSLRSLN LAWNKIAIIH PNAPSTLPSL IKLDLSSNLL SSFPITGLHG 420 

LTHLKLTGNH ALQSLISSEN FPBLKVIEMP YAYQCCAFGV CENAYKISNQ WNKGDNSSMD 480 

DLHKKDAGMF QAQDERDLED PLLDPEEDLK ALHSVQCSPS PGPPKPCEHL LDGVJLIRIGV 540 

30 WTIAVLALTC NALVTSTVFR SPLYISPIKL LIGVIAAVNM LTGVSSAVLA GVDAPTPGSF 600 

ARHGAWWENG VGCHVIGFLS IFASESSVFL LTLAALERGF SVKYSAKFET KAPFSSLKVI 660 

ILLCALLALT MAAVPLLGGS KYGASPLCLP LPFGEPSTMG YMVALILLNS LCFLMMTIAY 720 

TKLYCNLDKG DLENIWDCSM VKHIALLLFT NCILNCPVAF LSFSSLINLT FISPEVIKFI 780 

LLVWPLPAC LNPLLYILFN PHFKEDLVSL RKQTYVWTRS KHPSLMSINS DDVEKQSCDS 840 
TQALVTFTSS SITYDLPPSS VPSPAYPVTE SCHLSSVAFV PCL 



Seq ID NO: 485 DNA sequence 

Nucleic Acid Accession 8: NM_00S7S6.l 

Coding sequence; 73.. 3117 



1 11 21 31 41 51 

I I I I I I 

AGCCAGCCGG AGGACGOGAG GGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTGAA 60 

CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 120 

45 GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT TCTGGTAACA 180 

TCCCTGGAAG AAGATACTGA TAATTCCAGT TTGTCACCAC CACCTGCTAA ATTATCTGTT 240 

GTCAGTTTTG CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 300 

AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 360 

CA GCTTCAGGCG TCAAACCCCA GAGAAATATC TGCAATTTGT CATCTATTTG CAATGACTCA 420 

50 GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 460 

CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 540 

GAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 600 

GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 660 

TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC CAATGGAACA CTGCTGCTGT 720 

55 TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA GTGTGACCTG 780 

CAGGATCCCA TTGTCTGTCT TGCTGACCAT CCACGTGGCC CACCATTTTC TTCCAGCCAA 840 

TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 900 

GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAGATT 960 

CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTGCCAT TGACATGCCC 1020 

60 CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATGTCTCCGG CACCCCACCT 1080 

CCTGTGAAAG OCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 1140 

AGCGCACCTC CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 

CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCTAA CCTOGCAGGA 1260 

GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCCGC CTGACATGCT GGCCCCTCTG 1320 

65 GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 1380 

ACTATAAGTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT 1440 

TTCAACACAA CTACCTTTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 1500 

CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1560 

_ A CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 

/0 TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCTACGTCAT ATCATOGAGT 1680 

GTTGCAAACC TGACCGTCAG GAACTTGACA AGAAACGTGA CAGTCACATT AAAGCACATC 1740 

AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 

GGCAGAGGAG GCTGGTCAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC I860 

- c ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTGC TGGACCTATC TAGGACATCT 1920 

/ J GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATTGGTTG TGGGCTTTCA 1980 

TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT 2040 

TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 2100 

CTCCTGGACT CGTGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 

_ _ TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 

o0 TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCCGAA AATACATCCT TAAATTCTGC 2280 

ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 2340 

AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 

ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 

CTGAACGTCA GCATGTTCAT TGTGGTCCTQ GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 2S20 
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CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GGAGTATOGC TGGCCTTACA 25S0 

TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTQCCT GGGGACCAGT TAACGTGACC 2640 

TTCATGTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 

TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG TGGAAAGTTA 2760 

5 CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2820 

GTAAACCAAG QA6TGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 2880 

TCCACCACAC TGCTAGTGAA TAATGATTGC TCAGTACACG CAAGCGGGAA TGGAAATGCT 2940 

TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCACGAT 3000 

TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG GAAAGGCGGT 3060 

10 ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT TTATTGAGCA AATGTGATTC 3120 

CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT TTACCTTTTA 3180 

CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 

AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3300 

TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 

15 ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 

TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 3480 

TTGGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 3540 

GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 

TTGTAGAATC ATGGTTGTTT TGTCTCACGT GATAATTCAG AAAATCCTTG CTCGTTCCGC 3660 

20 AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 

TCAAGAAATA ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 3780 

TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3840 

TGGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGG 3900 

GTGCACGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC CTGACCGCAT 3960 

25 CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 4020 

AGAGGGATGA ACTGTCATCC AGACCATGTG TCAGGAAAAT TGTGAACGTA GATGAGGTAC 4080 

ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 4140 

CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 4200 

ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 4260 

30 TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT ATTGGGAACA 4320 

GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 4380 

TATACAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC 4440 

TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 

ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 

35 CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA TTTTCTTGTA TTTATATGTT 4620 
AAAATCAAAA ATGTTAAAAT CAATGAAATA AATTTGCAGT TAAGA 



40 



Seq ID NO: 486 Protein sequence 
Protein Accession fl: RP 005747.1 



1 11 21 31 41 51 

I I I I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PAKLSWSFA 60 

PSSNEVETTS LNDVTLSLLP SNETGVKPQR NICNLSSIOJ DSAFPRGEIM PQYDKESTVP 120 

45 QNQHITNGTL TGVLSLSELK RSELNKTLQT LSETYFIMCA TABAQSTLNC TFTIKLNNTM 180 

NACAVIAALE RVKIRPMEHC CCSVRIFCPS SPEELEKLQC DLQDPIVCLA DHPRGPPFSS 240 

SQSIPWPRA TVLSQVPKAT SFAEPPDYSP VTHNVPSPIG EIQPLSPQPS APIASSPAID 300 

MPPQSBTISS PMPQTHVSGT PPPVKASFSS PTVSAPANVN TTSAPPVQTD IVNTSSISDL 360 

ENQVLQMEKA LSLGSLEPNL AGBMINQVSR LLHSPPDMLA PLAQRLLKW DDIGLQLNFS 420 

50 NTTISLTSPS LALAVIRVNA SSFNTTTPVA QDPANLQVSL ETQAPENSIG TITLPSSLKN 480 

KLPAHDMELA SRVQFNFFET PALFQDPSLB NLSLISYVIS SSVANLTVRN LTRNVTVTLK 540 

HINPSQDBLT VRCVFWDLGR NGGRGGWSBN GCSVKDRRLN BTICTCSHLT SFGVLLDLSR 600 

TSVLPAQNMA LTFITYIGCG LSSIFLSVTL VTYTAFEKIR RDYPSKILIQ LCAALLLLNL 660 

VFLLDSW1AL YKMQGLCISV AVFLHYFLLV SFTWMGLEAF HMYLALVKVF NTYIRKYILK 720 

55 FCIVGWGVPA VWTIILTIS PDNYGLGSYG KFPNGSPDDF CWINNNAVFY ITWGYFCVI 780 

FLLNVSMFIV VLVQLCRIKK KKQLGAQRKT SIQDLRSIAG LTFLLGITWG FAFFAWGPVN 840 

VTFMYLFAIF NTLQGFFIFI FYCVAKENVR KQWRRYLCCG KLRLAENSDW SKTATNGLKK 900 

QTVNQGVSSS SNSLQSSSNS TNSTTLLVUN DCSVHASGNG NASTERNGVS FSVQNGDVCL 960 
HDFTGKQHMP NEKEDSCNGK GRMALRRTSK RGSLHFIEQM 
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Seq ID NO: 487 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..2904 



65 1 11 21 31 41 51 

I I ! I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGGCCAT GTTGGCAGAA CTGAAGAAGT TTTACTGACG 60 

TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTCGTTC TGGTAACATC CCTGGAAGAA 120 

GATACTGATA ATTCCAGTTT GTCACCACCA CCTGATGTTA CTTTAAGCTT ACTCCCTTCA 180 

70 AACGAAACAG AAAAAACTAA AATCACTATA GTAAAAACCT TCAATGCTTC AGGCG TCAAA 240 

CCCCAGAGAA ATATCTGCAA TTTGTCATCT ATTTGCAATG ACTCAGCAXT TTTTAGAGGT 300 

GAGATCATGT TTCAATATGA TAAAGAAAGC ACTGTTCCCC AGAATCAACA TATAACGAAT 360 

GGCACCTTAA CTGGAGTCCT GTCTCTAAGT GAATTAAACA CATTAAATTG TACATTCACA 420 

ATAAAACTGA ATAATACAAT GAATGCATGT GCTGTAATAG CTGCTTTGGA AAGAGTAAAG 480 

75 ATTCGACCAA TGGAACACTG CTGCTGTTCT GTCAGGATAC OCTGCCCTTC CTCCCCAGAA 540 

GAGTTGGAAA AGCTTCAGTG TGACCTGCAG GATCCCATTG TCTGTCTTGC TGACCATCCA 600 

CGTGGCCCAC CATTTTCTTC CAGCCAATCC ATCCCAGTGG TGCCTCGGGC CACTGTGCTT 660 

TCCCAGGTCC CCAAAGCTAC CTCTTTTGCT GAGCCTCCAG ATTATTCACC TGTGACCCAC 720 

A AATGTTCCCT CTCCAATAGG GGAGATTCAA CCCCTTTCAC CXX»GCCTTC AGCTCCCATA 780 

80 GCTTCCAGCC CTGCCATTGA CATGCCCCCA CAGTCTGAAA CGATCTCTTC CCCTATGCCC 840 

CAAACCCATG TCTCOGGCAC CCCACCTCCT GTGAAAGCCT CATTTTCCTC TCCCACCGTG 900 

TCTGCCCCTG OGAATGTCAA CACTACCAGC GCACCTCCTG TCCAGACAGA CATCGTCAAC 960 

ACCAGCAGTA TTTCTGATCT TGAGAACCAA GTGTTGCAGA TGGAGAAGGC TCTGTCCTTG 1020 

GGCAGCCTGG AGCCTAACCT CGCAGGAGAA ATGATCAACC AAGTCAGCAG ACTCCTTCAT 1080 
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10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



TCCCCGCCTG 
GGCCTACAGC 
GCTGTGATCA 
GCAAATCTTC 
CTTCCTTCAT 
CAGTTCAATT 
CTGATCAGCT 
AACGTGACAG 
GTATTTTGGG 
GTCAAAGACA 
GTTCTGCTGG 
ATTACATATA 
ATAGCTTTTQ 
GCTCTGCTTC 
CAAGGCCTCT 
TGGATGGGCC 
ATCCGAAAAT 
ACCATCATCC 
AATGGTTCAC 
GTGGGATATT 
CAGCTCTGTC 
GACCTCAGGA 
TTTGCCTGGG 
CAAGGATTTT 
AGGCGGTATC 
GCTACTAATG 
TTACA6TCAA 
GTACACGCAA 
CAGAATGGAG 
GAAGATTCCT 
TTACACTTTA 



ACATGCTGGC 
TGAACTTTTC 
GAGTGAATGC 
AGGTTTCTCT 
CGCTGATGAA 
TTTTTGAAAC 
ACGTCATATC 
TCACATTAAA 
ACTTGGGCAG 
GGAGATTGAA 
ACCTATCTAG 
TTGGTTGTGG 
AAAAGATCCG 
TGCTGAACCT 
GCATCTCAGT 
TAGAAGCATT 
ACATCCTTAA 
TGACTATATC 
CGGATGACTT 
TCTQTGTGAT 
GAATTAAAAA 
GTATCGCTGG 
GACCAGTTAA 
TCATATTCAT 
TTTGTTGTGG 
GTTTAAAGAA 
GCAGTAACTC 
GCGGGAATGG 
ATGTGTGCCT 
GCAATGGGAA 
TTGAGCAAAT 



CCCTCTGGCT 
AAACA CGAC T 
CAGTAGTTTC 
GGAAACCCAA 
TAATTTACGA 
ACCTGCTTTG 
ATOGAGTGTT 
GCACATCAAC 
AAATGGTGGC 
TGAAACCATC 
GACATCTGTG 
GCTTTCATCA 
GAGGGATTAC 
GGTCTTCCTC 
GGCTGTATTT 
CCATATGTAC 
ATTCTGCATT 
CCCAGATAAC 
CTGCTGGATC 
ATTTTTGCTG 
GAAGAAGCAA 
CCTTACATTT 
CGTGACCTTC 
CTTTTACTGT 
AAAGTTAGGG 
GCAGACTGTA 
CACTAACTCC 
AAATGCTTCT 
TCACGATTTC 
AGGCCGTATG 
GTGA 



CAAAGATTGC 
ATAAGTCTAA 
AACACAACTA 
GCTCCTGAGA 
GCTCATGACA 
TTTCAGGATC 
GCAAACCTGA 
CCGAGCCAGG 
AGAGGAGGCT 
TGTACCTGTA 
CTGCCTGCTC 
ATTTTTCTGT 
CCTTCCAAAA 
CTGGACTCGT 
CTTCATTATT 
CTGGCCCTTG 
GTOGGTTGGG 
TATGGGCTTG 
AACAACAATG 
AACGTCAGCA 
CTGGGAGCCC 
TTACTGGGAA 
ATGTATCTGT 
GTGGCCAAAG 
CTGGCTGAAA 
AACCAAGGAG 
ACCACACTGC 
ACAGAGAGGA 
ACTGGAAAAC 
GCTCTCAGAA 



TGAAAGTAGT 
CCTCCCCTTC 
CCTTTGTGGC 
ACAGTATTGG 
TGGAGCTAGC 
CTTCCCTGGA 
CCGTCAGGAA 
ATGAGTTAAC 
GGTCAGACAA 
GCCATCTAAC 
AAATGATGGC 
CAGTGACTCT 
TCCTCATCCA 
GGATTGCTCT 
TTCTCTTGGT 
TCAAAGTATT 
GGGTACCAGC 
GATCCTATGG 
CAGTATTCTA 
TGTTCATTGT 
AGCGAAAAAC 
TAACTTGGGG 
TTGCCATCTT 
AAAATGTCAG 
ATTCTGACTG 
TGTCCAGCTC 
TAGTGAATAA 
ATGGGGTCTC 
AGCACATGTT 
GGACTTCAAA 



GGATGACATT 
TTTGGCTCTG 
CCAAGACCCT 
CACAATTACT 
TTCCAGGGTT 
GAACCTCTCT 
CTTGACAAGA 
AGTGAGATGT 
TGGCTGCTCT 
AAGCTTOGGC 
TCTGACGTTC 
TGTAACCTAC 
GCTGTGTGCT 
GTATAAGATG 
CTCATTCACA 
TAATACTTAC 
TGTGGTTGTG 
GAAATTCCCC 
CATTACGGTG 
GGTCCTGGTT 
CAGTATTCAA 
CTTTGCCTTC 
TAATACCTTA 
GAAGCAATGG 
GAGTAAAACT 
TTCAAATTCC 
TGATTGCTCA 
TTTTAGTGTT 
TAACGAGAAG 
GCGGGGAAGC 



Seq ID NOs 4B8 Protein sequence 
Protein Accession #: Bos sequence 



MVPSVRQCGH 
NETEKXKITX 
GTLTGVLSLS 
ELEKLQCDLQ 
NVPSPIGBIQ 
SAPANVNTTS 
SPPDMLAPLA 
ANLQVSLETQ 
LISYVISSSV 
VKDRRLNETI 
IAFEKIRRDY 
WMGLEAPHMY 
NGSPDDFCWI 
DLRSIAGLTP 
RRYLCCGKLR 
VHASGNGNAS 
LHFIEQM 



11 

I 

VGRTEEVLLT 
VKTFNASGVK 
ELNTLNCTFT 
DPIVCLADHP 
PLSPQPSAPI 
APPVQTDIVN 
QRLLKWDDI 
APENSIGTIT 
ANLTVRNLTR 
CTCSHLTSFG 
PSKILIQLCA 
LALVKVFNTY 
NNNAVFYITV 
LLGITWGPAP 
LAENSDWSKT 
TERNGVSFSV 



21 
I 

FKIFLVIICL 
PQRNICNLSS 
IKLNNTMNAC 
RGPPFSSSQS 
ASSPAIDMPP 
TSSISDLENQ 
GLQLNFSNTT 
LPSSLMNNLP 
NVTVTLKHIN 
VLLDLSRTSV 
ALLLLNLVFL 
IRKYILKFCI 
VGYFCVIFLL 
FAWGPVNVTF 
ATNGLKKQTV 
QNGDVCLHDF 



31 
I 

HWLVTSLEB 
ICNDSAFFRG 
AVIAALERVK 
IPWPRATVL 
QSETISSPMP 
VLQMBKALSL 
ZSLTSPSIiAIi 
AHDMELASRV 
PSQDELTVRC 
LPAQMMALTF 
LDSWIALYKM 
VGWGVPAVW 
NVSMFIWLV 
MYLFAIFNTL 



41 
I 

DTDNSSLSPP 
EIHFQYDKES 
IRPMEHCCCS 
SQVPKATSFA 
QTHVSGTPPP 
GSLEPNLAGE 
AVIRVNASSF 
QFNFFETFAL 
VFWDLGRNGG 
ITYIGCGLSS 
QGLC1SVAVF 
TIIIiTISPDK 
QLCRIKKKKQ 
QGFFIFIFYC 



TGKQHMFNEK EDSCNGKGRM 



Seq XD NO i 489 DMA sequence 

Nucleic Acid Accession di Eos sequence 

Coding sequence: 1..2611 

l 11 21 31 

I I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGGCCAT GTTGGCAGAA 
TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTCGTTC 
GATACTGATA ATTCCAGTTT GTCACCACCA CCTGAGGTTG 
GTTACTTTAA GCTTACTOCC TTCAAACGAA ACAGGCGTCA 
AATTTGTCAT CTATTTGCAA TGACTCAGCA TTTTTTAGAG 
GATAAAGAAA GCACTGTTCC CCAGAATCAA CATATAACGA 
CTGTCTCTAA GTGAATTAAA ACGCTCAGAG CTCAACAAAA 
ACTTACTTTA TAATGTGTGC TACAGCAGAG GCCCAAAGCA 
ATAAAACTGA ATAATACAAT GAATGCATGT GCTGTAATAG 
ATTCGACCAA TGGAACACTG CTGCTGTTCT GTCAGGATAC 
GAGTTGGAAA AGCTTCAGTG TGACCTGCAG GATCCCATTG 
CGTGGCCCAC CATTTTCTTC CAGCCAATCC ATCCCAGTGG 
TCCCAGGTCC CGAAAGCTAC CTCTTTTGCT GAGCCTCCAG 
AATGTTCCCT CTCCAATAGG GGAGATTCAA OCCCTTTCAC 
GCTTCCAGCC CTGCCATTGA CATGCCOCCA CAGTCTGAAA 
CAAACCCATG TCTCCGGCAC CCCACCTCCT GTGAAAGCCT 
TCTGCCCCTG CGAATGTCAA CACTACCAGC GCACCTCCTO 
ACCAGGAGTA TTTCTGATCT TGAGAACCAA GTGTTGCAGA 
GGCAGCCTGG AGCCTAACCT CGCAGGAGAA ATGATCAACC 
TCCCCGCCTG ACATGCTGGC CCCTCTGGCT CAAAGATTGC 
GGCCTACAGC TGAACTTTTC AAACACGACT ATAAGTCTAA 
GCTGTGATCA GAGTGAATGC CAGTAGTTTC AACACAACTA 
GCAAATCTTC AGGTTTCTCT GGAAACCCAA GCTCCTGAGA 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



51 
I 

PDVTLSLLPS 
TVPQNQHITN 
VR1PCPSSPE 
EPPDYSPVTH 
VKASFSSPTV 
MINQVSRLLH 
NTTTFVAQDP 
FQDPSLENLS 
RGGWSDNGCS 
IFLSVTLVTY 
LHYFLLVSFT 
YGLGSYGKFP 
LGAQRKTSIQ 
VAKENVRKQW 
TTLLVNNDCS 
ALRRTSKRGS 



41 
I 

CTGAAGAAGT 
TGGTAACATC 
AAACAACAAG 
AACCCCAGAG 
GTGAGATCAT 
ATGGCACCTT 
CCCTGCAAAC 
CATTAAATTG 
CTGCTTTGGA 



TCTGTCTTGC 
TGCCTCGGGC 
ATTATTCACC 
CCCAGCCTTC 
CGATCTCTTC 
CATTTTCCTC 
TCCAGACAGA 
TGGAGAAGGC 
AAGTCAGCAG 
TGAAAGTAGT 
CCTCCCCTTC 
CCTTTGTGGC 
ACAGTATTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



51 
I 

TTTACTGACG 
CCTGGAAGAA 
CCTCAATGAT 
AAATATCTGC 
GTTTCAATAT 
AACTGGAGTC 
CCTAAGTGAG 
TACATTCACA 
AAGAGTAAAG 
CTCCCCAGAA 
TGACCATCCA 
CACTGTGCTT 
TGTGACCCAC 
AGCTCCCATA 
CCCTATGCCC 
TCCCACCGTG 
CATCGTGAAC 
TCTGTCCTTG 
ACTCCTTCAT 
GGATGACATT 
TTTGGCTCTG 
CCAAGACCCT 
CACAATTACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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CTTCCTTCAT 
CAGTTCAATT 
CTGATCAGCT 
AACGTGACAG 
GTATTTTGGG 
GTCAAAGACA 
GTTCTGCTGG 
ATTACATATA 
ATAGCTTTTG 
GCTCTGCTTC 
CAAGQCCTCT 
TGGATGGGCC 
ATCCGAAAAT 
ACCATCATCC 
AATGGTTCAC 
GTGGGATATT 
CAGCTCTGTC 
GACCTCAGGA 
TTTGCCTGGG 
CAAGGATTTT 
AGGCGGTATC 
GAGAGGAATG 
GGAAAACAGC 
CTCAGAAGGA 



CGCTG ATGAA 
TTTTTGAAAC 
ACGTCATATC 
TCACATTAAA 
ACTTGGGCAG 
GGAGATTGAA 
ACCTATCTAG 
TTGGTTGTGG 
AAAAGATCCG 
TGCTGAACCT 
GCATCTCAGT 
TAGAAGCATT 
ACATCCTTAA 
TGACTATATC 
CGGATGACTT 
TCTGTGTGAT 
GAATTAAAAA 
GTATCGCTGG 
GACCAGTTAA 
TCATATTCAT 
TTTGTTGTGG 
GGGTCTCTTT 
ACATGTTTAA 
CTTCAAAGCG 



TAATTTACCA 
ACCTGCTTTG 
ATOGAGTGTT 
GCACATCAAC 
AAATGGTGGC 
TGAAACCATC 
GACATCTGTG 
GCTTTCATCA 
GAGGGATTAC 
GGTCTTCCTC 
GGCTGTATTT 
CCATATGTAC 
ATTCTGCATT 
CCCAGATAAC 
CTGCTGGATC 
ATTTTTGCTG 
GAAGAAGCAA 
CCTTACATTT 
OGTGACCTTC 
CTTTTACTGT 
AAAGTTAOGG 
TAGTGTTCAG 
CGAGAAGGAA 
GGGAAGCTTA 



GCTCATGACA 
TTTCAGGATC 
GCAAACCTGA 
CCGAGCCAGG 
AGAGGAGGCT 
TGTACCTGTA 
CTGCCTGCTC 
ATTTTTCTGT 
CCTTCCAAAA 
CTGGACTCGT 
CTTCATTATT 
CTGGCCCTTG 
GTOGGTTGGG 
TATGGGCTTG 
AACAACAATG 
AAOGTCAGCA 
CTGGGAGCCC 
TTACTGGGAA 
ATGTATCTGT 
GTGGCCAAAG 
CTGGCTGAAA 
AATGGAGATG 
GATTCCTGCA 
CACTTTATTG 



TGGAGCTAGC 
CTTCCCTGGA 
CCGTCAGGAA 
ATGAGTTAAC 
GGTCAGACAA 
GCCATCTAAC 
AAATGATGGC 
CAGTGACTCT 
TCCTCATCCA 
GGATTGCTCT 
TTCTCTTGGT 
TCAAAGTATT 
GGGTACCAGC 
GATCCTATGG 
CAGTATTCTA 
TGTTCATTGT 
AGCGAAAAAC 
TAACTTGGGG 
TTGCCATCTT 
AAAATGTCAG 
ATTCTGGAAA 
TGTGCCTTCA 
ATGGGAAAGG 
AGCAAATGTG 



TTCCAGGGTT 
GAACCTCTCT 
CTTGACAAGA 
AGTGAGATGT 
TGGCTGCTCT 
AAGCTTCGGC 
TCTGACGTTC 
TGTAACCTAC 
GCTGTGTGCT 
GTATAAGATG 
CTCATTCACA 
TAATACTTAC 
TGTGGTTGTG 
GAAATTCCCC 
CATTACGGTG 
GGTCCTGGTT 
CAGTATTCAA 
CTTTGCCTTC 
TAATACCTTA 
GAAGCAATGG 
TGCTTCTACA 
CGATTTCACT 
CCGTATGGCT 
A 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



Seq ID NO; 490 Protein sequence 
Protein Accession 8: Eos sequence 



MVFSVRQCGH 
VTLSLLPSNE 
LSLSELKRSE 
IRPMEHCCCS 
SQVPKATSFA 
QTHVSGTPPP 
GSLEPNLAGE 
AVIRVNASSF 
QFNFPETPAL 
VFWDLGRNGG 
ITYIGCGLSS 
QGLCISVAVF 
T1ILTISPDN 
QLCRIKKKKQ 
QGFFIFIFYC 



11 
\ 

VGRTEEVLLT 
TGVKPQRNIC 
LNKTLQTLSE 
VRIPCPSSPE 
EPPDYSPVTH 
VKASFSSPTV 
MIKQVSRLLH 
NTTTFVAQDP 
FQDPSLENLS 
RGGWSDNGCS 
IFLSVTLVTY 
LHYFLLVSFT 
YGLGSYGKFP 
LGAQRKTSZQ 
VAKENVRKQW 
DSCNGKGRMA 



21 
I 

FKIFLVIXCL 
NLSSICNDSA 
TYFIMCATAE 
ELBKLQCDLQ 
NVPSPIGEXQ 
SAPANVNTTS 
SPPDMLAPLA 
ANLQVSLBTQ 
LISYVISSSV 
VKDRRLNETI 
IAFEKIRRDY 
WMGLEAFHMY 
NGSPDDFCWI 
DLRSIAGLTF 
RRYLCCGKLR 
LRRTSKRGSL 



31 
I 

HWLVTSIiEE 
FFRGEIMFQY 
AQSTLNCTFT 
DPIVCLADHP 
PLSPQPSAPI 
APPVQTDIVN 
QRLLKWDDI 
APENSIGTIT 
ANLTVRNLTR 
CTCSHLTSFG 
PSKILIQLCA 
IiALVKVFNTY 
NNNAVFYITV 
LLGITWGFAF 
LABNSGNAST 
HFIEQM 



41 
I 

DTDNSSLSFP 
DKESTVPQNQ 
IKLNNTMNAC 
RGPPFSSSQS 
ASSPAIDMPP 
TSSISDLENQ 
GLQLNFSNTT 
LPSSLMNNLP 
NVTVTLKHIN 
VLLDLSRTSV 
ALLLLNLVFL 
IRKYILKFCI 
VGYFCVIFLL 
FAWGPVNVTF 
ERNGVSFSVQ 



51 

I 

PEVETTSIiND 
HITNGTLTGV 
AVIAALERVK 
IPWPRATVL 
QSETISSPMP 
VLQMEKALSL 
ISLTSPSLAL 
AHDMELASRV 
PSQDELTVRC 
LPAQMMALTF 
LDSWIALYKM 
VGWGVPAVW 
NVSMFIWLV 
MYLFAIFNTL 
NGDVCLHDFT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 491 dna sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..3045 



1 11 21 31 41 51 

I.I I I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGGCCAT GTTGGCAGAA CTGAAGAAGT TTTACTGACG 60 

TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTCGTTC TGGTAACATC CCTGGAAGAA 120 

GATACTGATA ATTCCAGTTT GTCACCAOCA CCTGCTAAAT TATCTGTTGT CAGTTTTGCC 180 

CCCTCCTCCA ATGAGGTTGA AACAACAAGC CTCAATGATG TTACTTTAAG CTTACTCCCT 240 

TCAAACGAAA CAGAAAAAAC TAAAATCACT A7AGTAAAAA CCTTCAATGC TTCAGGCGTC 300 

AAACCCCAGA GAAATATCTG CAATTTGTCA TCTATTTGCA ATGACTCAGC ATTTTTTAGA 360 

GGTGAGATCA TGTTTCAATA TGATAAAGAA AGCACTGTTC CCCAGAATCA ACATATAACG 420 

AATGGCACCT TAACTGGAGT CCTGTCTCTA AGTGAATTAA AAOGCTCAGA GCTCAACAAA 480 

ACCCTGCAAA CCCTAAGTGA GACTTACTTT ATAATGTGTG CTACAGCAGA GGCCCAAAGC 540 

ACATTAAATT GTACATTCAC AATAAAACTG AATAATACAA TGAATGCATG TGCTGCAATA 600 

GCCGCTTTGG AAAGAGTAAA GATTCGACCA ATGGAACACT GCTGCTGTTC TGTCAGGATA 660 

CCCTGCCCTT CCTCCCCAGA AGAGTTGGGA AAGCTTCAGT GTGACCTGCA GGATCCCATT 720 

GTCTGTCTTG CTGACCATCC AOGTGGCCCA CCATTTTCTT CCAGCCAATC CATCCCAGTG 780 

GTGCCTCGGG CCACTGTGCT TTCCCAGGTC OCCAAAGCTA CCTCTTTTGC TGAGCCTCCA 840 

GATTATTCAC CTGTGACCCA CAATGTTCCC TCTCCAATAG GGGAGATTCA ACCCCTTTCA 900 

COCCAGCCTT CAGCTCCCAT AGCTTCCAGC CCTGCCATTG ACATGCJCCCC ACAGTCTGAA 960 

ACGATCTCTT CCCCTATGCC CCAAACCCAT GTCTCCGGCA CCCCACXTTCC TGTGAAAGCC 1020 

TCATTTTCCT CTCCCACCGT GTCTGCCOCT G06AATGTCA ACACTACCAG OGCACCTCCT 1080 

GTCCAGACAG ACATOGTCAA CACCAGCAGT ATTTCTGATC TTGAGAACCA AGTGTTGCAG 1140 

ATGGAGAAGG CTCTGTCCTT GGGCAGCCTG GAGCCTAACC TCGCAGGAGA AATGATCAAC 1200 

CAAGTCAGCA GACTOCTTCA TTCCCOGCCT GACATGCTGG CCCCTCTGGC TCAAAGATTG 1260 

CTGAAAGTAG TGGATGACAT TGGOCTACAG CTGAACTTTT CAAACAOQAC TATAAGTCTA 1320 

ACCTCCCCTT CTTTGGCTCT GGCTGTGATC AGAGTGAATG CCAGTAGTTT CAACACAACT 1380 

ACCTTTGTGG CCCAAGACCC TGCAAATCTT CAGGTTTCTC TGGAAACCCA AGCTCCTGAG 1440 

AACAGTATTG GCACAATTAC TCTTCCTTCA TOGCTGATGA ATAATTTACC AGCTCATGAC 1500 

ATGGAGCTAG CTTCCAGGGT TCAGTTCAAT TTTTTTGAAA CACCTGCTTT GTTTCAGGAT 1560 

CCTTCCCTGG AGAACCTCTC TCTGATCAGC TAOGTCATAT CATCGAGTGT TGCAAACCTG 1620 

ACCGTCAGGA ACTTGACAAG AAAOGTGACA GTCACATTAA AGCACATCAA CCCGAGCCAG 1680 

GATGAGTTAA CAGTGAGATG TGTATTTTGG GACTTGGGCA GAAATGGTGG CAGAGGAGGC 1740 

TGGTCAGACA ATGGCTGCTC TGTCAAAGAC AGGAGATTGA ATGAAACCAT CTGTACCTGT 1800 

AGOCATCTAA CAAGCTTOGG CGTTCTGCTG GACCTATCTA GGACATCTGT GCTGCCTGCT 1860 
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5 

10 

15 

20 

25 

30 

35 

40 

45 
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55 
60 
65 
70 
75 
80 



CAAATGATGG 
TCAGTGACTC 
ATCCTCATCC 
TGGATTGCTC 
TTTCTCTTGG 
GTCAAAGTAT 
GGGGTACCAG 
GGATCCTATG 
GCAGTATTCT 
ATGTTCATTG 
CAGCGAAAAA 
ATAACTTGGG 
TTTGCCATCT 
GAAAATGTCA 
AATTCTQACT 
GTGTCCAGCT 
CTAGTGAATA 
AATGGGGTCT 
CAGCACATGT 
AGGACTTCAA 



CTCTGACGTT 
TTGTAACCTA 
AGCTGTGTGC 
TGTATAAGAT 
TCTCATTCAC 
TTAATACTTA 
CTGTGGTTGT 
GGAAATTCCC 
ACATTACGGT 
TGGTCCTGGT 
CCAGTATTCA 
GCTTTGCCTT 
TTAATACCTT 
GGAAGCAATG 
GGAGTAAAAC 
CTTCAAATTC 
ATGATTGCTC 
CTTTTAGTGT 
TTAACGAGAA 
AGCGGGGAAG 



CATTACATAT 
CATAGCTTTT 
TGCTCTGCTT 
GCAAGGCCTC 
ATGGATGGGC 
CATCCGAAAA 
GACCATCATC 
CAATGGTTCA 
GGTGGGATAT 
TCAGCTCTGT 
AGACCTCAGG 
CTTTGCCTGG 
ACAAGGATTT 
GAGGCGGTAT 
TGCTACTAAT 
CTTACAGTCA 
AGTACACGCA 
TCAGAATGGA 
GGAAGATTCC 
CTTACACTTT 



ATTGGTTGTG 
GAAAAGATCC 
CTGCTGAACC 
TGCATCTCAG 
CTAGAAGCAT 
TACATCCTTA 
CTGACTATAT 
CCGGATGACT 
TTCTGTGTGA 
CGAATTAAAA 
AGTATCGCTG 
GGACCAGTTA 
TTCATATTCA 
CTTTGTTGTG 
GGTTTAAAGA 
AGCAGTAACT 
AGCGGGAATG 
GATGTGTGCC 
TGCAATGGGA 
ATTGAGCAAA 



GGCTTTCATC 
GGAGGGATTA 
TGGTCTTCCT 
TGGCTGTATT 
TCCATATGTA 
AATTCTGCAT 
CCCCAGATAA 
TCTGCTGGAT 
TATTTTTGCT 
AGAAGAAGCA 
GCCTTACATT 
ACGTGACCTT 
TCTTTTACTG 
GAAAGTTACG 
AGCAGACTGT 
CCACTAACTC 
GAAATGCTTC 
TTCAOGATTT 
AAGGCOGTAT 
TGTGA 



AATTTTTCTG 
CCCTTCCAAA 
CCTGGACTCG 
TCTTCATTAT 
CCTGGCCCTT 
TGTCGGTTGG 
CTATGGGCTT 
CAACAACAAT 
GAACGTCAGC 
ACTGGGAGCC 
TTTACTGGGA 
CATGTATCTG 
TGTGGCCAAA 
GCTGGCTGAA 
AAACCAAGGA 
CACCACACTG 
TACAGAGAGG 
CACTGGAAAA 
GGCTCTCAGA 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



Seq ID MO: 492 Protein sequence 
Protein Accession #: Eos sequence 



1 
I 

MVFSVRQCGH 
PSSNEVETTS 
GEIMFQYDKE 
TUJCTFTIKL 
VCLADHPRGP 
PQPSAPIASS 
VQTDIVNTSS 
LKWDDIGLQ 
NSIGTITLPS 
TVRNLTRNVT 
SHLTSFGVLL 
ILIQLCAAU, 
VKVFNTYIRK 
AVFYITWGY 
ITWGFAFFAW 
NSDWSKTATN 
NGVSFSVQNG 



11 
I 

VGRTEEVLLT 
LNDVTLSLLP 
STVP0NQHIT 
NNTMNACAAI 
PFSSSQSIPV 
PAIDMPPQSE 
ISDLENQVLQ 
LNFSNTTISL 
SLMNNLPAHD 
VTLKHINPSQ 
DLSRTSVLPA 
LLNLVFLLDS 
YILKFCIVGW 
FCVIFLLNVS 
GPVNVTFMYL 
GLKKQTVNQG 
DVCLHDFTGK 



21 
I 

FKIFLVIICL 
SNETBKTKIT 
NGTLTGVLSL 
AALERVKIRP 
VPRATVLSQV 
TISSPMPQTH 
MEKALSLGSL 
TSPSLALAVI 
MELASRVQFN 
DELTVRCVFW 
QMMALTFITY 
WIALYKMQGL 
GVPAVWTII 
MFIWLVQLC 
FAIFNTLQGF 
VSSSSNSLQS 



31 
I 

HWLVTSIiEE 
IVKTFNASGV 
SELKRSELNK 
MEHCCCSVRI 
PKATSFAEPP 
VSGTPPPVKA 
EPNLAGEMIN 
RVNASSFNTT 
FFETPALFQD 
DLGRNGGRGG 
IGCGLSSIFL 
CISVAVFLHY 
LTISPDNYGL 
RIKKKKQLGA 
FIFIFYCVAK 
SSNSTNSTTL 
CNGKGRMALR 



KPQRN1CNLS 
TLQTLSETYF 
PCPSSPEELG 
DYSPVTHNVP 
SFSSPTVSAP 
QVSRLLHSPP 
TFVAQDPANL 
PSLENLSLIS 
WSDNGCSVKD 
SVTLVTYIAF 
FLLVSFTWMG 
GSYGKFPMGS 
QRKTSIQDLR 
ENVRKQWRRY 
LVNNDCSVHA 



51 

! 

PAKLSWSFA 
SICNDSAFFR 
IMCATAEAQS 
KLQCDDQDPI 
SPIGEIQPLS 
ANVNTTSAPP 
DMLAPLAQRL 
QVSLETQAPE 
YVISSSVANL 
RRLNETICTC 
EKIRRDYPSK 
LEAFHMYLAL 
PDDFCWINNN 
SIAGLTFLLG 
LCCGKLRLAE 
SGNGN ASTER 
IEQM 



Seq ID NO: 493 DNA sequence 
Nucleic Acid Accession #: NM_015507 
Coding sequence: 241.. 1902 



11 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



21 31 41 

I I I I 

CCGCAGAGGA GCCTOGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 
CGCCCCTGCC GCGGTGGCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 
CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTAOG 
GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTG CGAGA 
ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 
GGGAAOGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 
TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 
CTCAGTGGCC. ACATGCTCAT GCCAGATGCT ACGT G TGTGA ACTCTAGGAC ATGTGCCATG 
ATAAACXGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 
GGTAAAGTCA TCTGTOCCTA CAATOGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 
AAATGTCACA TT G GTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 
AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAA6AG 
ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 
GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 
AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TAT CTCGG TT 
GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 
TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 
GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 
TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATG AAAAG 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 
GAAGCAOAAC GTGGCAAGGG CAAAACOGGC GAAATOGCAG TGGATGGCGT CTTGCTTGTT 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 
TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7 B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 



1146 
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TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TQAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TOTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



10 



Seq ID NO: 494 Protein sequence 
Protein Accession ft; NP 056322 



1 11 21 31 41 51 

I I I I I I 

MPLPWSLALP LLLSWVAGGP GNAASARHRG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 

15 CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCPC 120 

LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EBGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 

GKVICPYNRR CVNTFGSYYC KCHIGPELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

GSPKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKUiAHKKSM KKKAKIKNVT 300 

PEPTRTPTPK. VNLQPPNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 

20 SLRGDVFFPK VNEAGEPGLI LVQRKALTSK LEHKDLKISV DCSFKHGICD WKQDREDDPD 420 

WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLPDYRL AGDKVGKLRV 480 

FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLLS VDD 

25 Seq ID NO: 495 DNA sequence 

Nucleic Acid Accession ff: NM_003506.1 
Coding sequence: 259.. 2379 

1 11 21 31 41 51 

30 1 | | 1 | i 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG COGTCTCAGG TCCCTGGGGG GAACGGTGGG 60 

TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACOGCCC CCCGAGTAAT TGACCC AGGA 120 

CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 

ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 240 

35 ATGAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 360 

ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 

GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 480 

ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 540 

40 TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 600 

ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 660 

GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 780 

TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 840 

45 CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 900 

TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 1020 

GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGOTGA CACTGTTGTC 1080 

CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 

50 GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 1200 

TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 1260 

CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 

GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 1380 

CTGTGGCTTT GTGTGTTTGT TGGGCTCTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 1440 

55 CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTCGAATTG GAGTCTTCAG CGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTCGGATGT 1560 

TACGTCTATG AGCAAGTGAA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 1620 

CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AGAATTGGCT 1680 

TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 1740 

60 GGAAGGAAAA AGACATGCAC AGAATGGGCT GGGTTTTTTA AACGAAATCG CAAGAGAGAT 1800 

CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 1860 

TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 1920 

TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 1980 

ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTGACAG AAATCCAAAC CTCACCAGAA 2040 

65 ACATCAATGA GAGAGGTGAA AGCGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 2100 

TGTGGTGAAC CTGCCTOGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 2160 

GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGCGGA GTGAAGGAAG GATTAGTCCA 2220 

AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAGTG 2340 

70 AGAAAAGAGC AGGGAGGTGG TTGTGATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 2400 

CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

TAGGTTCTTC TTTTGCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGA6T 2520 

TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 

AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

75 CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

TTTACCTTTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 2760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATOCT 2820 

ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2880 

ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2940 

80 GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATAOCTTACC ATCTAAAATG 3000 

TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 3060 

GGTGCTTACT CAAAGAGTGT CCACTATTGA TTGTATTATG CTGCTCACTG ATCCTTCTGC 3120 

ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 3180 

GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 3240 
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AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAACTTTTG TTTCTTAACA TTTAGAATAT 3300 
TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTQT AG 

Seq ID NO: 496 Protein sequence 
Protein Accession #: NP_003497.l 

1 11 21 31 41 51 

1 I I I I I 

MEMFTFLLTC IFLPLLRGHS LFTCEPITVP RCMKMAYNMT FFPNI/4GHYD QSIAAVEMKS 60 

FLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCB KVYSDCKKLI DTFGIRWPEE 120 

LBCDRLQYCD ETVPVTFDPH TEFL6PQKKT EQVQRDIGFW CPRHLKTSGG QGYKFLOXDQ 180 

CAPPCPNMYF KSDELEFAKS FIGTVSIFCL CATLFTFLTF LIDVRRFRYP ERPIIYYSVC 240 

YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVLFMLLY PFTMAGTVWW 300 

VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV MLLALNKVEG DNISGVCFVG 360 

LYDLDASRYP VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI QHBGRNQEKL KKFMIRIGVP 420 

SGLYLVPLVT LLGCYVYBQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PELALFMIKY 480 

LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR RVLQBSCEFF LKHNSKVKHK 540 

KKHYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY LGQETLTEIQ TSPETSMREV 600 

KADGASTPRI* REQDOGEPAS PAASISRLSG EOVDGKGQAG SVSESARSEG RISPKSDITD 660 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 

Seq ID NO: 497 DNA sequence 
Nucleic Acid Accession 8: NM_005046 
Coding sequence: 16.. 777 

1 11 21 31 41 51 

I I I I I I 

GGATTTCCGG GCTCCATGGC AAGATCCCTT CTCCTGCCCC TGCAGATCCT ACTGCTATCC 60 

TTAGCCTTGG AAACTGCAGG AGAAGAAGCC CAGGGTGACA AGATTATTGA TGGCGCCCCA 120 

TGTGGAAGAG GCTCCCACCC ATGGCAGGTG GCCCTGCTCA GTGGCAATCA GCTCCACTGC 180 

GGAGGCGTCC TGGTCAATGA GOGCTGGGTG CTCACTGCCG CCCACTGCAA GATGAATGAG 240 

TACACCGTGC ACCTGGGCAG TGATACGCTG GGCGACAGGA GAGCTCAGAG GATCAAGGCC 300 

TCGAAGTCAT TCOGCCACCC CGGCTACTCC ACACAGACCC ATGTTAATGA CCTCATGCTC 360 

GTGAAGCTCA ATAGCCAGGC CAGGCTGTCA TCCATGGTGA AGAAAGTCAG GCTGCOCTCC 420 

CGCTGCGAAC CCCCTGGAAC CACCTGTACT GTCTCCGGCT GGGGCACTAC CACGAGCCCA 480 

GATGTGACCT TTCCCTCTGA CCTCATGTGC GTGGATGTCA AGCTCATCTC CCCCCAGGAC 540 

TGCACGAAGG TTTACAAGGA CTTACTGGAA AATTCCATGC TGTGCGCTGG CATCCCCGAC 600 

TCCAAGAAAA ACGCCTGCAA TGGTGACTCA GGGGGACCGT TGGTGTGCAG AGGTACCCTG 660 

CAAGGTCTGG TGTCCTGGGG AACTTTCCCT TGGGGCCAAC CCAATGACCC AGGAGTCTAC 720 

ACTCAAGTGT GCAAGTTCAC CAAGTGGATA AATGACACCA TGAAAAAGCA TCGCTAACGC 780 

CACACTGAGT TAATTAACTG TGTGCTTCCA ACAGAAAATG CACAGGAGTG AGGACGCCGA 840 

TGACCTATGA AGTCAAATTT GACTTTACCT TTCCTCAAAG ATATATTTAA ACCTCATGCC 900 

CTGTTGATAA ACCAATCAAA TTGGTAAAGA CCTAAAACCA AAACAAATAA AGAAACACAA 960 
AACCCTCAA 

Seq ID NO: 49B Protein sequence 
Protein Accession #: NP_005037 

1 11 21 31 41 51 

I I I I I I 

MARSLLLPLQ ILLLSLALET AGEEAQGDKI IDGAPCARGS HPWQVALLSG NQLHCGGVLV 60 

NERWVLTAAH CKMNEYTVHL GSDTLGDRRA QRIKASKSFR HPGYSTQTHV NDLMLVKLNS 120 

QARLSSMVKK VRLPSRCEPP GTTCTVSGWG TTTSPDVTFP SDLMCVDVKL ISPQDCTKVY 180 

KDLLENSMLC AGIPDSKKNA CNGDSGGPLV CRGTLQGLVS WGTFPCGQPN DPGVYTQVCK 240 
FTKWINDTMK KHR 

Seq ID NO: 499 DNA sequence 
Nucleic Acid Accession # : NM_007196 
Coding sequence : 182 . . 962 

1 11 21 31 41 51 

GTTCCCAGAA GCTCCCCAGG CTCTAGTGCA GGAGGAGAAG GAGGAGGAGC AGGAGGTGGA 60 

GATTCCCAGT TAAAAGGCTC CAGAATCGTG TACCAGGCAG AGAACTGAAG TACTGGGGCC 120 

TCCTCCACTG GGTCCGAATC AGTAGGTGAC CCCGCCCCTG GATTCTGGAA GACCTCACCA 180 

TGGGACGCCC CCGACCTCGT GCGGCCAAGA CGTGGATGTT CCTGCTCTTG CTGGGGGGAG 240 

CCTGGGCAGG ACACTCCAGG GCACAGGAGG ACAAGGTGCT GGGGGGTCAT GAGTGCCAAC 300 

CCCATTCGCA GCCTTGGCAG GCGGCCTTGT TCCAGGGCCA GCAACTACTC TGTGGCGGTG 360 

TCCTTGTAGG TGGCAACTGG GTCCTTACAG CTGCCCACTG TAAAAAACCG AAATACACAG 420 

TACGCCTGGG AGACCACAGC CTACAGAATA AAGATGGCCC AGAGCAAGAA ATACCTGTGG 480 

TTCAGTOCAT CCCACACCCC TGCTACAACA GCAGCGATGT GGAGGACCAC AACCATGATC 540 

TGATGCTTCT TCAACTGCGT GACCAGGCAT CCCTGGGGTC CAAAGTGAAG CCCATCAGCC 600 

TGGCAGATCA TTGCACCCAG CCTGGCCAGA AGTGCACCGT CTCAGGCTGG GGCACTGTCA 660 

CCAGTCCCCG AGAGAATTTT CCTGACACTC TCAACTGTGC AGAAGTAAAA ATCTTTCCCC 720 

AGAAGAAGTG TGAGGATGCT TACCCGGGGC AGATCACAGA TGGCATGGTC TGTGCAGGCA 780 

GCAGCAAAGG GGCTGACACG TGCCAGGGCG ATTCTGGAGG CCCCCTGGTG TGTGATGGTG 840 

CACTCCAGGG CATCACATCC TGGGGCTCAG ACCCCTGTGG GAGGTCCGAC AAACCTGGCG 900 

TCTATACCAA CATCTGCCGC TACCTGGACT GGATCAAGAA GATCATAGGC AGCAAGGGCT 960 
GATTCTAGGA TAAGCACTAG ATCTCCCTTA ATAAACTCAC AACTCTC 

Seq ID NO: S0O Protein sequence 
Protein Accession #: NP_009127 

1 11 21 31 41 51 
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I I I I 

MGRPRPRAAK TWMFLLLLGG AWAGHSRAQE DKVLGGHECQ 
VLVGGNWVLT AAHCXKPKYT VW/3DHSLQN KDGPEQEIPV 
LKLLQLRDQA SLGSKVKPIS LADHCTQPGQ KCTVSGWGTV 
QKKCEDAYPG QITDQ4VCAG SSKGADTCQG DSGGPLVCDG 
VYTNICRYLD WIKKIIGSKG 

Seq ID NO: 501 DMA sequence 
Nucleic Acid Accession 8: NM_006103 
Coding sequence: 29. .406 

1 11 21 31 41 51 

I I I I I I 

CACCTGCACC CCGCCCGGGC ATAOCACCAT GCCTGCTTGT CGCCTAGGCC CGCTAGCCGC 60 

CGCCCTCCTC CTCAGCCTGC TGCTGTTCGG CTTCACCCTA OTCTCAGGCA CAGGAGCAGA 120 

GAAGACTGGC GTGTGCCCCG AGCTCCAGGC TGACCAGAAC TGCACGCAAG AGTGCGTCTC 180 

GGACAGCGAA TGCGCCGACA ACCTCAAGTG CTGCAGCGCG GGCTGTGCCA CCTTCTGCCT 240 

TCTCTGCCCA AATGATAAGG AGGGTTCCTG CCOCCAGGTG AACATTAACT TTCCCCAGCT 300 

CGGCCTCTGT CGGGACCAGT GCCAGGTGGA CAGCCAGTGT CCTGGCCAGA TGAAATGCTG 360 

CCGCAATGGC TGTGGGAAGG TGTCCTGTGT CACTCCCAAT TTCTGAGGTC CAGCCACCAC 420 

CAGGCTGAGC AGTGAGGAGA GAAAGTTTCT GCCTGGCCCT GCATCTGGTT CCAGCCCACC 480 

TGCCCTCCCC TTTTTCGGGA CTCTGTATTC CCTCTTGGGC TGACCACAGC TTCTCCCTTT 540 
CCCAACCAAT AAAGTAACCA CTTTCAGCAA AAAAAAAAAA AAAA 

Seq ID NO: 502 Protein sequence 
Protein Accession #: NP_006094 

1 11 21 31 41 51 

I 1 I 1 I I 

MPACRLGPLA AALLLSLLLF GFTLVSGTGA BKTGVCPBLQ ADQNCTQECV SDSECADNLK 60 
CCSAGCATFC LLCPNDKEGS CPQVNINFPQ LGIiCRDQOQV DSQCPGQMKC CRNGCGKVSC 120 
VTPNF 

Seq ID NO: 503 DMA sequence 
Nucleic Acid Accession #: NM_002407 
Coding sequence: 65.. 352 

1 11 21 31 41 51 

I I I 1 I I 

CCTCCACAGC AACTTCCTTG ATCCCTGCCA CGCACGACTG AACACAGACA GCAGCCGCCT 60 
CGCCATGAAG CTGCTQATGG TCCTCATGCT GGCGGCCCTC CTCCTGCACT GCTATGCAGA 120 
TTCTGGCTGC AAACTCCTGG AGGACATGGT TGAAAAGACC ATCAATTCCG ACATATCTAT 180 
ACCTGAATAC AAAGAGCTTC TTCAAGAGTT CATAGACAGT GATGCCGCTG CAGAGGCTAT 240 
GGGGAAATTC AAGCAGTGTT TCCTCAACCA GTCACATAGA ACTCTGAAAA ACTTTGGACT 300 
GATGATGCAT ACAGTGTACG ACAGCATTTG GTGTAATATG AAGAGTAATT AACTTTACCC 360 
AAGGCGTTTG GCTCAGAGGG CTACAGACTA TGGCCAGAAC TCATCTOTTG ATTGCTAGAA 420 
ACCACTTTTC nT CT T GTOT TGTCTTTTTA TGTGGAAACT GCTAGACAAC TGTTGAAACC 480 
TCAAATTCAT TTCCATTTCA ATAACTAACT GCAAATC 

Seq ID NO: 504 Protein sequence 
Protein Accession #: NPJ>02398 

1 11 21 31 41 51 

111,11 

MKLLMVLMLA ALLLHCYADS GCKLLBDMVE KTINSDISIP EYKELLQEPI DSDAAAEAMG 60 
KFKQCFLNQS HRTLKNFGLM MKTVYDSIWC NMKSN 

Seq ID NO: 505 SNA sequence 

Nucleic Acid Accession #t NM_0 1479 1.1 

Coding sequence: 171.. 2126 

1 11 21 31 41 51 

I I I I I I 

TTGGCGGGCG GAAGCGGCCA CAACCCGGCG ATCGAAAAGA TTCTTAGGAA CGCCGTACCA 60 

GCCGCGTCTC TCAGGACAGC AGGCCCCTGT CCTTCTGTCG GGCGCCGCTC AGCCGTGCCC 120 

TCCGCCCCTC AGGTTCTTTT TCTAATTCCA AATAAACTTG CAAGAGGACT AT GAAAG ATT 180 

ATGATGAACT TCTCAAATAT TATGAATTAC ATGAAACTAT TGGGACAGGT GGCTTTGCAA 240 

AGGTCAAACT TGCCTGCCAT ATCCTTACTG GAGAGATGGT AGCTATAAAA ATCATGGATA 300 

AAAACACACT AGGGAGTGAT TTGCCCOGGA TCAAAACGGA GATTGAGGCC TTGAAGAACC 360 

TGAGACATCA GCATATATGT CAACTCTACC ATGTGCTAGA GACAGCCAAC AAAATATTCA 420 

TGGTTCTTGA GTACTGCCCT GGAGGAGAGC TGTTTGACTA TATAATTTCC CAGGATCGCC 480 

TGTCAGAAGA GGAGACCCGG GTTGTCTTCC GTCAGATAGT ATCTGCTGTT GCTTATGTGC 540 

ACAGCCAGGG CTATGCTCAC AGGGACCTCA AGCCAGAAAA TTTGCTGTTT GATGAATATC 600 

ATAAATTAAA GCTGATTGAC TTTGGTCTCT GTGCAAAACC CAAGGGTAAC AAGGATTACC 660 

ATCTACAGAC ATGCTGTGGG AGTCTGGCTT ATGCAGCACC TGAGTTAATA CAAGGCAAAT 720 

CATATCTTGG ATCAGAGGCA GAT CTTTG GA GCATGGGCAT ACTGTTATAT GTTCTTATGT 780 

GTGGATTTCT ACCATTTGAT GATGATAATG TAATGGCTTT ATACAAGAAG ATTATGAGAG 840 

GAAAATATGA TGTTCCCAAG TGGCTCTCTC CCAGTAGCAT TCTGCTTCTT CAACAAATGC 900 

TGCAGGTGGA CCCAAAGAAA CGGATTTCTA TGAAAAATCT ATTGAACCAT CCCTGGATCA 960 

TGCAAGATTA CAACTATCCT GTTGAGTGGC AAAGCAAGAA TCCTTTTATT CACCTCGATG 1020 

ATGATTGCGT AACAGAACTT TCTGTACATC ACAGAAACAA CAGGCAAACA ATGGAGGATT 1080 

TAATTTCACT GTGGCAGTAT GATCACCTCA CGGCTACCTA TCTTCTGCTT CTAGCCAAGA 1140 

AGGCTCGGGG AAAACCAGTT CGTTTAAGGC TTTCTTCTTT CTCCTGTGGA CAAGCCAGTG 1200 

CTACCCCArr CACAGACATC AAGTCAAATA ATTGOAGTCT GGAAGATGTG ACCGCAAGTG 1260 



PHSQPWQAAL FQGQQLLCGG 60 

VQSIPKPCYN SSDVEDHNHD 120 

TSPRENFPDT LNCAEVKIPP 180 

ALQGITSWGS DPCGRSDKPG 240 
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ATAAAAATTA TGTGGCGGGA TTAATAGACT ATGATTGGTG TGAAGATGAT TTATCAACAG 1320 

GTGCTGCTAC TCCCCGAACA TCACAGTTTA CCAAGTACTG GACAGAATCA AATGGGGTGG 1380 

AATCTAAATC ATTAACTCCA GCCTTATGCA GAACACCTGC AAATAAATTA AAGAACAAAG 1440 

AAAATGTATA TACTCCTAAG TCTGCTGTAA AGAATGAAGA GTACTTTATG TTTCCTGAGC 1500 

CAAAGACTCC AGTTAATAAG AACCAGCATA AGAGAGAAAT ACTCACTACG CCAAATCGTT 1S60 

ACACTACACC CTCAAAAGCT AGAAACCAGT GCCTGAAAGA AACTCCAATT AAAATACCAG 1620 

TAAATTCAAC AGGAACAGAC AAGTTAATGA CAGGTGTCAT TAGCCCTGAG AGGCGGTGCC 1680 

GCTCAGTGGA ATTGGATCTC AACCAAGCAC ATATGGAGGA GACTCCAAAA AGAAAGGGAG 1740 

CCAAAGTGTT TGGGAGCCTT GAAAGGGGGT TGGATAAGGT TATCACTGTG CTCACCAGGA 1800 

GCAAAAGGAA GGGTTCTGCC AGAGACGGGC CCAGAAGACT AAAGCTTCAC TATAATGTGA 1860 

CTACAACTAG ATTAGTGAAT CCAGATCAAC TGTTGAATGA AATAATGTCT ATTCTTCCAA 1920 

AGAAGCATGT TGACTTTGTA CAAAAGGGTT ATACACTGAA GTGTCAAACA CAGTCAGATT 1980 

TTGGGAAAGT GACAATGCAA TTTGAATTAG AAGTGTGCCA GCTTCAAAAA CCCGATGTGG 2040 

TGGGTATCAG GAGGCAGCGG CTTAAGGGOG ATGCCTGGGT TTACAAAAGA TTAGTGGAAG 2100 

ACATCCTATC TAGCTGCAAG GTATAATTGA TGGATTCTTC CATCCTGCCG GATGAGTGTG 2160 

GGTGTGATAC AGCCTACATA AAGACTGTTA TGATCGCTTT GATTTTAAAG TTCATTGGAA 2220 

CTACCAACTT GTTTCTAAAG AGCTATCTTA AGACCAATAT CTCTTTGTTT TTAAACAAAA 2280 

GATATTATTT TGTGTATGAA TCTAAATCAA GCCCATCTGT CATTATGTTA CTGTCTTTTT 2340 

TAATCATGTG GTTTTGTATA TTAATAATTG TTGACTTTCT TAGATTCACT TCCATATGTG 2400 

AATGTAAGCT CTTAACTATG TCTCTTTGTA ATGTGTAATT TCTTTCTGAA ATAAAACCAT 2460 
TTGTGAATAT 

Seq ID NO: 506 Protein sequence 
Protein Accession ft: NP_055606.1 

1 11 21 31 41 51 

I I I I I I • 

MKDYDELLKY YELHETIGTG GFAKVKLACH ILTGEMVAIK IMDKNTLGSD LPRIKTBIEA 60 

LKNLRHQH1C QLYHVLBTAN KIFMVLEYCP GGELFDYIIS QDRLSEEETR WPRQIVSAV 120 

AYVHSQGYAH RDLKPENLLF DEYHKLKLID FGLCAKPKGN KDYHLQTCCG SLAYAAPELI 180 

QGKSYLGSEA DVWSMGILLY VLMCGFLPPD DDNVMALYKK IMRGKYDVPK WLSPSSILLL 240 

QQMLQVDPKK RISMKNLLNH PKIMQDYNYP VEWQSKNPFI HLDDDCVTEL SVHHRNNRQT 300 

MEDLISLWQY DHLTATYLLL LAKKARGKPV RLRLSSFSQG QASATPFTDI KSNNWSLEDV 360 

TASDKNYVAG LIDYDWCEDD LSTGAATPRT SQFTKYWTBS NGVESKSLTP ALCRTPANKL 420 

KNKENVYTPK SAVKNEEYFM FPEPKTPVNK NQHKREILTT PNRYTTPSKA RNQCLKETPI 480 

KIPVNSTGTD KLMTGVISPE RRCRSVELDL NQAHMEETPK RKGAKVFGSL ERGLDKVITV 540 

LTRSKRKGSA RDGPRRLKLH YNVTTTRLVN PDQLLNEIMS ILPKKHVDPV QKGYTLiKCQT 600 
QSDFGKVTMQ FELEVGQLQK PDWGIRRQR LKGDAWVYKR LVEDILSSCK V 

Seq ID NO: 507 DNA sequence 
Nucleic Acid Accession #: NM_000582 
Coding sequence: 88. .990 

1 11 21 31 41 51 

I I I I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 60 

AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 120 

GGCATCAOCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 180 

CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAG 240 

CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA ACGAAAGCCA TGACCACATG 300 

GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC CATTQACTCG 360 

AACGACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 420 

TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC AACCGAAGTT 480 

TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG TGTGGTTTAT 540 

GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TGATGCTACA 600 

GACGAGGACA TCACCTCACA CATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 

CO0GTTGC0C AGGACCTGAA CGCGCCTTCT GATTGGGACA GCCGTGGGAA GGACAGTTAT 720 

GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA GTCCAGATTA 780 

TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 840 

CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC ACAGCCATGA AGATATGCTG 900 

GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT TTCTCATGAA 960 

TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 1020 

ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 

CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 1140 

ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 1200 

CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 1260 

TCATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAGAGAAT 1320 

ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 1380 

TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 1440 

AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 508 Protein sequence 
Protein Accession §: NP_000573 

1 11 21 31 41 51 

I I I I I I 

MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 
PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 
DFPTDLPATE VPTPWPTVD TYDGRGDSW YGLRSKSXKF RRPDIQYPDA TDEDITSHME 180 
SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 240 
NEHSDVIDSQ ELSKVSREFH SHEPHSHEDM LWDPKSKBE DKHLKFRISH ELDSASSEVN 
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PCT/US02/36810 



Seq ID NO: 509 DNA sequence 
Nucleic Acid Accession tt: AB051390.1 
Coding sequence: 34.. 2457 

5 1 11 21 31 41 51 

I I I I i I 

AGCGGCCGCG GCACAAAGTT GGGGGCCGCG AAGATGAGGC TGTCCCCGGC GCCCCTGAAG 60 

CTGAGCCGGA CTCCGGCACT GCTGGCCCTG GCGCTGCCCC TGGCCGGGGC GCTGGOCTTC 120 

A TCCGACGAGA CCCTGGACAA AGTGCCCAAG TCAGAGGGCT ACTGCAGCCG TATCCTGCGC 180 

10 GCCCAGGGCA CGCGGCGOGA GGGCTACACC GAGTTCAGCC TCCGCGTGGA GGGCGACCCC 240 

GACTTCTACA AGCCGGGAAC CAGCTACCGC GTAACACTTT CAGCTGCTCC TCCCTCCTAC 300 

TTCAGAGGAT TCACATTAAT TGCCCTCAGA GAGAACAGAG AGGGTGATAA GGAAGAAGAC 360 

CATGCTGGGA CCTTCCAGAT CATAGACGAA GAAGAAACTC AGTTTATQAG CAATTGCCCT 420 

GTTGCAGTCA CTGAAAGCAC TCCAOGGAGG AGGACCCGGA TCCAGGTGTT TTGGATAGCA 480 

15 CCACCAGCGG GAACAGGCTG CGTGATTCTG AAGGCCAGCA TCGTACAAAA ACGCATTATT 540 

TATTTTCAAG ATGAGGGCTC TCTGACCAAG AAACTTTGTG AACAAGATTC CACATTTGAT 600 

GGGGTGACTG ACAAACCCAT CTTAGACTGC TGTGCCTGCG GAACTGCCAA GTACAGACTC 660 

ACATTTTATG GGAATTGGTC CGAGAAGACA CACCCAAAGG ATTACCCTCG TCGGGCCAAC 720 

CACTGGTCTG CGATCATCGG AGGATCCCAC TCCAAGAATT ATGTACTGTG GGAATATGGA 780 

20 GGATATGCCA GCGAAGGCGT CAAACAAGTT GCAGAATTGG GCTCACCCGT GAAAATGGAG 840 

GAAGAAATTC GACAACAGAG TGATGAGGTC CTCACCGTCA TCAAAGCCAA AGCCCAATGG 900 
CCAGCCTGGC AGCCTCTCAA CGTGAGAGCA GCACCTTCAG CTGAATTTTC CGTGGACAGA 960 

ACGCGCCATT TAATGTCCTT CCTGACCATG ATGGGCCCTA GTCCCGACTG GAACGTAGGC 1020 

TTATCTGCAG AAGATCTGTG CACCAAGGAA TGTGGCTGGG TCCAGAAGGT GGTGCAAGAC 1080 

25 CTGATTCCCT GGGACGCTGG CACCGACAGC GGGGTGACCT ATGAGTCACC CAACAAACCC 1140 

ACCATTCCCC AGGAGAAAAT CCGGCCCCTG ACCAGCCTGG ACCATCCTCA GAGTCCTTTC 1200 

TATGACCCAG AGGGTGGGTC CATCACTCAA GTAGCCAGAG TTGTCATCGA GAGAATCGCA 1260 

CGGAAGGGTG AACAATGCAA TATTGTACCT GACAATGTCG ATGATATTGT AGCTGACCTG 1320 

A GCTCCAGAAG AGAAAGATGA AGATGACACC CCTGAAACCT GCATCTACTC CAACTGGTCC 1380 

30 CCATGGTCCG CCTGCAGCTC CTCCACCTGT GACAAAGGCA AGAGGATGCG ACAGCGCATG 1440 

CTGAAAGCAC AGCTGGACCT CAGCGTCCCC TGCCCTGACA CCCAGGACTT CCAGCCCTGC 1500 

ATGGGCCCTG GCTGCAGTGA CGAAGACGGC TCCACCTGCA CCATGTCCGA GTGGATCACC 1560 

TGGTCGCCCT GCAGCATCTC CTGCGGCATG GGCATGAGGT CCCGGGAGAG GTATGTGAAG 1620 

CAGTTCCCGG AGGACGGCTC CGTGTGCACG CTGCCCACTG AGGAAACGGA GAAGTGCACG 1680 

35 GTCAACGAGG AGTGCTCTCC CAGCAGCTGC CTGATGACCG AGTGGGGCGA GTGGGACGAG 1740 

TGCAGCGCCA CCTGCGGCAT GGGCATGAAG AAGCGGCACC GCATGATCAA GATGAACCCC 1800 

GCAGATGGCT CCATGTGCAA AGCCGAGACA TCACAGGCAG AGAAGTGCAT GATGCCAGAG 1860 

TGCCACACCA TCCCATGCTT GCTGTCCCCA TGGTCOGAGT GGAGTGACTG CAGCGTGACC 1920 

TGCGGGAAGG GCATGCGAAC CCGACAGCGG ATGCTCAAGT CTCTGGCAGA ACTTGGAGAC 1980 

40 TGCAATGAGG ATCTGGAGCA GGTGGAGAAG TGCATGCTCC CTGAATGCCC CATTGACTGT 2040 

GAGCTCACCG AGTGGTCCCA GTGGTCGGAA TGTAACAAGT CATGTGGGAA AGGCCACGTG 2100 

ATTCGAACCC GGATGATCCA AATGGAGCCT CAGTTTGGAG GTGCACCCTG CCCAGAGACT 2160 

GTGCAGCGAA AAAAGTGCCG CATCCGAAAA TGCCTTCGAA ATCCATCCAT CCAAAAGCTA 2220 

CGCTGGAGGG AGGCCCGAGA GAGCCGGCGG AGTGAGCAGC TGAAGGAAGA GTCTGAAGGG 2280 

45 GAGCAGTTCC CAGGTTGTAG GATGCGCCCA TGGACGGCCT GGTCAGAATG CACCAAACTG 2340 

TGCGGAGGTG GAATTCAGGA ACGTTACATG ACTGTAAAGA AGAGATTCAA AAGCTCCCAG 2400 

TTTACCAGCT GCAAAGACAA GAAGGAGATC AGAGCATGCA ATGTTCATCC TTGTTAGCAA 2460 

GGGTACGAGT TCCCCAGGGC TGCACTCTAG ATTCCAGAGT CACCAATGGC TGGATTATTT 2520 

_ A GCTTGTTTAA GACAATTTAA ATTGTGTACG CTAGTTTTCA TTTTTGCAGT GTGGTTCGCC 2580 

50 CAGTAGTCTT GTGGATGCCA GAGACATCCT TTCTGAATAC TTCTTGATGG GTACAGGCTG 2640 

AGTGGGGCGC CCTCACCTCC AGCCAGCCTC TTCCTGCAGA GGAGTAGTGT CAGCCACCTT 2700 

GTACTAAGCT GAAACATGTC CCTCTGGAGC TTCCACCTGG CCAGGGAGGA CGGAGACTTT 2760 

GACCTACTCC ACATGGAGAG GCAACCATGT CTGGAAGTGA CTATGCCTGA GTCCCAGGGT 2820 

GCGGCAGGTA GGAAACATTC ACAGATGAAG ACAGCAGATT CCCCACATTC TCATCTTTGG 2880 

55 CCTGTTCAAT GAAACCATTG TTTGCCCATC TCTTCTTAGT GGAACTTTAG GTCTCTTTTC 2940 

AAGTCTCCTC AGTCATCAAT AGTTCCTGGG GAAAAACAGA GCTGGTAGAC TTGAAGAGGA 3000 

GCATTGATGT TGGGTGGCTT TTGTTCTTTC ACTGAGAAAT TCGGAATACA TTTGTCTCAC 3060 

CCCTGATATT GGTTCCTGAT GCCCCCCCAA CAAAAATAAA TAAATAAATT ATGGCTGCTT 3120 

TATTTAAATA TAAGGTAGCT AGTTTTTACA CCTGAGATAA ATAATAAGCT TAGAGTGTAT 3180 

60 TTTTCCC2TG CTTTTGGGGG TTCAGAGGAG TATGTACAAT TCTTCTGGGA AGCCAGCCTT 3240 

CTGAACTTTT TGGTACTAAA TCCTTATTGG AACCAAGACA AAGGAAGCAA AATTGGTCTC 3300 

TTTAGAGACC AATTTGCCTA AATTTTAAAA TCTTCCTACA CACATCTAGA CGTTCAAGTT 3360 

TGCAAATCAG TTTTTAGCAA GAAAACATTT TTGCTATACA AACATTTTGC TAAGTCTGCC 3420 

CAAAGCCCCC CCAATGCATT CCTTCAACAA AATACAATCT CTGTACTTTA AAGTTATTTT 3480 

65 AGTCATGAAA TTTTATATGC AGAGAGAAAA AGTTACCGAG ACAGAAAACA AATCTAAGGG 3S40 

AAAGGAATAT TATGGGATTA AGCTGAGCAA GCAATTCTGG TGGAAAGTCA AACCTGTCAG 3600 

TGCTCCACAC CAGGGCTGTG GTCCTCCCAG ACATGCATAG GAATGGCCAC AGGTTTACAC 3660 

TGCCTTCCCA GCAATTATAA GCACACCAGA TTCAGGGAGA CTGACCACCA AGGGATAGTG 3720 
TAAAAGGACA TTTTCTCAGT TGGGTCCATC AGCAGTTTTT CTTCCTGCAT TTATTGTTGA 3780 
70 AAACTATTGT TTCATTTCTT CTTTTATAGG CCTTATTACT GCTTAATCCA AATGTGTACC 3840 
ATTGGTGAGA CACATACAAT GCTCTGAATA CACTACGAAT TTGTATTAAA CACATCAGAA 3900 
TATTTCCAAA TACAACATAG TATAGTCCTG AATATGTACT TTTAACACAA GAGAGACTAT 3960 
TCAATAAAAA CTCACTGGGT CTTTCATGTC TTTAAGCTAA GTAAGTGTTC AGAAGGTTCT 4020 
TTTTTATATT GTCCTCCACC TCCATCATTT TCAATAAAAG ATAGGGCTTT TGCTCCCTTG 4080 
75 TTCTTGGAGG GACCATTATT ACATCTCTGA ACTACCTTTG TATCCAACAT GTTTTAAATC 4140 
CTTAAATGAA TTGCTTTCTC CCAAAAAAAG CACAATATAA AGAAACACAA GATTTAATTA 4200 
TTTTTCTACT TGGGGGGAAA AAAGTCCTCA TGTAGAAGCA CCCACTTTTG CAATGTTGTT 4260 
CTAAGCTATC TATCTAACTC TCAGCCCATG ATAAAGTTCC TTAAGCTGGT GATTCCTAAT 4320 
CAAGGACAAG CCACCCTAGT GTCTCATGTT TGTATTTGGT CCCAGTTGGG TACATTTTAA 4380 
oO AATCCTGATT TTGGAGACTT AAAACCAGGT TAATGGCTAA GAATGGGTAA CATGACTCTT 4440 
GTTOGATTGT TATTTTTTGT TTGCAATGGG GAATTTATAA GAAGCATCAA GTCTCTTTCT 4500 
TACCAAAGTC TTGTTAGGTG GTTTATAGTT CTTTTGGCTA ACAAATCATT TTGGAAATAA 4560 
AGATTTTTTA CTACAAAAAT G 
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Seq ID NO: 510 Protein sequence 
Protein Accession #: BAB1B461.1 
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i 
I 

MRLSPAPLKL 
FSLRVEGDPD 
BTQFMSNCPV 
LCEQDSTFDG 
KNYVLWEYGG 
PSAEFSVDRT 
VTYESPNKPT 
NVDDIVADLA 
PDTQDFQPCM 
PTEETEKCTV 
QAEKCMMPEC 
MLPECPIDCE 
LRNPSIQKLR 
VKKRPKSSQF 



11 
I 

SRTPALLALA 
FYKPGTSYRV 
AVTESTPRRR 
VTDKPILDCC 
YASBGVKQVA 
HHLMSFLTMM 
IPQEKIRPLT 
PEEKDEDDTP 
GPGCSDEDGS 
NEECSPSSCL 
HTIPCLLSPW 
LTEWSQWSBC 



21 

I 

LPLAAALAFS 
TLSAAPPSYF 
TRIQVFWIAP 
ACGTAKYRLT 
ELGSPVKMEE 
GPSPDWNVGL 
SLDHPQSPFY 
ETCIYSNWSP 
TCTMSEWITW 
MTENGEWDEC 



31 
i 

DETLDKVPKS 
RGFTLIALRE 
PAGTGCVILK 
FYGNViSEKTH 
EIRQQSDEVL 
SAEDLCTKEC 
DPEGGSITQV 
WSACSSSTCD 
SPCSISCGMG 
SATCGMGMKK 



TSCKDKKEIR 



NKSCGKGHVI 
EQLKEESEGE 
ACNVHPC 



RTRMIQMEPQ 
QFPGCRMRPW 



41 

I 

EGYCSRILRA 
NREGDKEEDH 
ASIVQKRIIY 
PKDYPRRANH 
TVIKAKAQWP 
GWVQKWQDL 
ARWIERIAR 
KGKRMRQRML 
MRSRERYVKQ 
RKRMIKMNPA 
LKSLAELGDC 
FGGAPCPETV 
TAWSECTKLC 



51 
I 

QGTRREGYTE 
AGTFQIIDEE 
FQDEGSLTKK 
WSAIIGGSHS 
AWQPLNVRAA 
IPWDAGTDSG 
KGEOCNIVPD 
KAQLDLSVPC 
FPEDGSVCTL 
DGSMCKAETS 
NEDI*BQVEXC 
QRKKCRIRKC 
GGGIQERYMT 



Seq ID NO: Sll DMA sequence 

Nucleic Acid Accession #i NM_003 108.1 

Coding sequence: 76.. 1401 



41 



51 



I 

MVQQAESLEA 
KIERRKIMEQ 
RKKPKMDPSA 
GAGKAAQSGD 



ATSGAGGGSR 
LMFDLSLNFS 
LSEMIAGDWL 



11 
I 

ESNLPREALD 
SPDMHNABIS 
KPSASQSPEK 
YGGAGDDYVL 
PHQQLLQPPG 
LYYSFKNITK 
QSAHSASEQQ 
EANFSDLVFT 



21 
I 

TEEGEFMACS 
KRLGKRWKML 
SAAGGGGGSA 
GSLRVSGSGG 
QQPSQLLRRY 
QHPPPLAQPA 
LGGGAAAGNL 
Y 



31 



41 



PVALDESDPD WCKTASGHIK 
KDSERIPFIR EAER.LRLKHM 
GGGAGGAKTS KGSSKKCGKL 
GGAGKTVKCV FLDEDDDDDD 
NVAKVPASPT LSSSAESPEG 
LSPASSRSVS TSSSSSSGSS 
SLSLVDKDLD SFSEGSLGSH 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



1 11 21 31 

I I I I I i 

GGGGTGGGAG GGGGAGGGGG ACCTCCGCAC GAGACCCAGC GGCCCGGGTT GGAGCGTCCA 
GCCCIGCAAC GGATCATGGT GCAGCAGGCG GAGAGCTTGG AAGCGGAGAG CAACCTGCCC 
CGGGAGGCGC TGGACACGGA GGAGGGCGAA TTCATGGCTT GCAGCCOGGT GGCCCTGGAC 
GAGAGCGACC CAGACTGGTG CAAGACGGCG TCGGGCCACA TCAAGCGGCC GATGAACGCG 
TTCATGGTAT GGTCCAAGAT CGAACGCAGG AAGATCATGG AGCAGTCTCC GGACATGCAC 
AACGCCGAGA TCTCCAAGAG GCTGGGCAAG CGCTGGAAAA TGCTGAAGGA CAGCGAGAAG 
ATCCCGTTCA TCCGGGAGGC GGAGCGGCTG CGGCTCAAGC ACATGGCCGA CTACCCCGAC 
TACAAGTACC GGCCCCGGAA AAAGCCCAAA ATGGACCCCT CGGCCAAGCC CAGCGCCAGC 
CAGAGCCCAG AGAAGAGCGC GGCCGGCGGC GGCGGCGGGA GCGGGGGCGG AGGCGCGGGC 
GGTGCCAAGA CCTCCAAGGG CTCCAGCAAG AAATGCGGCA AGCTCAAGGC CCCCG0GGCC 
GCGGGCGCCA AGGCGGGOGC GGGCAAGGCG GCCCAGTCCG GGGACTACGG GGGCGGGGGC 
GACGACTACG TGCTGGGCAG CCTGCGCGTG AGCGGCTCGG GCGGGGGCGG CGCGGGCAAG 
ACGGTCAAGT GCGTGTTTCT GGATGAGGAC GACGACGACG ACGACGACGA CGACGAGCTG 
CAGCTGCAGA TCAAACAGGA GCCGGACGAG GAGGACGAGG AACCACCGCA CCAGCAGCTC 
CTGCAGCCGC CGGGGCAGCA GCCGTCGCAG CTGCTGAGAC GCTACAACGT CGCCAAAGTG 
CCCGCCAGCC CTACGCTGAG CAGCTOGGCG GAGTCCCCCG AGGGAGCGAG CCTCTACGAC 
GAGGTGCGGG CCGGCGCGAC CTCGGGCGCC GGGGGCGGCA GCCGCCTCTA CTACAGCTTC 
AAGAACATCA CCAAGCAGCA CCCGCOGCCG CTCGCGCAGC CCGCGCTGTC GCCCGCGTCC 
TCGCGCTCGG TGTCCACCTC CTCGTCCAGC AGCAGCGGCA GCAGCAGCGG CAGCAGCGGC 
GAGGACGCCG ACGACCTGAT GTTCGACCTG AGCTTGAATT TCTCTCAAAG CGCGCACAGC 
GCCAGCGAGC AGCAGCTGGG GGGCGGCGCG GCGGCCGGGA ACCTGTCOCT GTCGCTGGTG 
GATAAGGATT TGGATTCGTT CAGCGAGGGC AGCCTGGGCT CCCACTTCGA GTTCCCCGAC 
TACTGCACGC CGGAGCTGAG CGAGATGATC GCGGGGGACT GGCTGGAGGC GAACTTCTCC 
GACCTGGTGT TCACATATTG AAAGGCGCCC GCTGCTOGCT CTTTCTCTCG GAGGGTGCAG 
AGCTGGGTTC CTTGGGAGGA AGTTOTAGTG GTGATGATGA TGATGATGAT AATGATGATG 
ATGATGGTGG TGTTGATGGT GGCGGTGGTA GGGTGGAGGG GAGAGAAGAA GATGCTGATG 
ATATTGATAA GATGTCGTGA CGCAAAGAAA TTGGAAAACA TGATGAAAAT TTTGGTGGAG 
TTAAAGTGAA ATGAGTAGTT TTTAAACATT TTTCCTGTCC TTTTTTTGTC CCCCCTCCCT 
TCCTTTATCG TGTCTCAAGG TAGTTGCATA CCTAGTCTGG AGTTGTGATT ATTTTCCCAA 
AAAATGTGTT TTTGTAATTA CTATTTCTTT TTCCTGAAAT TCGTGATTGC AACAAAGGCA 
GAGGGGGOGG CGCGGCGGAG GGGAGGTAGG ACCCGCTCCG GAAGGCGCTG TTTGAAGCTT 
GTCGGTCTTT GAAGTCTGGA AGACGTCTGC AGAGGACCCT TTTGGCAGCA CAACTGTTAC 
TCTAGGGAGT TGGTGGAGAT ATTTTTTTTT CTTAAGAGAA CTTAAAGAAC TGGTGATTTT 
TTTTTAACAA AAAAAGGG 

Seq ID NO: 512 Protein sequence 
Protein Accession #» NP_003099.1 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



51 
I 

RPMNAFMVWS 
ADYPDYKYRP 
KAPAAAGAKA 
DDDELQLQIK 
ASLYDEVRAG 
SGSSGEDADD 
FEFPDYCTPE 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 513 DMA sequence 

Nucleic Acid Accession #s CAT duster 

1 11 21 31 41 51 

111)11 

GGTCGACCTA AATCTGATAA CTGGCTTATT ATGTAATTTA TTGGTGTTAT TATAGTAGAG 60 
ATTGGTAATC TACAGTAAGA TTTTCAGTTA GGATTTGAGA TTATGATAAT AACTAATAGA 120 
ATATTTCTAA ATTGGAATTA GAAGATTGTT GTATGACAGA GAGTCAGGAC TTGCCATTTG 180 
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GCAAACATCA AAGTCATTGT TTGGTGTGTA ATAGTACAAA ATCATCTTGC TTAACAGAGA 240 

AAGGATATCT GTTGCTCCOG AATQAAACAA TTTTTCTGAA ATAGAGGGCC CAGAATTGGT 300 

CTCTGACAAT TAATAAAGAC ATCAAAGATA GCAAAATGAT TTTTATATCT TA6GGCCAAT 360 

ACTACCAATT TAATAATTAA AACAAGTTCT GGTGAGCTCT GAACTTGGCA GAATTGGTGG 420 

CAACATAGAC TTTGGATTTT CCAAATTCCC CACATAAAAC AAAGGGGATC AACTAQATAG 480 
AAAAACCAGA AACCTTTGGA AATATCTGTT TAAAAAAAAA AAAAAGTCGA CGOGGGCC 

Seq ID NO: 514 DNA sequence 

Nucleic Acid Accession fl: CAT cluster 



1 11 21 31 41 51 

I I I I I I 

GGAGCCACAG TGAAAGTCAA GAATOTCAGT GATTCCACAT TTAATATCTA CATTTTTGCA 60 

GGGCAGTTAC TCTTTTGTAG TATAACATTG AGCTGATAGC ACATAGTGTA GACAAGTGAA 120 

TACAGGATTC TCTGGGTTGT ATTCCCAGAA GTCTGGAGGT CATTTGGATA TTTGTGGGCC 180 

CTTGGCTTCA CTCTGACTTG TGTGACACAT AAAAATTGTG ATGAAATGTC CTATAGATGT 240 

CCTGCAGGTC TTAAAAGAAC CTTTCCAAAC TATGAAACAG CCCAGCAGCA CTGAGTTAGA 300 

GGTAAATTCT GAACCCTTGA ACACTAAAAC TATTCTAACT GCACATAGAA TTGGCAAGTA 360 

GCATTCTATG TCTATGAACA GTATGTCTTT TCTATATAAC AGAGAAAATC TTTTTAAGCA 420 

AACTACTCAG TTTAAAACCT AATTCTTCTC ATAATCTCAG TACTTTTGAA TGAAGACATA 480 

TCAATGCAAC AGTACACTCT TATTCAGGCA TTTGAAAGAA AGAATTCGAG ATCTAGTTTG 540 

TATCAGATAT TATAAATTAG TATGGTTTAG TCTTTGTCAT GAAATTCTAC TTAATTTTTG 600 

GACTATAGGT TTAAGAATGT AAGCAGAAGT TCTGCACCAA TCAGAATAAG CTACATTATG 660 

CTTGAGTGAC AACTACTGTA ATGACAAAAT ATCAGTGGCT TAATACAATG GTTTTTCTCT 720 

CATACTTGTT CATAAAGAGT CAGCAAGGAC CCTGCTCATT ATGGTCCCTC AGGGACCCAG 780 

GGTTGTTGGA AGCTCCACCA TTTTAGATAG CTCCCTTCAA AGTCAGCCAT CTTTGGCAGT 840 

CCATGTCCCC CAACAGGCTG GCAAAATTTG GCTCTGGATG GCTTCAAGGA TTGAGCATOG 900 

GGCAGTTTAA ATGCTTTCAA CATGGAAAGT GGACACOGGC CACTCCCACT CACATCCCTT 960 

GGGCCAGAAC TAGGTCACTG GGCCCGGACC TAACTTCGGA GGGTTGGGGA ATTGTAATTC 1020 

CTCCATGTAC CCAAAGTGGA GAGAAGCCAG ATACTGAGAA ACATCAATAA TGGCTAACAG 1080 

AAATCCATTC TACCATTCCC TTTGCCTAAA GTGAAAAGAT GAGTACTTTC ATCAATTTGT 1140 

AAACTGTACT TTTGAAGTAA ATCCTGGTAG CTTGCATGGG GGCTGGATTT CCAGAAAGCC 1200 

ATATGTAATT TGGGAATGAC ATTCACTTAA GCTCATAGAA TATCATTATT TGATGTAAAA 1260 

TGCCCTCATT TGCAATACAG GACCAAAATG CACTAACCAC AAAACCCCCC TCCCCACGGG 1320 

GCCCGGGGTC CCTATTCCCC TCCATCCCTT TAAATGAGGC ATTCTATGAT TTGGAATGGA 1380 

AGCCCAGTTG TAGTCGTAAG AATTTTACTT AATTCAAGAA TTATTCTCAC TGAATATGTG 1440 

CCAGTTCTGA AAGGAATGCA AAGTCAAATT TTGCATCTTC TTTGCTCAAG GGCCTTTAGA 1500 

TGTAACAACA CAGACATGAT ACAAGGCTGA CAATGACATT ATGATTTAAA TATGTTAAAC 1560 

AACTTATTAA ATTGTGAATC AACAAAAAAT TATGTTCTTT ATTTTATGGT TTTGCATAGT 1620 

CCTGACTCAC TGCCTACATA CCCCTCTTGT TCCTCAGTTC TTATCCCTGA TTTCTTACAG 1680 

GATGGCCTAA GACAGCTGTA GATGTTTTTA TTTAGCAAAA AAAAAAAAAA AAAAGTCGAC 1740 
GCGGCCGCGA ATTTAGTAG 

Seq ID NO: 515 DNA sequence 
Nucleic Acid Accession #: NM_012427 
Coding sequence : 43 . . 924 

1 11 21 31 41 51 

I - I I i I I 

CTTGTGGTTC CTCTCTACTT GGGGAAATCA GGTGCAGCGG CCATGGCTAC AGCAAGACCC 60 

CCCTGGATGT GGGTGCTCTG TGCTCTGATC ACAGCCTTGC TTCTGGGGGT CACAGAGCAT 120 

GTTCTCGCCA ACAATGATGT TTCCTGTGAC CACCCCTCTA ACACCGTGCC CTCTGGGAGC 180 

AACCAGGACC TGGGAGCTGG GGCCGGGGAA GACGCCCGGT CGGATGACAG CAGCAGCCGC 240 

ATCATCAATG GATCCGACTG CGATATGCAC ACCCAGCCGT GGCAGGCCGC GCTGTTGCTA 300 

AGGCCCAACC AGCTCTACTG CGGGGCGGTG TTGGTGCATC CACAGTGGCT GCTCACGGCC 360 

GCCCACTGCA GGAAGAAAGT TTTCAGAGTC CGTCXCGGCC ACTACTCCCT GTCACCAGTT 420 

TATGAATCTG GGCAGCAGAT GTTCCAGGGG GTCAAATCCA TCCCCCACCC TGGCTACTCC 480 

CACCCTGGCC ACTCTAACGA CCTCATGCTC ATCAAACTGA ACAGAAGAAT TCGTCCCACT 540 

AAAGATGTCA GACCCATCAA OGTCTCCTCT CATTGTCCCT CTGCTGGGAC AAAGTGCTTG 600 

GTGTCTGGCT GGGGGACAAC CAAGAGCCCC CAAGTGCACT TCCCTAAGGT CCTCCAGTGC 660 

TTGAATATCA GCGTGCTAAG TCAGAAAAGG TGCGAGGATG CTTACCCGAG ACAGATAGAT 720 

GACACCATGT TCTGCGCCGG TGACAAAGCA GGTAGAGACT CCTGCCAGGG TGATTCTGGG 780 

GGGCCTGTGG TCTGCAATGG CTCCCTGCAG GGACTCGTGT CCTGGGGAGA TTACCCTTGT 840 

GCCCGGCCCA ACAGACCGGG TGTCTACACG AACCTCTGCA AGTTCACCAA GTGGATCCAG 900 

GAAACCATCC AGGCCAACTC CTGAGTCATC CCAGGACTCA GCACACCGGC ATCCCCACCT 960 

GCTGCAGGGA CAGCCCTGAC ACTCCTTTCA GACCCTCATT CCTTCCCAGA GATGTTGAGA 1020 

ATGTTCATCT CTCCAGCCCC TGACCCCATG TCTCCTGGAC TCAGGGTCTG CTTCCCCCAC 1080 

ATTGGGCTGA CCGTGTCTCT CTAGTTGAAC CCTGGGAACA ATTTCCAAAA CTGTCCAGGG 1140 

CGGGGGTTGC GTCTCAATCT CCCTGGGGCA CTTTCATCCT CAAGCTCAGG GCCCATCCCT 1200 
TCTCTGCAGC TCTGACCCAA ATTTAGTCCC AGAAATAAAC TGAGAAGTGG AAAAAAAAAA 

Seq ID NO: 516 Protein sequence 
Protein Accession th NP_036S59 

1 11 21 31 41 51 

I I I I I I 

MATARPPWMW VLCALITALL LGVTEHVLAN NDVSCDHPSN TVPSGSNQDL GAGAGEDARS 60 

DDSSSRIING SDCDMHTQPW QAALLLRPNQ LYCGAVLVHP QMLLTAAHCR KKVFRVRI/3H 120 

YSLSPVYESG QQMFQGVKSI PHPGYSHPGH SNDLMLIKLN RRIRPTKDVR PINVSSHCPS 180 

AGTKCLVSGW GTTKSPQVHP PKVLQCLNIS VLSQKRCEDA YPRQIDDTMF CAGDKAGRDS 240 
CQGDSGGPW CNGSLQGLVS WGDYPCARPN RPGVYTNLCK FTKWIQETIQ ANS 



1153 
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Seq ID NO: 517 DBA sequence 
Nucleic Acid Accession 8: NNJ>01719 
Coding sequence: 123.. 1418 

5 1 11 21 31 41 51 

I I I I I I 

GGGCGCAGCQ GGGCCOGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 60 

CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCCCG GGTAGCGCGT AGAGCCGGCG 120 

CGATGCACGT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 180 

10 CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 240 

GCTTCATCCA CCGGCGCCTC CGCAGCCAGG AGCGGCGGGA GATGCAGCGC GAGATCCTCT 300 

CCATTTTGGG CTTGCCCCAC OGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 420 

GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGTAC CCAGGGCCCC CCTCTGGCCA 480 

15 GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATGAGCTTC GTCAACCTCG 540 

TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 600 

TTTCCAAGAT CCCAGAAGGG GAAGCTGTCA CGGCAGCCGA ATTCCGGATC TACAAGGACT 660 

ACATCCGGGA ACGCTTCGAC AATGAGACGT TCCGGATCAG CGTTTATCAG GTGCTCCAGG 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG CCGTACCCTC TGGGCCTCGG 780 

20 AGGAGGGCTG GCTGGTGTTT GACATCACAQ CCACCAGCAA CCACTGGGTG GTCAATCCGC 840 

GGCACAACCT GGGCCTGCAG CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACCCCA 900 

AGTTGGCGGG CCTGATTGGG CGGCACGGGC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

TCTTCAAGGC CAOGGAGGTC CACTTCCGCA GCATCCGGTC CACGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GCGGATGGCC AACGTGGCAG 1080 

25 AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 

GAGACCTGGG CTGGCAGGAC TGGATCATCG CGCCTGAAGG CTACGCCGCC TACTACTGTG 1200 

AGGGGGAGTG TGCCTTCCCT CTGAACTCCT ACATGAAOGC CACCAACCAC GCCATCGTGC 1260 

AGACGCTGGT CCACTTCATC AACCCGGAAA CGQTGCCCAA GCCCTGCTGT GOGCCCACGC 1320 

AGCTCAATGC CATCTCCGTC CTCTACTTCG ATGACAGCTC CAACGTCATC CTGAAGAAAT 1380 

30 ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCCGAGAA TTCAGACCCT 1440 

TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTG TGAGACCTTC CCCTCCCTAT CCCCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA GGAAAAAAAA ACAACGCATA AAGAAAAATG 1680 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTCGT1T CCAGAGGTAA 1740 

TTATGAGCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCGGGA AGTTCCTGTA ATAAATGTCA 1860 
CAATAAAACG AATGAATG 



35 
40 



55 
60 



Seq ID NO i 518 Protein sequence 
Protein Accession ft: NP 001710 



1 11 21 31 41 51 

ac I I II i I 

40 MHVRSLRAAA PHSFVALWAP LFLLRSALAD FSLDNEVHSS FIHRRLRSQB RREMQREILS 60 

ILGLPHRPRP HLQGKHNSAP MFMLDLYNAM AVEEGGGPGG QGFSYPYKAV FSTQGPPLAS 120 

LQDSHFLTDA DMVMSFVNLV EHDKEFFHPR YHHREFRFDL SKIPEGEAVT AAEFRIYKDY 180 

IRERFDNETF RISVYQVLQE HLGRESDLFL LDSRTLWASE EGWLVFDITA TSNHWWNPR 240 

HNLGLQLSVE TLDGQSINPK LAGLIGRHGP QNKQPFMVAF PKATEVHFRS IRSTGSKQRS 300 

50 QNRSKTPKNQ EALRMANVAE NSSSDQRQAC KKHELYVSFR DLGWQDWIIA PEGYAAYYCE 360 

GECAFPLNSY MNATNHAIVQ TLVHFINPET VPKPCCAPTQ LNAISVLYFD DSSNVILKKY 420 
RNMWRACGC H 



Seq ID NO; 519 DNA sequence 

Nucleic Acid Accession 0: Eos sequence 

Coding sequence t 2 6 4.. 7 82 



1 11 21 31 41 51 

I I I I I 1 

CCCTGCTCCA GTCACACCCG GAAGCTGACT GGTCCACGCA CAGCTGAAGC ATGAGGAAAC 60 

TCATCGCGGG ACTAATTTTC CTTAAAATTT AGACTTGCAC AGTAAGGACT TCAACTGACC 120 

TTCCTCAGAC TGAGAACTGT TTCCAGTATA TACATCAAGT CACTGAGATC TCCAGCACCC 180 

TGCCGGTGGC ACTACTGAGA GACGAGGTGC CAGGGTGGTT CCTGAAAGTG CCTGAGCCCC 240 

AACTTATCAG CAAGGAGCTC ATCATGCTGA CAGAAGTCAT GGAGGTCTGG CATGGCTTAG 300 

65 TGATCGCGGT GGTGTCCCTC TTCCTGCAGG CCTGCTTCCT CACCGCCATC AACTACCTGC 360 

. TCAGCAGGCA CATGGCCCAC AAGAGTGAAC AGATACTGAA AGCGGCCAGT CTCCAGGTTC 420 

CCAGGCCCAQ CCCTGGCCAC CATCATCCAC CTGCTGTCAA AGAGATGAAG GAGACTCAGA 480 

CAGAGAGAGA CATCCCAATG TCTGATTCCC TTTACAGGCA TGACAGCGAC ACACCCTCAG 540 

ATAGCTTGGA TAGCTCCTGC AGTTCGCCTC CTGCCTGCCA GGCCACAGAG GATGTGGATT 600 

70 ACACACAAGT CGTCTTTTCT GACCCTGGAG AACTAAAAAA TGACTCCCCG CTGGACTATG 660 

AGAACATAAA GGAAATCACA GATTATGTCA ATGTCAATCC AGAAAGACAC AAGCCCAGTT 720 

TCTGGTATTT TGTCAACCCT GCTCTGTCTG AGCCAGCGGA ATATGATCAA GTGGCCATGT 780 

GAATTCCAAA TATTTTTAAT GGGGTCCAGT TCTCTATGGA TTCTTACATT TAATTTGTAG 840 

GGAAATGCCA TTTTTCCCCC TTAAACAAGG CATGGGGCTC ACAAGTCTAT GGAGACAGGC 900 

75 CAAAAAGAAT GTGGAGAAGA AAACTGATAA ATACACAGAG GTCCTCAAGA COCATGGACT 960 

CCTGGTCTGT ACCCAAAAAA GCTGTTCGTT CCTCAAAAAC AAAAACAAGG CTTGGCTGGG 1020 

AAAACAGGCC AATGCCCCGG CAAGAAAGGT TGAGATCAGA TGTTAGGAAG AACTTTCAGG 1080 

TAAAGTATGA GAACTATGGA GTCCATCAGC AGAGATAGTA GTGAAGTCTC TCCCCAGGGA 1140 

AAATTTTAAA AAGGTTGAAT CAGCTGTTGT AGAGTTCTAT TTGGCAATCT CATGGTTAAA 1200 

OO TGACTTCCCT TTGAGCTCTT TAATTATTGG CAATAAACAA CTTCTTTAAA AGTTTTAAAT 1260 
AAAATAGCAA CCACCACCA 



Seq ID NO: 520 Protein sequence 
Protein Accession ft: Eos sequence 
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1 11 21 31 41 51 

I I I I I I 

MLTEVMEVWH GLVIAWSLF LQACFLTAIN YLLSRHMAHK SEQILKAASL QVPRPSPGHH 60 

HPPAVKEMKE TQTERDIPMS DSLYRHDSDT PSDSLDSSCS SPPACQATED VDYTQWFSD 120 
PGBLKNDSPL DYENIKEITD YVNVNPERHK PSFWYFVNPA LSEPABYDQV AM 

Seq ID NO: 521 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence.- 107.. 328 

1 11 21 31 41 51 

I I 1 I 1 I 

CTGCTCTGTC TGAGCCAGOG GAATATGATC AAGTGGCCAT GT6AATTCCA AATATTTTTA 60 

ATGGGGTCCA GTTCTCTATO GATTCTTACA TTTAATTTGT AGGGAAATGC CATTTTTCCC 120 

CCTTAAACAA GGCATGGGGC TCACAAGTCT ATGGAGACAG GCCAAAAAGA ATGTGGAGAA 180 

GAAAACTGAT AAATACACAG AGGTCCTCAA GACCCATGGA CTCCTGGTCT GTACCCAAAA 240 

AAGCTGTTGG TTCCTCAAAA ACAAAAACAA GGCTTGGCTG GGAAAACAGG CCAATGCCCC 300 

GGCAAGAAAG GTTGAGATCA GATGTTAGGA AGAACTTTCA GGTAAAGTAT GAGAACTATG 360 

GAGTCCATCA GCAGAGATAG TAGTGAAGTC TCTCCCCAGG GAAAATTTTA AAAAGGTTGA 420 

ATCAGCTGTT GTAGAGTTCT ATTTGGCAAT CTCATGGTTA AATGACTTCC CTTTGAGCTC 480 

TTTAATTATT GGCAATAAAC AACTTCTTTA AAAGTTTTAA ATAAAATAGC AACCACCACC 540 
A 

Seq ID NO: 522 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I ! I I 

MPFFPLKQGM GLTSLWRQAK KNVEKKTDKY TEVLKTHGLL VCTQKSCSFL KNKNKAWLGK 60 
QANAPARKVE IRC 

Seq ID NO: 523 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence* 211.. 1895 

1 11 21 31 41 51 

I I i I ) I 

GGATCTGAGG GGCGCCCAOT CACTTCCTCC ACGTTCTCGT GCTGGGCGGG AGGAGCGGAT 60 

GGGGCTTGGG AGGCAGCCTG CTCTCCAGTC CCTATCCACC CACAGGTTTT TTGGGTCGGA 120 

GAGGAATTAT CTGATAAAAT TCCTGGGTTA ATATTTTTAA AAACGGAGAG TTTTTAAAAA 180 

TGATTTTTTT CCCTCGAAAA TGACCTTTTT ATGCTTCGAA GCAGTTTGTC AACCAGCATA 240 

GTGCTTTTTC TTTTCTCTTC TTTTTCTACG ATAAATGAAA GCATTTCTTC AAGAAAAAGG 300 

CACAGGTTCC TTGAACAGCT GGATTCTGAT GGCACCATTA CTATAGAGGA GCAGATTGTC 360 

CTTGTGCTGA AAGCGAAAGT ACAATGTGAA CTCAACATCA CAGCTCAACT CCAGGAGGGA 420 

GAAGGTAATT GTTTCCCTGA ATGGGATGGA CTCATTTGTT GGCCCAGAGG AACAGTGGGG 480 

AAAATATOGG CTGTTCCATG CCCTCCTTAT ATTTATGACT TCAACCATAA AGGAGTTGCT 540 

TTCCGACACT GTAACCCCAA TGGAACATGG GATTTTATGC ACAGCTTAAA TAAAACATGG 600 

GCCAATTATT CAGACTGCCT TCGCTTTCTG CAGCCAGATA TCAGCATAGG AAAGCAAGAA 660 

TTCTTTGAAC GCCTCTATGT AATGTATACC GTTGGCTACT CCATCTCTTT TGGTTCCTTG 720 

GCTGTGGCTA TTCTCATCAT TGGTTACTTC AGACGATTGC ATTGCACTAG GAACTATATC 780 

CACATGCACT TATTTGTGTC TTTCATGCTG AGAGCTACAA GCATCTTTGT CAAAGACAGA 840 

GTAGTCCATG CTCACATAGG AGTAAAGGAG CTGGAGTCCC TAATAATGCA GGATGACCCA 900 

CAAAATTCCA TTGAGGCAAC TTCTGTGGAC AAATCACAAT ATATCGGGTG CAAGATTGCT 960 

GTTGTGATGT TTATTTACTT CCTGGCTACA AATTATTATT GGATCCTGGT GGAAGGTCTC 1020 

TACCTGCATA ATCTCATCTT TGTGGCTTTC TTTTCGGACA CCAAATACCT GTGGGGCTTC 1080 

ATCTTGATAG GCTGGGGGTT TCCAGCAGCA TTTGTTGCAG CATGGGCTGT GGCACGAGCA 1140 

ACTCTGGCTG ATGCGAGGTG CTGGGAACTT AGTGCTGGAG ACATCAAGTG GATTTATCAA 1200 

GCACCGATCT TAGCAGCTAT TGGGCTGAAT TTTATTCTGT TTCTGAATAC GGTTAGAGTT 1260 

CTAGCTACCA AAATCTGGGA GACCAATGCA GTTGGGCATG ACACAAGGAA GCAATACAGG 1320 

AAACTGGCCA AATCGACACT GGTCCTGGTC CTAGTCTTTG GAGTGCATTA CATCGTGTTC 1380 

GTATGCCTGC CTCACTCCTT CACTGGGCTC GGGTGGGAGA TCCGCATGCA CTGTGAGCTC 1440 

TTCTTCAACT CCTTTCAGGG TTTCTTTGTG TCTATCATCT ACTGCTACTG CAATGGAGAG 1500 

GTTCAGGCAG AGGTGAAGAA GATGTGGAGT CGGTGGAATC TCTCCGTGGA CTGGAAAAGG 1560 

ACACCGCCAT GTGGCAGCCG CAGATGCGGC TCAGTGCTCA CCACCGTGAC GCACAGCACC 1620 

AGCAGCCAGT CACAGGTGGC GGCCAGCACA CGCATGGTGC TTATCTCTGG CAAAGCTGCC 1680 

AAGATCGCCA GCAGACAGCC TGACAGCCAC ATCACTTTAC CTGGCTATGT CTGGAGTAAC 1740 

TCAGAGCAGG ACTGCCTGCC ACACTCTTTC CACGAGGAGA CCAAGGAAGA TAGTGGGAGG 1800 

CAGGGAGATG ATATTCTAAT GGAGAAGCCT TCCAGGCCTA TGGAATCTAA CCCAGACACT 1860 
GAAGGATGCC AAGGAGAAAC TGAGGATGTT CTCTGA 

Seq ID NO: 524 Protein sequence 
Protein Accession #: Bos sequence 

1 11 21 31 41 51 

I I I I I I 

MLRSSI*STSI VLFLFSSFST INESISSRKR HRPLEQLDSD GTITIEEQIV LVLKAKVQCE 60 

LNITAQLQBG EGNCPPBWDG LICHPRGTVG KISAVPCPPY IYDFNHKGVA PRHCNPNGTW 120 

DFMHSLNKTW ANYSDCLRFL QPDISIGKQB FFERLYVMYT VGYSISFGSL AVAILIIGYF 160 

RRLHCTRNYI HMHLFVSFML RATSIPVKDR WHAHIGVKE LBSLIMQDDP QNSIEATSVD 240 

KSQYIGCKIA WMFIYFLAT NYWILVEGL YLHNLIFVAF PSDTKYLWGF ILIGKGFPAA 300 

FVAAWAVARA TLADARCWBL SAGDIKWIYQ APILAAIGLN FILPLNTVRV LATKIWBTNA 360 

VGHDTRKQYR KLAKSTLVLV LVFGVHYIVP VCLPHSFTGL GWEIRMHCEL PFNSFQGFFV 420 

SIIYCYCNGE VQAEVKKMWS RMNLSVDWKR TPPCGSRRCG SVLTTVTHST SSQSQVAAST 480 
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WO 03/042661 



RKVLISGKAA KIASRQPDSH ITLP6YVHSN SBQDCLPHSP HEETKEDSGR QGDDZLMBKP 540 
SRPMESNPDT EGCQGETEDV L 

Seq ID NO: S2S DNA sequence 
Nucleic Acid Accession #; NM_005048 
Coding sequence: 143.. 1795 

1 11 21 31 41 51 

I I I I I I 

GGCCGGTGGC CCGGGCCCGA CCACCCCAGC TGCGOGTCGT TACTGGCCAC AAGTTTGCTC 60 

TGG6CCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGGGC TCTGGAGGAG GGTCCCTGCT 120 

TCTTCCTACA GCCGTTCCGG GCATGGCCGG GCTGGGGG03 TCGCTCCAGG TCTGGGGTTG 180 

GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGCCCAGCTG GATTCTGATG GCACCATTAC 240 

TATAGAGGAG CAGATTGTCC TTGTGCTGAA AGCGAAAGTA CAATGTGAAC TCAACATCAC 300 

AGCTCAACTC CAGGAGGGAG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 360 

GCCCAGAGGA ACAGTGGGGA AAATATCGGC TGTTCCATGC CCTCCTTATA TTTATGACTT 420 

CAACCATAAA GGAGTTGCTT TCCGACACTG TAACCCCAAT GGAACATGGG ATTTTATGCA 480 

CAGCTTAAAT AAAACATGGG CCAATTATTC AGACTGCCTT CGCTTTCTGC AGCCAGATAT 540 

CAGCATAGGA AAGCAAGAAT TCTTTGAACG CCTCTATGTA ATGTATACOG TTGGCTACTC 600 

CATCTCTTTT GGTTCCTTGG CTGTGGCTAT TCTCATCATT GGTTACTTCA GACGATTGCA 660 

TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GAGCTACAAG 720 

CATCTTTGTC AAAGACAGAG TAGTCCATGC TCACATAGGA GTAAAGGAGC TGGAGTCCCT 780 

AATAATGCAG GATGACCCAC AAAATTCCAT TSAGGCAACT TCTGTGGACA AATCACAATA 840 

TATCGGGTGC AAGATTGCTG TTGTGATGTT TATTTACTTC CTGGCTACAA ATTATTATTG 900 

GATCCTGGTG GAAGGTCTCT ACCTGCATAA TCTCATCTTT GTGGCTTTCT TTTCGGACAC 960 

CAAATACCTG TGGGGCTTCA TCTTGATAGG CTGGGGGTTT CCAGCAGCAT TTGTTGCAGC 1020 

ATGGGCTGTG GCACGAGCAA CTCTGGCTGA TGCGAGGTGC TGGGAACTTA GTGCTGGAGA 1080 

CATCAAGTGG ATTTATCAAG CACCGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 1140 

TCTGAATACG GTTAGAGTTC TAGCTACCAA AATCTGGGAG ACCAATGCAG TTGGGCATGA 1200 

CACAAGGAAG CAATACAGGA AACTGGCCAA ATOGACACTG GTCCTGGTCC TAGTCTTTGG 1260 

AGTGCATTAC ATCGTGTTCG TATGCCTGCC TCACTCCTTC ACTGGGCTGG GGTGGGAGAT 1320 

CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 1380 

CTGCTACTGC AATGGAGAGG TTCAGGCAGA GGTGAAGAAG ATGTGGAGTC GGTGGAATCT 1440 

CTCCGTGGAC TGGAAAAGGA CACCGCCATG TGGCAGCCGC AGATGCGGCT CAGTGCTCAC 1500 

CACCGTGACG CACAGCACCA GCAGCCAGTC ACAGGTGGCG GCCAGCACAC GCATGGTGCT 1560 

TATCTCTGGC AAAGCTGCCA AGATGGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 1620 

TGGCTATGTC TGGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGGAGAC 1680 

CAAGGAAGAT AGTGGGAGGC AGGGAGATGA TATTCTAATG GAGAAGCCTT CCAGGCCTAT 1740 

GGAATCTAAC CCAGACACTG AAGGATGCCA AGGAGAAACT GAGGATGTTC TCTGAATGGA 1800 

CATTTGTGGC TGACTTTCAT GGGCTGGTCC AATGGCTGGT TGTGTGAGAG GGCTTGGCTG 1860 

ATACTCCTAT GCTTGAGTTC AAAGGCTGAA AATTCAGTTA AGGTGTTACT TAATAATAGT 1920 

TTTTAGGCTC CATGAATTGG CTCCTGTAAA TACTAACGAC ATGAAAATGC AAGTGTCAAT 1980 

GGAGTAGTTT ATTACCTTCT ATTGGCATCA AGTTTTCCTC TAAATTAATG TATGGTATTT 2040 

GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAGATT CAATTGCTTG 2100 

GCTGTAGCTT TCTCTCATAT ATATCACCCT AAATATAATG AAGATCTTTT AGTGTGTATC 2160 

ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 2220 

ATTTATTTTG CCTGTGCATA GGAGCAATTA GGATCTAAAA AAATATATGG GAAGATAAAA 2280 

GATCTAAGAA CAAGTACTTG CTGGAAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT 2340 

TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC 2400 

ACATCCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAGGTCT TCACTCTTTC 2460 

TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 2520 

TTGATTTTGT TTGTAATGTA TTTTGATAGC AAATCATGCT GCATCTATAT CTTTTTCTTG 2580 

TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TTAAAAATTT GTTTTAAAAA 2640 
T 

Seq ID NO: 526 Protein sequence 
Protein Accession NP_005039 

1 11 21 31 41 51 

1 1 I 1 I I 

MAGLGASLHV WGWLMLGSCL LARAQLDSDG TITIEEQIVL VLKAKVQCEL NITAQLQEGB 60 

GNCPPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAP RHCNPNGTWD PMHSUHCTMA 120 

NYSDCLRPLQ PDISIGKQEP FERLYVMYTV GYSISPGSIA VAILIIGYFR RLHCTRNYIH 180 

MHLFVSFMLR ATSIFVKDRV VHAHIGVKBI* BSLIKQDDPQ NSIEATSVDK SQYIGCK1AV 240 

VMPIYFLATN YYWILVEGLY LHNLIFVAFF SDTKYLWGPI LIGWGFPAAF VAAWAVARAT 300 

LADARCWELS AGDIKWIYQA PILAAIGLNP ILFLNTVRVL ATKIWETNAV GHDTRKQYRK 360 

LAKSTLVLVL VPGVHYTVFV CLPHSPTGLG WEIRMHCELF FNSFQGFPVS IIYCYCNGEV 420 

QAEVKKMWSR WNLSVDWKRT PPCGSRRCGS VLTTVTHSTS SQSQVAASTR MVLISGKAAK 480 

IASRQPDSHI TLPGYVWSNS EQDCLPHSFH EETKEDSGRQ GDDILMEKPS RPMESNPDTE S40 
GCQGETEDVL 

Seq ID NO: 527 DNA sequence 
Nucleic Acid Accession #x XM_036683 
Coding sequence: 38.. 3655 

1 11 21 31 41 51 

I I I I I I 

GCTTTGCCCA GTAGTTGGAA AGTGAACTCG ACTCGTQATG GTTCTCCTGT CACTTTGGTT 60 

GATAGCAGCC GCTCTGGTAG AGGTTAGGAC TTCAGCTGAT GGACAAOCTG GTAATGAAGA 120 

AATGGTGCAA ATAGATTTAC CAATAAAGAG ATATAGAGAG TATGAGCTGG TGACTCCAGT 180 

CAGCACAAAT CTAGAAGGAC GCTATCTCTC CCATACTCTT TCTGCGAGTC ACAAAAAGAG 240 

GTCAGCGAGG GACGTGTCTT CCAACCCTGA GCA GT TGT TC TTTAACATCA CGGCATTTGG 300 

AAAAGATTTT CATCTGCGAC TAAAGCCCAA CACTCAACIA GTAGCTCCTG GGGCTGTTGT 360 

GGAGTGGCAT GAGACATCTC TGGTGCCTGG GAATATAACC GATCCCATTA ACAACCATCA 420 
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ACCAGGAAGT GCTACGTATA GAATCCGGAA AACAGAGCCT TTGCAGACTA ACTGTGCTTA 480 
TGTTGGTGAC ATCGTGGACA TTCCAQGAAC CTCTGTTGCC ATCAGCAACT GTGATGGTCT 540 
GGCTGGAATG ATAAAAAGTG ATAATGAAGA GTATTTCATT GAACCCTTG6 AAAGAGGTAA 600 
ACAGA7GGAG GAAGAAAAAG GAAGGATTCA TGTTGTCTAC AAGAGATCAG CTGTAGAACA 660 
5 GGCTCCCATA GACATGTCCA AAGACTTOCA CTACAGAGAG TOGGACCTGG AAGGCCTTGA 720 
TGATCTAGGT ACTGTTTATG GCAACATCCA CCAGCAGCTG AATGAAACAA TGAGACGCCG 780 
CAGACACGCG GGAGAAAACG ATTACAATAT CGAGGTACTG CTGGGAGTGG ATGACTCTGT 840 
GGTCOGTTTC CATGGCAAAG AGCACGTCCA AAACTACCTC CTGACCCTAA TGAACATTGT 900 
GAATGAAATT TACCATGATG AGTCCCTCGG AGTGCATATA AATGTGGTCC TGGTGCGCAT 960 
10 GATAATGCTG GGATATGCAA AGTCCATCAG CCTCATAGAA AGGGGAAACC CATCCAGAAG 1020 
CTTGGAGAAT GTGTGTOGCT GGGCGTCCCA ACAGCAAAGA TCTGATCTCA ACCACTCTGA 1080 
ACACCATGAC CATGCAATTT TTTTAACCAG GCAAGACTTT GGACCTGCTG GAATGCAAGG 1140 
ATATGCTCCA GTCACCGGCA TGTGTCATCC AGTGAGAAGT TGTACCCTGA ATCATGAGGA 1200 
TGGTTTTTCA TCTGCTTTTG TAGTAGCCCA TGAAACGGGC CATGTGTT GG GAATGGAGCA 1260 
15 TGATGGACAA GGCAACAGGT GTGGTGATGA GACTGCTATG GGAAGTGTCA TGGCTCCCTT 1320 
GGTACAAGCA GCATTCCATC GTTACGACTG GTCCCGATGC AGTGGTCAAG AACTGAAAAG 1380 
ATATATCCAT TCCTATGACT GTCTCCTTGA TGACCCTTTT GATCATGATT GGCCTAAACT 1440 
CCCAGAACTT CCTGGAATCA ATTATTCTAT GGATGAGCAA TGTCGTTTTG ATTTTGGTGT 1500 
TGGCTATAAA ATGTGCACCG CGTTCCGAAC CTTTGACCCA TGTAAACAGC TGTGGTGTAG 1560 
20 CCATCCTGAT AATCCCTACT TTTGTAAGAC TAAAAAGGGA CCTCCACTTG ATGGGACTGA 1620 
ATGTGCTGCT GGAAAATGGT GCTATAAGGG TCATTGCATG TGGAAGAATG CTAATCAGCA 1680 
AAAACAAGAT GGCAATTGGG GGTCATGGAC TAAATTTGGC TCCTGTTCTC GGACATGTGG 1740 
AACTGGTGTT CGTTTCAGAA CACGCCAGTG CAATAATCCC ATGCCCATCA ATGGTGGTCA 1800 
GGATTGTCCT GGTGTTAATT TTGAGTACCA GCTTTGTAAC ACAGAAGAAT GCCAAAAACA 1860 
25 CTTTGAGGAC TTCAGAGCAC AGCAGTGTCA GCAGCGAAAC TCCCACTTTG AATACCAGAA 1920 
TACCAAACAC CACTGGTTGC CATATGAACA TCCTGACCCC AAGAAAAGAT GCCACCTTTA 1980 
CTGTCAGTCC AAGGAGACTG GAGATGTTGC TTACATGAAA CAACTGGTGC ATGATGGAAC 2040 
GCACTGTTCT TACAAAGATC CATATAGCAT ATGTGTGCGA GGAGAGTGTG TGAAAGTGGG 2100 
CTGTGATAAA GAAATTGGTT CTAATAAGGT TGAGGATAAG TGTGGTGTCT GTGGAGGAGA 2160 
30 TAATTCCCAC TGCCGAACCG TGAAGGGGAC ATTTACCAGA ACTCCCAGGA AGCTTGGGTA 2220 
CCTTAAGATG TTTGATATAC CCCCTGGGGC TAGACATGTG TTAATCCAAG AAGAOGAGGC 2280 
TTCTCCTCAT ATTCTTGCTA TTAAGAACCA GGCTACAGGC CATTATATTT TAAATGGCAA 2340 
AGGGGAGGAA GCCAAGTCGC GGACCTTCAT AGATCTTGGT GTGGAGTGGG ATTATAACAT 2400 
TGAAGATGAC ATTGAAAGTC TTCACACOGA TGGACCTTTA CATGATCCTG TTATTGTTTT 2460 
JJ GATTATACCT CAAGAAAATG ATACCCGCTC TAGCCTGACA TATAAGTACA TCATCCATGA 2S20 
AGACTCTGTA CCTACAATCA ACAGCAACAA TGTCATCCAG GAAGAATTAG ATACTTTTGA 2580 
GTGGGCTTTG AAGAGCTGGT CTCAGTGTTC CAAACCCTGT GGTGGAGGTT TCCAGTACAC 2640 
TAAATATGGA TGCCGTAGGA AAAGTGATAA TAAAATGGTC CATCGCAGCT TCTGTGAGGC 2700 
A CAACAAAAAG CCGAAACCTA TTAGACGAAT GTGCAATATT CAAGAGTGTA CACATCCACT 2760 

40 CTGGGTAGCA GAAGAATGGG AACACTGCAC CAAAACCTGT GGAAGTTCTG GCTATCAGCT 2820 
TCGCACTGTA CGCTGCCTTC AGCCACTCCT TGATGGCACC AACCGCTCTG TGCACAGCAA 2880 
ATACTGCATG GGTGACCGTC CCGAGAGCCG CCGGCCCTGT AACAGAGTGC CCTGCCCTGC 2940 
ACAGTGGAAA ACAGGACCCT GGAGTGAGTG TTCAGTGACC TGCGGTGAAG GAACGGAGGT 3000 
GAGGCAGGTC CTCTGCAGGG CTGGGGACCA CTGTGATGGT GAAAAGCCTG AGTOGGTCAG 3060 
45 AGCCTGTCAA CTGCCTCCTT GTAATGATGA ACCATGTTTG GGAGACAAGT CCATATTCTG 3120 
TCAAATGGAA GTGTTGGCAC GATACTGCTC CATACCAGGT TATAACAAGT TATGTTGTGA 3180 
GTCCTGCAGC AAGGGCAGTA GCACCCTGCC ACCACCATAC CTTCTAGAAG CTGCTGAAAC 3240 
TCATGATGAT GTCATCTCTA ACCCTAGTGA CCTCCCTAGA TCTCTAGTGA TGCCTACATC 3300 
TTTGGTTCCT TATCATTCAG AGACCCCTGC AAAGAAGATG TCTTTGAGTA GCATCTCTTC 3360 
50 AGTGGGAGGT CCAAATGCAT ATGCTGCTTT CAGGCCAAAC AGTAAACCTG ATGGTGCTAA 3420 
TTTACGCCAG AGGAGTGCTC AGCAAGCAGG AAGTAAGACT GTGAGACTGG TCACCGTACC 3480 
ATCCTCCCCA CCCACCAAGA GGGTCCACCT CAGTTCAGCT TCACAAATGG CTGCTGCTTC 3540 
CTTCTTTGCA GCCAGTGATT CAATAGGTGC TTCTTCTCAG GCAAGAACCT CAAAGAAAGA 3600 
TGGAAAGATC ATTGACAACA OACGTCCGAC AAGATCATCC ACCTTAGAAA GATGAGAAAG 3660 
55 TGAACCAAAA AGGCTAGAAA CCAGAGGAAA ACCTGGACAA CCTCTCTCTT CCCATGGTGC 3720 
ATATGCTTGT TTAAAGTGGA AATCTCTATA GATCGTCAGC TCATTTTATC TGTAATTGGA 3780 
AGAACAGAAA GTGCTGGCTC ACTTTCTAGT TGCTTTCATC CTCCTTTTGT TCTGCATTGA 3840 
CTCATTTACC AGAATTCATT GGAAGAAATC ACCAAAGATT ATTACAAAAG AAAAATATGT 3900 
TGCTAAGATT GTGTTGGTCG CTCTCTGAAG CAGAAAAGGG ACTGGAACCA ATTGTGCATA 3960 
60 TCAGCTGACT TTTTGTTTGT TTTAGAAAAG TTACAGTAAA AATTAAAAAG AGATACCAAT 4020 
GGTTTACACT TTAACAAGAA ATTTTGGATA TGGAACAAAG AATTCTTAGA CTTGTATTCC 4080 
TATTTATCTA TATTAGAAAT ATTGTATGAG CAAATTTGCA GCTGTTGTGT AAATACTGTA 4140 
TATTGCAAAA ATCAGTATTA TTTTAAGAQA TGTGTTCTCA AATGATTGTT TACTATATTA 4200 
CATTTCTGGA TGTTCTAGGT GCCTGTCGTT GAGTATTGCC TTGTTTGACA TTCTATAGGT 4260 
65 TAATTTTCAA AGCAGAGTAT TACAAAAGAG AAGTTAGAAT TACAGCTACT GACAATATAA 4320 
AGGGTTTTGT TGAATCAACA ATGTGATACG TAAATTATAG AAAAAGAAAA GAAACACAAA 4380 
AGCTATAGAT ATACAGATAT CAGCTTACCT ATTGCCTTCT ATACTTATAA TTTAAAGGAT 4440 
TG G TGTCTTA GTACACTTGT GGTCACAGGG ATCAACGAAT AGTAAATAAT GAACTCGTGC 4500 
AAGACAAAAC TGAAACCCTC TTTCCAGGAC CTCAGTAGGC ACOGTTGAGG TGTCCTTTGT 4560 
70 TT T T G T GT GT GTGTGTTCTT TTTTAATTTT CGCATTGTTG ACAGATACAA ACAGTTATAC 4620 
TCAATGTACT GTAATAATCG CAAAGGAAAA AGTTTTGGGA TAACTTATTT GTATG TTGGT 4680 
AGCTGAGAAA AATATCATCA GTCTAGAATT GATATTTGAG TATAGTAGAG CTTTGGGGCT 4740 
TTGAAGGCAG GTTCAAGAAA GCATATGTCG ATGGTTGAGA TATTTATTTT CCATATGGTT 4800 
CATGTTCAAA TGTTCACAAC CACAATGCAT CTGACTGCAA TAATGTGCTA ATAATTTATG 4860 
75 TCAGTAGTCA CCTTGCTCAC AGCAAAGCCA GAAATGCTCT CTOCAGGGAG TAGATGTAAA 4920 
GTACTTGTAC ATAGAATTCA GAACTGAAGA TATTTATTAA AAGTTGATTT TTTTTTCTTG 4980 
ATAGTATTTT TATGTACTAA ATATTTACAC TAATATCAAT TACATATTTT GGTAAACTAG 5040 
AGAGACATAA TTAGAGATGC ATGCTTTGTT CTGTGCATAQ AGACCTTTAA GCAAACTACT 5100 
OA ACAGCCAACT CAAAAGCTAA AACTGAACAA ATTTGATGTT ATGCAAACAT CTTGCATTTT 5160 
o0 TAGTAGTTGA TATTAAGTTG ATGACTTGTT TCCCTTCAAG GAAACATTAA ATTGTATGGA 5220 
CTCAGCTAGC TGTTCAATGA AATTGTGAAT TAGAAACATT TTTAAAAGTT TTTGAAAGAG 5280 
ATAAGTGCAT CATGAATTAC ATGTACATGA GAGGAGATAG TGATATCAGC ATAATGATTT 5340 
TGAGGTCAGT ACCTGAGCTG TCTAAAAATA TATTATACAA ACTAAAATGT AGATGAATTA 5400 
ACCTCTCAAA GCACAGAATG TGCAAGAACT TTTGCATTTT AATCGTTGTA AACTAACAGC 5460 
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TTAAACTATT GACTCTATAC CTCTAAAGAA TTGCTGCTAC TTTGTGCAAG AACTTTGAAG 
GTCAAATTAG GCAAATTCCA GATAGTAAAA CAATCCCTAA GCCTTAAGTC TTTTTTTTTT 
CCTAAAAATT CCCATAGAAT AAAATTCTCT CTAGTTTACT TGTGTGTGCA TACATCTCAT 
CCACAGGGGA AGATAAAGAT GGTGACACAA ACAGTTTCCA TAAAGATGTA CATATTCATT 
ATACTTCTGA CCTTTGGGCT TTCTTTTCTA CTAAGCTAAA AATTCCTTTT TATCAAAGTG 
TACACTACTG ATGCTGTTTG TTGTACTGAG AGCACGTACC AATAAAAATG TTAACAAAAT 
AT 

Seq ID NO: 528 Protein sequence 
Protein Accession ft: XP 036663 



1 

i 

MVLLSLWLIA 
LSASHKKRSA 
TDPINNHQPG 
I EPLERGKQM 
LNETMRRRRH 
INWLVRMIM 
FGPAGMQGYA 
KGSVMAPLVQ 
QCRFDFGVGY 
MWKNANQQKQ 
NTEBCQKHPE 
KQLVHDOTHC 
RTPRKLGYLK 
GVEWDYNIED 
QEELDTPEWA 
IQECTHPLWV 
CNRVPCPAQW 
LGDKSIPCQM 
RSLVMPTSLV 
TVRLVTVPSS 
STLER 



11 

! 

AALVEVRTSA 
RDVSSNPEQL 
SATYRIRKTE 
EEEXGRIHW 
AGENDYNIBV 
LGYAXSISIiI 
PVTGMCHPVR 
AAPHRYHWSR 
KMCTAPRTFD 
DGNWGSWTKP 
DPRAQQCQQR 
SYKDPYSICV 
MFDIPPGARH 
DISSLHTDGP 
UCSWSQCSKP 
AEEWEHCTKT 
KTGPWSECSV 
EVLARYCSIP 
PYHSETPAKK 
PPTKRVHLSS 



21 
I 

DGQAGNEEMV 
FFNITAFGKD 
PLQTNCAYVG 
YKRSAVEQAP 
LLGVDDSWR 
ERGNPSRSLE 
SCTLNHEDGF 
CSGQELKRYI 
PCKQLWCSHP 
GSCSRTCGTG 
NSHFEYQNTK 
RGECVKVGCD 
VLIQEDEA8P 
LHDPVIVLII 
CGGGFQYTKY 
CGSSGYQLRT 
TOGEGTEVRQ 
GYNKLCCESC 
MSLSSISSVG 
ASQMAAASPP 



31 
I 

QIDLPIKRYR 
FHLRLKPNTQ 
DIVDIPGTSV 
IDMSKDPHYR 
FHGKEHVQNY 
NVCRWASQQQ 
SSAFWAHET 
HSYDCLLDDP 
DNPYPCKTKK 
VRFRTRQCNN 
HHWLPYEHPD 
KEIGSNKVED 
RILAIKKQAT 
PQENDTRSSL 
GCRRKSDNKM 
VRCLQPLLDG 
VLCRAGDHCD 
SKRSSTLPPP 
GPNAYAAFRP 
AASDSIGASS 



41 

I 

EYELVTPVST 
LVAPGAWEW 
AISNCDGLAG 
ESDLEGliDDL 
LLTLMNIVNE 



51 
I 

NLEGRYLSHT 
HETSLVPGNI 
MIKSDNEEYF 
GTVYGNIHQQ 
IYHDESLGVH 
DHAIFLTRQD 



GHVLGMEHDG 
FDHDWPKLPE 
GPPLDGTECA 
PMPINGGQDC 
PKKRCHLYCQ 
KCGVCGGDNS 
GHYILNQKGE 
TYKYIIHEDS 
VHRSFCEANK 
TNRSVHSKYC 
GEKPESVRAC 
YLLEAAETHD 
KSKPDGANLR 
QARTSKKDGK 



LPGINYSMDE 
AGKWCYKGHC 
PGVNPBYQLC 
SKETGDVAYM 
HCRTVKGTFT 
EAKSRTFIDL 
VPTINSNNVI 
KPKPIRRMCN 
MGDRPESRRP 
QLPPCNDEPC 
DVISNPSDLP 
QRSAQQAGSK 
IIDNRRPTRS 



Seq ID NO: 529 DNA sequence 
Nucleic Acid Accession #: NM_002774 
Coding sequence : 24 6 .. 9 8 0 



AGGCGGACAA 
AGCCCGGGGC 
GCTGGCTCGC 
GGCAGCTGTT 
OGGCCATGAA 
AGAATAAGTT 
TCTACACCTC 
CAGCTGCCCA 
AAAGGGAGAG 
ATGCCGCCAG 
CTGAACTCAT 
ACATCCTGGG 
ACATCCACCT 
ACATGTTGTG 
GTCOGCTGGT 
GATCAAAGGA 
AAACCATTCA 
TGGCTGGTTC 
TGACCCTGAT 
CAGCTCCATC 
TACCCCCACC 
TTCCACACGT 
CTGCTTACTG 
AAGATGAAGA 
TTTCACACCT 
TATTTT 



11 

I 

AGCCCGATTG 
AGGGGOGGGG 
TCTCTCCTGG 
CCTTCCCCCG 
GAAGCTGATG 
GGTGCATGGC 
GGGCCACTTG 
CTGCAAAAAA 
TTCCCAGGAG 
CCATGACCAG 
CCAGCCCCTT 
CTGGGGCAAG 
GGTGTCCCGT 
TGCTGGGGAT 
ATGTGGAGAC 
GAAGCCAGGA 
GGCCAAGTGA 
CAGAACGTCT 
GCTTAATAAA 
CTTGCATCAC 
ACTAAGAGAA 
TTGATTTCTT 
TOGGTGTCCC 
TAAGGATGAT 
ATGACATACA 



21 
I 

TTCCTGGGCC 
GCCAGTGTGG 
GGACACAGAG 
ACTCAAGAAT 
GTGGTGCTGA 
GGACCCTGCG 
CTCTGTGGTG 
CCGAATCTTC 
CAGAGTTCTG 
GACATCATGC 
CCCCTGGAGA 
ACAGCAGATG 
GAGGAGTGTG 
GAGAAGTACG 
CACCTCCGAG 
GTCTACACCA 
CCCTGACATG 
CTCACCTAGA 
CGCAGCGACG 
TGGGGAGGAC 
TACAGGAAAA 
CCTGCAGAGG 
CTTGGGATGT 
ACAGTCTCCA 
TGGGATACCA 




GGGACTGCTC 
GTGATTTCCC 
AGCATGCCTA 
GGAAGGATTC 
GCCTTGTGTC 
ACGTCTGCAG 
TGACATCTAC 
CCTTGCCTCC 
TGAGGGTCCT 
GTGATGAGTG 
TCCCTTCTAG 
CCCAGCCACG 
ACCTTTCTTC 
TCAGGCAGTG 
CCTGGGCCGC 



41 

I 

OGCGCCTGGG 
TGTAGCTGTC 
AGCACACAGA 
COGGAGGCCT 
TGCAGCCTGG 
TCAOCCCTAC 
CCATCCACTG 
GGGGAAGCAT 
TGTGATCCAC 
GGCACGCCCA 
AGCCAACACC 
TGACACCATC 
CCCTGGCCAG 
CTGCCAGGGT 
ATGGGGTAAC 
ATACAOQAAC 
CTCCGGACCT 
CCTCCTCTCC 
GATTCTCCCT 
AGGACTTGGG 
GCATCTCCTC 
TGTCTGGAAT 
ACTGCAGATT 
GCTGTTGGAA 
CATGCACTCA 



5520 
5580 
5640 
5700 
S760 
5820 



Seq ID NO: 530. Protein sequence 
Protein Accession #: NP_002765 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



51 
1 

CCTGCTCCCC 
TCCCCGGCTG 
GGGACCTACG 
GCAGCAGGAG 
GCAGAGGAGC 
CAAGCTGCCC 
TGGGTCCTCA 
AACCTTCGGC 
CCTGACTATG 
GCCAAACTCT 
ACCAGCTGCC 
CAGTGTGCAT 
ATCACCCAGA 
GATTCTGGGG 
ATCCCCTGTG 
TGGATCCAAA 
ACCACCCCAC 
TGCCCAGCTC 
GGTTTTACCC 
TCCTCGGTCT 
TCCCCAACCG 
CCCAGCTCCG 
TCTCACCTGT 
AGATTTAAGA 
ATAAAGAATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



1 11 21 31 41 51 

I I I i I I 

MKKLMWLSL IAAAWAEEQN KLVEGGPCDK TSHPYQAALY TSGHLLCGGV LIHPLWVLTA 
AHCKKPNLQV FLGKHNLRQR ESSQEQSSW RAVIHPDYDA ASKDQDIMLL RLARPAKLSE 
LIQPLPLERD CSANTTSCHI LGWGKTADGD PPDTIQCAYI HLVSREECEH AYPGQITQNM 
LCAGDEKYGK DSCQGDSGGP LVOGDHLRGL VSWGNIPCGS KEKPGVYTNV CRYTNWIQKT 
IQAK 

Seq ID NO: 531 DNA sequence 
Nucleic Acid Accession #: NMJ>12152 
Coding sequence: 43.. 1104 



60 
120 
180 
240 



11 



21 



31 



41 



51 



1158 



WO 03/042661 



PCT/US02/36810 



111)11 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGTCACTAT 60 

GACAAGCACA TGQACTTTTT TTATAATAGG AQCAAC&CTG ATACTGTCGA TGACTGGACA 120 

GGAACAAAGC TTGTGATTGT TTTGTGTQTT GGGAOGTTTT TCTGCCTGTT TATTTTTTTT 180 

5 TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCOGAT TTCTTOGCTG GAATTGCCTA TGTATTCCTG 300 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 360 

GGGCTTCTGG ACAGTAGCTT GACT6CTTCC CTCACCAACT TGCTGGTTAT OGCCGTGGAG 420 

AGGCACATGT CAATCATQAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 480 

10 CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 540 

TGGAATT6CC TCTGCAACAT CTCTGCCTGC TCTTC0CT60 CCCCCATTTA CA6CAGGAGT 600 

TAOC T T GTTT TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGGT TGTGGTGTAC 660 

CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAAOGTCT TGTCTC06CA TACAAGTGGG 720 

TCCATCA6CC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAGGG 780 

15 GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TOCTOGAOGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 

ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 1080 

GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 



20 
25 



45 
50 



60 



75 



Seq ID NO: 532 Protein sequence 
Protein Accession #; NP 036284 



1 11 21 31 41 51 

I I 1 1 I 1 

MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL VIAAVIKNRK 60 

FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNIi 120 

30 LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWA1AIFMG AVPTLGWNCL CNISACSSLA 180 

PIYSRSYLVF WTVSNLMAFL IMVWYLRIY VYVKRKTNVL SPHTSGSISR HRTPMKLMKT 240 

VMTVLGAFW CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 300 
YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVOJK STS 

35 Seq ID HO: 533 DMA sequence 

Nucleic Acid Accession #: NM_002821 
Coding sequence : 15 0 .. 33 62 

Ark 1 11 21 31 41 51 

40 | | I | | I 

AACTCCCGCC TCGGGACGCC TCGGGGTCGG GCTCCGGCTG CGGCTGCTGC TGCGGCGCCC 60 

GCGCTCCGGT GCGTCCGCCT CCTGTGCCCG CCGOGGAGCA GTCTGCGGCC CGCCGTGCGC 120 

CCTCAGCTCC TTTTCCTGAG CCCGCCGCGA TGGGAGCTGC GOGGGGATCC CCGGCCAGAC 180 

CCCGCCGGTT GCCTCTGCTC AGCGTCCTGC TGCTGCCGCT GCTGGGCGGT ACCCAGACAG 240 

CCATTGTCTT CATCAAGCAG CCGTCCTCCC AGGATGCACT GCAGGGGCGC CGGGCGCTGC 300 

TTCGCTGTGA GGTTGAGGCT CCGGGCOCGG TACATGTGTA CTGGCTGCTC GATGGGGCCC 360 

CTGTCCAGGA CACGGAGCGG CGTTTCGCCC AGGGCAGCAG CCTGAGCTTT GCAGCTGTGG 420 

ACGGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCG GGATGATGTC ACTGGAGAAG 480 

AAGCCCGCAG TGCCAACGCC TCCTTCAACA TCAAATGGAT TGAGGCAGGT CCTGTGGTCC 540 

TGAAGCATCC AGCCTOGGAA GCTGAGATCC AGCCACAGAC CCAGGTCACA CTTCGTTGCC 600 

ACATTGATGG GCACCCTCGG CCCACCTACC AATGGTTCCG AGATGGGACC CCCCTTTCTG 660 

ATGGTCAGAG CAACCACACA GTCAGCAGCA AGGAGCGGAA CC TGAOGC TC CGGCCAGCTG 720 

GTCCTGAGCA TAGTGGGCTG TATTCCTGCT GCGCCCACAG TGCTTTTGGC CAGGCTTGCA 780 

GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TGCCAGGGTG GTGCTGGCAC 840 

55 CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGOCC 900 

AGCCACCCCC GAGCCTGCAG TGGCTCTTTG AGGATGAGAC TCCCATCACT AACCGCAGTC 960 

GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 1020 

TCCGGCCAGG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 1080 

TCATCCTGGA AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCGCTA TTTGAGCCAC 1140 

GGGTGTTTAC A6CTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 1200 

AGCCCAGCGT GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGGTCTACC 1260 

AGAAGGGCCA CGAGCTGGTG TTGGCCAATA TTGCTGAAAG TGATGCTGGT GTCTACACCT 1320 

GCCACGCGGC CAACCTGGCT GGTCAGOGGA GACAGGATGT CAACATCACT GTGGCCACTG 1380 

TGCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCGGCTACT 1440 

65 TGGATTGCCT GACCCAGGCC ACACCAAAAC CTACAGTTGT CTGGTACAGA AACCAGATGC 1500 

TCATCTCAGA GGACTCACGG TTOGAGGTCT TCAAGAATGG GACCTTGCGC ATCAACAGCG 1560 

TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCAG CACCCCAGCC GGCAGCATCG 1620 

AGGCGCAAGC CCGTGTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 1680 

__. AGCAGTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAGAGA 1740 

70 AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGG GTGACAGACA 1800 

ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTCGAGATGA CGCTGGCAAC TACACTTGCA 1860 

TTGCCTCCAA CGGGCCGCAG GGCCAGATTC GTGCCCATGT CCAGCTCACT GTGGCAGTTT 1920 

TTATCACCTT CAAAGTGGAA CCAGAGCGTA CGACTGTGTA CCAGGGCCAC ACAGCCCTAC 1980 

TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 2040 

TCCTGGACCC CACCAAGCTG GGACCCAGGA TGCACATCTT CCAGAATGGC TCCCTGGTGA 2100 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 2160 

ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 2220 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGTGGGTG 2280 

OA CCGCTGTGGC CTACATCATT GC CGTG CTGG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 2340 

o0 AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 2400 

GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 2460 

GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 2520 

TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 2580 

TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 2640 

1159 



WO 03/042661 



PCT/US02/36810 



GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCO GAGGGAGTTG GAGATGTTTG 2700 

GGAAGCTGAA CCACGCCAAC GTGGTGOGGC TCCTGGGGCT GTOOOQGGAG GCTGAGOCCC 2760 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2820 

AGAGCAAGGA TGAAAAATTG AAGTCACAGC CCCTCAGCAC CAAGCAGAAG GTGGCCCTAT 2880 

-> GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2940 

TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 3060 

GCTGGATGTC CCOCQAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 

CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTGGGCAGG 3180 

1U CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

GCTGCCCTTC CAAACTCTAT OGGCTGATGC AGCGCTGCTG GGCCCTCAGC CCCAAGGACC 3300 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 3360 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

ID TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3660 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 3780 

2U AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTG GG GGT CTAGACCAGG ATTATAGAGG 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3960 

CTTTTGACAC TATATAAACC QCCCTTTTTG TATGCACCAC GGGCGGCTTT TATATGTAAT 4020 

TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG AGGAGGGTGG 4080 

GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT TTTTTGTTTT 4140 
TGTTTTTGTT TTTACACTOG CTGCTCTCAA TAAATAAGCC TTTTTTA 
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Seq ID NO? 534 Protein sequence 
Protein Accession #: NP 002812 



1 11 21 31 41 51 

I I I I I I 

MGAARGSPAR PRRLPLLSVL LLPLLGGTQT AIVPIKQPSS QDALQGRRAL LRCEVEAPGP 60 

VHVYWLLDGA PVQDTERRFA QGSSLSFAAV DRLQDSGTFQ CVARDDVTGE EARSANASFN 120 

3D IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 180 

KERNLTLRPA GPEHSGLYSC CAHSAPGQAC SSQNFTLSIA DESFARWLA PQDVWARYE 240 

EAMFHCQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 300 

CIGQGQRGPP IILEATLHLA EIEDMPLFEP RVFTAGSBER VTCLPPKGLP EPSVWWEHAG 360 

VRLPTHGRVY QKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNITVAT VPSWLKKPQD 420 

SQLEEGKPGY LDCLTQATPK PTWWYRNQM LISEDSRFEV FXNGTLRZNS VEVYDGTWYR 480 

CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKEA TVPCSATGRB KPTIKWERAD 540 

GSSLPEWVTD NAGTLHFARV TRDDAGNYTC IASNGPQGQI RAHVQLTVAV FITFKVEPER 600 

TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR KHIFQNGSLV IHDVAPEDSG 660 

RYTCIAGNSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYI IAVL 720 

GLMFYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIQEEVALT SLGSGPAATN 780 

KRHSTSDKMH FPRSSLQPIT TLGKSBFGEV FLAKAQGLEE GVAETLVLVK SLQTKDEQQQ 840 

XjDFRRELEMF gklnhanwr LLGLCREAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 900 

PLSTKQKVAL CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 960 

YHFRQAWVPL RWMSPEAILE GDFSTKSDVW AFGVLMWEVF THGEMPHGGQ ADDEVLADLQ 1020 
AGKARLPQPB GCPSKLYRLM QRCWALSPKD RPSFSE1ASA LGDSTVDSKP 



Seq ID NO: 535 DNA sequence 
Nucleic Acid Accession ft: NM_013952 
Coding sequence: 161.. 1357 

1 11 21 31 41 51 

I I I i I I 

TTCAGAAGGA GGAGAGACAC CGGGCCCAGG GCACCCTCGC GGGCGGGCGG ACCCAAGCAG 60 

TGAGGGCCTG CAGCCGGCCG GCCAGGGCAG CGGCAGGCGC GGCCCGGACC TACGGGAGGA 120 

AGCCCCGAGC CCTCGGCGGG CTGCGAGCGA CTCCCCGGCG ATGCCTCACA ACTCCATCAG 180 

ATCTGGCCAT GGAGGGCTGA ACCAGCTGGG AGGGGCCTTT GTGAATGGCA GACCTCTGCC 240 

GGAAGTGGTC CGCCAGCGCA TCGTAOACCT GGCCCACCAG GGTGTAAGGC CCTGCGACAT 300 

CTCTCGCCAG CTCCGCGTCA GCCATGGCTG OGTCAGCAAG ATCCTTGGCA GGTACTACGA 360 

GACTGGCAGC ATCCGGCCTG GAGTGATAGG GGGCTCCAAG CCCAAGGTGG CCACCCCCAA 420 

GGTGGTGGAG AAGATTGGGG ACTACAAACG CCAGAACCCT ACCATGTTTG CCTGGGAGAT 480 

CCGAGACCGG CTCCTGGCTG AGGGCGTCTG TGACAATGAC ACTGTGCCCA GTGTCAGCTC 540 

CATTAATAGA ATCATCCGGA CCAAAGTGCA GCAACCATTC AACCTCCCTA TGGACAGCTG 600 

CGTGGCCACC AAGTCCCTGA GTCCCGGACA CACGCTGATC CCCAGCTCAG CTGTAACTCC 660 

CCCGGAGTCA CCCCAGTCGG ATTCCCTGGG CTCCACCTAC TCCATCAATG GGCTCCTGGG 720 

CATCGCTCAG CCTGGCAGOG ACAAGAGGAA AATGGATGAC AGTGATCAGG ATAGCTGCCG 780 

ACTAAGCATT GACTCACAGA GCAGCAGCAG CGGACCCCGA AAGCACCTTC GCACGGATGC 840 

CTTCAGCCAG CACCACCTCG AGOOGCTCGA GTGCCCATTT GAGCGGCAGC ACTACCCAGA 900 

GGCCTATGCC TCCCCCAGCC ACACCAAAGG CGAGCAGGGC CTCTACCCGC TGCCCTTGCT 960 

CAACAGCACC CTGGACGACG GGAAGGCCAC CCTGACCCCT TCCAACACGC CACTGGGGCG 1020 

CAACCTCTCG ACTCACCAGA CCTACCCGGT GGTGGCAGCT CCGCCCTTTT GGATCTGCAG 1080 

CAAGTCGGCT CCGGGGTCCC GCCCTTCAAT GCCTTTCCCC ATGCTGCCTC CGTGTAOGGG 1140 

CAGTTCACGG GCCAGGCCCT CCTCTCAGGG CGAGAGATGG TGGGGOCCAC GCTGCCCGGA 1200 

TACCCACCCC ACATCCCCAC CAGCGGACAG GGCAGCTATG CCTCCTCTGC GATCGCAGGC 1260 

ATGGTGGCAG GAAGTGAATA CTCTGGCAAT GCCTATGGCC ACACCCCCTA CTCCTCCTAC 1320 

AGCGAGGCCT GGGGCTTCCC CAACTCCAGC TTGCTGAGTT CCCCATATTA TTACAGTTCC 1380 

ACATCAAGGC CGAGTGCACC GCCCACCACT GCCACGGCCT TTGACCATCT GTAGTTGCCA 1440 
TGGGGACAGT G 



1160 
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Seq ID NO « 536 Protein sequence 
Protein Accession *: NP_039246 
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MPHNSIRSGH 
ILGRYYETGS 
TVPSVSSINR 
SINGLLGIAQ 
ERQHYPEAYA 
PPPWICSKSA 
PPLPSQAWWQ 



11 

I 

GGLNQLGGAF 
IRPGVIGGSK 
IIRTKVQQPF 
PGSDKRKMDD 



PGSRPSMPPP 
EVNTLAMPMA 



21 
I 

VNGRPLPEW 
PKVATPKWE 
NLPMDSCVAT 
SDQDSCRLSI 
LYPLPLUJST 
MLPPCTGSSR 
TPPTPPTARP 



31 
I 

RQRIVDLAHQ 
KIGDYKRQNP 
KSLSPGHTLI 
DSQSSSSGPR 
LDDGKATLTP 
ARPSSQGBRW 
GASPTPAC 



41 
I 

GVRPCDISRQ 
TMFAWEIRDR 
PSSAVTPPES 
KHLRTDAFSQ 
SNTPLGRNLS 
WGPRCPDTHP 



51 
I 

LRVSHGCVSK 
LLAEGVCDND 
PQSDSLGSTY 
HHLEPLECPF 
THQTYPWAA 
TSPPADRAAM 



Seq ID NO: 537 DNA sequence 

Nucleic Acid Accession #i NMJ103466.1 

Coding sequence: 11.. 1363 



l 

I 

GAATTCGGCG 
AGGGGCCTTT 
GGCCCACCAG 
CGTCAGCAAG 
GGGCTCCAAG 
CCAGAACCCT 
TGACAATGAC 
GCAACCATTC 
CACGCTGATC 
CTCCACCTAC 
AATGGATGAC 
CGGACCCCGA 
GTGCCCATTT 
OGAGCAGGGC 
CCTGACCCCT 
GGTGGCAGAT 
TAGCTCCACC 
CGGGGTCCCG 
CCAGGCCCTC 
CATCCCCACC 
AAGTGAATAC 
GCGCTTCCCC 
GAGTGCACOG 



11 

1 

ATGCCTCACA 
GTGAATGGCA 
GGTGTAAGGC 
ATCCTTGGCA 
CCCAAGGTGG 
ACCATGTTTG 
ACTGTGCCCA 
AACCTCCCTA 
CCCAGCTCAG 
TCCATCAATG 
AGTGATCAGQ 
AAGCACCTTC 
GAGCGGCAGC 
CTCTACCCGC 
TCCAACACGC 
CCTCACTCAC 
CCTTCCTCTT 
CCCTTCAATG 
CTCTCAGGGC 
AGCGGACAGG 
TCTGGCAATG 
AACTOCAGCT 
CCCACCACTG 



21 
I 

ACTCCATCAG 
GACCTCTGCC 
CCTGCGACAT 
GGTACTACGA 
CCACCCCCAA 
CCTGGGAGAT 
GTGTCAGCTC 
TGGACAGCTG 
CTGTAACTCC 
GGCTCCTGGG 
ATAGCTGCCG 
GCACGGATGC 
ACTACCCAGA 
TGCCCTTGCT 
CACTGGGGCG 
CCTTCGCCAT 
TATCTAGCTC 
CCTTTCCCCA 
GAGAGATGGT 
GCAGCTATGC 
CCTATGGCCA 
TGCTGAGTTC 
CCACGGCCTT 



31 
I 

ATCTGGCCAT 
GGAAGTGGTC 
CTCTCGCCAG 
GACTGGCAGC 
GGTGGTGGAG 
CCGAGACCGG 
CATTAATAGA 
OGTQGCCACC 
CCCGGAGTCA 
CATCGCTCAG 
ACTAAGCATT 
CTTCAGCCAG 
GGCCTATGCC 
CAACAGCACC 
CAACCTCTOG 
AAAGCAGGAA 
CGCCTTTTTG 
TGCTGCCTCC 
GGGGCCCACG 
CTCCTCTGCC 
CACCCCCTAC 
CCCATATTAT 
TGACCATCTG 



41 
I 

GGAGGGCTGA 
CGCCAGCGCA 
CTCCGCGTCA 
ATCCGGCCTG 
AAGATTGGGG 
CTCCTGGCTG 
ATCATCCGGA 
AAGTCCCTGA 
CCCCAGTCGG 
CCTGGCAGOG 
GACTCACAGA 
CACCACCTCO 
TCCCCCAGCC 
CTGGAOGACG 
ACTCAOCAGA 
ACCCCCGAGG 
GATCTGCAGC 
GTGTACGGGC 
CTGCCCGGAT 
ATCGCAGGCA 
TCCTCCTACA 
TACAGTTCCA 
TAGTTGAAGC 



51 
! 

ACCAGCTGGG 
TCGTAGACCT 
GCCATGGTTG 
GAGTGATAGG 
ACTACAAACG 
AGGGCGTCTG 
CCAAAGTGCA 
GTCCCGGACA 
ATTCCCTGGG 
ACAAGAGGAA 
GCAGCAGCAG 
AGCCGCTCGA 
ACACCAAAGG 
GGAAGGCCAC 
CCTACCCCGT 
TGTCCAGTTC 
AAGTOGGCTC 
AGTTCACGGG 
ACCCACCCCA 
TGGTGGCAGG 
GCGAGGCCTG 
CATCAAGGOC 
TT 



Seq ID NO: 538 Protein sequende 
Protein Accession #: NP 0034S7 



ILGRYYETGS 
TVPSVSSINR 
SINGLLGIAQ 
ERQHYPEAYA 
PHSPPAIKQE 
LSGREMVGPT 
NSSLLSSPYY 



11 

i 

GGLNQLGGAF 
IRPGVIGGSK 
IIRTKVQQPF 
PGSDKRKMDD 
SPSHTKGEQG 
TPEVSSSSST 
LPGYPPHIPT 
YSSTSRPSAP 



21 
I 

VNGRPLPEW 
PKVATPKWE 
NLPMDSCVAT 
SDQDSCRLSI 
LYPLPLLNST 
PSSLSSSAFL 
SGQGSYASSA 
PTTATAFDHL 



31 

I 

RQRIVDLAHQ 
KIGDYKRQNP 
KSLSPGHTLI 



LDDGKATLTP 
DLQQVGSGVP 
IAGMVAGSEY 



41 

i 

GVRPCDISRQ 
TMFAWEIRDR 
PSSAVTPPES 
KHLRTDAFSQ 
SNTPLGRNLS 
PFNAFPHAAS 
SGNAYGHTFY 



51 
I 

LRVSHGCVSK 
LLAEGVCDND 
PQSDSLGSTY 
HHLEPLECPF 
THQTYPWAD 
VYGQFTGQAL 
SSYSEAWRFP 



Seq ID NO: 539 DNA sequence 
Nucleic Acid Accession ft : NM_006799 
Coding sequence: 19.. 963 



GCCGCGGGAG 
CGGGCTGGAC 
CGACGGGTCA 
TGGCAGGGGA 
CGCTGGGCAC 
GGGTGGATGG 
TACTACACCC 
CCCTATGACA 
CCCATCTGTC 
GGCTGGGGGT 
CAGGTCGCCA 
AAGGACATCT 
TTCGGTGACT 
GTCGTGAGCT 
AGCCACCACT 
CCCTCCTGGC 
TGAGCCTACC 
IGTCTTOTTT 



11 
I 

AGGAGGCCAT 
TCAGGAAGCC 
TCACGTCGCG 
GCCTGCGCCT 
TCACGGOGGC 
TCCAGTTTGG 
GTTACTTCGT 
TTGCCTTGGT 
TCCAGGCCTC 
ACATCAAAGA 
TCATAAACAA 
TTGGAGACAT 
CAGGTGGACC 
GGGGAGTGGG 
TTGAGTGGAT 
CGCTACTCTT 
TGAGCCCATG 
GGTAATAAAC 



21 
I 

GGGCGCGCGC 
GGAGTOGGAG 
CATCGTGGGT 
GTGGGATTCC 
GCACTGCTTT 
CCAGCTQACT 
ATCGAATATC 
GAAGCTGTCT 
CACATTTGAG 
GGATGAGGCA 
CTCTATGTGC 
GGTTTGTGCT 
CTTGGCCTGT 
CTGTGGTCGG 
CCAGAAGCTG 
TTTCCCTCTT 
CAGCCTGGGG 
ACATTCCAGT 



31 



GAGGGGGCGC 
GGAGAGGACG 
CACGTATGCG 
GAAACCTATA 
TCCATGCCAT 
TATCTGAGCC 
GCACCTGTCA 
TTTGAGAACC 
CTGCCATCTC 
AACCACCTCT 
GGCAATGCCC 
AACAAGAATG 
CCCAATCGGC 
ATGGCCCAGA 
CTCTGGGCTC 
CCACTGCCAA 
TGATGCCTTG 



41 

I 

TGCTGGCGCT 
CGTTATCAGG 
CCGAACTCGG 
GAGTGAGCCT 
GTGACCTTAG 
CCTTCTGGAG 
CTCGCTACCT 
CCTACACTAA 
GGACAGACTG 
CCCACACCCT 
TCCTCAAGTA 
AAGGCGGGAA 
GACTGTGGTA 
COGGTGTCTA 
GTGGCATGTC 
TCCCACTCCT 
GTCAGGCCCT 
CAGGGCATTC 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 
300 
360 
420 



SI 
I 

GCTGCTGGCT 
ACCATGCGGC 
GCGTTGGCCG 
GCTCAGCCAC 
TGATCCCTCC 
CCTGCAGGCC 
GGGGAATTCA 
ACACATCCAG 
CTGGGTGACT 
CCAGGAAGTT 
CAGTTTCCGC 
GGATGCCTGC 
TCAGATTGGA 
CACCAATATC 
CCAGCCAGAC 
GGGGCCGGTC 
GGTTCTCTTC 
TTCAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 540 Protein sequence 
Protein Accession #: NP_006790 



1161 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

I I I I I I 

MGARGALLIA LLLARAGLRK PESQEAAPLS GPCGRRVITS RIVGGEDAEL GRWPWQGSLR 60 
5 LWDSHVCGVS LLSHRWALTA AHCFBTYSDL SDPSGWMVQF GQLTSMPSFW SLQAYYTRYF 120 
VSNIYLSPRY LGNSPYDIAL VKLSAPVTYT KHIQPICLQA STPEPENRTD CWVTGWGYIK 180 
EDEALPSPHT LQEVQVAIIN NSMCNHLFLK YSFRKDIFGD MVCAGNAQGG KDACPGDSOG 240 
PLAOJKNGLH YQIGWSWGV GCGRPNRPGV YTNISHHFEW IQKLMAQSGM SQPDPSWPLL 300 
PFPLLWALPL LGPV 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Seq ID NO: 541 DNA sequence 
Nucleic Acid Accession #: NMJ514344 
Coding sequence: 131.. 1444 

1 11 21 31 41 51 

I I I I I I 

GCGGCCGCGA TGGGGCGGAA GCGCCCGAAG CCCCGGAGCC CACAAACTGC CGGGCGCGCC 60 

TCGCCGCCGG GACCCGGGTG CCTGGGCTCG GCTTGAAGCG GCGGCGGCGC ACCGGCACAG 120 

CCGCGGGAGC ATGGGCAGGA GGATGCGGGG CGCCGCCGCC ACCGCGGGGC TCTGGCTGCT 180 

GGCGCTGGGC TOGCTGCTGG OGCTGTGGGG AGGGCTCCTG CCGCCGCGGA CCGAGCTGCC 240 

CGCCTCCCGG CCGCCCGAAG ACCGACTCCC ACGGCGCCCG GCCCGGAGCG GCGGCCCCGC 300 

GCCCGCGCCT CGCTTCCCTC TGCCCCCGCC CCTGGOGTGG GACGCCCGCG GCGGCTCCCT 360 

GAAAACTTTC CGGGCGCTGC TCACCCTGGC GGCCGGCGCG GACGGCCCGC CCCGGCAGTC 420 

CCGGAGCGAG CCCAGGTGGC ACGTGTCAGC CAGGCAGCCC CGGCCGGAGG AGAGCGCCGC 480 

GGTGCAOGGG GGCGTCTTCT GGAGCCGCGG CCTGGAGGAG CAGGTGCCCC CGGGCTTTTC 540 

GGAGGCCCAG GCGGCGGCGT GGCTGGAGGC GGCTCGCGGC GCCOGGATGG TGGCCCTGGA 600 

GCGCGGGGGT TGCGGGCGCA GCTCCAACCG ACTGGCCCGT TTTGCCGACG GCACCCGCGC 660 

CTGCGTGCGC TACGGCATCA ACCCGGAGCA GATTCAGGGC GAGGCCCTGT CTTACTATCT 720 

GGCGCGOCTG CTGGGCCTCC AGCGCCAOGT GCCGCCGCTG GCACTGGCTC GGGTGGAGGC 780 

TCGGGGCGCG GAGTGGGOGC AGGTGCAGGA GGAGCTGCGC GCTGCGCACT GGACCGAGGG 840 

CAGCGTGGTG AGCCTGACAC GCTGGCTGCC CAACCTCACG GACGTGGTGG TGCCOGCGCC 900 

CTGGCGCTCG GAGGACGGCC GTCTGCGCCC CCTCCGGGAT GCCGGGGGTG AGCTGGCCAA 960 

CCTCAGCCAG GCGGAGCTGG TGGACCTAGT ACAATGGACC GACTTAATCC TTTTCGACTA 1020 

CCTGACGGCC AACTTCGACC GGCTCGTAAG CAACCTCTTC AGCCTGCAGT GGGACCCGCG 1080 

CGTCATGCAG CGTOCCACCA GCAACCTGCA CCGCGGTCCG GGCGGGGCGC TGGTCTTTCT 1140 

GGACAATGAG GCGGGCTTGG TGCACGGCTA COGGGTAGCA GGCATGTGGG ACAAGTATAA 1200 

CGAGCCGCTG TTGCAGTCAG TGTGCGTGTT CCGCGAGCGG ACCGCGCGGC GCGTCCTGGA 1260 

GCTGCACCGC GGACAGGACG CCGCGGCCCG GCTGCTGCGC CTCTACCGGC GCCACGAGCC 1320 

TCGCTTCCCC GAGCTGGCCG CCCTTGCAGA CCCCCACGCT CAGCTGCTAC AGCGCCGCCT 1380 

CGACTTCCTC GCCAAGCACA TTTTGCACTG TAAGGCCAAG TACGGCCGCC GGTCTGGGAC 1440 

TTAGTGTCAC CGGGAGGAAA AGAGAGAGAT CTGGGGCTGG GGTATGGATG ATGGGGGGAA 1500 

GGGCGGTCGC CTCTGCCACT GTCAGGGACC AGCCGGCCAA CGCCCACCCG CAAAGGTGTC 1560 

TAAAAACTTC AGCTTTTCAC CCACCTGCCC CTTTCTTTCA ATCCCACGCT GTTTCCTTTC 1620 

AAAGTTCTGG GAGGACGAAC TCACCGAGGC GAGAAGTGTA ACATTCTCTC CACCCAGCTT 1680 

ATAAAAGGAT TCTTTACTGT GCCAGCACGG GGATTGGATC CGAAGAAACT GGCTACTGGG 1740 

GTTTGGCCCC CGAGTGGCCG TCCCTGTGGG AGATGCACCC CATTCTTGGG CCCCCCTCAT 1800 

TCCCTTTCCG AAAAAGGAAA ACTTGCGTTT GAGCCGTTGA GCTAATTCTG CAATTTTCTA 1860 

CCAAACAGAG CGCTGGTGGC CCCGGAGCAG GGCTGTGACA TTGGCTGGTG GAGCCCCTTC 1920 

CTGTGTTCTC CCTTTGTTCC AGCGCCGOGA TGGTGAGATC ACTGTTCCAA GCAGGGGGAC 1980 

GGCTCGCGAT AGGACAAAGA GAGCAGGACC TCCAGACTCT GGGGAGCCCT GCAGACCTTG 2040 

ACAATTTGCC TGACTCATTC CTGACCTCTT GTCATTTTGG CCTGAAGGCT ACAAATTCAG 2100 

GGTCAGCTGT ATGCACTAAG TCAAATAATG AATTTCTTCC TCCCTCTCGC AACCGACCAA 2160 

AATTTTGACA ACGATGATGT TCACCAGAAG GAAAAAAAAA TCAGTTTTAT GCACTTTATT 2220 

TTGTTTTGAT TTTCATTTTT TATTAAGAAA AAATTTTATT TTACAGAATT TACCTTCTCT 2280 

GTATATATGT GCATAAAGTG TGGTGTAAAT ATACTAAACA AACTTATATT TCAATAAAAG 2340 
GGAGTTTAAA ATTTAAAAAA AAAAAAA 

Seq ID NO: 542 Protein sequence 
Protein Accession ft: NP_055159 

1 11 21 31 41 51 

I I I I I I 

MGRRMRGAAA TAGLWLLALG SLLALWGGLL PPRTELPASR PPEDRLPRRP ARSGGPAPAP 60 

RFPLPPPLAW DARGGSLKTP RALLTLAAGA DGPPRQSRSE PRWHVSARQP RPBESAAVHG 120 

GVFWSRGIiEE OVPPGFSEAQ AAAWLBAARG ARMVALERGG CGRSSNRLAR FADGTRACVR 180 

YGINPBQIQG EALSYYLARL LGLQRHVPPL ALARVEARGA QWAQVQEELR AABWTEGSW 240 

SLTRWLPNLT DVWPAPWRS EDGRLRPLRD AGGELANLSQ AELVDLVQWT DLILFDYLTA 300 

NFDRLVSNLF SLQWDPRVMQ RATSNLHRGP GGALVFLDNE AGLVHGYRVA GMWDKYNBPL 360 

LQSVCVFRBR TARRVLELHR GQDAAARIiLR LYRRHEPRFP BLAAIADPHA QLLQRRLDFL 420 
AKHILHCKAK YGRRSGT 

Seq ID NO: 543 DNA sequence 

Nucleic Acid Accession ft: XM_0076S2.4 

Coding sequence: 1..1290 



1 11 21 31 41 51 

1 I I I I I 

ATGGCCGGCT CTGGCGCGTG GAAGOGCCTC AAATCTATGC TAAGGAAGGA TGATGCGCCG 60 

QA CTGTTTTTAA ATGACACCAG CGCCTTTGAC TTCTCGGATG AGGCGGGGGA CGAGGGGCTT 120 

oU TCTCGGTTCA ACAAACTTOG AGTTGTGGTG GCCGATGAOG GTTCCGAAGC CCOGGAAAGG 180 

CCTGTTAACG GGGCGCACCC GACCCTCCAG GCCGACGATG ATTCCTTACT GGACCAAGAC 240 

TTACCTTTGA CCAACAGTCA GCTGAGTTTG AAGGTGGACT CCTGTGACAA CTGCAGCAAA 300 

CAGAGAGAGA TACTGAAGCA GAQAAAGGTG AAAGCCAGGT TGACCATTGC TGCCGTTCTG 360 

TACTTGCTTT TCATGATTGG AGAACTTGTA GGTGGATACA TTGCAAATAG CCTAGCAATC 420 

1162 



WO 03/042661 PCT/US02/36810 



5 

10 
15 
20 



30 
35 
40 
45 
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70 
75 
80 



ATGACAGATG CACTTCATAT GTTAACTGAC CTAAGOGCCA TCATACTCAC CCTGCTTGCT 480 

TTGTGGCTAT CATCAAAATC ACCAACCAAA AGATTCACCT TTGGATTTCA TOGCTTAGAG 540 

GTTTTGTCAG CTATGATTAG TGTGCTGTTG GTGTATATAC TTATGGGATT CCTCTTATAT 600 

GAAGCTGTGC AAAGAACTAT CCATATGAAC TATGAAATAA ATGGAGATAT AATGCTCATC 660 

ACCGCAGCTG TTGGAGTTGC AGTTAATGTA ATAATGGGGT TTCTGTTGAA CCAGTCTGGT 720 

CACOGTCACT CCCATTCCCA CTCCCTGCCT TCAAATTCCC CTACCAGAGG TTCTGGGTGT 780 

GAACGTAACC ATGGGCAGGA TAGCCTGGCA GTGAGAGCTG CATTTGTACA TGCTTTGGGA 840 

GATTTGGTAC AGAGTGTTGG TGTGCTAATA GCTGCATACA TCATACGATT CAAGCCAGAA 90 0 

TACAAGATTG CTGATCCCAT CTGTACATAC GTATTTTCAT TACTTGTGGC TTTTACAACA 960 

TTTCGAATCA TATGGGATAC AGTAGTTATA ATACTAGAAG GTGTGCCAAG CCATTTGAAT 1020 

GTAGACTATA TCAAAGAAGC CTTGATGAAA ATAGAAGATG TATATTCAGT CGAAGATTTA 1080 

AATATCTGGT CTCTCACTTC AGGAAAATCT ACTGCCATAG TTCACATACA GCTAATTCCT 1140 

GGAAGTTCAT CTAAATGGGA GGAAGTACAG TCCAAAGCAA ACCATTTATT ATTGAACACA 12 00 

TTTGGCATGT ATAGATGTAC TATTCAGCTT CAGAGTTACA GGCAAGAAGT GGACAGAACT 1260 
TGTGCAAATT GTGAGAGTTC TAGTCCCTGA 

Seq ID NO: 544 Protein sequence 
Protein Accession #: XP 007652.1 



1 11 21 31 41 51 

I I I I I I 

MAGSGAWKRL KSMLRKDDAP LFLNDTSAFD FSDEAGDEGL SRFNKLRVW ADDGSEAPER 60 

PVNGAHPTLQ ADDDSLLDQD LPLTNSQLSb KVDSCDNCSK QREILKQRKV KARLTIAAVL 120 

YLLFMIGELV GGYIANSLAI MTDALHMLTD LSAIILTLLA LWLSSKSPTK RFTFGFHRLE 180 

25 VLSAMISVLL VYILMGFLLY EAVQRTIHMN YEINGDIMLI TAAVGVAVNV IMGFLLNQSG 240 

HRHSHSHSLP SNSPTRGSGC ERNHGQDSIA VRAAFVHALG DLVQSVGVLI AAYIIRFKPE 300 

YKIADPICTY VPSLLVAFTT FRIIWDTWI ILEGVPSHLN VDYIKEALMK IEDVYSVEDL 360 

NIWSLTSGKS TAIVHIQLIP GSSSKWEEVQ SKANHLLLNT FGMYRCTIQL QSYRQEVDRT 420 
CANCQSSSP 



Seq ID NO: 545 DNA sequence 
Nucleic Acid Accession #; AB037765.1 
Coding sequence : 1 . . 2478 

1 11 21 31 41 51 

I I I I I I 

ATGTTTTCCG GCTTCAATGT CTTTAGAGTT GGGATCTCTT TTGTCATAAT GTGCATTTTT 60 

TACATGCCAA CAGTAAACTC TTTACCAGAA CTGAGTCCTC AGAAATATTT TAGTACATTG 120 

CAACCAGGAA AAGCCTCTTT AGCTTATTTT TGTCAAGCTG ATTCCCCAAG AACATCTGTA 180 

TTTCTTGAAG AACTGAATGA GGCTGTTAGA CCTCTGCAGG ACTATGGAAT TTCAGTTGCC 240 

AAGGTTAATT GTGTCAAAGA AGAAATATCA AGATACTGTG GAAAAGAAAA GGATTTGATG 300 

AAAGCATATT TATTCAAGGG CAACATATTG CTCAGAGAAT TCCCTACTGA CACCTTGTTT 360 

GATGTGAATQ CCATTGTCGC CCATGTTCTC TTTGCTCTTC TTTTTAGTGA AGTGAAATAT 420 

ATTACCAACC TGGAAQACCT TCAGAACATA GAAAATGCTC TGAAAGGAAA AGCAAATATT 480 

ATATTCTCAT ATGTAAGAGC CATTGGAATA CCAGAGCACA GAGCAGTCAT GGAAGCCGCT 540 

TTTGTGTATG GGACTACATA CCAATTTGTC TTAACCACAG AAATTGCCCT TTTGQAAAGT 600 

ATTGGCTCXG AGGATGTGGA ATATGCACAT CTCTACTTTT TTCATTGTAA ACTAGTCTTG 660 

GACTTGACCC AGCAATGTAG AAGAACACTA ATGGAACAGC CATTGACTAC ACTGAACATT 720 

CACCTGTTTA TTAAGACAAT GAAAGCACCT CTGTTGACTG AAGTTGCTGA AGATCCTCAA 780 

CAAGTTTCAA CTGTCCATCT CCAACTGGGC TTACCACTGG TTTTTATTGT TAGCCAACAG 840 

GCTACTTATG AAGCTGATAG AAGAACTGCA GAATGGGTTG CTTGGCGTCT TCTGGGAAAA 900 

GCAGGAGTTC TACTCTTGTT AAGGGACTCT TTGGAAGTGA ACATTCCTCA AGATGCTAAT 960 

GTGGTCTTCA AAAGAGCAGA AGAGGGAGTT CCAGTGGAAT TTTTGGTATT ACATGATGTT 1020 

GATTTAATAA TATCTCATGT GGAAAATAAT ATGCACATTG AGGAAATACA AGAAGATOAA 1080 

GACAATGACA TGGAAGGTCC AGATATAGAT GTTCAGGATG ATGAAGTGGC AGAAACTGTT 1140 

TTCAGAGATA GGAAGAGAAA ATTACCTTTG GAACTTACAG TGGAACTAAC AGAAGAAACA 1200 

TTTAATGCAA CAGTGATGGC TTCTGACAGC ATAGTACTCT TCTATGCTGO TTGGCAAGCA 1260 

GTATCCATGG CATTTTTGCA ATCCTATATT GATGTGGCAG TTAAACTGAA AGGCACATCT 1320 

ACTATGCTTC TTACTAGAAT AAACTGTGCA GATTGGTCTG ATGTATGTAC TAAGCAAAAT 1380 

GTTACTGAAT TTCCTATCAT AAAGATGTAC AAGAAAGGCG AGAACCCAGT ATCTTATGCT 1440 

GGAATGTTAG GAACCGAAGA TCTCCTAAAA TTTATCCAGC TCAACAGGAT TTCATATCCA 1500 

GTGAATATAA CATCGATCCA AGAAGCAGAA GAATATTTAA GTGGGGAATT ATATAAAGAC 1560 

CTCATCTTGT ATTCTAGTGT GTCAGTATTG GGACTATTTA GTCCAACCAT GAAAACAGCA 1620 

AAAGAAGATT TTAGTGAAGC AGGAAACTAC CTAAAAGGAT ATGTTATCAC TGGAATTTAT 1680 

TCTGAAGAAG ATGTTTTGCT ACTGTCAACC AAATATGCTG CAAGTCTTCC AGCCCTGCTG 1740 

CTTGCCAGAC ACACAGAAGG CAAAATAGAG AGCATCCCAC TAGCTAGCAC ACATGCACAA 1800 

GACATAGTTC AAATAATAAC AGATGCACTA CTGGAAATGT TTCOSGAAAT CACTGTGGAA 1860 

AATCTTCCCA GTTATTTCAG ACTTCAGAAA CCATTATTGA TTTTGTTCAG TGATGGCACT 1920 

GTAAATCCTC AGTATAAAAA AGCAATATTG ACACTGGTAA AGCAGAAATA CTTGGATTCA 1980 

TTTACTCCAT GCTGGTTAAA TCTAAAGAAT ACTCCAGTGG GGAGAGGAAT CTTGAGGGCA 2040 

TATTTTGATC CTCTGCCTCC CCTTCCTCTT CTTGTTTTGG TGAATCTGCA TTCAGGTGGC 2100 

CAAGTATTTG CATTTCCTTC AGACCAGGCT ATAATTGAAG AAAACCTTGT ATTGTGGCTG 2160 

AAGAAATTAG AAGCAGGACT AGAAAATCAT ATCACAATTT TACCTGCTCA AGAATGGAAA 2220 

CCTCCTCTTC CAGCTTATQA TTTTCTAAGT ATGATAGATG CCGCAACATC TCAACGTGGC 2280 

ACTAGGAAAG TTCCCAAGTG TATGAAAGAA ACAGATGTGC AGGAGAATGA TAAGGAACAA 2340 

CATGAAGATA AATCGGCAGT CAQAAAAGAA CCGATTGAAA CTCTGAGAAT AAAGCATTGG 2400 

AATAGAAGTA ATTGGTTTAA AGAAGCAGAA AAATCATTTA GACGTGATAA AGAGTTAGGA 2460 
TGCTCAAAAG TGAACTAA 

Seq ID NO: 546 Protein sequence 
Protein Accession fl: BAA92SB2.1 

1 11 21 31 41 51 

I I I I I I 

1163 



WO 03/042661 



PC17US02/36810 



MFSGFNVFRV GISPVIMCIP YMPTVNSLPB LSPQKYFSTL QPGKASLAYF CQADSPRTSV 60 

FLEELNEAVR PLQDYGISVA KVNCVKEEIS RYCGKBKDLM KAYLFKGNIL LRBFPTDTLF 120 

DVNAIVAHVL FALLFSEVKY ITNLEDLQNI ENAIiKGKANI IFSYVRAIGI PEHRAVMEAA 180 

FVYGTTYQFV LTTEIAIAES IGSEDVEYAH LYFFHCKLVL DLTQQCRRTL MEQPLTTLNI 240 

J HLFIKTMKAP LLTEVAEDPQ QVSTVHLQLG LPLVFIVSQQ ATYEADRRTA EWVAWRLLGK 300 

AGVLLLLRDS LEVNIPQDAN WFKRAEEGV PVEFLVLHDV DLIISHVENN MHIEEIQEDE 360 

DNDMEGPDID VQDDEVAETV FRDRKRKLPL ELTVELTEET FNATVMASDS IVLFYAGWQA 420 

VSMAFLQSYI DVAVKLKGTS TMLLTRINCA DWSDVCTKQN VTEFPIIKMY KKGENPVSYA 480 

GMLGTEDLLK PIQLNRISYP VNITSIQEAE EYLSGELYKD LILYSSVSVL GLFSPTMKTA 540 

KEDFSEAGNY LKGYVITGIY SEEDVLLLST KYAASLPALL LARHTEGKIE SIPLASTHAQ 600 

DIVQIITDAL LEMFPEITVE NLPSYFRLQK PLLILFSDGT VNPQYKKAIL TLVKQKYLDS 660 

FTPCWLNLKN TPVGRGILRA YPDPLPPLPL LVLVNLHSGG QVFAFPSDQA IIEENLVLWL 720 

KKLEAGLENH ITILPAQEWK PPLPAYDFLS MIDAATSQRG TRKVPKCMKE TDVQENDKEQ 780 
HEDKSAVRKE PIETLRIKHW NRSNWFKEAE KSFRRDKELG CSKVN 
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Seq ID NO: 547 DNA sequence 
Nucleic Acid Accession #: NM_033102.l 
Coding sequence : 1 . . 1662 

1 11 21 31 41 51 

I 1 I I I I 

ATGGTCCAGA GGCTGTGGGT GAGCCGCCTG CTGCGGCACC GGAAAGCCCA GCTCTTGCTG 60 

GTCAAOCTGC TAACCTTTGG CCTGGAGGTG TGTTTGGCCG CAGGCATCAC CTATGTGCCG 120 

CCTCTGCTGC TGGAAGTGGG GGTAGAGGAG AAGTTCATGA CCATGGTGCT GGGCATTGGT 180 

CCAGTGCTGG GCCTGGTCTG TGTCCCGCTC CTAGGCTCAG CCAGTGACCA CTGGGGTGGA 240 

CGCTATGGCC GCCGCCGGCC CTTCATCTGG GCACTGTCCT TGGGCATCCT GCTGAGCCTC 300 

TTTCTCATCC CAAGGGCCGG CTGGCTAGCA GGGCTGCTGT GCCCGGATCC CAGGCCCCTG 360 

GAGCTGGCAC TGCTCATCCT GGGCGTGGGG CTGCTGGACT TCTGTGGCCA GGTGTGCTTC 420 

ACTCCACTGG AGGCCCTGCT CTCTGACCTC TTCCGGGACC CGGACCACTG TCGCCAGGCC 480 

TACTCTGTCT ATGCCTTCAT GATCAGTCTT GGGGGCTGCC TGGGCTACCT CX^GCCTGCC 540 

ATTGACTGGG ACACCAOTGC CCTGGCCCCC TACCTGGGCA CCCAGGAGGA GTGCCTCTTT 600 

GGCCTGCTCA CCCTCATCTT CCTCACCTGC GTAGCAGCCA CACTGCTGGT GGCTGAGGAG 660 

GCAGOGCTGG GCCCCACCGA GCCAGCAGAA GGGCTGTCGG CCCCCTCCTT GTCGCCCCAC 720 

TGCTGTCCAT GCCGGGCCCG CTTGGCTTTC CGGAACCTGG GCGCCCTGCT TCCCCGGCTG 780 

CACCAGCTGT GCTGCCGCAT GCCCCGCACC CTGCGCCGGC TCTTCGTGGC TGAGCTGTGC 840 

AGCTGGATGG CACTCATGAC CTTCACGCTG TTTTACACGG ATTTCGTGGG CGAGGGGCTG 900 

TACCAGGGCG TGCCCAGAGC TGAGCCGGGC ACCGAGGCCC GGAGACACTA TGATGAAGGC 960 

GTTCGGATGG GCAGCCTGGG GCTGTTCCTG CAGTGOGCCA TCTCCCTGGT CTTCTCTCTG 1020 

GTCATGGACC GGCTGGTGCA GCGATTOGGC ACTCGAGCAG TCTATTTGGC CAGTGTGGCA 1080 

GCTTTCCCTG TGGCTGCOGG TGCCACATGC CTGTCCCACA GTGTGGCCGT GGTGAGAGCT 1140 

TCAGCCGCCC TCACCGGGTT CACCTTCTCA GCCCTGCAGA TCCTGCCCTA CACACTGGCC 1200 

TCCCTCTACC ACCGGGAGAA GCAGGTGTTC CTGCCCAAAT ACCGAGGGGA CACTGGAGGT 1260 

GCTAGCAGTG AGGACAGCCT GATGACCAGC TTCCTGCCAG GCCCTAAGCC TGGAGCTCCC 1320 

TTCCCTAATG GACACGTGGG TGCTGGAGGC AGTGGCCTGC TCCCACCTCC ACCCGCGCTC 1380 

TGCGGGGCCT CTGCCTGTGA TGTCTCCGTA CGTGTGGTGG TGGGTGAGCC CACCGAGGCC 1440 

AGGGTGGTTC CGGGCGGGGG CATCTGOCTG GACCTCGCCA TCCTGGATAG TGCCTTCCTG 1500 

CTGTCCCAGG TGGCCCCATC CCTGTTTATG GGCTCCATTG TCCAGCTCAG CCAGTCTGTC 1560 

ACTGCCTATA TGGTGTCTGC CGCAGGCCTG GGTCTGGTOG CCATTTACTT TGCTACACAG 1620 
GTAGTATTTG ACAAGAGCGA CTTGGCCAAA TACTCAGCGT GA 

Seq ID NO: 548 Protein sequence 
Protein Accession #: NP_149093.1 



1 11 21 31 41 51 

55 | | | | i | 

MVQRLWVSRL LRHRKAQLLL VNLLTFGLEV CLAAGITYVP PLLLEVGVEE KFMTMVLGIG 60 

PVLGLVCVPL LGSASDHWRG RYGRRRPFIW ALSLGILLSL FLIPRAGWLA GLLCPDPRPL 120 

BLALLILGVG LLDFCGQVCF TPLEALLSDL PRDPDHCRQA YSVYAFMISL GGCLGYLLPA 180 

IDWDTSALAP YLGTQEECLF GLLTLIPLTC VAATLLVAEE AALGPTEPAE GLSAPSLSPH 240 

CCPCRARLAF RNLGALLPRL EQLCCRMPRT LRRLFVAELC SWMALMTFTL FYTDFVGEGL 300 

YQGVPRABPG TBARRHYDEG VRMGSLGIiFL QCAISLVFSL VMDRLVQRFG TRAVYLASVA 360 

AFPVAAGATC LSHSVAWTA SAALTGPTFS ALQILPYTLA SLYHREKQVF LPKYRGDTGG 420 

ASSEDSLMTS FLPGPKPGAP FPNGHVGAGG SGLLPPPPAL CGASACDVSV RVWGEPTEA 480 

RWPGRGICL DLAILDSAFL LSQVAPSLPM GSIVQLSQSV TAYMVSAAGL GLVAIYFATQ 540 
WFDKSDLAK YSA 



Seq ID NO: 549 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1389 

1 U 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGATTTAGA TGACAGAGAA 60 

ACCCTTGTTT CTGAACATGA GTATAAAGAG AAAACCTGTC AGTCTGCTGC TCTTTTTAAT 120 

GTTGTCAACT CGATTATAGG ATCTGGTATA ATAGGATTGC CTTATTCAAT GAAGCAAGCT 180 

GGGTTTCCTT TGGGAATATT GCTTTTATTC TGGGTTTCAT ATGTTACGGA Crm -CCCTT 240 

GTTTTATTGA TAAAAGGAGG GGCCCTCTCT GGAACAGATA CCTACCAGTC TTTGGTCAAT 300 

AAAACTTTCG GCTTTCCAGG GTATCTGCTC CTCTCTGTTC TTCAGTTTTT GTATCCTTTT 360 

ATAGCAATGA TAAGTTACAA TATAATAGCT GGAGATACTT TGAGCAAAGT TTTTCAAAGA 420 

ATCCCAGGAG TTGATCCTGA AAAGGTGTTT ATTGGTCGCC ACTTCATTAT TGGACTTTOC 480 

ACAGTTACCT TTACTCTGCC TTTATCCTTG TACCGAAATA TAGCAAAGCT TGGAAAGGTC 540 

TCCCTCATCT CTACAGGTTT AACAACTCTG ATTCTTGGAA TTGTAATGGC AAGGGCAATT 600 

TCACTGGGTC CACACATACC AAAAACAGAA GACGCTTGGG TATTTGCAAA GCCCAATGCC 660 

ATTCAAGCGG TCGGGGTTAT GTCTTTTGCA TTTATTTGCC ACCATAACTC CTTCTTAGTT 720 
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TACAGTTCTC TAGAAGAACC CACAGTAGCT AAGTGGTCCC GCCTTATCCA TATGTCCATC 780 

GTGATTTCTG TATTTATCTG TATATTCTTT GCTACATGTG GATACTTGAC ATTTACTGGC 840 

TTCACCCAAG GGGACTTATT TGAAAATTAC TGCAGAAATG ATGACCTGGT AACATTTGGA 900 

AGATTTTGTT ATGGTGTCAC TGTCATTTTG AGATACCCTA TGGAATGCTT TGTGACAAGA 960 

GAGGTAATTG CCAATGTGTT TTTTGGTGGG AATCTTTCAT CGGTTTTCCA CATTGTTGTA 1020 

ACAGTGATGG TCATCACTGT AGCCACGCTT GTGTCATTGC TGATTGATTG CCTCGGGATA 1080 

GTTCTAGAAC TCAATGGTGT GCTCTGTGCA ACTCCCCTCA TTTTTATCAT TCCATCAGCC 1140 

TGTTATCTGA AACTGTCTGA AGAACCAAGG ACACACTCCG ATAAGATTAT GTCTTGTGTC 1200 

ATGCTTCCCA TTGGTGCTGT GGTGATGGTT TTTGGATTCG TCATGGCTAT TACAAATACT 1260 

CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 1320 

AATACCTCAG AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 1380 
TTTCAATGA 



Seq ID NO : 550 Protein sequence 
I J Protein Accession #: Eos sequence 



1 11 21 31 41 51 

I I I I I i 

MGYQRQEPVI PPQRDLDDRE TLVSEHEYKE KTCQSAALFN WNSIIGSGI IGLPYSMKQA 60 

GFPLGILLLP WVSYVTDFSL VLLIKGGALS GTDTYQSLVN KTFGFPGYLL LSVLQFLYPP 120 

IAMISYNIIA GDTLSKVFQR IPGVDPENVP IGRHFIIGLS TVTFTLPLSL YRNIAKLGKV 180 

SLISTGLTTL ILGIVMARAI SLGPHIPKTE DAWVFAKPNA IQAVGVMSFA PICHHNSFLV 240 

YSSLEEPTVA KWSRLIHMSI VISVFICIPP ATCGYLTFTG ETQGDLPENY CRNDDLVTFG 300 

RFCYGVTVIL TYPMECPVTR EVIANVFFGG NLSSVFHIW TVMVITVATL VSLLIDCLGI 360 

VLELNGVLCA TPLIFIIPSA CYLKLSBEPR THSDKIMSCV MLPIGAWMV FGPVMAITNT 420 
QDCTHGQEMP YCFPDNFSLT NTSESHVQQT TQLSTLNISI FQ 



Seq ID NO: 551 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
30 Coding sequence: 1..1284 



1 11 21 31 41 51 

I I I I 1 I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGGATTGCC TTATTCAATG 60 

AAGCAAGCTG GGTTTCCTTT GGGAATATTG CTTTTATTCT GGGTTTCATA TGTTACAGAC 120 

TTTTCCCTTG TTTTATTGAT AAAAGGAGGG GCCCTCTCTG GAACAGATAC CTACCAGTCT 180 

TTGGTCAATA AAACTTTCGG CTTTCCAGGG TATCTGCTCC TCTCTGTTCT TCAGTTTTTG 240 

TATCCTTTTA TAGCAATGAT AAGTTACAAT ATAATAGCTG GAGATACTTT GAGCAAAGTT 300 

TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AACGTGTTTA TTGGTCGCCA CTTCATTATT 360 

GGACTTTCCA CAGTTACCTT TACTCTGCCT TTATCCTTGT ACCGAAATAT AGCAAAGCTT 420 

GGAAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 480 

AGGGCAATTT CACTGGGTCC ACACATACCA AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 540 

CCCAATGCCA TTCAAGCGGT CGGGGTTATG TCTTTTGCAT TTATTTGCCA CCATAACTCC 600 

TTCTTAGTTT ACAGTTCTCT AGAAGAACCC ACAGTAGCTA AGTGGTCCCG CCTTATCCAT 660 

ATGTCCATCG TGATTTCTGT ATTTATCTGT ATATTCTTTG CTACATGTGG ATACTTGACA 720 

TTTACTGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATGA TGACCTGGTA 780 

ACATTTGGAA GATTTTGTTA TGGTGTCACT GTCATTTTGA CATACCCTAT GGAATGCTTT 840 

GTGACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GGTTTTGCAC 900 

ATTGTTGTAA CAGTGATGGT CATCACTGTA GCGACGCTTG TGTCATTGCT GATTGATTGC 960 

CTCGGGATAG TTCTAGAACT CAATGGTGTG CTCTGTGCAA CTCCCCTCAT TTTTATCATT 1020 

CCATCAGCCT GTTATCTGAA ACTGTCTGAA GAACCAAGGA CACACTCCGA TAAGATTATG 1080 

TCTTGTGTCA TGCTTCCCAT TGGTGCTGTG GTGATGGTTT TTGGATTOGT CATGGCTATT 1140 

ACAAATACTC AAGACTGCAC CCATGGGCAQ GAAATGTTCT ACTGCTTTCC TGACAATTTC 1200 

TCTCTCACAA ATACCTCAGA GTCTCATGTT CAGCAGAGAA CACAACTTTC TACTTTAAAT 1260 
ATTAGTATCT TTCAACTCGA GTAA 

Seq ID NO: 5S2 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MGYQRQEPVI PPQRGLPYSM KQAGFPLGIL LLFWVSYVTD FSLVLLIKGG ALSGTDTYQS 60 

LVNKTFGFPG YLLLSVLQFL YPPIAMISYN IIAGDTLSKV FQRIPGVDPE NVFIGRHFII 120 

GLSTVTFTLP LSLYRNIAKL GKVSLISTGL TTLILGIVMA RAISLGPHIP KTEDAHVFAK 180 

PNAIQAVGVM SFAFICHHNS PLVYSSLEEP TVAKKSRLIH MSIVISVPIC IFFATCGYLT 240 

PTGFTQGDLF ENYCRNDDLV TFGRFCYGVT VILTYPMECP VTREVIANVF FGGNLSSVFH 300 

IWTVMVITV ATLVSLLIDC LGIVLELNGV LCATPLIPII PSACYLKLSE BPRTHSDKIM 360 

SCVMLPIGAV VMVFGPVMAI TNTQDCTHGQ EMPYCPPDNP SLTNTSESHV QQTTQLSTLN 420 
ISIFQLE 

Seq ID NO: 553 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1203 

1 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGT TTTCCCTTGT TTTATTGATA 60 

AAAGGAGGGG CCCTCTCTGG AACAGATACC TACCAGTCTT TGGTCAATAA AACTTTCGGC 120 

TTTCCAGGGT ATCTGCTCCT CTCTGTTCTT CAGTTTTTGT ATCCTTTTAT AGCAATGATA 180 

AGTTACAATA TAATAGCTGG AGATACTTTG AGCAAAGTTT TTCAAAGAAT CCCAGGAGTT 240 

GATCCTGAAA ACGTGTTTAT TGGTCGCCAC TTCATTATTG GACTTTCCAC AGTTACCTTT 300 

ACTCTGCCTT TATCCTTGTA CCGAAATATA GCAAAGCTTG GAAAGGTCTC CCTCATCTCT 360 

ACAGGTTTAA CAACTCTGAT TCTTGGAATT GTAATGGCAA GGGCAATTTC ACTGGGTCCA 420 

CACATACCAA AAACAGAAGA CGCTTGGGTA TTTGCAAAGC CCAATGCCAT TCAAGCGGTC 480 
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GGGGTTATGT CTTTTGCATT TATTTGCCAC CATAACTCCT TCTTAGTTTA CAGTTCTCTA 540 

GAAGAACCCA CAGTAGCTAA GTGGTCCOGC CTTATCCATA TGTCCATOGT GATTTCTGTA 600 

TTTATCTGTA TATTCTTTGC TACATGTGGA TACTTGACAT TTACTGGCTT CACCCAAGGG 660 

GACTTATTTG AAAATTACTG CAGAAATGAT GACCTGGTAA CATTTGGAAG ATTTTGTTAT 720 

J GGTGTCACTG TCATTTTGAC ATACCCTATG GAATGCTTTG TGACAAGAGA GGTAATTGCC 780 

AATGTGTTTT TTGGTGGGAA TCTTTCATCG GTTTTCCACA TTGTTGTAAC AGTGATGGTC 840 

ATCACTGTAG CCAOGCTTGT GTCATTGCTG ATTGATTGCC TOGGGATAGT TCTAGAACTC 900 

AATGGTGTGC TCTGTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCTG TTATCTGAAA 960 

CTGTCTGAAG AACCAAGGAC ACACTCCGAT AAGATTATGT CTTGTGTCAT GCTTCCCATT 1020 

10 GGTGCTGTGG TGATGGTTTT TGGATTCGTC ATGGCTATTA CAAATACTCA AGACTGCAGC 1080 

CATGGGCAGG AAATGTTCTA CTGCTTTCCT GACAATTTCT CTCTCACAAA TACCTCAGAG 1140 

TCTCATGTTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT TCAACTCGAG 1200 
TAA 

15 Seq ID NO i 554 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

on I I 1 I I I 

ZU MGYQRQBPVI PPQFSLVLLI KGGALSGTDT YQSLVNKTFG FPGYLLDSVL QFLYPFIAMI 60 

SYNIIAGDTL SKVFQRIPGV DPENVFIGRH FIIGLSTVTP TLPLSLYRNI AKLQKVSLIS 120 

TGLTTLILGI VMARAISLGP HIPKTEDAWV FAKPNAIQAV GVMSFAFICH HNSFLVYSSL 180 

EEPTVAKWSR LIHMSIVISV FICIFFATCG YLTFTGFTQG DLFENYCRND DLVTFGRFCY 240 

GVTVILTYPM ECFVTREVIA NVFFGGNLSS VFHIWTVMV ITVATLVSLL IDCLGIVLEL 300 

25 NGVLCATPLI FIIPSACYLK LSEEPRTHSD KIMSCVMLPI GAWMVFGFV MAITNTQDCT 360 
HGQEMFYCFP DNFSLTNTSE SHVQQTTQLS TLNISIFQLE 

Seq ID NO: 555 DNA sequence 
Nucleic Acid Accession £: Eos sequence 
30 Coding sequence: 1..1140 

1 11 21 31 41 51 

I I I 1 I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGGCTTT 60 

CCAGGGTATC TGCTCCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATGATAAGT 120 

TACAATATAA TAGCTGGAGA TACTTTGAGC AAAGTTTTTC AAAGAATCCC AGGAGTTGAT 180 

CCTGAAAAOG TGTTTATTGG TOGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 240 

CTGCCTTTAT CCTTGTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 300 

GGTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT GGGTCCACAC 360 

ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA AGCGGTCGGG 420 

GTTATGTCTT TTGCATTTAT TTGCCACCAT AACTCCTTCT TAGTTTACAG TTCTCTAGAA 480 

GAACCCACAG TAGCTAAGTG GTCCCGCCTT ATCCATATGT CCATCGTGAT TTCTGTATTT 540 

ATCTGTATAT TCTTTGCTAC ATGTGGATAC TTGACATTTA CTGGCTTCAC CCAAGGGGAC 600 

TTATTTGAAA ATTACTGCAG AAATGATGAC CTGGTAACAT TTGGAAGATT TTGTTATGGT 660 

GTCACTGTCA TTTTGACATA CCCTATGGAA TGCTTTGTGA CAAGAGAGGT AATTGCCAAT 720 

GTGTTTTTTG GTGGGAATCT TTCATOGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 780 

ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTGCCTCG GGATAGTTCT AGAACTCAAT 840 

GGTGTGCTCT GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 900 

TCTGAAGAAC CAAGGACACA CTCCGATAAG ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 

GCTGTGGTGA TGGTTTTTGG ATTCGTCATG GCTATTACAA ATACTCAAGA CTGCACCCAT 1020 

GGGCAGGAAA TGTTCTACTG CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 1080 
CATGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA ACTCGAGTAA 

Seq ID NO: 556 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MGYQRQBPVI PPQVNKTFGF PGYLLLSVLQ FLYPFIAMIS YNIIAGDTLS KVFQRIPGVD 60 

PENVFIGRHF IIGLSTVTFT LPLSLYRNIA KLGKVSLIST GLTTLILGXV MARAISLGPH 120 

IPKTEDAWVF AKPNAIQAVG VMSFAFICHH NSFLVYSSLE EPTVAKHSRL IHMSIVISVF 180 

ICIFFATCGY LTFTGFTQGD LFENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 240 

VFFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLELN GVLCATPLIF IIPSACYLKL 300 

SEEPRTHSDK IMSCVMLPIG AWMVFGFVM AITNTQDCTH GQEMFYCFPD NFSLTNTSES 360 
HVQQTTQLST LNISIPQLE 

Seq ID NO: SS7 DNA sequence 
Nucleic Acid Accession 8: XMJJ57188.1 
Coding sequence: 769.. 4269 

1 11 21 31 41 51 

I I I 1 I I 

ATGGGATTGC CTCTCCCTCT CACTCTGGGC TTCTGTCCCA CTCTTATCTT AGTGTCAGTC 60 

CTCCCCCAAG TCTGTGTCCC TCTCTCTCCC CTAAATCTCT GGCCCCTCCT TTCTGAGTTC 120 

CTGCCCTTCC CCCAATTCTT TGGTTTTTGC ATCCCCCTCT GCCCCTTGCC TCAGTCAAGT 180 

CTCCCCTGTG TGTCTCTCTC CCCCCGGCCC GGACCTCTGC ACCCCCCAGG TCGCTGTCCC 240 

TCTGTCCCCT TATCGCGGCC TGGGACCCGC CCTCTCCCCG CCTCCCGCTT TGGCGTCTCC 300 

AAGACTCCCC GCCCCCCAGA CCTOGCCCCG CCCCAGGCTA GGCTGGAAAG TGGAGGATCC 360 

GGTTTGCTCT GGGGGGGTCT GGAAGCAGAG CCGGCGGAGG GAGCGCCGGG GCCCTGGGCT 420 

GCAGGAGGTT GCGGCGGCCG CGGCAGCATG GTGGTGCCGG AGAAGGAGCA GAGCTGGATC 480 

CCCAAGATCT TCAAGAAGAA GACCTGCACG ACGTTCATAG TTGACTCCAC AGATCCGGGG 540 

GATGGGGTCT CGCTCTATTG CCCAGGCTGT TCTCAAACTC CTGGGCTCAA GCAGTCCTTC 600 

TGCCTTGACC TCCCAAAGTG CTTGGATTGT GCCCAGCCGT CCTTGAAGTT TTGCTCAGAA 660 

GAGCAAACTT TCTGGGAAGT AGCTGCAGGT GTTGQAAGTA GCTGCAGGGG AACTAGGGGA 720 
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TTCAAGGATG GAGCTGAAAT GGGTGAACGG ACAAAGTCGG TAAACTGAAT GGAGGATGCC 780 

TTCGGGGCAG CCGTGGTGAC CGTGTGGGAC AGCGATGCAC ACACCACGGA GAAGCCCACC 840 

GATGCCTACG GAGAGCTGGA CTTCACGGGG GCCGGCCGCA AGCACAGCAA TTTCCTCCGG 900 

CTCTCTGACC GAACGGATCC AGCTGCAGTT TATAGTCTGG TCACACGCAC ATGGGGCTTC 960 

CGTGCCCCGA ACCTGGTGGT GTCAGTGCTG GGGGGATCGG GGGGCCCCGT CCTCCAGACC 1020 

TGGCTGCAGG ACCTGCTGCG TCGTGGGCTG GTGCGGGCTG CCCAGAGCAC AGGAGCCTGG 1080 

ATTGTCACTG GGGGTCTGCA CACGGGCATC GGCCGGCATG TTGGTGTGGC TGTACGGGAC 1140 

CATCAGATGG CCAGCACTGG GGGCACCAAG GTGGTGGCCA TGGGTGTGGC CCCCTGGGGT 1200 

GTGGTCCGGA ATAGAGACAC CCTCATCAAC CCCAAGGGCT CGTTCCCTGC GAGGTACCGG 1260 

TGGCGCGGTG ACCCGGAGGA CGGGGTCCAG TTTCCCCTGG ACTACAACTA CTCGGCCTTC 1320 

TTCCTGGTGG AOGACGGCAC ACACGGCTGC CTGGGGGGCG AGAACCGCTT CCGCTTGCGC 1380 

CTGGAGTCCT ACATCTCACA GCAGAAGACG GGCGTGGGAG GGACTGGAAT TGACATCCCT 1440 

GTCCTGCTCC TCCTGATTGA TGGTGATGAG AAGATGTTGA CGCGAATAGA GAACGCCACC 1500 

CAGGCTCAGC TCCCATGTCT CCTCGTGGCT GGCTCAGGGG GAGCTGCGGA CTGCCTGGCG 1560 

GAGACCCTGG AAGACACTCT GGCCCCAGGG AGTGGGGGAG CCAGGCAAGG CGAAGCCCGA 1620 

GATCGAATCA GGC G TTTCT T TCCCAAAGGG GACCTTGAGG TCCTGCAGGC CCAGGTGGAG 1680 

AGGATTATGA CCCGGAAGGA GCTCCTGACA GTCTATTCTT CTGAGGATGG GTCTGAGGAA 1740 

TTCGAGACCA TAGTTTTGAA GGCCCTTGTG AAGGCCTGTG GGAGCTCGGA GGCCTCAGCC 1800 

TACCTGGATG AGCTGCGTTT GGCTGTGGCT TGGAACCGCG TGGACATTGC CCAGAGTGAA 1860 

CTCTTTCGGG GGGACATCCA ATGGOGGTCC TTCCATCTCG AAGCTTCCCT CATGGACGCC 1920 

CTGCTGAATG ACCGGCCTGA GTTCGTGCGC TTGCTCATTT CCCAOGGCCT CAGCCTGGGC 1980 

CACTTCCTGA CCCCGATGCG CCTGGCCCAA CTCTACAGCG CGGCGCCCTC CAACTCGCTC 2040 

ATCCGCAACC TTTTGGACCA GGCGTCCCAC AGCGCAGGCA CCAAAGCCCC AGCCCJAAAA 2100 

GGGGGAGCTG CGGAGCTCCG GCCCCCTGAC GTGGGGCATG TGCTGAGGAT GCTGCTGGGG 2160 

AAGATGTGCG OGCCGAGGTA CCCCTCCGGG GGCGCCTGGG ACCCTCACCC AGGCCAGGGC 2220 

TTCGGGGAGA GCATGTATCT GCTCTCGGAC AAGGCCACCT CGCCGCTCTC GCTGGATGCT 2280 

GGCCTCGGGC AGGCCCCCTG GAGCGACCTG CTTCTTTGGG CACTGTTGCT GAACAGGGCA 2340 

CAGATGGCCA TGTACTTCTG GGAGATGGGT TCCAATGCAG TTTCCTCAGC TCTTGGGGCC 2400 

TGTTTGCTGC TCCGGGTGAT GGCACGCCTG GAGCCTGACG CTGAGGAGGC AGCACGGAGG 2460 

AAAGACCTGG CGTTCAAGTT TGAGGGGATG GGCGTTGACC TCTTTGGCGA GTGCTATCGC 2520 

AGCAGTGAGG TGAGGGCTGC CCGCCTCCTC CTCCGTCGCT GCCCGCTCTG GGGGGATGCC 2580 

ACTTGCCTCC AGCTGGCCAT GCAAGCTGAC GCCCGTGCCT TCTTTGCCCA GGATGGGGTA 2640 

CAGTCTCTGC TGACACAGAA GTGGTGGGGA GATATGGCCA GCACTACACC CATCTGGGCC 2700 

CTGGTTCTOG CCTT CTTTTG CCCTCCACTC ATCTACACCC GCCTCATCAC CTTCAGGAAA 2760 

TCAGAAGAGG AGCCCACACG GGAGGAGCTA GAGTTTGACA TGGATAGTGT CATTAATGGG 2820 

GAAGGGCCTG TCGGGACGGC GGACCCAGCC GAGAAGACGC CGCTGGGGGT CCCGCGCCAG 2880 

TCGGGCCGTC CGGGTTGCTG CGGGGGCCGC TGCGGGGGGC GCCGGTGCCT ACGCCGCTGG 2940 

TTCCACTTCT GGGGCGCGCC GGTGACCATC TTCATGGGCA ACGTGGTCAG CTACCTGCTG 3000 

TTCCTGCTGC TTTTCTCGCG GGTGCTGCTC GTGGATTTCC AGCCGGCGCC GCCCGGCTCC 3060 

CTGGAGCTGC TGCTCTATTT CTGGGCTTTC ACGCTGCTGT GCGAGGAACT GCGCCAGGGC 3120 

CTGAGCGGAG GCGGGGGCAG CCTCGCCAGC GGGGGCCCCG GGCCTGGCCA TGCCTCACTG 3180 

AGCCAGCGCC TGCGCCTCTA CCTCGCCGAC AGCTGGAACC AGTGCGACCT AGTGGCTCTC 3240 

ACCTGCTTCC TCCTGGGCGT GGGCTGCCGG CTGACCCCGG GTTTGTACCA CCTGGGCCGC 3300 

ACTGTCCTCT GCATCGACTT CATGGTTTTC ACGGTGCGGC TGCTTCACAT CTTCACGGTC 3360 

AACAAACAGC TGGGGCCCAA GATGGTCATC GTGAGCAAGA TGATGAAGGA CGTGTTCTTC 3420 

TTCCTCTTCT TCCTGGGCGT GTGGCTGGTA GCCTATGGCG TGGCCACGGA GGGGCTCCTG 3480 

AGGCCACGGG ACAGTGACTT CCCAAGTATC CTGCGCCGCG TCTTCTACCG TCCCTACCTG 3540 

CAGATCTTCG GGCAGATTCC CCAGGAGGAC ATGGACGTGG CCCTCATGGA GCACAGCAAC 3600 

TGCTCGTCGG AGCCCGGCTT CTGGGCACAC CCTCCTGGGG CCCAGGCGGG CACCTGCGTC 3660 

TCCCAGTATG CCAACTGGCT GGTGGTGCTG CTCCTCGTCA TCTTCCTGCT CGTGGCCAAC 3720 

ATCCTGCTGG TCAACTTGCT CATTGCCATG TTCAGTTACA CATTCGGCAA AGTACAGGGC 3780 

AACAGCGATC TCTACTGGAA GGCGCAGCGT TACCGCCTCA TCCGGGAATT CCACTCTCGG 3840 

CCCGCGCTGG CCCCGCCCTT TATCGTCATC TCCCACTTGC GCCTCCTGCT CAGGCAATTG 3900 

TGCAGGCGAC CCCGGAGCCC CCAGCCGTCC TCCCCGGCCC TCGAGCATTT CCGGGTTTAC 3960 

CTTTCTAAGG AAGCCGAGCG GAAGCTGCTA ACGTGGGAAT CGGTGCATAA GGAGAACTTT 4020 

CTGCTGGCAC GCGCTAGGGA CAAGCGGGAG AGCGACTCCG AGCGTCTGAA GCGCACGTCC 4080 

CAGAAGGTGG ACTTGGCACT GAAACAGCTG GGACACATCC GCGAGTACGA ACAGCGCCTG 4140 

AAAGTGCTGG AGCGGGAGGT CCAGCAGTGT AGCCGCGTCC TGGGGTGGGT GGCCGAGGCC 4200 

CTGAGCCGCT CTGCCTTGCT GCCCCCAGGT GGGCCGCCAC CCCCTGACCT GCCTGGGTCC 4260 

AAAGACTGAG CCCTGCTGGC GGACTTCAAG GAQAAGCCCC CACAGGGGAT TTTGCTCCTA 4320 

GAGTAAGGCT CATCTGGGCC TCGGCCCCOG CACCTGGTGG CCTTGTCCTT GAGGTGAGCC 4380 

CCATGTCCAT CTGGGCCACT GTCAGGACCA CCTTTGGGAG TGTCATCCTT ACAAACCACA 4440 

GCATGCCCGG CTCCTCCCAG AACCAGTCCC AGCCTGGGAG GATCAAGGCC TGGATCCCGG 4500 

GCCGTTATCC ATCTGGAGGC TGCAGGGTCC TTGGGGTAAC AGGGACCACA GACCCCTCAC 4560 

CACTCACAGA TTCCTCACAC TGGGGAAATA AAGCCATTTC AGAGGAAAAA AAAAAAAAAA 4620 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NOs 558 Protein sequence 
Protein Accession fl: XP_057188.1 

1 11 21 31 41 SI 

I I I I I I 

MEDAFGAAW TVWDSDAHTT EKPTDAYGEL DFTGAGRKHS NFLRLSDRTD PAAVYSLVTR 60 

TWGFRAPNLV VSVLGGSGGP VLQTWLQDLL RRGLVRAAQS TGAWIVTGGL HTGIGRHVGV 120 

AVRDKQMAST GGTKWAMGV APWGWRNRD TLINPKGSFP ARYRWRGDPE DGVQFPLDYN 180 

YSAPFLVDDG THGCLGGENR FRLRLESYIS QQKTGVGGTG IDIPVLLLLI DGDEKMLTRI 240 

ENATQAQLPC LLVAGSGGAA DCLAETLEDT LAPGSGGARQ GEARDRIRRF FPKGDLEVLQ 300 

AQVERIMTRX ELLTVYSSED GSEEFETIVL KALVKACGSS EASAYLDELR LAVAWNRVDI 360 

AQSELFRGDI QWRSFHLEAS LMDALLNDRP EFVRLLISHG LSLGHFLTPM RLAQLYSAAP 420 

SNSLIRNLLD QASHSAGTKA PALKGGAAEL RPPDVGHVLR KLLGKKCAPR YPSGGAWDPH 480 

PGQGFGESMY LLSDKATSPL SLDAGLGQAP WSDLLLNALL LNRAQMAKYF WEMGSNAVSS 540 

ALGACLLLRV MARLEPDAEE AARRKDLAFK FEGMGVDLFG ECYRSSEVRA ARLLLRRCPL 600 

WGDATCLQLA MQADARAPPA QDGVQSLLTQ KWWGDMASTT PIWALVLAFF CPPLIYTRLZ 660 

TFRKSEEEPT REELEFDMDS VINGEGPVGT ADPAEKTPLG VPRQSGRPGC CGGRCGGRRC 720 
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LRRWFHFWGA PVTIPMGNW SYLLFLLLFS RVLLVDFQPA PPGSLELLLY FWAPTLLCEE 780 

LRQGLSGGGG SLASGGPQPG KASLSQRLRL YLADSWNQCD LVALTCFLLG VGCRLTPGLY 840 

HLGRTVLCID FMVPTVRLLH IFTVNKQLOP KIVIVSKMMK DVPFFLFPLG VWLVAYGVAT 900 

EGLLRPRDSD FPSILRRVFY RPYLQIFGQI PQEDMDVALM EHSNCSSEPO FWAHPPGAQA 960 

J GTCVSQYANW LWLLLVIFL LVANILLVNL LIAMFSYTFG KVQGNSDLYW KAQRYRLIRE 1020 

FHSRPA1APP PIV1SHLRLL LRQLCRRPRS PQPSSPALEH FRVYLSKBAE RKLLTWBSVH 1080 

KENFLLARAR DKRESDSERL KRTSQKVDLA LKQLGHIREY BQRLKVLBRE VQQCSRVLGW 1140 
VAEALSRSAL LPPGGPPPPD LPGSKD 

10 Seq ID NO: 5S9 DNA sequence 

Nucleic Acid Accession #: NMJ>06853.1 
Coding sequences 26.. 874 

, _ 1 11 21 31 41 51 

15 | | | | | | 

AGGAATCTGC GCTCGGGTTC CGCAGATGCA GAGGTTGAGG TGGCTGCGGG ACTGGAAGTC 60 

ATCGGGCAGA GGTCTCACAG CAGCCAAGGA ACCTGGGGCC CGCTCCTCCC CCCTCCAGGC 120 

CATGAGGATT CTGCAGTTAA TCCTGCTTGC TCTGGCAACA GGGCTTGTAG GGGGAGAGAC 180 

CAGGATCATC AAGGGGTTCG AGTGCAAGCC TCACTCCCAG CCCTGGCAGG CAGCCCTGTT 240 

CGAGAAGAOG CGGCTACTCT GTGGGGCGAC GCTCATCGCC CCCAGATGGC TCCTGACAGC 300 

AGCCCACTGC CTCAAGCCCC GCTACATAGT TCACCTGGGG CAGCACAACC TCCAGAAGGA 360 

GGAGGGCTGT GAGCAGACCC GGACAGCCAC TGAGTCCTTC CCCCACCCCG GCTTCAACAA 420 

CAGCCTCCCC AACAAAGACC ACCGCAATGA CATCATGCTG GTGAAGATGG CATOGCCAGT 480 

CTCCATCACC TGGGCTGTGC GACCCCTCAC CCTCTCCTCA CGCTGTGTCA CTGCTGGCAC 540 

CAGCTGCCTC ATTTCCGGCT GGGGCAGCAC GTCCAGCCCC CAGTTACGCC TGCCTCACAC 600 

CTTGCGATGC GCCAACATCA OCATCATTGA GCACCAGAAG TGTGAGAACG CCTACCCCGG 660 

CAACATCACA GACACCATGG TGTGTGCCAG CGTGCAGGAA GGGGGCAAGG ACTCCTGCCA 720 

GGGTGACTCC GGGGGCCCTC TGGTCTGTAA CCAGTCTCTT CAAGGCATTA TCTCCTGGGG 780 

CCAGGATCCG TGTGCGATCA CCCGAAAGCC TGGTGTCTAC AOGAAAGTCT GCAAATATGT 840 

GGACTGGATC CAGGAGAOGA TGAAGAACAA TTAGACTGGA CCCACCCACC ACAGCCCATC 900 

ACCCTCCATT TCCACTTGGT GTTTGGTTCC TGTTCACTCT GTTAATAAGA AACCCTAAGC 960 

CAAGACOCTC TACGAACATT CTTTGGGCCT CCTGGACTAC AGGAGATGCT GTCACTTAAT 1020 

AATCAACCTG GGGTTCGAAA TCAGTGAGAC CTGGATTCAA ATTCTGCCTT GAAATATTGT 1080 

GACTCTGGGA ATGACAACAC CTGGTTTGTT CTCTGTTGTA TCCCCAGCCC CAAAGACAGC 1140 
TCCTGGCCAT ATATCAAGGT TTCAATAAAT ATTTGCTAAA TGAGTG 
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Seq ID NO: 560 Protein sequence 
Protein Accession #: NP_006844.1 

1 11 21 31 41 51 

I I I I I I 

MRILQLILLA LATGLVGGET RIIKGFECKP HSQPWQAALF EKTRLLCGAT LIAPRHLLTA 60 

AKCLKPRYIV HLGQHNLQKE EGCEQTRTAT ESFPHPGFNN SLPNKDHRND IMLVKMASPV 120 

SITWAVRPLT LSSRCVTAGT SCLISGWGST SSPQLRLPHT LRCANITIIE HQKCENAYPG 180 

NITDTMVCAS VQEGGXDSCQ GDSGGPLVCN QSLQGIISWG QDPCAITRKP GVYTKVCKYV 240 

DWIQETMKNN 

Seq ID NO: 561 DNA sequence 
Nucleic Acid Accession #: AY046419.1 
Coding sequence : 1 . . 1743 

1 11 21 31 41 51 

I I -I I I I 

ATGTTTACCT TCCTGTCATC TGTCACTGCT GCTGTCAGTG GCCTCCTGGT GGGTTATGAA 60 

CTTGGGATCA TCTCTGGGGC TCTTCTTCAG ATCAAAACCT TATTAGCCCT GAGCTGCCAT 120 

GAGCAGGAAA TGGTTGTGAG CTCCCTCGTC ATTGGAGCCC TCCTTGCCTC ACTCACCGGA 180 

GGGGTCCTGA TAGACAGATA TGGAAGAAGG ACAGCAATCA TCTTGTCATC CTGCCTGCTT 240 

GGACTCGGAA GCTTAGTCTT GATCCTCAGT TTATCCTACA CGGTTCTTAT AGTGGGACGC 300 

ATTGCCATAG GGGTTTCCAT CTCCCTCTCT TCCATTGCCA CTTGTGTTTA CATCGCAGAG 360 

ATTGCTCCTC AACACAGAAG AGGCCTTCTT GTGTCACTGA ATGAGCTGAT GATTGTCATC 420 

GGCATTCTTT CTGCCTATAT TTCAAATTAC GCATTTGCCA ATGTTTTCCA TGGCTGGAAG 480 

TACATGTTTG GTCTTGTGAT TCCCTTGGGA GTTTTGCAAG CAATTGCAAT GTATTTTCTT 540 

CCTCCAAGCC CTCGGTTTCT GGTGATGAAA QGACAAGAGG GAGCTGCTAQ CAAGGTTCTT 600 

GGAAGGTTAA GAGCACTCTC AGATACAACT GAGGAACTCA CTGTGATCAA ATCCTCCCTG 660 

AAAGATGAAT ATCAGTACAG TTTTTGGGAT CTGTTTCGTT CAAAAGACAA CATGCGGACC 720 

CGAATAATGA TAGGACTAAC ACTAGTATTT TTTGTACAAA TCACTGGCCA ACCAAACATA 780 

TTGTTCTATG CATCAACTGT TTTGAAGTCA GTTGGATTTC AAAGCAATGA GGCAGCTAGC 840 

CTCGCCTCCA CTGGGGTTGG AGTCGTCAAG GTCATTAGCA CCATCCCTGC CACTCTTCTT 900 

GTAGACCATG TCGGCAGCAA AACATTCCTC TGCATTGGCT CCTCTGTGAT GGCAGCTTCG 960 

TTGGTGACCA TGGGCATCGT AAATCTCAAC ATCCACATGA ACTTCACCCA TATCTGCAGA 1020 

AGCCACAATT CTATCAACCA GTCCTTGGAT GAGTCTGTGA TTTATGGACC AGGAAACCTG 1080 

TCAACCAACA ACAATACTCT CAGAGACCAC TTCAAAGGGA TTTCTTCCCA TAGCAGAAGC 1140 

TCACTCATGC CCCTQAGAAA TGATGTGGAT AAGAGAGGGG AGACGACCTC AGCATCCTTG 1200 

CTAAATGCTG GATTAAGCCA CACTGAATAC CAGATAGTCA CAGACCCTGG GGACGTCCCA 1260 

GCTTTTTTGA AATGGCTGTC CTTAGCCAGC TTGCTTSTTT ATGTTGCTGC TTTTTCAATT 1320 

GGTCTAGGAC CAATGCCCTG GCTGGTGCTC AGCGAGATCT TTCCTGGTGG GATCAGAGGA 1380 

CGAGCCATGG CTTTAACTTC TAGCATGAAC TGGGGCATCA ATCTCCTCAT CTCGCTGACA 1440 

TTTTTGACTG TAACTGATCT TATTGGCCTG CCATGGGTGT GCTTTATATA TACAATCATG 1500 

AOTCTAGCAT OC C TOCT T TT TGTTGTTATG TTTATACCTG AGACAAAGGG ATGCTCTTTG 1560 

GAACAAATAT CAATGGAGCT AGCAAAAGTG AACTATGTGA AAAACAACAT TTGTTTTATG 1620 

AGTCATCACC AAGAAGAATT AGTGCCAAAA CAGCCTCAAA AAAGAAAACC CCAGGAGCAG 1680 

CTCTTGGAGT GTAACAAGCT GTGTGGTAGG GGCCAATCCA GGCAGCTTTC TCCAGAGACC 1740 
TAA 
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Seq ID NO: 562 Protein sequence 
Protein Accession fl: AAL02327.1 

1 11 21 31 41 51 

5 I | | | I I 

MFTPLSSVTA AVSQLLVGYE LGIISGALLQ IKTUALSCH BQEMWSSLV IGALLASLTG 60 
GVLIDRYGRR TAIILSSCLL OiGSLVLILS LSYTVLIVGR IAIGVSISLS SIATCVYIAB 120 
IAPQHRRGLL VSLNELMIVI GILSAYISNY AFANVPHGWK YMFGLVIPLG VLQAIAMYFL 180 
PPSPRFLVMK GQEGAASKVL GRLRALSDTT EELTVIKSSL KDEYQYSFWD LFR9KDNMRT 240 

10 RIMIGLTLVF FVQITGQPNI LPYASTVLKS VGFQSNEAAS LASTGVGWK VISTIPATLL 300 
VDHVGSKTFL CIGSSVMAAS LVTMGIVNLN IHMNFTHICR SHNSINQSLD ESVIYGPGNL 360 
STNNNTLRDH FKGISSHSRS SLMPLRNDVD KRGETTSASL LNAGLSHTEY QIVTDPGDVP 420 
AFLKWLSLAS LLVYVAAFSI GLGPMPWLVL SEIPPGGIRG RAMALTSSMN WGINLLISLT 480 
FLTVTDLIGL PWVCPIYTIM SLASLLFWM FIPETKGCSI* EQISMELAKV NYVKNNICFM 540 

15 SHHQEELVPK QPQKRKPQEQ LLECNKLCGR GQSRQLSPET 

Seq ID NO: 563 DMA sequence 
Nucleic Acid Accession 8: XM_059466.1 
Coding sequence: 1..894 
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1 11 21 31 41 51 

I I I I I I 

ATGGAGCCGC GGGCGCTCGT CACGGCGCTC AGCCTCGGCC TCAGCCTGTG CTCCCTGGGG 60 

CTGCTCGTCA CGGCCATCTT CACCGACCAC TGGTACGAGA CCGACCCCCG GCGCCACAAG 120 

GAGAGCTGCG AGCGCAGCCG CGCGGGCGCC GACCCCCCGG ACCAGAAGAA CCGCCTGATG 180 

CCGCTGTCGC ACCTGCCGCT GCGGGACTCG CCCCCGCTGG GGCGCCGGCT GCTCCCGGGC 240 

GGCCCGGGGC GCGCCGACCC CGAGTCCTGG CGCTOGCTCC TGGGGCTCGG CGGGCTGGAC 300 

GCCGAGTGCG GCCGGCCCCT CTTCGCCACC TACTCGGGCC TCTGGAGGAA GTGCTACTTC 360 

CTGGGCATOG ACCGGGACAT CGACACCCTC ATCCTGAAAG GTATTGCGCA GCGATGCACG 420 

GCCATCAAGT ACCACTTTTC TCAGCCCATC CGCTTGCGAA ACATTCCTTT TAATTTAACC 480 

AAGACCATAC AGCAAGATGA GTGGCACCTG CTTCATTTAA GAAGAATCAC TGCTGGCTTC 540 

CTCGGCATGG CCGTAGCCGT CCTTCTCTGC GGCTGCATTG TGGCCACAGT CAGTTTCTTC 600 

TGGGAGGAGA GCTTGACCCA GCACGTGGCT GGACTCCTGT TCCTCATGAC AGGGATATTT 660 

TGCACCATTT CCCTCTGTAC TTATGCCGCC AGTATCTCGT ATGATTTGAA CCGGCTCCCA 720 

AAGCTAATTT ATAGCCTGCC TGCTGATGTG GAACATGGTT ACAGCTGGTC CATCTTTTGC 780 

GCCTGGTGCA GTTTAGGCTT TATTGTGGCA GCTGGAGGTC TCTGCATCGC TTATCCGTTT 840 
ATTAGCCGGA CCAAGATTGC ACAGCTAAAG TCTGGCAGAG ACTCCACGGT ATGA 



Seq ID NO: 564 Protein sequence 
40 Protein Accession #: XPJ>59466.1 

1 11 21 31 41 51 

I I I I I I 

MEPRALVTAL SLGLSLCSLG LLVTAIFTDH WYETDPRRHK ESCERSRAGA DPPDQKNRLM 60 

45 PLSHLPLRDS PPLGRRLLPG GPGRADPESW RSLLGLGGLD AECGRPLFAT YSGLWRKCYF 120 

LGIDRDIDTL ILKGIAQRCT AIKYHFSQPI RLRNIPFNLT KTIQQDBWHL LHLRRITAGF 180 

LGMAVAVLLC GCIVATVSFF WEESLTQHVA GLLFLMTGIF CTISLCTYAA SISYDLNRLP 240 
KLIYSIiPADV EHGYSWSIFC AWCSLGFIVA AGGLCIAYPF ISRTKIAQLK SGRDSTV 

50 Seq ID NO.- 565 DMA sequence 

Nucleic Acid Accession # : Eos sequence 
Coding sequence: 1. .3315 

1 11 21 31 41 51 

55 | | | | | | 

ATGTCCTTTC GGGCAGCCAG GCTCAGCATG AGGAACAGAA GGAATGACAC TCTGGACAGC 60 

ACCCGGACCC TGTACTCCAG CGCGTCTCGG AGCACAGACT TGTCTTACAG TGAAAGCGAC 120 

TTGGTGAATT TTATTCAAGC AAATTTTAAG AAACGAGAAT GltSlCTfCrf TACCAAAGAT 180 

TCCAAGGCCA CGGAGAATGT GTGCAAGTGT GGCTATGCCC AGAGCCAGCA CATGGAAGGC 240 

60 ACCCAGATCA ACCAAAGTGA GAAATGGAAC TACAAGAAAC ACACCAAGGA ATTTCCTACC 300 

GACGCCTTTG GGGATATTGA GTTTGAGACA CTGGGGAAGA AAGGGAAGTA TATACGTCTG 360 

TCCTGCGACA CGGACGCGGA AATCCTTTAC GAGCTGCTGA CCCAGCACTG GCACCTGAAA 420 

ACACCCAACC TGGTCATTTC TGTGACCGGG GGGGCCAAGA ACTTCGCCCT GAAGCCGCGC 480 

ATGCGCAAGA TCTTCAGCCG GCTCATCTAC ATCGCGCAGT CCAAAGGTGC TTGGATTCTC 540 

65 ACGGGAGGCA CCCATTATGG CCTGATGAAG TACATCGGGG AGGTGGTGAG AGATAACACC 600 

ATCAGCAGGA GTTCAGAGGA GAATATTGTG GCCATTGGCA TAGCAGCTTG GGGCATGGTC 660 

TCCAACCGGG ACACCCTCAT CAGGAATTGC GATGCTGAGG GCTATTTTTT AGCCCAGTAC 720 

CTTATGGATG ACTTCACAAG AGATCCACTG TATATCCTGG ACAACAACCA CACACATTTG 780 

CTGCTCGTGG ACAATGGCTG TCATGGACAT CCCACTGTCG AAGCAAAGCT CCGGAATCAG 840 

70 CTAGAGAAGT ATATCTCTGA GCGCACTATT CAAGATTCCA ACTATGGTGG CAAGATCCCC 900 

ATTGTGTGTT TTGCCCAAGG AGGTGGAAAA GAGACTTTGA AAGCCATCAA TACCTCCATC 960 

AAAAATAAAA TTCCTTGTGT GGTGGTGGAA GGCTCGGGCC AGATCGCTGA TGTGATCGCT 1020 

AGCCTGGTGG AGGTGGAGGA TGCCCTGACA TCTTCTGCCG TCAAGGAGAA GCTGGTGCGC 1080 

TTTTTACCCC GCACGGTGTC CCGGCTGCCT GAGGAGGAGA CTGAGAGTTG GATCAAATGG 1140 

75 CTCAAAGAAA TTCTCGAATG TTCTCACCTA TTAACAGTTA TTAAAATGGA AGAAGCTGGG 1200 

GATGAAATTG TGAGCAATGC CATCTCCTAC GCTCTATACA AAGCCTTCAG CACCAGTGAG 1260 

CAAGACAAGG ATAACTGGAA TGGGCAGCTG AAGCTTCTGC TGGAGTGGAA CCAGCTGGAC 1320 

TTAGCCAATG ATGAGATTTT CACCAATGAC CGCCGATGGG AGTCTGCTGA CCTTCAAGAA 1380 

GTCATGTTTA CGGCTCTCAT AAAGGACAGA CCCAAGTTTG TCCGCCTCTT TCTGGAGAAT 1440 

oO GGCTTGAACC TACGGAAGTT TCTCACCCAT GATGTCCTCA CTGAACTCTT CTCCAACCAC 1500 

TTCAGCACGC TTGTGTACCG GAATCTGCAG ATCGCCAAGA ATTCCTATAA TGATGCCCTC 1560 

CTCACGTTTG TCTGGAAACT GGTTGCGAAC TTCCGAAGAG GCTTCCGGAA GGAAGACAGA 1620 

AATGGCCGGG ACGAGATGGA CATAGAACTC CACGACGTGT CTCCTATTAC TCGGCACCCC 1680 

CTGCAAGCTC TCTTCATCTG GGCCATTCTT CAGAATAAGA AGGAACTCTC CAAAGTCATT 1740 
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TGGGAGCAGA CCAGGGGCTG CACTCTGGCA GCCCTGGGAG CCAGCAAGCT TCTGAAGACT 1800 

CTGGCCAAAG TGAAGAAGGA CATCAATGCT GCTGGGGAGT CCGAGGAGCT GGCTAATGAG 1860 

TACGAGACCC GGGCTGTTGA GCTGTTCACT GAGTGTTACA GCAGCGATGA AGACTTGGCA 1920 

GAACAGCTGC TGGTCTATTC CTGTGAAGCT TGGGGTGGAA GCAACTGTCT G GAGCT GGCG 1980 

5 GTGGAGGCCA CAGACCAGCA TTTCATCGCC CAGCCTGGGG TCCAGAATTT TCTTTCTAAG 2040 

CAATGGTATG GAGAGATTTC CCGAGACACC AAGAACTGGA AGATTATCCT GTGTCTGTTT 2100 

ATTATACCCT TGGTGGGCTG TGGCTTTGTA TCATTTAGGA AGAAACCTGT CGACAAGCAC 2160 

AAGAAGCTGC TTTGGTACTA TGTGGCGTTC TTCACCTCCC CCTTCGTGGT CTTCTCCTGG 2220 

AATGTGGTCT TCTACATCGC CTTCCTCCTG CTGTTTGCCT ACGTGCTGCT CATGGATTTC 2280 

10 CATTCGGTGC CACACCCCCC OGAGCTGGTC CTGTACTCGC TGGTCTTTGT CCTCTTCTGT 2340 

GATGAAGTGA GACAGTGGTA CGTAAATGGG GTGAATTATT TTACTGACCT GTGGAATGTG 2400 

ATGGACACGC TGGGGCTTTT TTACTTCATA GCAGGAATTG TATTTCGGCT CCACTCTTCT 2460 

AATAAAAGCT CTTTGTATTC TGGACGAGTC ATTTTCTGTC TGGACTACAT TATTTTCACT 2520 

CTAAGATTGA TCCACATTTT TACTGTAAGC AGAAACTTAG GACCCAAGAT TATAATGCTG 2580 

15 CAGAGGATGC TGATCGATGT GTTCTTCTTC CTGTTCCTCT TTGCGGTGTG GATGGTGGCC 2640 

TTTGGCGTGG CCAGGCAAGG GATCCTTAGG CAGAATGAGC AGCGCTGGAG GTGGATATTC 2700 

OGTTCGGTCA TCTACGAGCC CTACCTGGCC ATGTTCGGCC AGGTGCCCAG TGACGTGGAT 2760 

GGTACCACGT ATGACTTTGC CCACTGCACC TTCACTGGGA ATGAGTCCAA GCCACTGTGT 2820 

GTGGAGCTGG ATGAGCACAA CCTGCCCCGG TTCCCCGAGT GGATCACCAT CCCCCTGGTG 2880 

20 TGCATCTACA TGTTATCCAC CAACATCCTG CTGGTCAACC TGCTGGTCGC CATGTTTGGC 2940 

TACACGGTGG GCACOGTCCA GGAGAACAAT GACCAGGTCT GGAAGTTCCA GAGGTACTTC 3000 

CTGGTGCAGG AGTACTGCAG CCGCCTCAAT ATCCCCTTCC CCTTCATCGT CTTCGCTTAC 3060 

TTCTACATGG TGGTGAAGAA GTGCTTCAAG TGTTGCTGCA AGGAGAAAAA CATGGAGTCT 3120 

TCTGTCTGCT GTTTCAAAAA TGAAGACAAT GAGACTCTGG CATGGGAGGG TGTCATGAAG 3180 

25 GAAAACTACC TTGTCAAGAT CAACACARAA GCCAACGACA CCTCAGAGGA AATGAGGCAT 3240 

CGATTTAGAC AACTGGATAC AAAGCTTAAT GATCTCAAGG GTCTTCTGAA AGAGATTGCT 3300 
AATAAAATCA AATGA 

Seq ID NO: 566 Protein sequence 
30 Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MSFRAARLSM RNRRNDTLDS TRTLYSSASR STDLSYSESD LVNFIQANFK KRECVFFTKD 

35 SKATENVCKC GYAQSQHMEG TQINQSEKWN YKKHTKEFPT DAFGDIQFET LGKKGKYIRL 
SCDTDAEILY ELLTQHWHLK TPNLVISVTG GAKNFALKPR MRKIFSRLIY IAQSKGAWII* 
TGGTHYGLMX YIGEWRBNT ISRSSEENIV AIGIAAWGMV SNRDTLIRNC DAEGYFLAQY 
LMDDFTRDPL YILDNNHTHL LLVDNGCHGH PTVEAKLRNQ LEKYISERTI QDSNYGGKIP 
IVCFAQGGGK ETLKAINTSI KNKIPCWVE GSGQIADVIA SLVEVEDALT SSAVKEKLVR 

40 FLPRTVSRLP EEETESWIKM LKEILECSHL LTVIKMEEAG DEXVSNAISY ALYKAFSTSE 
QDKDNWNGQL KLLLEWNQLD LANDEIFTND RRWESADLQE VMFTALIKDR PKFVRLFLEN 
GLNLRKFLTH DVLTELFSNH FSTLVYRNLQ IAKNSYKDAL LTFVWKLVAN FRRGFRKEDR 
NGRDEMDIEL HDVSPITRHP LQALFIWAIL QNKKELSKVI WEQTRGCTLA ALGASKLLKT 
LAKVKNDINA AGESEELANE YETRAVELFT BCYSSDEDLA EQLLVYSCEA WGGSNCLELA 

45 VEATDQHFIA QPGVQNFLSK QWYGEISRDT KNWKIILCLF IIPLVGCGFV SFRKKPVDKH 
KKLLWYYVAF FTSPFWFSW NWFYIAFLL LFAYVLLMDF HSVPHPPELV LYSLVFVLFC 
DEVRQWYVNG VNYFTDLWNV MDTLGLFYFI AGIVFRLHSS KKSSLYSGRV IFCLDYIIFT 
LRLIHIFTVS RNLGPKIIML QRMLIDVFFF LPLFAVWMVA FGVARQGILR QNEQRWRWIF 
RSVIYEPYLA MFGQVPSDVD GTTYDFAHCT FTGNESKPLC VELDEHNLPR FPEWITIPLV 

50 CIYMLSTNIL ZiVNLLVAMFG YTVGTVQENN DQVWKFQRYF LVQEYCSRLN IPFPFIVFAY 
FYMWKKCFK CCCKEKNMES SVCCFKNEDN ETLAWEGVMK ENYLVKINTK ANDTSEEMRH 
RFRQLDTKLN DLKGLLKEIA NKIK 

Seq ID NO: 567 DNA sequence 
55 Nucleic Acid Accession #: NM 006911.1 
Coding sequence : 1 . . 558 
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1 11 21 31 41 51 

60 | | | | | | 

ATGCCTCGCC TGTTCTTGTT CCACCTGCTA GAATTCTGTT TACTACTGAA CCAATTTTCC 60 

AGAGCAGTCG CGGCCAAATG GAAGGACGAT GTTATTAAAT TATGCGGCCG CGAATTAGTT 120 

OGCGOGCAGA TTGCCATTTG CGGCATGAGC ACCTGGAGCA AAAGGTCTCT GAGCCAGGAA 180 

GATGCTCCTC AGACACCTAG ACCAGTGGCA GAAATTGTAC CATCCTTCAT CAACAAAGAT 240 

65 ACAGAAACTA TAATTATCAT GTTGGAATTC ATTGCTAATT TGCCACCGGA GCTGAAGGCA 300 

GCCCTATCTG AGAGGCAACC ATCATTACCA GAGCTACAGC AGTATGTACC TGCATTAAAG 360 

GATTCCAATC TTAGCTTTGA AGAATTTAAG AAACTTATTC GCAATAGGCA AAGTGAAGCC 420 

GCAGACAGCA ATCCTTCAGA ATTAAAATAC TTAGGCTTGG ATACTCATTC TCAAAAAAAG 480 

AGACGACCCT AOGTGGCACT GTTTGAGAAA TGTTGGCTAA TTGGTTGTAC CAAAAGGTCT 540 
70 CTTGCTAAAT ATTGCTGA 

Seq ID NO i 568 Protein sequence 
Protein Accession 8: NP_008842.1 

75 1 11 21 31 41 51 

I I 1 I.I I 

MPRLFLFHLL EFCLLLNQFS RAVAAKWKDD VIKLCGRELV RAQIAICGMS TWSKRSLSQE 60 
DAPQTPRPVA EIVPSFINKD TETIIIMLBF IANLPPELKA ALSERQPSLP ELQQYVPALK 120 
DSNLSFEEFK KLIRNRQSEA ADSNPSBLKY LGLDTHSQKK RRPYVALFBK CCLIGCTKRS 180 
80 LAKYC 

Seq ID NO: 569 DNA sequence 

Nucleic Acid Accession 8: XM_036453.1 
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Coding sequence: 1..3978 

1 11 21 31 41 51 

I I 1 1 I I 

ATGCTGCCCG TGTACCAGGA GGTGAAGCCC AACCCGCTGC AGGACGCGAA CCTCTGCTCA 60 

OGCGTGTTCT TCTGGTGGCT CAATCCCTTQ TTTAAAATTG GCCATAAACG GAGATTAGAG 120 

GAAGATGATA TGTATTCAGT GCTGCCAGAA GACCGCTCAC AGCACCTTGG AGAGGAGTTG 180 

CAAGGGTTCT GGGATAAAGA AGTTTTAAGA GCTGAGAATG ACGCACAGAA GCCTTCTTTA 240 

ACAAGAGCAA TCATAAAGTG TTACTGGAAA TCTTATTTAG TTTTGGGAAT TTTTACGTTA 300 

ATTGAGGAAA GTGCCAAAGT AATCCAGCCC ATATTTTTGG GAAAAATTAT TAATTATTTT 360 

GAAAATTATG ATCCCATGGA TTCTGTGGCT TTGAACACAG CGTACGCCTA TGCCACGGTG 420 

CTGACTTTTT GCACGCTCAT TTTGGCTATA CTGCATCACT TATATTTTTA TCACGTTCAG 480 

TGTGCTGGGA TGAGGTTACG AGTAGCCATG TGCCATATGA TTTATCGGAA GGCACTTCGT 540 

CTTAGTAACA TGGCCATGGG GAAGACAACC ACAGGCCAGA TAGTCAATCT GCTGTCCAAT 600 

GATGTGAACA AGTTTGATCA GGTGACAGTG TTCTTACACT TCCTGTGGGC AGGACCACTG 660 

CAGGCGATCG CAGTGACTGC CCTACTCTGG ATGGAGATAG GAATATCGTG CCTTGCTGGG 720 

ATGGCAGTTC TAATCATTCT CCTGCCCTTG CAAAGCTGTT TTGGGAAGTT GTTCTCATCA 780 

CTGAGGAGTA AAACTGCAAC TTTCACGGAT GCCAGGATCA GGACCATGAA TGAAGTTATA 840 

ACTGGTATAA GGATAATAAA AATGTACGCC TGGGAAAAGT CATTTTCAAA TCTTATTACC 900 

AATTTGAGAA AGAAGGAGAT TTCCAAGATT CTGAGAAGTT CCTGCCTCAG GGGGATGAAT 960 

TTGGCTTCGT TTTTCAGTGC AAGCAAAATC ATCGTGTTTG TGACCTTCAC CACCTACGTG 1020 

CTCCTCGGCA GTGTGATCAC AGCCAGCCGC GTGTTCGTGG CAGTGACGCT GTATGGGGCT 1080 

GTGCGGCTGA CGGTTACCCT CTTCTTCCCC TCAGCCATTG AGAGGGTGTC AGAGGCAATC 1140 

GTCAGCATCC GAAGAATCCA GACCTTTTTG CTACTTGATG AGATATCACA GGGCAACCGT 1200 

CAGCTGCCGT CAGATGGTAA AAAGATGGTG CATGTGCAGG ATTTTACTGC TTTTTGGGAT 1260 

AAGGCATCAG AGACCCCAAC TCTACAAGGC CTTTCCTTTA CTGTCAGACC TGGCGAATTG 1320 

TTAGCTGTGG TCGGCCCCGT GGGAGCAGGG AAGTCATCAC TGTTAAGTGC CGTGCTCGGG 1380 

GAATTGGCCC CAAGTCACGG GCTGGTCAGC GTGCATGGAA GAATTGCCTA TGTGTCTCAG 1440 

CAGCCCTGGG TGTTCTCGGG AACTCTGAGG AGTAATATTT TATTTGGGAA GAAATACGAA 1500 

AAGGAACGAT ATGAAAAAGT CATAAAGGCT TGTGCTCTGA AAAAGGATTT ACAGCTGTTG 1560 

GAGGATGGTG ATCTGACTGT GATAGGAGAT CGGGGAACCA CGCTGAGTGG AGGGCAGAAA 1620 

GCACGGGTAA ACCTTGCAAG AGCAGTGTAT CAAGATGCTG ACATCTATCT CCTGGACGAT 1680 

CCTCTCAGTG CAGTAGATGC GGAAGTTAGC AGACACTTGT TCGAACTGTG TATTTGTCAA 1740 

ATTTTGCATG AGAAGATCAC AATTTTAGTG ACTCATCAGT TGCAGTACCT CAAAGCTGCA 1800 

AGTCAGATTC TGATATTGAA AGATGGTAAA ATGGTGCAGA AGGGGACTTA CACTGAGTTC 1860 

CTAAAATCTG GTATAGATTT TGGCTCCCTT TTAAAGAAGG ATAATGAGGA AAGTGAACAA 1920 

CCTCCAGTTC CAGGAACTCC CACACTAAGG AATCGTACCT TCTCAGAGTC TTCGGTTTGG 1980 

TCTCAACAAT CTTCTAGACC CTCCTTGAAA GATGGTGCTC TGGAGAGCCA AGATACAGAG 2040 

AATGTCCCAG TTACACTATC AGAGGAGAAC CGTTCTGAAG GAAAAGTTGG TTTTCAGGCC 2100 

TATAAGAATT ACTTCAGAGC TGGTGCTCAC TGGATTGTCT TCATTTTCCT TATTCTCCTA 2160 

AACACTGCAG CTCAGGTTGC CTATGTGCTT CAAGATTGGT GGCTTTCATA CTGGGCAAAC 2220 

AAACAAAGTA TGCTAAATGT CACTGTAAAT GGAGGAGGAA ATGTAACCGA GAAGCTAGAT 2280 

CTTAACTGGT ACTTAGGAAT TTATTCAGGT TTAACTGTAG CTACCGTTCT TTTTGGCATA 2340 

GCAAGATCTC TATTGGTATT CTACGTCCTT GTTAACTCTT CACAAACTTT GCACAACAAA 2400 

ATGTTTGAGT CAATTCTGAA AGCTCCGGTA TTATTCTTTG ATAGAAATCC AATAGGAAGA 2460 

ATTTTAAATC GTTTCTCCAA AGACATTGGA CACTTGGATG ATTTGCTGCC GCTGACGTTT 2520 

TTAGATTTCA TCCAGACATT GCTACAAGTG GTTGGTGTGG TCTCTGTGGC TGTGGCCGTG 2580 

ATTCCTTGGA TCGCAATACC CTTGGTTCCC CTTGGAATCA TTTTCATTTT TCTTCGGCGA 2640 

TATTTTTTGG AAACGTCAAG AGATGTGAAG CGCCTGGAAT CTACAACTCG GAGTCCAGTG 2700 

TTTTCCCACT TGTCATCTTC TCTCCAGGGG CTCTGGACCA TCCGGGCATA CAAAGCAGAA 2760 

GAGAG GTGTC AGGAACTGTT TGATGCACAC CAGGATTTAC ATTCAGAGGC TTGGTTCTTG 2820 

TTTTTGACAA CGTCCCGCTG GTTCGCCGTC CGTCTGGATG CCATCTGTGC CATGTTTGTC 2880 

ATCATCGTTG CCTTTGGGTC CCTGATTCTG GCAAAAACTC TGGATGCOGG GCAGGTTGGT 2940 

TTGGCACTGT CCTATGCCCT CACGCTCATG GGGATGTTTC AGTGGTGTGT TCGACAAAGT 3000 

GCTGAAGTTG AGAATATGAT GATCTCAGTA GAAAGGGTCA TTGAATACAC AGACCTTGAA 3060 

AAAGAAGCAC CTTGGGAATA TCAGAAACGC CCACCACCAG CCTGGCCCCA TGAAGGAGTG 3120 

ATAATCTTTG ACAATGTGAA CTTCATGTAC AGTCCAGGTG GGCCTCTGGT ACTGAAGCAT 3180 

CTGACAGCAC TCATTAAATC ACAAGAAAAG GTTGGCATTG TGGGAAGAAC CGGAGCTGGA 3240 

AAAAGTTCCC TCATCTCAGC CCTTTTTAGA TTGTCAGAAC CCGAAGGTAA AATTTGGATT 3300 

GATAAGATCT TGACAACTGA AATTGGACTT CACGATTTAA GGAAGAAAAT GTCAATCATA 3360 

CCTCAGGAAC CTGTTTTGTT CACTGGAACA ATGAGGAAAA ACCTGGATCC CTTTAATGAG 3420 

CACACGGATG AGGAACTGTG GAATGCCTTA CAAGAGGTAC AACTTAAAGA AACCATTGAA 3480 

GATCTTCCTG GTAAAATGGA TACTGAATTA GCAGAATCAG GATCCAATTT TAGTGTTGGA 3S40 

CAAAGACAAC TGGTGTGCCT TGCCAGGGCA ATTCTCAGGA AAAATCAGAT ATTGATTATT 3600 

GATGAAGOGA OGGCAAATGT GGATCCAAGA ACTGATGAGT TAATACAAAA AAAAATCCGG 3660 

GAGAAATTTG CCCACTGCAC CGTGCTAACC ATTGCACACA GATTGAACAC CATTATTGAC 3720 

AGCGACAAGA TAATGGTTTT AGATTCAGGA AGACTGAAAG AATATGATGA GCCGTATGTT 3780 

TTGCTGCAAA ATAAAGAGAG CCTATTTTAC AAGATGGTGC AACAACTGGG CAAGGCAGAA 3840 

GCCGCTGCCC TCACTGAAAC AGCAAAACAG GTATACTTCA AAAGAAATTA TCCACATATT 3900 

GGTCACACTG ACCACATGGT TACAAACACT TCCAATGGAC AGCCCTCGAC CTTAACTATT 3960 
TTCGAGACAG CACTGTGA 



Seq ID NO: 570 Protein sequence 
Protein Accession #: XP_036453.l 

1 11 21 31 41 51 

I I I I I I 

MLPVYQEVKP NPLQDANLCS RVFFWWLNPL FKIGHKRRLE EDDMYSVLPE DRSQHLGEEL 60 
QGFWDKBVLR AENDAQKPSL TRAIIKCYWK SYLVLGIFTL IEESAKVIQP IFLGKIINYF 120 
ENYDPMDSVA LNTAYAYATV LTFCTLILAI LHHLYFYHVQ CAGKRLRVAM CHMIYRKALR 180 
LSNMAMGKTT TGQIVNLLSN DVNKFDQVTV FLHFLWAGPL QAIAVTALLW MEIGISCLAG 240 
MAVLI ILLPL QSCFGKLFSS LRSKTATFTD ARIRTMNEVI TGIRIIKMYA WEKSFSNLIT 300 
NLRKKEISKI LRSSCLRGMN LASFFSASKI IVFVTFTTYV LLGSVITASR VFVAVTLYGA 360 
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VRLTVTLFFP 
KASBTPTLQG 
QPWVFSGTLR 
ARVNLARAVY 
SQILILKDGK 
SQQSSRPSLK 
NTAAQVAYVL 
ARSLLVFYVL 
LDFIQTLLQV 
FSHLSSSLQG 
IIVAFGSLIL 
KEAPKEYQKR 
KSSLISALFR 
HTDEELWNAL 
DEATANVDPR 
LLQNKESLFY 
FETAL 



SAIBRVSEAI 
LSPTVRPGEL 
SNILFGKKYE 
QDADIYLLDD 
MVQKGTYTEF 
DGALESQDTE 
QDWVflrSYWAN 
VNSSQTLHNK 
VGWSVAVAV 
LWTIRAYKAB 
AKTLDAGQVG 
PPPAWPHEGV 
LSEPEGKIWI 
QEVQLKETIE 
TDELIQKKIR 
KMVQQLGKAE 



VSIRRIQTFL 
LAWGPVGAG 
KBRYEKVIKA 
PLSAVDAEVS 
LKSGIDFGSL 
NVPVTLSEEN 
KQSMLNVTVN 
MFBSZLKAPV 
IPWIAIPLVP 
ERCQELPDAH 
LALSYALTLM 
IIFDNVNFMY 
DKILTTEIGL 
DLPGKMDTEL 
EKFAHCTVLT 
AAALTETAKQ 



LLDEISQRNR 
KSSLLSAVLG 
CALKKDLQItL 
RHLFELCICQ 
LKKDNEESEQ 
RSEGKVGFQA 
GGGNVTBKLD 
LFFDRNPIGR 
LGIIFIFLRR 
QDLHSEAWFL 
GMFQWCVRQS 
SPGGPLVLKH 
HDLRKKMSII 
AESGSNFSVG 
IAHRLNTIID 
VYPKRNYPHI 



QLPSDGKKMV 
ELAPSHGLVS 
EDGDLTV IGD 
ILHEKITILV 
PPVPGTPTLR 
YKNYFRAGAH 
IjNWYLGIYSG 
ILNRFSKDIG 
YFLETSRDVK 
FLTTSRWFAV 
AEVENMHISV 
LTALIKSQEK 
PQEPVLFTGT 
QRQLVCLARA 
SDKIMVLDSG 
GHTDHMVTNT 



HVQDFTAFWD 
VHGRIAYVSQ 
RGTTLSGGQK 
THQLQYLKAA 
NRTFSBSSVW 
WIVFIPLILL 
LTVATVLFGI 
HLDDLLPLTF 
RLESTTRSPV 
RLDAICAMFV 
ERVIEYTDLE 
VGIVGRTGAG 
MRKMLDPFNE 
ILRKNQILII 
RLKEYDEPYV 
SNGQPSTLTI 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 571 DNA sequence 
Nucleic Acid Accession *h AF07120 
Coding sequence i 116 . . 4093 



11 



51 



21 31 41 

1 ■ I I I I I 

GGACAGGCGT GGCGGCCGGA GCCCCAGCAT CCCTGCTTGA GGTCCAGGAG CGGAGCCCGC 60 

GGCCACCGCC GCCTGATCAG CGCGACCCCG GCCCGCGCCC GCCCCGCCCG GCAAGATGCT 120 

GCCCGTGTAC CAGGAGGTGA AGCCCAACCC GCTGCAGGAC GCGAACATCT GCTCACGCGT 180 

GTTCTTCTGG TGGCTCAATC CCTTGTTTAA AATTGGCCAT AAACGGAGAT TAGAGGAAGA 240 

TGATATGTAT TCAGTGCTGC CAGAAGACCG CTCACAGCAC CTTGGAGAGG AGTTGCAAGG 300 

GTTCTGGGAT AAAGAAGTTT TAAGAGCTGA GAATGACGCA CAGAAGCCTT CTTTAACAAG 360 

AGCAATCATA AAGTGTTACT GGAAATCTTA TTTAGTTTTG GGAATTTTTA CGTTAATTGA 420 

GGAAAGTGCC AAAGTAATCC AGCCCATATT TTTGGGAAAA ATTATTAATT ATTTTGAAAA 480 

TTATGATCCC ATGGATTCTG TGGCTTTGAA CACAGCGTAC GCCTATGCCA CGGTGCTGAC 540 

TTTTTGCACG CTCATTTTGG CTATACTGCA TCACTTATAT TTTTATCACG TTCAGTGTGC 600 

TGGGATGAGG TTACGAGTAG CCATGTGCCA TATGATTTAT CGGAAGGCAC TTCGTCTTAG 660 

TAACATGGCC ATGGGGAAGA CAACCACAGG CCAGATAGTC AATCTGCTGT CCAATGATGT 720 

GAACAAGTTT GATCAGGTGA CAGTGTTCTT ACACTTCCTG TGGGCAGGAC CACTGCAGGC 7 BO 

GATCGCAGTG ACTGCCCTAC TCTGGATGGA GATAGGAATA TCGTGCCTTG CTGGGATGGC 840 

AGTTCTAATC ATTCTCCTGC CCTTGCAAAG CTGTTTTGGG AAGTTGTTCT CATCACTGAG 900 

GAGTAAAACT GCAACTTTCA CGGATGCCAG GATCAGGACC ATGAATGAAG TTATAACTGG 960 

TATAAGGATA ATAAAAATGT AOGCCTGGGA AAAGTCATTT TCAAATCTTA TTACCAATTT 1020 

GAGAAAGAAG GAGATTTCCA AGATTCTGAG AAGTTCCTGC CTCAGGGGGA TGAATTTGGC 1080 

TTCGTTTTTC AGTGCAAGCA AAATCATCGT GTTTGTGACC TTCACCACCT ACGTGCTCCT 1140 

CGGCAGTGTG ATCACAGCCA GCCGCGTGTT CGTGGCAGTG ACGCTGTATG GGGCTGTGCG 1200 

GCTGACGGTT ACCCTCTTCT TCCCCTCAGC CATTGAGAGG GTGTCAGAGG CAATCGTCAG 1260 

CATCCGAAGA ATCCAGACCT TTTTGCTACT TGATGAGATA TCACAGCGCA ACCGTCAGCT 1320 

GCCGTCAGAT GGTAAAAAGA TGGTGCATGT GCAGGATTTT ACTGCTTTTT GGGATAAGGC 1380 

ATCAGAGACC CCAACTCTAC AAGGCCTTTC CTTTACTGTC AGACCTGGCG AATTGTTAGC 1440 

TGTGGTCGGC CCCGTGGGAG CAGGGAAGTC ATCACTGTTA AGTGCCGTGC TCGGGGAATT 1500 

GGCCCCAAGT CACGGGCTGG TCAGCGTGCA TGGAAGAATT GCCTATGTGT CTCAGCAGCC 1560 

CTGGGTGTTC TCGGGAACTC TGAGGAGTAA TATTTTATTT GGGAAGAAAT ATGAAAAGGA 1620 

ACGATATGAA AAAGTCATAA AGGCTTGTGC TCTGAAAAAG GATTTACAGC TGTTGGAGGA 1680 

TGGTGATCTG ACTGTGATAG GAGATCGGGG AACCACGCTG AGTGGAGGGC AGAAAGCACG 1740 

GGTAAACCTT GCAAGAGCAG TGTATCAAGA TGCTGACATC TATCTCCTGG ACGATCCTCT 1800 

CAGTGCAGTA GATGCGGAAG TTAGCAGACA CTTGTTCGAA CTGTGTATTT GTCAAATTTT 1860 

GCATGAGAAG ATCACAATTT TAGTGACTCA TCAGTTGCAG TACCTCAAAG CTGCAAGTCA 1920 

GATTCTGATA TTGAAAGATG GTAAAATGGT GCAGAAGGGG ACTTACACTG AGTTCCTAAA 1980 

ATCTGGTATA GATTTTGGCT CCCTTTTAAA GAAGGATAAT GAGGAAAGTG AACAACCTCC 2040 

AGTTCCAGGA ACTCCCACAC TAAGGAATCG TACCTTCTCA GAGTCTTCGG TTTGGTCTCA 2100 

ACAATCTTCT AGACCCTCCT TGAAAGATGG TGCTCTGGAG AGCCAAGATA CAGAGAATGT 2160 

CCCAGTTACA CTATCAGAGG AGAACCGTTC TGAAGGAAAA GTTGGTTTTC AGGCCTATAA 2220 

GAATTACTTC AGAGCTGGTG CTCACTGGAT TGTCTTCATT TTCCTTATTC TCCTAAACAC 2280 

TGCAGCTCAG GTTGCCTATG TGCTTCAAGA TTGGTGGCTT TCATACTGGG CAAACAAACA 2340 

AAGTATGCTA AATGTCACTG TAAATGGAGG AGGAAATGTA ACOGAGAAGC TAGATCTTAA 2400 

CTGGTACTTA GGAATTTATT CAGGTTTAAC TGTAGCTACC GTTCTTTTTG GCATAGCAAG 2460 

ATCTCTATTG GTATTCTACG TCCTTGTTAA CTCTTCACAA ACTTTGCACA ACAAAATGTT 2520 

TGAGTCAATT CTGAAAGCTC CGGTATTATT CTTTGATAGA AATCCAATAG GAAGAATTTT 2580 

AAATCGTTTC TCCAAAGACA TTGGACACTT GGATGATTTG CTGCCGCTGA CGTTTTTAGA 2640 

TTTCATCCAG ACATTGCTAC AAGTGGTTGG TGTGGTCTCT GTGGCTGTGG CCGTGATTOC 2700 

TTGGATCGCA ATACCCTTGQ TTCCCCTTGG AATCATTTTC ATTTTTCTTC GGCGATATTT 2760 

TTTGGAAACG TCAAGAGATG TGAAGOGCCT GGAATCTACA ACTCGGAGTC CAGTGTTTTC 2820 

CCACTTGTCA TCTTCTCTCC AGGGGCTCTG GACCATCCGG GCATACAAAG CAGAAGAGAG 2880 

GTGTCAGGAA CTGTTTGATG CACACCAGGA TTTACATTCA GAGGCTTGGT TCTTGTTTTT 2940 

GACAACGTCC CGCTGGTTCG CCGTCCGTCT GGATGCCATC TGTGCCATGT TTGTCATCAT 3000 

CGTTGCCTTT GGGTCCCTGA TTCTGGCAAA AACTCTGGAT GCOGGGCAGG TTGGTTTGGC 3060 

ACTGTCCTAT GCCCTCACGC TCATGGGGAT GTTTCAGTGG TGTGTTCGAC AAAGTGCTGA 3120 

AGTTGAGAAT ATGATGATCT CAGTAGAAAG GGTCATTGAA TACACAGACC TTGAAAAAGA 3180 

AGCACCTTGG GAATATCAGA AACGCCCACC ACCAGCCTGG CCCCATGAAG GAGTGATAAT 3240 

CTTTGACAAT GTGAACTTCA TGTACAGTCC AGGTGGGCCT CTGGTACTGA AGCATCTGAC 3300 

AGCACTCATT AAATCACAAG AAAAGGTTGG CATTGTGGGA AGAACCGGAG CTGGAAAAAG 3360 

TTCCCTCATC TCAGCCCTTT TTAGATTGTC AGAACCOGAA GGTAAAATTT GGATTGATAA 3420 

GATCTTGACA ACTGAAATTG GACTTCACGA TTTAAGGAAG AAAATGTCAA TCATACCTCA 3480 

GGAACCTGTT TTGTTCACTG GAACAATGAG GAAAAACCTG GATCCCTTTA AGGAGCACAC 3540 

GGATGAGGAA CTGTGGAATG CCTTACAAGA GGTACAACTT AAAGAAACCA TTGAAGATCT 3600 
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TCCTGGTAAA 
ACAACTGGTG 
AGCGACGGCA 
ATTTGCCCAC 
CAAGATAATQ 
GCAAAATAAA 
T6CCCTCACT 
CACTGACCAC 
GACAGCACTG 
TTTGGACTAT 
CAAGATGCTA 



ATGGATACTG 
TGCCTTGCCA 
AATGTGGATC 
TGCACOGTGC 
GTTTTAGATT 
GAGAGCCTAT 
GAAACAGCAA 
ATGGTTACAA 
TGAATCCAAC 
GTAAACCACA 
GTTCATTTGA 



AATTAGCAGA 
GGGCAATTCT 
CAAGAACTGA 
TAAOCATTGC 
CAGGAAGACT 
TTTACAAGAT 
AACAGGTATA 
ACACTTCCAA 
CAAAATGTCA 
TTGTACTTTT 
ATATTTCTCC 



ATCAGGATCC 
CAGGAAAAAT 
TGAGTTAATA 
ACACAGATTQ 
GAAAGAATAT 
GGTGCAACAA 
CTTCAAAAGA 
TGGACAGCCC 
AGTCOGTTCC 
TTTTACTTTG 
C 



AATTTTAGTG 
CAGATATTGA 
CAAAAAAAAA 
AACACCATTA 
GATGAGCCGT 
CTGGGCAAGG 
AATTATCCAC 
TCGACCTTAA 
GAAGGCATTT 
GCAACAAATA 



TTGGACAAAG 
TTATTGATGA 
TCOGGGAGAA 
TTGACAGOGA 
ATGTTTTGCT 
CAGAAGCCGC 
ATATTGGTCA 
CTATTTTCGA 
TCCACTAGTT 
TTTATACATA 



3660 
3720 
37B0 
3840 
3900 
3960 
4020 
40 80 
4140 
4200 



Seq ID NO: S72 Protein sequence 
Protein Accession #: AAC27076.1 



1 
I 

MLPVYQEVKP 
QGFWDKEVLR 
ENYDPMDSVA 
LSNMAMGKTT 
MAVLIILLPL 
NLRKKEISKI 
VRLTVTLPFP 
KASSTPTLQG 
QPWVFSGTLR 
ARVNLARAVY 
SQILXLKDGK 
SQQSSRPSLK 
NTAAQVAYVL 
ARSLLVFYVL 
LDFIQTLLQV 
FSHLSSSLQG 
IIVAFGSLIL 
KEAPWEYQKR 
KSSLISALFR 
HTDEELWNAL 
DEATANVDPR 
LLQNKBSLFY 
PETAL 



11 
I 

NPLQDANICS 
AENDAQKPSL 
LNTAYAYATV 
TGQIVNLLSN 
QSCFGKLFSS 
LRSSCLRGMN 
SA1ERVSEAI 
LSPTVRPGEL 
SNILFGKKYE 
QDADIYLLDD 
MVQKGTYTEF 
DGALESQDTE 
QDWWLSYWAN 
VNSSQTLHNK 
VGWSVAVAV 
LWTIRAYKAE 
AKTLDAGQVG 
PPPAWPHEGV 
LSEPEGKIWI 
QEVQLKETIE 
TDBLIQKKIR 
KMVQQLGKAE 



21 
I 

RVFFWWLNPL 
TRAIIKCYWK 
LTFCTLILAI 
DVNKFDQVTV 
LRSKTATFTD 
LASFFSASKI 
VSIRRIQTFL 
LAWGPVGAG 
KERYEKVIKA 
PLSAVDAEVS 
LKSGIDFGSL 
NVPVTLSEEN 
KQSMLNVTVN 
MFESILKAPV 
IPWIAIPLVP 
EROQELFDAH 
LALSYALTLM 
IIFDNVNFMY 
DKILTTBIGL 
DLPGKMDTEL 
EKPAHCTVLT 
AAALTETAKQ 



31 
t 

FKIGHKRRLE 
SYLVLGIFTL. 
LHHLYFYHVQ 
FLHFLWAGPL 
ARIRTMNEVI 
IVFVTFTTYV 
LLDEISQRNR 
KSSLLSAVLG 
CALKKDLQIiL 
RHLFELCIOQ 
LKKDNEESEQ 
RSEGKVGFQA 
GGGNVTEKLD 
LFFDRNPIGR 
LGIIFIFLRR 
QDLHSEAWFL 
GMFQWCVRQS 
SPGGPLVLKH 
HDLRKKMSII 
AESGSNFSVG 
IAHRLNTIID 
VYFKRNYPHI 



41 
I 

EDDMYSVIiPE 
IEESAKVIQP 
CAGMRLRVAM 
QAIAVTALLW 
TGIRIIKMYA 
LLGSVITASR 
QLPSDGKKMV 
ELAPSHGLVS 
EDGDLTVIGD 
ILHEKITILV 
PPVPGTPTLR 
YKNYFRAGAH 
LNWYLGIYSG 
ILNRFSKDIG 
YFLETSRDVK 
PLTTSRWFAV 
AEVENMMISV 
LTALIKSQEK 
PQEPVLPTGT 
QRQLVCLARA 
SDKIMVLDSG 
GHTDHMVTNT 



51 
I 

DRSQHLGEEL 
IFLGKIINYF 
CHMIYRKALR 
MEIGISCLAG 
WEKSFSNL1T 
VFVAVTLYGA 
HVQDFTAFWD 
VHGRIAYVSQ 
RGTTLSGGQK 
THQLQYLKAA 
NRTFSBSSVW 
WIVFIPLIL1. 
LTVATVLFGI 
HLDDLLPLTF 
RLESTTRSPV 
RLDAICAMFV 
ERVIEYTDLE 
VGIVGRTGAG 
MRKNLDPFKE 
ILRKNQILII 
RLKEYDEPYV 
SNGQPSTLTI 



Seq ID NO: 573 DNA sequence 

Nucleic Acid Accession fti Eos sequence 

Coding sequence: 1..1365 



1 
I 

ATGGAATCAA 
GGCATAAATG 
TTTGCCAAAT 
AGAAATCCTA 
GATGCTCTCA 
CTGTGGGACC 
AGGATAAACC 
TTGATTGTCA 
GCCAGCCGGC 
CTTGCCCGCC 
ATTGAAAATT 
AGCTTGGCCA 
AGAAACCAAC 
ATAGTTGCCA 
CAACTTTATT 
TGTAGAAAAC 
CTCTGCTTAC 
GTTCATGCAA 
ATCTCCTTTG 
TCAGTGAGCA 
GTCGCTCTGC 
GAAGAGTACT 
ATTGTAATTC 



11 
I 

TCTCTATGAT 
GTATCAAAGA 
CCTTGACCAT 
AGTTTGCTTC 
CAAAAACAAA 
TGAGACATCT 
AGTACCCAGA 
AAGGATTTAA 
AGGTTTATAT 
AGTTGAATTT 
TACCCCTACG 
CATTTTTTTT 
AGAGTGACTT 
TTACTTTGCT 
ACGGCACCAA 
AGCTTGGATT 
CGATGAGAAG 
ATATTGAAAA 
GCATAATGAG 
ATGCTTTAAA 
TCATAAGTAC 
ACAGATTTTA 
TGGATCTTTT 



21 
I 

GGGAAGCCCT 
TGCAAGGAAG 
TCGACTTATT 
TGAATTTTTT 
TATAATATTT 
GCTTGTGGGT 
ATCCAATGCT 
TGTTGTCTCA 
ATGCAGCAAC 
CATTCCCATT 
ACTCTTTACT 
CCTTTATTCC 
TTACAAAATT 
CTCCCTAGTA 
GTATAGGAGA 
ACTAAGTTTT 
GTCAGAGAGA 
CTCTTGGAAT 
CCTTGGCTTA 
CTGGAGAGAA 
TTTCCATGTT 
TACACCACCA 
GCAGCTTTGC 



31 
I 

AAGAGCCTTA 
GTCACTGTAG 
AGATGCGGCT 
CCTCATGTGG 
GTTGCTATAC 
AAAATCCTGA 
GAATATTTGG 
GCTTGGGCAC 
AATATTCAAG 
GACTTGGGAT 
CTCTGGAGAG 
TTTGTCAGAG 
CCTATAGAGA 
TACCTCGCAG 
TTTCCACCTT 
TTCTTCGCTA 
TATTTGTTTC 
GAGGAAGAAG 
CTTTCCCTCC 
TTCAGTTTTA 
TTAATTTATG 
AACTTTGTTC 
AGATACCCAG 



41 

1 

GTGAAACTTG 
GTGTGATTGG 
ATCATGTGGT 
TAGATGTCAC 
ACAGAGAACA 
TTGATGTGAG 
CTTCATTATT 
TTCAGTTAGG 
CGCGACAACA 
CCTTATCATC 
GGCCAGTGGT 
ATGTGATTCA 
TTGTGAATAA 
GTCTTCTGGC 
GGTTGGAAAC 
TGGTCCATGT 
TCAACATGGC 
TTTGGAGAAT 
TGGCAGTCAC 
TTCAGTCTAC 
GATGGAAACG 
TTGCTCTTGT 
ACTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



51 

i 

TTTACCTAAT 
AAGTGGAGAT 
CATAGGAAGT 
TCATCATGAA 
TTATACCTCC 
CAATAACATG 
CCCAGATTCT 
ACCTAAGGAT 
GGTTATTGAA 
AGCCAGAGAG 
GGTAGCTATA 
TCCATATGCT 
AACCTTACCT 
AG CTGCTT AT 
CTGGTTACAG 
TGCCTACAGC 
TTATCAGCAG 
TGAAATGTAT 
TTCTATCCCT 
ACTTGGATAT 
AGCTTTTGAG 
TTTGCCCTCA 



Seq ID NO: 574 Protein sequence 
Protein Accession #: Eos sequence 



1 
I 

MESISMMGSP 
RNPKFASEFF 
RINQYPESNA 
LARQLNFIPI 
RNQQSDFYKI 
CRKQLGLLSF 
ISFGIMSLGL 
EEYYRFYTPP 



11 
I 

KSLSETCLPN 
PHWDVTHHE 
EYLASLFPDS 
DLGSLSSARE 
PIEIVNKTLP 
FFAMVHV AY S 
LSLLAVTSIP 
NFVLALVLPS 



21 
I 

GINGIKDARK 
DALTKTNIIF 
LIVKGFNVVS 
IENLPLRLFT 
IVAITLLSLV 
LCLPMRRSER 
SVSNAUWRE 
IVILDLLQLC 



31 

1 

VTVGVIGSGD 
VAIHREHYTS 
AWALQLGPKD 
LHRGPWVAI 
YLAGIiLAAAY 
YLFLNMAYQQ 
FSFIQSTLGY 
RYPD 



41 
I 

FAKSLTIRLI 
LWDItRHLLVG 
ASRQVYICSN 
SLATFFFLYS 
QLYYGTKYRR 
VHANIENSWN 
VALLISTFHV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



51 

I 

RCGYHWIGS 
KILIDVSNNM 
NXQARQQVIB 
FVRDVTHPYA 
FPPWLBTWLQ 
EEEVWRIEMY 
LIYGWKRAFE 



60 
120 
180 
240 
300 
360 
420 
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Seq ID NO: 575 DNA sequence 

Nucleic Acid Accession ft: NMJ)01873.1 

Coding sequence: 3.. 1721 

1 11 21 31 41 51 

I I I I I I 

AAATGGOGTG CCCGTCTCTC OGCCGGCCCC CTGCCTCGCA GTGGTTTCTC CTGCAGCTCC 60 

CCTGGGCTCC GCGGCCAGTA GTGCAGCCCG TGGAGCCGCG GCTTTGOCCG TCTCCTCTGG 120 

GTGGCCCCAG TGC6CGG6CT GACACTCATT CAGOCGGGGA AGGTGAGGOG AGTAGAGGCT 180 

GGT6CGGAAC TTGCCGCCCC CAGCAGCGCC GGCGGGCTAA GCCCAGGGCC GGGCAGACAA 240 

AAGAGGCOGC CCGCGTAGGA AGGCACGGCC GGOGGCGGCG GAGCGCAGCG ATGGCCGGGC 300 

GAGGGGGCAG OGCGCTGCTG GCTCTGTGCG GGGCACTGGC TGCCTGCGGG TGGCTCCTGG 360 

GCGCOGAAGC CCAGGAGCCC GGGGCGCCCG CGGCGGGCAT GAGGCGGCGC CGGCGGCTGC 420 

AGCAAGAGGA CGGCATCTCC TTCGAGTACC ACCGCTACCC OGAGCTGCGC GAGGCGCTOG 480 

TGTCCGTGTG GCTGCAGTGC ACCGCCATCA GCAGGATTTA CACGGTGGGG CGCAGCTTCG 540 

AGGGCCGGGA GCTCCTGGTC ATCGAGCTGT CCGACAACCC TGGCGTCCAT GAGCCTGGTG 600 

AGCCTGAATT TAAATACATT GGGAATATGC ATGGGAATGA GGCTGTTGGA OGAGAACTGC 660 

TCATTTTCTT GGCCCAGTAC CTATGCAACG AATAGCAGAA GGGGAACGAG ACAATTGTCA 720 

ACCTGATCCA CAGTACCCGC ATTCACATCA TGCCTTCCCT GAACCCAGAT GGCTTTGAGA 780 

AGGCAGCGTC TCAGCCTGGT GAACTCAAGG ACTGGTTTGT GGGTCGAAGC AATGCCCAGG 840 

GAATAGATCT GAACCGGAAC TTTCCAGACC TGGATAGGAT AGTGTACGTG AATGAGAAAG 900 

AAGGTGGTCC AAATAATCAT CTGTTGAAAA ATATGAAGAA AATTGTGGAT CAAAACACAA 960 

AGCTTGCTCC TGAGACCAAG GCTGTCATTC ATTGGATTAT GGATATTCCT TTTGTGCTTT 1020 

CTGCCAATCT CCATGGAGGA GACCTTGTGG CCAATTATCC ATATGATGAG ACGCGGAGTG 1080 

GTAGTGCTCA CGAATACAGC TCCTCCCCAG ATGACGCCAT TTTCCAAAGC TTGGCCCGGG 1140 

CATACTCTTC TTTCAACCCG GOCATGTCTG ACCCCAATOG GCCACCATGT CGCAAGAATG 1200 

ATGATGACAG CAGCTTTGTA GATGGAACCA CCAACGGTGG TGCTTGGTAC AGCGTACCTG 1260 

GAGGGATGCA AGACTTCAAT TACCTTAGCA GCAACTGTTT TGAGATCACC GTGGAGCTTA 1320 

GCTGTGAGAA GTTCCCACCT GAAGAGACTC TGAAGACCTA CTGGGAGGAT AACAAAAACT 1380 

CCCTCATTAG CTACCTTGAG CAGATACACC GAGGAGTTAA AGGATTTGTC CGAGACCTTC 1440 

AAGGTAAGCC AATTGCGAAT GCCACCATCT CCGTGGAAGG AATAGACCAC GATGTTACAT 1500 

CCGCAAAGGA TGGTGATTAC TGGAGATTGC TTATACCTGG AAACTATAAA CTTACAGCCT 1560 

CAGCTCCAGG CTATCTGGCA ATAACAAAGA AAGTGGCAGT TCCTTACAGC CCTGCTGCTG 1620 

GGGTTGATTT TGAACTGGAG TCATTTTCTG AAAGGAAAGA AGAGGAGAAG GAAGAATTGA 1680 

TGGAATGGTG GAAAATGATG TCAGAAACTT TAAATTTTTA AAAAGGCTTC TAGTTAGCTG 1740 

CTTTAAATCT ATCTATATAA TGTAGTATGA TGTAATGTGG TCTTTTTTTT AGATTTTGTG 1800 

CAGTTAATAC TTAACATTGA TTTATTTTTT AATCATTTAA ATATTAATCA ACTTTCCTTA 1860 

AAATAAATAG CCTCTTAGGT AAAAATATAA GAACTTGATA TATTTCATTC TCTTATATAG 1920 

TATTCATTTT CCTACCTATA TTACACAAAA AAGTATAGAA AAGATTTAAG TAATTTTGCC 1980 

ATCCTAGGCT TAAATGCAAT ATTCCTGGTA TTATTTACAA TGCAGAATTT TTTGAGTAAT 2040 

TCTAGCTTTC AAAAATTAGT GAAGTTCTTT TACTGTAATT GGTGACAATG TCACATAATG 2100 

AATGCTATTG AAAAGGTTAA CAGATACAGC TCGGAGTTGT GAGCACTCTA CTGCAAGACT 2160 

TAAATAGTTC AGTATAAATT GTCGTTTTTT TCTTGTGCTG ACTAACTATA AGCATGATCT 2220 

TGTTAATGCA TTTTTGATGG GAAGAAAAGG TACATGTTTA CAAAGAGGTT TTATGAAAAG 2280 

AATAAAAATT GACTTCTTGC TTGTACATAT AGGAGCAATA CTATTATATT ATGTAGTCCG 2340 

TTAACACTAC TTAAAAGTTT AGGGTTTTCT CTTGGTTGTA GAGTGGCCCA GAATTGCATT 2400 
CTGAATGAAT AAAGGTTAAA AAAAAATCCC CAGTGAAAAA AAA 

Seq ID NO: 576 Protein sequence 
Protein Accession ft: NP 001864.1 



1 
I 

MAGRGGSALL 
EALVSVHLQC 
RELLIFLAQY 
NAQGIDLNRN 
FVLSANLHGG 
RKNDDDSSFV 
NKNSLISYLE 
LTASAPGYLA 



11 

I 

ALCGALAACG 
TAISRIYTVG 
LCNEYQKGNE 
FPDLDRTVYV 
DLVANYPYDE 
DGTTNGGAWY 
QIHRGVKGPV 
ITKKVAVPYS 



21 
I 

WLLGAEAQEP 
RSFEGRBLLV 
TIVNLIHSTR 



SVPGGKQDFN 
RDLQGNPIAN 
PAAGVDFELE 



31 
I 

GAPAAGMRRR 
IBLSDNPGVH 
IHIMPSLNPD 
LLKNMKKXVD 
SSPDDAIFQS 
YLSSNCFBIT 
ATISVEGIDH 



41 
1 

RRLQQEDGIS 
EPGEPEFKYI 
GPEKAASQPG 
QNTKLAPBTK 
LARAYSSPNP 
VELSCEKFPP 
DVTSAKDGDY 



51 
I 

FEYHRYPELR 
GNMHGNEAVG 
BLKDWFVGRS 
AVIHWIMDIP 
AMSDPNRPPC 
EETLKTYWED 
WRLLIPGNYK 
SETLNP 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 577 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..933 



ATGTGCAGCA 
TTCGACAAGA 
TTCCCCTGTG 
GACTGTCCCG 
GCCCGCTACC 
AATAACTGTC 
GGGCAGGTGT 
ATCATCGGCA 
CACCAGCGGA 
CTGCTGTCCC 
AATAATGGCA 
CCACCCTCCT 
COGCCCTACT 
CGGTCGGGGA 
GACACCAGCC 



11 

I 

ATGGAGGGTG 
GTGATGAGAA 
CCAGCGGCAT 
ATGGCAGCGA 
ACTGCAAGAA 
AAGACAACAG 
TTGTQACTTC 
GCTCOGTCAT 
AGCGGAACAA 
GCCTGGTGGT 
TCCAGTATGT 
ACTCCGAGGC 
CTTCTGACAC 
GTGCCAACAG 
ACAGCCCGGG 



21 
I 

CATCCCGGGC 
GGAGTGOCCC 
OCATTGCATC 
TGAAGAGAAC 
CGGCCTCTGT 
TGATGAGGAA 
AGAGAACCAA 
TTTTGTGCTG 
CCTCATGACG 
CCTGGACCAC 
GGCCAGCCAG 
CTTGCTGGAC 
GGAATCTCTG 
TGCCAGCTCC 
GCAGCCTGGC 



31 
I 

GCCTGGCAGT 
AAGGCTAAGT 
ATTGGTOGCT 
TGCACAGCAA 
ATTGACAAGA 
AGCTGTGAAA 
CTTGTGTATT 
GTGGTGGCCC 
CTGCCCGTGC 
CCCCACCACT 
GCGGAGCAGA 
CAGAGGCCTG 
AACCAAGCGG 
CAGGCAGCCA 
CCCCAGGAGG 



41 
I 

GTGACGGGCT 
GGAAATGTGG 
TCCGGTGCAA 
ACCCTCTGCT 
GCTTCATCTG 
GTTCTCAAGA 
ACCCCAGCA? 
TGCTGGCACT 
ACCGGCTGCA 
GCAACGTCAC 
ATGCGTCGGA 
OGTGGTATGA 
ACCTGCCCCC 
GCAGCCTCCT 
GCACTGCTGA 



51 
I 

GCCTGACTGC 
CCCGACCTTC 
TGGGTTTGAG 
TTGCTCCACC 
OGATGGACAG 
ACCCGGCAGT 
CACCTATGCC 
GGTCTTGCAC 
GCACCCTGTG 
CTACAACGTC 
AGTAGGCTCC 
CCTTCCTOCA 
CTACOGCTCC 
GAGOGTGGAA 
GCCCAGGGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



1174 



WO 03/042661 



TCTGAGCCCA GCCAGGGCAC TGAAGAAGTA TAA 

Seq ID NO: 578 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I S I I 

MCSNGRCIPG AWQCDGLPDC PDKSDEKECP KAKSKCGPTF FPCASGIHCI IGRFRCNGFE 60 

DCPDGSDEEN CTANPLLCST ARYHCKNGLC IDKSFICDGQ NNCQDNSDEE SCESSQEPGS 120 

GQVFVTSENQ LVYYPSITYA IIGSSVIFVL WALLALVLH HQRKRNNLMT LPVHRLQHPV 180 

LLSRLWLDH PHHCNVTYNV NNGIQYVASQ AEQNASEVGS PPSYSEALLD QRPAWYDLPP 240 

PPYSSDTESL NQADLPPYRS RSGSANSASS QAASSLLSVE DTSHSPGQPG PQEGTAEPRD 300 
SEPSQGTEEV 

Seq ID NO: 579 DNA sequence 
Nucleic Acid Accession §; AF179274.1 
Coding sequence : 1..1125 

1 11 21 31 41 51 

I 1 I I I I 

ATGGTGCTGT GGGAGTCCCC GOGGCAGTGC AGCAGCTGGA CACTTTGOGA GGGCTTTTGC 60 

TGGCTGCTGC TGCTGCCCGT CATGCTACTC ATCGTAGCCC GCCCGGTGAA GCTOGCTGCT 120 

TTCCCTACCT CCTTAAGTGA CTGCCAAACG CCCACCGGCT GGAATTGCTC TGGTTATGAT 180 

GACAGAGAAA ATGATCTCTT CCTCTGTGAC ACCAACACCT GTAAATTTGA TGGGGAATGT 240 

TTAAGAATTG GAGACACTGT GACTTGCGTC TGTCAGTTCA AGTGCAACAA TGACTATGTG 300 

CCTGTGTGTG GCTCCAATGG GGAGAGCTAC CAGAATGAGT GTTACCTGCG ACAGGCTGCA 360 

TGCAAACAGC AGAGTGAGAT ACTTGTGGTG TCAGAAGGAT CATGTGCCAC AGATGCAGGA 420 

TCAGGATCTG GAGATGGAGT CCATGAAGGC TCTGGAGAAA CTAGTCAAAA GGAGACATCC 480 

ACCTGTGATA TTTGCCAGTT TGGTGCAGAA TGTGACGAAG ATGCCGAGGA TGTCTGGTGT 540 

GTGTGTAATA TTGACTGTTC TCAAACCAAC TTCAATCCCC TCTGCGCTTC TGATGGGAAA 600 

TCTTATGATA ATGCATGCCA AATCAAAGAA GCATCGTGTC AGAAACAGGA GAAAATTGAA 660 

GTCATGTCTT TGGGTCGATG TCAAGATAAC ACAACTACAA CTACTAAGTC TGAAGATGGG 720 

CATTATGCAA GAACAGATTA TGCAGAGAAT GCTAACAAAT TAGAAGAAAG TGCCAGAGAA ■ 780 

CACCACATAC CTTGTCCGGA ACATTACAAT GGCTTCTGCA TGCATGGGAA GTGTGAGCAT 840 

TCTATCAATA TGCAGGAGCC ATCTTGCAGG TGTGATGCTG GTTATACTGG ACAACACTGT 900 

GAAAAAAAGG ACTACAGTGT TCTATACGTT GTTCCCGGTC CTGTACGATT TCAGTATGTC 960 

TTAATCGCAG CTGTGATTGG AACAATTCAG ATTGCTGTCA TCTGTGTGGT GGTCCTCTGC 1020 

ATCACAAGGA AATGCCCCAG AAGCAACAGA ATTCACAGAC AGAAGCAAAA TACAGGGCAC 1080 
TACAGTTCAG ACAATACAAC AAGAGCGTCC ACGAGGTTAA TCTGA 

Seq ID NO: 580 Protein sequence 
Protein Accession fh NP_057276.2 

1 11 21 31 41 51 

1 I I I I I 

MVLWESPRQC SSWTLCEGPC WLLLLPVMLL IVARPVKLAA FPTSLSDCQT PTGWNCSGYD 60 

DRENDLFLCD TNTCKFDGEC LRIGDTVTCV CQFKCNNDYV PVCGSNGESY QNECYLRQAA 120 

CKQQSEILW SEGSCATDAG SGSGDGVHEG SGETSQKETS TCDICQFGAE CDEDAEDVWC 180 

VCNIDCSQTN FNPLCASDGK SYDNAOQIKE ASCQKQEKIB VMSLGROQDN TTTTTKSEDG 240 

HYARTDYAEN ANKLEESARE HHIPCPEHYN GFCMHGKCEH SINMQEPSCR CDAGYTGQHC 300 

EKKDYSVLYV VPGPVRFQYV LIAAVIGTIQ IAVICVWLC ITRKCPRSNR IHRQKQNTGH 360 
YSSDNTTRAS TRLI 

Seq ID NO: 581 DNA sequence 
Nucleic Acid Accession #: S78203.1 
Coding sequence: 1..2190 

1 11 21 31 41 51 

I I I I I I 

ATGAATCCTT TCCAGAAAAA TGAGTCCAAG GAAACTCTTT TTTCACCTGT CTCCATTGAA 60 

GAGGTACCAC CTCGACCACC TAGCCCTCCA AAGAAGCCAT CTCCGACAAT CTGTGGCTCC 120 

AACTATCCAC TGAGCATTGC CTTCATTGTG GTGAATGAAT TCTGCGAGCG CTTTTCCTAT 180 

TATGGAATGA AAGCTGTGCT GATCCTGTAT TTCCTGTATT TCCTGCACTG GAATGAAGAT 240 

ACCTCCACAT CTATATACCA TGCCTTCAGC AGCCTCTGTT ATTTTACTCC CATCCTGGGA 300 

GCAGCGATTG CTGACTCGTG GTTGGGAAAA TTCAAGACAA TCATCTATCT CTCCTTGGTG 360 

TATGTGCTTG GCCATGTGAT CAAGTCCTTG GGTGCCTTAC CAATACTGGG AGGACAAGTG 420 

GTACACACAG TCCTATCATT GATCGGCCTG AGTCTAATAG CTTTGGGGAC AGGAGGCATC 480 

AAACCCTGTG TGGCAGCTTT TGGTGGAGAC CAGTTTGAAG AAAAACATGC AGAGGAACGG 540 

ACTAGATACT TCTCAGTCTT CTACCTGTCC ATCAATGCAG GGAGCTTGAT TTCTACATTT 600 

ATCACACCCA TGCTGAGAGG AGATGTGCAA TGTTTTGGAG AAGACTGCTA TGCATTGGCT 660 

TTTGGAGTTC CAGGACTGCT CATGGTAATT GCACTTGTTG TGTTTGCAAT GGGAAGCAAA 720 

ATATACAATA AACCACCCCC TGAAGGAAAC ATAGTGGCTC AAGTTTTCAA ATGTATCTGG 780 

TTTGCTATTT CCAATCGTTT CAAGAACCGT TCTGGAGACA TTCCAAAGCG ACAGCACTGG 840 

CTAGACTGGG CAGCTGAGAA ATATCCAAAG CAGCTCATTA TGGATGTAAA GGCACTGACC 900 

AGGGTACTAT TCCTTTATAT CCCATTGCCC ATGTTCTGGG CTCTTTTGGA TCAGCAGGGT 960 

TCACGATGGA CTTTGCAAGC CATCAGGATG AATAGGAATT TGGGGTTTTT TGTGCTTCAG 1020 

CCGGACCAGA TGCAGGTTCT AAATCCCTTT CTGGTTCTTA TCTTCATCOC GTTGTTTGAC 1080 

TTTGTCATTT ATCGTCTGGT CTCCAAGTGT GGAATTAACT TCTCATCACT TAGGAAAATG 1140 

GCTGTTGGTA TGATCCTAGC GTGCCTGGCA TTTGCAGTTG CGGCAGCTGT AGAGATAAAA 1200 

ATAAATGAAA TGGCCCCAGC CCAGTCAGGT CCCCAGGAGG TTTTCCTACA AGTCTTGAAT 1260 

CTGGCAGATG ATGAGGTGAA GGTGACAGTG GTGGGAAATG AAAACAATTC TCTGTTGATA 1320 

GAGTCCATCA AATCCTTTCA GAAAACACCA CACTATTCCA AACTGCACCT GAAAACAAAA 1380 

AGCCAGGATT TTCACTTCCA CCTGAAATAT CACAATTTGT CTCTCTACAC TGAGCATTCT 1440 

GTGCAGGAGA AGAACTGGTA CAGTCTTGTC ATTCGTGAAG ATGGGAACAG TATCTCCAGC 1500 



1175 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



ATGATGGTAA 
AACACTTTGC 
GAAGACTATG 
TGTAGAACAG 
TATCTGTTTG 
ATTCCAGCCA 
GOGGAGGTCA 
ATGAAATCTG 
CTTGTTGTGG 
CTCCTGCTGG 
ACAGAGGATA 
AAACTAGAGA 



AGGATACAGA 
ATAAAGATGT 
GTGTGTCTGC 
AAGATAAOAA 
TTATTACTAA 
AGAAAATGTC 
TGTTCTCTGT 
TGCTCCAGGC 
CACAGTTCAG 
TGATCTGCCT 
TGCGGGGTCC 
CCAAGAAGAC 



AAGCAAAACA 
CAACATCTCC 
TTATAGAACT 
CTTTTCTCTG 
TAACACCAAT 
CATTGCGTGG 
CACAGGTCTT 
AGCTTGGCTA 
TGGCCTGGTA 
GATCTTCTCC 
AGCAGATAAG 
AAAACTCTGA 



ACCAATGGGA 
CTGAGTACAG 
GTGCAAAGAG 
AATTTGGGTC 
CAGGGTCTTC 
CAGCTACCAC 
GAGTTTTCTT 
TTGACAATTG 
CAGTGGGCCG 
ATCATGGGCT 
CACATTCCTC 



Seq ID NO: 582 Protein sequence 
Protein Accession #: AAB343B8.1 



YGMKAVLILY 
YVLGHVIKSL 
TRYFSVFYLS 
IYNKPPPEGN 
RVLFLYIPLP 
PVIYRLVSKC 
LADDEVKVTV 
VQEKNWYSLV 
EDYGVSAYRT 
IPANKMSIAW 
LWAQFSGLV 
KLBTKKTKL 



11 
I 

ETLPSPVSIE 
PLYFLHWNED 
GALPILGGQV 
INAGSLISTP 
IVAQVFKCIW 
MFWALLDQQG 
GINFSSLRKM 
VGNENNSLLI 
IREDGNSISS 
VQRGEYPAVH 
QltPQYALVTA 
QWAEFILPSC 



21 
I 

EVPPRPPSPP 
TSTSIYHAPS 
VHTVLSLIGL 
ITPMLRGDVQ 
FAISNRFKNR 
SRWTLQAIRM 
AVGMILACLA 
ESIKSFQKTP 
MMVKDTESKT 
CRTEDKNPSL 
GEVMFSVTGL 
LLLVICLIPS 



31 
I 

KKPSPTICGS 
SLCYPTPILG 
SLIALGTGGI 
CFGEDCYALA 
SGDIPKRQHW 
NRNLGFFVLQ 
PAVAAAVEIK 
HYSKLHLKTK 
TNGMTTVRFV 
NLGLLDPGAA 
EFSYSQAPSS 
IMGYYYVPVK 



TGACAACCGT 
ATACCTCTCT 
GAGAATACCC 
TTCTAGACTT 
AGGCCTGGAA 
AATATGCCCT 
ATTCTCAGGC 
CAGTTGGGAA 
AATTCATTTT 
ACTACTATGT 
ACATCCAGGG 



41 

1 

NYPLSIAFIV 
AAIADSWLGK 
KPCVAAFGGD 
FGVPGLLMVI 
LDWAAEKYPK 
PDQMQVLNPP 
INEMAPAQSG 
SQDFHFHLKY 
NTLHKDVNIS 
YLFVITNNTN 
MKSVLQAAWL 
TEDMRGPADK 



GAGGTTTGTT 
CAATGTTGGT 
TGCAGTGCAC 
TGGTGCAGCA 
GATTGAAGAC 
GGTTACAGCT 
TCCCTCTAGC 
TATCATCGTG 
GTTTTCCTGC 
TCCTGTAAAG 
GAACATGATC 



51 
I 

VNEFCERFSY 
FKTI IYLSLV 



ALWFAMGSK 
QLIMDVKALT 
LVLIFIPLFD 
PQEVFLQVLN 
HNLSLYTEHS 
LSTDTSLNVG 
QGLQAWKIED 
LTIAVGNIIV 
HIPHIOGNMI 



Seq ID NO: 583 DNA sequence 

Nucleic Acid Accession #: NM_032642.1 

Coding sequence: 184.. 1263 " 



1 
I 

GACCATTAGC 
TAGTTTGAAC 
ACAGAGGGAA 
ACCATGCCCA 
CTGACAGACG 
TTTATCATCG 
AAGCTGTGCC 
ATCAAGGAAT 
GCATCTGTCT 
GTGAGCGCCG 
ACCTGCGGCT 
GGCTGTGGGG 
GAGCGAGAGA 
CAAAACAACG 
CACGGCGTCT 
AAGGTCGGGG 
AAGGGCCGGC 
TATGTGGACC 
CAGGGCCGCC 
GGGCGTGGCT 
TGGTGCTGCT 
TAGCCOGGAG 
ATAAATCTAT 
GAAAGATGAA 
GACTTTGCTG 
GAGGACTCTC 
GAGGGAGGTT 
GTTAGAGGAC 
ACTCAGCTTC 
AGGCCTGGTG 
GGAAOGGCTC 
GAGATCTGCA 
GGTTCACTAG 
CCCTAAACTG 
GACGTTATAC 
CTCCACGGAG 
GGCGTGCTCA 
TGGAAAAAAA 



11 
I 

AGGCACCCAG 
CTAGGAACTG 
CCCTACTCTG 
GCCTGCTGCT 
CCAACTCCTG 
GTGCCCAGCC 
AATTGTACCA 
GCCAGCACCA 
TTGGGAGAGT 
CGGGCGTGGT 
GCAGCCGGAC 
ACAACGTGGA 
AGAACTTTGC 
AGGCCGGTCG 
CGGGGTCCTG 
ACCGGCTGAA 
TGGAGCTGGT 
CCAGCCCCGA 
TCTGCAACAA 
ACAACCAGTT 
TCGTCAGGTG 
GGCCTGCTCC 
ATAAATCTAT 
AATGGAAAGG 
GTTCTCTCCT 
AGGATGTAGG 
GTGGTTGGAT 
TGCCTGTGAT 
AACCTCGATG 
CTCTCTTACT 
CTTCCCTAAA 
CCTCCCGGAC 
CTCCTACCTG 
AATGTTTGCG 
TGTCTTCCCC 
GGAGGCCTCA 
TCATCTCTGC 
AAAAAGAAAA 



21 
I 

GCCTGTC TTT 
CAGGACCAGA 
GAAACTGTCA 
GCTGTTCACG 
GTGGTCATTA 
CGTGTGCAGT 
GGAGCACATG 
GTTCCGGCAG 
CATGCAGATA 
CAACGCCATC 
GGCGOGGCCC 
GTACGGCTAC 
CAAAGGATCA 
CAGGGCTGTG 
CAGCCTCAAG 
GGAGAAGTAC 
CAACAGCCGC 
CTACTGCCTG 
GACCTCGGAG 
CAAGAGCGTG 
TAAGAAGTGC 
OGGCCCCCCC 
TTTATATTTG 
AAGAGCTTAT 
CTTGGTGGGT 
GACTTGGAAA 
GGAGGAGATG 
CCTGGCCACT 
TCTTCAGGGT 
CTTTCATCCA 
ATGAGAAGTC 
TTCAGGCCTG 
AAGAGGAAAG 
CCTGGGCTGC 
CACCCCCGGG 
CAAACCACAG 
CCCAGGTGTA 
AAAAAAAAAA 



1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 




Seq ID NO: S84 Protein sequence 
Protein Accession 8i NP_116031.1 

1 11 21 31 41 51 

I I I 1 I I 

MPSLLLLPTA ALLSSWAQLL TDANSWWSLA LNPVQRPEMP I IGAQPVCSQ LPGLSPGQRK 
LCQLYQEHMA YIGEGAKTGI KECQHQFRQR RHNCSTADNA SVPGRVMQIG SRETAPTHAV 



60 
120 



1176 



WO 03/042661 



PCT/US02/36810 



10 



40 



50 



SAAGWNAIS RACREGBLST CGCSRTARPK DLPRDWLWGG CGDNVEYGYR FAKEPVDARB 180 

REKNFAKGSE EQGRVLMNLQ NNEAGRRAVY KMADVACKCH GVSGSCSUCT CWLQLAEPRK 240 

VGDRLKEKYD SAAAMRVTRK GRLELVNSRF TQPTPEDLVY VDPSPDYCLR NESTGSLGTQ 300 
GRLCNKTSEG MDGCELMCOG RGYNQFKSVQ VERCHCKFHW CCFVRCKKCT EIVDQYICK 

Seq ID NO: 585 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1479 



I 11 21 31 41 SI 

II III! 

ATGGCTTTGA ACTCAGGGTC ACCACCAGCT ATTGGACCTT ACTATGAAAA CCATGGATAC 60 

CAACCGGAAA ACCCCTATCC CGCACAGCCC ACTGTGGTCC CCACTGTCTA CGAGGTGCAT 120 

CCGGCTCAOT ACTACCCGTC CCCCGTGCCC CAGTAOGCCC CGAGGGTCCT GACGCAGGCT 180 

ID TCCAACCCCG TOGTCTGCAC GCAGCCCAAA TCCCCATCCG GGACAGTGTG CACCTCAAAG 240 

ACTAAGAAAG CACTGTGCAT CACCTTGACC CTGGGGACCT TCCTCGTGGG AGCTGOGCTG 300 

GCCGCTGGCC TACTCTGGAA GTTCATCGGC AGCAAGTGCT CCAACTCTGG GATAGAGTGC 360 

GACTCCTCAG GTACCTGCAT CAACCCCTCT AACTGGTGTG ATGGCGTGTC ACACTGCCCC 420 

GGCGGGGAGG ACGAGAATCG GTGTGTTCGC CTCTACGGAC CAAACTTCAT CCTTCAGGTG 480 

JA) TACTCATCTC AGAGOAAGTC CTGGCACCCT GTGTGCCAAG AOGACTGGAA CGAGAACTAC 540 

GGGCGGGCGG CCTGCAGGGA CATGGGCTAT AAGAATAATT TTTACTCTAG CCAAGGAATA 600 

GTGGATGACA GCGGATCCAC CAGCTTTATG AAACTGAACA CAAGTGCCGG CAATGTCGAT 660 

ATCTATAAAA AACTGTACCA CAGTGATGCC TGTTCTTCAA AAGCAGTGGT TTCTTTACGC 720 

TGTATAGCCT GCGGGGTCAA CTTGAACTCA AGCCGCCAGA GCAGGATCGT GGGCGGCGAG 780 

ZD AGCGCGCTCC CGGGGGCCTG GCCCTGGCAG GTCAGCCTGC ACGTCCAGAA OGTCCACGTG 840 

TGCGGAGGCT CCATCATCAC CCCCGAGTGG ATCGTGACAG COGCCCACTG CGTGGAAAAA 900 

CCTCTTAACA ATCCATGGCA TTGGACGGCA TTTGCGGGGA TTTTGAGACA ATCTTTCATG 960 

TTCTATGGAG CCGGATACCA AGTAGAAAAA GTGATTTCTC ATCCAAATTA TGACTCCAAG 1020 

„ ACCAAGAACA ATGACATTGC GCTGATGAAG CTGCAGAAGC CTCTGACTTT CAACGACCTA 10 80 

D\J GTGAAACCAG TGTGTCTGCC CAACCCAGGC ATGATGCTGC AGCCAGAACA GCTCTGCTGG 1140 

ATTTCOGGGT GGGGGGCCAC CGAGGAGAAA GGGAAGACCT CAGAAGTGCT GAACGCTGCC 1200 

AAGGTGCTTC TCATTGAGAC ACAGAGATGC AACAGCAGAT ATGTCTATQA CAACCTGATC 1260 

ACACCAGCCA TGATCTGTGC CX5GCTTCCTG CAGGGGAAOG TCGATTCTTG CCAGGGTGAC 1320 

AGTGGAGGGC CTCTGGTCAC TTCGAAGAAC AATATCTGGT GGCTGATAGG GGATACAAGC 1380 

OJ TGGGGTTCTG GCTGTGCCAA AGCTTACAGA CCAGGAGTGT ACGGGAATGT GATGGTATTC 1440 
ACGGACTGGA TTTATCGACA AATGAGGGCA GACGGCTAA 



Seq ID NO: 586 Protein sequence 
Protein Accession #t Eos sequence 



I 11 21 31 41 51 

II till 

MALNSGSPPA IGPYYENHGY QPENPYPAQP TWPTVYEVH PAQYYPSPVP QYAPRVLTQA 60 

SNPWCTQPK SPSGTVCTSK TKKAIrCITLT LGTFLVGAAL AAGLLWKFMG SKCSNSGIEC 120 

*IO DSSGTCINPS NWCDGVSHCP GGEDENRCVR LYGPNFILQV YSSQRKSWHP VCQDDWNENY 180 

GRAACRDMGY KNNFYSSQGI VDDSGSTSFM KLNTSAGNVD IYKKLYHSDA CSSKAWSLR 240 

CIACGVNLNS SRQSRIVGGE SALPGAWPWQ VSLHVQNVHV CGGSIITPEW IVTAAHCVEK 300 

PLNNPWHWTA FAGILRQSPM FYGAGYQVBK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 360 

VXPVCLPNPG MMLQPEQLCW ISGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDNLI 420 

TPAMICAGFL QGNVDSCQGD SGGPLVTSKN NIWWLIGDTS WGSGCAKAYR PGVYGNVMVF 480 
TDWIYRQMRA DG 



Seq ID NO: 587 DNA sequence 
Nucleic Acid Accession #t NM_005656.i 
•>-> Coding sequence: 57.. 1535 

} 11 21 31 41 51 

I I I I I I 

£fl GTCATATTGA ACATTCCAGA TACCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

OU CTTTGAACTC AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 120 

CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGTCCCCAC TGTCTACGAG GTGCATCCGG 180 

CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCOGAG GGTCCTGACG CAGGCTTCCA 240 

ACCCCGTCGT CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAAAGACTA 300 

AGAAAGCACT GTGCATCACC TTGACCCTGG GGACCTTCCT CGTGGGAGCT GCGCTGGCCG 360 

OD CTGGCCTACT CTGGAAGTTC ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 480 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACGGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

„ n GGGCGGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

/U ATGACAGCGG ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAAT GTCGATATCT 720 

ATAAAAAACT GTACCACAGT GATGCCTGTT CTTCAAAAGC AGTGGTTTCT TTAOGCTGTT 780 

TAGCCTGCGG GGTCAACTTG AACTCAAGCC GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

7 r GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

/J TTAACAATCC ATGGCATTGG ACGGCATTTG CGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

ATGGAGCCGG ATACCAAGTA CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAACAATGA CATTGCGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 1200 

nr\ CCGGGTGGGG GGCCACOGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 1260 

OU TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 1320 

CAGCCATGAT CTGTGCCGGC TTCCTGCAGG GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 

GTTCTGGCTG TGCCAAAGCT TACAGACCAG GAGTGTACGG GAATGTGATG GTATTCACGG 1500 

ACTGGATTTA TCGACAAATG AAGGCAAACO GCTAATCCAC ATGGTCTTCG TCCTTGACGT 1S60 
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CGTTTTACAA 
ATGATTCAGA 
TGCCATACTG 
COGCAAGGGG 
GTTGGAGGCT 
TGAGCATGGA 
GGAAAGGGAG 
TCCCCAGCCT 
GATGGTGGCC 
AAGGGGAACA 
GAGGGAAGCA 
CATTGGGTGG 
TCCTAGCACC 
ATGTCGGCCT 
ATGCTCAGTT 
CTGAGTTCAA 



GAAAACAATG 
GGTCACTTCA 
TGCAGGCTGC 
TGATGGCOGG 
GCCCCCATTG 
GCTGTCACTT 
ACAGCCAGGT 
ACTTCACAAG 
AGAAATAAAG 
GAAACATTTT 
ATTGAAAAGG 
GGCTCCTGGG 
CTGGAGAGTG 
CTTCAGGCCT 
TAAGGTACAC 
AGCCATCTT 



GGGCTGGTTT 
TTTTTATTAA 
AGTGGCTCCC 
CTGGTTGTGG 
AGATCTTCCT 
CTCAGCTGCT 
GGCACCTGCA 
GGGATTTTGC 
GGACCAGCCC 
TGTTCTTATG 
AACTTGCCCT 
AGGGAGACTC 
AATGCCCCTT 
GATAGTCATT 
TGTTTCCATG 



TGCTTCCCCG 
ACAGTGAACT 
CTGCCCAGCC 
GCACTGGCGG 
GCTGAGTCCT 
GGATGACTTG 
GCGGCTGCCC 
TGATGGGTTC 
TTCATGGGTG 
GGGTGAGAAT 
GAGCACTCCT 
AGCCTTCCTC 
GGTCCCTGGC 
GGAAATTGAG 
TTATGTTTCT 



TGCATGATTT 
TGTCTGGCTT 
TGCTCTCCCT 
TCAATTGTGG 
TTCCAGGGGC 
AGATGAAAAA 
TCTGGGGCCA 
TTAGAGCCTT 
GTGACGTGGT 
ATAGACAGTG 
GGTGCAGGTC 
CTCATCCTCC 
AGGGCGCCAA 
GTCCATGGGG 
ACACATTGAT 



Seq ID NO: 588 Protein sequence 
Protein Accession ft: NP_005647.i 



1 

i 

MALNSGSPPA 
SNPWCTQPK 
DSSGTCINPS 
GRAACRDMGY 
CIACGVNLNS 
PLNNPWHWTA 
VKPVCLPNPG 
TPAMICAGPL 
TDWIYRQMKA 



11 
I 

IGPYYENHGY 
SPSGTVCTSK 
NWCDGVSHCP 
KNNFYSSQGI 
SRQSRIVGGE 
FAGILRQSFM 
MMLQPEQLCW 
GGNVDSCQGD 
KG 



21 
1 

QPENPYPAQP 
TKKALCITLT 
GGBDENRCVR 
VDDSGSTSPW 
SALPGAWPWQ 
FYGAGYQVQK 
ISGWGATEEX 
SGGPLVTSNN 



31 
I 

TWPTVYEVH 
LGTPLVGAAL 
LYGPNP1LQM 
KLNTSAGNVD 
VSLHVQNVHV 
VISHPNYDSK 
GKTSEVLNAA 
NIWWLIGDTS 



41 
I 

PAQYYPSPVP 
AAGLLWKFMG 



IYKKLYHSDA 
CGGSIITPEW 
TKNNDIALMK 
KVLLIETQRC 
WGSGCAKAYR 



ACTCTTAGAG 
TGGCACTCTC 
AACCCCTTGT 
AAGGAAGAGG 
CAATTTTGGA 
GGAGAGACAT 
CTTGGTAGTG 
AGCAGCCCTG 
AGTCACTTGT 
CCCTTGGTGC 
TCCACCTGCA 
CTGACCCTGC 
GTTTGGCACC 
GAAATCAAGG 
GGTGGTGACC 



51 
I 

QYAPRVLTQA 
SKCSNSGIEC 
VCQDDWNENY 
CSSKAWSLR 
IVTAAHCVEK 
LQKPI/TFNDL 
NSRYVYDNLI 
PGVYGNVMVF 



1620 
1680 
1740 
1800 
1860 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 589 DNA sequence 
Nucleic Acid Accession #: NM_001935.1 
Coding sequence : 1 . . 230 1 



51 



1 11 21 31 41 

1 I I . I I 1 

ATGAAGACAC CGTGGAAGAT TCTTCTGGGA CTGCTGGGTG CTGCTGCGCT TGTCACCATC 60 

ATCACCGTGC CCGTGGTTCT GCTGAACAAA GGCACAGATG ATGCTACAGC TGACAGTCGC 120 

AAAACTTACA CTCTAACTGA TTACTTAAAA AATACTTATA GACTGAAGTT ATACTCCTTA 180 

AGATGGATTT CAOATCATGA ATATCTCTAC AAACAAGAAA ATAATATCTT GGTATTCAAT 240 

GCTGAATATG GAAACAGCTC AGTTTTCTTG GAGAACAGTA CATTTGATGA GTTTGGACAT 300 

TCTATCAATG ATTATTCAAT ATCTCCTGAT GGGCAGTTTA TTCTCTTAGA ATACAACTAC 360 

GTGAAGCAAT GGAGGCATTC CTACACAGCT TCATATGACA TTTATGATTT AAATAAAAGG 420 

CAGCTGATTA CAGAAGAGAG GATTCCAAAC AACACACAGT GGGTCACATG GTCACCAGTG 480 

GGTCATAAAT TGGCATATGT TTGGAACAAT GACATTTATG TTAAAATTGA ACCAAATTTA 540 

CCAAGTTACA GAATCACATG GACGGGGAAA GAAGATATAA TATATAATGG AATAACTGAC 600 

TGGGTTTATG AAGAGGAAGT CTTCAGTGCC TACTCTGCTC TGTGGTGGTC TCCAAACGGC 660 

ACTTTTTTAG CATATGCCCA ATTTAACGAC ACAGAAGTCC CACTTATTGA ATACTCCTTC 720 

TACTCTGATG AGTCACTGCA GTACCCAAAG ACTGTACGGG TTCCATATCC AAAGGCAGGA 780 

GCTGTGAATC CAACTGTAAA GTTCTTTGTT GTAAATACAG ACTCTCTCAG CTCAGTCACC 840 

AATGCAACTT CCATACAAAT CACTGCTCCT GCTTCTATGT TGATAGGGGA TCACTACTTG 900 

TGTGATGTGA CATGGGCAAC ACAAGAAAGA ATTTCTTTGC AGTGGCTCAG GAGGATTCAG 960 

AACTATTCGG TCATGGATAT TTGTGACTAT GATGAATCCA GTGGAAGATG GAACTGCTTA 1020 

GTGGCACGGC AACACATTGA AATGAGTACT ACTGGCTGGG TTGGAAGATT TAGGCCTTCA 1080 

GAACCTCATT TTACCCTTGA TGGTAATAGC TTCTACAAGA TCATCAGCAA TGAAGAAGGT 1140 

TACAGACACA TTTGCTATTT CCAAATAGAT AAAAAAGACT GCACATTTAT TACAAAAGGC 1200 

ACCTGGGAAG TCATCGGGAT AGAAGCTCTA ACCAGTGATT ATCTATACTA CATTAGTAAT 1260 

GAATATAAAG GAATGCCAGG AGGAAGGAAT CTTTATAAAA TCCAACTTAG TGACTATACA 1320 

AAAGTGACAT GCCTCAGTTG TGAGCTGAAT CCGGAAAGGT GTCAGTACTA TTCTGTGTCA 1380 

TTCAGTAAAG AGGCGAAGTA TTATCAGCTG AGATGTTCCG GTCCTGGTCT GCCCCTCTAT 1440 

ACTCTACACA GCAGCGTGAA TGATAAAGGG CTGAGAGTCC TGGAAGACAA TTCAGCTTTG 1500 

GATAAAATGC TGCAGAATGT CCAGATGCCC TCCAAAAAAC TGGACTTCAT TATTTTGAAT 1560 

GAAACAAAAT TTTGGTATCA GATGATCTTG CCTCCTCATT TTGATAAATC CAAGAAATAT 1620 

CCTCTACTAT TAGATGTGTA TGCAGGCCCA TGTAGTCAAA AAGCAGACAC TGTCTTCAGA 1680 

CTGAACTGGG CCACTTACCT TGCAAGCACA GAAAACATTA TAGTAGCTAG CTTTGATGGC 1740 

AGAGGAAGTG GTTACCAAGG AGATAAGATC ATGCATGCAA TCAACAGAAG ACTGGGAACA 1800 

TTTGAAGTTG AAGATCAAAT TGAAGCAGCC AGACAATTTT CAAAAATGGG ATTTGTGGAC 1860 

AACAAACGAA TTGCAATTTG GGGCTGGTCA TATGGAGGGT ACGTAACCTC AATGGTCCTG 1920 

GGATCGGGAA GTGGCGTGTT CAAGTGTGGA ATAGCCGTGG CGCCTGTATC CCGGTGGGAG 1980 

TACTATGACT CAGTGTACAC AGAACGTTAC ATGGGTCTCC CAACTCCAGA AGACAACCTT 2040 

GACCATTACA GAAATTCAAC AGTCATGAGC AGAGCTGAAA ATTTTAAACA AGTTGAGTAC 2100 

CTCCTTATTC ATGGAACAGC AGATGATAAC GTTCACTTTC AGCAGTCAGC TCAGATCTCC 2160 

AAAGCCCTGG TCGATGTTGG AGTGGATTTC CAGGCAATGT GGTATACTGA TGAAGACCAT 2220 

GGAATAGCTA GCAGCACAGC ACACCAACAT ATATATACCC ACATGAGCCA CTTCATAAAA 2280 
CAATGTTTCT ctttacctta G 

Seq ID NO i S90 Protein sequence 
Protein Accession 8: NP_001926.l 

1 11 21 31 41 51 

I I I I ! I 

MKTPWKILLG LLGAAALVTI ITVPWLLNK GTDDATADSR KTYTLTDYLK NTYRLKLYSL 60 

1178 



WO 03/042661 



PCT/US02/36810 



RWISDHEYLY KQENNILVFN AEYGNSSVFL ENSTPDEFGH SIHDYSISPD GQPILLBYNY 120 

VKQWRHSYTA SYDIYDLNKR QLITEERIPN NTQWVTWSPV GHKLAYVWNN DIYVKIEPNL 180 

PSYRITWTGK EDI IYNGITD WVYEEEVFSA YSALWWSPNG TFLAYAQFND TEVPLIEYSP 240 

YSDESLQYPK TVRVPYPKAG AVKPTVKPPV VNTDSLSSVT KATSIQITAP ASMLIGDHYL 300 

5 CDVTWATQER ISLQWLRRIQ NYSVMDICDY DESSGRWNCL VARQHIEMST TGWVGRFRPS 360 

EPHPTLD6NS FYKIISNEEG YRHICYFQID KKDCTFITKG TWEVIGIEAL TSDYLYYISN 420 

EYKGMPGGRN LYKIQLSDYT KVTCLSCBLN PBRGQYYSVS FSKEAKYYQL RCSGPGLPLY 480 

TLHSSVNDKG LRVLEDNSAL DKMLQNVQMP SKKLDFIILN ETKFWYQMIL PPHFDKSKKY 540 

PLLLDVYAGP CSQKADTVFR mWATYLAST ENIIVASFDG RGSGYQGDKI MHAINRRLGT 600 

10 PEVEDQIEAA RQFSKMGPVD NKRIAIWGWS YGGYVTSMVL GSGSGVPKCG IAVAPVSRWE 660 

YYDSVYTERY MGLPTPEDNL DHYRNSTVMS RAENFKQVEY LLIHGTADDN VHFQQSAQIS 720 
KALVDVGVDF QAMWYTDEDH GIASSTAHQH IYTHMSHFIK QCFSLP 

Seq ID NO: 591 DMA sequence 
IS Nucleic Acid Accession 8: NM_01 6077.1 
Coding sequence: 128.. 667 

1 11 21 31 41 51 

OA I I I I 1 1 

ZD TCGCTTTGTG ATTCTTGATC CGGAACTTTG TCACCCAGGA ACCCCGGAAG AGGTAGCTCA 60 

CGCGATAGAA ACGTGTTCGC T7GCCCAGAA GAAGGGAAGG OGOGAGTGAG GAAAGGAGGT 120 

ACTGTAGATG CCCTCCAAAT CCTTGGTTAT GGAATATTTG GCTCATCCCA GTACACTCGO 180 

CTTGGCTGTT GGAGTTGCTT GTGGCATGTG CCTGGGCTGG AGCCTTOGAG TATGCTTTGG 240 

GATGCTCCCC AAAAGCAAGA CGAGCAAGAC ACACACAGAT ACTGAAAGTG AAGCAAGCAT 300 

25 CTTGGGAGAC AGCGGGGAGT ACAAGATGAT TCTTGTGGTT OGAAATGACT TAAAGATGGG 360 

AAAAGGGAAA GTGGCTGCCC AGTGCTCTCA TGCTGCTGTT TCAGCCTACA AGCAGATTCA 420 

AAGAAGAAAT CCTGAAATGC TCAAACAATG GGAATACTGT GGCCAGCCCA AGGTGGTGGT 480 

CAAAGCTCCT GATGAAGAAA CCCTGATTGC ATTATTGGCC CATGCAAAAA TGCTGGGACT 540 

GACTGTAAGT TTAATTCAAG ATGCTGGACG TACTCAGATT GCACCAGGCT CTCAAACTGT 600 

30 CCTAGGGATT GGGCCAGGAC CAGCAGACCT AATTGACAAA GTCACTGGTC ACCTAAAACT 660 

TTACTAGGTG GACTTTGATA TGACAACAAC CCCTCCATCA CAAOTGTTTG AAGCCTGTCA 720 

GATTCTAACA ACAAAAGCTG AATTTCTTCA CCCAACTTAA ATGTTCTTGA GATGAAAATA 780 
AAACCTATTC CCATGTTCTA AAAAAA 

35 Seq ID NO: 592 Protein sequence 
Protein Accession ft: NPJ)5716l.l 

1 11 21 31 41 51 

An I 1 I I I 1 

4U MPSKSLVMEY LAHPSTLGLA VGVACGMCLG WSLRVCFGML PKSKTSKTHT DTESEASI1/G 60 

DSGEYKMILV VRNDLKMGKG KVAAQCSHAA VSAYKQIQRR NPEMLKQWEY CGQPKVWKA 120 
PDSETLIALXj ahakmlgltv sliqdagrtq IAPGSQTVLG IGPGPADLID kvtghlkly 

Seq ID NO: 593 DNA sequence 
45 Nucleic Acid Accession #i FGENESH predicted 
Coding sequence : 1 . . 1896 

1 11 21 31 41 51 

cr\ I I I I I I 

OU atgcgcgccg tgccgctgcc cgccccgctc ctgccgctgc TGCTGCTCGC GCTCCTGGCC 60 

GCTCCCGCCG CCC6CGCCAG CAGAGCGGAG TCOGTCTCCG OGCCGTGGCC CGAACCCGAG 120 

CGOGAGTCGC GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 180 

GCGGCGGGCG GCAGCGGCAG CTCCAGCTCC AACAGCAGTG GCGACGCCTT GGTGACCCGC 240 

ATTTCCATCC TCCTCCGCGA CCTACCCACC CTCAAGGCAG CCGTGATCGT GGCGTTCGCC 300 

55 TTTACCACCC TCCTCATCGC CTGCCTGCTG CTGCGCGTCT TCAGGTCGGG AAAGAGGTTA 360 

AAGAAGACAC GCAAGTATGA TATCATCACC ACTCCAGCAG AGOGAGTGGA AATGGCGCCA 420 

CTAAATGAAG AGGATGATGA AGATGAGGAC TCCACAGTAT TCGACATCAA ATACAGAGTG 480 

TCCTTGCCGG CTGCACTGAG ACGTCAGCTG CCAGGGTGCC AGACGCTACT GACAGTTCCT 540 

. . GTGCCCCCAC CCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAGTGG AAGGCCTGAT 600 

60 GGTGGAATCA GACCTGGTAA AACCTGTTTC CCAGCCTGGT GGCATCCTGT GGAAAGTTGG 660 

TCAGCTGCAA CCTGGGGTGT GAAGGACTGG ACCTGGAAGC CCTCTTGCGT CGGAGGTGTT 720 

GAAACCAAAA CGAACGTTAT GTATAAAACC CCAGCTCCAT CGTGCGTGTC AGGCATCTGC 780 

TCAGACTGTC ACTGGCAAGC TCGTTTCCAC GTCACCACAA TGGAGTTGCT TCTGCCACCC 840 

TTTGGGCATC CCTTTAAAGT GCCCCCTACT TCTACTCCCC ATGGTTTTCG ACAACTGCAG 900 

65 CTGAATCTCA TGGAAAAGCT GGATTCCTCT GCCTTACGCA GAAACACCCG GGCTCCATCT 960 

GCCAGGTGCT TGCCACTGGT CCTGGCAGAA ATGGCGGCTG CTGAAAGTGA CCTTCCAAAT 1020 

CCTTGGTGGC ACTTCAGCGC CACAGGCTCT CCAATAAAAA CCCTTTACAC ACAAACCATG 1080 

AGTACCTTGG GCTTGGATGT TTTCTGTGGT GCCGGCCAGC GGGGCACCTT TTGTGAAGAC 1140 

AGAGCAGTGA CTAAGGTTCT CCAGGGTAGC TCTTTCTCCA AACAGCTGCG CTGGAAGCCA 1200 

70 GCCCTAGAGA GTGGGTTTCC CCATCATCTC AGGCTTCTCA GAGAGTGTCC TCCGCTGAGC 1260 

ACCCATCCTG TCAGGTTGGC TOGTTCAGAT GCCCGGGGAC AAGCCAGCCT GACGGGGAGG 1320 

AGGGTGTTTC GGCGTCCGCG GCAGTCTCTG CATGGCGGAG GGTCAGCGGG TACCGCAACT 1380 

TGCCTTTTGG TTTTGAAGAT TCTGTTGAGG CGCCATOCTC ACCTTGACCT CTTCTACAAA 1440 

_ _ ATCTGTCTCC CCTGCTGTGC CGTGGAACAC CTACGGGAAG CCAAGAGAAG CTCAGTGACT 1500 

75 GTCCTTGCGT CATTTGAGCA GAGCCCACAA AAGGCAGCTG CTGCCCACGG GGAGCCTGTC 1560 

AAACGAGGGC CCAGTGGGCA ATTGACCAGA CACACATGCC CTGGCTGGGG GATCACACAT 1620 

G0GAACCTGC AGACAATTCC AGATACCCAA GGCCAGGAAG GCCCACGTGA GG ATGT CACT 1680 

CACCCTGGAG GAGACTTGGA TGGGGTGGCA AATTTCTATT TGGAGGAAGA GGGTTTCCAG 1740 

ori GATGGCAGAT GCCAGAAGAT GGTCCTGATG TCTGAGGAAG GGCCACCTAG TTTGACAGGA 1800 

80 TGTGAGAGGC TCACAGGTTC CCATCACTTC TCCAGCCATT CCAAGTCTTG GTCCTTCCTT I860 
TCCCCCCGAC AGCCCCTGTT TCTGTCCAGG CCCTGA 



Seq ID NO: 594 Protein sequence 
Protein Accession ft: FGENESH predicted 
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1 11 21 31 41 51 

I I I I I ! 

MRAVPLPAPL LPLLLLALLA APAARASRAE SVSAPWPEPE RESRPPPGPG PGNTTRFGSG 60 

AAGGSGSSSS NSSGDALVTR ISILLRDLPT LKAAVIVAFA FTTLLIACLL LRVFRSGKRL 120 

KKTRKYD1 IT TPAERVEMAP LNEEDDEDED STVFDIKYRV SLPAALRRQL PGCQTLLTVP 180 

VPPPFILDID LPARCSGRPD GGIRPGKTCF PAWWHPVESW SAATWGVKDW TWKPSCVGGV 240 

ETKTNVMYKT PAPSCVSGIC SDCHWQARFH VTTMELLLPP FGHPFKVPPT STPHGFRQLQ 300 

LNLMEKLDSS ALRRNTRAPS ARCLPLVLAE MAAAESDLPN PWWHFSATGS PIKTLYTQTM 360 

STLGLDVFCG AGQRGTFCED RAVTKVLQGS SFSKQLRWKP ALESGFPHHL RLLRECPPLS 420 

THPVRLARSD ARGQASLTGR RVFRRPRQSL HGGGSAGTAT CLLVLKILLR RHPHLDLFYK 480 

ICLPCCAVEH LREAKRSSVT VUVSFEQSPQ KAAAAHGEPV KRGPSGQLTR HTCPGWGITH 540 

ANLQTIPDTQ GQEGPREDVT HPGGDLDGVA NFYLEEEGFQ DGRCQKMVLM SEEGPPSLTG 600 
CERI/TGSHHF SSHSKSWSFL SPRQPLFLSR P 

Seq ID NO: 595 DNA sequence 

Nucleic Acid Accession ft: NMJJ21614.1 

Coding sequence: 1..1740 



1 11 21 31 41 51 

I I I I I I 

ATGAGCAGCT GCAGGTACAA CGGGGGCGTC ATGCGGCCGC TCAGCAACTT GAGCGCGTCC 60 

CGCCGGAACC TGCACGAGAT GGACTCAGAG GCGCAGCCCC TGCAGCCCCC CGCGTCTGTC 120 

GGAGGAGGTG GCGGCGCGTC CTCCCCGTCT GCAGCCGCTG CCGCCGCCGC CGCTGTTTCG 180 

TCCTCAGCCC CCGAGATCGT GGTGTCTAAG CCCGAGCACA ACAACTCCAA CAACCTGGCG 240 

CTCTATGGAA CCGGCGGCGG AGGCAGCACT GGAGGAGGCG GCGGCGGTGG CGGGAGCGGG 300 

CACGGCAGCA GCAGTGGCAC CAAGTCCAGC AAAAAGAAAA ACCAGAACAT CGGCTACAAG 360 

CTGGGCCACC GGCGCGCCCT GTTCGAAAAG CGCAAGCGGC TCAGCGACTA CGCGCTCATC 420 

TTCGGCATGT TCGGCATCGT GGTCATGGTC ATCGAGACCG AGCTGTCGTG GGGCX5CCTAC 480 

GACAAGGCGT CGCTGTATTC CTTAGCTCTG AAATGCCTTA TCAGTCTCTC CACGATCATC 540 

CTGCTCGGTC TGATCATCGT OTACCACGCC AGGQAAATAC AGTTGTTCAT GGTGGACAAT 600 

GGAGCAGATG ACTGGAGAAT AGCCATGACT TATGAGCGTA TTTTCTTCAT CTGCTTGGAA 660 

ATACTGGTGT GTGCTATTCA TCCCATACCT GGGAATTATA CATTCACATG GACGGCCCGG 720 

^ - CTTGCCTTCT CCTATGCCCC ATCCACAACC ACCGCTGATG TGGATATTAT TTTATCTATA 780 

3 J CCAATGTTCT TAAGACTCTA TCTGATTGCC AGAGTCATGC TTTTACATAG CAAACTTTTC 840 

ACTGATGCCT CCTCTAGAAG CATTGGAGCA CTTAATAAGA TAAACTTCAA TACACGTTTT 900 

GTTATGAAGA CTTTAATGAC TATATGCCCA GGAACTGTAC TCTTGGTTTT TAGTATCTCA 960 

TTATGGATAA TTGCCGCATG GACTGTCCGA GCTTGTGAAA GGTACCATGA TCAACAGGAT 1020 

GTTACTAGCA ACTTCCTTGG AGCGATGTGG TTGATATCAA TAACTTTTCT CTCCATTGGT 1080 

TATGGTGACA TGGTACCTAA CACATACTGT GGAAAAGGAG TCTGCTTACT TACTGGAATT 1140 

ATGGGTGCTG GTTGCACAGC CCTGGTGGTA GCTGTAGTGG CAAGGAAGCT AGAACTTACC 1200 

AAAGCAGAAA AACACGTGCA CAATTTCATG ATGGATACTC AGCTGACTAA AAGAGTAAAA 1260 

AATGCAGCTG CCAATGTACT CAGGGAAACA TGGCTAATTT ACAAAAATAC AAAGCTAGTG 1320 

AAAAAGATAG ATCATGCAAA AGTAAGAAAA CATCAACGAA AATTCCTGCA AGCTATTCAT 1380 

CAATTAAGAA GTGTAAAAAT GGAGCAGAGG AAACTGAATG ACCAAGCAAA CACTTTGGTG 1440 

GACTTGGCAA AGACCCAGAA CATCATGTAT GATATGATTT CTGACTTAAA CGAAAGGAGT 1500 

GAAGACTTCG AGAAGAGGAT TGTTAOCCTG GAAACAAAAC TAGAGACTTT GATTGGTAGC 1S60 

ATCCACGCCC TCCCTGGGCT CATAAGCCAG AOCATCAGGC AGCAGCAGAG AGATTTCATT 1620 

GAGGCTCAGA TGGAGAGCTA CGACAAGCAC GTCACTTACA ATGCTGAGCG GTCCCGGTCC 1680 
TCGTCCAGGA GGCGGCGGTC CTCTTCCACA GCACCACCAA CTTCATCAGA GAGTAGCTAG 



Seq ID NO: 596 Protein sequence 
Protein Accession #: NP_067627.l 

1 11 21 31 41 51 

I I I I I I 

MSSCRYNGGV MRPLSNLSAS RRNLHEMDSE AQPDQPPASV GGGGGASSPS AAAAAAAAVS 60 
SSAPEIWSK PEHNNSNNLA LYGTGGGGST GGGGGGGGSG KGSSSGTKSS KKKNQNIGYK 120 
LGHRRALFEK RKRLSDYALI FGMFGIWMV IETELSWGAY DKASLYSLAL KCLISLSTII 180 
LLGLIIVYHA REIQLFMVDN GADDWRIAMT YERIFFICLE ILVCAIHPIP GNYTFTWTAR 240 
LAFSYAPSTT TADVDIILSI PMFLRLYLIA RVMLLHSKLF TDASSRSIGA LNKINPNTRF 300 
VMKTLMTICP GTVLLVFSIS LWIIAAWTVR ACERYHDQQD VTSNFLGAMW LISITFLSIG 360 
YGDMVPNTYC GKGVCLLTGI MGAGCTALW AWARKLELT KAEKHVHNFM MDTQLTKRVK 420 
NAAANVLRET WLIYKNTKLV KKIDHAKVRK HQRKFLQAIH QLRSVKMEQR KLNDQANTLV 480 
DLAKTQNIMY DMISDLNERS EDFEKRIVTL ETKLETLIGS IHALPGLISQ TIRQQQRDFI 540 
EAQMESYDKH VTYNAERSRS SSRRRRSSST APPTSSESS 



Seq ID NO: 597 DNA sequence 
„ Nucleic Acid Accession ft: NMJ>16029.1 
/" Coding sequence: 228.. 1097 



1 11 21 31 41 51 

I I I I I 1 

CTGCGATCCC GCAGGGCAGC GACGOGACTC TGGTGCGGGC CGTCTTCTTC CCCCCGAGCT 60 

GGGCGTGCGC GGCCGCAATG AACTGGGAGC TGCTGCTGTG GCTGCTGGTG CTGTGCGCGC 120 

TGCTCCTGCT CTTGGTGCAG CTGCTGCGCT TCCTGAGGGC TGACGGCGAC CTGACGCTAC 180 

TATGGGCCGA GTGGCAGGGA CGACGCCCAG AATGGGAGCT GACTGATATG GTGGTGTGGG 240 

TGACTGGAGC CTCGAGTGGA ATTGGTGAGG AGCTGGCTTA CCAGTTGTCT AAACTAGGAG 300 

TTTCTCTTGT GCTGTCAGCC AGAAGAGTGC ATGAGCTGGA AAGGGTGAAA AGAAGATGCC 360 

TAGAGAATGG CAATTTAAAA GAAAAAGATA TACTTGTTTT GCCCCTTGAC CTGACCGACA 420 

CTGGTTCCCA TGAAGCGGCT ACCAAAGCTG TTCTCCAGGA GTTTGGTAGA ATCGACATTC 480 

TGGTCAACAA TGGTGGAATG TCCCAGCGTT CTCTGTGCAT GGATACCAGC TTGGATGTCT 540 

ACAGAAAGCT AATAGAGCTT AACTACTTAG GGACGGTGTC CTTGACAAAA TGTGTTCTGC 600 

CTCACATGAT CGAGAGGAAG CAAGGAAAGA TTGTTACTGT GAATAGCATC CTGGGTATCA 660 

1180 



WO 03/042661 
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10 



15 



20 



50 



60 



AGATTGCCAT GAATCTTGCA AA 

Seq ID NO: 598 Protein sequence 
Protein Accession ft: NP_0S7113.1 

1 11 21 

I i I 



CTAGCAAGCA 


TGCTCTCCGG 


GGTTTTTTTA 


720 


CAG6TATAAT 


AGTTTCTAAC 


ATTTGCCCAG 


780 


CCCTAGCTGQ 


AGAAGTCACA 


AAGACTATAG 


840 


CAACCAGTCG 


TTGTGTGCGG 


CTGATGTTAA 


900 


GGATCTCA6A 


ACAACCTTTC 


TTGTTAGTAA 


960 


CCTG6TG6AT 


AACCAACAAG 


ATGGGGAAGA 


1020 


ATGCAGACTC 


TTCTTATTTT 


AAAATCTTTA 


1080 


ACTTTTCAAG 


CCACTGGAGG 


GAGAAATGGA 


1140 


CTGAATAATC 


AAAGACTAAT 


TTGTGATTTT 


1200 


CATGGAATGA 


AATAAAAAAT 


AAATAATAAA 


1260 


31 


41 


51 




1 

DLTLLWAEWQ 


1 

GRRPEWELTD 


1 

MWWVTGASS 


60 


KRRCLENGNL 


KEKDILVLPL 


DLTDTGSHRA 


120 


SLDVYRKLIE 


LNYLGTVSLT 


KCVLPHMIER 


180 


RGPFNGLRTE 


LATYPG1IVS 


NICPGPVQSN 


240 


RLMLISMAND 


LKEVWISEQP 


FLLVTYLWQY 


300 


FKIFKTKHD 









25 Seq ID NO: 599 DNA sequence 

Nucleic Acid Accession ft: NM_000793.2 
Coding sequence: 401.. 1222 " 

™ 1 U 21 31 41 51 

30 I I I I I | 

GCCTGCAGAG AGAGGCACTT TGCACCACAG ACAGATAGCA AGAAGGGAAA GACAGAGAGT 60 

GAGAAAAAAG AGGAGTCAGT CGCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAG 120 

AAGCACAGAA AGTGTGTGTA AAACGGAGTA AAGAAAGAAA AAAAAAAAAC TACCCTTAAA 180 

c GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 240 

J J CATAGAGACA ATGAAAGGCT AAAGAAAATT TTAAAATCTC TGCCACAGTC TCATAGGTGC 300 

TTGGAAATGA AAGTAGAACT GCCTGTCTTT AAGGGACTCT GACAGAGAGG GTGAAGGGGA 360 

ACCAGAGCGC ACAAGGGAAC TGACTCAGGA GGCAGAGAAG ATGGGCATCC TCAGCGTAGA 420 

CTTGCTGATC ACACTGCAAA TTCTGCCAGT TTTTTTCTCC AACTGCCTCT TCCTGGCTCT 480 

A(>i CTATGACTCG GTCATTCTGC TCAAGCACGT GGTGCTGCTG TTGAGCCGCT CCAAGTCCAC 540 

W TCGCGGAGAG TGGCGGCGCA TGCTGACCTC AGAGGGACTG CGCTGGGTCT GGAAGAGCTT 600 

CCTCCTCGAT GCCTACAAAC AGGTGAAATT GGGTGAGGAT GCCCCCAATT CCAGTGTGGT 660 

GCATGTCTCC AGTACAGAAG GAGGTGACAA CAGTGGCAAT GGTACCCAGG AGAAGATAGC 720 

TGAGGG AGCC ACATGCCACC TTCTTGACTT TGCCAGCCCT GAGCGCCCAC TAGTGGTCAA 780 

CTTTGGCTCA GCCACTTGAC CTCCTTTCAC GAGCCAGCTG CCAGCCTTCC GCAAACTGGT 840 

*0 GGAAGAGTTC TCCTCAGTGG CTGACTTCCT GCTGGTCTAC ATTGATGAGG CTCATCCATC 900 

AGATGGCTGG GCGATACOGG GGGACTCCTC TTTGTCTTTT GAGGTGAAGA AGCACCAGAA 960 

CCAGGAAGAT CGATGTGCAG CAGCCCAGCA GCTTCTGGAG CGTTTCTCCT TGCCGCCCCA 1020 

GTGCCGAGTT GTGGCTGACC GCATGGACAA TAACGCCAAC ATAGCTTACG GGGTAGCCTT 1080 

TGAACGTGTG TGCATTGTGC AGAGACAGAA AATTGCTTAT CTGGGAGGAA AGGGCCCCTT 1140 

CTCCTACAAC CTTCAAGAAG TCCGGCATTG GCTGGAGAAG AATTTCAGCA AGAGATGAAA 1200 

GAAAACTAGA TTAGCTGGTT AAAGGTATGA TTATAAGAGA GCTTATTGTT TTAAAAAGTT 1260 

ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA ACAGAGCCCT ATTGGCTTAC 1320 

TGAAAGACAG GAGTTTATCT ATOGGAAGAA CATGAATCTC TAACAGCTCC ATACTTCTTT 1380 

r r CACTACTCAA ATGGCATTGG GCTGAGTAAG TAACCATATC ACCTCTCTTC TTAGTAAAAA 1440 

J J GCCCTATGTG AAAAGATCCC AAGATGGAGA GGAAGAAACG CTAATTCAGC ATGTGTTCAT 1500 

TCTGCATTGA GAAGGAACTG ATACATCTGA TGCATGCTTT GAGACCAGAA GAAAAGACTT 1560 

ACCTGAATAA TTACTACATT AGGGAAGCTA CTGTCTAOGT TAAGATAAAG GGTATTGCCT 1620 

TGGCTCTATT TGGCATGGAT GGAGCCCAGT TGGAAAATTC CCAAATATTA CAACAAGTCC 1680 

TTGAACCCAG GCCATGTGGT TAGACGTTGG TGTTAAGGTT AGACCTTATG TTAGAGTCAT 1740 

TTCTGATGTT CCAGCTTCTA GCCATGTAGT GCTCTCAGTC TTCATACCCC AGAAATTATT 1800 

GGTATATTTG TAGATACCGA GAATGATCCC TCAGTCTGAG AGGTTAGAAT GATCATCTGT 1860 

AATCTGAGGG TTAATTTCTA GGCAGGTGGA GAGAGTGGTA AAAAAGAAAT GAAATTGACA 1920 

AGCTAGGAAA GAGGAGGCAG AAAGATTTGG AAAATTCACA GAGTTTCACC CTTAAGCTGT 1980 

- AGAGAGTGGG TCACATTTGT TAGCCACGGA AACATAGAAA CATACACAAG GCCAGAAAAA 2040 

03 GAAGAAGGAG CTCAACTAAA AGTGGCATAG AGAATACACA TATAAAAACA ATATATTTGT 2100 

CATATGCTCC TAGAGAGGAG AAAGGGGTGA TTGAAAGAAA AAAAAATACT TAAATATTTG 2160 

TAATTGTGAG GGGTTTCTTT TGGAAATAAT TACTTTTGAA CCATGTATGT GGTATGTATA 2220 

TTTTCAGTGG GTTAATTATA CCCCATGATA CCTATTAAAG GAAAACCAGT GGGTCTGGTG 2280 

n GTGCTGGTCT TTTCCTCCCC ATTCCTACAA TTTCTATGTG GCCCAAGTCA TTCCTAATCT 2340 

/U TGGTCTCTAT AGCAGTGTTC TCTCTGAATG CTGAGCTGAA GAAATTATAC GTACATACAC 2400 

ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCTGCTGCG GGAGGTAGGT 2460 

ACCATGGCCA TTCAGCACAG CCTTGATTTC CTCCCAAAGT AGGTGAGCTA TAGTGAAGAA 2520 

TAGGTGCAAA CAAACAAGCT TACTTCCATT GCAAAATAGA AGAAGAGGAA GTTAGAGATA 2580 

ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGGCAAC CCCCGGTATA TCATGGAATT 2640 

/J TCCATTGACA TTTGAATTTG GACTTGGATC TTCCCTTGGT CCCATTAGCT GAGGTTTAGT 2700 

AATCTAAAGT CCCTATAGTA TATGATTATA ATGCTATTTT AAAAAATATA TATATAAAAT 2760 

ATTTTTTTCT TTTTAAAATA GACACTATAG TTTTACCCAT AAGTAATATT TAAAGATTAT 2820 

AGCTCCCAAA AGAATGGACC AACCACTTTC GTATCATAAT TTCTTTTTGG TAAATATGAG 2880 

Sfi ACTATTATGA AAT CAT ACTA TATGATTGTA TTTAAAGGTA CAATCAAAGG ATCTTTTGTC 2940 

OU CATTCCATTA ATAACTGAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTTGAA 3000 

AATACATTCT TAAACTAGTT GTCTGAAATG AGAAAAGAGT GAGAACTAGG TGTGCAAGAA 3060 

CCAAACGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTCGTG TCACCAGCTG 3120 

GTATATTGTG TAAATATTAA AGCTCCATTG GGACTGATTT TTCATGGCAA CATCAGCTTT 3180 

CTAATGTTCT AAATTCTATA AAAACCACCC ACAAAGAAAC AAAGCAAATT TCATTATCTA 3240 
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ATGAGTTGCT GGAAAATCAT ATTGAGAATA ATTATTTCAG ATTCCTCAQT TGTTAACTTC 3300 

TACATTCAAG GGCTTATCTC TGCCCCCATT GATTTTTAAC CTCAAAATGG TGTGAGATTT 3360 

ACTGTGGAAC CCTAAAGCAQ TAAAATAAAA AACCTGGTTG CAGCACATTC ACACTGTTGT 3420 

CCTTAAAATT CCCCTTTTTT CTCTATGTAC GATAAAGTAA CAGTATGTCA GATAAGCCGG 3480 

TGGGGGGATG AGATTAGGCT GAGGCAGTGC TAGTCAACTG GGGGAAAAGG ATGATGGAAA 3540 

AATCACCCAG TTGTGCTATA TTTTTAAAGA AGGAGGTOGT TTATGTGTGC AGACAATTCT 3600 

CCCTGAGGTT AGCCCAATGG AGAAATGAAG CAGAGGAAGG AAACATAGAA AGACATGGGC 3660 

TATCAGGGAG GAAGATGTTC AATAGAACAT GCAAGAATTT CTGGAAGAAA GGCTGTGGAA 3720 

GGGCCAATGG AGAAAATGAA TGGACAAAGC TCAGGAATCC CTACGCTATG TAGAATGTTC 3780 

TTGGTGTTAT CAGGGTTAAG CCCTGTAATT ATGTAACCTA TTTATCGCAA CATGAATTTT 3840 

TATGATTTCT TGTGATGTAT TCTTTTATGA AATTAACAAG AACTCATTAT TTTGAGGTAG 3900 

AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATTGCCAA TACTCATGTG 3960 

CGTTTCATGT GTTTTATAAG tf l"l T UTlX!LT TTGAAGAATT GTAGTTCTTA GTCCCACAGG 4020 

GAAATGTGTA TCTATTTATA TATCATAGTA TAAATCTATG ATATATTTAT ATCATATATA 4080 

AAAGTCTGAG TTCTCTTTCT TAGTCCCTAA TCATCTTTCT CCCATAGGCT GTGTTTACAT 4140 

GGAGCTATCG GTTTAGCCTT TTAAGCTTCA TTAGCTTGTC TATTATTGAA ATAGTTTCCA 4200 

AGAAATTTTA GATATTATCA TAACATCTGG GTCTACTCAA ACACTTATTG TTTGAAAGAC 4260 

TTATGTCTTG GACCTATCAA AAACTGACTT TATTTATTGC TTAGTGAAAA TACTAGTGGG 4320 

ATCAACAATG ATTTTCTTGA ATGGGCATGA ATGGAGATGC CCGCACAGTA ATGTAGAAAT 4380 

GTTTCATACA GCTATTAAAA TGTAACTGAC CTCCTTAOAG GCAGATTAGT AACTGTTCCT 4440 

ACTTTGTATA GCTAAGTGAC AGTCACTTAA CTTACATGAC TTTCTTTTTT CACATTGGGT 4500 

CTCTGGTCCT GTGTCTTCAC CTCATTTATA GCACGTCTCC TTGATTTTTG GTAGTATCAA 4560 

CTTCCCAGTG ATCTGTTCAG TTAAGTTCTT CTCCCGTTAA CCAGGAAGTG CTTATTCTCT 4620 

CATCACAGTG GGAAGAATAG CCTATTGTCT TTCATTTTGC CTGAGTGTAT TTTACTATTT 4680 

GGGCTCTGAA ATAAAAATTA TGAAATATGG TGAGGTCACA TGTTGGTGCT GCCTTGCTGC 4740 

ATAAAATTCT AGGAGGGCAG GTTAGGAGAC AGTTATGTAT GGCCTTTCGG GAAAATTCAA 4800 

AGGGTGGGAT TACAAGGGTG TTCCTCAGGC ATGCCCCTAT GGGCCCTATG TGGAAGCAAG 4860 

AAGAATTGAC TGATTTACAG GACTTCTCTT TATGTCAATC TTAAGAGGAT GGATGAATCT 4920 

GGACATTTGT TCCACCCGAC CTCTGACTGA TGGTTTGGAA AATAACTTTA ATTAGGATCA 4980 

TATGACCATT GAAAAAGGAA AAATGTAGAC TCTGACTTCC GTCCCACTGA AGGATTAATG 5040 

AAAACCTTTA CTAGCATTTA GAGCTTTTCA GAACATCCCC ACTGTCATGT GTCTCAGCAG 5100 

TGGAGACTGC AAGTAAGGCT TTTAATTTTA GGAGGTTTTT TTTTTTTTTT TTTTTTCCCC 5160 

TAAATGGTAT GGCCAAAAGT CAGAGTTAAA ATATATATAG TTAGATTCCA ACTTCCTCCT S220 

TCACTCTAAA AATAGAATCC AAACCCACTC TTCATATATG CTTCCAGAAT GGGGCTTAAG 5280 

TACCAATCTC TGCTTTGCAA TGGGCACAAT CTTGGTCATG TCCTGAGGCT CTCTAAGAAA 5340 

AGAGAGGATC TAGGATGGGA GAGCTAGAAA GTTGCTAACT GGGAAGAACA AGGCCCTGAG 5400 

GGGTTGGTCT ACCAATCTGG GAAGATTTGA AAACAAACTT CTCGCAACTG AAGGAAGGCT 5460 

GAAGGCTGCT GCAAGTCATT GAGTGACTTT AGGATGAGCA AAACATTGGG CCACTTCCTA 5520 

ATGCCCTATG TGTATAGTAC CAGAAGCAAG GTCTCAGACT TAACAGACCC AGCTCTGTTC 5580 

CAAGGTGAGT CTGAACCAAT AGAAAGCAAA CATGTGCAGA TATCCAAACA AGACTGCTCA 5640 

TGCAAGTCGG GGCTGGCTAC CCGTCTTAGG CAGCAACAGC AGAGCTCCAG GGAGCTTATT 5700 

CAATATTTAC TGAGACTTCG AAGACCCAGC AGATGTTTAA TGAAGTCACT ATTTTGGCTC 5760 

AAACCCTCCA CTTCTCCCCC TCCCCTCAAA AAGCCAACAG GTAAACACAT AAATGAAAGA 5820 

AACCCACAGA AGGGGATGGG AAATAAAGAA AATTCTCTCA AGACTTCTCC AGGCCCATGT 5880 

CACTGGTCAG CGTGGTTTTT ATGTGTATTA GGATTGGGGG ATGTGAAGAA ATAAGTATCC 5940 

AGTACTTTAT AACCAAAGCA ATTAAATGAT ATTGGGGTAG GGAATGTTGG CCAGTTTTGT 6000 

TTAGTTTTGC CATCACATTG TCACCCAGAC CTCACCTAGC CCCAAGTAAT CGGGCGCCCC 6060 

GAAGAGGGAG ACAGAGATGT GCCAGAGTTG ACCCAGTGTG CGGATGATAA CTACTGACGA 6120 

AAGAGTCATC GACCTCAGTT AGTGGTTGGA TGTAGTCACA TTAGTTTGCC TCTCCCCATC 6180 

TTTGTCTCCC TGGCAAGGAG AATATGCGGG ACATGATGCT AAGAGCCCTG GGTAAATGTG 6240 

GTGAGAATGC ACGCGTGCAT ATGCTACACA TATGTGCTTC TCAGTTGCAG AAAATGAACT 6300 

GCTTTGGGAG ATTATCAGTA GAAAGAGTGT TATCATATTG GTGCTGAGTG CTATGTGTGC 6360 

TTATACAATT TGTTCTTGTA TTTTAATAAA CTTTGAATAA AAGAATAAAA AAAAAAAAAA 6420 
AAAAAAAAA 

Seq ID NO: 600 Protein sequence 
Protein Accession #: NPJJO0784.2 

1 11 21 31 41 51 

I I I I I I 

MGILSVDLLI TLQILPVFFS NCLFLALYDS VILLKHWLL LSRSKSTRGE HRRMLTSEGL 
RCVWKSFLLD AYKQVKLGED APHSSWHVS STEGGDNSGN GTQEKIAEGA TCHLLDFASP 
BRPLWNFGS ATUPPFTSQL PAFRKLVEEF SSVADFLLVY XDEAHPSDGW AIPGDSSLSF 
EVKKHQNQED RCAAAQQLLE RFSLPPQCRV VADRMDNNAN IAYGVAFERV CIVQRQKIAY 
LGGKGPFSYN LQEVRHWLEK NFSKRUKKTR LAG 

Seq ID NO: 601 DNA sequence 
Nucleic Acid Accession #t NMJ>05233.1 
Coding sequence: 101.. 3052 , 



60 
120 
180 
240 



1 
I 

CCATGGATGG 
ATGGAGATAT 
CCTCCTCCTT 
CAATGAAGTC 
TCCATCACAT 
TTACCAGGTG 
CCCCAGGAAC 
TAGCATTCCA 
TGATGATGAT 
AGCTGATGAA 
GATTAGAGAA 
TGCTTGTGTT 



11 
I 

TAACTTCTCC 
GCTCCTCTCA 
CTCAGCTGCT 
AATCTACTGG 
GGGTGGGAAG 
TGCAATGTCA 
TCAGCTCAGA 
TTGGTTTTAG 
CATGGGGTGA 
AGTTTCACTC 
GTAGGTCCTG 
GCCTTGGTGT 



21 
I 

AGCAATCAGA 
CTGCCCTCTG 
CTGTTCTOGA 
ATTCAAAAAC 
AGATCAGTGG 
TGGACCACAG 
AGATTTATGT 
GAACTTGCAA 
AATTTCGAGA 
AAATGGATCT 
TCAACAAGAA 
CTGTGAGAGT 



31 
I 

GCGCTCCCCC 
CACCAGCAAC 
CAGCTTOGGG 
AATTCAAGGG 
TGTGGATGAA 
TCAAAACAAT 
GGAGCTCAAG 
GGAGACATTC 
GCATCAGTTT 
TGGGGACCGT 
GGGATTTTAT 
ATACTTCAAA 



41 
I 

TCACATCAGT 
ATGGATTGTC 
GAACTGATTC 
GAGCTGGGCT 
CATTACACAC 
TGGCTGAGAA 
TTCACTCTAC 
AACCTGTACT 
AGAAAGATTG 
ATTCTGAAGC 
TTGGCATTTC 
AAGTGCCCAT 



51 
I 

GGCATGCTTC 
AGCTCTCCAT 
OGCAGCCTTC 
GGATCTCTTA 
CCATCAGGAC 
CAAACTGGGT 
GAGACTGCAA 
ACATGGAGTC 
ACACCATTGC 
TCAACACTGA 
AAGATGTTGG 
TTACAGTGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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GAATCTGGCT ATGTTTCCAG ACACGGTACC 
AGGGTCTTGT GTCAACAATT CTAAGGAGGA 
AGGCGAATGG CTTGTACCCA TTGGCAAQTG 
TTTTATGTGC CAAGCTTGTC GACCAGGTTT 
TGCTAAGTGC CCGCCTCACA GTTCTACTCA 
GAATAATTAC TTCCGGGCAG ACAAAGACCC 
TTCACCAAGA AATGTTATCT CTAATATAAA 
GCCCCTGGAC ACAGGAGGCC GGAAAGATGT 
GTGGAATATA AAACAGTGTG AGCCATGCAG 
TGGACTCACC AACACCACGG TGACAGTGAC 
TGAGATTGAT GCCGTTAATG GGGTGTCAGA 
GGTCAGCATC ACAACTAATC AGGCTGCTCC 
GACCTCCAGA AATAGCATCT CTTTGTCCTG 
ATTGGACTAC GAGGTCAAAT ACTATGAAAA 
GAGGGCAAGA GGCACAAATG TTACCATCAG 
CCAAATCCGA GCCCGAACAG CCGCTGGATA 
AACTAGTCCA GACTCTTTCT CCATCTCTGG 
TTCAGCGGCA GTAGCAATTA TTCTCCTCAC 
CTGTGGCTAT AAGTCAAAAC ATGGGGCAGA 
TTTAAAACTT CCAGGTCTCA GGACTTATGT 
AGCTGTTCAT GAGTTTGCCA AGGAATTGGA 
TGGAGCAGGT GAATTTGGAG AGGTGTGCAG 
GATTTCAGTG GCCATTAAAA CCCTGAAAGT 
CCTGGGAGAA GCAAGCATTA TGGGACAGTT 
AGTTGTTACC AAAAGTAAGC CAGTTATGAT 
GGATAGTTTC CTACGTAAAC ACGATGCCCA 
TCGAGGGATA GCATCTGGCA TGAAGTACCT 
CGCTGCTCGG AACATCTTGA TCAACAGTAA 
TTCGCGTGTC CTGGAGGATG ACCCAGAAGC 
AATCAGGTGG ACATCACCAG AAGCTATAGC 
ATGGAGTTAT GGGATTGTTC TCTGGGAGGT 
GATGTCCAAT CAGGATGTAA TTAAAQCTQT 
GGACTGCCCA GCTGCCTTGT ATCAGCTGAT 
CAGACCCAAG TTTGAGCAGA TTGTTAGTAT 
CCTGAAGATC ATCACCAGTG CAGCCGCAAG 
TGTGGATATC TCTACCTTCC GCACAACAGG 
CTGCAAGGAA ATCTTCACGG GCGTGGAGTA 
CACAGATGAC ATGAAAAAGG TTGGTGTCAC 
TAGCATTAAA GCTCTAGAAA CGCAATCAAA 
GGAAGTGCTT CTGGACGGAA GTGGTGGCTG 
CAATAATTCT GGAGATACTG GTGGAAGTT 



CATGGACTCC CAGTCCCTGG TGGAGGTTAG 
AGATCCTCCA AGGATGTACT GCAGTACAGA 
TTCCTGCAAT GCTGGCTATG AAGAAAGAGG 
CTACAAGGCA TTGGATGGTA ATATGAAGTG 
GGAAGATGGT TCAATGAACT GCAGGTGTGA 
TCCATCCATG GCTTGTACCC GACCTCCATC 
CGAGACCTCA GTTATCCTGG ACTGGAGTTG 
TACCTTCAAC ATCATATGTA AAAAATGTGG 
CCCAAATGTC CGCTTCCTCC CTCGACAGTT 
AGACCTTCTG GCACATACTA ACTACACCTT 
GCTGAGCTCC CCACCAAGAC AGTTTGCTGC 
ATCACCTGTC CTGACGATTA AGAAAGATCG 
GCAAGAACCT GAACATCGTA ATGGGATCAT 
GCAGGAACAA GAAACAAGTT ATACCATTCT 
TAGCCTCAAG CCTGACACTA TATACGTATT 
TGGGACGAAC AGCCGCAAGT TTGAGTTTGA 
TGAAAGTAGC CAAGTGGTCA TGATCGCCAT 
TGTTGTCATC TATGTTTTGA TTGGGAGGTT 
TGAAAAAAGA CTTCATTTTG GCAATGGGCA 
TGACCCACAT ACATATGAAG ACCCTACCCA 
TGCCACCAAC ATATCCATTG ATAAAGTTGT 
TGGTCGCTTA AAACTTCCTT CAAAAAAAGA 
TGGCTACACA GAAAAGCAGA GGAGAGACTT 
TGACCACCCC AATATCATTC GACTGGAAGG 
TGTCACAGAA TACATGGAGA ATGGTTCCTT 
GTTTACTGTC ATTCAGCTAG TGGGGATGCT 
GTCAGACATG GGCTATGTTC ACCGAGACCT 
CTTGGTGTGT AAGGTTTCTG ATTTCGGACT 
TGCTTATACA ACAAGAGGAG GGAAGATCCC 
CTACCGCAAG TTCACGTCAG CCAGCGATGT 
GATGTCTTAT GGAGAGAGAC CATACTGGGA 
AGATGAGGGC TATCGACTGC CACCCCCCAT 
GCTGGACTGC TGGCAGAAAG ACAGGAACAA 
TCTGGACAAG CTTATCCGGA ATCCCGGCAG 
GCCATCAAAC CTTCTTCTGG ACCAAAGCAA 
TGACTGGCTT AATGGTGTCC GGACAGCACA 
CAGTTCTTGT GACACAATAG CCAAGATTTC 
CGTGGTTGGG CCACAGAAGA AGATCATCAG 
GAATGGCCCA GTTCCCGTGT AAAGCACGAC 
TGGAAGGCGT CAAGTCATCC TGCAGACAGA 



Seq ID NO: 602 Protein sequence 
Protein Accession #: NP 005224.1 



1 
I 

MDOQLSILLL 
HYTPIRTYQV 
NLYYMESDDD 
LAFQDVGACV 
RMVCSTBGEW 
SMNCRCENNY 
IICKKCGWNI 
PPRQFAAVSI 
ETSYTILRAR 
QWMIAISAA 
TYEDPTQAVH 
EKQRRDFLGE 
IQLVGMLRGI 
TRGGKIPIRW 
YRLPPPMDCP 
LLLDQSNVDI 
PQKKIISSIK 



11 

i 

LSC9VLDSFG 
CNVMDHSQNN 
HGVKFREHQF 
ALVSVRVYFK 
LVPIGKCSCN 
PRADKDPPSM 
KQCEPCSPNV 
TTNQAAPSPV 
GTNVTISSLK 
VAIILLTWI 
EFAKELDATN 
ASIMGQFDHP 
ASGMKYLSDM 
TSPEAIAYRK 
AALYQLMLDC 
STPRTTGDWL 
ALETQSKNGP 



21 
I 

ELIPQPSNEV 
WLRTNWVPRN 
TKIDTIAADE 
KCPFTVKNIiA 
AGYEERGFMC 
ACTRPPSSPR 
RFLPRQFGLT 
LTIKKDRTSR 
PDTIYVPQIR 
YVLIGRFCGY 
ISIDKWGAG 
KI1RLEGWT 
GYVHRDLAAR 
PTSASDVWSY 



31 
I 

NUuDSKTIQG 
SAQKIYVELK 
SFTQMDLGDR 
MPPDTVPMDS 
QACRPGFYKA 
NVISNINETS 
NTTVTVTDIJj 
NSISLSWQEP 
ARTAAGYGTN 



KSKPVMIVTE 
NILIKSNLVC 
GIVLWEVMSY 
FEQIVSILDK 
IFTGVEYSSC 



41 
I 

ELGWISYPSH 
FTLRDCNSIP 
ILKLNTEIRE 
QSLVEVRGSC 
LDGNMKCAKC 
VILDWSWPLD 
AHTNYTPEID 
EHPNGIILDY 
SRKFEPETSP 
LHFGKGHLKL 
KLPSKKEISV 
YMENGSLDSF 
KVSOFGI^RV 
GERPYWEMSN 
LIRNPGSLKI 
DTIAKISTDD 



51 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



LVLGTCKETF 
VGPVNKKGFY 
VNNSKEEDPP 
PPHSSTQEDG 
TGGRKDVTFN 
AVNGVSELSS 
EVKYYEKQEQ 
DSFSISGESS 
PGLRTYVDPH 
AIKTLKVGYT 
LRKHDAQFTV 
LEDDPEAAYT 
QDVIKAVDBG 
ITSAAARPSN 
MKKVGVTWG 



VPV 



Seq ID NO: 603 DNA sequence 

Nucleic Acid Accession #: NMJ>05727.1 

Coding sequence : 122 . . 847 



1 
I 

GCCAGGCGTC 
TGGAGCCTCA 
CATGCAGTGC 
GTGTGGTGCA 
GAAGATCTTC 
CATCGCAGCC 
TGAGAGCAAG 
GGTTGCAGCT 
GCTGGTAGTG 
GAACACCACC 
CTCACCCTAC 
CAACACAGCC 
CTTCAATCAG 



11 
I 

CCTCTGCCTG 
GCAGTTCCCT 
TTCAGCTTCA 
GCCCTGTTGG 
GGGCCACTGT 
GGCGTTGTGG 
TGTGCCCTCG 
GCTGTGGTCG 
CCTGCCATCA 
ATGAAAGGGC 
TTCAAAGAGA 
AATGAAACCT 
CTTTTGTATG 



21 
I 

CCCACTCAGT 
CTTTCAGAAC 
TTAAGACCAT 
CAGTGGGCAT 
CGTCCAGTGC 
TCTTTGCTCT 
TGACGTTCTT 
CCTTGGTGTA 
AGAAAGATTA 
TCAAGTGCTG 
ACAGTGCCTT 
GCACCAAGCA 
ACATCCGAAC 



31 

I 

GGGAACACCC 
TCACTGCCAA 
GATGATCCTC 
CTGGGTGTCA 
CATGCAGTTT 
TGGTTTCCTG 
CTTCATCCTC 
CACCACAATG 
TGGTTCCCAG 
TGGCTTCACC 
TCCCCCATTC 
AAAGGCTCAC 
TAATGCAGTC 



41 
I 

GGGAGCTGTT 
GAGCCCTGAA 
TTCAATTTGC 
ATCGATGGGG 
GTCAACGTGG 
GGCTGCTATG 
CTCCTCATCT 
GCTGAGCACT 
GAAGACTTCA 
AACTATACGG 
TGTTGCAATG 
GACCAAAAAG 
ACCGTGGGTG 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
640 
900 
960 



51 
I 

TTGTCCTTTG 
CAGGAGCCAC 
TCATCTTTCT 
CATCCTTTCT 
GCTACTTCCT 
GTGCTAAGAC 
TCATTGCTGA 
TCCTGAOGTT 
CTCAAGTGTG 
ATTTTGAGGA 
ACAACGTCAC 
TAGAGGGTTG 
GTGTGGCAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TGGAATTGGG 
ACAATAAGTC 
CCCTGGCAAG 
AATGGACCTG 
ATGCCTGACT 
GTAGCCAGTT 
TAGTGGTGAT 
AAGTGAAATC 
TGTTACAATG 



GGCCTCGAGC 
CACTTCTGCC 
CAGCAGTGAT 
CCCTTTCTGC 
TTCCTTCCAT 
CTGTTGCCCA 
CCCAGTGCTC 
AGCAGAGCCT 
TTAAAAAA 



TGGCTGCCAT 
TCTGCCACTA 
TGGGGGAGGG 
TCCAGACTTG 
TGGTQGOTGG 
TTCCOCCAGT 
TACTGGGGGA 
CTGGGTGGAT 



GATTGTGTCC 
CTGCTGCCAC 
GACAGGATCT 
GGGCTAGATA 
ATGGGTGGGG 
CTATTAAACC 
TGAGAGAAAG 
GTGTAGAAGG 



ATGTATCTGT 
ATGGGAACTG 
AACAATGTCA 
GGGACCACTC 
GGCATTCCAG 
CTTQATATGC 
GCATTTTATA 
CACTTCAAAA 



ACTGCAATCT 840 

TGAAGAGGCA 900 

CTTGGGCCAG 960 

CTTTTAGGOG 1020 

AGCCTCTAAO 1080 

CCCCTAGGCC 1140 

QCCTGGGCAT 1200 

TGCATAAACC 1260 



Seq ID NO: 604 Protein sequence 
Protein Accession fh NP_005718.1 

11 



SI 



1 11 21 31 41 

I 1 I I I I 

MQCFSFIKTM MILFNLLIFIi CGAALLAVGI WVSIDGASFL KIFGPLSSSA MQFVNVGYPL 
IAAGVWFAL GFLGCYGAKT ESKCALVTPP FILLLIFIAE VAAAWALVY TTMAEHFLTL 
LWPAIKKDY GSQEDFTQVW NTTMKGLKCC GFTNYTDFED SPYFKENSAF PPPCCNDNVT 
NTANBTCTBQ KAHDQKVEGC FNQLLYDIRT NAVTVGGVAA GIGGLE1AAM IVSMYLYCNL 
Q 

Seq ID NO: 60S DNA sequence 

Nucleic Acid Accession ft: NM_000729.2 

Coding sequence; 



1 
I 

GGCTCAGCTG 
AGCCATGAAC 
GACGCAGCCG 
GCCCCGTAGG 
CGCCCTGCTG 
CGTTAAGAAC 
CTGGATGGAT 
GCCGCCATCA 
TCACACTCAT 
TCAATGTGAA 
TGTGCAAACT 
TATGCTATTA 



11 
I 

CCGGGCTGCT 
AGOGGCGTGT 
GTGCCTCCCG 
CAGCTGAGG6 
GCAAGATACA 
CTGCAGAACC 
TTTGGCCGTC 
GCCCAAOGGA 
AACTCATTGT 
AATTGTGTCT 
GAAGACAAAA 
AAGTGATTTC 



21 
I 

CCGGTTGGAA 
GCCTGTGCGT 
CAGATCCCGC 
TATOGCAGAG 
TCCAGCAGGC 
TGGACCCCAG 
GCAGTGCCGA 
AGCAACCTCC 
CTGTGGAGTT 
GTAAGATTGT 
CTGTTTTCTT 
ATTCTGCC 



31 
I 

ACGCCAAGCC 
GCTGATGGCG 
GGGCTCCGGG 
AACGGATGGC 
CCGGAAAGCT 
CCACAGGATA 
GGAGTATGAG 
CAACCCAGAG 
TGACATTGAA 
CCAGTGCAAC 
CATCTGTGAC 



41 

I 

AGCTGCCGTC 
GTACTGGCGG 
CTGCAGCGGG 
GAGTCCCGAG 
CCTTCTGGAC 
AGTGACCGGG 
TACCCCTCCT 
GAGGCAGAAT 
TGTATCTATT 
CACACACGCT 
TCCTGTTCTG 



51 
I 

CTAATCCAAA 
CTGGCGCCCT 
CAGAGGAGGC 
CGCACCTGGG 
GAATGTCCAT 
ACTACATGGG 
AGAGGACCCA 
AAGACAACAA 
TATTAAGTTC 
CACCAGAAGT 
AAAATGTTGT 



Seq ID NO: 606 Protein sequence 
Protein Accession #: NP_000720.1 

21 



51 



1 11 21 31 41 

1 1 I I I 

MNSGVCLCVL MAVLAAGALT QPVPPADPAG SGLQRAEEAP RRQLRVSQRT DGESRAHLGA 
LLARYIQQAR KAPSGRMSIV KNLQNLDPSH RISDRDYMGW MDFGRRSAEE YEYPS 

Seq ID NO: 607 DNA sequence 

Nucleic Acid Accession #: NM_001423.1 

Coding sequence: 219.. 692 



AGCACTCTCC 
ATAACCTCGG 
CAGGGCTGCT 
CTCTTTGCTC 
TCTTTGTGGT 
GGTTGGTTTC 
GCTGCAGTGA 
TGATTCTCTC 
CCATGGAGAA 
GCATTCTTGT 
ATCACCACGG 
GCGTTCTCTA 
GGTGGGGAGG 
COQAGATAGG 
TGAGGGGATT 
GATGCTCCCT 
GTTCTTAGTG 
TGGGTCATGC 
TGGCAAGAGC 
TCTGACCCAA 
GTGAGCCATT 
GGAGGAGGAA 
CTTTTCACAA 
GTCCCTCATG 
GCTCTTCTGG 
AACTATGGGA 
TTCCCCTCAA 
TCATTGTAGC 
ATCAATATTG 
GAGTGATCAC 



11 
I 

AGCCTCTCAC 
GAGGCGGGTC 
GCCAGCACCT 
ACAAGTTACC 
CCACATCGCT 
CAATACGGTA 
CAGCCTGTCA 
TATCATCTTC 
GGGAAACCGG 
GGGGGTGTCC 
CTATTCCTAC 
TCTGGTCCTG 
AGGAAGCCGT 
GGAGGGGGGA 
CTCTACTGCC 
TGATGGGGTC 
ACACACACTG 
ACTGAGGTCC 
AGATACTGTC 
AGCAAAACAT 
GTCCCTCTTT 
AGAGAGTTTG 
TGGAAAACCT 
GAGACCTCAT 
AGTTTCTCTA 
TCCAAGGGGC 
TGCAGTCAGT 
ACAATCCTAT 
AGGATCAGGG 
CTCTTGGGGA 



21 
I 

CGCAAAATTA 
CTTCCCCTCA 
GCCACTCAGA 
AAAAAAAAAA 
ACTGTTATTA 
GATGCATCAG 
TATGCCAGTG 
TGTGTCATTG 
TTCTTCCTCT 
ATCTACACTA 
ATCCTGGGCT 
AGAAAGAAAT 
TGAATCTGGG 
GGGGGAAGCA 
AAGCCCCTGC 
CAGAGAGCCT 
TCTGGGGCCC 
ACAGACCTAC 
TTTGTGCTGA 
CACATTCCAG 
GGAACAGATA 
TAAGGTCATG 
GGGGGATGGT 
GCCATGGTCT 
AAGTCACTAG 
AGTCTTGCAA 
GTTTCTTTTA 
TACTCTTCCT 
CTCCTAGGCT 
CCCTGCCTAT 



31 
I 

CACACCCCAG 
GTGCGGTCAC 
GCGCCTCTGT 
GAGCCAACAT 
TGCTATTTGT 
TAGGTCTTTG 
AAGATGCCCT 
CCCTCCTGGT 
CAGGGGCCAC 
GTCATTATGC 
GGATCTGCTT 
AAGGCCGGAC 
AGGGAAGTGG 
AAGGGGGGAG 
CCTGGGGAGA 
CCCTGCAGCC 
CATCAGCTGC 
TGCACTGAGT 
ATACGCTAAG 
TCTGAAGTGC 
TTTAGCTCTG 
CTGGTGGGTT 
CAGAGCCCAG 
TTGCTAGGCC 
TGAACAATTC 
CAGTGCCATG 
AGTATACAAC 
CTAACATTTT 
CAGTGGTAGC 
CCCACTTCAC 



41 
I 

TACACCAGCA 
ATACTTCCAG 
CGCTGGGACC 
GTTGGTATTG 
TAQCACCATT 
GAAAAACTGT 
CAAGACAGTG 
CTTCGTGTTC 
CACACTGGTG 
GAATOGTGAT 
CTGCTTCAGC 
GAGTTCATQG 
AGGTTGCTGT 
GTCAAATCOC 
AAGTAGTTGG 
ACCAGACTTG 
CACAACACCA 
TAAAATAGCG 
CCTGGAAGCC 
CTACTGGGGG 
TGGAATTCAG 
AGCTAAACCA 
TCGAGACCTC 
TCTTGCTGAA 
GGTGGTAAAA 
TTAGGGTTAT 
AGGAGAGAGA 
TGAGGAAGTT 
TCTGGCTTAG 
AGGTGAGGCA 



51 
I 

GAGGAAACTT 
AAGAGCGGAC 
CTTCAGAACT 
CTGGCTGGTA 
GCCAATGTCT 
ACCAACATTA 
CAGGCCTTCA 
CAGCTCTTCA 
TGCTGGCTGT 
GGAACGCAGT 
TTCATCATCG 
GGATCTGGGG 
ACAGGAAAAA 
AAACCATTAC 
CTAGTACTTT 
GCCTCCAGCT 
GCCCCACTTC 
GTACAAGTTC 
ATCCTGCCCT 
GCTTTGGCCT 
TGACAAAATG 
AGAAGGAGAC 
ACACACGGCT 
AGCCAAGGCA 
GTACCACACA 
GTTTTTAGGA 
TGGACATGGC 
TTGTCTAATT 
ACACCACCTG 
TGGCAATTCT 



60 
120 
180 
240 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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GGAAGCTGAT 
. CTATATAATT 
GCATTCCCAG 
TCTTTTCTTT 
CTGTCTGCTG 
TAAGCAGGCA 
GGG GTTTTTT 
TGAGACATCT 
TCCTTTTTGG 
ATTCTATTTC 
CTAGGCTGAG 
TGATTTCATT 
CAGACATATC 
ACCATGTATT 
ATTACTCTGG 
ATACTATAAT 
TTCTGATTCC 



TAAAACACAC 
GTGAAGTATT 
GAAAATACGA 
AAAATAAAAA 
COGCAGGAGC 
TTGCTTTGCC 
GGGAAGACGT 
TGCCTACTTT 
GGAGTTGTTA 
TCTATGTTTA 
GTTAGAGAGA 
ATCACATGAT 
CAAAGGGAAT 
CCCTTATCTT 
TGGATTGTTC 
TGTAAATATT 
CTTCAAAAAA 



ATAAACCAAA 
AAGCCTACCG 
AAATCCCATG 
AGCAAAAACT 
ACCTCTATAC 
CTGGAGCAGC 
TT T C T TT A TC 
TCTTTATTAG 
TGCCATGATT 
TTCTAGTTAA 
TTGGCCAGCA 
TATAGAAGGC 
ACTCACATTT 
TTACTTTTTT 
TAGTACTGTA 
TTGATACAAA 
AAAAAA 



ACCAAACAAC 
TATTTCAGCC 
AGATAAATAA 
CTTGTGGTAC 
AGGACTTAGA 
TATTTTAAGC 
GCCCTGAGAA 
CTTTCTCCTC 
TTTGGTATTT 
GGAAATGTTG 
AAAACTGTGG 
TGTCTTAGTG 
TGTTAAGAAG 
TCTGTGACAT 
TTGGGCTTCT 
TGTTTATAAC 



AGGCCCTTGG 
ATGATAAGAA 
AAATATAGGT 
CTAGTCAGAT 
AGTAGTATGT 
CATCTCAGAT 
GATCTACCCC 
ATCCATTTCT 
ATGTAAAAGG 
AGGGCAAGCC 
GAAGATGAAC 
CAAAAAACAT 
TTGAACTATG 
TTATGTCTCA 
TCGTTAATAG 
TCTAGGGATA 



GTGAAAGGTG 
CAGAGTGCCT 
GATGGGCAGA 
GGTAGACGAG 
TATTCCTGGT 
TCTGTCTAAA 
AGGGAGAATC 
TTTATACCTT 
ATTATTACTA 
ACCAAATTAC 
TTTGTCA7TA 
ACTTACATTT 
ACTGGAGTAA 
TGTAATTTGC 
ATTATTTCAT 
TAAAAACAGA 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



Seq ID NO: 608 Protein sequence 
Protein Accession #: NP 001414.1 



11 



21 



31 41 SI 

I I i I 1 1 

MLVLLAGIFV VHIATVIMLP VSTIANVWLV SNTVDASVGL WKNCTNISCS DSLSYASEDA 
LKTVQAFMIL SIIFCVIALL VFVFQLFTME KGNRPPLSGA TTLVCWLCIL VGVSIYTSHY 
ANRDGTQYKH GYSYILGWIC FCFSFIIGVL YLVLRKK 

Seq ID NO: 609 DNA sequence 

Nucleic Acid Accession #* NM_004961.2 

Coding sequence: 55.. 1575 



60 
120 



11 



21 



31 



41 

t 



51 



GCCAGAGCGT GAGCCGCGAC CTCCGCGCAa GTGGTCGCGC OGGTCTCCGC GGAAATGTTG 
TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG GGTCGAGGGA 
CCTCAGACTG AATCAAAGAA TGAAGCCTCT TCCCGTGATG TTGTCTATGG CCCCCAGCCC 
CAGCCTCTGG AAAATCAGCT CCTCTCTGAG GAAACAAAGT CAACTGAGAC TGAGACTGGG 
AGCAGAGTTG GCAAACTGCC AGAAGCCTCT CGCATCCTGA ACACTATCCT GAGTAATTAT 
GACCACAAAC TGCGCCCTGG CATTGGAGAG AAGCCCACTG TGGTCACTGT TGAGATCGCC 
GTCAACAGCC TTGGTCCTCT CTCTATCCTA GACATGGAAT ACACCATTGA CATCATCTTC 
TCCCAGACCT GGTACGACGA ACGCCTCTGT TACAACGACA CCTTTGAGTC TCTTGTTCTG 
AATGGCAATG TGGTGAGCCA GCTATGGATC CCGGACACCT TTTTTAGGAA TTCTAAGAGG 
ACCCACGAGC ATGAGATCAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGATGGCAAG 
GTGTTGTACA CAATTAGGAT GACCATTGAT GCCGGATGCT CACTCCACAT GCTCAGATTT 
CCAATGGATT CTCACTCTTG CCCTCTATCT TTCTCTAGCT TTTCCTATCC TGAGAATGAG 
ATGATCTACA AGTGGGAAAA TTTCAAGCTT GAAATCAATG AGAAGAACTC CTGGAAGCTC 
TTCCAGTTTG ATTTTACAGG AGTGAGCAAC AAAACTGAAA TAATCACAAC CCCAGTTGGT 
GACTTCATGG TCATGACGAT TTTCTTCAAT GTGAGCAGGC GGTTTGGCTA TGTTGCCTTT 
CAAAACTATG TCCCTTCTTC CGTGACCACG ATGCTCTCCT GGGTTTCCTT TTGGATCAAG 
ACAGAGTCTG CTCCAGCCCG GACCTCTCXA GGGATCACCT CTGTTCTGAC CATGACCACG 
TTGGGCACCT TTTCTCGTAA GAATTTCCCG CGTGTCTCCT ATATCACAGC CTTGGATTTC 
TATATOGCCA TCTGCTTOGT CTTCTGCTTC TGCGCTCTGT TGGAGTTTGC TGTGCTCAAC 
TTCCTGATCT ACAACCAGAC AAAAGCCCAT GCTTCTCCTA AACTCCGCCA TCCTCGTATC 
AATAGCCGTG CCCATGCCCG TACCCGTGCA CGTTCCCGAG CCTGTGCCCG CCAACATCAG 
GAAGCTTTTG TGTGCCAGAT TGTCACCACT GAGGGAAGTG ATGGAGAGGA GCGCCCGTCT 
TGCTCAGCCC AGCAGCCCCC TAGCCCAGGT AGCCCTGAGG GTCCCCGCAG CCTCTGCTCC 
AAGCTGGCCT GCTGTGAGTG GTGCAAGCGT TTTAAGAAGT ACTTCTGCAT GGTCCCCGAT. 
TGTGAGGGCA GTACCTGGCA GCAGGGCCGC CTCTGCATCC ATGTCTACCG CCTGGATAAC 
TACTCGAGAG TTGTTTTCCC AGTGACTTTC TTCTTCTTCA ATGTGCTCTA CTGGCTTGTT 
TGCCTTAACT TGTAGGTACC AGCTGGTACC CTGTGGGGCA ACCTCTCCAG TTCCCCAGGA 
GGTCCAAGCC CCTTGCCAAG GGAGTTGGGG GAAAGCAGCA GCAGCAGCAG GAGOGACTAG 
AGTTTTTCCT GCCCCATTCC CCAAACAGAA GCTTGCAGAG GGTTTGTCTT TGCTGCCCCT 
CTCCCCTACC TGGCCCATTC ACTGAGTCTT CTCAGCAGAC CATTTCAAAT TATTAATAAA 
TGGGCCACCT CCCTCTTCTT CAAGGAGCAT CCGTGATGCT CAGTGTTCAA AACCACAGCC 
ACTTAGTGAT CAGCTCCCTA AAACCATGCC TAAGTACAGG CGGATTAGCT ATCTTCCAAC 
AATGCTGACC ACCAGACAAT TACTGCATTT TTCCAGAAGC CCACTATTGC CTTTGTAGTG 
CTTTCGGCCC AGTTCTGGCC TCAGCCTCAA AGTGCACCGA CTAGTTGCTT GCCTATACCT 
GGCACCTCAT TAAGATGCTG GGCAGCAGTA TAACAGGAGG AAGAGATCCC TCTCCTTTGG 
TCAGATTATT ATGTTCTCAG TTCTCTCTCC CTGCTACCOC TTTCTCTGCA GATAGATAGA 
CACTGGCATT ATCCCTTTAG GAAGAGGGGG GGGCAGCAAG AGAGCCTATT TGGGACAGCA 
TTCCTCTCTC TCTGCTGCTG TGACATCTCC CTCTCCTTGC TGGCTCCATC TTTCGTCTGC 
ACTACCAATT CAATGCCCTT CATCCAATGG GTATCTATTT TTGTGTGTGA TTATAGTAAC 
TACTCCCTGC TTTATATGCC AOCCTCTTOC TTCTCTTTGA COCCTGTGAC TCTTTCTGTA 
ACTTTCCCAG TGACTTCCCC TAGCCCTGAC CCAGGCACTA GGCCTTGGTG ACTTCCTGGG 
GCCAAGAAAC TAAGGAAACT CGGCTTTGCA ACAGGCATTA CTCGCCATTG ATTGGTGCCC 
ACCCAGGGCA CACTGTCGGA GTTCTATCAC TTGCTTGACC CCTGGACCCA TAAACCAGTC 
CACTGTTATA CCCGGGGCAC TCTAACCATC ACAATCAATC AATCAAATTC CCTTAAATTT 
GTATGGCACT GGAACTTTGG CAAAGCACTT TTGACAAGTT GTGTCTGATT GGAGCTTCAT 
GATAGCCTTG TGACATCTTT AGGGCAGGAT TCTTATCCCC ATTTTGCAGA TGAAAACCCT 
GAGTCACAGA TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA GCCCACTGTC 
ACCTTCTAGA CCACATGATA GGGCTAGACA GCTCAGTTCA CCATGATTCT CTTCTGTCAC 
CTCTGCTGGC ACACCAGTGG CAAGGCCCAG AATGGCGACC TCTCTTTAGC TCAATTTCTG 
GGCCTGAGGT GCTCAGACTG CCCCCAAGAT CAAATCTCTC CTGGCTGTAG TAACCCAGTG 
GAATGAATTT GGACATGCCC CAATGCTTCT ATATGCTAAG TGAAATCTGT GTCTGTAATT 



60 
120 
180 
240 
300 
360 
420 
480 
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600 
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900 
960 
1020 
1080 
1140 
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1260 
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1185 



WO 03/042661 
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TGTTGGGGGG TGGATAGGGT GGGGTCTCCA TCTACTTTTT GTCACCATCA TCTGAAATGG 3120 
GGAAATATGT AAATAAATAT ATCAGCAAAG CAAAAAGAAA AAAAAAAA 

Seq ID NOt 610 Protein sequence 
5 Protein Accession #: NP_004952.1 

1 11 21 31 41 SI 

I t I I I I 

MLSKVLFVLL GILLILQSRV EGPQTESKNE ASSRDWYGP QPQPLENQLL SEETKSTSTE 60 

TGSRVGKLPE ASRILNTILS NYDHKLRPGI GEKPTWTVE IAVNSLGPLS ILDMEYTIDI 120 

IFSQTWYDER LCYNDTFESL VLNGNWSQL WIPDTFFRNS KRTHBHEITM PNQMVRIYKD 190 

GKVLYTIRMT IDAGCSLHML RFPMDSHSCP LSFSSFSYPE NEMIYKWENP KLEINBKNSW 240 

KLFQFDFTGV SNKTEIITTP VGDFMVMTIF FNVSRRFGYV APQNYVPSSV TTMLSWVSFW 300 

IKTESAPART SLGITSVLTM TTLGTFSRKN FPRVSYITAL DFYIAICFVF CFCALLEFAV 360 

LNFLIYNQTK AHASPKLRHP RINSRAHART RARSRACARQ HQEAFVOQIV TTEGSDGEER 420 

PSCSAQQPPS PGSPBGPRSL CSKLACCEWC KRFKKYFCMV PDCEGSTWQQ GRLCIHVYRL 480 
DNYSRWFPV TFPFFNVLYW LVCLNL 



10 



15 



Seq ID NO: 611 DNA sequence 
20 Nucleic Acid Accession Jh NM_021984.1 
Coding sequence : 572 . . 1753 
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1 11 21 31 41 51 

I I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCC1TTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 540 

CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG TACGACGAAC GCCTCTGTTA CAACGACACC TTTGAGTCTC TTGTTCTGAA 660 

TGGCAATGTG GTGAGCCAGC TATGGATCCC GGACACCTTT TTTAGGAATT CTAAGAGGAC 720 

CCACGAGCAT GAGATCACCA TGCCCAACCA GATGGTCCGC ATCTACAAGG ATGGCAAGGT 780 

GTTGTACACA ATTAGGATGA CCATTGATGC CGGATGCTCA CTCCACATGC TCAGATTTCC 840 

AATGGATTCT CACTCTTGCC CTCTATCTTT CTCTAGCTTT TCCTATCCTG AGAATGAGAT 900 

GATCTACAAG TGGGAAAATT TCAAGCTTGA AATCAATGAG AAGAACTCCT GGAAGCTCTT 960 

CCAGTTGGAT TTTACAGGAG TGAGCAACAA AACTGAAATA ATCACAACCC CAGTTGGTGA 1020 

CTTCATGGTC ATGACGATTT TCTTCAATGT GAGCAGGCGG TTTGGCTATG TTGCCTTTCA 1080 

AAACTATGTC CCTTCTTCCG TGACCACGAT GCTCTCCTGG GTTTCCTTTT GGATCAAGAC 1140 

AGAGTCTGCT CCAGCCCGGA CCTCTCTAGG GATCACCTCT GTTCTGACCA TGACCACGTT 1200 

GGGCACCTTT TCTCGTAAGA ATTTCCCGCG TGTCTCCTAT ATCACAGCCT TGGATTTCTA 1260 

TATCGCCATC TGCTTOGTCT TCTGCTTCTG CGCTCTGTTG GAGTTTGCTG TGCTCAACTT 1320 

CCTGATCTAC AACCAGACAA AAGCCCATGC TTCTCCTAAA CTCCGCCATC CTCGTATCAA 1380 

TAGCCGTGCC CATGCCCGTA CCCGTGCACG TTCCCGAGCC TGTGCCCGCC AACATCAGGA 1440 

AGCTTTTGTG TGCCAGATTG TCACCACTGA GGGAAGTGAT GGAGAGGAGC GCCCGTCTTG 1500 

CTCAGCCCAG CAGCCCCCTA GCCCAGGTAG CCCTGAGGGT CCCCGCAGCC TCTGCTCCAA 1560 

GCTGGCCTGC TGTGAGTGGT GCAAGCGTTT TAAGAAGTAC TTCTGCATGG TCCCCGATTG 1620 

TGAGGGCAGT ACCTGGCAGC AGGCCCGCCT CTGCATCCAT GTCTACCGCC TGGATAACTA 1680 

CTCGAGAGTT GTTTTCCCAG TGACTTTCTT CTTCTTCAAT GTGCTCTACT GGCTTGTTTG 1740 

CCTTAACTTG TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG 1800 

TCCAAGCCCC TTGCCAAGGG AGTTGGGGGA AAGCAGCAGC AGCAGCAGGA GCGACTAGAG 1860 

TTTTTCCTGC CCCATTCCCC AAACAGAAGC TTGCAGAGGG TTTGTCTTTG CTGCCCCTCT 1920 

CCCCTACCTG GCCCATTCAC TGAGTTTTCT CAGCAGACCA TTTCAAATTA TTAATAAATG 1980 

GGCCACCTCC CTCTTCTTCA AGGAGCATCC GTGATGCTCA GTGTTCAAAA OCACAGCCAC 2040 

TTAGTGATCA GCTCCCTAAA ACCATGCCTA AGTACAGGCG GATTAGCTAT CTTCCAACAA 2100 

TGCTGACCAC CAGACAATTA CTGCATTTTT CCAGAAGCCC ACTATTGCCT TTGCAGTGCT 2160 

TTCGGCCCAG TTCTGGCCTC AGCCTCAAAG TGCACCGACT AGTTGCTTGC CTATACCTGG 2220 

CACCTCATTA AGATGCTGGG CAGCAGTATA ACAGGAGGAA GAGATCCCTC TCCTTTGGTC 2280 

AGATTATTAT GTTCTCAGTT CTCTCTCGCT GCTACCCCTT TCTCTGCAGA TAGATAGACA 2340 

CTGGCATTAT CCCTTTAGGA AGAGGGGGGG GCAGCAAGAG AGCCTATTTG GGACAGCATT 2400 

CCTCTCTCTC TGCTGCTGTG ACATCTCCCT CTCCTTGCTG GCTCCATCTT TCGTCTGCAC 2460 

TACCAATTCA ATGCCCTTCA TCCAATGGGT ATCTATTTTT GTGTGTGATT ATAGTAACTA 2520 

CTCCCTGCTT TATATGCCAC OCTCTTCCTT CTCTTTGAOC CCTGTGACTC TTTCTGTAAC 2580 

TTTCCCAGTG ACTTCCCCTA GCCCTGACCC AGGCACTAGG CCTTGGTGAC TTCCTGGGGC 2640 

CAAGAAACTA AGGAAACTCG GCTTTGCAAC AGGCATTACT OGCCATTGAT TGGTGCCCAC 2700 

CCAGGGCACA CTGTCGGAGT TCTATCACTT GCTTGACCCC TGGACCCATA AACCAGTCCA 2760 

CTGTTATACC CGGGGCACTC TAACCATCAC AATCAATCAA TCAAATTCCC TTAAATTTGT 2820 

ATGGCACTGG AACTTTGGCA AAGCACTTTT GACAAGTTGT GTCTGATTGG AGCTTCATGA 2880 

TAGCCTTGTG ACATCTTTAG GGCAGGATTC TTATCCCCAT TTTGCAGATG AAAACCCTGA 2940 

GTCACAGATT TCTGTGGGAC TGTGGATCTC ACTGGAAGCT ATCCAAGAGC CCACTGTCAC 3000 

CTTCTAGACC ACATGATAGG GCTAGACAGC TCAGTTCACC ATGATTCTCT TCTGTCACCT 3060 

CTGCTGGCAC ACCAGTGGCA AGGCCCAGAA TGGCGAOCTC TCTTTAGCTC AATTTCTGGG 3120 

CCTGAGGTGC TCAGACTGCC CCGAAGATCA AATCTCTCCT GGCTGTAGTA ACCCAGTGGA 3180 

ATGAATTTGG ACATGCCCCA ATGCTTCTAT ATGCTAAGTG AAATCTGTGT CTGTAATTTG 3240 

TTGGGGGGTG GATAGGGTGG GGTCTCCATC TACTTTTTGT CACCATCATC TGAAATGGGG 3300 
AAATATGTAA ATAAATATAT CAGCAAAGC 

Seq ID NO: 612 Protein sequence 
Protein Accession fi: NP 068819.1 



1186 



WO 03/042661 



1 11 21 31 41 SI 

I I I I I I 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 120 

INEKNSWKLF QLDPTGVSNK TBI ITTPVGD FMVMTIPFNV SRRFGYVAFQ NYVPSSVTTM 180 

LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 240 

ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 300 

GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQARL 360 
CIHVYRLDNY SRWFPVTFF FFNVLYWLVC Uth 

Seq ID NO: 613 DMA sequence 
Nucleic Acid Accession #: NMJ)21987.1 
Coding sequence: S72..1657 " 

1 11 21 31 41 SI 

I ! I I 1 I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGOG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 540 

CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG AATTCTAAGA GGACCCACGA GCATGAGATC ACCATGCCCA ACCAGATGGT 660 

CCGCATCTAC AAGGATGGCA AGGTGTTGTA CACAATTAGG ATGACCATTG ATGCCGGATG 720 

CTCACTCCAC ATGCTCAGAT TTCCAATGGA TTCTCACTCT TGCCCTCTAT CTTTCTCTAG 780 

CTTTTCCTAT CCTGAGAATG AGATGATCTA CAAGTGGGAA AATTTCAAGC TTGAAATCAA 840 

TGAGAAGAAC TCCTGGAAGC TCTTCCAGTT TGATTTTACA GGAGTGAGCA ACAAAACTGA 900 

AATAATCACA ACCCCAGTTG GTGACTTCAT GGTCATGACG ATTTTCTTCA ATGTGAGCAG 960 

GCGGTTTGGC TATGTTGCCT TTCAAAACTA TGTCCCTTCT ' TCCGTGACCA CGATGCTCTC 1020 

CTGGGTTTCC TTTTGGATCA AGACAGAGTC TGCTCCAGCC CGGACCTCTC TAGGGATCAC 1080 

CTCTGTTCTG ACCATGACCA CGTTGGGCAC CTTTTCTCGT AAGAATTTCC CGCGTGTCTC 1140 

CTATATCACA GCCTTGGATT TCTATATCGC CATCTGCTTC GTCTTCTGCT TCTGCGCTCT 1200 

GTTGGAGTTT GCTGTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC ATGCTTCTCC 1260 

TAAACTCCGC CATCCTOGTA TCAATAGCCG TGCCCATGCC CGTACCCGTG CACGTTCCCG 1320 

AGCCTGTGCC CGCCAACATC AGGAAGCTTT TGTGTGCCAG ATTGTCACCA CTGAGGGAAG 1380 

TGATGGAGAG GAGCGCCCGT CTTGCTCAGC CCAGCAGCCC CCTAGCCCAG GTAGCCCTGA 1440 

GGGTCCCCGC AGCCTCTGCT CCAAGCTGGC CTGCTGTGAG TGGTGCAAGC GTTTTAAGAA 1500 

GTACTTCTGC ATGGTCCCCG ATTGTGAGGG CAGTACCTGG CAGCAGGGCC GCCTCTGCAT 1560 

CCATGTCTAC CGCCTGGATA ACTACTCGAG AGTTGTTTTC CCAGTGACTT TCTTCTTCTT 1620 

CAATGTGCTC TACTGGCTTG TTTGCCTTAA CTTGTAGGTA CCAGCTGGTA CCCTGTGGGG 1680 

CAACCTCTCC AGTTCCCCAG GAGGTCCAAG CCCCTTGCCA AGGGAGTTGG GGGAAAGCAG 1740 

CAGCAGCAGC AGGAGCGACT AGAGTTTTTC CTGCCCCATT CCCCAAACAG AAGCTTGCAG 1800 

AGGGTTTGTC TTTGCTGCCC CTCTCCCCTA CCTGGCCCAT TCACTGAGTT TTCTCAGCAG 1860 

ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAAGGAGC ATCCGTGATG 1920 

CTCAGTGTTC AAAACCACAG CCACTTAGTG ATCAGCTCCC TAAAACCATG CCTAAGTACA 1980 

GGCGGATTAG CTATCTTCCA ACAATGCTGA CCACCAGACA ATTACTGCAT TTTTCCAGAA 2040 

GCCCACTATT GCCTTTGCAG TGCTTTCGGC CCAGTTCTGG CCTCAGCCTC AAAGTGCACC 2100 

GACTAGTTGC TTGCCTATAC CTGGCACCTC ATTAAGATGC TGGGCAGGAG TATAACAGGA 2160 

GGAAGAGATC CCTCTCCTTT GGTCAGATTA TTATGTTCTC AGTTCTCTCT CCCTGCTACC 2220 

CCTTTCTCTG CAGATAGATA GACACTGGCA TTATCCCTTT AGGAAGAGGG GGGGGCAGCA 2280 

AGAGAGCCTA TTTGGGAGAG CATTCCTCTC TCTCTGCTGC TGTGACATCT CCCTCTCCTT 2340 

GCTGGCTCGA TCTTTCGTCT GCACTACCAA TTCAATGCCC TTCATCCAAT GGGTATCTAT 2400 

TTTTGTGTGT GATTATAGTA ACTACTCCCT GCTTTATATG CCACCCTCTT CCTTCTCTTT 2460 

GACCCCTGTG ACTCTTTCTG TAACTTTCCC AGTGACTTCC CCTAGCCCTG ACCAGGCACT 2520 

AGGCCTTGGT GACTTCCTGG GGCCAAGAAA CTAAGGAAAC TCGGCTTTGC AACAGGCATT 2580 

ACTCGCCATT GATTGGTGCC CACCCAGGGC ACACTGTOGG AGTTCTATCA CTTGCTTGAC 2640 

CCCTGGACCC ATAAACCAGT CGACTGTTAT ACCCGGGGCA CTCTAACCAT CACAATCAAT 2700 

CAATCAAATT CCCTTAAATT TGTATGGCAC TGGAACTTTG GCAAAGCACT TTTGACAAGT 2760 

TG TGTC TGAT TGGAGCTTCA TGATAGCCTT GTGACATCTT TAGGGCAGGA TTCTTATCCC 2820 

CATTTTGCAG ATGAAAACCC TGAGTCACAG ATTTCTGTGG GACTGTGGAT CTCACTGGAA 2880 

GCTATCCAAG AGCCCACTGT CACCTTCTAG ACCACATGAT AGGGCTAGAC AGCTCAGTTC 2940 

ACCATGATTC TCTTCTGTCA CCTCTGCTGG CACACCAGTG GCAAGGCCCA GAATGGCGAC 3000 

CTCTCTTTAG CTCAATTTCT GGGCCTGAGG TGCTCAGACT GCCCCCAAGA TCAAATCTCT 3060 

CCTGGCTGTA GTAACCCAGT GGAATGAATT TGGACATGCC GCAATGCTTC TATATGCTAA 3120 

GTGAAATCTG TGTCTGTAAT TTGTTGGGGG GTGGATAGGG TGGGGTCTCC ATCTACTTTT 3180 
TGTCACCATC ATCTGAAATG GGGAAATATG TAAATAAATA TATCAGCAAA GC 

Seq ID NO: 614 Protein sequence 
Protein Accession #: NP_068822.l 

1 11 21 31 41 51 

i I I I i I 

MEYTIDIIFS QTWNSKRTHB HEITMPNQMV RIYKDGKVLY TIRMTIDAGC SLHMLRFPMD 60 

SHSCPLSFSS FSYPENEMIY KWENFKLEIN EKNSWKL.FQF DFTGVSNKTB I ITTPVGDFM 120 

VMTIFFNVSR RFGYVAFQNY VPSSVTTMLS WVSFWIKTES APARTSLGIT SVLTMTTLGT 180 

FSRKNFPRVS YITALDFYIA ICFVFCFCAL LEFAVLNFLI YNQTKAHASP KLRHPRINSR 240 

AHARTRARSR ACARQHQEAF VCQIVTTEGS DGEERPSCSA QQPPSPGSPE GPRSLCSKLA 300 

CCEWCKRFKK YFCMVPDCEG STWQQGRLCI HVYRLDNYSR WFPVTFPFF NVLYWLVCLN 360 
L 



1187 



WO 03/042661 



PCT/US02/36810 



20 



Seq ID NO: 615 PNA sequence 

Nucleic Acid Accession 8: NM_021990.1 

Coding sequence: 1309.. 2490 

5 1 11 21 31 41 SI 

i I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGOGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTGGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

1U GTGTAAAGAA AGCCAAATCA AGGACCOGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGT GAGTTTCATA 300 

CCTTGGCAGA TGGCCTTTAA CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 360 

TTTTCTTGGC TGTGGTGCAT GGCTGTGGAG CTCAGGGTGG ACTCCTGTTG GGCAGCCAGT 420 

TCCTGGATGG CTGTCTGTGG GTGGAGGACT CCTGCCTTTC CTGTTTAGAC ACCCACAAAG 480 

1!) GCTGCTCTTT AGCCTCCTTC CCTTCATCCC CTTCCCCTGC CCCCAGTGCA ACGAGTATTA 540 

CACAACCAAC AAAACCGCAA AATATTCCCA CAATTTTCTG GTCCTCTCTG GGAGAGGCCG 600 

CTCTGGCTTT TCCTCTCAGC CCTGGCCCTC TGCCTGCTCC TCACTCCTGG TTGGTGCTGG 660 

TCAGGCTGAC TAGAGGCCAA GGCGACCAAC ACTAGGCAAA CGCGGCCAGC GCTCAGACAT 720 

AAATGCCCTC TTCATTTCAC GTGTAACATT CTTTTAAAAT CTAGGTCTTG GTT T TGT TGA 780 

TTTTTTCTTA AATAAAAGAG TGATCATAAA AGAGGGACAG CATAGAAAGT CCCCAAAGAG 840 

CAGCAAGGTT TTAAAGAAAT TCACAAGCCT AATCTGTCAC TGTCTTATAA TTTGCTATTA 900 

CCAGTCACAA TTTAACTAGG TTTTGTGTTG AAAACTTGTT TTGGTTTGCT TCTGTCCCAA 960 

GAGGCACTAG CTGGGGCCCC TACAGAGTGC AGGGCAGAGC TTCATTTTTC GTTTGAATGT 1020 

0 - TCTAGGGTCG AGGGACCTCA GACTGAATCA AAGAATGAAG CCTCTTCCCG TGATGTTGTC 1080 

Zj TATGGCCCGC AGCCCCAGCC TCTGGAAAAT CAGCTCCTCT CTGAGGAAAC AAAGTCAACT 1140 

GAGACTGAGA CTGGGAGCAG AGTTGGCAAA CTGCCAGAAG CCTCTCGCAT CCTGAACACT 1200 

ATCCTGAGTA ATTATGACCA CAAACTGCGC CCTGGCATTG GAGAGAAGCC CACTGTGGTC 1260 

ACTGTTGAGA TCTCCGTCAA CAGCCTTGGT CCTCTCTCTA TCCTAGACAT GGAATACACC 1320 

ATTGACATCA TCTTCTCCCA GACCTGGTAC GACGAAOGCC TCTGTTACAA CGACACCTTT 1380 

GAGTCTCTTG TTCTGAATGG CAATGTGGTG AGCCAGCTAT GGATCCCGGA CACCTTTTTT 1440 

AGGAATTCTA AGAGGACCCA CGAGCATGAG ATCACCATGC CCAACCAGAT GGTCOGCATC 1500 

TACAAGGATG GCAAGGTGTT GTACACAATT AGGATGACCA TTGATGCCGG ATGCTCACTC 1560 

CACATGCTCA GATTTCCAAT GGATTCTCAC TCTTGCCCTC TATCTTTCTC TAGCTTTTCC 1620 

TATCCTGAGA ATGAGATGAT CTACAAGTGG GAAAATTTCA AGCTTGAAAT CAATGAGAAG 1680 

AACTCCTGGA AGCTCTTCCA GTTTGATTTT ACAGGAGTGA GCAACAAAAC TGAAATAATC 1740 

ACAACCCCAG TTGGTGACTT CATGGTCATG ACGATTTTCT TCAATGTGAG CAGGCGGTTT 1800 

GGCTATGTTG CCTTTCAAAA CTATGTCCCT TCTTCOGTGA CCACGATGCT CTCCTGGGTT 1860 

TCCTTTTGGA TCAAGACAGA GTCTGCTCCA GCCCGGACCT CTCTAGGGAT CACCTCTGTT 1920 

CTGACCATGA CCACGTTGGG CACCTTTTCT CGTAAGAATT TCCCGCGTGT CTCCTATATC 1980 

ACAGCCTTGG ATTTCTATAT CGCCATCTGC TTCGTCTTCT GCTTCTGCGC TCTGTTGGAG 2040 

TTTGCTGTGC TCAACTTCCT GATCTACAAC CAGACAAAAG CCCATGCTTC TCCTAAACTC 2100 

CGCCATCCTC GTATCAATAG CCGTGCCCAT GCCCGTACCC GTGCACGTTC CCGAGCCTGT 2160 

GCCOGCCAAC ATCAGGAAGC TTTTGTGTGC CAGATTGTCA CCACTGAGGG AAGTGATGGA 2220 

GAGGAGCGCC CGTCTTGCTC AGCCCAGCAG CCCCCTAGCC CAGGTAGCCC TGAGGGTCCC 2280 

CGCAGCCTCT GCTCCAAGCT GGCCTGCTGT GAGTGGTGCA AGCGTTTTAA GAAGTACTTC 2340 

TGCATGGTCC CCGATTGTGA GGGCAGTACC TGGCAGCAGG GCCGCCTCTG CATCCATGTC 2400 

TACCGOCTGG ATAACTACTC GAGAGTTGTT TTCCCAGTGA CTITCTTCTT CTTCAATGTG 2460 

CTCTACTGGC TT G TT T GOCT TAACTTGTAG GTACCAGCTG GTACCCTGTG GGGGAACCTC 2520 

TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG CAGCAGCAGC 2580 

AGCAGGAGCG ACTAGAGTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2640 

GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTTTTCTCAG CAGACCATTT 2700 

CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 2760 

TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 2820 

TAGCTATCTT CCAACAATGC TGACCACCAG ACAATTACTG CATTTTTCCA GAAGCCCACT 2880 

jS ATTGCCTTTG CAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2940 

TGCTTGOCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAGTATAACA GGAGGAAGAG 3000 

ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 3060 

CTGCAGATAG ATAGACACTG GCATTATCCC TTTAGGAAGA GGGGGGGGCA GCAAGAGAGC 3120 

CTAT TTGG GA CAGCATTCCT CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 3180 

CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 3240 

TGTGATTATA GTAACTACTC CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 3300 

GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCAGGC ACTAGGCCTT 3360 

GGTGACTTCC TGGGGCCAAG AAACTAAGGA AACTCGGCTT TGCAACAGGC ATTACTCGCC 3420 

ATTGATTGGT GCCCACCCAG GGCACACTGT CGGAGTTCTA TCACTTGCTT GACCCCTGGA 3480 

CCCAT AAAC C AGTCCACTGT TATACCOGGG GCACTCTAAC CATCACAATC AATCAATCAA 3540 

ATTCCCTTAA ATTTGTATGG CACTGGAACT TTGGCAAAGC ACTTTTGACA AOTTGTGTCT 3600 

GATTGGAGCT TCATGATAGC CTTOTGACAT CTTTAGGGCA GGATTCTTAT CCCCATTTTG 3660 

CAGATGAAAA CCCTGAGTCA CAGATTTCTG TGGGACTGTG GATCTCACTG GAAGCTATCC 3720 

AAGAGCCCAC TGTCACCTTC TAGACCACAT GATAGGGCTA GACAGCTCAG TTCACCATGA 3780 

TTCTCTTCTG TCACCTCTGC TGGCACACCA GTGGCAAGGC CCAGAATGGC GACCTCTCTT 3840 

TAGCTCAATT TCTGGGCCTG AGGTGCTCAG ACTGCCCCCA AGATCAAATC TCTCCTGGCT 3900 

GTAGTAACCC AGTGGAATGA ATTTGGACAT GCCCCAATGC TTCTATATGC TAAGTGAAAT 3960 

CTGTGTCTGT AATTTGTTGG GGGGTGGATA GGGTGGGGTC TCCATCTACT TTTTGTCACC 4020 
ATCATCTGAA ATGGGGAAAT ATGTAAATAA ATATATCAGC AAAGC 
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Seq ID NO: 616 Protein sequence 
Protein Accession #: NP 068630.1 



OA 1 11 21 31 41 51 

NEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHKLRFP MDSHSCPLSF SSFSYPBNEM IYKWENPKLE 120 

INEKNSWKLP QFDPTGVSNK TBIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 

LSWVSFWIKT ESAPAHTSI/3 ITSVLTMTTL GTPSRXNPPR VSYITALDPY IAICFVPCFC 240 

1188 



WO 03/042661 



ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVOQIVTTE 300 

GSDGEBRPSC SAQQPPSPGS PEGPRSLCSK LACCEVfCKRF KKYFCMVPDC EGSTWQQGRL 360 
CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 

Seq ID NO: 617 DMA sequence 

Nucleic Acid Accession ft: NM 004864-1 

Coding sequence: 2 6.. 952 

1 U 21 31 41 si 

I I I I I I 

CGGAACGAGG GCAACCTGCA CAGCCATGCC CGGGCAAGAA CTCAGGACGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TGGTGCTCTC GTGGCTGCCG CATGGGGGCG CCCTGTCTCT 120 

GGCCGAGGCG AGCCGCGCAA GTTTCCCGGG ACCCTCAGAG TTGCACTCCG AAGACTCCAG 180 

ATTCOGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGCGGG CCAACCAGAG 240 

CTGGGAAGAT TCGAACACOG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 300 

AGTGCGGCTG GGATCCGGCG GCCACCTGCA CCTGCGTATC TCTCGGGCCG CCCTTCCCGA 360 

GGGGCTCCCC GAGGCCTCCC GCCTTCACCG GGCTCTGTTC CGGCTGTCCC CGACGGCGTC 420 

AAGGTOGTGG GACGTGACAC GACCGCTGCG GOGTCAGCTC AGCCTTGCAA GACCCCAAGC 480 

GCCCGCGCTG CACCTGCGAC TGTCGCCGCC GCCGTCGCAG TOGGACCAAC TGCTGGCAGA 540 

ATCTTCGTCC GCACGGCCCC AGCTGGAGTT GCACTTGOGG CCGCAAGCCG CCAGGGGGCG 600 

CCGCAGAGCG CGTGCGCGCA AGGGGGACGA CTGTCCGCTC GGGCCCGGGC GTTGCTGCCG 660 

TCTGCACACG GTCCGCGCGT CGCTGGAAGA CCTGGGCTGG GCCGATTGGG TGCTGTCGCC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGCGTGCCCG AGCCAGTTCC GGGCGGCAAA 780 

CATGCACGCG CAGATCAAGA CGAGCCTGCA CCGCCTGAAG CCCGACACGG AGCCAGCGCC 840 

CTGCTGCGTG CCCGCCAGCT ACAATCCCAT GGTGCTCATT CAAAAGACCG ACACCGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTGC CACTGCATAT GAGCAGTCCT 960 

GGTCCTTCCA CTGTGCACCT GCGCGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TOGGGGGCTG GTCTGATGGA 1140 

ACTGTGTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 



Seq ID NO: 618 Protein sequence 
Protein Accession ft: NP_0 04855.1 

} 11 21 31 41 51 

I I I I I | 

MPGQELRTVN GSQMLLVLLV LSWLPHGGAI* SLAEASRASP PGPSELHSED SRFRELRKRY 60 

EDLLTRLRAN QSWEDSNTDL VPAPAVRILT PEVRLGSGGH LHLRISRAAL PEGLPEASRL 120 

HRALFRLSPT ASRSWDVTRP LRRQLSLARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 180 

ELHLRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASIi EDLGWADWVL SPREVQVTMC 240 

IGACPSQFRA ANMHAQIKTS LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 300 
LAKDCHCI 

Seq ID NO: 619 DNA sequence 

Nucleic Acid Accession #: NMJ>03979.2 

Coding sequence: 254.. 1357 ~ 

1 11 21 31 41 51 

I I I I I I 

ATAACAGCAT GAAGTGCCGT GGAACTGGAA TAGGCGTGTC CTCTCCCTCG ACCCTCCCCC 60 

TCCTTGTCCC TCTGCTCAOC CCTCGCTCGT TCCCTCCCTC GGGCGAGGGC CGCCTTTATA 120 

ACAACTGCTC AGAGTGCGAG GGOGGGATAG CTGTCCAAGG TCTCOCCCAG CACTGAGGAG 180 

CTCGCCTGCT GCCCTCTTGC GCGOGGGAAG CAGCACCAAG TTCACGGCCA ACGCCTTGGC 240 

ACTAGGGTCC AGAA TGGCTA CAACAGTCCC TGATGGTTGC CGCAATGGCC TGAAATCCAA 300 

GTACTACAGA CTTTGTGATA AGGCTGAAGC TTGGGGCATC GTCCTAGAAA CGGTGGCCAC 360 

AGCCGGGGTT GTGACCTCGG TGGCCTTCAT GCTCACTCTC CCGATCCTCG TCTGCAAGGT 420 

GCAGGA CTCC AACAGGCGAA AAATGCTGCC TACTCAGTTT CTCTTCCTCC TGGGTGTGTT 480 

GGGCATCTTT GGCCTCACCT TOGCCTTCAT CATCGGACTG GACGGGAGCA CAGGGCCCAC 540 

ACGCTTCTTC CTCTTTGGGA TCCTCTTTTC CATCTGCTTC TCCTGCCTGC TGGCTCATGC 600 

TGTCAGTCTG ACCAAGCTCG TCCGGGGGAG GAAGCCCCTT TCCCTGTTGG TGATTCTGGG 660 

TCTGGCCGTG GGCTTCAGCC TAGTCCAGGA TGTTATCGCT ATTGAATATA TTGTCCTGAC 720 

CATGAATAGG ACCAACGTCA ATGTCTTTTC TGAGCTTTCC GCTCCTCGTC GCAATGAAGA 780 

CTTTGTCCTC CTGCTCACCT ACGTCCTCTT CTTGATGGCG CTGACCTTCC TCATGTCCTC 840 

CTTCACCTTC TGTGGTTCCT TCAOGGGCTG GAAGAGACAT GGGGCCCACA TCTACCTCAC 900 

GATGCTCCTC TCCATTGCCA TCTGGGTGGC CTGGATCACC CTGCTCATGC TTCCTGACTT 960 

TGACCGCAGG TGGGATGACA CCATCCTCAG CTCCGCCTTG GCTGCCAATG GCTGGGTGTT 1020 

CCTGTTGGCT TATGTTAGTC CCGAGTTTTG GCTGCTCACA AAGCAACGAA ACCCCATGGA 1080 

TTATCCTGTT GAGGATGCTT TCTGTAAACC TCAACTCGTG AAGAAGAGCT ATGGTGTGGA 1140 

GAACAGAGCC TACTCTCAAG AGGAAATCAC TCAAGGTTTT GAAGAGACAG GGGACACGCT 1200 

CTATGCCCCC TATTCCACAC ATTTTCAGCT GCAGAACCAG CCTCCCCAAA AGGAATTCTC 1260 

CATCCCACGG GCCCACGCTT GGCCGAGCCC TTACAAAGAC TATGAAGTAA AGAAAGAGGG 1320 

CAGCTAACTC TGTCCTGAAG AGTGGGACAA ATGCAGCCGG GOGGCAGATC TAGCGGGAGC 1380 

TCA AAGGG AT GTGGGCGAAA TCTTGAGTCT TCTGAGAAAA CTGTACAAGA CACTACGGGA 1440 

ACAGTTTGCC TCCCTCCCAG CCTCAACCAC AATTCTTCCA TGCTGGGGCT GATGTGGGCT 1500 

AGTAAGACTC CAGTTCTTAG AGGCGCTGTA GTATTTTTTT TTTTTTGTCT CATCCTTTGG 1560 

ATAv -" A " i ^ riri " TAAGTGGGAG TCTCAGGCAA CTCAAGTTTA GACCCTTACT Cmi'rUTTT 1620 

GTTTTTTGAA ACAGGATCTT GCTCTGTCAC CCAGGCTTGA GTGCAGTGGT GCGATCACAG 1680 

CCCAGTGCAG CCT CGAC CAC CTGTGCTCAA GCAATCCTCC CATCTCCATC TCCCAAAGTG 1740 

CTGGGATGAC AGGCGTGAGC CACAGCTCCC AGCCTAGGCC CTTAATCTTG CTGTTATTTT 1800 

CCATGGACTA AAGGTCTGGT CATCT GAGCT CACGCTGGCT CACACAGCTC TAGGGGCCTG 1860 

CTCCTCTAAC TCACAGTGGG TTTTGTGAGG CTCTGTGGCC CAGAGCAGAC CTGCATATCT 1920 

GAGCAAAAAT AGCAAAAGCC TCTCTCAGCC CACTGGCCTG AATCTACACT GGAAGCCAAC 1980 

TTGCTGGCAC CCCCGCTCCC CAACCCTTCT TGCCTGGGTA GGAGAGGCTA AAGATCACCC 2040 
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TAAATTTACT 
ATTCACAGGT 
TAATCTCCCC 
CTCCTTGTCA 
CTCATCTTGC 
GTGGGCATGG 
GCAATAAAGA 



CATCTCTCTA 
CACCCCTCTC 
CTAOGCTCTG 
GGAGAATTTG 
ACCCCAACCT 
TCTCCTAATG 
TGTGGTGGCC 



GTGCTGCCTC 
TTCTTGCACT 
CCAGGAATTC 
TAGATCATTC 
CTGTAAATAG 
GAGGAGTGTT 
ACTCTTTCAT 



ACATTGGGCC 
GTCCCCAAAC 
TTTCAGAOCT 
TCACTTCAAA 
ATTTACCGCA 
CATTGTATAA 
GGTGGTGGCA 



TCAGCAGCTC CCCAGCACCA 2100 

TTGCTGTCAA TTCCGAGATC 2160 

CACTAGCACA AGCCCGGTTG 2220 

TTCCTGGGGC TGATACTTCT 2280 

TTTACGGCTG CATTCTGTAA 2340 

TAAGTTATTC ACCTGAGTAT 2400 
GCAAAAAAAA AAAAAA 



Seq ID NO: 620 Protein sequence 
Protein Accession #: NP_003970.1 

1 11 21 31 

I I I I 

MATTVPDGCR NGLKSKYYRL CDKAEAWGIV LETVATAGW 
RRKMLPTQFL FLLGVLGIFG LTFAFIIGLD GSTGPTRFFL 
KLVRGRKPLS LLVILGLAVG FSLVQDVIAI BYIVLTMNRT 
LTYVLFLMAL TFLMSSFTFC GSFTGWKRHG AHIYLTMLLS 
DDTILSSALA ANGWVFLLAY VSPBFWLLTK QRNPMDYPVE 
SQEBITQGFE ETGDTLYAPY STHFQLQNQP PQKBFSIPRA 

Seq ID NO: 621 DNA sequence 

Nucleic Acid Accession #: NM_002423.2 

Coding sequence: 48.. 851 



41 
I 

TSVAFMLTLP 
FGILFSICFS 
NVNVFSELSA 
IAIWVAWITL 
DAFCKPQLVK 
HAHPSPYKDY 



1 
I 

ACCAAATCAA 
TGCTGTGTGC 
GAGGCATGAG 
ATGACTCAGA 
TCTTTGGCCT 
CCAGATGTGG 
CTTCCAAAGT 
TGGATCGATT 
GGAAAGTTGT 
ACTCCTACCC 
GTCTCGGAGG 
GGATTAACTT 
CCTCTGATCC 
AACTTTCCCA 
GAAAGAAATA 
TGTTGCACAA 
CTTTTTTATT 
ATGGTGTGAC 
ATGTTACATA 



11 
1 

CCATAGGTCC 
TGTGTGCCTG 
TGAGCTACAG 
AACAAAAAAT 
ACCTATAACT 
AGTGCCAGAT 
GGTCACCTAC 
AGTGTCAAAG 
ATGGGGAACT 
ATTTGATGGG 
AGATGCTCAC 
CCTGTATGCT 
TAATGCAGTG 
GGATGATATT 
GAAACTTCAG 
TCAGAATTGA 
GCAGTTGGTT 
TGTGTCTTAT 
CACAAATAAA 



21 
I 

AAGAACAATT 
CTGCCTGGCA 
TGGGAACAGG 
GCCAACAGTT 
GGAATGTTAA 
GTTGCAGAAT 
AGGATOGTAT 
GCTTTAAACA 
GCTGACATCA 
CCAGGAAACA 
TTCGATGAGG 
GCAACTCATG 
ATGTATCCAA 
AAAGGCATTC 
GCAGAACATC 
TAAGCACTGT 
TTTGAATGTC 
TCCATCTATG 
TAAAATGTTT 



31 
I 

GTCTCTGGAC 
GCCTGGCCCT 
CTCAGGACTA 
TAGAAGCCAA 
ACTCCCGCGT 
ACTCACTATT 
CATATACTCG 
TGTGGGGCAA 
TGATTGGCTT 
CGCTGGCTCA 
ATGAACGCTG 
AACTTGGCCA 
CCTATGGAAA 
AGAAACTATA 
CATTCATTCA 
TCCTCCACTC 
TTTCACTCCT 
AGCTTTGTCA 
ATTCCATGGT 



41 

! 

GGCAGCTATG 
GCCGCTGCCT 
TCrCAAGAGA 
ACTCAAGGAG 
CATAGAAATA 
TCCAAATAGC 
AGACTTACCG 
AGAGATCCCC 
TGCGCGAGGA 
TGCCTTTGOO 
GACGGATGGT 
TTCTTTGGGT 
TGGAGATCCC 
TGGAAAGAGA 
TTCATTGGAT 
CATTTAGCAA 
TTTATTGGTT 
GTGCGCGTAG 
AAATTTA 



51 
I 

ILVCKVQDSN 
CLLAHAVSLT 
PRRNEDFVLL 
LMLPDFDRRW 
KSYGVENRAY 
EVKKEGS 



51 
I 

CGACTCACCG 
CAGGAGGCGG 
TTTTATCTCT 
ATGCAAAAAT 
ATGCAGAAGC 
CCAAAATGGA 
CATATTACAG 
CTGCATTTCA 
GCTCATGGGG 
CCTGGGACAG 
AGCAGTCTAG 
ATGGGACATT 
CAAAATTTTA 
AGTAATTCAA 
TGTATATCAT 
TTATGTCACC 
AAACTCCTTT 
ATGTCAATAA 



Seq ID NO: 622 Protein sequence 
Protein Accession #s NP_002414.1 

1 11 21 31 41 51 

I I I 1 1 I 

MRLTVLCAVC LLPGSIALPI. PQEAGGMSEL QWEQAQDYUC RFYLYDSETK NANSLEAKLK 
EMQKFFGLPI TGMLNSRVIE IMQKPRCGVP DVAEYSLFPN SPKWTSKWT YRIVSYTRDL 
PHITVDRLVS KALNMWGKEI PLHFRKWWG TADIMIGFAR GAHGDSYPFD GPGNTLAHAF 
APGTGLGGDA HFDEDERWTD GSSLGINFLY AATHELGHSL GMGHSSDPNA VMYPTYGNGD 
PQNFKLSQDD IKGIQKLYGK RSNSRKK 

Seq ID NOs 623 DNA sequence 

Nucleic Acid Accession ft: NMJJ314S7.1 

Coding sequence: 204.. 956 



1 

i 

AAACAGGAAA 
CAGGAAATAG 
ATTCTCTCAG 
ACGTGGCTGC 
CTGTGTTGGT 
TGCCCCTGTA 
TGGTGTCGAA 
GGGCCATCCA 
TTCTCGTAGG 
TGTGGTTTAT 
GCCTGCTGTC 
TCATACTCTT 
CTTACGCCTG 
TCCTGGAGTT 
AATCAAGCAA 
CAGAACCGGT 
ACAGATTCTG 
CCATAACCCA 
TCTCACCTTC 
ACAGAGATTT 
TATGTGGGCA 



11 
I 

TAAATAOGAA 
AGGACTTCGG 
GAAAAAAAAC 
TGGCAGAGCA 
GGTGGCACCC 
TCCAAACAGC 
TGTGAATGGG 
GATCATCATT 
GGAATACCTG 
CATTTCAGGA 
TGGCAGTTTG 
CATCACAGAT 
GGGTGTGAAC 
TGGCATOGCA 
TGTGAGTGTC 
GACCTCACCA 
GAAGCATCTT 
GGTCGTTCCT 
ATTCTTCAAT 
TAAACAOATG 
TCCAGCCTCT 



21 
I 

TGAAACTGAG 
ATCCTTCTAA 
AAGGTCCCCA 
AGCATGAATT 
CACAATGGTT 
CAGCOGCAAG 
CAGCCTGTGC 
GGCCTGGCTC 
TCTATTTCAT 

GGCTTGAACA 
CTAAGTATTC 
CCTGGAATGG 
TGOGCATCTT 
ATCTATCCAA 
CCAAGTTATT 
TCACTGGGAC 
GTTCTGACAG 
TCAGTCTAGG 
TTAACCAAGA 
GGGGCCTTGG 



31 
I 

CTCTAAGCAG 
CCCTACCACC 
CAGCAAAGAA 
OGATGACTTC 
ATCCTGTGAC 
TCCACCTAGT 
AGAAAGCTCT 
ACATCGGCCT 
TCTACGGAGG 
TGGCAGCAGA 
TCGTCAGTGC 
CCCACCCATA 
CGATTTCTGG 
CCCACTTTGG 
ACATCTATGC 
CCAGTGAGAT 
CAAAAGAAGT 
CTGAGGAAAC 
AAACCATGCT 
GGGACTCCCT 
CACACACACA 



41 

i 

CATGTAACCT 
CAACTGGCCC 
AAGGAATAGG 
AGCAGTTCCG 
CCCAGGAATT 
TCCTGGGAAC 
GAAAGAAGGC 
CGGCTCCATC 
CTTTCCCTTC 
AAATCAGCCA 
AATCTGCTCT 
TGCCTACCCC 
CGTGCTGCTG 
CTGCCAGTTG 
AGCAAACCCA 
CCAAGCAAAT 
CCTCCTCCCT 
GTCTCTCCCA 
GTTTCTCTAT 
AGGGCACATG 
TTCGTGTGCT 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 
240 



51 
I 

GGCCTGCATC 
CAGTACATTC 
ATCAAGAGAT 
GTGGCCAATT 
ATGTCTCACG 
CCACCTAGTT 
AAAACCTTGG 
ATGGCGACGG 
TGGGGAGGCT 
TATTCTTATT 
GCAGTTGGAG 
GACTATTATC 
GTCTTCTGCC 
GTCTGCTGTC 
GTGATCACCC 
AAGTAAGGCT 
TTCTGGGCTT 
CTGTTTGTAC 
CAAGAAGAAG 
CATCAGCACA 
CTGCTGCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1190 



WO 03/042661 



PCT/US02/36810 



TGAGCTTGTG GGTTAGAGGA ACAAATATCT AGACATTCAA TCTTCACTCT TTCAATTGTG 1320 
CATTCATTTA ATAAATAGAT ACTGAGCATT CAAAAAAAAA AAAAAAAAA 

Seq ID NO: 624 Protein sequence 
J Protein Accession #: NP_113645.1 

1 11 21 31 41 SI 

I I I I I I 

MNSMTSAVPV ANSVLWAPH NGYPVTPGIM SHVPLYPNSQ PQVHLVPGNP PSLVSNVNGQ 60 

1U PVQKALKEGK TLGAIQUIG LAHIGLGSIM ATVLVGBYLS ISFYGGFPFW GGLWFIISGS 120 

LSVAAENQPY SYCLLSGSLG LNIVSAICSA VGVILPITDL SIPKPYAYPD YYPYAWGVNP 180 

GMAISGVLLV FCLLEPGIAC ASSHFGCQLV COQSSNVSVI YPNIYAANPV ITPEPVTSPP 240 
SYSSEIQANK 

15 Seq ID NO: 625 DNA sequence 

Nucleic Acid Accession #: NMJ)05221.3 
Coding sequence: 1..870 

„ 1 11 21 31 41 51 

20 | | | | | | 

ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 60 

TTCCAGACGT CCGCAGCTAT GCACCATCCG TCTCAGGAAT CGCCAACTTT GCCCGAGTCT 120 

TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACGG CTACTGCTCT 180 

CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA TCACGGCGTG 240 

ZD AACGGCTCCG CCGGGAGCTA CCCAGCCAAA GCTTATGCCG ACTATAGCTA CGCTAGCTCC 300 

TACCACCAGT ACGGCGGCGC CTACAACCGC GTCCCAAGCG CCAOCAACCA GCCAGAGAAA 360 

GAAGTGAGCG AGCCCGAGGT GAGAATGGTG AATGGCAAAC CAAAGAAAGT TCGTAAACCC 420 

AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 480 

TACCTCGCCT TGCCGGAACG CGCCGAGCTG GCOGCCTCGC TGGGATTGAC ACAAACACAG 540 

GTGAAAATCT GGTTTCAGAA CAAAAGATCC AAGATCAAGA AGATCATGAA AAACGGGGAG 600 

ATGCCCCCGG AGCACAGTCC CAGCTCCAGC GACCCAATGG CGTGTAACTC GCCGCAGTCT 660 

CCAGCGGTGT GGGAGCCCCA GGGCTCGTCC CGCTCGCTCA GCCACCACCC TCATGCCCAC 720 

CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 780 

ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CGGGCTCCTT ACAGCACCCG 840 
CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 



30 
35 
40 



50 
55 
60 
65 



75 



Seq ID NO: 626 Protein sequence 
Protein Accession #: NP 00S212.1 



1 11 21 31 41 51 

I I t i I I 

MTGVFDRRVP SIRSGDFQAP FQTSAAMHHP SQESPTLPES SATDSDYYSP TGGAPHGYCS 60 

PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGGAYNR VPSATNQPEK 120 

EVTEPEVRMV NGKPKKVRKP RTIYSSFQLA ALQRRFQKTQ YLALPERAEL AASLGLTQTQ 180 

40 VKIWFQNKRS KIKKIMKNGE MPPEHSPSSS DPMACNSPQS PAVWEPQGSS RSLSHHPHAH 240 
PPTSNQSPAS SYLENSASWY TSAASSINSH LPPPGSLQHP LALASGTLY 



Seq ID NO: 627 DNA sequence 
Nucleic Acid Accession #: NM_014420 
Coding sequence: 118., 792 

1 11 21 31 41 51 

i i r i i i 

GCACGAGAGA CGACGTGCTG AGCTGCCAGC TTAGTGGAAG CTCTGCTCTG GGTGGAGAGC 60 

AGCCTCGCTT TGGTGACGCA CAGTGCTGGG ACCCTCCAGG AGCCCCGGGA TTGAAGGATG 120 

GTGGCGGCCG TCCTGCTGGG GCTGAGCTGG CTCTGCTCTC CCCTGGGAGC TCTGGTCCTG 180 

GACTTCAACA ACATCAGGAG CTCTGCTGAC CTGCATGGGG CCCGGAAGGG CTCACAGTGC 240 

CTGTCTGACA CGGACTGCAA TACCAGAAAG TTCTGCCTCC AGCCCCGCGA TGAGAAGCCG 300 

TTCTGTGCTA CATGTCGTGG GTTGCGGAGG AGGTGCCAGC GAGATGCCAT GTGCTGCCCT 360 

GGGACACTCT GTGTGAACGA TGTTTGTACT ACGATGGAAG ATGCAACCCC AATATTAGAA 420 

AGGCAGCTTG ATGAGCAAGA TGGCACACAT GCAGAAGGAA CAACTGGGCA CCCAGTCCAG 480 

GAAAACCAAC CCAAAAGGAA GCCAAGTATT AAGAAATCAC AAGGCAGGAA GGGACAAGAG S40 

GGAGAAAGTT GTCTGAGAAC TTTTGACTGT GGCCCTGGAC TTTGCTGTGC TCGTCATTTT 600 

TGGACGAAAA TTTGTAAGCC AGTCCTTTTG GAGGGACAGG TCTGCTCCAG AAGAGGGCAT 660 

AAAGACACTG CTCAAGCTCC AGAAATCTTC CAGOGTTGCG ACTGTGGCCC TGGACTACTG 720 

TGTCGAAGCC AATTGACCAG CAATCGGCAG CATGCTCGAT TAAGAGTATG CCAAAAAATA 780 

GAAAAGCTAT AAATATTTCA AAATAAAGAA GAATCCACAT TGCAAAAAAA AAAAAAAAAA 840 
A 



70 Seq ID NO: 628 Protein sequence 
Protein Accession #: NP 055235 



1 11 21 31 41 51 

I I 1 I I I 

MVAAVLLGLS WLCSPLGALV LDFNNIRSSA DLHGARKGSQ CLSDTDCNTR KFCLQPRDEK 60 
PFCATCRGLR RRCQRDAMCC PGTLCVNDVC TTMEDATPIL ERQLDEQDGT HAEGTTGHPV 120 
QENQPKRKPS IKKSQGRKGQ EGESCLRTFD CGPGLCCARH PWTKICKPVL LEGQVCSRRG 180 
HKDTAQAPEI FQRCDCGPGL LCRSQLTSNR QHARLRVCQK IEKL 



80 Seq ID NO: 629 DNA sequence 

Nucleic Acid Accession #s NM_002448.1 
Coding sequence: 241.. 1134 



11 21 31 41 SI 

1191 



WO 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



GOGCGAGTGC 
ACGCCGGAGC 
AGGCCCCGGG 
CGGCTGGCCA 
ATGACTTCTT 
GGAGGCGCGG 
GAGGAGGGGG 
ATGGCCGACC 
CAGGCGGCGG 
GACGOGCCCT 
CCAGAAGATG 
CAGAGCCCCC 
OGCAAACACA 
CTGGAGCGCA 
AGCTCGCTCA 
GCAAAGAGAC 
CCACCGGCTG 
GCGGGTGCCT 
CCCGTGGGAC 
CCAGGTCCCC 
CTCCCCTGCT 
CTCTTTGCTC 
GTCCCTTAGT 
AGGGCAGAGA 
CTAGAGGCCA 
GAGAAATCGG 



TCCATTGGAA 
ACACAGATGT 



TCCCGGGAAC 
TGGCCTGCTG 
CCCTOGCAGA 
GTGCTGCGGC 
TGCCACTCGG 
GCCAGGCCCC 
CCAAGCCCAA 
ACAGGAAGCC 
GTGGCTCGGC 
CTTCGCCGCG 
CGCTCGTCAA 

AGACGAACCG 
AGTTCCGCCA 
GCCTCACTGA 
TACAAGAGGC 
CCTTCGGCCT 
CGCTCTACGG 
TCTACACGGC 
ACCTGTGGGC 
CGGCACCGCC 
CCTGAGTTCA 
ACTCTTCTAG 
GGTTAACAGA 
TGTCTCCTGC 
TGTCTGACGA 
GAAAAGAGAA 
AATTTAAGAC 
GTTGCAAAGG 



TCTGCCTGOG 
GGGAGGGGOG 
GGCCGGCCGC 
AGAAGGGGGG 
TGTCAAAGTG 
CAGCGCCGCC 
AGTGTCCCCT 
GGGGGCCAAG 
GCAGCCACTG 
GCOGCTOGGC 
AGCOGAGAGC 
GCCGCCGGCC 
TAAGCCGCGG 
GAAGCAGTAC 
GACGCAGGTG 
AGAGCTGGAG 
CTCCTTCCCT 
TGCCTCTGGC 
CCATGTGGGC 
CAGCCGATTC 
AGCCGCCTTC 
CTCTCCGAAG 
CATTTAGATC 
TTTATCTAGG 
ATAGCTTTTC 
TTTTGGAAAT 
AAAAAAGACT 
AAGTTCAACA 
TAGGTTGAAG 



I I 
CGGCGGCAGC GACGGGAGGC 
GGAGGCGOGC GCGGGAGGGT 
GCTCCCAGCC CGCCOGGAGC 
GOCOGGCTCT GCATGGCCCC 
GAGGACTCCG CCTTCGGCAA 
GCGGCCACGG CAGCCGCCAT 
TOGCTCCTGC CCTTCAGCGT 
GAGAGCGCCC TGGOGCCCTC 
GGCGTCCCGC CGGGGTCGCT 
CATTTCTCGG TGGGGGGACT 
CCOGAGAAGC CCGAGAGGAC 
AGGCGGCTGA GCCCCCCAGC 
AOGCCCTTCA CCACCGCGCA 
CTGTCCATCG CCGAGCGCGC 
AAGATATGGT TCCAGAACCG 
AAGCTGAAGA TGGCOGCCAA 
CTCGGCGGCC CCGCAGCTGT* 
CCCTTCCAGC GCGCCGCGCT 
TACAGCATGT ACCACCTGAC 
CTCCAGCCCT GGTGCTGTAC 
CCTTTAACCC TCACACTGCT 
TCTGATCCCT GCCAAAAAGT 
TACACTCTOG AGTTAAAGAT 
GTCCCCAGCA GAATTGACAG 
CCTGTCCTGA CACCAGGCAA 
GAGAACAATC TCAAAAAAAA 
AGCCAGCCAG GAAGATGAAT 
ACAAAACATT TGCTCTGGGG 
GGA 



I 

CAGGCCCAGC 
COGCCCGGCC 
CCATGCCCGG 
GGCTGCTGAC 
GCCGGCGGGG 
GGGCGCGGAC 
GGAGGCGCTC 
CGAGGGCGTG 
GGGAGCCCCG 
CCTCAAGCTG 
CCCGTGGATG 
CTGCACCCTC 
GCTGCTGGCG 
GGAGTTCTCC 
CCGCGCCAAG 
GCCCATGCTG 
AGCGGCCGCG 

ATAGAGGGTC 
CCCCGACGTG 
CCAGTTTCAC 
GGCTGGAAGA 
GGGGAAACTG 
TTGAACAGAG 
GAAAAGCGCA 
AAAAAAAAAA 
CCTAGCTTCT 
GGCAGGGAAA 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: 630 Protein sequence 
Protein Accession #: NP 002439.1 



51 



1 11 21 31 41 

I I I I I I 

HTSItPLGVKV EDSAFGKPAG GGAGQAPSAA AATAAAMGAD EBGAKPKVSP SLLPFSVEAL 
MADHRKPGAK ESALAPSEGV QAAGGSAQPL GVPPGSL6AP DAPSSPRPLG HFSVGGLUO. 
PKDALVKAES PEKPERTFWM QSPRPSPPPA RRLSPPACTL RKHKTNRKPR TPPTTAQLLA 
LERKFRQKQY LSIAERAEFS SSLSLTETQV KIWFQNRRAK AKRLQEAELE KLKMAAKPML 
PPAAFGLSFP LGGPAAVAAA AGASLYGASG PFQRAALPVA PVGLYTAHVG YSMYHLT 

Seq ID NO i 631 DNA sequence 

Nucleic Acid Accession #> NM_002SS7.1 

Coding sequence: 13.. 2049 



51 



60 
120 
180 
240 



1 11 21 31 41 

1 I I I I I 

CAGACCATTG AGATGTGGAA GCTGTTGCTG TGGGTTGGGC TGGTTCTTGT GCTGAAACAC 
CACGATGGTG CTGCCCATAA ACTCGTGTGT TATTTCACCA ACTGGGCACA CAGTCGGCCA 
GGCCCTGCCT CGATCTTGCC CCATGACCTG GACCCCTTTC TCTGCACCCA CCTGATATTT 
GCCTTTGCCT CAATGAACAA CAATCAGATT GTTGCTAAGG ATCTCCAGGA TGAGAAAATT 
CTCTACCCAG AGTTCAACAA ACTAAAGGAG AGGAACAGAG AGCTGAAAAC ACTACTGTCC 
ATCGGCGGGT GGAACTTTGG CACCTCAAGA TTCACCACTA TGTTGTCCAC ATTTGCCAAC 
CGTGAAAAGT TTATTGCTTC AGTTATATCC CTTCTGAGGA CACATGACTT TGATG GTCTT 
GACCTTTTCT TCTTATATCC TGGACTAAGA GGCAGCCCCA TGCATGACOG GTGGACTTTT 
CTCTTCTTAA TTGAAGAGCT CCTGTTTGCC TTCCGGAAGG AGGCACTGCT CACCATGCGC 
CCGAGGCTGC TGCTGTCTGC TGCTGTTTCT GGGGTCCCAC ACATCGTCCA AACATCCTAT 
GATGTGCGCT TTCTAGGAAG ACTCCTGGAT TTCATCAATG TCTTGTCTTA TGACTTACAT 
GGAAGTTGGG AAAGGTTCAC AGGACATAAT AGCCCCCTCT TCTCTCTGCC TGAAGACCCC 
AAATCTTOGG CATATGCTAT GAATTATTGG AGAAAGCTTG GGGCACCCTC AGAGAAGCTC 
ATCATGGGGA TCCCCACCTA TGGACGTACC TTTOGCCTCC TCAAAGCCTC TAAGAATGGG 
TTGCAGGCCA GAGCX3ATCGG ACCAGCATCT CCAGGGAAGT ACACCAAGCA AGAAGGCTTC 
TTGGCTTATT TTGAGATTTG TTCCTTTGTC TGGGGAGOGA AGAAGCACTG GATTGATTAC 
CAGTATGTCC OGTATGCCAA CAAGGGGAAA GAGTQGGTTG GCTATGACAA TGCCATCAGC 
TTCAGTTACA AGGCATGGTT TATAAGGCGA GAGCATTTTG GGGGGGCCAT GGTGTGGACA 
TTGGACATGG ATGACGTCAG GGGCACGTTC TGTGGCACTG GCCCTTTCCC CCTTGTCTAC 
GTATTGAATG ATATCCTGGT GCGGGCTGAG TTCAGTTCAA CTTCTTTACC ACAATTTTGG 
CTGTCATCTG CTGTGAATTC TTCAAGCACT GACCCTGAAA GGCTGGCTGT GACCACGGCA 
TGGACCACTG ATAGTAAGAT TTTGCCCCCA GGAGGAGAGG CTGGGGTCAC TGAGATCCAC 
GGAAAGTGTG AAAATATGAC TATAACCCCT AGAGGTACAA CTGTGACCCC TACAAAGGAA 
ACTGTATCCC TTGGAAAGCA CACTGTAGCT CTAGGAGAGA AGACTGAGAT CACTGGGGCA 
ATGACCATGA CTTCTGTGGG TCATCAGTCC ATGACCCCTG GAGAGAAGGC CCTGACCCCT 
GTGGGTCATC AATCTGTGAC CACTGGACAG AAGACCCTGA CCTCTGTGGG TTATCAGTCT 
GTGACCCCTG GGGAAAAGAC CCTGACCCCT GTGGGTCATC AGTCTGTOAC CCCTGTGAGT 
CATCAGTCTG TGAGCCCTGG AGGAACGACT ATGACCCCTG TCCATTTTCA GACTGAGACC 
CTTAGACAGA ATACAGTGGC CCCTAGAAGG AAGGCTGTGG CCCGTGAAAA GGTGACTGTC 
CCCTCCAGAA ACATATCAGT CACCCCTGAA GGGCAGACTA TGCCTTTAAG AGGGGAGAAT 
TTGACTTCTG AGGTGGGCAC TCACCCCAGG ATGGGTAACT TGGGTCTTCA GATGGAAGCT 
GAAAACAGGA TGATGCTGTC CTCCAGCCCC GTCATCCAGC TCCCGGAACA AACTCCTCTA 
GCTTTTGACA ACCGCTTTGT TCCCATCTAT GGAAACCATT CCTCTGTCAA CTCAGTAACC 
CCTCAAACAA GTCCTCTTTC TCTAAAAAAA GAAATCCCAG AAAACTCTGC TGTGGATGAA 
GAAGCCTAAG CCCCTCTGGT GTCAGAAACC AGGGAAAACC CTTGTCTTTT CTTCTAAGTQ 
ACATGTTGGA AGCCTTCTCA TCCCGGGGCA AAGCAGGCAT CAAAACCAGA ATAGGCCAAT 

1192 
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WO 03/042661 



PCT/US02/36810 



5 
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CTCTTTTCCA TTAAATAAAC TGTAAACACA AGAACCCA 

Seq ID NO: 632 Protein sequence 
Protein Accession ft: NP 002S48.1 



1 11 21 31 41 51 

I I I I I I 

MWKLLLWVGL VLVLKHHDGA AHKLVCYPTN WAHSRPGPAS ILPHDLDPFL CTHLIPAFAS 60 

^ MNNNQIVAKD LQDEKILYPE FNKLKERNRE LKTLLSIGGW NFGTSRFTTM LSTFANREKF 120 

1U IASVISLLRT HDFDGLDLFF LYPGLRGSPM HDRWTFLFLI EELLFAFRKE ALLTMRPRLL 180 

LSAAVSGVPH IVQTSYDVRF LGRLLDFINV LSYDLHGSWE RFTGHNSPLP SLPEDPKSSA 240 

YAMNYWRKLG APSEKLIKGI PTYGRTFRLL KASKNGLQAR AIGPASPGKY TKQEGFLAYF 300 

EICSFVWGAK KHWIDYQYVP YANKGKEWVG YDNAISFSYK AWFIRREHFG GAMVWTLDMD 360 

DVRGTFCGTG PFPLVYVLND ILVRAEFSST SLPQFWLSSA VNSSSTDPER LAVTTAWTTD 420 

ID SKILPPGGEA GVTEIHGKCE NMTITPRGTT VTPTKETVSL GKHTVALGEK TEITGAMTMT 480 

SVGHQSMTPG EKALTPVGHQ SVTTGQKTLT SVGYQSVTPG EKTLTPVGHQ SVTPVSHQSV S40 

SPGGTTMTPV HFQTETLRQN TVAPRRKAVA REKVTVPSRN ISVTPEGQTM PLRGENLTSE 600 

VGTHPRMGNL GLQMEAENRM MLSSSPVIQL PEQTPLAFDN RFVPIYGKHS SVNSVTPQTS 660 
PLSLKKBIPE NSAVDEEA 

Seq ID NO: 633 DMA sequence 
Nucleic Acid Accession «: NMJ>03885.1 
Coding sequence: 98.. 1021 ~ 



1 11 21 31 41 51 

I I I I I I 

AAACTCAGAA TTTTCGCGGG CTCGGTGAGC GGTTTTATCC CTCCGGCCGG CAGGCTGGGC 60 

GCAGGGGGCG AGCCCCCGCC CGGCGCGCAG CAGCACCATG GGCACGGTGC TGTCCCTGTC 120 

^ A TCCCAGCTAC CGGAAGGCCA CGCTGTTTGA GGATGGCGCG GCCACCGTGG GCCACTATAC 180 

J\) GGCCGTACAG AACAGCAAGA ACGCCAAGGA CAAGAACCTG AAGCGCCACT OCATCATCTC 240 

CGTGCTGCCT TGGAAGAGAA TCGTGGCCGT GTCGGCCAAG AAGAAGAACT CCAAGAAGGT 300 

GCAGCCTAAC AGCAGCTACC AGAACAACAT CACGCACCTC AACAATGAGA ACCTGAAGAA 360 

GTCGCTGTCG TGCGCCAACC TGTCCACATT CGCCCAGCCC CCACCGGCCC AGCCGCCTGC 420 

_ _ ACCCCCGGCC AGCCAGCTCT OGGGTTCCCA GACCGGGGGC TCCTCCTCAG TCAAGAAAGC 480 

J J CCCTCACCCT GCCGTCACCT CCGCAGGGAC GCCCAAACGG GTCATCGTCC AGGCGTCCAC 540 

CAGTGAGCTG CTTCGCTGCC TGGGTGAGTT TCTCTGCCGC CGGTGCTACC GCCTGAAGCA 600 

CCTGTCCCCC ACGGACCCCG TGCTCTGGCT GCGCAGCGTG GACCGCTCGC TGCTTCTGCA 660 

GGGCTGGCAG GACCAGGGCT TCATCACGCC GGCCAACGTG GTCTTCCTCT ACATGCTCTG 720 

Ar . CAGGGATGTT ATCTCCTCCG AGGTGGGCTC GGATCACGAG CTCCAGGCCG TCCTGCTGAC 780 

HU ATGCCTGTAC CTCTCCTACT CCTACATGGG CAACGAGATC TCCTACCCGC TCAAGCCCTT 840 

CCTGGTGGAG AGCTGCAAGG AGGCCTTTTG GGACCGTTGC CTCTCTGTCA TCAACCTCAT 900 

GAGCTCAAAG ATGCTGCAGA TAAATGCCGA CCCACACTAC TTCACACAGG TCTTCTCCGA 960 

CCTGAAGAAC GAGAGCGGCC AGGAGGACAA GAAGCGGCTC CTCCTAGGCC TGGATCGGTG 1020 

A AGCACTGTAG CCTGCGTCAT GGCTCAAGGA TTCAATGCAT TTTTAAGAAT TTATTATTAA 1080 
ATCAGTTTTG TGTACAG 



Seq ID NO: 634 Protein sequence 
Protein Accession #j NP_003876.1 

1 11 21 31 41 51 

I I I I I I 

KGTVI^LSPS YRKATLFEDG AATVGHYTAV QNSKNAKDKN LKRHSIISVL PWKRIVAVSA 60 
KKKNSKKVQP NSSYQNNITH LNNENLKKSL SCANLSTPAQ PPPAQPPAPP ASQLSGSQTG 120 
GSSSVKKAPH PAVTSAGTPK RVIVQASTSE LLRCLGEFLC RRCYRLKHLS PTDPVLWLRS 180 
VDRSLLLQGW QDQGFITPAN WFLYMLCRD VISSEVGSDH ELQAVLLTCL YLSYSYKGNE 240 
ISYPLKPFLV ESCKEAFWDR CLSVINLMSS KMLQINADPH YFTQVPSDLK NESGQEDKKR 300 
LLLGLDR 



1193 



WO 03/042661 



PCT/US02/36810 



TABLE 79A: 



Pkey: Unique Eos probeset identifier number 

ExAocru Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Seq (D No.: Sequence identification number Unking information in Table 79A to sequences in Table 8 
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Pkey 


ExAccn 


UnigenelD 


424212 


NMJJ05814 


Hs.143131 


424503 


NM_002205 


Hs.149509 


418007 


M13509 


Hs.83169 


418007 


M13509 


Hs.83169 


418738 


AW388633 


Hs.6682 




AI085198 


Hs.164226 




AW103364 


Hs.727 


422867 


132137 


Ks.1584 


444381 


BE387335 


Hs.283713 


421582 


AI910275 


Hs^50470 


411789 


AF245505 


Hs.72157 


452281 


T93500 


Hs.28792 


428898 


AA852773 


Hs.334838 


421552 


AF026692 


Hs.1Q570O 


425247 


NM_00594O 


Hs. 155324 


432201 


AI538613 


Hs.298241 


447377 


X77343 


Hs.334334 


446921 


AB012113 


Hs.16530 


418888 


AU076801 


Hs.89436 


432179 


X75208 


Hs.2913 


422578 


AF239666 


Hs.1545 


409889 


AW630041 


Hs.56937 


447033 


AI357412 


Hs.157601 


447033 


AI357412 


Hs.157601 


411975 


AI916056 


Hs.144583 


434206 


AW136973 


Hs.362915 


423936 


U77629 


Hs.1 35639 


447400 


AK000322 


Hs.18457 


449032 


AA045573 


Hs.22900 


415214 


AI445236 


Hs.1 251 24 


443247 


BE614387 


Hs.333893 


422048 


NMJJ12445 


Hs.288126 


410418 


D31382 


Hs.63325 


446342 


BE298665 


Hs.1 4846 


411274 


NM_002776 


Hs.69423 


104978 


A1199268 


Hs.19322 


422260 


AA315993 


Ks.105484 


409041 


A6033025 


Hs.50081 


420344 


BE463721 


Hs.97101 


422163 


AF027208 


Hs.1 12360 


437935 


AW939591 


Hs.5940 


422330 


D30783 


Hs.115263 


408908 


BE296227 


Hs.250822 


407811 


AW190902 


Hs.40098 


437852 


BE001836 


Ks.256897 


408243 


Y00787 


Hs.624 


426088 


AF038007 


Hs.166196 


439738 


BE246502 


Hs.9598 


419741 


NM.007019 


Hs.93002 


450983 


AA305384 


Hs.25740 


417771 


AA804698 


Hs.82547 


421379 


Y15221 


Hs.1 03982 


442006 


AW975183 


Hs.372210 


413048 


M93221 


Hs.75182 


443324 


R44013 


Hs.164225 


424917 


AI63620B 


Hs.96901 


424917 


AI636208 


Hs.96901 


444527 


NM.005408 


Hs.1 1383 


442652 


AI005163 


Hs^01378 


450726 


AW204600 


Hs.355462 


416965 


N26223 


Hs.1 60436 


442275 


AW449467 


Hs.54795 


431745 


AW972448 


Hs.163425 


431745 


AW972448 


Hs.163425 


453142 


AA033648 


Hs.7473 


421659 


NMJJ14459 


Hs.106511 


444090 


S69115 


Hs.1 0306 


421563 


NH.006433 


Hs.105806 


430413 


AW842182 


Hs.241392 


414991 


CI 7898 




419833 


AA251131 


Hs .220697 


424943 


AU077260 


Hs.1 53924 



glycoprotein A33 {transmembrane) 
integrin, alpha 5 (fibronecfjn receptor, 
matrix metaltoproteinase 1 (interstitial 
matrix metaltoproteinase 1 (interstitial 
sohite carrier tamBy 7, (cationic amino 
Thrombospondin 1 
inhibln, beta A (adMn A, acfivin AB a 
cartilage oflgomeric matrix protein (pse 
hypolhetical protein BC014245 
trefoil factor 1 (breast cancer, estroge 
Adfican 

Homo sapiens cDNA FL) 1 1 041 fis, done PL 

KIAA1866 protein 

secreted fhzzled-related protein 4 

matrix metaltoproteinase 1 1 (strometysin 

Transmembrane protease, serine 3 

transcription factor AP-2 alpha 

small Inducible cytokine subfamBy A (Cy 

cadherin 17, U cadherin (liver-intesfin 

EphB3 

caudal type homeo box transcription fad 
suppression of tumorigenicity 14 (colon 
Predicted gene: Eos cloned; secreted w/V 
Predicted gene: Eos cloned; secreted w/V 
3'UTR of: dead ringer (DrosophHaJ-fike 
ESTs, Weakly similar to S69890 mitogen i 
achaete-scute complex (Drosophlia) homo) 
hypothetical protein FU20315 
nuclear factor (erythroid-derived 2)-0k 
EphB2 

c-Myc target JP01 

spondin 2, extracellular matrix protein 
transmembrane protease, serine 4 
solute carrier family 7 (cationic amino 
kaflikreln 10 

Homo sapiens. Similar to RiKEN cDNA 2010 

regenerating gene type (V 

Hypothetical protein, XPJ)51860 (WAA119 

putative G protein-coupled receptor 

prominin (rnouse)-fike 1 

mucin 13, epithelial transmembrane 



serine/threonine kinase 15 

cysteine knot superfamily 1, BMPantagon 

putative GPCR 

intedeukin 8 

ATPase, Oass I, type 8B, member 1 

sema domain, immunoglobulin domain (lg)> 

ubiquitin carrier protein E2-C 

ER01(S.cerevisiae)-like 

refirtoic add receptor res pond or (tazaro 

small inducible cytokine subfamily B (Cy 

ESTs. Weakly similar to S72482 hypotheti 

man nose receptor, C type 1 

ESTs 

hypotheGcal protein FU23049 

hypothetical protein FU23049 

small inducible cytokine subfamily A (Cy 

Homo sapiens cDNA FU 40427 fis 

HUMPS PBA Human pulmonary surfactant-asso 

MOAC1 

Homo sapiens secretogtobin, family 3A, m 
Novel FGENESH predicted cadherin repeat 
Novel FGENESH predicted cadherin repeat 
Homo sapiens gap junction protein, alpha 
protocadherin 17 

natural kfDer ceO group 7 sequence 
granulysin 

small Inducible cytokine A5 (RANTES) 
Homo sapiens up-regulated by BCG-CWS (LO 
Homo sapiens tryptophanyMRNA synthetas 
death-associated protein kinase 1 



Seq ID No. 

SeqlDNo.C1&C217 

SeqiDNaC2&C218 

SeqlDNo.C3SC219 

SeqIDNo.C4&C220 

SeqiONo.C5&C221 

SeqlDNo.C6&C222 

SeqlDNo.C7&C223 

SeqlDNo.C8&C224 

SeqiONo.C9&C225 

SeqlDNo.C10&C226 

SeqlDNo,C1UC227 

Seq ID No. C12 

SeqlDNo.C13&C228 

SeqlDNo.C144C229 

Seq(DNo.C15&C230 

SeqlDNo.C16&C231 

SeqlDNo.C17&C232 

SeqlONaC16&C233 

SeqlONaC19&C234 

SeqlDNaC20&C235 

SeqlDNo.C2UC236 

SeqlDNaC22&C237 

SeqtONo.C23aC238 

Seq ID No, C24 & C239 

SeqlONaC25&C240 

SeqiDNo.C26&C241 

SeqlDNaC27&C242 

SeqlDNo.C28&C243 

SeqlDNo.C29&C244 

SeqlDNo.C30&C245 

SeqiDNo.C3UC246 

SeqlDNo.C32&C247 

SeqlONo.C33&C248 

SeqlDNo.C34&C249 

SeqlDNo.C35&C250 

SeqlDNo.C36&C251 

SeqIDNo.C37&C252 

SeqlDNaC38&C253 

Seq ID NO.C39&C254 

SeqlDNo.C40&C255 

SeqlDNo.C41&C256 

SeqtDNo.C42&C257 

SeqlDNo.C43&C258 

SeqlDNaC44&C259 

SeqlDNaC45&C260 

SeqtDNo.C46&C261 

SeqtDNo.C47&C262 

SeqIDNo.C48&C263 

SeqlONo.C49&C264 

SeqlDNo,C50&C265 

SeqlDNaC51&C266 

SeqlDNaC52&C267 

SeqlDNaC53&C268 

SeqtDNo.C64&C269 

SeqlDNo.C55&C270 

SeqlDNo.C56&C271 

Seq(DNo.C57&C272 

Seq IDNO.C58&C273 

Seq ID Na C59&C274 

Seq ID NO.C80&C275 

SeqlDNaC6UC276 

Seq IDNO.C62&C277 

Seq ID No. C83 & C278 

SeqiDNo.C64&C279 

Seq ID No. C65 & C280 

Seq ID No.C66&C281 

Seq ID Na C67 & C282 

Seq ID No. C68 & C283 

Seq ID No. C69 & C284 

Seq ID No. C70&C285 

Seq ID No. C71&C286 

Seq ID NO.C72&C287 
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430890 

452401 

439180 

410407 

418526 

422627 

430486 

423673 

423673 

431846 

431958 

448733 

426440 

428484 

429211 

423634 

457819 

424687 

418462 

439606 

407720 

418543 

417512 

415817 

415817 

415817 

415817 

415B17 

415817 

415817 

415817 

421817 

421817 

409420 

440659 

408790 

408790 

408790 

450701 

414774 

413691 

453357 

449101 

429263 

421474 

421753 

408482 

426761 

429736 

430985 

431890 



X54232 

NM_007115 

AI393742 



BE019020 



Hs.35406 
Hs.151738 
Hs, 85266 
Hs.58561 
Hs.38002 
Hs.85962 
Hs.82226 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 



446872 

453102 

428513 

430280 

428486 

457489 

432874 

445891 

445891 

404682 

429547 

404287 

404287 

404287 

418318 

444754 

432596 

444006 

428505 

448844 

448844 

428392 

448030 

422109 

449048 

417931 

419216 

426227 

413554 

445417 

426322 



BE062109 

BE003054 

BE003054 

BE019924 

X63629 

NM_005629 

BE382756 

AF104032 

AF052693 

AW959908 

AA057484 

J05070 

BE00159S 

W79123 

AB037776 

NNL005329 

X76534 

U88957 

U88967 

U88967 

U88967 

U88967 

U88967 

U88967 

U88967 

AF146074 

AF146074 

Z15008 

AF134160 

AW560227 

AW580227 

AW580227 

H39960 

X02419 

AB023173 

AL080235 

AA205847 

AA019004 

U76362 

BE314328 

NM_000676 

AI015709 

AF125304 

AA490232 

X17033 

AW023624 

X97058 

NML007197 

BE220806 

AA361258 

AW583497 

AI693815 

W94322 

AW391342 

AW391342 



AW009166 Hs.99376 



gfypicanl 

tumor necrosis factor, alpha-induced pro 
Hs.199067 v-ert)-b2 avian erythroblastic leukemia v 
Hs.63287 carbonic anhydrase DC 

solute carrier family 16 (monocarboxyllc 
transforming growth factor, beta-induced 
chloride channel calcium activated, tarn 
matrix metaHoproteinase 1 2 (macrophage 
matrix rretaDopmtemase 12 (macrophage 
uroplakrn 18 

cadherin 3, type 1, P-cadherin (placenta 
Hs.187958 solute carrier family 6 (neurotransmflte 
Hs.169902 solute carrier family 2 (facilitated g!u 
Hs.184601 solute carrier family 7 (cationte amino 
Hs.198249 gap junction protein, beta 5 (connexln 3 
Hs.1690 heparin-binding growth factor binding pr 
FU20522 Hypothetical protein FU20522 
matrix metaBoproteinase 9 (gelatinase B 
integrin,beta4 

G protein-coupled receptor 87 
irrrrrumogtobuHn superfamily, member 9 
hyaluronan synthase 3 
glycoprotein (transmembrane) nmb 
protein tyrosine phosphatase, receptoM 
protein tyrosine phosphatase, receptoM 
protein tyrosine phosphatase, receptoM 
protein tyrosine phosphatase, receptoM 



Hs.2699 
Hs.29352 



Hs.85838 

Hs.118787 

Hs.241551 

Hs.1695 

Hs.1695 

Hs.271580 

Hs.2877 



U47732 

T83911 

AJ224741 

BE395085 

AL035461 

AI581519 

AI581519 

H10233 

N30714 

S73265 

245051 

W95642 

AU076718 

U67058 

AA319146 

AK001058 

J05068 



protein tyrosine phosphatase, receptoM 
protein tyrosine phosphatase, receptoM 
protein tyrosine phosphatase. receptoM 
Hs.108660 ATP-binding cassette, sub-family C (CFTR 
Hs.108660 ATP-bindlng cassette, sub-family C (CFTR 
laminin, gamma 2 (nicein (1001(D), kaQni 
daudin 1 

neurotrophic tyrosine kinase, receptor, 
neurotrophic tyrosine kinase, receptor, 
neurotrophic tyrosine kinase, receptor, 
hypothetical protein XPJJ981 51 (leucine- 
plasminogen activator, urokinase 
ATPase, Class VI, type 11B 
Ras-induced senescence 1 (RIS1) 
G protein-coupled receptor 
Hs. 198396 ATP-binding cassette, sub-family A (ABC1 
Hs.104637 solute carrier lamBy 1 (glutamate trans 
Hs.10791 1 ATP-binding cassette, sub-famlty B (MDR/ 
Hs.45743 adenosine A2b receptor 
Hs.1 72089 PORiMIN Prooncosis receptor inducing me 
Hs.2 1 2680 tumor necrosis factor receptor supertami 
Hs.27323 ESTs, Weakly simflar to 178885 serme/lh 
Hs.271986 integrin, alpha 2 (CD49B, alpha 2 subuni 
Hs. 162282 potassium channel TASK-4; potassium chan 
Hs. 16382 pyrirrddinergJc receptor P2Y, G-proteln c 
Hs.31 664 frizzled prosophila) homolog 10 
Hs.184697 ptexinCI 
Hs.237868 interteuJdn 7 receptor 
Hs.1 84604 pancreatic polypeptide 
Hs.127179 cryptic gene 
Hs.279651 melanoma Inhibitory activity 
Hs.199460 DPCR1 protein 
Hs.199460 DPCR1 protein 

orthobg of mouse polydomain protein 
FGENESH predicted novel secreted protein 
FGENESH predicted novel CUWomain conta 
FGENESH predicted novel CUB-domaln conta 
FGENESH predicted novel CUB-domain conta 
Hs .84072 transmembrane 4 superfamily member 3 
Hs.1 1881 transmembrane 4 sirperfamiry member 4 
Hs278461 matriEn3 

Hs334762 type I terrernbrane protein Fn14 
Hs.2281 chrornograrrin B (secretogranrn 1) 
Hs.1 771 64 FGENESH predicted novel ceB surface pr 
FGENESH predicted novel ceil surface pr 
secretory granule, neuroendocrine protei 
rrarruVane^parining 4^Jomains, subfam2y A 
gastrirweteasing peptide 
similar to S68401 (cattle) gtucose indue 
trefoil factor 3 (intestinal) 
small inducible cytokine subfamily B (Cy 



Hs.54451 

Hs.7327 

Hs.47860 

Hs.47860 

Hs.47860 

Hs.288467 

Hs.77274 

Hs.75478 

Hs.35861 

Hs.23016 



Hs.177164 
HS2265 
Hs325960 
Hs.1473 
Hs.22920 
Hs.82961 
Hs.1 64021 

Hs.1 54299 Human proteinase activated receptor-2 mR 
Hs.75426 
Hs.1 2680 
Hs.2012 



n li (chromogranin C) 
a dsintegrm-tite and rnetafioprotease w 
rranscobaternin I (vitamin B12 binding pr 



SeqlDNo.C73&C288 
SeqtDNo. C74&C289 
SeqlDNo.C75SC290 
SeqlDNo.C76&C291 
SeqtDNo.C77&C292 
SeqlDNo.C78&C293 
SeqlO No. C79&C294 
SeqlDNaC80&C295 
SeqlDNo.C8UC296 
SeqlDNo.C82&C297 
SeqlDNo.C83&C298 
SeqlDNo. C84&C2S9 
SeqlDNo.C85&C300 
SeqlDNaC86&C301 
SeqlDNaC87&C3G2 
SeqlDNo.C88&C303 
SeqlDNo.C89&C304 
SeqlDNo.C90&C305 
SeqlDNaC91&C306 
SeqlDNo.C92&C307 
SeqlDNo.C93&C308 
SeqlO NO.C94&C309 
SeqlDNo.C95&C310 
SeqIDNo.C96&C311 
SeqlDNo.C97&C312 
SeqlDNo.C98&C313 
SeqlDNo. C99&C314 
SeqtDNo.C100&C315 
Seq IDNO.C101&C316 
SeqlO No, C102&C317 
SeqlDNo.C103&C318 
SeqlDNaC104&C319 
SeqiDNaC105&C320 
SeqlDNo.C1Q6&C321 
SeqlDNo.C107&C322 
SeqlONo.C108&C323 
SeqIDNo.C109&C324 
Seqi0No.C1t0&C325 
SeqiDNo.C11UC326 
SeqlDNo.C112&C327 
SeqiDNo.C113&C328 
SeqlDNo.C114&C329 
SeqlO No. C115&C330 
SeqlONo.C116&C331 
.SeqtDNaC117&C332 
SeqlDNaC118&C333 
SeqlDNo.C119&C334 
SeqlDNaC120&C335 
Seq!DNo.C121&C336 
SeqlDNaC122&C337 
SeqlDNaC123&C338 
SeqlDNo,C124&C339 
SeqlDNo.C125&C340 
Seq(DNaC126&C341 
SeqlDNo.C127&C342 
SeqtDNo. C128&C343 
SeqlDNaC129&C344 
SeqtDNo.C130&C345 
SeqlDNo.C131&C346 
SeqlDNaC132&C347 
SeqlDNo.C133&C348 
SeqlDNo.C134&C349 
Seq(DNaC135&C350 
SeqlDNaC136&C351 
SeqlDNaC137&C352 
SeqlDNo.C138&C353 
SeqtONo.C139&C354 
SeqlDNo.C140&C355 
SeqlDNo.C1418.C35S 
SeqlDNo.C1428.C357 
SeqlDNo.C143&C358 
. SeqtDNaC144&C359 
SeqlDNo.C145&C360 
SeqlO No. C146&C361 
SeqlO No. C147&C362 
SeqlDNaC148&C363 
SeqlDNaC149&C364 
Seq ID NO.C150&C355 
SeqlONaC151&C366 
Seq ID NO.C152&C357 
SeqlDNo.C153&C368 
SeqlDNo.C154&C369 
SeqtDNo.C155&C370 
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413719 


BE439580 


Hs.75498 


431462 


AW583672 


Hs.256311 


416498 


U33632 


Hs.79351 


413095 


AM94359 


Hs.30715 


426125 


X87241 


Hs.166994 


436729 


6E621807 


Hs.351316 


437145 


AF007216 


Hs.5462 


451820 


AW058357 


Hs.199248 


427557 


NM.002659 


Ks.179657 


408308 


AL033377 


Hs.44197 


421340 


F07783 


Hs.1369 


428187 


AI687303 


Hs.285529 


428187 


AI687303 


Ks^85529 


422278 


AF072873 


Hs.1 14218 


446619 


AU076643 


Hs.313 


419452 


U33635 


Hs.90572 


428242 


H55709 


Hs.2250 


439659 


AW970780 


Hs.594S3 


411825 


AK000334 


HsJ52415 


412314 


AA825247 


Hs.356084 


429150 


AF120103 


Hs.1 97366 


419073 


AW372170 


Hs.183918 


411628 


AW161449 


Hs.72290 


419508 


AW997938 


Hs.90786 


421779 


AI879159 


Hs.108219 


439668 


AI091277 


Hs.302634 


433336 


AF017986 


Hs.31386 


436972 


AA284679 


Ks.25640 


410268 


AA316181 


Hs.61635 


416370 


N90470 


Hs.203697 


437052 


AA861697 


Hs.1 20591 


421481 


AW391972 


Hs.1 04696 


444151 


AW972917 


Hs.1 28749 


426174 


AA547959 


Hs.1 15838 


410037 


AB020725 


Hs.58009 


425071 


NM.013989 


Hs.1 54424 


421829 


AB018330 


Hs.1 08708 


418576 


AW968159 


Hs,302740 


419693 


AA133749 


Hs.301350 


419693 


M133749 


Hs.301350 


446988 


Y09763 


Hs.22785 


448988 


Y09763 


Hs.22785 


448988 


Y09763 


Hs.22785 


448988 


Y09763 


Hs.22785 


430144 


AI732722 


Hs.98927 


408833 


AW612232 


Hs^254835 


452017 


AF109302 


Hs.27495 


415992 


CO 5837 


Hs.145807 


415992 


C05837 


Hs.145807 


443991 


NNL0Q225Q 


Hs.1 0082 


425976 


C75094 


Hs334514 


432800 


BE391046 


Hs.278952 


452955 


AW390282 


Hs.31130 


424339 


BE257148 


Hs.145416 


425263 


NM.001197 


Hs.155419 


421537 


BE383488 


Hs.1 05547 


434293 


NM.004445 


Hs.3796 


427715 


BE245274 


Hs.180428 


413049 


NW_002151 


Hs.823 


414555 


N98569 


Hs.76422 


422424 


AI186431 


Hs.296538 


432378 


AI493046 


Hs.146133 


409041 


AB033O25 


Hs.50081 



small Indudbte cytokine subfamily A (Cy 
granin-Gke neuroendocrine peptide precu 
potassium channel. subfamily K, member 1 
potassium voltage-gated channel tsk-rel 
FAT tumor suppressor (Drosophila) homoto 
transmembrane 4 superfamUy member 1 
solute carrier family 4, sodium btcarbon 
ESTs 

plasminogen activator, urokinase recepto 
hypothetical protein DKFZp564O0462 
decay accelerating factor for complement 
G protein-coupled receptor 49 
G protein-coupled receptor 49 
frizzled (Drosophila) homotog 6 
secreted phosphoprotein 1 (osteoponlin, 
PTK7 protein tyrosine kinase 7 
leukemia inhibitory factor (cholinergic 
leudne-nch repeat-containing G protein 
solute carrier famiy 39 (zinc transport 
G protein-coupled receptor 27 (GPR27) (S 
smoothened (OrosophBa) hornolog 
transmembrane receptor Unc5H2 mRNA 
wingless-type MMTV alteration site fami 
ATP-binding cassette, sub-family C (CFTR 
wingless-type MMTV integration site fami 
frizzled (DrosophBa) homotog 8 
secreted mzzled-related protein 2 (str 
claudln3 

six transmembrane epithelial antigen of 

C038 antigen (p45) 

ESTs 

KIAA1324 protein 
alpha-methylacyVCoA racemase 
Homo sapiens similar to EcNnoidin (LOC1 
WAA0918 protein 
deiodinase, bdothyronine, type II 
caldum/caimodutin-dependent protein kin 
Eplthenal calcium channel 2, CaT-like A 
FXYD domain-containing ion transport reg 
FXYD domain-containing ion transport reg 
gamnia-aminobutync acid (GABA) A recepto 
gamma-aminobutyric acid (GABA) A recepto 
gamma-aminobu^ric add (GABA) A recepto 
gamma-aminobutyrc acid (GABA) A recepto 
ERGL protein; ERGlC-534Ike protein 
ESTs 

prostate cancer associated protein 7 
hypothetical protein FU13593 
hypothetical protein FU13593 
potassium intermediate/small conductance 
NG22 protein 
AUM protein 

transmembrane 7 superfamQy member 2 
endoglycan 

BCL2-inieracting Hter (apoptosis-toduc 
neural proliferation, differentiation an 



KIAA1 181 protein 

hepsln (transmembrane protease, serine 1 
phospholipase A2. group 1IA (platelets, 
prostate differentiation factor 
ESTs 

Hypothetical protein, XP__051880 (KIAA1 19 



TABLE 79B 



SeqlDNo.C156&C371 
SeqlDNo.C157iC372 
SeqlDNo.C158&C373 
SeqlDNo.C159&C374 
SeqlDNaC160&C375 
Seq(ONaC161&C376 
SeqlDNaC162&C377 
SeqlONaC163&C378 
SeqlONaC164&C379 
SeqlDNo.C165&C380 
SeqlONaC166&C381 
SeqlDNo.C167&C382 
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Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

414991 1785136 J D78831 CI 7898 078863 

TABLE 79C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column ere Genbank IdenSfier (Gl) numbers. 'Dunham 1. et al" refers to the publication entitled The DNA 

sequence of human chromosome 22" Dunham I. et aJ., Nature (1999) 402:489495. 
Strand: Indicates DMA strand from which exoro were predicted. 
NLposition: indicates nudeotMe positions of predicted exons. 
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Seq ID NO: C2 DNA Sequence 

Nucleic Acid Accession fh Eos sequence 

Coding sequence t 1..3150 



Table 80: 

Seq ID NO: CI DNA Sequence 
Nucleic Acid Accession <h NM_005814 
J Coding sequence: 345.. 1304 

1 11 21 31 41 51 

I I I I I I 

CTACCCCTTT GTGAGCAGTC TAGGACTTTG TACACCTGTT AAGTAGGGAG AAGGCAGGGG 60 

1U AGGTGGCTGG TTTAAGGGGA ACTTQAGGGA AGTAGGGAAG ACTCCTCTTQ GGACCTTTGG 120 

AGTAGGTGAC ACATGAGCCC AGCCCCAGCT CACCTGCCAA TCCAGCTGAG GAGCTCACCT 180 

GCCAATCCAG CTGAGGCTGG GCAGAGGTGG GTGAGAAGAG GGAAAATTGC AGGGACCTCC 240 

AGTTGGGCCA GGCCAGAAGC TGCTGTAGCT TTAACCAGAC AGCTCAGACC TGTCTGGAGG 300 

1 ~ CTGCCAGTGA CAGGTTAGGT TTAGGGCAGA GAAGAAGCAA GACCATGGTG GGGAAGATGT 360 

ID GGCCTGTGTT GTGGACACTC TGTGCAGTCA GGGTGACCGT CGATGCCATC TCTGTGGAAA 420 

CTCCGCAGGA CGTTCTTCGG GCTTCGCAGG GAAAGAGTGT CACCCTGCCC TGCACCTACC 480 

ACACTTCCAC CTCCAGTCGA GAGGGACTTA TTCAATG6GA TAAGCTCCTC CTCACTCATA 540 

CGGAAAGGGT GGTCATCTGG CCGTTTTCAA ACAAAAACTA CATCCATGGT GAGCTTTATA 600 

^ AGAATCGCGT CAGCATATCC AACAATGCTG AGCAGTCCGA TGCCTCCATC ACCATTGATC 660 

ZKj AGCTGACCAT GGCTGACAAC GGCACCTACG AGTGTTCTGT CTCGCTGATG TCAGACCTGG 720 

AGGGCAACAC CAAGTCACGT GTCCGCCTGT TGGTCCTCGT GCCACCCTCC AAACCAGAAT 780 

GCGGCATCGA GGGAGAGACC ATAATTGGGA ACAACATCCA GCTGACCTGC CAATCAAAGG 840 

AGGGCTCACC AACCCCTCAG TACAGCTGGA AGAGGTACAA CATCCTGAAT CAGGAGCAGC 900 

CCCTGGCCCA GCCAGCCTCA GGTCAGCCTG TCTCCCTGAA GAATATCTCC ACAGACACAT 960 

CGGGTTACTA CATCTGTACC TCCAGCAATG AGGAGGGQAC GCAGTTCTGC AACATCACGG 1020 

TGGCCGTCAG ATCTCCCTCC ATGAACGTGG CCCTGTATGT GGGCAXCGCG GTGGGCGTGG 1080 

TTGCAGCCCT CATTATCATT GGCATCATCA TCTACTGCTG CTGCTGCCGA GGGAAGGACG 1140 

ACAACACTGA AGACAAGGAG GATGCAAGGC CGAACCGGGA AGCCTATGAG GAGCCACCAG 1200 

AGCAGCTAAG AGAACTTTCC AGAGAGAGGG AGGAGGAGGA TGACTACAGG CAAGAAGAGC 1260 

AGAGGAGCAC TGGGCGTGAA TCCCCGGACC ACCTCGACCA GTGACAGGCC AGCAGCAGAG 1320 

GGCGGCGGAG GAAGGGTTAG GGGTTCATTC TCCCGCTTCC TGGCCTCCCT TCTCCTTTCT 1380 

AAGCCCTGTT CTCCTGTCCC TCCATCCCAG ACATTGATGG GGACATTTCT TCCCCAGTGT 1440 

CAGCTGTGGG GAACATGGCT GGCCTGGTAA GGGGGTCCCT GTGCTGATCC TGCTGACCTC 1500 

ACTGTCCTGT GAAGTAACCC CTCCTGGCTG TGACACCTGG TGCGGGCCTG GCCCTCACTC 1560 

AAGACCAGGC TGCAGCCTCC ACTTCCCTCG TAGTTGGCAG GAGCTCCTGG AAGCACAGCG 1620 

CTGAGCATGG GGCGCTCCCA CTCAGAACTC TCCAGGGAGG CGATGCCAGC CTTGGGGGGT 1680 

GGGGGCTGTC CTGCTCACCT GTGTGCCCAG CACCTGGAGG GGCACCAGGT GGAGGGTTTG 1740 

CACTCCACAC ATCTTTCTTG AATGAATGAA AGAATAAGTG AGTATGCTTG GGCCCTGCAT 1800 

TGGCCTGGCC TCCAGCTCCC ACTCCCTTTC CAACCTCACT TCCCGTAGCT GCCAGTATGT 1860 

TCCAAACCCT CCTGGGAAGG CCACCTCCCA CTCCTGCTGC ACAGGCCCTG GGGAGCTTTT 1920 

GCCCACACAC TTTCCATCTC TGCCTGTCAA TATCGTACCT GTCCCTCCAG GCCCATCTCA 1980 

AATCACAAGG ATTTCTCTAA CCCTATCCTA ATTGTCCACA TACGTGGAAA CAATCCTGTT 2040 

ACTCTGTCCC ACGTCCAATC ATGGGCCACA AGGCACAGTC TTCTGAGCGA GTGCTCTCAC 2100 

TGTATTAGAG CGCCAGCTCC TTGGGGCAGG GCCTGGGCCT CATGGCTTTT GCTTTCCCTG 2160 

AAGCCCTAGT AGCTGGCGCC CATCCTAGTG GGCACTTAAG CTTAATTGGG GAAACTGCTT 2220 

TGATTGGTTG TGCCTTCCCT TCTCTGGTCT CCTTGAGATG ATCGTAGACA CAGGGATGAT 2280 

TCCCACCCAA ACCCACGTAT TCATTCAGTO AGTTAAACAC GAATTGATTT AAAGTGAACA 2340 

CACACAAGGG AGCTTGCTTG CAGATGGTCT GAGTTCTTGT GTCCTGGTAA TTCCTCTCCA 2400 

<n GGCCAGAATA ATTGGCATGT CTCCTCAACC CACATGGGGT TCCTGGTTGT TCCTGCATCC 2460 

CGATACCTCA GCCCTGGCCC TGCCCAGCCC ATTTGGGCTC TGGTTTTCTG GTGGGGCTGT 2520 

CCTGCTGCCC TCCCACAGCC TCCTTCTGTT TGTCGAGCAT TTCTTCTACT CTTGAGAGCT 2580 

CAGGCAGCGT TAGGGCTGCT TAGGTCTCAT GGACCAGTGG CTGGTCTCAC CCAACTGCAG 2640 

TTTACTATTG CTATCTTTTC TGGATGATCA GAAAAATAAT TCCATAAATC TATTGTCTAC 2700 

TTGCGATTTT TTAAAAAATG TATATTTTTA TATATATTGT TAAATCCTTT GCTTCATTCC 2760 

AAATGCTTTC AGTAATAATA AAATTGTGGG TGG 2793 



1 11 21 31 41 51 

I I I I I I 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 60 

CGCCGACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 120 

GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 180 

GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 240 

CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 300 

TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 360 

CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 420 

/U TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 480 

AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 540 

GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTTCAG CTGGGCAGCA 600 

GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GOCGAGTTCA CCAAGACTGG CCGTGTGGTT 660 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 780 

CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 840 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 960 

CAGATGGCCT CCTACTTTGG CTATGCAGTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 1020 

GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GCGGCCTCAG 1080 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 1140 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 1200 

GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 1260 

CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 1320 

1198 



WO 03/042661 



PCT/US02/36810 



CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1380 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCQTGT CCGCTAGTGC CTCCCTCACC 1500 

ATCTTCCCCG CCATGTTCAA CCCAGAGGAG OGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1S60 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 1620 

GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1680 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGAAOG AGTCAGAATT TOGAGACAAA 1800 

CTCTCGCOGA TTCACATOGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 1920 

ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 1980 

GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 2040 

CAGAATGTGG GTGAGGGTGG CGCCTATGAG GCT6AGCTTC GGGTCACOGC CCCTCCAGAG 2100 

GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCCGTGA ACCAGAGCOG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TOGGTTTACA GTCCCTCATC TCCGGGACAC TAAGAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG OGAOGTGGTT 2340 

TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAACGGTGT CTCCAAGCCT 2400 

GAGGCAGTGC TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GAGTTGGATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 2700 

TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCOGGAGG CTGAGTGTTT CAGGCTGOGC 2760 

Zj TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 2820 

TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 2880 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

ATCATCATCC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA COGCCATGGA AAAAGCTCAG 3120 

CTCAAGCCTC CAGCCACCTC TGATGCCTGA 3150 



10 
15 
20 



30 



Seq ID NO: C3 DNA Sequence 
„ Nucleic Aci>d Accession #: NMJJ02421.2 
3D Coding sequence: 1..1410 

1 11 21 31 41 51 

I I I I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

4U CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 

A c GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 360 

TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 420 

TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG AGGGTCAAGC AGACATCATG 480 

ATAT CTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT 540 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 600 

rr\ GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGCGGC TCATGAACTC 660 

GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTGATGTA CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 780 

GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA CCCCAAAAGC ATGTGACAGT 840 

AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG TGATGTTCTT TAAAGACAGA 900 

TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 960 

TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT TTGCCGACAG AGATGAAGTC 1020 

CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC AGAATGTGCT ACACGGATAC 1080 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 1140 

CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTGGAGGTAT 1200 

^ A GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA TGATAGCACA TGACTTTCCT 1260 

OU GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 

AATAGCTGGT TCAACTGCAG GAAAAATTAG 1410 

Seq ID NO: C4 DNA Sequence 
CO Nucleic Acid Accession 8: Eos sequence 
Coding sequence: 1. .1410 



70 
75 
80 



1 11 21 31 41 51 

I 1 1 I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 

GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 360 

TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 420 

TGGA GTAATG TCACACCTCT GACATTCACC AAGGTCTCTG AGGGTCAAGC AGACATCATG 480 

ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT 540 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 600 

GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGCGGC TCATGCCCTC 660 

G GCCA TTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTG ATGTA CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 780 

GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA CCCCAAAAGC ATGTGACAGT 840 

AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG TGATGTTCTT TAAAGACAGA 900 
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TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 960 

TGGCGACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT TTGOCGACAG AGATGAAGTC 1020 

CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC AGAATGTGCT ACACGGATAC 1080 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 1140 

CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTGGAGGTAT 1200 

GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA TGATAGCACA TGACTTTCCT 1260 

GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 

AATAGCTGGT TCAACTGCAG GAAAAATTAG 1410 

Seq ID NO: C5 DNA Sequence 

Nucleic Acid Accession #: NM_014331.2 

Coding sequence : 1 . . 1506 

1 11 21 31 41 SI 

! I I I I I 

ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TOCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGOGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

TTTAAAGACX3 CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAAOGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGOGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1S00 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

CTTCAGATGA AACTGTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2640 

AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATA AGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGC7TACATC TGTAATCCCA 2880 

GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 

CCATCTCCAA AAAAAAAAAA AAAA 3144 

Seq ID NO: C6 DNA Sequence 

Nucleic Acid Accession fh NM.003246.1 

Coding sequence: 112.. 3624 *~ 

1 11 21 31 41 51 

I I I I I I 

GGAOGCACAG GCATTCCCCG CGCCCCTCCA GCCCTCGCCG CCCTCGCCAC CGCTCCCGGC 60 
CGCCGCGCTC CGGTACACAC AGGATCCCTG CTGGGCACCA ACAGCTCCAC CATGGGGCTG 120 
GCCTGGGGAC TAGGOGTCCT GTTCCTGATG CATGTGTGTG GCACCAACCG CATTCCAGAG 180 
TCTGGCGGAG ACAACAGOGT GTTTGACATC TTTGAACTCA CCGGGGCCGC CCGCAAGGGG 240 
TCTGGGCGCC GACTGGTGAA GGGCCCCGAC CCTTCCAGCC CAGCTTTCCG CATCGAGGAT 300 
GCCAACCTGA TCCCCCCTGT GCCTGATGAC AAGTTCCAAG ACCTGGTGGA TGCTGTGCGG 360 
GCAGAAAAGG GTTTCCTCCT TCTGGCATCC CTGAGGCAGA TGAAGAAGAC CCGGGGCACG 420 
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CTGCTGGCCC 
AAGGCGGGCA 
GAAGAAGCTC 
AGGGCCCAGC 
CAAAGC6TCT 
GTCAATGACA 
GAAGACATCC 
AACAACGTGG 
AAGGACTTGC 
CTCAGGGGCC 
GAGAACAAAG 
CAGTACAGAA 
TCAGTTACCA 
CCTGATGGAG 
CCATGGTCCG 
CGCTGCTGOG 
CACATTCAGG 
TGGTCATCTT 
TCTCCCAGCC 
TGCAAGAAAG 
TGTTCTGTCA 
CCCCAGTTTG 
CAGGACTGTC 
AGCTACCCTG 
ATCCAGTGCA 
GGAGAGCACC 
TTCACCGGCT 
TGCAAGCCCC 
AACTACCTGG 
GGCAATGGCA 
GTGTGCGTGG 
TCAGGGCAGG 
AATGATAAAA 
GACTATGACA 
GATCAGGCAG 
GACGGTATCC 
ACTGATATGG 
CAGCTGGACT 
GAAGATGGCC 
GACCATGACA 
CCTGATGACA 
GATGGTOGAG 
GACATCTGTC 
CCTCTGGACC 
AAAGAACTCG 
AATGCTGTGG 
GGATTTGTCT 
ACCCAGTCCT 
AAAGTTGTAA 
GGAAACACCC 
GATTTCACCG 
GTGATGTATG 
GCTGGTGGTA 
AAATACGAAT 
AATGCTGGTA 
CTTGGCTTCC 
TCAAGAAAAT 
ATCTTCCAAG 
AGTTGGGAAG 
CATCTCT 



TGGAGCGGAA 
CCCTGGACCT 
TCCTGGCAAC 
TGTACATOGA 
TCACCAGAGA 
ATTTCCAGGG 
TCAGGAACAA 
TGAATGGTTC 
AAGCCATCTG 
TGCGCACCAT 
AGTTGGCCAA 
ATAACGAGGA 
TCTGCAAAAA 
AATGCTGTCC 
AGTGGACCTC 
ATAGCCTCAA 
AGTGTGACAA 
GTTCTGTGAC 
CCCAGATGAA 
AOGCCTGCCC 
CCTGTGGAGG 
GAGGCAAGGA 
CAATTGATGG 
ATGGCAGCTG 
CAGATGTTGA 
GGTGTGAGAA 
CACAGCCCTT 
GTAACCCCTG 
GCCACTATAG 
TCATCTGCGG 
CCAATGCGAC 
AAGACTATGA 
TTCCAGATGA 
GAGATGATGT 
ACACAGACAA 
TCAATGAACG 
ATGGGGTTGG 
CTGACTCAGA 
ACCAGAACAA 
AAGATGGCAA 
AGGACAACTG 
GTGATGCCTG 
CTGAGAATGT 
CCAAAGGGAC 
TCCAGACTGT 
ACTTCAGTGG 
TTGGCTACCA 
ACTGGGACAC 
ACTCCACCAC 
CTGGCCAGGT 
CCTACAGATG 
AAGGGAAGAA 
GACTAGGGTT 
GTAGAGATCC 
TTGCACCTTC 
TTCTTTTCTG 
GCAGTTTTCA 
CATATAAACA 
GTGGCCATTC 



AGACCACTCT 
CAGCCTGACC 
CGGCCAGTGG 
CTGTGAAAAG 
CCTGGCCAGC 
GGTGCTGCAG 
AGGCTGCTCC 
CAGCCCTGCC 
GGGCATCTCC 
TGTGACCACG 
TGAGCTGAGG 
ATGGACTGTT 
GGTGTCCTGC 
TCGCTGTTGG 
CTGTTCTACG 
CAACCGATGT 
AAGATTTAAA 
ATGTGGTGAT 
TGGGAAACCC 
CATCAATGGA 
AGGGGTACAG 
CTGCGTTGGT 
ATGCCTGTCC 
GAAATGTGGT 
TGAGTGCAAA 
CACGGACCCC 
CGGCCAGGGT 
CACGGATGGG 
CGACCCCATG 
GGAGGACACA 
TTACCACTGC 
CAAGGATGGA 
CAGGGACAAC 
GGGAGACCGC 
CAATGGGGAA 
GGACAACTGC 
AGATCAGTGT 
CCGCATTGGA 
TCTGGACAAC 
GGGAGATGCC 
CAGACTCGTG 
CAAAGATGAT 
TGACATCAGT 
ATCCCAAAAT 
CAACTGTGAT 
CACCTTCTTC 
GTCCAGCAGC 
CAACCCCACG 
AGGGCCTGGC 
GCGCACCCTG 
GCGTCTCAGC 
AATCATGGCT 
GTTTGTCTTC 
CTAATCATCA 
TGGAACTATG 
TGCTTGCATC 
AAAACAGACT 
ATTGCTTTGG 
CACTCTGCCT 



GGCCAGGTCT 
GTCCAAGGAA 
AAGAGCATCA 
ATGGAGAATG 
ATCGCCAGAC 
AATGTOAGGT 
AGCTCTACCA 
ATCCGCACTA 
TGTGATGAGC 
CTGCAGGACA 
CGGCCTCCCC 
GATAGCTGCA 
CCCATCATGC 
CCCAGCGACT 
AGCTGTGGCA 
GAGGGCTCCT 
CAGGATGGTG 
GGTGTGATCA 
TGTGAAGGCG 
GGCTGGGGTC 
AAACGTAGTC 
GATGTAACAG 
AATCCCTGCT 
GCTTGTCCCC 
GAAGTGCCTG 
GGCTACAACT 
GTCGAACATG 
ACCCACGACT 
TACCGCTGCG 
GACCTGGATG 
AAAAAGGATA 
ATTGGTGATG 
TGTCCATTCC 
TGTGACAACT 
GGAGACGCCT 
CAGTACGTCT 
GACAATTGCC 
GATACCTGTG 
TGTCCCTATG 
TGTGACCACG 
CCCAATCCCG 
TTTGACCATG 
GAGACCGATT 
GACCCTAACT 
CCTGGACTCG 
ATCAACACCG 
CGCTTTTATG 
AGGGCTCAGG 
GAGCACCTGC 
TGGCATGACC 
CACAGGCCAA 
GACTCAGGAC 
TCTCAAGAAA 
AATTGTTGAT 
GGCTTGAGAA 
AGTGTGGACT 
CATCAGCATT 
TTTCCTTTTG 
TTGTCACAGA 



TCAGCGTGGT 
AGCAGCACGT 
CCCTGTTTGT 
CTGAGTTGGA 
TCCGCATCGC 
TTGTCTTTGG 
GTGTCCTCCT 
ACTACATTGG 
TGTCCAGCAT 
GCATCCGCAA 
TATGCTATCA 
CTGAGTGTCA 
CCTGCTCCAA 
CTGCGGACGA 
ATGGAATTCA 
CGGTCCAGAC 
GCTGGAGCCA 
CAAGGATCCG 
AAGCGCGGGA 
CTTGGTCACC 
GTCTCTGCAA 
AAAACCAGAT 
TTGCCGGCGT 
CTGGTTACAG 
ATGCCTGCTT 
GCCTGCCCTG 
CCACGGCCAA 
GCAACAAGAA 
AGTGCAAGCC 
GCTGGCCCAA 
ATTGCCCCAA 
CCTGTGATGA 
ATTACAACCC 
GTCCCTACAA 
GTGCTGCAGA 
ACAATGTGGA 
CCTTGGAACA 
ACAACAATCA 
TGCCCAATGC 
ATGATGACAA 
ACCAGAAGGA 
ACAGTGTGCC 
TCCGCCGATT 
GGGTTGTACG 
CTGTAGGTTA 
AAAGGGACGA 
TTGTGATGTG 
GATACTCGGG 
GGAACGCCCT 
CTCGTCACAT 
AGACGGGTTT 
CCATCTATGA 
TGGTGTTCTT 
TGAAAGACTG 
AACCCCCAGG 
CCTAGAACGT 
CAGCCTCCAA 
AAAAAGCATC 
GCAGGGTGCT 




GCTCTGCAAC 
GACCAAAGCC 
ATGGGACATC 
CAACCCCGCA 
CTGCAACAAG 
GAAGTGTACT 
TGGAAATGGC 
CAACCACAAT 
CCCCCCACGC 
CAAACAGGTG 
CGCCAAGTGC 
TGGCTACGCT 
TGAGAACCTG 
CCTTCCCAAC 
TGACGATGAC 
AGCTCAGTAT 
CCACAACCCA 
CATTGATGGA 
CCAGAGAGAC 
CAATCCGGAT 
GGATATTGAT 
CAACCAGGCT 
CGATGGCATT 
CTCTGACGGC 
AGACATCGAT 
CCAGATGATT 
CCATCAGGGT 
TGATGAGTTT 
TGACTATGCT 
GAAGCAAGTC 
CCTTTCTGTG 
GTGGCACACA 
AGGCTGGAAA 
CATTAGAGTG 
TAAAACCTAT 
CTCTGACCTG 
ATCATAAACC 
ATCACTTCTC 
GCGACCTGCC 
TGAATAAGAC 
TACTTGCTTC 
ATTGTGAGGC 



Seq ZD NO: C7 UNA Sequence 

Nucleic Acid Accession #: NM_002192 

Coding sequencei 86. .1366 



TCCACACACA 
AATCACAACA 
AAGTTGCTGG 
CCCCGACTGT 
AGAGATGGTG 
CGATGTCACC 
GGGCAAAGTC 
AATGAATGAA 
CAGGAAGACG 
TGCAGAAGTC 
CATCCGCCTC 
CGAGGAAGTG 
CGCTCGGAAG 
CCAGGGCAAG 
CAGCTTGGTT 
GGGCGGAGGT 
CCTCATGCTG 



11 
I 

CAAAAAACCT 
ACTTTTGCTG 
ATTATAGTGA 
CCGTCCTGTG 
GAGGCCGTCA 
CAGCCGGTAC 
GGGGAGAACG 
CTTATGGAGC 
CTGCACTTCG 
TGGCTCTTCC 
TTCCAGCAGC 
GGCTTAAAGG 
AGCACCTGGC 
AGCTCCCTGG 
CTCCTGGGCA 
GAAGGTGGGG 
CAGGCCCGGC 



21 
I 

GCGCGTGAGG 
CCAGGATGCC 
GGAGTTCCCC 
CGCTGGCCGC 
AGAAGCACAT 
CCAAGGCGGC 
GGTATGTGGA 
AGACCTCGGA 
AGATTTCCAA 
TAAAAGTCCC 
AGAAGCACCC 
GGGAGAGGAG 
ATGTCTTCCC 
ACGTTCGGAT 
AGAAGAAGAA 
CAGGAGCAGA 
AGTCTGAAGA 



31 
I 

GGGGAGGAAA 
CTTGCTTTGG 
CACCCCAGGA 
CCTCCCAAAG 
TTTAAACATG 
GCTTCTGAAC 
GATAGAGGAT 
GATCATCACG 
GGAAGGCAGT 
CAAGGCCAAC 
GCAGGGCAGC 
TGAACTGTTG 
TGTCTCCAGC 
TGCCTGTGAG 
GAAAGAAGAG 
TGAGGAAAAG 
CCACCCTCAT 



41 
I 

AQCAGGGCCT 
CTGAGAGGAT 
TCCGAGGGGC 
GATGTACCCA 
CTGCACTTGA 
GCGATCAGAA 
GACATTGGAA 
TTTGCOGAGT 
GACCTGTCAG 
AGGACCAGGA 
TTGGACACAG 
CTCTCTGAAA 
AGCATCCAGC 
CAGTGCCAGG 
GAGGGGGAAG 
GAGCAGTCGC 
CGCOGGCGTC 



51 
I 

TTAAAAAGGC 
TTCTGTTGGC 
ACAGCGCGGC 
ACTCTCAGCC 
AGAAGAGACC 
AGCTTCATGT 
GGAGGGCAGA 
CAGGAACAGC 
TGGTGGAGCG 
CCAAAGTCAC 
GGGAAGAGGC 
AAGTAGTAGA 
GGTTGCTGGA 
AGAGTGGCGC 
GGAAAAAGAA 
ACAGACCTTT 
GGCGGGGCTT 
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GGAGTGTGAT GGCAAGGTCA ACATCTGCTG TAAGAAACAG TTCTTTGTCA GTTTCAAGGA 1080 

CATCGGCTQG AATGACTGGA TCATTGCTCC CTCTGGCTAT CATGCCAACT ACTGCGAGGG 1140 

TGAGTGCCCG AGCCATATAG CAGGCAOGTC CGGGTCCTCA CTGTCCTTCC ACTCAACAGT 1200 

CATCAACCAC TACCGCATGC GGGGCCATAG CCCCTTTGCC AACCTCAAAT 0GT6CTGTGT 1260 

GCCCAOCAAG CTGAGACCCA TGTCCATGTT GTACTATGAT GATGGTCAAA ACATCATCAA 1320 

AAAGGACATT CAGAACATGA TCGTGGAGGA GTGTGGGTGC TCATAGAGTT GCCCAGCCCA 1380 

GGGGGAAAGG GAGCAAGAGT TGTCCAGAGA AGACAGTGGC AAAATGAAGA AATTTTTAAG 1440 

GTTTCTGAGT TAACCAGAAA AATAGAAATT AAAAACAAAA CAAAACAAAA AAAAAAACAA 1500 

AAAAAAACAA AAGTAAATTA AAAACAAACC TGATGAAACA GATGAAACAG ATGAAGGAAG 1560 

ATGTGGAAAT CTTAGCCTGC CTTAGCCAGG GCTCAGAGAT GAAGCAGTGA AGAGACAGAT 1620 

TGGGAGGGAA AGGGAGAATG GTGTACCCTT TATTTCTTCT GAAATCACAC TGATGACATC 1680 

AGTTGTTTAA ACGGGGTATT GTCCTTTCCC CCCTTGAGGT TCCCTTGTGA GCTTGAATCA 1740 

ACCAATCTGA TCTGCAGTAG TGTGGACTAG AACAACCCAA ATAGCATCTA GAAAGCCATG 1800 

AGTTTGAAAG GGCCCATCAC AGGCACTTTC CTAGCCTAAT 1840 

Seq ID NO: C8 DNA Sequence 

Nucleic Acid Accession 8: NM 000095.1 

Coding sequence: 26.. 2299 

1 11 21 31 41 51 

I I I I I I 

CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGGGTTC TTCTGCTCAC 60 

CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGATGCTT CGGGAACTGC AGGAAACCAA CGOGGCGCTG CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACOGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 

GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 



O^x CCCCTGCCCC GOGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

J\J CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCOG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA S40 

GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 

CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GGGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

JD CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGOGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGOG ACACTGACCT 840 

AGACGGCTTC CCGGAOGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 

CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

Aft CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

^CCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 

45 



GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGGG ATGCGTCCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 

CTCAGACCAG AAGGACAGTG ATGGOGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCAOGGTGCC 1380 

TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGOG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCOGC CTGGTGCCTA ACCCCGGCCA 1500 

GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

JU GGCCTTCCAG ACAGTCGTGC TGGACOOGGA GGGTGAOGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGOGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACAOGGTCAC 1800 

GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT I860 

re CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 

30 GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCOGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

OU CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGCGG CAAGCCTAGG GAOCAGGGTG AGGACCCGCG GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 

AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TQTGCAGGG 2439 



Seq ID NO: C9 DNA Sequence 

Nucleic Acid Accession #: XMJ)57014 

Coding sequence t 143.. 874 

1 11 21 31 41 51 

I I I I I I 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 

CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 

CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC OGCAGCGGCT 180 

CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGOG CCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 

AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAQAC GGGAGCCCTG GGGCCAATGG 360 

CATTCCGGGT AC ACCTG GGA TCCCAGGTQG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 

TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 

ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC S40 

AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 

CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 

AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 
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CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 

CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 840 

TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 

TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 

CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACT6TT 1020 

TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 

TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1140 

GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 

TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 

CAACCTTAAA AAAAAAAAAA AAAA 1284 

Seq ID NO: CIO DKA Sequence 
Nucleic Acid Accession 8: NM_003225 
Coding sequence: 41 J. 2 95 

1 11 21 31 41 51 

I I I I I I 

ATCCCTGACT CGGGGTCGCC TTTGGAGCAQ AGAGGAGGCA ATGGCCACCA TGGAGAACAA 60 

GGTGATCTGC GCCCTGGTCC TGGTGTCCAT GCTGGCCCTC GGCACCCTGG CCGAGGCCCA 120 

GACAGAGACG TGTACAGTGG CCCCCCGTGA AAGACAGAAT TGTGGTTTTC CTGGTGTCAC 180 

GCCCTCCCAG TGTGCAAATA AGGGCTGCTG TTTCGACGAC ACCGTTCGTG GGGTCCCCTG 240 

GTGCTTCTAT CCTAATACCA TCGACGTCCC TCCAGAAGAG GAGTGTGAAT TTTAGACACT 300 

TCTGCAGGGA TCTGCCTGCA TCCTGACGGG GTGCCGTCCC CAGCACGGTG ATTAGTCCCA 360 

GAGCTCGGCT GCCACCTCCA CCGGACACCT CAGACACGCT TCTGCAGCTG TGCCTCGGCT 420 

CACAACACAG ATTGACTGCT CTGACTTTGA CTACTCAAAA TTGGCCTAAA AATTAAAAGA 480 

GATCGATATT AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 540 



Seq ID NO: Cll DNA Sequence 
Nucleic Acid Accession #: NM_015419.1 
jU Coding sequence: 1..8487 



1 11 21 31 41 51 

I I I I I i 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT TTGGGGCCAT 60 

CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 120 

TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 180 

AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 

TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCOGATGG AGCTTTAAGA 300 

GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 360 

CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 420 

TTTATCCACC CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 480 

AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC S40 

AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 600 

AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 660 

TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 720 

TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 780 

AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 840 

GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 900 

GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 960 

AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 1020 

GATGTGTACA AGATTCACTT GAACCAAACG GATCCTCCAG ATATTGACAT AAATGCAACA 1080 

GTTGCCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 1140 

ATAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 1200 

CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACAGGT 1260 

GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 1320 

CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 1380 

CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG 1440 

CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 1500 

TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 1560 

CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 1620 

AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC TCAAGTGAGG 1680 

GATGAAATGG ACCGCATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 1740 

GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT OGGTGACATT GCCTTGCAAT 1800 

GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 1860 

GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 1920 

GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 1980 

CATTTTACGG TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 2040 

AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 2100 

_ n GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 2160 

/U GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 2220 

AGAAQAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 2280 

GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 2340 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 

_ GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 2460 

fJ TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG TGCTGAAGAA 2520 

TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 2580 

GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 

AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 

TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 

ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT 2880 

GAGCCTCCAT TGGATGCTGT CTCCTTGOCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3000 

CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA CATCACAGTT ATTTGAGGAT 3060 
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TCTACTATAG GGGAACCAGO TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 3120 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 

5 GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 

CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 

ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3S40 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 

10 GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCOCCA AACAGTTGGA AATGGAGAAG 3660 

AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC AOGGGAAGAG GCCAAACAAA 3720 

CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3780 

CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 

ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3900 

15 AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 

TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 

20 GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 

GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GATCAAGATC ATCTTGAAAC GACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 

25 ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 

GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

GCAACCCCAG TCAACAATGA AGGAAGACAG CATATGTGAG GGCCAAATGA ATTATCAACA 4680 

CCCTCTTOCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 

TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 

30 GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 

CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 

35 GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 

AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 

CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 

GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTOCATCTG 5340 

GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 5400 

40 ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 

TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 5520 

CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 
CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC OCTGTGAGGC AACAGGAAAA 5640 
CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 
45 AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGTTCAAGTA 5760 
CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC AOGGCCTGGA CAGGATGGTG 5820 
GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 
ACTGTCTACC TGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5940 
CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 
50 OGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 
GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCCGGGGCGG ACAGCCTGGC CATOCGCCTG 6120 
CAOGTGGCGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 
CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 6240 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 
55 GTTTTCCCCA A06GGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 6360 
GAGTGCGTGG CCGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACX3TGCAG 6420 
CGTGCAGCAG CCAACGCGCG CATCACGGGC ACCTCCCOGC GGAGGACGGA CGTCAGGTAC 6480 
GGAGGAACCC TCAAGCTGGA CTGCAGOGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 
£ AGGCTGCCGT CCAAGAGGAT GATOGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 
60 TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGAOGGACA AAGATGCCGG AGATTACCTG 6660 
TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 
AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 6780 
CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 
. GACGGGAGTC TGGTGAACTC CTTCATGCAG TOGGATGACA GCGGTGGAOG CACCAAGCGC 6900 
65 TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 
GACTACACCT GCTTTGCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 
GTGGTGACAG CGCCCGCCAC CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 
GGAGAOGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 
TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 
70 GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 
AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 
AAGATCAAOG GTAACCCCAA COCCATCACC ACOGTGGGGG AGATAGCAGC CGGGGGCAGT 7380 
CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 7440 
CCGGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT CCATGGCAAC 7500 
75 GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGOGACTCCG TCCAGCTGGT ATGCATGGCA 7560 
CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 
AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 
AGCCTCAACT GCTCTGOCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 
0 GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 
80 CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCOGCAAT 7860 
GCCGCTGGCC ACAOGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 
AAGCAGTATC ATAACCTGGT CAGCATCATG AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 
CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 
GGCCCCCAAA CCCTGGGAOG OGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 
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GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGQATGG AGACGGAGTA CGGCCCTTCG 8160 

GTCACCAGCA TCCCCGTGAT TGTGATCGCC TATCCTCCCC GQATCACCAO OGAGCCCACC 8220 

CCGGTCATCT ACACCCGGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

CCCAAAGCTG ACATCACGTG GGAGTTACOG GATAAGTCGC ATCTGAAGGC AOGGGTTCAG 8340 

J GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAGGAACTG 8S20 

ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTPGGGGAA TAGGAGCTCT TAAATAATGT 8580 

GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 8640 

W GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTOTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

TTTCTTCTTT TGCAAATGCC ACTCGACTGC CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA 8880 

1 * T CACC TAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 8940 

13 TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 

CAGAGTGACT 6ATATATATA TATATATTTT AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TTTATA TGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

TATATATTTT TTCCTTTCAA ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

ZAJ ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC TGTTATATTA 9300 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TT TTTGT TGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9S40 

ZD CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 

ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 9645 

Seq ID NO: C12 DNA Sequence 
JU Nucleic Acid Accession #: AK001903 
Coding sequence: none 

1 11 21 31 41 51 

;c I I 1 I I I 

D J TATCATGCAT GTGGGAAGGT GGGTGTGGTG AGAAAAGTTT TAAGGCAAGA GTAGATGGCC 60 

ATGTTCAACT TTACAAAATT TCTTGGAAAA CTGGCAGTAT TTTGAACTGC ATCTTCTTTG 120 

GTACCGGAAC CTGCAGAAAC AGTGTGAGAA ATTAAGTCCT GGTTCACTGC GCAGTAGCAA 180 

AGATGGTCAA GGCCATGGAA AAAGCAGAAA TTTACCAAGA AAGCTGATAC CCATGTATAG 240 

A( . TTCCCACTCA TCTCAAATAC ATCTGCTATC TTTTTAAGCT AAGTCCTAGA CATATCGGGG 300 

4U ATAACATGGG GGTTGATTAG TGACCACAGT TATCAGAAGC AGAGAAATGT AATTCCATAT 360 

TTTATT TGAA ACTTATTCCA TATTTTAATT GGATATTGAG TGATTGGGTT ATCAAACACC 420 

CACAAA CTTT AATTTTGTTA AATTTATATG GCTTTGAAAT AGAAGTATAA GTTGCTACCA 480 

TTTTTTGATA ACATTGAAAG ATAGTATTTT ACCATCTTTA ATCATCTTGG AAAATACAAG 540 

. . TCCTGTGAAC AACCACTCTT TCACCTAGCA GCATGAGGCC AAAAGTAAAG GCTTTAAATT 600 

H-J ATAACATATG GGATTCTTAG TAGTATGTTT TTTTCTTGAA ACTCAGTGGC TCTATCTAAC 660 

CTTACTATCT CCTCACTCTT TCTCTAAGAC TAAACTCTAG GCTCTTAAAA ATCTGCCCAC 720 

ACCAATCTTA GAAGCTCTGA AAAGAATTTG TCTTTAAATA TCTTTTAATA GTAACATGTA 780 

TTTTATGGAC CAAATTGACA TTTTCGACTA TTTTTTCCAA AAAAGTCAGG TGAATTTCAG 840 

C/\ CACACTGAGT TGGGAATTTC TTATCCCAGA AGACCAACCA ATTTCATATT TATTTAAGAT 900 

->U T GATTCCATA CTCCGTTTTC AAGGAGAATC CCTGCAGTCT CCTTAAAGGT AGAACAAATA 960 

CTTTCTATTT TTTTTTTCAC CATTGTGGGA TTGGACTTTA AGAGGTGACT CTAAAAAAAC 1020 

AGAGAACAAA TATGTCTCAG TTGTATTAAG CACX3GACCCA TATTATCATA TTCACTTAAA 1080 

AAAATGATTT CCTGTGCACC TTTTGGCAAC TTCTCTTTTC AATGTAGGGA AAAACTTAGT 1140 

_ _ CACCCTGAAA ACCCACAAAA TAAATAAAAC TTGTAGATGT GGGCAGAAGG TTTGGGGGTG 1200 

GACATTGTAT GTGTTTAAAT TAAACCCTGT ATCACTGAGA AGCTGTTGTA TGGGTCAGAG 1260 

AAAATGAATG CTTAGAAGCT GTTCACATCT TCAAGAGCAG AAGCAAACCA CATGTCTCAG 1320 

CTA TATTA TT ATTTATTTTT TATGCATAAA GTGAATCATT TGTTCTGTAT TAATTTCCAA 1380 

AGGGTTTTAC CCTCTATTTA AATGCTTTGA AAAACAGTGC ATTGACAATG GGTTGATATT 1440 

TTTC TTTAAA AGAAAAATAT AATTATGAAA GCCAAGATAA TCTGAAGCCT GTTTTATTTT 1500 

AAAACTTTTT ATGTTCTGTG GTTGATGTTG 1 T1 OTJ . T G TT TGTTTCTATT TTGTTGGTTT 1560 

TTTACTTTGT TTTTTGTTTT G TTTT G T TTT GTTTTGCATA CTACATGCAG TTCTTTAACC 1620 

AATGTCTGTT TGGCTAATGT AATTAAAGTT GTTAATTTAT ATGAGTGCAT TTCAACTATG 1680 

TCAATGGTTT CTTAATATTT ATTGTGTAGA AGTACTGGTA ATTTTTTTAT TTACAATATG 1740 

TTTAAAGAGA TAACAGTTTG ATATGTTTTC ATGTGTTTAT AGCAGAAGTT ATTTATTTCT 1800 

OJ ATGGCATTCC AGCGGATATT TTGGTGTTTG CGAGGCATGC AGTCAATATT TTGTACAGTT 1860 

AGTGGACAGT ATTCAGCAAC GCCTGATAGC TTCTTTGGCC TTATGTTAAA TAAAAAGACC 1920 

TGTTTGGGAT GT 1932 

70 Seq ID NO: 03 Protein Sequence 

Nucleic Acid Accession 8: Eos sequence 
Coding sequence: 1..5001 

ne 1 11 21 31 41 51 

75 I I I I I I 

ATGCCAGGCA CAAAACTAAC CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 60 

ACCTCTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCOGGGT TATGTCATCT 120 

CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 180 

TCAAGACAGT ACACCGTGCG CTATCGAGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 240 

CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 300 

GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 360 

ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 420 

AAACCTACAG TTGTCGCTGC ATCTTGGGAT GOGCTACCAG AGACTGAGGG GAAAGTGAAA 480 

GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGCCAAATCA 540 
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15 
20 
25 
30 



TTTCAGAATA CATTCTTTCA TACGCCCCGG CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 

CCTA TCCTGQ AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG GAACGCTATC 660 

TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TCGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGTTGTCT AGCTTACCTG 780 

J ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 840 

CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGCGGCTT TTCCTTCATT 900 

ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 

GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 1020 

1 n CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 

IU CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTOGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 

TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 

TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 1320 

GTTGCTCCCG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CCTGGCCACG CAGCCCCGCC CAGGGGCGCC CCCCTCGGCT 1440 

TCGGC CTCT C CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1500 

GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GCGCAAAGCC AGCCTCGCCG 1680 

GCGCGG AGGA CCCCCCATTC AGGGGCOGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 

GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 

GGAGACGAGG AGGATGAGAA GCOGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 

TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 2340 

GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 

CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 

GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 2520 

TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 2S80 

D J ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC 2640 

CCGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 

CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 

ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG 2820 

GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCOGCGCAC 2880 

GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 2940 

ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 3000 

AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCG 3060 

GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 

GCCGAGGACG AGGAGGAGGA GGACGOGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT 3240 

GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 

CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3360 

GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACCGTGA GCCCCGTCGC GGGCACCCAC 3420 

CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3480 

TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCC 3540 

AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 3600 

GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3660 

GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 3720 

CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 3780 

GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG GGCAGGGGCG ACATGGCACA 3840 

CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC 3900 

TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 

ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 4020 

ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 4080 

CGCAOQACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 4140 

ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAC 4200 

GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 4260 

GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 4320 

TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA CCACCACTGA GCCTTCGACC 4380 

ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA TCAGTTCCTT TCCTGAAGAA 4440 

GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACGTA CCTAAATAAA 4500 

GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 

GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 4620 

AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 4680 

GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 

ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC ACTTGCCCAT TGAGAACCTA 4800 

AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 4860 

ATCAGCCCTT CGGTCTCATT TGTCACCGAA TCAGATAATC CTCTGCTTGT TGTGAGGCCC 4920 

CCAGGCGGTG AGCTATCTGG ATCCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 4980 

GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA OTTGTT CTTT 5040 

GTAATTCACT GAGGTATAAA ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 5100 

TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 5160 

GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 5220 

AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 5280 

TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG GTAATCACAG GACCGTCATG 5340 

CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 5400 

CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 

CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 5520 

TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTm i TITU TTTGTTTGTA ATAGCACATC 5580 
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AAAAAAAAAA 
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10 Seq ID NO: C14 DNA Sequence 

Nucleic Acid Accession #: NM_003014 
Coding sequence: 238.. 1278 ~ 

tr } 11 21 31 41 51 

GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGCCAGACGG 60 

CGGAGCTCCG CGGCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 120 

AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCGAAG GGACAGCGAA AGATGAGGGT 180 

GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTOGCAG CGCGAGAGGG CAGTGCCATG 240 

ZU TTCCTCTCCA TCCTAGTGGC GCTGTGCCTG TGGCTGCACC TGGCGCTGGG CGTGCGCGGC 300 

GCGCCCTGCG AGGCGGTGOG CATCCCTATG TGCCGGCACA TGCCCTGGAA CATCACGCGG 360 

ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAACGCCA TCCTGGCCAT CGAGCAGTAC 420 

GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 480 

^ _ GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 540 

ZD CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 600 

AGCCTGGCCT GCGAOGAGCT GCCTGTCTAT GACCGTGGCG TGTGCATTTC GCCTGAAGCC 660 

ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 720 

CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 780 

orj AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 840 

3U AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 900 

GAGATCTTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 960 

TCTTGCCAGT GTCCACACAT CCTGCCCCAT CAAGATGTTC TCATCATGTG TTACGAGTGG 1020 

CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 1080 

*r AAAAGATCCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 1140 

5 J AAGAAAACAG CCGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 1200 

GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 1260 

AACCCGAAAA GAGTGTGAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 1320 

GATGAGGCTG GGCATTGCCT GGGACAGCCT ATGTAAGGCC ATGTGCCCCT TGCCCTAACA 1380 

ACi ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGCTTCA 1440 

*fU GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG TACAGAAGGT 1500 

GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGCATACT 1560 

CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTOGACC TAATATGTGC ATTGTAAAAT 1620 

J^TGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT TACCTTTTGA 1680 

ac TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 1740 

*fr J AAGGAACAGT AGTGGAATQA ATGTTAAAAG ATCTTTATGT GTTTATGGTC TGCAGAAGGA 1800 

TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG AAAATTATAA 1860 

TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA AAACAAAAAT 1920 

AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTTTTTTG 1980 

en GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 2040 

D\J ACAGTGAGTT TGTCTOTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 2100 

ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT CCACCCTGAG 2160 

AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC CAAATTTTGT TTTTCTTCAT 2220 

TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG TACAGAGAGG 2280 

_ AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATACT TTAATCAGAA 2340 

J J AAAAA GAAC T TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 2400 

ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTTTATA 2460 

AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 2520 

ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTTGCTTAT 2580 

GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 2640 

OU CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 2700 

TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 2760 

TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACCGGAATTC 2820 

Seq ID NO; CIS DNA Sequence 
OD Nucleic Acid Accession 8: NMJ)05940 
Coding sequence: 23.. 1489 " 

1 11 21 31 41 51 

7A 1 1 I 1 I t 

l\J AAGCCCAGCA GCCCCGGGGC GGATGGCTCC GGCCGCCTGG CTCCGCAGCG CGGCCGCGCG 60 

CGCCCTCCTG CCCCCGATGC TGCTGCTGCT GCTCCAGCCG CCGCCGCTGC TGGCCCGGGC 120 

TCTGCCGCCG GACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC CCTGGCATGC 180 

AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TGCCACGCAG GAAGCCCCCC GGCCTGCCAG 240 

7 C CAGCCTCAGG CCTCCCCGCT GTGGCGTGCC CGACCCATCT GATGGGCTGA GTGCCCGCAA 300 

/J CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCACCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCGATG TGACGCCACT CACCTTTACT GAGGTGCACG AGGGCCGTGC 480 

TGACATCATG ATCGACTTCG CCAGGTACTG GCATGGGGAC GACCTGCCGT TTGATGGGCC 540 

8n TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC CGAGAAGGGG ATGTCCACTT 600 

OU CGACTATGAT GAGA CCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTGC TGCAGGTGGC 660 

AGCCCATGAA TTTGGCCACG TGCTGGGGCT GCAGCACACA ACAGCAGCCA AGGCCCTGAT 720 

GTCCGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGGGCGT 780 

TCAACACCTA TATGGCCAGC CCTGGCCCAC TGTCACCTCC AGGACCCCAG CCCTGGGCCC 840 

■ CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCTGGAG CCAGACGCCC CGCCAGATGC 900 

1207 



WO 03/042661 



PCT7US02/36810 



CTGTGAGGCC TCCTTTGACG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTG TQGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTGCC TTCGAGGATG CCCAGGGCCA 1080 

CATTTGGTTC TTCCAAGGTG CTCAGTACTG GGTGTAOGAC GGTGAAAAGC CAGTCCTGGG 1140 

0 CCCCGCACCC CTCACCGAGC TGGGCCTGGT GAGGTTCCCG GTCCATGCTG CCTTGGTCTG 1200 

GGGTCCCGAG AAGAACAAGA TCTACTTCTT CCGAGGCAGG GACTACTGGC GTTTCCACCC 1260 

CAGCACCCGG CGTGTAGACA GTCCOGTGCC CCGCAGGGCC ACTGACTGGA GAGGGGTGCC 1320 

CTCTGAGATC OACGCTGCCT TCCAGGATGC TGATGGCTAT GCCTACTTCC TGCGCGGCCG 1380 

^ CCTCTACTGG AAGTTTGACC CTGTGAAGGT GAAGGCTCTG GAAGGCTTCC CCCGTCTCGT 1440 

1U GGGTCCTGAC TTCTTTGGCT GTGCCGAGCC TGCCAACACT TTCCTCTGAC CATGGCTTGG 1500 

ATGCCCTCAG GGGTGCTGAC CCCTGCCAGG CCACGAATAT CAGGCTAGAG ACCCATGGCC 1560 

ATCTTTGTGG CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TGTCTCCTGC AGGGGGATGG 1620 

GGTGGGGTAC AACCACCATG ACAACTGCCG GGAGGGCCAC GCAGGTCGTG GTCACCTGCC 1680 

AGCGACTGTC TCAGACTGGG CAGGGAGGCT TTGGCATGAC TTAAGAGGAA GGGCAGTCTT 1740 

ID GGGACCCGCT ATGCAGGTCC TGGCAAACCT GGCTGCCCTG TCTCATCCCT GTCCCTCAGG 1800 

GTAGCACCAT GGCAGGACTG GGGGAACTGG AGTGTCCTTG CTGTATCCCT GTTGTGAGGT 1860 

TCCTTCCAGG GGCTGGCACT GAAGCAAGGG TGCTGGGGCC CCATGGCCTT CAGCCCTGGC 1920 

TGAGCAACTG GGCTGTAGGG CAGGGCCACT TCCTGAGGTC AGGTCTTGGT AGGTGCCTGC 1980 

rjr^ ATCTGTCTGC CTTCTGGCTG ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2040 

ZU GTTCACAGTC AAATGGGGAG GGGTATTCTT CATGCAGGAG ACCCCAGGCC CTGGAGGCTG 2100 

CAACATACCT CAATCCTGTC CCAGGCCGGA TCCTCCTGAA GCCCTTTTCG CAGCACTGCT 2160 

ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AGTGTGTATA AACCTTCTTC TTCTTTTTTT 2220 

TTTTTAAACT GAGGATTGTC ATTAAACACA GTTGTTTTCT 2260 

25 Seg ID NO: Cl 6 DMA Sequence 

Nucleic Acid Accession #: NM_024022 
Coding sequence t 202.. 1563 

o n * J 1 21 31 41 51 

ACCGGGCACC GGACGGCTGG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

^ AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

3D TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

A( . GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

4U AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTCG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

. TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

4D AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

J\J CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

JJ TTAGTGGGAO CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCCCCTGGAC TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

£(\ CGGC ACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

OU GCTGCTTTTT GTTTTTT6TT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TOCGCTTCCC TGGTTCAAGC GATTCTCTTG 1860 

CC 3 C ^GCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 2040 

OD CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

n(\ AAC CCACC CT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTOGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 

AAAAA 2465 

Seq ID NO: C17 DNA Sequence 
O Nucleic Acid Accession ft: NM 003220 
Coding sequence: 63.. 1376 ~ 



80 



1 11 21 31 41 51 

I I I I I I 

GAATTCCGGC TCTCTGGGTG AGAGACCGAG AGGGGCATAT CCGTTCACGC CGATCCATGA 60 

AAATGCTTTG GAAATTGACG GATAATATCA AGTACGAGGA CTGCGAGGAC CGTCACGACG 120 

GCACCAGCAA CGGGACGGCA CGGTTGCCCC AGCTGGGCAC TGTAGGTCAA TCTCCCTACA 180 

CGAGCGCCCC GCCGCTGTCC CACACCCCCA ATGCCGACTT CCAGCCCCCA TACTTCCCCC 240 

CACCCTACCA GCCTATCTAC CCCCAGTCGC AAGATCCTTA CTCCCACGTC AACGACCCCT 300 
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ACAGCCTGAA CCCCCTGCAC GCCCAGCCGC AGCCGCAGCA CCCAGGCTGG CCCGGCCAGA 360 

GGCAGAGCCA GGAGTCTGGG CTCCTGCACA CGCACCGGGG GCTGCCTCAC CAGCTGTOGG 420 

GCCTGGATCC TOGCAGGGAC TACAGGCGGC ACGAGGACCT CCTGCACGGC CCACACGCGC 480 

TCAGCTCAGG ACTCGGAGAC CTCTCGATCC ACTCCTTACC TCACGCCATC GAGGAGGTCC 540 

CGCATGTAGA AGACCCGGGT ATTAACATCC CAGATCAAAC TGTAATTAAG AAAGGCCCCG 600 

TGTCCCTGTC CAAGTCCAAC AGCAATGCCG TCTCCGCCAT CCCTATTAAC AAGGACAACC 660 

TCTTCGGCGG CGTGGTGAAC CCCAACGAAG TCTTCTGTTC AGTTCCGGGT CGCCTCTOGC 720 

TCCTCAGCTC CACCTCGAAG TACAAGGTCA CGGTGGCGGA AGTGCAGCGG CGGCTCTCAC 780 

CACCCGAGTG TCTCAACGCG TCGCTGCTGG GCGGAGTGCT CCGGAGGGCG AAGTCTAAAA 840 

ATGGAGGAAG ATCTTTAAGA GAAAAACTGG ACAAAATAGG ATTAAATCTG CCTGCAGGGA 900 

GACGTAAAGC TGCCAACGTT ACCCTGCTCA CATCACTAGT AGAGGGAGAA GCTGTCCACC 960 

TAGCCAGGGA CTTTGGGTAC GTGTGCGAAA CCGAATTTCC TGCCAAAGCA GTAGCTGAAT 1020 

TTCTCAACCG ACAACATTCC GATCCCAATG AGCAAGTGAC AAGAAAAAAC ATGCTCCTGG 1080 

CTACAAAACA GATATGCAAA GAGTTCACCG ACCTGCTGGC TCAGGACCGA TCTCCCCTGG 1140 

GGAACTCACO GCCCAACCCC ATCCTGGAGC CCGGCATCCA GAGCTGCTTG ACCCACTTCA 1200 

ACCTCATCTC CCAOGGCTTC GGCAGCCCCG CGGTGTGTGC CGOGGTCACG GCCCTGCAGA 1260 

ACTATCTCAC CGAGGCCCTC AAGGCCATGG ACAAAATGTA CCTCAGCAAC AACCCCAACA 1320 

GCCACACGGA CAACAACGCC AAAAGCAGTG ACAAAGAGGA GAAGCACAGA AAGTGAGGCT 1380 

CTCCTCCCGC CCCGCCCCTC CCACGCCTCA CCAGCCCCCC GCGCGCCCAC CCTCCGGCGG 1440 

ZU GTGACAGCTC CGGGATCAGC AACCCTTCCT GCTGCTGCTA CTGCTGCTGC TGCTGCOGCC 1500 

GCCGCCGCCG CCGCTGCCCT TGGGTCCCCC CGAGTCTCCG GQACTGCCCT CTCGACTGTC 1S60 

AGTGGGGCAG CCTCTCCGAC TCTGCACCCG CCTCGACCTC CCCACCCGCT CCCACACCCC 1620 

TGTGCCCCCG GAATTC 1636 

25 seq ID NO: CI 8 DMA Sequence 

Nucleic Acid Accession #: NMJ>02988 
Coding sequence: 71.. 340 

ork 1 11 21 31 41 51 

30 | | | | || 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 60 

GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 

CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 180 

GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 

AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 300 

GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 

GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 

CCAC CCTGGA GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 

CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 540 

ATTATGAAAT TAGTATTTTC TCTGACATCT CATGACATTG TCTTTATCAT CCTTTCCCCT 600 

TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 

CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 

ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 

AAAAAAAAAA AAAAAAAAAA AAA 803 
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Seq ID NO: CI 9 DNA Sequence 
Nucleic Acid Accession th NM_004063 
Coding sequence: 121.. 2619 



1 11 21 31 41 51 

I I I I I I 

AGGGAGTGTT CCCGGGGOAG ATACTCCAGT CGTAGCAAGA GTCTCGACCA CTGAATGGAA 60 

GAAAAGGACT TTTAACCACC ATTTTGTGAC TTACAGAAAG GAATTTGAAT AAAGAAAACT 120 

ATGATACTTC AGGCCCATCT TCACTCCCTG TGTCTTCTTA TGCTTTATTT GGCAACTGGA 180 

TATGGCCAAG AGGGGAAGTT TAGTGGACCC CTGAAACCCA TGACATTTTC TATTTATGAA 240 

GGCCAAGAAC CGAGTCAAAT TATATTCCAG TTTAAGGCCA ATCCTCCTGC TGTGACTTTT 300 

GAACTAACTG GGGAGAGAGA CAACATATTT GTGATAGAAC GGGAGGGACT TCTGTATTAC 360 

AACAGAGCCT TGGACAGGGA AACAAGATCT ACTCACAATC TCCAGGTTGC AGCCCTGGAC 420 

GCTAATGGAA TTATAGTGGA GGGTCCAGTC CCTATCACCA TAGAAGTGAA GGACATCAAC 480 

GACAATCGAC CCACGTTTCT CCAGTCAAAG TACGAAGGCT CAGTAAGGCA GAACTCTCGC 540 

CCAGGAAAGC CCTTCTTGTA TGTCAATGCC ACAGACCTGG ATGATCCGGC CACTCCCAAT 600 

GGCCAGCTTT ATTACCAGAT TGTCATCCAG CTTCCCATGA TCAACAATGT CATGTACTTT 660 

CAGATCAACA ACAAAACGGG AGCCATCTCT CTTACCCGAG AGGGATCTCA GGAATTGAAT 720 

CCTGCTAAGA ATCCTTCCTA TAATCTGGTG ATCTCAGTGA AGGACATGGG AGGCCAGAGT 780 

GAGAATTCCT TCAGTGATAC CACATCTGTG GATATCATAG TGACAGAGAA TATTTGGAAA 840 

GCACCAAAAC CTGTGGAGAT GGTGGAAAAC TCAACTGATC CTCACCCCAT CAAAATCACT 900 

CAGGTGCGGT GGAATGATCC CGGTGCACAA TATTCCTTAG TTGACAAAGA GAAGCTGCCA 960 

AGATTCCCAT TTTCAATTGA CCAGGAAGGA GATATTTACG TGACTCAGCC CTTGGACCGA 1020 

GAAGAAAAGG ATGCATATGT TTTTTATGCA GTTGCAAAGG ATGAGTACGG AAAACCACTT 1080 

TCATATCCGC TGGAAATTCA TGTAAAAGTT AAAGATATTA ATGATAATCC ACCTACATGT 1140 

CCGTCACCAG TAACCGTATT TGAGGTCCAG GAGAATGAAC GACTGGGTAA CAGTAXCGGG 1200 

ACCCTTACTG CACATGACAG GGATGAAGAA AATACTGCCA ACAGTTTTCT AAACTACAGG 1260 

ATTGTGGAGC AAACTCCCAA ACTTCCCATG GATGGACTCT TCCTAATCCA AACCTATGCT 1320 

GGAATGTTAC AGTTAGCTAA ACAGTCCTTG AAGAAGCAAG ATACTCCTCA GTACAACTTA 1380 

I J ACGATAGAGG TGTCTGACAA AGATTTCAAG ACCCTTTGTT TTGTGCAAAT CAAOGTTATT 1440 

GATATCAATG ATCAGATCCC CATCTTTGAA AAATCAGATT ATGGAAACCT GACTCTTGCT 1500 

GAAGACACAA ACATTGGGTC CACCATCTTA ACCATCCAGG CCACTGATGC TGATGAGCCA 1560 

TTTACTGGGA GTTCTAAAAT TCTGTATCAT ATCATAAAGG GAGACAGTGA GGGAOGCCTG 1620 

GGGGTTGACA CAGATCCCCA TACCAACACC GGATATGTCA TAATTAAAAA GCCTCTTGAT 1680 

TTTGAAACAG CAGCTGTTTC CAACATTGTG TTCAAAGCAG AAAATCCTGA GCCTCTAGTG 1740 

TTTGGTGTGA AGTACAATGC AAGTTCTTTT GCCAAGTTCA CGCTTATTGT GACAGATGTG 1800 

AATGAAGCAC CTCAATTTTC CCAACACGTA TTCCAAGCGA AAGTCAGTGA GGATGTAGCT 1860 

ATAGGCACTA AAGTGGGCAA TOTGACTGCC AAGGATCCAG AAGGTCTGGA CATAAGCTAT 1920 

TCACTGAGGG GAGACACAAG AGGTTGGCTT AAAATTGACC ACGTGACTGG TGAGATCTTT 1980 
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AGTGTGGCTC CATTGGACAG AGAAGCCGGA AGTCCATATC GGGTACAAGT GGTGGCCACA 2040 

GAAGTAGGGG GGTCTTCCTT GAGCTCTGTG TCAGAGTTCC ACCTGATCCT TATGGATGTG 2X00 

AATGACAACC CTCCCAGGCT AGCCAAGGAC TACACGGGCT TGTT C TTCT G CCATCCCCTC 2160 

AGTG CACCTG GAAGTCTCAT TTTOGAGGCT ACTGATGATG ATCAGCACTT ATTTOGGGGT 2220 

J CCCCATTTTA CATTTTCCCT CGGCAOTGGA AGCTTACAAA ACGACTGGGA AGTTTCCAAA 2280 

ATCAATGGTA CTCATGCCCG ACTGTCTACC AGGCACACAG AGTTTGAGGA GAGGGAGTAT 2340 

GTCGTCTTGA TCCGCATCAA TGATGGGGGT CGGCCACCCT TGGAAGGCAT T GT TTCTT T A 2400 

CCAGTTACAT TCTGCAGTTG TGTGGAAGGA AGTTGTTTCC GGCGAGCAGG TCACCAGACT 2460 

GGGATACCCA CTGTGGGCAT GGCAGTTGGT ATACTGCTGA CCACCCTTCT GGTGATTGGT 2520 

1U ATAATTTTAG CAGTTGTGTT TATCCGCATA AAGAAGGATA AAGGCAAAGA TAATGTTGAA 2580 

AGTGCTCAAG CATCTGAAGT CAAACCTCTG AGAAGCTGAA TTTGAAAAGG AATGTTTGAA 2640 

TTTATATAGC AAGTGCTATT TCAGCAACAA CCATCTCATC CTATTACTTT TCATCTAACG 2700 

TGCATTATAA TTTTTTAAAC AGATATTCCC TCTTGTCCTT TAATATTTGC TAAATATTTC 2760 

1 ^ TTTTTTGAGG TGGAGTCTTG CTCTGTCGCC CAGGCTGGAG TACAGTGGTG TGATCCCAGC 2820 

IJ TCACTGCAAC CTCCGCCTCC TGGGTTCACA TGATTCTCCT GCCTCAGCTT CCTAAGTAGC 2880 

TGGGTTTACA GGCACCCACC ACCATGCCCA GCTAATTTTT GTATTTTTAA TAQAGACGGG 2940 

GTTTCGCCAT TTGGCCAGGC TGGTCTTGAA CTCCTGACGT CAAGTGATCT GCCTGCCTTG 3000 

GTCTCCCAAT ACAGGCATGA ACCACTGCAC CCACCTACTT AGATATTTCA TGTGCTATAG 3060 

ACATTAGAGA GATTTTTCAT TTTTCCATGA CATTTTTCCT CTCTGCAAAT GGCTTAGCTA 3120 

ZU CTTGTGTTTT TCCCTTTTGG GGCAAGACAG ACTCATTAAA TATTCTGTAC A TT TT TT C TT 3180 

TATCAAGGAG ATATATCAGT GTTGTCTCAT AGAACTGCCT GGATTCCATT TATGTTTTTT 3240 

CTGATTCCAT CCTGTGTCCC CTTCATCCTT GACTCCTTTG GTATTTCACT GAATTTCAAA 3300 

CATTTGTCAG AGAAGAAAAA CGTGAGGACT CAGGAAAAAT AAATAAATAA AAGAACAGCC 3360 

TTTTCCCTTA GTATTAACAG AAATGTTTCT GTGTCATTAA CCATCTTTAA TCAATGTGAC 3420 

ZD ATGTTGCTCT TTGGCTGAAA TTCTTCAACT TGGAAATGAC ACAGACCCAC AGAAGGTGTT 3480 

CAAACACAAC CTACTCTGCA AACCTTGGTA AAGGAACCAG TCAGCTGGCC AGATTTCCTC 3540 

ACTACCTGCC ATGCATACAT GCTGCGCATG TTTTCTTCAT TCGTATGTTA GTAAAGTTTT 3600 

GGTTATTATA TATTTAACAT GTGGAAGAAA ACAAGACATG AAAAGAGTGG TGACAAATCA 3660 

AGAATAAACA CTGGTTGTAG TCAGTTTTGT TTGTTAA 3697 



30 
35 



Seq ID NO: C20 DNA Sequence 
Nucleic Acid Accession #: NMJ)04443 
Coding sequence : 28 . . 3024 



1 11 21 31 41 51 

I 1 I I I I 

GGCTCGGCTC CTAGAGCTGC CACGGCCATG GCCAGAGCCC GCCCGCCGCC GCCGCCGTCG 60 

CCGCCGCCGG GGCTTCTGCC GCTGCTCCCT CCGCTGCTGC TGCTGCCGCT GCTGCTGCTG 120 

4U CCCGCCGGCT GCCGGGCGCT GGAAGAGACC CTCATGGACA CAAAATGGGT AACATCTGAG 180 

TTGGCGTGGA CATCTCATCC AGAAAGTGGG TGGGAAGAGG TGAGTGGCTA CGATGAGGCC 240 

ATGAATCCCA TCCGCACATA CCAGGTGTGT AATGTGCGCG AGTCAAGCCA GAACAACTGG 300 

CTTCGCACGG GGTTCATCTG GCGGCGGGAT GTGCAGCGGG TCTACGTGGA GCTCAAGTTC 360 

ACTGTGCGTG ACTGCAACAG CATCCCCAAC ATCCCCGGCT CCTGCAAGGA GACCTTCAAC 420 

4D CTCTTCTACT ACGAGGCTGA CAGCGATGTG GCCTCAGCCT CCTCCCCCTT CTGGATGGAG 480 

AACCCCTACG TGAAAGTGGA CACCATTGCA CCCGATGAGA GCTTCTCGCG GCTGGATGCC 540 

GGCCGTGTCA ACACCAAGGT GCGCAGCTTT GGGCCACTTT CCAAGGCTGG CTTCTACCTG 600 

GCCTTCCAGG ACCAGGGCGC CTGCATGTCG CTCATCTCCG TGCGCGCCTT CTACAAGAAG 660 

cr\ TGTGCATCCA CCACCGCAGG CTTCGCACTC TTCCCCGAGA CCCTCACTGG GGCGGAGCCC 720 

ACCTCGCTGG TCATTGCTCC TGGCACCTGC ATCCCTAACG CCGTGGAGGT GTOGGTGCCA 780 

CTCAAGCTCT ACTGCAACGG CGATGGGGAG TGGATGGTGC CTGTGGGTGC CTGCACCTGT 840 

GCCACCGGCC ATGAGCCAGC TGCCAAGGAG TCCCAGTGCC GCCCCTGTCC CCCTGGGAGC 900 

TACAAGGCGA AGCAGGGAGA GGGGCCCTGC CTCCCATGTC CCCCCAACAG CCGTACCACC 960 

TCCCCAGCCG CCAGCATCTG CACCTGCCAC AATAACTTCT ACCGTGCAGA CTCGGACTCT 1020 

3 J GCGGACAGTG CCTGTACCAC CGTGCCATCT CCACCCCGAG GTGTGATCTC CAATGTGAAT 1080 

GAAACCTCAC TGATCCTCGA GTGGAGTGAG CCCCGGGACC TGGGTGGCCG GGATGACCTC 1140 

CTGTACAATG TCATCTGCAA GAAGTGCCAT GGGGCTGGAG GGGCCTCAGC CTGCTCACGC 1200 

TGTGATGACA AOGTGGAGTT TGTGCCTCGG CAGCTGGGCC TGACGGAGCG CCGGGTCCAC 1260 

^ ATCAGCCATC TGCTGGCCCA CACGCGCTAC ACCTTTGAGG TGCAGGCGGT CAACGGTGTC 1320 

OU TCGGGCAAGA GCCCTCTGCC GCCTCGTTAT GCGGCCGTGA ATATCACCAC AAACCAGGCT 1380 

GCCCCGTCTG AAGTGCCCAC ACTACGCCTG CACAGCAGCT CAGGCAGCAG CCTCACCCTA 1440 

TCCTGGGCAC CCCCAGAGCG GCCCAACGGA GTCATCCTGG ACTACGAGAT GAAGTACTTT 1500 

GAGAAGAGCG AGGGCATCGC CTCCACAGTG ACCAGCCAGA TGAACTCCGT GCAGCTGGAC 1560 

GGGCTTCGGC CTGACGCCCG CTATGTGGTC CAGGTCCGTG CCCGCACAGT AGCTGGCTAT 1620 

CD GGGCAGTACA GCCGCCCTGC CGAGTTTGAG ACCACAAGTG AGAGAGGCTC TGGGGCCCAG 1680 

CAGCTCCAGG AGCAGCTTCC CCTCATCGTG GGCTCCGCTA CAGCTGGGCT TGTCTTCGTG 1740 

GTGGCTGTCX3 TGGTCATCGC TATCGTCTGC CTCAGGAAGC AGCGACACGG CTCTGATTCG 1800 

GAGTACACGG AGAAGCTGCA GCAGTACATT GCTCCTGGAA TGAAGGTTTA TATTGACCCT 1860 

„ TTTACCTACG AGGACCCTAA TGAGGCTGTT CGGGAGTTTG CCAAGGAGAT CGACGTGTCC 1920 

/U TGCGTCAAGA TCGAGGAGGT GATCGGAGCT GGGGAATTTG GGGAAGTGTG CCGTGGTCGA 1980 

CTGAAACAGC CTGGCCGCOG AGAGGTGTTT GTGGCCATCA AGACGCTGAA GGTGGGCTAC 2040 

ACCGAGAGGC AGCGGCGGGA CTTCCTAAGC GAGGCCTCCA TCATGGGTCA GTTTGATCAC 2100 

CCCAATATAA TCCGGCTCGA GGGCGTGGTC ACCAAAAGTC GGCCAGTTAT GATCCTCACT 2160 

„ GAGTTCATGG AAAACTGCGC CCTGGACTCC TTCCTCCGGC TCAACGATGG GCAGTTCACG 2220 

/J GTCATCCAGC TGGTGGGCAT GTTGCGGGGC ATTGCTGCCG GCATGAAGTA CCTGTCCGAG 2280 

ATGAACTATG TGCACCGCGA CCTGGCTGCT CGCAACATCC TTGTCAACAG CAACCTGGTC 2340 

TGCAAAGTCT CAGACTTTGG CCTCTCCCGC TTCCTGGAGG ATGACCCCTC CGATCCTACC 2400 

TACACCAGTT CCCTGGGCGG GAAGATCCCC ATCCGCTGGA CTGCCCCAGA GGCCATAGCC 2460 

TATCGGAAGT TCACTTCTGC TAGTGATGTC TGGAGCTACG GAATTGTCAT GTGGGAGGTC 2520 

OU ATGAGCTATG GAGAGCGACC CTACTGGGAC ATGAGCAACC AGGATGTCAT CAATGCCGTG 2580 

GAGCAGGATT ACCGGCTGCC ACCACCCATG GACTGTCCCA CAGCACTGCA CCAGCTCATG 2640 

CTGGACTGCT GGGTGCGGGA CCGGAACCTC AGGCCCAAAT TCTCCCAGAT TGTCAATACC 2700 

CTGGACAAGC TCATCCGCAA TGCTGCCAGC CTCAAGGTCA TTGCCAGCGC TCAGTCTGGC 2760 

ATGTCACAGC CCCTCCTGGA CCGCACGGTC CCAGATTACA CAACCTTCAC GACAGTTGGT 2820 
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GATTGGCTGG 
GCATCTTTTG 
CTGGCCGGCC 
CAGACGCTGC 
GATGCCAAGC 
TGTGGCCCAG 
TOCCTCCCCA 
TGATGACCCC 
CCACAACCTC 
TAAGCCGGGG 
AGCAGTCCCT 
CCATCCTGAA 
CCCTGTCCCC 
AGTTGOCCTT 
TGTGTGTGTG 
GCATGGGTGA 
AGCAGAATAA 



ATGCCATCAA 
ACCTGGTGGC 
ACCAGAAGAA 
CTGTGCAGGT 
AGCCGGCTGG 
AAGCTGGAAG 
GGAAGTGCGC 
TCCCCAAGCC 
ACACTTGTCT 
TTCCACAGGG 
CCCTCAGGAA 
GCCAGCTTGC 
ACCCCCGCCC 
TGCCCCCCAG 
TGTGCGCGCG 
GOGTGTAAAA 
AGGCAATAAG 



GATGGGGCGG 
CCAGATGACG 
GATCCTGAGC 
CTGACACCGG 
ACTTTCGGAC 
TTTGGGAAAG 
CCCAAACCTC 
CCTCAGGGCC 
GTTCTTCAGT 
CCCAGCCCTG 
CTGQAGGAGG 
ACCTCCAGTT 
TTGGTGCTGT 
AGACTGACTC 
CGCGCGCGTG 
GCTTGGCCCT 
ATGAA 



TACAAGGAGA 
GCAGAAGACC 
AGTATCCAGG 
CTCCCACGGG 
TCTTGGACTT 
GCCCAAGCTG 
TTCATATTGA 
CAGACCTTCC 
GCTGGAGGTC 
GCAGGGGTCT 
GGACTCCAGG 
TGCACAGGGA 
CATAAAAGGG 
TCAGAGCCAG 
TGTGTGTGCA 
GTGOCCTACA 



GCTTCGTCAG 
TGCTCCGTAT 
ACATGCGGCT 
GACCCTGAGG 
TTGGATGCCT 
GGACTTCTCC 
AGATGGATTA 
TGCTCTCCAG 
CTGGCAGGGT 
GGCCCCCCAG 
AATGGGGAAA 
TTTGTCCTGG 
CAGGCAGGGG 
AGATGGGATG 
CGCACTGGCC 
ATGGGGCCAG 



TGCGGGGTTT 
TGGGGTCACC 
GCAGATGAAC 
ACCGTGCAGG 
GGCCTTAGGC 
AGGCCTGTGT 
GGAGAGGOGG 
CAGGGGATCC 
CAGGCTGGGG 
GTAGGCGGAG 
TGTGACACCA 
GGGCTGAGGG 
CAGGCTGAGG 
TGTGAGTGTG 
TGCACAGAGA 
CTGGGCCGAC 



Seq ID NO: C2l DNA Sequence 
Nucleic Acid Accession ft: NM_001804 
Coding sequence: 82.. 879 



1 
I 

AGGTGAGCGG 
GCGGGGQACC 
TACCCCGGCC 
GCCCCGCCCC 
GCCCCCGCGC 
GCCGCCTACG 
CCCCCTCCAG 
CAGCCCCTCG 
GAGTGGATGC 
AAGGACAAGT 
CATTACAGCC 
ACTGAACGGC 
AAGAAGAAAC 
ACCCCAGCCG 
TCCTCTCCAA 
CGGGGGACCT 
CCGAGTCTCA 
CCCCTTGGCC 
TAGACCTCTG 
AAGCCCCAAA 
TACGTTCAGA 
ATGGAGCTGA 
AACTCACACC 
GCCCTTCCCA 
CACTGGGTTA 
GGAGGCCCTG 
CAGGCTGCCC 
CACAGCTGGA 
TTTTTCTTCT 



11 

I 

TTGCTCGTCG 
CCGCGGCCAC 
CAGCCAGGCC 
CGGCGCOCCC 
CCCCGACGGC 
GCCCGGGCCC 
ACTTTAGCCC 
GGGGCCCGGG 
GGOGCAGCGT 
ACCGCGTGGT 
GTTACATCAC 
AGGTGAAGAT 
AGCAGCAGCA 
GGCCATCCCT 
TGCCTGTGAA 
GGGGACTCGG 
GCCCTGACCT 
CATTGTGTGC 
GGGGATAAGG 
TGGTTGGGGG 
GGTGCAGCTG 
AAAAGATGGA 
TGCCTCTTCT 
GGCTGGATAC 
CAGAATCACA 
GGAGGAGAGA 
ATCAGGGCCC 
CAGGTGTTGT 
CTTTTCAAA 



21 
I 

TCGGGGCGGC 
CATGTATGTG 
AGCCAGCCTC 
GCAGTACCCC 
CTGGGGGGCG 
CGCGGCCCCT 
GGTGCCGGCG 
CACACCGTCC 
GGCGGCCGGA 
CTACACCGAC 
AATCCGGCGG 
CTGGTTCCAA 
ACAGCCCCCA 
GGGGGGCCTG 
AGAGGAGTTT 
GTGCTGGGAG 
TCTGGGACAT 
AGTAAGCCTG 
GAGTCCAGGG 
AGGGGCCTAG 
GAGGCCTGTG 
ATGCTTGCAG 
GCAGCCTCAC 
TAAGCACAAA 
GCCCTCATGA 
AGGGGGCAGA 
AGGGAGCCCC 
ATATAGAGTG 



31 
I 

CGGCAGCGGC 
GGCTATGTGC 
GGCCTGGGCC 
GACTTCTCCA 
CCCTTCCCTG 
GCCGCCAGCC 
CCCCCTGGGC 
TCGCCCGGAG 
GGCGGCGGTG 
CACCAACGCC 
AAATCAGAGC 
AACCGGCGGG 
CAGCCGCCGA 
TGTCCCAGCA 
CTGCCATAGC 
TGTGGCTCCT 
GGTGGACAGT 
TTGGATAAAG 
TGGATGATCT 
ACAAGGCTCC 
TGGGGACCAC 
AGCATGACCT 
CTCTACCTGC 
GCCCATAGCA 
TCATTCTCAG 
GTCTTCCCTA 
CAGAGGACTT 
GAATCTCTTG 



41 

I 

GGCTCCAGGG 
TGGACAAGGA 
CGGCAAACTA 
GCTACTCTCA 
CGCCCAAGGA 
CAGCTTCGCT 
CCGGCCCGGG 
CGCAGAGGCC 
GCAGCGGTAA 
TGGAGCTGGA 
TGGCTGCCAA 
CAAAGGAGCG 
TGGCCCACGA 
ACACCAGCCT 
CCCATGCCCA 
GTGGGCCCAG 
CACCTATCCA 
ACCTTCCAGC 
CAATCTCCCG 
AGGCCCCACC 
ACTGATCCTG 
GAGGAGGGAG 
CCCCATCATA 
CTGGGCTCTG 
TGAGGGCTCT 
CCAGGTTTCT 
TATTCGGACC 
GATGCAGCTT 



51 
I 

CCCAGCATGC 
TTCGCCCGTG 
CGGCCCCCOG 
CGTGGAGCCG 
CGACTGGGCC 
GGCATTCGGG 
CCTCCTGGCG 
GACGCCCTAC 
GACTCGGACC 
GAAGGAGTTT 
TCTGGGGCTC 
CAAAGTGAAC 
CATCACGGCC 
CCTGGCCACC 
GCCTGTGCGC 
GAGGTCTGGT 
CCCTCTGCAT 
TCCTGTGTTC 
TGGGCATCTC 
TCCTCCTCCA 
GAGAAAAGGG 
GAACGTGGTC 
AGGGCACTGA 
ATGGCTGCTC 
GGATTGAGAG 
ACACCCCCGC 
AAGCAGAGCT 
CAAGAATAAA 



Seq ID NO: C22 DNA Sequence 
Nucleic Acid Accession ft: NM 021978 
Coding sequence: 36.. 2603 



GACGCCTGTG 
GCGGAGGGGG 
TGAATGGCTT 
AGCATGGCCC 
TGCTGGGGAT 
TCTTCAATGG 
ACTCCACTGA 
GCGGAGTCCC 
GCAGCGTCAT 
CCGAGCGCGT 
AGTCCTTTGT 
CCCAGGACAA 
CCACGCCCGG 
GGGGGGACGC 
ACGAGOGCGG 
CCCTGGTGCA 
AGAACGTCCT 
CCACCTTCTT 
GGACATTCAA 
ACATTGAGGT 
AGCCCGGCGT 
ACTGCGGAGA 
TCCACTCAGA 



11 
I 

AGACCCGCGA 
CCCGAAGGAC 
GGAGGAAGGC 
GGGGCGCTGG 
CGGCTTCCTG 
CTACATGAGG 
GTTTGTAAGC 
ATTCCTGGGC 
CGCCTACTAC 
CATGGCCGAG 
GGTCACCTCA 
CAGCTGCAGC 
CTTCCCTGAC 
CGACTCAGTG 
CAGCGACCTG 
GTTGTGTGGC 
GCTCATCACA 
CCAGCTGCCT 
CAGCCCCTAC 
GCCCAACAAC 
GCCTGCGGGC 
GAGGTCCCAG 
TCAGTCCTAC 




AGCCCCTACC 
CTGAGCCTCA 
GTGACGGTGT 
ACCTACCCTC 
CTGATAACCA 
AGGATGAGCA 
TACCCAGGCC 
CAGCATGTGA 
ACCTGCCCCA 
TTCGTCGTCA 
ACCGACACCG 



31 
I 

GGACCATGGG 
GACTCAAGTA 
TGCCAGTCAA 
CAGCCGTGCT 
TGCAGTACCG 
AGAATTTTGT 
AGGTGAAGGA 
AGGAGTCGGC 
TCAGCATCCC 
TCATGCTGCC 
TCCCCACGGA 
ACGCCCGCGG 
CCGCTCATGC 
CCTTCCGCAG 
ACAACACCCT 
CCTCCTACAA 
ACACTGAGCG 
GCTGTGGAGG 
ACTACCCACC 
AGGTGCGCTT 
AGGACTACGT 
C CAGCAA CAG 
GCTTCTTAGC 



41 

I 

GAGCGATCGG 
CAACTCCCGG 
CAACGTCAAG 
GATCGGCCTC 
GGACGTGCGT 
GGATGCCTAC 
CGCGCTGAAG 
TGTGACGGCC 
GCAGCACCTG 
CCCGCGGGCG 
CTCCAAAACA 
TGTGGAGCTG 
CCGCTGCCAG 
CTTTGACCTT 
GAGCCCCATG 
CCTGACCTTC 
GOGGCATCCC 
CCGCTTACGT 
CAACATTGAC 
CAAATTCTTC 
GGAGATCAAT 
CAACAAGATC 
TGAATACCTC 



51 

I 

GCCCGCAAGG 
CACGAGAAAG 
AAGGTGGAAA 
CTCTTGGTCT 
GTCCAGAAGG 
GAGAACTCCA 
CTGCTGTACA 
TTCAGCGAGG 
GTGGAGGAGG 
CGCTCCCTGA 
GTACAGAGGA 
ATGCGCTTCA 
TGGGCCCTGC 
GCGTCCTGCG 
GAGCCCCACG 
CACTCCTCCC 
GGCTTTGAGG 
AAAGCCCAGG 
TGCACATGGA 
TACCTGCTGG 
GGGGAGAAAT 
ACAGTTCGCT 
TCCTACGACT 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3805 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1699 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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CCAGTGACCC ATCCCCGGGG CAGTTCACGT GCCGCACGGG GCGGTGTATC CGGAAGGAGC 1440 

TGCGCTGTGA TGGCTGGGCC GACTGCACCG ACCACAGCGA TGAGCTCAAC TGCAGTTGOG 1500 

ACGCCGGCCA CCAGTTCACG TGCAAGAACA AGTTCTGCAA GCCCCTCTTC TGGGTCTGCG 1560 

ACAGTGTGAA CGACTGCGGA GACAACAGCG ACGAGCAGGG GTGCAGTTGT CCGGCCCAGA 1620 

CCTTCAGGTG TTCCAATGGG AAGTGCCTCT CGAAAAGCCA GCAGTGCAAT GGGAAGGACG 1680 

ACTGTGGGGA CGGGTCCGAC GAGGCCTCCT GCCCCAAGGT GAACGTOGTC ACTTGTACCA 1740 

AACACACCTA CCGCTGCCTC AATGGGCTCT GCTTGAGCAA GGGCAACCCT GAGTGTGACG 1800 

GGAAGGAGGA CTGTAGCGAC GGCTCAGATG AGAAGGACTG CGACTGTGGG CTGCGGTCAT 1860 

TCACGAGACA GGCTCGTGTT GTTGGGGGCA CGGATGOGGA TGAGGGCGAG TGGCCCTGGC 1920 

AGGTAAGCCT GCATGCTCTG GGCCAGGGCC ACATCTGCGG TGCTTCCCTC ATCTCTCCCA 1980 

ACTGGCTGGT CTCTGCCGCA CACTGCTACA TCGATGACAG AGGATTCAGG TACTCAGACC 2040 

CCACGCAGTG GAOGGCCTTC CTGGGCTTGC ACGACCAGAG CCAGCGCAGC GCCCCTGGGG 2100 

TGCAGGAGCG CAGGCTCAAG OGCATCATCT CCCACCCCTT CTTCAATGAC TTCACCTTCG 2160 

ACTATGACAT CGCGCTGCTG GAGCTGGAGA AACCGGCAGA GTACAGCTCC ATGGTGCGGC 2220 

CCATCTGCCT GCCGGACGCC TCCCATGTCT TCCCTGCOGG CAAGGCCATC TGGGTCACGG 2280 

GCTGGGGACA CACCCAGTAT GGAGGCACTG GOGOGCTGAT CCTGCAAAAG GGTGAGATCC 2340 

GCGTCATCAA CCAGACCACC TGCGAGAACC TCCTGCCGCA GCAGATCACG CCGCGCATGA 2400 

TGTGCGTGGG CTTCCTCAGC GGCGGCGTGG ACTCCTGCCA GGGTGATTCC GGGGGACCCC 2460 

TGTCCAGCGT GGAGGCGGAT GGGOGGATCT TCCAGGCOGG TGTGGTGAGC TGGGGAGACG 2520 

GCTGCGCTCA GAGGAACAAG CCAGGCGTGT ACACAAGGCT CCCTCTGTTT CGGGACTGGA 2580 

TCAAAGAGAA CACTGGGGTA TAGGGGCCGG GGCCACCCAA ATGTGTACAC CTGGGGGGCC 2640 

ACCCATCGTC CACCCCAGTG TGCACGCCTG CAGGCTGGAG ACTGGACCGC TGACTGCACC 2700 

AGCGCCCCCA GAACATACAC TGTGAACTCA ATCTCCAGGG CTCCAAATCT GCCTAGAAAA 2760 

CCTCTCGCTT CCTCAGCCTC CAAAGTGGAG CTGGGAGGTA GAAGGGGAGG ACACTGGTGG 2820 

TTCTACTGAC CCAACTGGGG GCAAAGGTTT GAAGACACAG CCTCCCCCGC CAGCCCCAAG 2880 

CTGGGCCGAG GCGCGTTTGT ' GTATATCTGC CTCCCCTGTC TGTAAGGAGC AGCGGGAACG 2940 

GAGCTTCGGA GCCTCCTCAG TGAAGGTGGT GGGGCTGCCG GATCTGGGCT GTGGGGCCCT 3000 

TGGGCCACGC TCTTGAGGAA GCCCAGGCTC GGAGGACCCT GGAAAACAGA CGGGTCTGAG 3060 

ACTGAAAATG GTTTACCAGC TCCCAGGTGA CTTCAGTGTG TGTATTGTGT AAATGAGTAA 3120 

AACATTTTAT TTCTTTTTAA AAAAAAAAA 3149 

Seq ID NO: C23 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..2268 



1 11 21 31 41 51 

I I I I I I 

ATGCCCCCTT TCCTGTTGCT GGAAGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 240 

CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 

CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

ATGGTTTTCA AAGGAGGGCG CAOGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 

AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG S40 

TTTGCTGTGG GGGTGAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 600 

AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 720 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 780 

AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 840 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 1020 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTGCTG 1080 

CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG OGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 1200 

CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 1260 

CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 1320 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 1380 

GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 

GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 1S00 

TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1S60 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 1620 

TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGOGA TGCTGCGGGC CATTAGCCAG 1800 

GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAQAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 

TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TAOGCCGACC TGCGGTACCA CCAGGACGTG 2100 

CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CGGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GCCCCCACTG CGAGAACCGA TTCTTGAGAC GCCCCTGA 2268 



Seq ID NO: C24 DNA Sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequencer 1..2424 

1 11 21 31 41 51 

I I I I I I 

ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC T G T GTTTT CC TGTTTTCCAG AGTGCCCCCA 60 
TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 

1212 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



AGCAAAATGA 
AGCGTCGGGA 
CTGGACATCA 
CTGGAATTCC 
ATGGTTTTCA 
GGGTTGCCTG 
AAGTCCCAGG 
TTTGCTGTGG 
AGAGGGCAGC 
ACCCTCAGCA 
CCCTGTGAGC 
AGAGGATCGC 
AGAGTGTTCC 
TCGCAGCCCT 
CTCTGCCCGC 
AGGGTCGACC 
CGGGCCAAAG 
CGAGTGGGTG 
CAGGATGTGC 
CTGACGGGCA 
GGCCAGGACC 
GTTGCGGGCC 
GAGGCCGTGC 
TCGGATCCTC 
CAGCGGCCAG 
TCAGTAGGGC 
TTTGAGGTGA 
ACTGCCTTCG 
GCCCCCTACC 
GTGATGACCG 
GGCGGGAGAG 
TCTGTCTTGG 
CCCCGGGATT 
CTCATTGAGT 
TGCATGAATG 
GGCTGGGAGG 
CAGGGATGGA 
CGTACCCCTC 
AATGTCTGTG 



TGTGGTGCTC 
AAGGGAGCTT 
GCCCCGAGAG 
CCTTGGATTC 
AAGGAGGGCG 
GAGGCAGAAA 
GGGATGTGGC 
GGGTCAGGTT 
ACGTGCTGTT 
GCTCGGCCAT 
ACAGGACGCT 
GGCGGACCCT 
TAACCCACCC 
GCCAGAATGG 
TGGCCTTTGG 
TCCTCTTCCT 
TCTTCGTGAA 
TGGCCACATA 
CTGACCTGGT 
GTGCCTTGCG 
GGCCACGTAG 
CAGCGCGTCA 
GGGCAGAGCT 
AGGATCTGTT 
GGTGCCGGAC 
CGGAGAATTT 
ACCCTGACGT 
GGCTGGACAC 
TAGGTGGGGT 
TCCAGAGGGG 
GCGCAGAGGA 
TCGTGGGCGT 
CCCTGATCCA 
GGCTGTGTGG 
AGGGCAGCTG 
GCCCCCACTG 
TTCTTGAGAC 
CCAGCAACTA 
CCCCAGGTCC 



GGCTGCAGTG 
TGAAAGGTCC 
GGTCAGAGTG 
ATTTTCAACC 
CACGGAGACG 
TGCTTCTGTG 
ACTGCCATCC 
TCCCAGGTGG 
GGCTGAGCAG 
CTGCTCCAGC 
GGAGATGGTC 
TGGGGTGCTG 
TGCCACCTGC 
AGGCACATGT 
AGGGGAGGCT 
GCTGGACAGC 
GCGGTTTGTG 
CAGCAGGGAG 
CTGGAGCCTC 
GCAGGCGGCA 
AGTGGTGGTT 
CGCAAGGGCG 
GGAGGAGATC 
CAACCAAATC 
ACAAGCCCTG 
TGCTCAGATG 
GACACAGGTC 
CAAACCCACC 
GGGCTCAGCC 
TGCCCGGCCT 
TGCAGCCGTT 
GGGGCCTGTC 
CGTGGCAGCT 
AGAAGCCAAG 
CGTCCTGCAG 
CGAGAACCGT 
GCCCCTGAGG 
CAGAGAAGGC 
TTAG 



GACATCATGT 
AAGCACTTTG 
GGAGCATTCC 
CAACAGGAAG 
GAACTTGCTC 
CCCCAGATCC 
AAGCAGCTGA 
GAGGAGCTGC 
GTGGAGGATG 
GCCACGCCAG 
CGGGAGTTCG 
GCTGCACACT 
TACAGGACCA 
GTTCCAGAAG 
AACTGTGCCC 
TCTGCGGGCA 
CGGGCCGTGC 
CTGCTGGTGG 
GATGGCATTC 
GAGCGTGGCT 
TTGCTCACTG 
CGAGAGCTGC 
ACAGGCAGCC 
CCTGAGCTGC 
GACCTCGTCT 
CAGAGCTTTG 
GGCCTGGTGG 
CGGGCTGCGA 
GG CACCGCCC 
GGTGTCCCCA 
CCTGCCCAGA 
CTAAGTGAGG 
TAOGCCGACC 
CAGCCAGTCA 
AATGGGAGCT 
GAGTGGAGCT 
CACATGGCTC 
CTGGGCACTQ 



TTCTGTTAGA 
CCATCACAGT 
AGTTCAGTTC 
TGAAGGCAAG 
TGAAATACCT 
TCATCATCGT 
AGGAAAGGGG 
ATGCACTGGC 
CCACCAACGG 
ACTGCAGGGT 
CTGGCAATGC 
GTCCCTTCTA 
CCTGCCCAGG 
GACTGGACGG 
TGAAGCTGAG 
CCACTCTGGA 
TGAGCGAGGA 
CGGTGCCTGT 
CCTTCCGTGG 
TCGGGAGCGC 
AGTCACACTC 
TCCTGCTGGG 
CAAAGCATGT 
AGGGGAAGCT 
TCATGTTGGA 
TGAGAAGCTG 
TGTATGGCAG 
TCCTGCGGGC 
TGCTGCACAT 
AAGCTGTGGT 
AGCTGAGGAA 
GTCTGCGGAG 
TGCGGTACCA 
ACCTCTGCAA 
ACCGCTGCAA 
CTTGCTCTGT 
CCGTGCAGGA 
AAATGGTGCC 



TGGGTCTAAC 
CTGTGACGGT 
CACTCCTCAT 
AATCAAGAGG 
TCTGCACAGA 
CACTGATGGG 
TGTCACTGTG 
CAGCGAGGCT 
CCTCTTCAGC 
CGAGGCTCAC 
CCCATGCTGG 
CAGCTGGAAG 
CCCCTGTGAC 
CTACCAGTGC 
CCTGGAATGC 
CGGCTTCCTG 
CTCTCGGGCC 
GGGGGAGTAC 
TGGCCCCACC 
CACCAGGACA 
CGAGGATGAG 
TGTAGGCAGT 
GATGGTCTAC 
GTGCAGCCGG 
CACCTCTGCC 
TGCCCTCCAG 
CCAGGTGCAG 
CATTAGCCAG 
CTATGACAAA 
GGTGCTCACA 
CAATGGCATC 
GCTTGCAGGT 
CCAGGACGTG 
ACCCAGCCCG 
GTGTCGGGAT 
ATGTGTGAGC 
GGGCAGCAGC 
TACCTTCTGG 



Seq ID NO: C25 DNA Sequence 

Nucleic Acid Accession #: XM_097386.3 

Coding sequence : 142 . . 79S 



1 
I 

CTCGCAGAAC 
GGGACCCCCG 
CTACCTCAGG 
TCCCGTGCAC 
CCTGGACCCC 
GGGTCGAGGG 
GTGTCTGTGG 
TTGAGCCTGC 
CGAGGCTGGG 
GGAAATGGGA 
CCGTGTCTGC 
TCTCCTGCCA 
CAGCACCCCC 
CTGGATGGGT 
CACAGTCAAG 
GGAACCTTCC 
ACGGGGGGCA 
TCTTATCATG 
CCTTGGCAAA 
GTTTGTCAGA 
AACTTAAAAA 
GTCTCGCCCT 
GTATTAAAAG 



11 
I 

CACCTGGACT 
TCCCACGCCT 
TCTGACTTTT 
CAGGGTCTGC 
TGGGGCCCCC 
CTGGGTOGTC 
GCGATCCGGC 
TGTTAACTCC 
TCCCGGCCCA 
GGCGGTTGCA 
AGAGGCTCCT 
CCGCCTGTCG 
CCTTATGCAG 
TCTAGTTCAC 
GCCAACGGGG 
GTGGGACCOG 
GCTGGGGTCG 
TTTTACCCAC 
TGTAAACTCA 
CGGCGTGGGA 
AATTTTTAAA 
GGCCGCGGCA 
GAAAAAA 



21 
I 

CTGTCCGTGT 
GGGCCCCGCG 
GATGCCAAAA 
AGCAGCCACT 
GTGCACCTGC 
TCCCTGGGGC 
CTCCCGGCGG 
GATGATTGTA 
GGAGAAGGAA 
GAGGGTCTAT 

GTGGGGGTTT 
ACTGGGAGGG 
TTGGGACCGT 
GCTCCCCCTG 
TAAGTGGCTG 
TTGTTAAGGG 
CTTGTCCCTT 
GCCTTTCATT 
ACCACGCCTG 
AAACATAAAA 
GGGTGGCCTG 



31 
I 

CTGTCCCCCG 
CCGGGGGAAG 
TCTGAGCCCC 
GGGGCCTGGC 
CCACCTCGGA 
TGOGTGTGTG 
TGGGTGGACC 
GGGACAGGCG 
GTCGCTGAAG 
GGGGCCOGGT 
GGTGGCCCCG 
AAATTCGGTG 
GGTCGGGCAG 
GGGGCCTGGC 
CTCTGAGATG 
TCCAGAAAGG 
TCACGCATCT 
TTTTCCCCAA 
CATGACGTGT 
AAACTCAGGT 
CTACTCTCTA 
TAACAATTTC 



41 

1 

GCCTCCAGGG 
CGCCTGCTGC 
TGGGGTGCCT 
TGCCTGCTGC 
GCCTGGGGAG 
TGTCGGGAAT 
TGGATTCTAA 
GGGTGGGTGG 
GCAGTGGCCA 
CCTGGATACT 
TTCAACCCCA 
TGGCTTTCTG 
TCCCCTCAGC 
TGCGTACTGA 
TTGGGAGAAA 
CGGGAGGGTG 
GTACAGTTGA 
TTGTGCTTTT 
GAAATTTCAG 
AATAGGAGGA 
CCTCTGGCTG 
AGTTTTCGCA 



51 
I 

CTCCTCTCCC 
CTATCTCTGT 
CTCCCCCGCC 
ATCTGGCGGC 
GGGCCGTGCA 
CCTGCGTGTC 
CTCAGAGGAC 
GGGGTGGGCG 
TGCTGGCCGT 
CGGCAGGAAG 
GCCGTGCCCA 
GGGTGCAGCT 
CACGAGGACC 
GTGGGTGCCC 
GGCGGCTTCT 
GGCACGGGGC 
ATTTCCTTTC 
GCATTTTTTT 
TTTCTCTGGA 
AAAAAAAAAA 
GGCCCAGCCT 
GAACATTCAG 



Seq ID NO: C26 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 95.. 2128 



1 
I 

GGGGTAGTTT 
CTCCTGGAAA 
TGCTTCCTGG 
CACACTGAGA 
GATCAATATA 
GCTTCTGAAA 
CTTCACCCCT 
GCGAGTCAGC 



11 
I 

GTAGGGACGC 
GGAGAGACAC 
GAGCTTGTGG 
GTATCTGGAG 
TCCCAAGACT 
ACCCACTGGA 
CAGCATAAAA 
TTCTCAGCTG 



21 
I 

AGCTCTCCAC 
CAGCATTTGC 
TCCGCGTTGA 
ACCTTCATGT 
GGTCAGACTT 
CCCTGGACAA 
TGCTGOGCCT 
TGGTTTTTAA 



31 

i 

GTGCGCGACT 
CACAATGCTG 
CCATCCCAAT 
TGGAGGAGTG 
TGCTCTTTGG 
ATATGGGGTC 
TCGTCTGCCG 
AGCTGTCAGT 



41 
I 

GCGAGGCTGG 
TCATCCACTG 
GAAGAGCAGC 
ATGCTCAAGT 
TGGGAACAGA 
CAGGCAGATG 
AATTTGAAGA 
GATATCTGCA 



51 
I 

ACGCTACGGG 
ACTTTACATT 
AGAAAGACGT 
TAGTAGAACA 
AGCATTGCTG 
CAAAGCTTCT 
TGGTGAGGTT 
AAATCCTGAA 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2424 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1337 



60 

120 

180 

240 

300 

360 

420 

480 



1213 



V 



WO 03/042661 



PCT/DS02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



TATTAGAAGA 
GAAGAAAAAA 
TTCTCCAACA 
TATATATGAC 
COCTTTGACG 
AGCACTTGCG 
TTGGCTAGAC 
CTTAOGATTT 
AAACCAACTC 
GGAAGAAATG 
TGAAACACAG 
TTTGGAAGTA 
CCCTAAACTT 
CAAACAATAT 
TGAACAAGGA 
AAATGTAGCA 
TGAAATGTAT 
GTTGGCATCG 
CCTTTCATTT 
AAACATGGAT 
CAAACAGCTG 
CGAAGCCAAG 
CTACCTTGTC 
GTTGATTAAA 
CAAACAGTGG 
CTTTACTGCT 
CTACATTTTC 
CCACAAATTG 
GGCAAGCCAA 
GACTAACAGA 
GGCACATCCC 
TTTTCTATAA 
GCATGTGGTT 
TCGTGCTGAT 
TAGCAAGAAC 
GAACTGCTTT 
CCCTGGAGGC 
ACAACCCAAG 
ATGACCACTG 
GCAAGCCCCA 
GGGTGACACA 
CTACTCTGGG 
CCAGTGTCAA 
CCATTCTCAA 
GGCTTTAGGC 
TGGATTTGGT 
GTAAATCACA 
TGCCTTGAGA 
CTGAATTGTC 
CAAAAGCAAT 
TGTTCTGCCA 
TCAGTTCTCC 
TCAGCTGTTG 
GAGCCACATA 
TGGAAATTCC 
CGCTGAATTT 
GTTCACATCA 
CGTATTATAC 
CCTAGTCAGT 
GATTTTACCT 
TTCAACATGC 
AACTTTGTTT 
CACACAGCAA 
AATGCTACTG 
CCCAAATTCC 
TATCTACAGT 
GATTTATATG 



TCAGAAGAGC 
GACAAAAATA 
GCTTCAGGTT 
CCCATCAATG 
GAACAAAACT 
GATATGTACC 
TCCTCACGCT 
AAATATTATT 
TATGAGCAAG 
TTGATCTTTG 
GATTTTGCAG 
ACCCTAGAAG 
GCAGATAATC 
TGGTTTATCT 
GAACCACTAG 
GGAAGAAAAT 
TTGAGATGTG 
AAGGGCAAAA 
CTGAGGATGA 
ATGAACCCAG 
GCCGCCCGGA 
CTGCGGTTCA 
AGATTTAAAG 
ATTGATGCAG 
AATGTAAACT 
TTCACCTGCC 
TTGTCCACCC 
ACCGGOGGTC 
AGGOGCCCCT 
CAACATACAT 
TTTCCTTACT 
AATGGGTAGG 
ACAGTTCTCT 
CTGTCTCATT 
AACTCTTATT 
TAGCTCCTGT 
CAAGGTCTAG 
GTCAGAGGAG 
ACCCACATCT 
TTCCATTCCT 
GGGGGACAGG 
TTGGGATTCA 
GGATTCCAGT 
AGACCACCAT 
TGGTCTGGGT 
ATCTATGTTG 
AGCCAATAAT 
CTGCCTTGAG 
ATTTTCCATT 
ACCAGAAGTA 
CACAAAGAAC 
TAGGATGCAC 
CTTTATTTTC 
GAGACAGACT 
TCTTTTAGCC 
CTGAGGCCTT 
TAAAGAACTT 
TTGATGTTGG 
TCTTTAAGAT 
GTTTGCTGCT 
TTCCTTAGCT 
TAACCAAACT 
CATCCCAAAG 
TTCCAAAGAG 
TATGATGAAA 
TGATTCAGAA 
GATTTGAAAA 



TTTCCTTGTT 
ATAAGGAACC 
CATCAGTAAG 
GAACACCAGC 
GCAGCATCCT 
AGCCTCGGTC 
CCCTTATGGA 
CTTTCTTCGA 
CCAGGTGGGC 
CAGCTCTACA 
GCGAGTCCGA 
GTGGAAAAGC 
TCAAATTATT 
TTAAAGACAC 
AAAAACTAAA 
TTGGAATCAA 
ACCATGAGAA 
CCATGGCAGA 
AAAACAGGAA 
AATGTTTTGT 
TCCTGGAGGC 
TCCAGGCGTG 
GAAGCAAAAA 
CCACCGGGAT 
GGGAAACCCG 
TGAGTGCAGA 
GCTCCAAGGA 
AGGATTGAAA 
CCCCAGAGGG 
TCACCGCTGG 
TTGCCCTGTG 
CAGGAGAAAA 
GACTTGCAGA 
ACTAAGTCAA 
TCACAAACTC 
CTTTTCAAAA 
GGGTAGAAAG 
TGGCCCTCAG 
GGTCTTAGTC 
ACAGATCTCT 
AAAACCCATT 
GGAAGACAGG 
CAGGTGTCTA 
GTCCAAGGTC 
CATGGGGAAG 
GTACGACTCC 
AGACTTTTTT 
ACAGTGCTTG 
TTGGTTTGTT 
AAGGGAAATA 
TTGGGGTGTA 
CCCTCAGGGA 
CATCAAAGCC 
TGGCAAGGGA 
TCTCCTTAGA 
GCTTAAAGCT 
GATTTGAAAT 
TCATTTCTCA 
TATAACTGGT 
GAGAACATCT 
TTTCATAGTT 
TTGTTTGGTT 
AAATAAACAA 
CAACTTGATG 
TTTTAAATTA 
ATCTGCATTT 
AAAAAAAAAA 



AAAGCCGTCT 
CATAATTGAA 
TCCTGGTTTA 
ATCATCCACC 
CGCATTCAGC 
TCTGGTTGAT 
ACAAGGCATC 
CTTGAATCCT 
CATTCTCTTA 
GTACCACATT 
GGTTGATGAA 
GGACAGCCTT 
TAGGCCCAAG 
ATCCATAGCA 
TCTTAGAGGC 
GTTACTAATC 
TCAATACGCC 
CAGCTCCTAC 
CTCTGCATCT 
GTCACCACGG 
GCACCAGAAC 
GCAGTCACTG 
AGATGACATT 
TCCAGTGACA 
GCAGGTGGTC 
TTGCAAGATT 
CCAGAATGAA 
CAAGCACGCG 
ATCCCTAACG 
TCACCCAGAT 
CTACCAGCCA 
GCAGGTGCCC 
ACCTGCCAGG 
TGGAGAAGAC 
AGGTATGAAA 
TGGCAGAGGG 
GGGAGGGGTG 
TGTCATCTGT 
ATTGGTCTCC 
CAGCCACCTG 
TCTCAACCCA 
CACAGTCCTC 
TCCCAACTGG 
TGACAGCTCC 
CGTCCCTCTT 
TGGCCTTTTA 
CTCCCCCTCT 
CCTTGAGAGA 
AGAGGTGGGA 
TCAGACAATA 
AGGATAAGGC 
GCCTGGCCAG 
CTCTGAGAAG 
CCCCCTGGTT 
GGTGAATGTG 
CAGAAGTGGT 
GTTTTCTATA 
GTCCTATTTC 
CCTACATTAA 
CTGCCTAATT 
GTCTGACATT 
ACAGTTTTCA 
GTGTGACAAA 
GTTTTTTTTA 
ATGGGCACCT 
TTTATTCTTT 
A 



GGTGACTATT 
GATATTCTAA 
TACAGTAAAA 
ATGACTTGGT 
CAACCCCCCC 
AAAGCCAAGC 
CAAGAGGATG 
AAATATGATG 
GAAGAAATTG 
AGCAAACTGT 
ATAGAAGCGG 
TTGGAGGACA 
AAGTTACTAC 
TACTTTAAAA 
TGCGAAGTTG 
CCTGTTGCCG 
CAATGGATGG 
CAGCCAGAGG 
CAGGTGGCTT 
TGTGCAAAAA 
GTGGCCCAGA 
CCTGAGTTTG 
CTGGGAGTTT 
ACATGGAGAT 
ATCGAGTTTG 
GTGCACGAGT 
ACACTCGATG 
TGCTCGGCTC 
TGCCCAGCAT 
CCTCATTCAA 
CGGAAGGAGC 
TAAGATTGCT 
TGTATGGCTA 
AGAAAGGTAA 
CGAAACGCCT 
AGTTCCTACA 
GGGCTACCAG 
CCACAGTGAT 
TCAGATTTCT 
TAAGTCCTTT 
GATCCATGTC 
TCTGTTCATA 
TCAGGGAGAG 
CCACTGGCTG 
ATCGCTGGTC 
TCTAAAGGAC 
GTTTTTTGCT 
GTGAGCCAAT 
GGGGTGGGTT 
TTTTATTATT 
AAAAGCTCCA 
AGTTCCGAGG 
TGAGACCTCA 
CTGAGCCAGT 
AATGAAGCCT 
TTAGGCATTT 
GAAACAAGTG 
TCAGTTCTAT 
AATAATGCTT 
TACCAAAGCC 
TCCATGAAAA 
GGGGAGCGTT 
AAAAAAAAAA 
ATACTGAGTG 
TTCAACATCA 
TATATGACTT 



GTGTCATCTC 
TAACAGCTGC 
TTGAGAAGGT 
TTTTCATAGA 
ATCCCATTTT 
CCCGTGAGCG 
GCAATTCCGG 
AGCTGCCATC 
CCCAGGCACC 
GGAAAATCTG 
CTAAGTGTAC 
TATTTTAGAA 
CTCGATGTCA 
AGACCTTCAG 
CAAAGGAACC 
TCTTCCATGA 
AACAAACCTA 
CAAAAGGTCA 
TTTGCTTCCT 
TTAAGTAAAA 



Seq ID NO: C27 Protein Sequence 
Protein Accession #: NP_005161.1 

1 11 21 31 41 51 

1 I I 1 I I 

MDGGTLPRSA PPAPPVPVGC AARRRPASPE LLRCSRRRRP ATABTGGGAA AVARRNERER 
NRVKLVNLGF QALRQHVPHG GASKKLSKVE TLRSAVEYIR ALQRLLAEHD AVRNALAGGL 
RPQAVRPSAP RGPPGTTPVA ASPSRASSSP GRGGSSEPGS PRSAYSSDDS GCEGALSPAE 
RELLDFSSWL GGY 

Seq ID NO: C28 DNA Sequence 
Nucleic Acid Accession NM 017763 
Coding sequence: 169.. 2520 ~ 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3460 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4471 



60 
120 
180 
193 



11 



21 
I 



31 
I 



41 
I 



51 



1214 



WO 03/042661 



AAAAAAAAAA AACTTTAGAG AAAGGAAGGG CCAAAACTAC GACTTGGCTT TCTGAAACGG 60 

AAGCATAAAT GTTCTTTTCC TCCATTTGTC TGGATCTGAG AACCTGCATT TGGTATTAGC 120 

TAGTGGAAGC AGTATGTATG GTTGAAGTGC ATTGCTGCAG CTGGTAGCAT GAGTGGTGGC 180 

CACCAGCTGC AGCTGGCTGC CCTCTGGCCC TGGCTGCTGA TGGCTACCCT GCAGGCAGGC 240 

TTTGGACGCA CAGGACTGGT ACTGGCAGCA GCGGTGGAGT CTGAAAGATC AGCAGAACAG 300 

AAAGCTGTTA TCAGAGTGAT CCCCTTGAAA ATGGACCCCA CAGGAAAACT GAATCTCACT 360 

TTGGAAGGTG TGTTTGCTGG TGTTGCTGAA ATAACTCCAG CAGAAGGAAA ATTAATGCAG 420 

TCCCACCCAC TGTACCTGTG CAATGCCAGT GATGACGACA ATCTGGAGCC TGGATTCATC 480 

AGCATCGTCA AGCTGGAGAG TCCTCGACGG GCCCCCCGCC CCTGCCTGTC ACTGGCTAGC 540 

AAGGCTCGGA TGGCGGGTGA GCGAGGAGCC AGTGCTGTCC TCTTTGACAT CACTGAGGAT 600 

CGAGCTGCTG CTGAGCAGCT GCAGCAGCGG CTGGGGCTGA CCTGGCCAGT GGTGTTGATC 660 

TGGGGTAATG ACGCTGAGAA GCTGATGGAG TTTGTGTACA AGAACCAAAA GGCCCATGTG 720 

AGGATTGAGC TGAAGGAGCC CCCGGCCTGG CCAGATTATG ATGTGTGGAT CCTAATGACA 780 

GTGGTGGGCA CCATCTTTGT GATCATCCTG GCTTCGGTGC TGCGCATCCG GTGCOGCCCC 840 

CGCCACAGCA GGCCGGATCC GCTTCAGCAG AGAACAGCCT GGGCCATCAG CCAGCTGGCC 900 

ACCAGGAGGT ACCAGGCCAG CTGCAGGCAG GCCCGGGGTQ AGTGGCCAGA CTCAGGGAGC 960 

AGCTGCAGCT CAGCCCCTGT GTGTGCCATC TGTCTGGAGG AGTTCTCTGA GGGGCAGGAG 1020 

CTAOGGGTCA TTTCCTGCCT CCATGAGTTC CATCGTAACT GTGTGGACCC CTGGTTACAT 1080 

CAGCATCGGA CTTGCCCCCT CTGOGTGTTC AACATCACAG AGGGAGATTC ATTTTCCCAG 1140 

TCCCTGGGAC CCTCTCGATC TTACCAAGAA CCAGGTCGAA GACTCCACCT CATTCGCCAG 1200 

CATCCCGGCC ATGCCCACTA CCACCTCCCT GCTGCCTACC TGTTGGGCCC TTCCCGGAGT 1260 

GCAGTGGCTC GGCCCCCACG ACCTGGTCCC TTCCTGCCAT CCCAGGAGCC AGGCATGGGC 1320 

CCTCGGCATC ACCGCTTCCC CAGAGCTGCA CATCCCCGGG CTCCAGGAGA GCAGCAGCGC 1380 

CTGGCAGGAG CCCAGCACCC CTATGCACAA GGCTGGGGAA TGAGCCACCT CCAATCCACC 1440 

TCACAGCACC CTGCTGCTTG CCCAGTGCCC CTACGCCGGG CCAGGCCCCC TGACAGCAGT 1SO0 

GGATCTGGAG AAAGCTATTG CACAGAACGC AGTGGGTACC TGGCAGATGG GCCAGCCAGT 1560 

GACTCCAGCT CAGGGCCCTG TCATGGCTCT TCCAGTGACT CTGTGGTCAA CTGCACGGAC 1620 

ATCAGCCTAC AGGGGGTCCA TGGCAGCAGT TCTACTTTCT GCAGCTCCCT AAGCAGTGAC 1680 

TTTGACCCCC TAGTGTACTG CAGCCCTAAA GGGGATCCCC AGCGAGTGGA CATGCAGCCT 1740 

AGTGTGACCT CTCGGCCTCG TTCCTTGGAC TCGGTGGTGC CCACAGGGGA AACCCAGGTT 1800 

TCCAGCCATG TCCACTACCA CCGCCACCGG CACCACCACT ACAAAAAGCG GTTCCAGTGG 1860 

CATGGCAGGA AGCCTGGCCC AGAAACCGGA GTCCCCCAGT CCAGGCCTCC TATTCCTCGG 1920 

ACACAGCCCC AGCCAGAGCC ACCTTCTCCT GATCAGCAAG TCACCGGATC CAACTCAGCA 1980 

GCCCCTTCGG GGCGGCTCTC TAACCCACAG TGCCCCAGGG CCCTCCCTGA GCCAGCCCCT 2040 

GGCCCAGTTG ACGCCTCCAG CATCTGCCCC AGTACCAGCA GTCTGTTCAA CTTGCAAAAA 2100 

TCCAGCCTCT CTGCCCGACA CCCACAGAGG AAAAGGCGGG GGGGTCCCTC CGAGCCCACC 2160 

CCTGGCTCTC GGCCCCAGGA TGCAACTGTG CACCCAGCTT GCCAGATTTT TCCCCATTAC 2220 

ACCCCCAGTG TGGCATATCC TTGGTCCCCA GAGGCACACC CCTTGATCTG TGGACCTCCA 2280 

GGCCTGGACA AGAGGCTGCT ACCAGAAACC CCAGGCCCCT GTTACTCAAA TTCACAGCCA 2340 

GTGTGGTTGT GCCTGACTCC TCGCCAGCCC CTGGAACCAC ATCCACCTGG GGAGGGGCCT 2400 

TCTGAATGGA GTTCTGACAC CGCAGAGGGC AGGCCATGCC CTTATCCGCA CTGCCAGGTG 2460 

CTGTCGGCCC AGCCTGGCTC AGAGGAGGAA CTCGAGGAGC TGTGTGAACA GGCTGTGTGA 2520 

GATGTTCAGG CCTAGCTCCA ACCAAGAGTG TGCTCCAGAT GTGTTTGGGC CCTACCTGGC 2580 

ACAGAGTCCT GCTCCTGGGA AAGGAAAGGA CCACAGCAAA CACCATTCTT TTTGCCGTAC 2640 

TTCCTAGAAG CACTGGAAGA GGACTGGTGA TGGTGGAGGG TGAGAGGGTG CCGTTTCCTG 2700 

CTCCAGCTCC AGACCTTGTC TGCAGAAAAC ATCTGCAGTG CAGCAAATCC ATGTCCAGCC 2760 

AGGCAACCAG CTGCTGCCTG TGGCGTGTGT GGGCTGGATC CCTTGAAGGC TGAGTTTTTG 2820 

AGGGCAGAAA GCTAGCTATG GGTAGCCAGG TGTTACAAAG GTGCTGCTCC TTCTCCAACC 2880 

CCTACTTGGT TTCCCTCACC CCAAGCCTCA TGTTCATACC AGCCAGTGGG TTCAGCAGAA 2940 

CGCATGACAC CTTATCACCT CCCTCCTTGG GTGAGCTCTG AACACCAGCT TTGGCCCCTC 3000 

CACAGTAAGG CTGCTACATC AGGGGCAACC CTGGCTCTAT CATTTTCCTT TTTTGCCAAA 3060 

AGGACCAGTA GCATAGGTGA GCCCTGAGCA CTAAAAGGAG GGGTCCCTGA AGCTTTCCCA 3120 

CTATAGTGTG GAGTTCTGTC CCTGAGGTGG GTACAGCAGC CTTGGTTGCT CTGGGGGTTG 3180 

AGAATAAGAA TAGTGGGGAG GGAAAAACTC CTCCTTGAAG ATTTCCTGTC TCAGAGTCCC 3240 

AGAGAGGTAG AAAGGAGGAA TTTCTGCTGG ACTTTATCTG GGCAGAGGAA GGATGGAATG 3300 

AAGGTAGAAA AGGCAGAATT ACAGCTGAGC GGGGACAACA AAGAGTTCTT CTCTGGGAAA 3360 

AGTTTTGTCT TAGAGCAAGG ATGGAAAATG GGGACAACAA AGGAAAAGCA AAGTGTGACC 3420 

CTTGGGTTTG GACAGCCCAG AGGCCCAGCT CCCCAGTATA AGCCATACAG GCCAGGGACC 3480 

CACAGGAGAG TGGATTAGAG CACAAGTCTG GOCTCACTGA GTGGACAAGA GCTGATGGGC 3540 

CTCATCAGGG TGACATTCAC CCCAGGGCAG CCTGACCACT CTTGGCCCCT CAGGCATTAT 3600 

CCCATTTGGA ATGTGAATGT GGTGGCAAAG TGGGCAGAGG ACCCCACCTG GGAACCTTTT 3660 

TCCCTCAGTT AGTGGGGAGA CTAGCACCTA GGTACCCACA TGGGTATTTA TATCTGAACC 3720 

AGACAGACGC TTGAATCAGG CACTATGTTA AGAAATATAT TTATTTGCTA ATATATTTAT 3780 

CCACAAAAAA AAAAAAAAAA AA 3802 



Seq ID NO: C29 Protein Sequence 
Protein Accession #: NP_004280.2 

1 11 21 31 41 51 

I I I I I I 

MNSSAHYHVN PSQAISQDVN LHEAILLCPN NTFRRDPTAR TSQSQEPPLQ LNSHTTNPBQ 60 

TLPGTNLTGF LSPVDNHMRN LTSQDLLYDL DINIFDBINL MSLATEDNPD PIDVSQLFDE 120 

PDSDSGLSLD SSHNNTSVIK SNSSHSVCDE GAIGYCTDHB SSSHHDLEGA VGGYYPBPSK 180 

LCHLDQSDSD PHGDLTFQHV FHNHTYHLQP TAPESTSEPP PWPGKSQKIR SRYLEDTDRN 240 

LSRDEQRAKA LHIPFSVDEI VGMPVDSPNS MLSRYYLTDL QVSLIRDIRR RGKNKVAAQN 300 

CRKRKLDIIL NLEDDVCNLQ AKKETLKREQ AQCNKAINIM KQKLHDLYHD IPSRLRDDQG 360 

RPVNPNHYAL QCTHDGSILI VPKELVASGH KKETQKGKRK 400 

Seq ZD NO: C30 DNA Sequence 
Nucleic Acid Accession #* NM_004442 
Coding sequence : 1 9 . . 2 982 

X 11 21 31 41 51 



1215 



WO 03/042661 



PCTYUS02/36810 



I I I I I 1 

GCCCCGGGAA GCGCAGCCAT GGCTCTGCGG AGGCTGGGGG CC6CGCTGCT GCTGCTGCCG 60 

CTGCTCGCCG CCGTGGAAGA AACGCTAATG GACTCCACTA CAGCGACTGC TGAGCTGGGC 120 

TGGATGGTGC ATCCTCCATC AGGGTGGGAA GAGGTGAGTG GCTACGATGA GAACATGAAC 180 

ACGAT CCGC A CGTACCAGGT GTGCAACGTG TTTGAGTCAA GCCAGAACAA CTGGCTACGG 240 

ACCAAGTTTA TCCGGCGCCG TGGCGCCCAC CGCATCCACG TGGAGATGAA GTTTTCGGTG 300 

CGTGACTGCA GCAGCATCCC CAGCGTGCCT GGCTCCTGCA AGGAGACCTT CAACCTCTAT 360 

TACTATGAGG CTGACTTTGA CTCGGCCACC AAGACCTTCC CCAACTGGAT GGAGAATCCA 420 

TGGGTGAAGG TGGATACCAT TGCAGCCGAC GAGAGCTTCT CCCAGGTGGA CCTGGGTGGC 480 

CGOGTCATGA AAATCAACAC CGAGGTGCGG AGCTTCGGAC CTGTGTCCCG CAGCGGCTTC 540 

TACCTGGCCT TCCAGGACTA TGGCGGCTGC ATGTCCCTCA TCGCCGTGCG TGTCTTCTAC 600 

CGCAAGTGCC CCCGCATCAT CCAGAATGGC GCCATCTTCC AGGAAACCCT GTCGGGGGCT 660 

GAGAGCACAT CGCTGGTGGC TGCCCGGGGC AGCTGCATCG CCAATGCGGA AGAGGTGGAT 720 

GTACCCATCA AGCTCTACTG TAACGGGGAC GGCGAGTGGC TGGTGCCCAT CGGGCGCTGC 780 

ATGTGCAAAG CAGGCTTCGA GGCCGTTGAG AATGGCACCG TCTGCOGAGG TTGTCCATCT 840 

GGGACTTTCA AGGCCAACCA AGGGGATGAG GCCTGTACCC ACTGTCCCAT CAACAGCCGG 900 

ACCACTTCTG AAGGGGCCAC CAACTGTGTC TGCCGCAATG GCTACTACAG AGCAGACCTG 960 

GACCCCCTGG ACATGCCCTG CACAACCATC CCCTCCGCGC CCCAGGCTGT GATTTCCAGT 1020 

GTCAATGAGA CCTCCCTCAT GCTGGAGTGG ACCCCTCCCC GCGACTCCGG AGGCCGAGAG 1080 

GACCTCGTCT ACAACATCAT CTGCAAGAGC TGTGGCTCGG GCOGGGGTGC CTGCACCCGC 1140 

TGGGGGGACA ATGTACAGTA CGCACCACGC CAGCTAGGCC TGACCGAGCC ACGCATTTAC 1200 

ATCAGTGACC TGCTGGCCCA CACCCAGTAC ACCTTCGAGA TCCAGGCTGT GAACGGCGTT 1260 

ACTGACCAGA GCCCCTTCTC GCCTCAGTTC GCCTCTGTGA ACATCACCAC CAACCAGGCA 1320 

GCTCCATCGG CAGTGTCCAT CATGCATCAG GTGAGCCGCA CCGTGGACAG CATTACCCTG 1380 

TCGTGGTCCC AGCCAGACCA GCCCAATGGC GTGATCCTGG ACTATGAGCT GCAGTACTAT 1440 

GAGAAGGAGC TCAGTGAGTA CAACGCCACA GCCATAAAAA GCCCCACCAA CACGGTCACC 1S0O 

GTGCAGGGCC TCAAAGCCGG CGCCATCTAT GTCTTCCAGG TGOGGGCACG CACCGTGGCA 1560 

GGCTACGGGC GCTACAGCGG CAAGATGTAC TTCCAGACCA TGACAGAAGC CGAGTACCAG 1620 

ACAAGCATCC AGGAGAAGTT GCCACTCATC ATCGGCTCCT CGGCGGCTGG CCTGGTCTTC 1680 

CTCATTGCTG TGGTTGTCAT CGCCATCGTG TGTAACAGAA GACGGGGGTT TGAGCGTGCT 1740 

GACTCGGAGT ACACGGACAA GCTGCAACAC TACACCAGTG GCCACATGAC - CCCAGGCATG 1800 

AAGATCTACA TCGATCCTTT CACCTACGAG GACCCCAACG AGGCAGTGCG GGAGTTTGCC 1860 

AAGGAAATTG ACATCTCCTG TGTCAAAATT GAGCAGGTGA TCGGAGCAGG GGAGTTTGGC 1920 

GAGGTCTGCA GTGGCCACCT GAAGCTGCCA GGCAAGAGAG AGATCTTTGT GGCCATCAAG 1980 

ACGCTCAAGT CGGGCTACAC GGAGAAGCAG OGCOGGGACT TCCTGAGCGA AGCCTCCATC 2040 

ATGGGCCAGT TCGACCATCC CAACGTCATC CACCTGGAGG GTGTCGTGAC CAAQAGCACA 2100 

CCTGTGATGA TCATCACCGA GTTCATGGAG AATGGCTCCC TGGACTCCTT TCTCCGGCAA 2160 

AACGATGGGC AGTTCACAGT CATCCAGCTG GTGGGCATGC TTCGGGGCAT CGCAGCTGGC 2220 

ATGAAGTACC TGGCAGACAT GAACTATGTT CACCGTGACC TGGCTGCCCG CAACATCCTC 2280 

GTCAACAGCA ACCTGGTCTG CAAGGTGTCG GACTTTGGGC TCTCACGCTT TCTAGAGGAC 2340 

GATACCTCAG ACCCCACCTA CACCAGTGCC CTGGGGGGAA AGATCCCCAT CCGCTGGACA 2400 

GCCCCGGAAG CCATCCAGTA CCGGAAGTTC ACCTCGGCCA GTGATGTGTG GAGCTACGGC 2460 

ATTGTCATGT GGGAGGTGAT GTCCTATGGG GAGCGGCCCT ACTGGGACAT GACCAACCAG 2520 

GATGTAATCA ATGCCATTGA GCAGGACTAT CGGCTGCCAC CGCCCATGGA CTGCCCGAGC 2580 

GCCCTGCACC AACTCATGCT GGACTGTTGG CAGAAGGACC GCAACCACCG GCCCAAGTTC 2640 

GGCCAAATTG TCAACAOGCT AGACAAGATG ATCCGCAATC CCAACAGCCT CAAAGCCATG 2700 

GCGCCCCTCT CCTCTGGCAT CAACCTGCCG CTGCTGGACC GCACGATCCC CGACTACACC 2760 

AGCTTTAACA CGGTGGAOGA GTGGCTGGAG GCCATCAAGA TGGGGCAGTA CAAGGAGAGC 2820 

TTCGCCAATG CCGGCTTCAC CTCCTTTGAC GTCGTGTCTC AGATGATGAT GGAGGACATT 2880 

CTCCGGGTTG GGCTCACTTT GGCTGGCCAC CAGAAAAAAA TCCTGAACAG TATCCAGGTG 2940 

ATGOGGGCGC AGATGAACCA GATTCAGTCT GTGGAGGTTT GACATTCACC TGCCTCGGCT 3000 

CACCTCTTCC TCCAAGCCCC GCCCCCTCTG CCCCACGTGC CGGCCCTCCT GGTGCTCTAT 3060 

CCACTGCAGG GCCAGCCACT CGCCAGGAGG CCACGGGCCA OGGGAAGAAC CAAGCGGTGC 3120 

CAGCCACGAG ACGTCACCAA GAAAACATGC AACTCAAACG ACGGAAAAAA AAAGGGAATG 3180 

GGAAAAAAGA AAACAGATCC TGGGAGGGGG CGGGAAATAC AAGGAATATT TTTTAAAGAG 3240 

GATTCTCATA AGGAAAGCAA TGACTGTTCT TGCGGGGGAT AAAAAAGGGC TTGGGAGATT 3300 

CATGCGATGT GTCCAATCGG AGACAAAAGC AGTTTCTCTC CAACTCCCTC TGGGAAGGTG 3360 

ACCTGGCCAG AGCCAAGAAA CACTTTCAGA AAAACAAATG TGAAGGGGAG AGACAGGGGC 3420 

- A CGCCCTTGGC TCCTGTCCCT GCTGCTCCTC TAGGCCTCAC TCAACAACCA AGCGCCTGGA 3480 

OU GGACGGGACA GATGGACAGA CAGCCACCCT GAGAACCCCT CTGGGAAAAT CTATTCCTGC 3540 

CACCACTGGG CAAACAGAAG AATTTTTCTG TCTTTGGAGA GTATTTTAGA AACTCCAATG 3600 

AAAGACACTG TTTCTCCTGT TGGCTCACAG GGCTGAAAGG GGCTTTTGTC CTCCTGGGTC 3660 

AGGGAGAACG CGGGGACCCC AGAAAGGTCA GCCTTCCTGA GGATGGGCAA CCCCAGGTCT 3720 

GCAGCTCCAG GTACATATCA CGCGCACAGC CTGGCAGCCT GGCCCTCCTG GTGCCCACTC 3780 

CCGCCAGCCC CTGCCTCGAG GACTGATACT GCAGTGACTG CCGTCAGCTC CGACTGCOGC 3840 

TGAGAAGGGT TGATCCTGCA TCTGGGTTTG TTTACAGCAA TTCCTGGACT OGGGGGTATT 3900 

TTGGTCACAG GGTGGTTTTG GTTTAGGGGG TT T G TTTGTT GGGTTGTTTT TTGTTTTTTG 3960 

GTTTTTTTTA ATGACAATGA AGTGACACTT TGACATTTCC TACCTTTTGA GGACTTGATC 4020 

CTTCTCCAGG AAGAAGGTGC TTTCTGCTTA CTGACTTAGG CAATACACCA AGGGCGANAT 4080 

TTTATATGCA CATTTCTGGA TTTTTTTATA CGGTTTTCAT TGACACTCTT CCCTCCTCCC 4140 

ACCTGCCACC AGGCCTCACC AAAGCCCACT GCCATGGGGC CATCTGGGCC ATTCAGAGAC 4200 

TGGAGTGAGA TTTGGGTGTG GAGGGGGAGG CGCCAAGGTG GAGGAGCTTC CCACTCCAGG 4260 

ACTGTTGATG AAAGGGACAG ATTGAGGAGG AAGTGGGCTC TGAGGCTGCA GGGCTGGAAG 4320 

TCCTTGCCCA CTTCCCACTC TCCTGCCCCA ATCTATCTAG TACTTCCCAG GCAAATAGGC 4380 

CCCTTTGAGG CTCCTGAGTG CCCTCAGATG GTCAAAACCC AGTTTTCCCT CTGGGAGCCT 4440 

AAACCAGGCT GCATCGGAGG CCAGGACCOG GATCATTCAC TGTGATACCC TGCCCTCCAG 4500 

AGGGTGCGCT CAGAGACACG GGCAAGCATG CCTCTTCCCT TCCCTGGAGA GAAAGTGTGT 4560 

GATTTCTCTC CCACCTCCTT CCCCCCACCA GACCTTTGCT GGGCCTAAAG GTCTTGGCCA 4620 

TGGGGACGCC CTCAGTCTAG GGATCTGGCC ACAGACTCCC TCCTGTGAAC CAACACAGAC 4680 

ACCCAAGCAG AGCAATCAGT TAGTGAATTG GAATTCCCCA AGTCTTTGCT ATTGTGAATA 4740 

GTGCTGCAAT AAACATACGT GTGCATGTGT CTTTATAGTA GAATGATCTA TAATCCTCTG 4800 

GGTATGTACC CAGTAATGGG ATTGCTGGGT CAAATGGTTT TTCTGGTTCT AGATCCTTGA 4860 

GGAATTGCCA CACTGTCTTT CACAATGGTT GAACTAATTT ACACTCCTAC CAACAGTGTA 4920 

AAAGTCTTCC TGTTTCTCCA CATCCTCTCC AGCATCTGTT GTTTCCTGAC TTTTTAATGA 4980 
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TTGCCATTCT 
TGACCAGTGA 
TGAGAAGTGT 
TAAACTTAAT 
ACAAAGCGTC 
GCTGAAAATG 
GAGAACTTCT 
CCCAAAGATA 
TTGAAATGAA 
GGAAGCCCAT 
GGGGACCAAT 
CCAAACTAAG 
GAGATTTTGT 
AAAAGAAAAA 
ATGAAGAAAC 
AGATTCACAC 
ACGCAGACTG 
CCATCTCACG 



AACTGGTGTG 
AGATGAGCTT 
CTGTTCATAA 
GAAATAAAGC 
CAAGAAATAT 
ACAGGGAGAA 
CCAACCTAGC 
TTTCTCGAGA 
GGAAAAAATG 
CAGACTAACA 
ATTCAACATT 
CTTCATAAGC 
CACCACCAGG 
CTGGTGCCAG 
CGTGTCAACT 
ATAACAATAT 
GCAAATTGGT 
TGCAAAGACA 



AGATGGTATC 
TTTTTCATAT 
CCTTCACCAA 
ATGAAGACAA 
GGGACTATX5T 
TGAAACCAAG 
AAGACAGACC 
AGAGCAACCC 
CTAAGGGCAG 
GCAGATCTCT 
CACAAAGAAA 
AAAGGAGAAA 
CCTGCCTTAC 
CCACTGCAAA 
AATGGGCAAA 
TAACCTTATA 
TAAAGAGTCA 
CACATAGGCT 



TCATTGTGGG 
GTTTGTTGGC 
TTTTTGATGA 
GATTAGAAGA 
GACAAGAACA 
TTGGAAAACA 
AACGTTCAAA 
GAAGACACAT 
CCAGAGAGAA 
CTGCAGAAAC 
AGAATTTTCA 
TAAAATCCTT 
AAGACATCCT 
AAATACCAAA 
ATAACCAGCT 
TCTAAATGGG 
AGACTCATTG 
CAGAGTAAAA 



TTTGATTTGC 
CACATGTTTG 
GGTTGTTTGT 
AAAAGAATGA 
AACTTACGTT 
CTCTTCAGGA 
TTCAGGAAAT 
AATTGTCAGA 
AGGTCAGGTT 
CCTACAACCT 
ATCCAGAATT 
TACAGACAAG 
GAAGGAAGCA 
TTGTAGAGAC 
AGTATCATAA 
CTAAATCCCC 
GTGTGCCGTA 
GGGATACAGG 



ATTTCTCTAA 

TCTTTTCTTG 
AAAGGAACAA 
TGACTGGTGT 
TATTATCCAG 
ACAGAGAACA 
TTCACCAAGG 
ACTCACAAAA 
AGAAGAGAGT 
TCATATCCAG 
CAAATCCTGG 
CTAAATATGG 
CATTGACACT 
TGACAGGATC 
CAATTAAAAG 
TTCAGGAGAC 
GGAATTC 



Seq ID NO: C31 DMA Sequence 

Nucleic Acid Accession ft: NM_031942.1 

Coding sequence: 145.. 1260 



CCCGAGCCCC 
TGCTCCTCCT 
CCGATCTGGG 
GTAAAGAAGA 
TCCTCTGATG 
TCAGTTCGGG 
GCGATGAAGT 
CAGCCCTCAG 
AATTTTTTGG 
ATGTCTGAAT 
GACTCACAAT 
CCTGAACGGA 
GCTCTACCCA 
ACCGTGGATG 
GTGACCCTTC 
GTCTGCAGCA 
TGCCGTCAGA 
OGAGGCCAGT 
CTGCTGGATC 
CGGCAGCGAG 
TTTGGGAATG 
TATCTGGAAA 
TTTTTTCACT 
AATCAAGTTA 
TACACTTTGC 
TCTATTTCCA 
TATGAAAGCA 
GAAACACAAT 
CTTGTTTACA 
TCAAGGCACA 
CTCTCAATCC 
CTCCTAATTT 
AAGGTTGGTG 
ATGAGTAAGC 
TTATTTCAGT 
TTGATATTAA 
CAAGGTTCAA 
GTAACTTTTA 
CTGTGTCAGT 
AAGTGTTTGT 
GATGCATAAT 
TATACAAAAG 



11 

I 

GCCCCTCCGG 
GCTGTGGGAC 
CACCCGCCAC 
ACTTAAAGAA 
ACAGTTGTGA 
AAGGCTGTAG 
TTCCAGCGCG 
AGAATTCTGT 
AGAAAAGGGC 
TAGAAAGCTT 
CAAGGAGACC 
GAGCTCGTCC 
TGGAGGAGGA 
GCTACATGAA 
CGCATATAAT 
ATTCTCGAGA 
AGACTATTGA 
TCTGTGGCCC 
CGAACTGGCA 
ATGGACGGTG 
TGCATGCCTA 
ATTTGCTGCC 
GAAACCTGAG 
ATCTTAGCAG 
CCTCCTGCAG 
ATGCTCCTCT 
TATTTTATTT 
AATAGTATTA 
CAAAAACGAG 
AAAGTCTTAA 
CATGTATTGC 
CTTCTGCCCG 
GGCAATGTAA 
TGATTTGAAT 
TCACATGTAA 
AAACTAGTCT 
GTTTAGATTT 
GCAGTTTGTT 
ATTCCCCCTC 
ATGTCCAATT 
TGGACCTTGA 
TTTATTTTAA 



21 
I 

GCCCGGGTCG 
CGCTGACCGC 
CAGCATGGAC 
ATTCAGATAT 
CAGCTTTGCT 
GACCCGCAGC 
GAGTACCAGG 
GACTGATTCC 
TTTAAATATA 
CCCTGGCTCG 
GCGAAGGCGT 
TCTTACCAGG 
GGAGGAAGAG 
TGAAGATGAC 
TCGCCCAGTG 
GAAGATATAT 
TACCAAAACA 
CTGCCTTCGA 
TTGCCCGCCT 
TGCGACTGGG 
CTTGAAAAGC 
TGCCTTCTAC 
TTAAAAATCT 
ACATGTGTTT 
TTTCTTCTCT 
CCAACCGCTT 
ACTTGGTGTT 
ACTAACTAGA 
TATGATTTAG 
AACCATGTGG 
GCTTATGTTA 
AAGGGTAAGT 
TACTTAATTA 
TTTCAGTATA 
GGTATTGCAA 
GTGGTTCTTT 
TAAGCACTTT 
AACCTGACAT 
CTCTTTGCAT 
TACTTGCATA 
ATCGATAAGT 
TAATAAAATG 



31 
I 

GCGCGCCCAG 
GCGGCTGCTC 
GCTCGCCGCG 
GTGAAGTTGA 
TCTGATAATT 
CAGTGCAGGC 
GGAGCAACCA 
AACTCCGATT 
AAGCAAAACA 
TTCCGTGGAA 
ACATTCCCGG 
TCAAGGTCCC 
GATAAGTACA 
CTGCCCAGAA 
GAAGAAATTA 
AACCGTTCAC 
AACTGCAGAA 
AACCGTTATG 
TGTCQAGGAA 
GTCCTTGTGT 
CTGAAACAGG 
TTCTCAAATC 
TGATGATCAG 
CTGGAGCATC 
GCTCCCAACC 
AGTTTCTGAA 
GAAATAGCCC 
TCTATTGAAT 
CACTCATACT 
AAAAATTAGG 
CAAGTTGTTG 
GGTGCGTCCA 
AAATAATGAT 
AAACTTTAGT 
ATAAATTCTT 
GCAGTTTCTT 
TATAACAATG 
CTCTGCCAGT 
TAATCAAGGT 
TGTAAACCAT 
GTAAATACAG 
TTTGTTCTAA 



41 
I 

CCTGCCAGCC 
CGCTCTCCCC 
TGCCGCAGAA 
TTTCCATGGA 
TTGCAAACAC 
ACTCTGGACC 
ACAAAAAAGC 
CAGAAGATGA 
AAGCAATGCT 
GACATCCCCT 
GTGTTGCTTC 
GGATCCTCGG 
TGTTGGTGAG 
GCCGTCGCTC 
CAGAGGAGGA 
TGGGCTCTAC 
ACCCAGACTG 
GTGAAGAGGT 
TCTGCAACTG 
ATTTAGCCAA 
AATTTGAAAT 
TTTCTTGTAA 
CCTGTTTCAT 
ACAGAAGGTA 
CCCATCTCAT 
TTTCTTTTAA 
TCATAAAACC 
TTCAGAGAAG 
AGTTGAAATT 
TAATTATTGC 
TCACAGTTGA 
GCTTACACGA 
GGAAGAGCTA 
ATAATTGTAG 
GGACAATTTT 
GTAAATTTAT 
ATAAGTGCCT 
CTAGTTTCTG 
ATTTGGTAGA 
TGCTGTGCCA 
CTTTTGATCT 
AAAAAAAAAA 



51 
I 

GCGCTGCTGC 
GCTCCAAGCG 
AGATCTCAGA 
AACCTCGTCA 
GAGGCTGCAG 
TCTCAGGGTG 
AGAGTCCCGC 
AAGTGGAATG 
TGCAAAACTC 
CCCAGGCTCC 
CAGGAGAAAC 
GTCCCTTGAC 
AAAGAGGAAG 
CAGATCATCC 
GTTGGAGAAC 
TTGTCATCAA 
CTGGGGCGTT 
CAGGGATGCT 
CAGTTTCTGC 
ATATCATGGC 
GCAAGCATAA 
AAGTTTCCAA 
AAGAAACTCC 
TATTGCTAGT 
AGCATCCCCC 
ATTACAGTTT 
TAAGCACTTG 
AGCCTTCTAA 
TTTAATAGAA 
AGATTGATGT 
GACTTAATTT 
TCATAATTCA 
TCTGGAGATT 
TTTGCAAAGT 
GTATGGAAAC 
AAACCAGGCA 
TTTTGGAGAT 
GGCAGGTTTC 
GGTGGAATCT 
TTCAATGTTT 
GTAATGCTTT 



Seq id NO: C32 DNA Sequence 

Nucleic Acid Accession ft: NM_012445.1 

Coding sequence: 276.. 1271 



1 

I 

GCACGAGGGA 
GCGGCAGCCC 
TCTCGCTGGA 
GGCCCGGGGC 
GCTGCCGGCC 
CCCTGGGCAA 
TTGGGGGAGA 
GCAAGTGGAG 
GGTCTTCGCT 
ACGTCAGTAA 



11 
I 

AGAGGGTGAT 
CCGCCGCCCC 
GGCCAGGCCG 
GCCGGCCTCG 
GCGCTCCCGC 
GGCCCTCTGC 
GTCCATCTGT 
CCAGACGGCC 
GCTGGGGGCC 
CGGGCTGCGC 



21 

I 

CCGACCCGGG 
CGCAGCCCCT 
TGCAGCATCG 
GGCTTAAATA 
TGCTCCTGCC 
GCTCTCCTCC 
TCOGCCAGAG 
TTCCCCAAGC 
GO GCATA GCT 
GACTTTGCGG 



31 
I 

GAAGGTCGCT 
TCTCCTCCTT 
AAGACAGGAG 
GGAGCTCCGG 
GGGTGATGGA 
TGGCCACTCT 
CCCCGGCCAA 
AGTACCCOCT 
CCGACTACAG 
AGCGCGGCGA 



41 
I 

GGGCAGGGCG 
TCTCCCACGT 
GAACTGGAGC 
GCTCTGGCTG 
AAACCCCAGC 
CGGCGCCGCC 
ATACAGCATC 
GTTCCGCCCC 
CATGTGGAGG 
GGCCTGGGCG 



SI' 
I 

AGTTGGGAAA 
CCTATCTGCC 
CTCATTGGCC 
GGACCCGACC 
CCGGCCGCCG 
GGCCAGCCTC 
ACCTTCACGG 
CCTGCGCAGT 
AAGAACCAGT 
CTGATGAAGG 



5040 
5100 
5160 
5220 
S2B0 
5340 
5400 
S460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6057 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2510 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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AGATCGAGGC GGCGGGGGAG GOGCTGCAGA GCGTGCACGC GGTGTTTTCG GCGCCCGCCG 660 

TCCCCAGCGG CACCGGGCAG AOGTCGGCGG AGCTGGAGGT GCAGCGCAGG CACTCGCTGG 720 

TCTCGTTTGT GGTGCGCATC GTGCCCAGCC CCGACTGGTT CGTGGGCGTG GACAGCCTGG 780 

5ACCTGTGCGA CGGGGACCGT TGGCGGGAAC AGGCGGCGCT GGACCTGTAC CCCTACGACG 840 

CCGGGACGGA CAGCGGCTTC ACCTTCTCCT CCCCCAACTT CGCCACCATC CCGCAGGACA 900 

CGGTGACCGA GATAACGTCC TCCTCTCCCA GCCACCCGGC CAACTCCTTC TACTACCCGC 960 

GGCTGAAGGC CCTGCCTCCC ATCGCCAGGG TGACACTGGT GCGGCTGCGA CAGAGCCCCA 1020 

GGGCCTTCAT CCCTCCCGCC CCAGTCCTGC CCAGCAGGGA CAATGAGATT GTAGACAGCG 1080 

- CCTCAGTTCC AGAAACGCCG CTGGACTGCG AGGTCTCCCT GTGGTCGTCC TGGGGACTGT 1140 

1U GCGGAGGCCA CTGTGGGAGG CTCGGGACCA AGAGCAGGAC TCGCTACGTC CGGGTCCAGC 1200 

CCGCCAACAA CGGGAGCCCC TGCCCCGAGC TCGAAGAAGA GGCTGAGTGC GTCCCTGATA 1260 

ACTGCGTCTA AGACCAGAGC CCCGCAGCCC CTGGGGCCCC CGGAGCCATG GGGTGTCGGG 1320 

GGCTCCTGTG CAGGCTCATG CTGCAGGCGG CCGAGGCACA GGGGGTTTCG CGCTGCTCCT 1380 

GACCGCGGTG AGGCCGCGCC GACCATCTCT GCACTGAAGG GCCCTCTGGT GGCCGGCACG 1440 

ID GGCATTGGGA AACAGCCTCC TCCTTTCCCA ACCTTGCTTC TTAGGGGCCC CCGTGTCCCG 1500 

TCTGCTCTCA GCCTCCTCCT CCTGCAGGAT AAAGTCATCC CCAAGGCTCC AGCTACTCTA 1560 

AATTATGGTC TCCTTATAAG TTATTGCTGC TCCAGGAGAT TGTCCTTCAT CGTCCAGGGG 1620 

CCTGGCTCCC ACGTGGTTGC AGATACCTCA GACCTGGTGC TCTAGGCTGT GCTGAGCCCA 1680 

CTCTCCCGAG GGCGCATCCA AGCGGGGGCC ACTTGAGAAG TGAATAAATG GGGCGGTTTC 1740 

Z\J GGAAGCGTCA GTGTTTCCAT GTTATGGATC TCTCTGCGTT TGAATAAAGA CTATCTCTGT 1800 

TGCTCAC 1807 



25 



55 



Seq ID NO: C33 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1314 



1 U 21 31 41 51 

I I I I I I 

„ ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

JU AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

or GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

J J GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGGGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACOCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

^•J GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTQACCA GTGGCATGTG 1200 

_ n GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 

AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 1314 



Seq ID NO: C34 DNA Sequence 

Nucleic Acid Accession #: NM 003045.1 

Coding sequence: 148.. 2037 ~ 



1 11 21 31 41 51 

I I I I I I 

CGATCCTGCC GGAGCCCCGC CGCCGCCGGC TTGGATTCTG AAACCTTCCT TGTATCCCTC 60 

- CTGAGACATC TTTGCTGCAA GATCGAGGCT GTCCTCTGGT GAGAAGGTGG TGAGGCTTCC 120 
DU CGTCATATTC CAGCTCTGAA CAGCAACATG GGGTGCAAAG TCCTGCTCAA CATTGGGCAG 180 

CAGATGCTGC GGCGGAAGGT GGTGGACTGT AGCCGGGAGG AGACGCGGCT GTCTCGCTGC 240 

CTGAACACTT TTGATCTGGT GGCCCTCGGG GTGGGCAGCA CACTGGGTGC TGGTGTCTAC 300 

GTCCTGGCTG GAGCTGTGGC CCGTGAGAAT GCAGGCCCTG CCATTGTCAT CTCCTTCCTG 360 

- ATCGCTGCGC TGGCCTCAGT GCTGGCTGGC CTGTGCTATG GCGAGTTTGG TGCTCGGGTC 420 
CO CCCAAGACGG GCTCAGCTTA CCTCTACAGC TATGTCACCG TTGGAGAGCT CTGGGCCTTC 480 

ATCACCGGCT GGAACTTAAT CCTCTCCTAC ATCATCGGTA CTTCAAGCGT AGCGAGGGCC 540 

TGGAGCGCCA CCTTCGACGA GCTGATAGGC AGACCCATCG GGGAGTTCTC ACGGACACAC 600 

ATGACTCTGA ACGCCCCCGG CGTGCTGGCT GAAAACCCCG ACATATTOGC AGTGATCATA 660 

„ ATTCTCATCT TGACAGGACT TTTAACTCTT GGTGTGAAAG AGTCGGCCAT GGTCAACAAA 720 

/U ATATTCACTT GTATTAACGT CCTGGTCCTG GGCTTCATAA TGGTGTCAGG ATTTGTGAAA 780 

GGATCGGTTA AAAACTGGCA GCTCACGGAG GAGGATTTTG GGAACACATC AGGCCGTCTC 840 

TGTTTGAACA ATGACACAAA AGAAGGGAAG CCCGGTGTTG GTGGATTCAT GCCCTTCGGG 900 

TTCTCTGGTG TCCTGTCGGG GGCAGCGACT TGCTTCTATG CCTTCGTGGG CTTTGACTGC 960 

ATCGCCACCA CAGGTGAAGA GGTGAAGAAC CCACAGAAGG CCATCCCCGT GGGGATCGTG 1020 

'3 GCGTCCCTCT TGATCTGCTT CATCGCCTAC TTTGGGGTGT CGGCTGCCCT CACGCTCATG 1080 

ATGCCCTACT TCTGCCTGGA CAATAACAGC CCCCTGCCCG ACGCCTTTAA GCACGTGGGC 1140 

TGGGAAGGTG CCAAGTACGC AGTGGCCGTG GGCTCCCTCT GCGCTCTTTC CGCCAGTCTT 1200 

CTAGGTTCCA TGTTTCCCAT GCCTCGGGTT ATCTATGCCA TGGCTGAGGA TGGACTGCTA 1260 

orv TTTAAATTCT TAGCCAACGT CAATGATAGG ACCAAAACAC CAATAATCGC CACATTAGCC 1320 

OU TCGGGTGCCG TTGCTGCTGT GATGGCCTTC CTCTTTGACC TGAAGGACTT GGTGGACCTC 1380 

ATGTCCATTG GCACTCTCCT GGCTTACTCG TTGGTGGCTG CCTGTGTGTT GGTCTTACGG 1440 

TACCAGCCAG AGCAGCCTAA CCTGGTATAC CAGATGGCCA GTACTTCCGA CGAGTTAGAT 1500 

CCAGCAGACC AAAATGAATT GGCAAGCACC AATGATTCCC AGCTGGGGTT TTTACCAGAG 1560 

GCAGAGATGT TCTCTTTGAA AACCATACTC TCACCCAAAA ACATGGAGCC TTCCAAAATC 1620 
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TCTGGGCTAA TTGTGAACAT TTCAACCAGC CTTATAGCTG TTCTCATCAT CACCTTCTGC 1680 

ATTGTGACCG TGCTTGGAAG GGAGGCTCTC ACCAAAG66G CGCTGTGGGC AGTCTTTCTG 1740 

CTCGCAGGGT CTGCCCTOCT CTGTGOCGTG GTCACGGGCG TCATCTGGAG GCAGCCCGAG 1800 

AGCAAGACCA AGCTCTCATT TAAGGTTCCC TTCCTGCCAG TGCTCCCCAT CCTGAGCATC 1860 

TTCGTGAACG TCTATCTCAT GATGCAGCTG GACCAGGGCA CCTGGGTCCG GTTTGCTGTG 1920 

TGGATGCTGA TAGGCTTCAT CATCTACTTT GGCTATGGCC TGTGGCACAG CGAGGAGGCG 1980 

TCCCTGGATG CCGACCAAGC AAGGACTCCT GACGGCAACT TGGACCAGTG CAAGTGACGC 2040 

ACAGCCCCGC CCCCOGGAGG TGGCAGCAGC CCCGAGGGAC GCCCCCAGAG GACCGGGAGG 2100 

CACCCCACCC TCCCCACCAG TGCAACAGAA ACCACCTGCG TCCACACCCT CACTGCA 2157 

Seq ID NO: C35 DNA Sequence 

Nucleic Acid Accession #: NM_002776.1 

Coding sequence: 82.. 912 

1 11 21 31 41 51 

I I I I I I 

ACCAGCGGCA GACCACAGGC AGGGCAGAGG CACX3TCTGGG TCCCCTCCCT CCTTCCTATC 60 

GGCGACTCCC AGATCCTGGC CATGAGAGCT CCGCACCTCC ACCTCTCOGC CGCCTCTGGC 120 

GCCCGGGCTC TGGCGAAGCT GCTGCCGCTG CTGATGGCGC AACTCTGGGC CGCAGAGGCG 180 

GCGCTGCTCC CCCAAAACGA CACGCGCTTG GACCCCGAAG CCTATGGOGC CCOGTGCGCG 240 

CGCGGCTCGC AGCCCTGGCA GGTCTCGCTC TTCAACGGCC TCTCGTTCCA CTGCGCGGGT 300 

GTCCTGGTGG ACCAGAGTTG GGTGCTGAOG GCCGCGCACT GOGGAAACAA GCCACTGTGG 360 

GCTCGAGTAG GGGATGATCA CCTGCTGCTT CTTCAGGGCG AGCAGCTCCG CCGGACGACT 420 

CGCTCTGTTG TCCATCCCAA GTACCACCAG GGCTCAGGCC CCATCCTGGC AAGGCGAAOG 480 

GATGAGCACG ATCTCATGTT GCTAAAGCTG GCCAGGCCCG TAGTGCCGGG GCCCCGCGTC 540 

CGGGCCCTGC AGCTTCCCTA CCGCTGTGCT CAGCCCGGAG ACCAGTGCCA GGTTGCTGGC 600 

TGGGGCACCA CGGCCGCCCG GAGAGTGAAG TACAACAAGG GCCTGACCTG CTCCAGCATC 660 

ACTATCCTGA GCCCTAAAGA GTGTGAGGTC TTCTACCCTG GCGTGGTCAC CAACAACATG 720 

ATATGTGCTG GACTGGACCG GGGCCAGGAC CCTTGCCAGA GTGACTCTGG AGGCCCCCTG 780 

GTCTGTGACG AGACCCTCCA AGGCATCCTC TCGTGGGGTG TTTACCCCTG TGGCTCTGCC 840 

CAGCATCCAG CTGTCTACAC CCAGATCTGC AAATACATGT CCTGGATCAA TAAAGTCATA 900 

CGCTCCAACT GATCCAGATG CTACGCTCCA GCTGATCCAG ATGTTATGCT CCTGCTGATC 960 

CAQATGCCCA GAGGCTCCAT CGTCCATCCT CTTCCTCCCC AGTCGGCTGA ACTCtCCCCT 1020 

TGTCTGCACT GTTCAAACCT CTGCCGCCCT CCACACCTCT AAACATCTCC CCTCTCACCT 1080 

CATTCCCCCA CCTATCCCCA TTCTCTGCCT GTACTGAAGC TGAAATGGAG GAAGTGGTGG 1140 

CAAAGGTTTA TTCCAGAGAA GCCAGGAAGC CGGTCATCAC CCAGCCTCTG AGAGCAGTTA 1200 

CTGGGGTCAC CCAACCTGAC TTCCTCTGCC ACTCCCCGCT GTGTGACTTT GGGCAAGCCA 1260 

AGTGCCCTCT CTGAACCTCA GTTTCCTCAT CTGCAAAATG GGAACAATGA CGTGCCTACC 1320 

TCTTAGACAT GTTGTGAGGA GACTATGATA TAACATGTGT ATGTAAATCT TCATGTGATT 1380 

GTCATGTAAG GCTTAACACA GTGGGTGGTG AGTTCTGACT AAAGGTTACC TGTTGTCGTG 1440 

AAAAAAAAAA AAAA 1454 



Seq ID NO i C36 DNA Sequence 
Nucleic Acid Accession Ut XM_095088 
45 Coding sequence: 1..4074 



1 11 21 31 41 51 

I I I I I I 

ATGACCCGGG CCGCGACAGC CGAGCCGGGC CGCGTGTCTC CCGCCTCGCC CGCGCGCAGC 60 

ACGGCGGGCC TGCCGCGCGC CTTCCTGCAG AGCCTGCGCA CCCTGTTAGA CATCCTGGAC 120 

GACTGGCAGC GCGGCTGCGT GCACCTGCGA GAGATCCAGT CCCTCTGGGT CGAAGCGCGG 180 

GAGCTGCCAA GCGGGGTGCT GGAGGGCCTG AGCCAGCGGC GCGGGCCGCA GCCGGGAGCA 240 

GCTGTACGCT CCCGCCGAGG CGGCGCAGTG CCCCGCGGGG CCCGAGCGGT TCCAGAGOGC 300 

TGCGCTGGAA CGGAGACCOG GCGCGGACGC AGGTGCAGCG GCCTGCAGCG CCTTGGAGGC 360 

GGATTCAGGG GATGCCCGGC GGACCOCTGT GCCCGAGGGG AACACCGGAG GCACACCATC 420 

ACCAGCGGCG TGGACTGCGG CCTGCTGAAG CAGATGAAGG AGCTGGAGCA GGAGAAGGAG 480 

GTGCTGCTAC AGGGTTTGGA GATGATGGCG CAGGGCCGCG ATTGGTACCA GCAGCAGCTG 540 

CAACAAGTGC AGGAGCGCCA GTGCCGCCTG GGCCAGAGCA GAGCCAGCGC CGACTTTGGG 600 

GCCGTGGGGA GCCCTCGCCC ACTGGGACGG CTACTGCCCA AGGTACAGGA GGTGGCCCGG 660 

TGGCTGGGGG AGCTGCTGGC TGAGGCCTGT GCCGGTCGGG CCCTGCCCAC ATCCTCCTCC 720 

GGGCCCCCTT GCTCTGCCCT GACGTCCACC TCGTCCCCGG GCTGGCAGCA GCAGATCATC 780 

CTCATGCTGA AGGAGCAGAA CCGACTCCTC ACCCAGGAGG TGACCGAGAA GAGTGAGCGC 840 

ATCACGCAGC TGGAGCAGAA GTCGGCGCTC ATTAAGCAGC TATTTGAGGC CCGCGCCCTG 900 

AGCCAGCAGG ATGGGGGCCT GTCCCCGGCT GGCCCCCACA TTGAGCCCTT GACTCGGTTT 960 

CGGCTTCCGG TGCTGACATG GGCTGGGGCT CTCTTGAGTC CGCATAGTCC GCAGCTACTA 1020 

CTGCOGCTGT CAGCGGACAG TGGGGGACCC CTCCACGAGT TACCAGATAC CTGGTTTCCA 1080 

GOGGTGCTGC TTTGGGTCCC ATCTCCAGGG AAAAGAACTG CTCACGCCAG GCTGCACTTC 1140 

CACCAAAGGC CAGCAGAGGG CGCGTGGCAG CTCGGATGCG GCGCTGAGGC TGCGCCCGAG 1200 

ACCTGCGGGA CGCTGCCCCA CTTTGAGTCC CACAAAACAA CCTGTGAGCC TGACTCCCTC 1260 

GGAGGGCCCT GTCCCCAGGA GGGGGATCGC AGCTGGAGCC ACCTGGGCGC AGCGTTTGAT 1320 

GTGGCACCTG CAGTGGCCAA AGTGACACCC AACCGTGAGG AOGCTGCAGG GAGTCGGCAC 1380 

GGAGACATCT GTCCCCTCTG TCCCAAGGGA CTGTTGACAT TCAGAGACAT AGCTATAGAA 1440 

TTCTCTCTGG CGGAGTGGCA ATGCCTGGAT CATGCTCAGC AGAATTTATA TAGAGATGTG 1500 

ATGTTAGAGA ACTACAGAAA CCTGTTCTCC CTGGGTATGA CTGTCTCTAA GCCAGACTTG 1560 

ATCGCCTGTC TGGAGCAAAA TAAAGAGCCC CAGAATATAA AGAGAAATGA GATGGCAGCC 1620 

AAACACCCAG TTACATGTTC TCATTTCAAC CAAGACCTTC AGCCAGAGCA GAGCATAAAA 1680 

GATTCACTCC AAAAAGTAAT ACCAAGAACA TATGGAAAAT GTGGACATGA GAATTTACAA 1740 

TTAAAAAAAT GTTGTAAAAG AGTAGATGAG TGTGAGGTGC ACAAAGGAGG TTATAATGAC 1800 

CTTAACCAAT GTTTGTCAAA TACCCAAAAC AAAATATTTC AGACTCATAA ATGTGTCAAA 1860 

GTCTTCAGTA AATTTTCAAA TTCCAATAGA CACAATGCAA GATATACTGG AAAGAAACAT 1920 

TTGAAATGTA AAAAATATGG CAAATCATTT TGCATGTTTT CACACCTAAA TCAACATCAG 1980 

ATAATTCATA CTAAGGAGAA GTCCTACAAA TGTGAAGAAT GTGGCAAATC CTTTAACCAC 2040 

TCCTCAAGCG GTACTACACA TAAAAGAATT CTTACTGGAG AGAAACCCTA CAGATGTGAG 2100 

GAATGTGGCA AAGCCTTTAG GTGGCCCTCA AACCTTACTA GACATAAGAG AATTCACACT 2160 



1219 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GGAGAGAAAC 
ACTAACCACA 
GCCTTTAGOG 
TACACATGTG 
ATAATTCATA 
TCCTCAACTG 
GAATGTGACA 
GGAGAGAAAC 
AAGGTCTACT 
GGCTGGCTTG 
GTGAGACCTA 
CAGAGGAAGG 
CAAAGAAATA 
CTGGAGGAGT 
GAGAACTACA 
TGTCTGGAGC 
CCAGACCTTC 
TATGGAAAAA 
GAGGTGCAAA 
ATATTTCAGA 
AAGACAAGAC 
ATGGTTTCAC 
GAAGAATGCG 
ACTGGAGAGA 
CTTACTAAAC 
CAAGCCTTTA 
CCATACACAT 
AAGAGAATTC 
TGCTCCTCAA 
AAAGAATGTG 
ACTGGAGAGG 
CTTGCTAATC 



CCTACGCATG 
AGAGAATTCA 
TATCCTCAGC 
AAGAATGTGG 
CTGGAGAGAA 
TAAAGGCACA 
AAGCTTTTAA 
CCTACAAATG 
CTGGAGATGG 
TGAGAAACAA 
ACAGGGACCC 
AGGAACCTGA 
CTGGAGTGGG 
GGCAATGCCT 
GAAACCTGGT 
AAAATAAAGA 
CGCCAGAGCT 
GTGGACATGA 
AAGGAGGTTG 
CTCATAAATG 
ATACTGGAAA 
AACTACATCA 
GCAAACCCTT 
AACCCTACAG 
ATAGGAGAAT 
GCCGCTCCTC 
GTGAAGAATG 
ATACTGGAGA 
CCCTTAAGAA 
GGAAAGCCTT 
AACCCTACAA 
ATAAGAGTAT 



TGAAGAATGT 
TACTGGAGAG 
OCTCATTTAC 
CAAAGCCTTT 
ACCCTACACA 
TAAGAGAATT 
GTGGCATTCA 
CAGTGACAGC 
GGAAAATGGA 
GAACGAAAAT 
TTCATGGGGA 
CCTTCAAAAT 
TGGACTGTTG 
GGATCACGCT 
CTCCCTGGGT 
GCCTTGGAAT 
AGGCATAAAA 
CAATTTACAA 
TAATGAAGTT 
TGTCAAAGTC 
GAAACATTTC 
ACATCAGATA 
CAACTGCTCT 
ATGTGAGGAA 
TCATACTGGA 
AACACTTGCT 
TGGCAAAGCC 
GAAACCCTAC 
ACATAAGATA 
TGCCTTCTCC 
ATGTGAAGAA 
GCATACTGGA 



GGCCAAGCCT 
AGACCCTACA 
CACAAGAGAA 
AACTGCTCCT 
TGTGAAGAAT 
CATACTGGAG 
AGTCTTGCTA 
AAAGCCTTAG 
ATCCGTGTAC 
AGAACAGGGC 
CAGCAAGAAG 
CACTATGACC 
ACATTCAGAG 
CAGCAGAATT 
ATTGCTGTCT 
ATAAAGAGAA 
GATTCACTCC 
GTAAAAACAT 
AACCAATGTT 
TTCGGCAAAT 
AAATGTAAAA 
ATTCATACTA 
TCAACCCTTT 
TGTGGCAAAG 
GAAAAACCCT 
AACCACAAGA 
TTTAGCTTAT 
ACATGTGAAG 
ATTCATACTG 
TCAACTCTTA 
TGTGACAAAG 
GAGAAACCCT 



TTAGGCGCTC 
AATGTGAAGA 
TTCATACTGG 
CGACTCTTAA 
GTGGCAGAAC 
AGAAACCATA 
AACATAAGAT 
CCAAATCATC 
ATAAGAAAAA 
TGTTCCAGAT 
GTTCACTGAC 
ATCAGAATGC 
ATGTAGTCAT 
TATATAGAGA 
CTAAGCCAGA 
ATGAGATGGT 
AAAAAGTAAT 
GTAAAAGCAT 
TGTCAACTAC 
TTTCAAATTC 
AATATGGCAA 
GGGAGAATTC 
CTAAACATAA 
CTTTTACCTG 
ACACATGTGA 
GAATTCATAC 
CCTCATCCCT 
AATGTGGCAA 
GAGAGAAACC 
ATACTCATAA 
CTTTTAAGTG 
ACAAATGTGA 



CTCAACACTT 
ATGTGGCAAA 
AGAGAAACCC 
GACACATAAG 
CTTTAACTGC 
CAAATGTGAA 
AATTCACACT 
AGAAGTGCAA 
GGAGACACAG 
CGGGGCTGCC 
TGACCCAATT 
CTTAGAAGAT 
AGAATTCTCT 
TGTGATGTTA 
CTTGATCACC 
AACCAAACAC 
ACCAAGAAGA 
GGGTGAGTGT 
CCAAAACAAA 
CAATAGACAT 
ATCATTTTGC 
CTACCAATGT 
AAGAATTCAT 
GTCCTCAACC 
AGAATGTGGC 
TGGAGAGAAA 
CAC TTAC CAC 
AGCCTTTAAC 
CTACAAATGT 
GAGGATTCAT 
GTCCTCAAGT 
ATAA 



Seq ID NO: C37 DMA Sequence 
Nucleic Acid Accession lh NM 032044 
Coding sequence: 162.. 658 



1 
I 

AAGATATAAA 
GCCCTTAGAG 
AGAAGCGCTA 
GATGGCTTCC 
CCTGGGTGAT 
TTGCTATGGT 
TTAOGGAAAC 
AGAGTACATA 
GAAGAGGCAG 
CAAGTCCATG 
TTGGAGCAGC 
CAAGAATCAA 
GCTAAATCTG 
ACTACACTGG 
CTAGCTCTAA 
GTCTCTGGCT 
CTTTGGCCAT 
AGACCCCTTC 
CACTCAACTC 
TAGGTTGATG 



11 

I 

AGCTCCAGAA 
TCTTGGTTGC 
GTAAGGTCTC 
AGAAGCATGC 
ATCATCATGA 
TACTTCAGGA 
GGAGCCCACC 
AGTGGCTArC 
CAGTGGCAGT 
GGTGGGAACA 
AACGAATGCA 
GATTCTGCTA 
CTCATTATTT 
CTTTTTTAGG 
ATGTTTGCCC 
GTCTCGAGCA 
AAGAAGTAAA 
AGCTTCTACA 
CTGCTTGTTT 
TGGGCCATAC 



21 

I 

ACGTTGACTG 
CAAACAGATT 
TGAGATCCTT 
GGCTGCTCCT 
GACCCAGCTG 
AGCTGAGGAA 
TGGCATCTAT 
AGAGAAGCCA 
GGATTGATGG 
AGCACTGTGC 
ACAAGCGCCA 
ACTCCTGCAC 
CAGAGGGGAA 
CTTAGAGACA 
CGCCATCCCT 
GTCTAGAAGA 
GATTTGAAGA 
CCCTTCTGCC 
TTCCTTTGGC 
ATTCCTTTAA 



31 

1 

GGACCACTGG 
TGCAGATCAA 
GCACTAGCTA 
•ATTGCTGAGC 
TGCTCCTGGA 
CTGGTCTGAT 
CCTGAGTTTA 
GCCGATATGG 
GGCCATGTAT 
TGAGATGAGC 
ACACTTCCTG 
AGCCCCGTCC 
ACCTAGCAAA 
GAAACTTTAG 
TTCCACAGTA 
GTGCATCTCC 
CAGAAGGAAG 
CTCTCTCCAT 
CATAGGAAGG 
TAAACCATTG 



41 
I 

AGACACTGAA 
GGAGAACCCA 
CATCCTCAGG 
TGCCTGGCCA 
TGGTTTTACC 
GCCGAGCTCG 
AAGGAAGCCA 
ATTGGCCTGC 
CTGTACAGAT 
TCCAATAACA 
TGCAAGTACC 
TCTTCCTTTC 
CTAAGAGTGA 
CATTGGCCCA 
TCCTTCTTCC 
AGCCTATGAA 
AAACTCAGGA 
TGCCTGCACC 
TTTACCAGTA 
TGTACATAAG 



51 
I 

GAAGGCAGGG 
GGAGTTTCAA 
GTAGGAGGAA 
AAACAGGAGT 
ACAAGTCCAA 
AGTGTCAGTC 
GCACCATAGC 
ACGACCCACA 
CCTGGTCTGG 
ACTTTTTAAC 
GACCATAGAG 
TGCTAGCCTG 
TAAGGGCCCT 
GTAGTGGCTT 
CTCCTCCCCT 
ACAGCTGGGT 
GTAAGCTTCT 
CCACCCCAGC 
GAATCCTTGC 
AAAAAAAAAA 



Seq ZD NO> C38 dna Sequence 

Nucleic Acid Accession fh Eos sequence 

Coding sequence: 52.. 3042 



1 
I 

GCTCACCCAG 
GTTAACCTCA 
GACCGGGGCA 
CCCAAACTCA 
AATGTACAGT 
TACCAGGCAG 
GTGGCAGGGA 
GCGGAGGTTG 
TA CCCC TACA 
AAGTTTGCTC 
CAGCAGCTGG 
GGAGGTTATG 
TGCGTCACAG 
TTGGGCCACT 
CTTGGCCTCC 
AAGATQATCA 
GCTGTGTCCA 
GGATCTGAGG 



11 

I 

GAAAAATATG 
GCACCGAGGT 
GAGCCTGCCG 
CAGTCACCAT 
CATGGAAACC 
AAGAGTTCCA 
AACCAATGTA 
GGCTTCTGAG 
GAAACCACAT 
TGGGATTTAA 
TGGGTCAGTA 
ACCCACCCAC 
TCCATGGCTC 
GCTTCTTCAC 
TTGTCAAGTC 
CAGGAGACTC 
CCTTCTGGAT 
AAACTGGATT 



21 
I 

CAATCGTCCC 
TGTCTACAAA 
GAGCTACCGT 
TGACACCAAT 
TGGAGATACC 
GGTGCTTCCC 
CCTGCACATC 
CCGGAACATC 
CTGCAATTTC 
GGCAGCACAC 
CCCGATTCAC 
ATACATCAGG 
CAATGGCTTG 
GGAAGATGGG 
TGGAACCCTC 
CTACCCAGGG 
GGCCAATCCC 
TTGGTTTATT 



31 

I 

ATTGATATAC 
AAAGGCCAGG 
GTACGGTTCC 
GTGAACAGCA 
CTGGTCATTG 
TGCAGATCCT 
GGGGAGGAGA 
ATAGTGATGG 
TTTGACTTCG 
TTGGAGGGCA 
TTCCACCTGG 
GACCTCTCCA 
TTGATCAAGG 
CCGGAGGAAC 
CTCCCCTCGG 
TACATCCCCA 
AACAACAACC 
TTTCACCACG 



41 
I 

AGGCCACTAC 
ATTATAGGTT 
TCTGTGGGAA 
CCATTCTGAA 
CCAGTACTGA 
GCGCCCCCAA 
TAGACGGCGT 
GGGAGATGGA 
ATACCTTTGG 
CGGAGCTGAA 
CCGGTGATGT 
TCCATCATAC 
ACGTTGTGGG 
GCAACACTTT 
ACGGTGACAG 
AGCCCAGGCA 
TCATCAACTG 
TACCAACGGG 



51 
I 

AATGGATGGA 
TGCTTGCTAC 
GCCTGTGAGG 
CTTGGAGGAT 
TTACTCCATG 
CCAGGTCAAA 
GGACATGCGG 
GGACAAATGC 
GGGCCACATC 
GCATATGGGA 
AGACGAAAGG 
ATTCTCTCGC 
CTATAACTCT 
TGACCACTGT 
CAAGATGTGC 
AGACTGCAAT 
TGCCGCTGCA 
CCCCTCCGTG 



2220 
22S0 
2340 
2400 
2460 
2S20 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4074 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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5 

10 



GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACCGA 1140 

6CACATTCCA ACTAOOGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 1200 

TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 1260 

GAOGCOGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 1380 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGAGGACGGC 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTTGGCGAGA GTGGCAACGT GGGGACGGAA 1500 

ATGATGGACA ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

AACTGCACTT TCCGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGCGC CCTGGCCTTC 1680 

CG CCTG AATA ATGCCTGGCA GAGCTGCCCC CATAACAACG TGACCGGCAT TGCCTTTGAG 1740 

GACGTTCGGA TTACTTCCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGTT CAACCAGCTG 1600 

GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTOG ACGGCTCCGT GTOCGAGTAC 1860 

CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA CTGCATCAAT 1920 

ID GTTCCCGACT GGAGAGGGGC CATTTGCAGT GGGTGCTATG CACAGATGTA CATTCAAGCC 1980 

TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 

TACCTGGAGG GGGCGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 

CTGCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCCGAACT OGCCATCTGG 2160 

CTCATCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 

GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 

TACTACTGGG ACGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 

GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

CCAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 

GCTGTCGTAG ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT CGGAGGATGG CATCCAGGTG 2700 

GTGGTGATTG ACGGGAACGA AGGGCGCGTG GTGAGCCACA CGAGCTTCAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2820 

GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3000 

CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCCGGT 3060 

GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 

CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 

GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 

GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 

GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3840 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCOCTGCCG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 4260 

AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 

TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 

AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 

TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 

GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 

CCCTCCTGCT CCCAGCGCAC ACAAAOCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 4620 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 

TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 

GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 

GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4860 

ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 

ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 

CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 

ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 

CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 

„ CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 

/U AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCACGGT TTTGTTGAGT TTTCACTOTT 5280 

CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 

AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 

GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTC TTTCC TAAACCATTC 5460 

ACCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 

TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5S80 

ACGCTCCTCT GAAATGCTTQ TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 

GTTAGATGTA TAGAGTGTTT GTATGTAAAC ATTTCTTGTA GGCATCACCA TGAACAAAGA 5700 

TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 

AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 5808 
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1 11 21 31 41 51 

I I I I I I 

GTGGCCTCGA GGTGGTGGCA GGGCCGCCCC CTGCAGTCCG GAGACGAACG CACGGACOGG 60 

GCCTCCGGAG GCAGGTTCGG CTGGAAGGAA CCGCTCTGGC TTCGTCCTAC ACTTGCGCAA 120 

ATGTCTCCGA GCTTACTCAC ATAGCATATT GGTATATCAA AATGAAATGC AAGGAACCAA 180 

AAATAACATA ATTGAAGGCA GTAAAAGTGA AATTAAATAG GAAGATCATC AGTCAAGGAA 240 

GACCCACTGG AGAGGACAGA AAATGAAGCA GTGTTTTATC ATGTGTATTT CAGCAGGTCT 300 

TCTTGAAATT TAACTAAAAA TATGACTGCT CTCTCTTCAG AGAACTGCTC TTTTCAGTAC 360 

CAGTTACGTC AAACAAACCA GCCCCTAGAC GTTAACTATC TGCTATTCTT GATCATACTT 420 

GGGAAAATAT TATTAAATAT CCTTACACTA GGAATGAGAA GAAAAAACAC CTGTCAAAAT 480 

TTTATGGAAT ATTTTTGCAT TTCACTAGCA TTCGTTGATC TTTTACTTTT GGTAAACATT 540 

TCCATTATAT TGTATTTCAG GGATTTTGTA CTTTTAAGCA TTAGGTTCAC TAAATACCAC 600 

ATCTGCCTAT TTACTCAAAT TATTTCCTTT ACTTATGGCT TTTTGCATTA TCCAGTTTTC 660 

CTGACAGCTT GTATAGATTA TTGCCTGAAT TTCTCTAAAA CAACCAAGCT TTCATTTAAG 720 

TGTCAAAAAT TATTTTATTT CTTTACAGTA ATTTTAATTT GGATTTCAGT CCTTGCTTAT 780 

GTTTTGGGAG ACCCAGCCAT CTACCAAAGC CTGAAGGCAC AGAATGCTTA TTCTCGTCAC 840 

TGTCCTTTCT ATGTCAGCAT TCAGAGTTAC TGGCTGTCAT TTTTCATGGT GATGATTTTA 900 

TTTGTAGCTT TCATAACCTG TTGGGAAGAA GTTACTACTT TGGTACAGGC TATCAGGATA 960 

ACTTCCTATA TGAATGAAAC TATCTTATAT TTTCCTTTTT CATCCCACTC CAGTTATACT 1020 

GTGAGATCTA AAAAAATATT CTTATCCAAG CTCATTGTCT GTTTTCTCAG TACCTGGTTA 1080 

CCATTTGTAC TACTTCAGGT AATCATTGTT TTACTTAAAG TTCAGATTCC AGCATATATT 1140 

GAGATGAATA TTCCCTGGTT ATACTTTGTC AATAGTTTTC TCATTGCTAC AGTGTATTGG 1200 

TTTAATTGTC ACAAGCTTAA TTTAAAAGAC ATTGGATTAC CTTTGGATCC ATTTGTCAAC 1260 

TGGAAGTGCT GCTTCATTCC ACTTACAATT CCTAATCTTG AGCAAATTGA AAAGCCTATA 1320 

TCAATAATGA TTTGTTAATA TTATTAATTA AAAGTTACAG CTGTCATAAG ATCATAATTT 1380 

TATGAACAGA AAGAACTCAG GACATATTAA AAAATAAACT GAACTAAAAC AACTTTTGCC 1440 

CCCTGACTGA TAGCATTTCA GAATGTGTCT TTTGAAGGGC TATACCAGTT ATTAAATAGT 1500 

GTTTTATTTT AAAAACAAAA TAATTCCAAG AAGTTTTTAT AGTTATTCAG GGACACTATA 1560 

TTACAAATAT TACTTTGTTA TTAACACAAA AAGTGATAAG AGTTAACATT TGGCTATACT 1620 

GATGTTTGTG TTACTCAAAA AAACTACTGG ATGCAAACTG TTATGTAAAT CTGAGATTTC 1680 

ACTGACAACT TTAAGATATC AACCTAAACA TTTTTATTAA ATGTTCAAAT GTAAGCAAGA 1740 

AAAAAAAAA 1749 



Seq ID NO: C40 DNA Sequence 
DO Nucleic Acid Accession 8: BC012089 



Coding sequence: 1..2571 

1 11 21 31 41 51 

I I I I I I 

ATGGCCCTCG TACTCGGCTC CCTGTTGCTG CTGGGGCTGT GCGGGAACTC CTTTTCAGGA 60 

GGGCAGCCTT CATCCACAGA TGCTCCTAAG GCTTGGAATT ATGAATTGCC TGCAACAAAT 120 

TATGAGACCC AAGACTCCCA TAAAGCTGGA CCCATTGGCA TTCTCTTTGA ACTAGTGCAT 180 

ATCTTTCTCT ATGTGGTACA GCCGCGTGAT TTCCCAGAAG ATACTTTGAG AAAATTCTTA 240 

CAQAAGGCAT ATGAATCCAA AATTGATTAT GACAAGATTG TCTACTATGA AGCAGGGATT 300 

ATTCTATGCT GTGTCCTGGG GCTGCTGTTT ATTATTCTGA TGCCTCTGGT GGGGTATTTC 360 

TTTTGTATGT GTCGTTGCTG TAACAAATGT GGTGGAGAAA TGCACCAGCG ACAGAAGGAA 420 

AATGGGCCCT TCCTGAGGAA ATGCTTTGCA ATCTCCCTGT TGGTGATTTG TATAATAATA 480 

AGCATTGGCA TCTTCTATGG TTTTGTGGCA AATCACCAGG TAAGAACCCG GATCAAAAGG 540 

AGTCGGAAAC TGGCAGATAG CAATTTCAAG GACTTGCGAA CTCTCTTGAA TGAAACTCCA 600 

GAGCAAATCA AATATATATT GGCCCAGTAC AACACTACCA AGGACAAGGC GTTCACAGAT 660 

CTGAACAGTA TCAATTCAGT GCTAGGAGGC GGAATTCTTG ACCGACTGAG ACCCAACATC 720 

ATCCCTOTTC TTGATGAGAT TAAGTCCATG GCAACAGCGA TCAAGGAGAC CAAAGAGGCG 780 

TTGGAGAACA TGAACAGCAC CTTGAAGAGC TTGCACCAAC AAAGTACACA GCTTAGCAGC 840 

AGTCTGACCA GCGTGAAAAC TAGCCTGCGG TCATCTCTCA ATGACCCTCT GTGCTTGGTG 900 

CATCCATCAA GTGAAACCTG CAACAGCATC AGATTGTCTC TAAGCCAGCT GAATAGCAAC 960 

CCTGAACTGA GGCAGCTTCC ACCCGTGGAT GCAGAACTTG ACAACGTTAA TAACGTTCTT 1020 

AGGACAGATT TGGATGGCCT GGTCCAACAG GGCTATCAAT CCCTTAATGA TATACCTGAC 1080 

AGAGTACAAC GCCAAACCAC GACTGTCGTA GCAGGTATCA AAAGGGTCTT GAATTCCATT 1140 

GGTTCAGATA TCGACAATGT AACTCAGCGT CTTCGTATTC AGGATATACT CTCAGCATTC 1200 

TCTGTTTATG TTAATAACAC TGAAAGTTAC ATCCACAGAA ATTTACCTAC ATTGGAAGAG 1260 

TATGATTCAT ACTGGTGGCT GGGTGGCCTG GTCATCTGCT CTCTGCTGAC CCTCATCGTG 1320 

ATTTTTTACT ACCTGGGCTT ACTGTGTGGC GTGTGCGGCT ATGACAGGCA TGCCACCCCG 1380 

ACCACCCGAG GCTGTGTCTC CAACACOGGA GGCGTCTTCC TCATGGTTGG AGTTGGATTA 1440 

AGTTTCCTCT TTTGCTGGAT ATTGATGATC ATTGTGGTTC TTACCTTTGT CTTTGGTGCA 1500 

AATGTGGAAA AACTGATCTG TGAACCTTAC ACGAGCAAGG AATTATTCCG GGTTTTGGAT 1560 

ACACCCTACT TACTAAATGA AGACTGGGAA TACTATCTCT CTGGGAAGCT ATTTAATAAA 1620 

TCAAAAATGA AGCTCACTTT TGAACAAGTT TACAGTGACT GCAAAAAAAA TAGAGGCACT 1680 

TACGGCACTC TTCACCTGCA GAACAGCTTC AATATCAGTG AACATCTCAA CATTAATGAG 1740 

CATACTGGAA GCATAAGCAG TGAATTGGAA AGTCTGAAGG TAAATCTTAA TATCTTTCTG 1800 

TTGGGTGCAG CAGGAAGAAA AAACCTTCAG GATTTTGCTG CTTGTGGAAT AGACAGAATG 1860 
AATTATGACA GCTACTTGGC TCAGACTGGT AAATCCCCCG CAGGAGTGAA TCTTTTATCA , 1920 
TTTGCATATG ATCTAGAAGC AAAAGCAAAC AGTTTGCCCC CAGGAAATTT GAGGAACTCC * 1980 

CTGAAAAGAG ATGCACAAAC TATTAAAACA ATTCACCAGC AACGAGTCCT TCCTATAGAA 2040 

CAATCACTGA GCACTCTATA CCAAAGCGTC AAGATACTTC AACGCACAGG GAATGGATTG 2100 

TTGGAGAGAG TAACTAGGAT TCTAGCTTCT CTGGATTTTG CTCAGAACTT CATCACAAAC 2160 

AATACTTCCT CTGTTATTAT TGAGGAAACT AAGAAGTATG GGAGAACAAT AATAGGATAT 2220 

TTTGAACATT ATCTGCAGTG GATCGAGTTC TCTATCAGTG AGAAAGTGGC ATCGTGCAAA 2280 

CCTGTGGCCA CCGCTCT AGA TACTGCTGTT GATGTCTTTC TGTGTAGCTA CATTATCGAC 2340 

CCCTTGAATT TGTTTTGGTT TGGCATAGGA AAAGCTACTG TATTTTTACT TCCGGCTCTA 2400 

ATTTTTGCGG TAAAACTGGC TAAGTACTAT CGTCGAATGG ATTCGGAGGA CGTGTACGAT 2460 

GATGTTGAAA CTATACCCAT GAAAAATATG GAAAATGGTA ATAATGGTTA TCATAAAGAT 2520 

CATGTATATG GTATTCACAA TCCTGTTATG ACAAGCCCAT CACAACATTG A 2S71 
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Nucleic Acid Accession 8: NM_033049 
Coding sequence: 28.. 1566 



CCACGCGTCC 
CTCCTTTCTG 
GAAACTGCGA 
GAAACTGCTA 
CCCATAATTA 
ACACATAGTT 
GTAAATTCAT 
ACAATGGTTC 
CAATCATCAG 
GGTCCCAGCA 
CATAATACAA 
AAAGGAAAG6 
GAGAAACATT 
GTATTTGGCA 
CCAAGATCTG 
GCAGAAACCA 
AGTAGCTCAA 
AACCAGACTG 
AGGCCTAACC 
AACGCACAGC 
GTGCCCGGCT 
GGACTCGACT 
ATTGTCATTC 
AAGCATATTG 
ACAGGCTTCA 
TCCAGAGACA 
TATTAGAATC 
TTAGAGTGTT 
TTTTTCTTCC 
AATCCAGCTC 
CTTGTCAGAG 
TTTTCTTAAT 
CCCAGCATTG 
GGAGACATAC 
CAAGGGTCAC 
ATGGAGAGAT 
AGACTGGAAA 
TTTTTGATGC 
TTTTTGAAAT 
AGAGAGGCAA 
CTTCTTCCTT 
GAAGACTGGC 
TTTGAAATTG 
CCAGCCGTCC 
CTGGTGACAG 
ACAAATGGGG 
ATTTCTAATG 
TGTTAATTGT 
AAAAAAA 



Seq ID NO j C42 DNA Sequence 

Nucleic Acid Accession 8t NM_001432.1 

Coding sequence: 167.. 676 




41 
I 

TTCATCTTAC 
CTGATGCTGT 
CCACTGAAAC 
CAGCTACTTC 
CTGCTCCCCC 
ACAGTGAGTC 
CTCCAAATGA 
CCCCCACCAC 
CCAGCACCCT 
ATTCGTTATG 
ACAACTCTTC 
GAGAAACATT 
TTACTAGCTT 
CTGTAAGCAC 
TAACAATAGT 
AAATTAATAA 
GGTGTGATTA 
ATTGCAAATC 
TCAAGTGTCC 
GGGCCCCTGA 
AGTGTGCATT 
TTGTGGGCAC 
CAAGATCAAA 
AAAATCTAAA 
CTAAGGTCAG 
GCAGCATGCC 
ACCAATGTAC 
TAAAGATCTG 
AGTTGTGAAT 
TGTAGTAGAT 
TGAGACAATG 
TCCAATCTAG 
TAAGTGACCT 
AAGGGATACC 
ACACTGCTTC 
CAACCTAGCC 

CTATGGGGTG 
AACATGACCT 
AGGAGGCACC 
TTCTCTAATC 
GAGTAAGAGC 
GGGAGCTTGT 
CATTGGGGCC 
ACACACACAC 
TGTTATTAGT 
TATTATTATT 
GATGG TGAAA 




TGGCTACAGT 
CATCGCTGGC 
TAACAAAACG 
ACTGCGGTCG 
GATAACGGCC 
CCGCCCTGAC 
AAGCTATTAT 
GCCTCCGGGG 
GTTTGCAACG 
GCTATTAGCG 
GTTAGGGTTG 
AGGAAAGCTC 
AATTCCCCTG 
CACCACCTTT 
TATCCCTCCA 
TCACCCAAGA 
ATCCAGGAAC 
CACTCTACAG 
GGTAGAAGGA 
TGGGATCCAC 
ATGCCCTCCC 
CTTAGGTCAG 
GCTCAGGAGT 
TCAGCCTTCT 
AACCACACAC 
GTCCTTTTTT 
TGTTCTTGAC 
AAAAAAAAAA 



1 
I 

TCACTTGCCT 
TGACAGCCGC 
TCCGCAGCCG 
GAGGATGGAG 
TCTTCTACAG 
TGATAACTGC 
AACAAAGTGT 
GGACATGAGT 
CTTCTTTTTA 
TATTATTTTG 
AAATCGAAAA 
GTTGCCGCAA 
GAATGGCGCC 
CATTTTATTA 
GTACTTGAAA 
TGTGCAGAAA 
GCTAAATGCT 
AGCAATTCCC 
ATATATAGTC 
GGTTTGACTG 
TAATGCAATT 
AAACAAGCTA 
TATATGAGGG 



11 
I 

GATATTTCCA 
TCTCCAGCCA 
CCCTCCGCCA 
ATGCTCTGTG 
GCAGTCCTCA 
ACAGCTTTAG 
AGCTCTGACA 
CAAAACTACT 
ACCGTCCACC 
TTTCTTATCA 
AGTAAAGAAC 
GTCTGAATAT 
ATCAAACTTA 
ATAATATTTA 
AATGTTTTTA 
ATATTTAATA 
TCATTGAAAG 
AGGTCATAGC 
AAATCGATTT 
TACCATGGTT 
AGAATTTGGG 
ACCCTGGGGA 
GCAGTGGACA 



21 

I 

GTGTCAGAGG 
CTGCCGCGAG 
AGCCCCAGCG 
CCGGCAGGGT 
GTACAACTGT 
TTCAGACAGA 
TGAATGGCTA 
GCAGGTGTGA 
AACCTTTAAG 
CAGTCGTCGG 
CAAAGAAGGA 
GAGAGAGTTA 
TGGGCAGGGA 
TGTTGGGTCA 
rTTTTGTTTT 
TCAAAAGAAA 
CTTCAAAGTT 
TCAAGAATTG 
AGTAAGTATG 
TGATATGTAG 
AGAAGCAAAT 
GTCTATGGTC 
GTTCCCTATG 



31 
I 

GACACAGCCA 
CCCGTCTGCT 
CCCGCTCCCA 
CCCTGCGCTG 
GATTCCATCA 
AGACAATCCA 
TTGTTTGCAT 
AGTGGGTTAT 
CAAAGAGTAT 
TTCCACATAT 
ATATGAGAGA 
CCTCAGGGGA 
TAACAGTGTG 
AGTGTTAGGT 
ATTTTTGACA 
ATTGATATTT 
TATATGCCTG 
TTAGCAAATG 
TTTTTTATGT 
TTGGCACCAT 
ATAGGTCCTG 
TCTTCACTCA 
CCAACTCACG 



41 

I 

ACGTGGGGTC 
CCCGCCCTGC 
TCGCCGATGA 
CTGCTCTGCC 
TGTATCCCAG 
CGTGTGGCTC 
GGACAGTGCA 
ACTGGTGTCC 
GTGGCTTTGA 
TATTTCTGCA 
GTTACCTCAG 
TCCAGAGTTG 
CCTGGTTAAT 
CAATAACACT 
GACTATTTGC 
TTATACAAGT 
GTGCACAGTG 
ACAGATTTCT 
TCCTCAAATC 
GGTATCATAT 
TGTTAAACAC 
GGTCTCAGCT 
ACTCCTACAG 



SI 
I 

CCTTCTAGGC 
CCGTGCACTC 
CCGCGGGGAG 

GAGAGTCCAG 
AAGTGTCAAT 
TCTATCTGGT 
GATGTGAACA 
CCGTGATTCT 
GATGGTACAG 
GGGATCCAGA 
CCGCAAGTCT 
ATTAATATTC 
GTATTTTAAT 
TAATGTATAA 
AATTTCCTGA 
CTTAGAAGTA 
GTAAGCCTAT 
AGTGATAATT 
ATTAAAACAA 
TACACATTTG 
ATAATTCTGT 
GTACTAGTCA 
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960 
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1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

16B0 

1740 

1800 

1860 

1920 

I960 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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2880 

2887 
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CTCATCTACC AGATTCTGCC TATGTAAAAT GAATTGAAAA ACAATTTTCT GTAATCTTTT 1440 

ATTTAAGTAG TGGGCATTTC ATAGCTTCAC AATGTTCCTT TTTTGTATAT TACAACATTT 1500 

ATGTGAGGTA ATTATTGCTC AACAGACAAT TAGAAAAAAG TCCACACTTG AAGCCTAAAT IS SO 

5TTGTGCTTTT TAAGAATATT TTTAGACTAT TTCTTTTTAT AGGGGCTTTG CTGAATTCTA 1620 

ACATTAAATC ACAGCCCAAA ATTTGATGGA CTAATTATTA TTTTAAAATA TATGAAGACA 1680 

ATAATTCTAC ATGTTGTCTT AAGATGGAAA TACAGTTATT TCATCTTTTA TTCAAGGAAG 1740 

TTTTAACTTT AATACAGCTC AGTAAATGGC TTCTTCTAGA ATGTAAAGTT ATGTATTTAA 1800 

AGTTGTATCT TGACACAGGA AATGGGAAAA AACTTAAAAA TTAATATGGT GT ATTTTT CC 1860 

- AAATGAAAAA TCTCAATTGA AAGCTTTTAA AATGTAGAAA CTTAAACACA CCTTCCTGTG 1920 

. 1U GAGGCTGAGA TGAAAACTAG GGCTCATTTT CCTGACATTT GTTTATTTTT TGGAAGAGAC 1980 

AAAGATTTCT TCTGCACTCT GAGCCCATAG GTCTCAGAGA GTTAATAGGA GTATTTTTGG 2040 

GCTATTGCAT AAGGAGCCAC TGCTGCCACC ACTTTTGGAT TTTATGGGAG GCTCCTTCAT 2100 

CGAATGCTAA ACCTTTGAGT AGAGTCTCCC TGGATCACAT ACCAGGTCAG GGAGGATCTG 2160 

TTCTTCCTCT ACGTTTATCC TGGCATGTGC TAGGGTAAAC GAAGGCATAA TAAGCCATGG 2220 

I J CTGACCTCTG GAGCACCAGG TGCCAGGACT TGTCTCCATG TGTATCCATG CATTATATAC 2280 

CCTGGTGCAA TCACACGACT GTCATCTAAA GTCCTGGCCC TGGCCCTTAC TATTAGGAAA 2340 

ATAAACAGAC AAAAACAAGT AAATATATAT GGTCCTATAC ATATTGTATA TATATTCATA 2400 

TACAAACATG TATGTATACA TGACCTTAAT GGATCATAGA ATTGCAGTCA TTTGGTGCTC 24 60 

on TGCTAACCAT TTATATAAAA CTTAAAAACA AGAGAAAAGA AAAATCAATT AGATCTAAAC 2520 

ZU AGTTATTTCT GTTTCCTATT TAATATAGCT GAAGTCAAAA TATGTAAGAA CACATTTTAA 2580 

ATACTCTACT TACAGTTGGC CCTCTGTGGT TAGTTCCACA TCTGTGGATT CAACCAACCA 2640 

AGGACGGAAA ATGCTTAAAA AATAATACAA CAACAACAAA AAATACATTA TAACAACTAT 2700 

TTACTTTTTT TTTTTTCTTT TTGAGATGGA GTCTCGCTCT GTTGCCCAGG TTGGAGTGCA 2760 

GTGGCAGGAT CTCGGpTCAC TGCAACCTCA CCTCCCGGGT TCAAGAGATC CTCCTGCCTC 2820 

ZD AGCCTCCTGA GCAGCTGGGA CTACAGGCGC ATGCCACCAT GCCCAGCTAA TTTTTGTATT 2880 

TTTAGTAGAG GCGGGGTTTC ACCATGTTGG CCAGGATGGT CTCAATCTCC TAACCTTGAG 2940 

ATCCACCCTC CACAGCCTCC CAAACTGCTG GGATTACAGG CGTGAGCCAC CGCACGTAGC 3000 

ATTTACATTA GGTATTACAA GTAATGTAAA GATGATTTAA GTATACAGGA GGATGTGAAT 3060 

AGGTTATATG CAAGCACTAT GCCCTTTTAT ATAAGTGACT TGAACATCTG TGCCCGATTT 3120 

J\) TAGTATGTGC AGGGGGGOGA TCTGGGAATC AGTCCCCTGT GGATACCAAG GTACAACTGT 3180 

ATTTATTAAC GCTTACTAGA TGTGAGGAGA GTCTGAATAT TTTCAGTGAT CTTGGCTGTT 3240 

TCAAAAAAAT CTATTGACTT TTCAATAAAT CAGCTGCAAT CCATTTATTT CATTTACAAA 3300 

AGATTTATTG TAAGCCTCTC AATCTTGGTT TTTCAGTTGA TCTTAAGCAT GTCAATTCAT 3360 

AAAAACAAGT CATTTTTGTA TTTTTCATCT TTAAGAATGC TTAAAAAAGC TAATCCCTAA 3420 

-50 AATAGTTAGA TCTTTGTAAA TGCATATTAA ATAATAAAGT ATGACCCACA TTACTTTTTA 3480 

TGGG TGAAA A TAAGACAAAA ATAATAGTTT TAGTGAGGAT GGTGCTGAGT AAACATAAAA 3540 

ACTGATTTGC TCTCAGCTGA TGTGTCCTGT ACACAGTGGG AAGATTTTAG TTCACACTTA 3600 

GTCTAACTCC CCCATTTTAC AGATTTCTCA CTATATATAT TTCTAGAAGG GGCTATGCAT 3660 

. ~ ATTCAATGTA TTOAGAACCA AAGCAACCAC AAATGCATAA ATGCATAATT TATGGTCTTC 3720 

4U AACCAAGGCC ACATAATAAC CCAGTTAACT TACTCTTTAA CCAGGAATAT TAAGTTCTAT 3780 

AACTAGTACT CAAGGTTTAA CCTTAAAATT AAGATTTCCT TAACCTTAAC CTTAAAATTG 3840 

ATATTATATT AAACATACAT AATACAATGT AACTCCACTG TTCTCCTGAA TATTTTTTGC 3900 

TCTAATCTCT CTGCCGAAAG TCAAAGTGAT GGGAGAATTG GTATACTGGT ATGACTACGT 3960 

CTTAAGTCAG ATTTTTATTT ATGAGTCTTT GAGACTAAAT TCAATCACCA CCAGGTATCA 4020 

43 AATC AACTTT TATGCAGCAA ATATATGATT CTAGTGTCTG ACTTTTGTTA AATTCAGTAA 4080 

TGCAGTTTTT AAAAACCTGT ATCTGACCCA CTTTGTAATT TTTGCTCCAA TATCCATTCT 4140 

GTAGACTTTT GAAAAAAAAG TTTTTAATTT GATGCCCAAT ATATTCTGAC CGTTAAAAAA 4200 

TTCTTGTTCA TATGGGAGAA GGGGGAGTAA TGACTTGTAC AAACAGTATT TCTGGTGTAT 4260 

cr\ ATTTTAATGT TTTTAAAAAG AGTAATTTCA TTTAAATATC TGTTATTCAA ATTTGATGAT 4320 

DU GTTAAATGTA ATATAATGTA TTTTCTTTTT ATTTTGCACT CTGTAATTGC ACTTTTTAAG 4380 

TTTGAAGAGC CATTTTGGTA AAOGGTTTTT ATTAAAGATG CTATGGAACA TAAAGTTGTA 4440 

TTGCATGCAA TTTAAAGTAA CTTATTTGAC TATGAATATT ATCGGATTAC TGAATTGTAT 4500 

CAATTTGTTT GTGTTCAATA TCAGCTTTGA TAATTGTGTA CCTTAAGATA TTGAAGGAGA 4560 

AAATAGATAA TTTACAAGAT ATTATTAATT TTTATTTATT T7TCTTGGGA ATTGAAAAAA 4620 

D J ATTGAAATAA ATAAAAATGC ATTGAACATC TTGCATTCAA AATCTTCACT GAC 4673 

60 



Seq ID NO j C43 DNA Sequence 
Nucleic Acid Accession #: AF011468.1 
Coding sequence: 2S7..1468 



1 11 21 31 41 51 

I I I I I I 

^ GGAAOACTTG GGTCCTTGGG TCGCAGGTGG GAGCCGACGG GTGGGTAGAC CGTGGGGGAT 60 

OD ATCTCAGTGG CGGACGAGGA CGGCGGGGAC AAGGGGCGGC TGGTCGGAGT GGCGGAGCGT 120 

CAA GTCCCC T GTCGGTTCCT CCGTCCCTGA GTGTCCTTGG CGCTGCCTTG TGCCCGCCCA 180 

GCGCCTTTGC ATCCGCTCCT GGGCACCGAG GCGCCCTGTA GGATACTGCT TGTTACTTAT 240 

TACAGCTAGA GGCATCATGG ACCGATCTAA AGAAAACTGC ATTTCAGGAC CTGTTAAGGC 300 

TACAGCTCCA GTTGGAGGTC CAAAACGTGT TCTCGTGACT CAGCAAATTC CTTGTCAGAA 360 

/U TCCATTACCT GTAAATAGTG GCCAGGCTCA GCGGGTCTTG TGTCCTTCAA ATTCTTCCCA 420 

GCGCGTTCCT TTGCAAGCAC AAAAGCTTGT CTCCAGTCAC AAGCCGGTTC AGAATCAGAA 480 

GCAGAAGCAA TTGCAGGCAA CCAGTGTACC TCATCCTGTC TCCAGGCCAC TGAATAACAC 540 

CCAAAAGAGC AAGCAGCCCC TGCCATCGGC ACCTGAAAAT AATCCTGAGG AGGAACTGGC 600 

ATCAAAACAG AAAAATGAAG AATCAAAAAA GAGGCAGTGG GCTTTGGAAG ACTTTGAAAT 660 

/3 TGG TCGC CCT CTGGGTAAAG GAAAGTTTGG TAATGTTTAT TTGGCAAGAG AAAAGCAAAG 720 

CAAGTTTATT CTGGCTCTTA AAGTGTTATT TAAAGCTCAG CTGGAGAAAG CCGGAGTGGA 780 

GCATCAGCTC AGAAGAGAAG TAGAAATACA GTCCCACCTT CGGCATCCTA ATATTCTTAG 840 

ACTGTATGGT TATTTCCATG ATGCTACCAG AGTCTACCTA ATTCTGGAAT ATGCACCACT 900 

p« TGGAACAGTT TATAGAGAAC TTCAGAAACT TTCAAAGTTT GATGAGCAGA GAACTGCTAC 960 

OU TTATATAACA GAATTGGCAA ATGCCCTGTC TTACTGTCAT TCGAAGAGAG TT ATT CAT AG 1020 

AGACATTAAG CCAGAGAACT TACTTCTTGG ATCAGCTGGA GAGCTTAAAA TTGCAGATTT 1080 

TGGGTGGTCA GTACATGCTC CATCTTCCAG GAGGACCACT CTCTGTGGCA CCCTGGACTA 1140 

CCTGCCCCCT GAAATGATTG AAGGTCGGAT GCATGATGAG AAGGTGGATC TCTGGAGCCT 1200 

TGGAGTTCTT TGCTATGAAT TTTTAGTTGG GAAGCCTCCT TTTGAGGCAA ACACATACCA 1260 
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AGAGACCTAC AAAAGAATAT CACGGGTTGA ATTCACATTC CCTGACTTTG TAACAGAGGG 1320 

AGCCAGGGAC CTCATTTCAA GACTGTTGAA GCATAATCCC AGCCAGAGGC CAATGCTCAG 1380 

AGAAGTACTT GAACACCCCT GGATCACAGC AAATTCATCA AAACCATCAA ATTGCCAAAA 1440 

CAAAGAATCA GCTAGCAAAC AGTCTTAGGA ATCGTGCAGG GGGAGAAATC CTTGAGCCAG 1S00 

J GGCTGCCATA TAACCTGACA GGAACATGCT ACTGAAGTTT ATTTTACCAT TGACTGCTGC 1560 

CCTCAATCTA GAACGCTACA CAAGAAATAT TTGTTTTACT CAGCAGGTGT GCCTTAACCT 1620 

CCCTATTCAG AAAGCTCCAC ATCAATAAAC ATGACACTCT GAAGTGAAAG TAGCCACGAG 1680 

AATTGTGCTA CTTATACTGG TTCATAATCT GGAGGCAAGG TTCGACTGCA GCOGCCCCGT 1740 

- CAGCCTGTGC TAGGCATGGT GTCTTCACAG GAGGCAAATC CAGAGCCTGG CTGTGGGGAA 1800 

1U AGTGACCACT CTGCCCTGAC CCCGATCAGT TAAGGAGCTG TGCAATAACC TTCCTAGTAC 1860 

CTGAGTGAGT GTGTAACTTA TTGGGTTGGC GAAGCCTGGT AAAGCTGTTG GAATGAGTAT 1920 

GTGATTCTTT TTAAGTATGA AAATAAAGAT ATATGTACAG ACTTGTATTT TTTCTCTGGT 1980 

GGCATTCCTT TAGGAATGCT GTGTGTCTGT CCGGCACOCC GGTAGGCCTG ATTGGGTTTC 2040 

TAGTCCTCCT TAACCACTTA TCTCCCATAT GAGAGTGTGA AAAATAGGAA CACGTGCTCT 2100 

ACCTCCATTT AGGGATTTGC TTGGGATACA GAAGAGGCCA TGTGTCTCAG AGCTGTTAAG 2160 

GGCTTATTTT TTTAAAACAT TGGAGTCATA GCATGTGTGT AAACTTTAAA TATGCAAATA 2220 

AATAAGTATC TATGTCTAAA AAAAAAAAAA AAA 2253 

Seq id NO: C44 DNA Sequence 
ZU Nucleic Acid Accession ft: NM_013372 
Coding sequence: 63.. 617 

1 11 21 31 41 51 

9< I I I ! I I 

Z,J GCGGCCGCAC TCAGCGCCAC GCGTOGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 

GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCOGCCA GACAAGGCCC 180 

AGCACAATGA CTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 

„ GGGGCCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

J\J CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CC ACGA GGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 420 

GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 

3 J CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 780 

ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 

CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 

CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCTGACT TTQTGTTTTT GTGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 1020 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

*c ATCTCI'xwi-i- TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 1200 

*tO TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

C(\ ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 1500 

J\J ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 

OK) TCTGATTAAA CT TGGC CTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

OJ TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGG CGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

/U ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTOGTTAA TGGAGATGAC TTAAGTTGGC 2820 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTA QTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

ort GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

OU TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA COGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 
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AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 

TAAATTAAAC CTATTCTTTC AAAAAAAAA 4049 

10 Seq ID NO: C45 DNA Sequence 

Nucleic Acid Accession 8: Eos sequence 
Coding sequence: 200.. 2932 

- - 1 11 21 31 41 SI 

15 I I I I I I 

ATT GCTG ATG GATCAGTGAG CCTGTGTTCA TGCCAGTGAG CTGCTGTGGC TCAGATACTG 60 

ATACTTTCTT TCCAAACAGC ATAAGAAGTG ATTGAGCCAC AAGTATACTG AAGGAAGGGC 120 

TCCCTCGAGT TCTGGTGTGA AGAGATAAAT CACCAGTCAC AGACTATGCA CCCGACTGCT 180 

GCTGTTCAGT CCAGGGAAAA TGAAAGTTGG AGTGCTGTGG CTCATTTCTT TCTTCACCTT 240 

CACTGACGGC CACGGTGGCT TCCTGGGGAA AAATGATGGC ATCAAAACAA AAAAAGAACT 300 

CATTGTGAAT AAGAAAAAAC ATCTAGGCCC AGTCGAAGAA TATCAGCTGC TGCTTCAGGT 360 

GACCTATAGA GATTCCAAGG AGAAAAGAGA TTTGAGAAAT TTTCTGAAGC TCTTGAAGCC 420 

TCCATTATTA TGGTCACATG GGCTAATTAG AATTATCAGA GCAAAGGCTA CCACAGACTG 480 

^ - CAACAGCCTG AATGGAGTCC TGCAGTGTAC CTGTGAAGAC AGCTACACCT GGTTTCCTCC 540 

25 CTCATGCCTT GATCCCCAGA ACTGCTACCT TCACACGGCT GGAGCACTCC CAAGCTGTGA 600 

ATGTCATCTC AACAACCTCA GCCAGAGTGT CAATTTCTGT GAGAGAACAA AGATTTGGGG 660 

CACTTTCAAA ATTAATGAAA GGTTTACAAA TGACCTTTTG AATTCATCTT CTGCTATATA 720 

CTCCAAATAT GCAAATGGAA TTGAAATTCA ACTTAAAAAA GCATATGAAA GAATTCAAGG 780 

TTTTGAGTCG GTTCAGGTCA CCCAATTTCG AAATGGAAGC ATCGTTGCTG GGTATGAAGT 840 

TGTTGGCTCC AGCAGTGCAT CTGAACTGCT GTCAGCCATT GAACATGTTG CCGAGAAGGC 900 

TAAGACAGCC CTTCACAAGC TGTTTCCATT AGAAGACGGC TCTTTCAGAG TGTTCGGAAA 960 

AGCCCAGTGT AATGACATTG TCTTTGGATT TGGGTCCAAG GATGATGAAT ATACCCTGCC 1020 

CTGCAGCAGT GGCTACAGGG GAAACATCAC AGCCAAGTGT GAGTCCTCTG GGTGGCAGGT 1080 

CATCAGGGAG ACTTGTGTGC TCTCTCTGCT TGAAGAACTG AACAAGAATT TCAGTATGAT 1140 

TGTAGGCAAT GCCACTGAGG CAGCTGTGTC ATCCTTCGTG CAAAATCTTT CTGTCATCAT 1200 

TCGGCAAAAC CCATCAACCA CAGTGGGGAA TCTGGCTTCG GTGGTGTCGA TTCTGAGCAA 1260 

TATTTCATCT CTGTCACTGG CCAGCCATTT CAGGGTGTCC AATTCAACAA TGGAGGATGT 1320 

CATCAGTATA GCTGACAATA TCCTTAATTC AGCCTCAGTA ACCAACTGGA CAGTCTTACT 1380 

GCGGGAAGAA AAGTATGCCA GCTCACGGTT ACTAGAGACA TTAGAAAACA TCAGCACTCT 1440 

GGTGCCTCGG ACAGCTCTTC CTCTGAATTT TTCTCGGAAA TTCATTGACT GGAAAGGGAT 1500 

TCCAGTGAAC AAAAGCCAAC TCAAAAGGGG TTACAGCTAT CAGATTAAAA TGTGTCCCCA 1S60 

AAATACATCT ATTCCCATCA GAGGCCGTGT GTTAATTGGG TCAGACCAAT TCCAGAGATC 1620 

CCTTCCAGAA ACTATTATCA GCATGGCCTC GTTGACTCTG GGGAACATTC TACCCGTTTC 1680 

CAAAAATGGA AATGCTCAGG TCAATGGACC TGTGATATCC ACGGTTATTC AAAACTATTC 1740 

4j CATAAATGAA GTTTTCCTAT TTTTTTCCAA GATAGAGTCA AACCTGAGCC AGCCTCATTG 1800 

TGTGTTTTGG GATTTCAGTC ATTTGCAGTG GAACGATGCA GGCTGCCACC TAGTGAATGA 1860 

AACTCAAGAC ATCGTGACGT GCCAATGTAC TCACTTGACC TCCTTCTCCA TATTGATGTC 1920 

ACCTTTTGTC CCCTCTACAA TCTTCCCCGT TGTAAAATGG ATCACCTATG TGGGACTGGG 1980 

TATCTCCATT GGAAGTCTCA TTTTATGCCT GATCATCGAG GCTTTGTTTT GGAAGCAGAT 2040 

TAAAAAAAGC CAAACCTCTC ACACACGTCG TATTTGCATG GTGAACATAG CCCTGTCCCT 2100 

CTTGATTGCT GATGTCTGGT TTATTGTTGG TGCCACAGTG GACACCACGG TGAACCCTTC 2160 

TGGAGTCTGC ACAGCTGCTG TGTTCTTTAC ACACTTCTTC TACCTCTCTT TGTTCTTCTG 2220 

GATGCTCATG CTTGGCATCC TGCTGGCTTA CCGGATCATC CTCGTGTTCC ATCACATGGC 2280 

CCAGCATTTG ATGATGGCTG TTGGATTTTG CCTGGGTTAT GGGTGCCCTC TCATTATATC 2340 

TGTCATTACC ATTGCTGTCA CGCAACCTAG CAATACCTAC AAAAGGAAAG ATGTGTGTTG 2400 

GCrrAACTGG TCCAATGGAA GCAAACCACT CCTGGCTTTT GTTGTCCCTG CACTGGCTAT 2460 

TGTGGCTGTG AACTTCGTTG TGGTGCTGCT AGTTCTCACA AAGCTCTGGA GGCCGACTGT 2520 

TGGGGAAAGA CTGAGTCGGG ATGACAAGGC CACCATCATC CGCGTGGGGA AGAGCCTCCT 2580 

CATTCTGACC CCTCTGCTAG GGCTCACCTG GGGCTTTGGA ATAGGAACAA TAGTGGACAG 2640 

CCAGAATCTG GCTTGGCATG TTATTTTTGC TTTACTCAAT GCATTCCAGG GATTTTTTAT 2700 

CTTATGCTTT GGAATACTCT TGGACAGTAA GCTGCGACAA CTTCTGTTCA ACAAGTTGTC 2760 

TGCCTTAAGT TCTTGGAAGC AAACAGAAAA GCAAAACTCA TCAGATTTAT CTGCCAAACC 2820 

CAAATTCTCA AAGCCTTTCA ACCCACTGCA AAACAAAGGC CATTATGCAT TTTCTCATAC 2880 

TGGAGATTCC TCCGACAACA TCATGCTAAC TCAGTTTGTC TCAAATGAAT AAGGCAAGGA 2940 

ATCATAAAAT CAAGAAAAAA TTTCCAGAAC AACTTGACAT TTAGAGACAA ATGTCAATGA 3000 

AGAAATTATG CTCAGTATTC GATCGGGTTT TCTGATTTAG GGGTCTGGGA ATAAAACAAG 3060 

AATGTCTCAG TGGCTTCA 3078 

Seq ID NO: C46 DNA Sequence 
/U Nucleic Acid Accession #: nm_000584.1 
Coding sequence: 75.. 374 ~ 

1 11 21 31 41 SI* 

I I I I I I 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCAGGAAGAA ACCACCGGAA GGAACCATCT 60 

CACTGTGTGT AAACATGACT TCCAAGCTGG CCGTGGCTCT CTTGGCAGCC TTCCTGATTT 120 

CTGCAGCTCT GTGTGAAGGT GCAGTTTTGC CAAGGAGTGC TAAAGAACTT AGATGTCAGT 180 

GCATAAAGAC ATACTCCAAA CCTTTCCACC CCAAATTTAT CAAAGAACTG AGAGTGATTG 240 

AGAGTGGACC ACACTGCGCC AACACAGAAA TTATTGTAAA GCTTTCTGAT GGAAGAGAGC 300 

TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT GGAGAAGTTT TTGAAGAGGG 360 

CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA GAATCAGTGA AGATGCCAGT 420 

GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT GTGGGTCTGT TGTAGGGTTG 480 

CCAGATGCAA TACAAGATTC CTGGTTAAAT TTGAATTTCA GTAAACAATG AATAGTTTTT 540 

CATTGTACCA TGAAATATCC AGAACATACT TATATGTAAA GTATTATTTA TTTGAATCTA 600 
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CAAAAAACAA 
TACAAATAGC 
GAATGGGTTT 
TTTGCCATAA 
CCTAGTCTGC 
TTTCTAAGTG 
TGGAAGCACT 
CTATTTATTA 
AGAAGATGAA 
GATATTAAAT 
ACAATTGGGT 
AGTACATTAT 
AGTCTTGTCA 
AACTATTAAA 
TGTTTATTAT 
AAGGCTTTAT 
TTCTGATTGT 
AAGTAAAAAA 



CAAATAATTT 
AAAATTGA6C 
GCTAGAATGT 
AGTCAAATTT 
TAGCCAGGAT 
GAAAAAGTAT 
TTAAGTTTTT 
TTTATGTATT 
TCATTGATTG 
GATGTTTTAT 
ACCCAGTTAA 
TGTTTATCTG 
TTGCCAGCTG 
ACAGCCAAAA 
GTACAAATAG 
ATTTTTAACT 
ATGGAAATAT 
AAAAAAAAA 



TTAAATATAA 
CAAGGGCCAA 
GATATTTGAA 
AGCTGGAAAT 
CCACAAGTCC 
TAGCCACCAT 
TCATCATAAC 
TATTTAAGCA 
AATAGTTATA 
TAGATAAATT 
ATTTTCATTT 
AAAGTTTTAA 
TGTTGGTAGT 
CTCCACAGTC 
ATTCTTATAA 
TTAAGATGTT 
AAAAGTAAAT 



GGATTTTCCT 
GAGAATATCC 
GCATCACATA 
CCTGGATTTT 
TTGTTCCACT 
CTTACCTCAC 
ATAAATTATT 
TCAAATATTT 
AAGATGTTAT 
TCAATCAGGG 
CAGATAAACA 
TTGAACTAAC 
GCTGTGTTGA 
AATATTAGTA 
TATTATTTAA 
TTTATGTGCT 
ATGAAACATT 



AGATATTGCA 
GAACTTTAAT 
AAAATGATGG 
TTTCTGTTAA 
GTGCCTTGGT 
AGTGATGTTG 
TTCAAGTGTA 
GTGCAAGAAT 
AGTAAATTTA 
TTTTTAGATT 
ACAAATAATT 
AATCCTAGTT 
ATTACGGAAT 
ATTTCTTGCT 
ATGACTGCAT 
CTCCAAATTT 
TAAAATATAA 



CGGGAGAATA 
TTCAGGAATT 
GACAATAAAT 
ATCTGGCAAC 
TTCTCCTTTA 
TGAGGACATG 
ACTTATTAAC 
TTGGAAAAAT 
TTTTATTTTA 
AAACAAAGAA 
TTTTAGTATA 
TGATACTCCC 
AATGAGTTAG 
GGTTGAAACT 
TTTTAAATAC 
TTTTTACTGT 
TTTGTTGTCA 



Seq ID NO: C47 DNA Sequence 

Nucleic Acid Accession ft: NM_005603.1 

Coding sequence: 1..3756 



l 
I 

ATGAGTACAG 
GTGGTTCCCT 
GAACCAGAAC 
GAATGTACAT 
AACACAAAAT 
AAGTACAACG 
AATTTATATT 
TGGTACACCA 
GTGGACGATG 
ATTAAGGATG 
OGTCTGAAAA 
AACAGCCTCT 
ATGTCACTTG 
GGTTTTATTG 
TGGAGAAACA 
AGGAACACCG 
AAGAATAGTG 
GTTTACACGA 
TATTGGGAAG 
CCCTCCTACC 
CCCATCTCTC 
TGGGACCTGC 
CTCAATGAAC 
CAAAATATCA 
GATGCCTCTC 
GCTGATGGGA 
GAGCCAGAAG 
AGGACTGATG 
GCTGCCAGGA 
GAACTGGGCA 
AAGCGAATGT 
GCTGACACTG 
GATGCCCTGG 
ATTGAAGAAA 
ACCAACCGGG 
CTGGGAGCTA 
CTTGCAAAAG 
AATATAGGAT 
A3TAATTCTC 
AAGTTTGCAC 
ATCACTGGTT 
CTGAAGCTGA 
AGGCTAGAAG 
AGCGCAGTCA 
AAGAGGTACA 
ATCAAAACTG 
TCGAGTGACT 
CGATGGTCTT 
TTTACTTTGG 
GAGGATTGGT 
GGGCTGCTCG 
GTGGGACAAA 
GTCCTAACAT 
CAGGATGGAG 
GTAATAACAG 
TTTTCAATTT 
GGAATACATG 
AGACAGCCAT 



11 
I 

AAAGAGACTC 
ACAGTGATGA 
AAAACCGAGT 
GGCAAGTCAA 
TCTTGTGTAT 
CATTTACCTT 
TCCTGGCTCT 
CACTAGTGCC 
TGGCTCGCCA 
GCAGGTTCAA 
AAAATGATTT 
GCTATGTGGA 
AAATCACAGA 
AATGTGAAGA 
CAAGTTTTCC 
ATTTCTGCCA 
GGAAAACCAG 
TCTTTGTTGT 
CACAGGTGGG 
GTGGATTCCT 
TCTATGTCAG 
AAATGTACTA 
AGCTOGGGCA 
TGACCTTTAA 
AACACAACCA 
AGCTTGCATT 
TACGACAGTT 
GTCAGCTCAA 
ACTTTGGCTT 
CTGAAAGGAC 
CTATCATTGT 
TTATTTATGA 
ATATCTTTGC 
AAGAATTTAC 
ACGAAGCTCT 
CAGCTATTGA 
CTGACATTAA 
TTGCTTGTGA 
TTCTTCATGC 
CTGCTGTGCA 
CTTGGTTGAA 
AGTTCCCAAG 
CTAAGAAAGA 
TCTGCTGCCG 
AGAAAGCCAT 
CCCACATTGG 
ATTCCTTTGC 
ACATAAGGAT 
TTCATTTCTG 
TCATCAOOCT 
ACCAGGATGT 
GAGACTTACT 
CGATGATCCT 
AGGCACCTTC 
TCAATTTCCA 
TTGGAAGCAT 
TTCTCTTTCC 
ACATTTGGTT 



21 

! 

AGAAACGACA 
TGAAACAGAA 
CAACAGGGAA 
AGCAAACGAT 
TAAGGAGAGT 
TATACCAATG 
TCTTATCTTA 
CCTGCTTGTG 
TAAAATGGAT 
AGTTGCTAAG 
TGTTCCAGCT 
AACAGCAGAA 
CCAGTACCTC 
ACCCAATAAC 
TTTGGATGCT 
CGGCTTAGTC 
ATTTAAAAGA 
TCTTATTCTG 
CAATTCCTCT 
CATTTTCTGG 
CGTGGAAGTG 
TGCTGAGAAG 
GATCCATTAT 
AAAGTGCTGT 
CAACAAAATA 
TTATGACCAC 
CTTCTTCTTG 
CTAOCAGGCA 
TGCCTTCCTC 
TTACAATGTT 
AAGAACCCCA 
AGGGTTACAT 
AAATGAAACT 
AGAATGGAAT 
GGATAAAGTA 
AGACAAGCTA 
GATCTGGGTG 
ACTTCTGACT 
AAGGATGGAA 
GGAATCTTTT 
TGAAATTCTT 
AACAGAAGAA 
GCAGCGGCAG 
CGTCACCCCC 
CAOG CTGGCC 
CGTTGGAATA 
TCAGTTCCGA 
GTGCAAGTTC 
GTACTCCTTC 
CTACAACGTG 
GAGTGACAAA 
ATTCAACTAT 
CTTCTTCATA 
OGACTACCAG 
GATTGGCTTG 
TCCACTTTAT 
ATCTGCATTT 
AACTATCATC 



31 
I 

TTTGACGAGG 
GATGAACTTG 
GCAGAGGAGA 
CGCAAGTACC 
AAATATGCGA 
AATCTGTTTG 
CAGGCAGTTC 
GTGCTGGGCG 
AAGGAAATCA 
TGGAAAGAAA 
GACATTCTCC 
CTGGACGGAG 
CAAAGAGAAG 
CGACTAGATA 
GATAAAATTT 
ATTTTTGCAG 
ACTAAAATTG 
CTTTCTGCTG 
TGGTACCTCT 
GGCTATATCA 
ATTCGTCTTG 
GACACACCCG 
ATCTTCTCTG 
ATCAACGGGC 
GAGCAAGTTG 
TATCTTATTG 
CTCGCAGTTT 
GCCTCTCCCG 
GCCAGGACCC 
CTTGCCATTT 
GAAGGCAATA 
CGAATGAATC 
CTTAGAACCC 
AAAAAGTTTA 
TATGAGGAGA 
CAGGATGGAG 
CTTACTGGAG 
GAAGACACCA 
AACCAGAGGA 
TTTCCACCCG 
CTCGAGAAAA 
GAAAGACGGA 
AAAAACTTTG 
AAGCAGAAGG 
ATCGGAGATG 
AGTGGACAAG 
TATCTGCAGA 
CTACGATACT 
TTCAATGGCT 
CTGTACACCA 
CTGAGCCTCC 
AAGAGATTCT 
CCTCTTGGAG 
TCTTTTGCCG 
GATACTTCTT 
TTTGGCATCA 
CAATTTACAG 
CTGACTGTTG 



41 

I 

ATTCTCAGCC 
ATGACCAGGG 
ACCGGGAGCC 
ACGAACAACC 
ATAATGCAAT 
AGCAGTTTAA 
CTCAAATCTC 
TCACTGCAAT 
ACAATAGGAC 
TTCAAGTTGG 
TGCTGTCTAG 
AAACCAATTT 
ATACATTGGC 
AGTTTACAGG 
TGTTACGTGG 
GTGCTGACAC 
ATTACTTGAT 
GTCTTGCCAT 
ATGATGGAGA 
TTGTTCTCAA 
GACAGAGTCA 
CAAAAGCTAG 
ATAAGACGGG 
AGATATATGG 
ATTTTAGCTG 
AGCAAATCCA 
GCCACACAGT 
ATGAAGGTGC 
AGAACACCAT 
TGGACTTCAA 
TCAAGCTTTA 
CTACTAAGCA 
TATGCCTTTG 
TGGCTGCCAG 
TTGAAAAAGA 
TTCCAGAAAC 
ACAAAAAGGA 
CCATCTGCTA 
ATAGAGGTGG 
GTGGAAACCG 
AGACCAAGAG 
TGCGGACCCA 
TGGACCTGGC 
CCATGGTGGT 
GGGCCAATGA 
AAGGAATGCA 
GGCTACTGCT 
TCTTTTACAA 
ACTCTGCGCA 
GCCTGCCCGT 
GATTCCCTGG 
TTGTAAGCTT 
CTTATCTGCA 
TCACCATTGC 
ATTGGACTTT 
TGTTTGACTT 
GCACAGCTTC 
CTGTGTGCTT 



51 
I 

TAATGACGAA 
GTCTGCTGTT 
ATTCAGAAAA 
TCACTTTATG 
TAAAACATAC 
GAGAGCAGCC 
TACCCTGGCT 
CAAAGACCTG 
GTGTGAAGTC 
AGACGTCATT 
CTCTGAGCCT 
AAAATTTAAG 
TACATTTGAT 
AACACTATTT 
CTGTGTAATT 
TAAAATAATG 
GAACTACATG 
CGGCCATGCT 
AGACGATACA 
CACCATGGTA 
CTTCATCAAC 
AACCACCACA 
GACACTCACA 
GGACCATCGG 
GAATACATAT 
GTCAGGGAAA 
CATGGTGGAT 
CCTGGTAAAC 
CACCATCAGT 
CAGTGACCGG 
CTGTAAAGGT 
AGAAACACAG 
CTACAAGGAA 

CTTAATTCTC 
CATTTCAAAA 
AACTGCTGAA 
TGGGGAGGAT 
CGTCTACGCA 
TGCCTTAATC 
AAATAAGATT 
AAGTAAAAGG 
CTGCGAGTGC 
GGACCTGGTG 
CGTGAACATG 
AGCTGTCATG 
GGTGCATGGC 
AAACTTTGCC 
GACTGCATAC 
GCTCCTCATG 
GTTATACATA 
GTTGCATGGG 
AACCGTAGGG 
CTCTGCTCTT 
TGTGAATGCT 
TCATAGTGCT 
AAAOGCTCTG 
ACTACCCGTC 
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1920 
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GTTGCCATTC GATTCCTGTC AATGACCATC TGGCCATCAG AAAGTGATAA GATCCAGAAG 3540 

CATCGCAAGC GGTTGAAGGC GGAGGAGCAG TGGCAGCGAC GGCAGCAGGT GTTCCGCCGG 3600 

GGCGTGTCAA CGCGGOGCTC GGCCTACGCC TTCTCGCACC AGCGGGGCTA CGCGGACCTC 3660 

ATCTCCTCCG GGCGCAGCAT CCGCAAGAAG CXJCTCGCCGC TTGATGCCAT CGTGGOGGAT 3720 

J GGCACCGCGG AGTACAGGCG CACCGGGGAC AGCTGA 3756 



10 



80 



Seq ID NO: C48 DNA Sequence 
Nucleic Acid Accession 8: XM_044533 
Coding sequence: 238.. 2751 



1 11 21 31 41 51 

I ! I I I 1 

GCTCTGCCCA AGCCGAGGCT GCGGGGCOGG CGOCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

10 GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCX3COGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

^ TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 420 

ZU TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

ZD AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

„ GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

3U TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

«r GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

J J ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1560 

ACi CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

W GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 

CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 1860 

TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

4 J ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGOGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

DU GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GOGTGCACCC CAAGACCTGC 2520 

OD CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2820 

OU TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

s-C GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

Oj GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACOGGCT GCCCTGTCTC ACTGCAGATT 3240 

CAGGACCAGC TTGGGCTGCG TGOGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3300 

CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

/U GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 34 B0 

CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAAGCOGT GTGCCCCTTC 3540 

CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

7 r TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 

fO ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 3766 



Seq ID NO i C49 DNA Sequence 

Nucleic Acid Accession #: NMJ307019.1 

Coding sequence; 41.. 580 ~~ 

1 11 21 31 41 51 

I I I i I I 

GGCACGAGCG AGTTCCTGTC TCTCTGCCAA CGCCGCCCGG ATGGCTTCCC AAAACCGCGA 60 
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CCCAGCCGCC ACTAGCGTCG CCGCCGCCCG TAAAGGAGCT GAGCCGAGCG GGGGCGCCGC 120 

CGGGGGTCCG GTGGGCAAAA GGCTACAGCA GGAGCTGATG ACCCTCATGA TGTCTGGCGA 180 

TAAAGGGATT TCTGCCTTCC CTGAATCAGA CAACCTTTTC AAATGGGTAG GGACCATCCA 240 

TGGAGCAGCT GGAACAGTAT ATGAAGACCT GAGGTATAAG CTCTCGCTAG AGTTCCCCAG 300 

TGGCTACCCT TACAATGCGC CCACAOTGAA GTTCCTCACG CCCTGCTATC ACCCCAACGT 360 

GGACACCCAG GGTAACATAT GCCTGGACAT CCTGAAGGAA AAGTGGTCTG CCCTGTATGA 420 

TGTCAGGACC ATTCTGCTCT CCATCCAGAG CCTTCTAGGA GAACCCAACA TTGATAGTCC 480 

CTTGAACACA CATGCTGCCG AGCTCTGGAA AAACCCCACA GCTTTTAAGA AGTACCTGCA 540 

AGAAACCTAC TCAAAGCAGG TCACCAGCCA GGAGCCCTGA CCCAGGCTGC CCAGCCTGTC 600 

CTTGTGTCGT CTTTTTAATT TTTCCTTAGA TGGTCTGTCC TTTTTGTGAT TTCTGTATAG 660 

GACTCTTTAT CTTGAGCTGT GGTATTTTTG TTTTGTTTTT GTCTTTTAAA TTAAGCCTCG 720 

GTTGAGCCCT TGTATATTAA ATAAATGCAT TTTTGTCCTT TTTTAAAAAA AAAAAAAAAA 780 

AAA 783 

Seq ID NO: C50 DMA Sequence 

Nucleic Acid Accession tt: NMJJ14584.1 

Coding sequence: 227.. 1633 ~~ 



ori 1 11 21 31 41 51 

20 I I I I I I 

GCACGAGCCC CGGGCTGCCG GCGCGGGCGC CGCGGCACGT CCACAGGCTG GGTCGCGAGG 60 

TGGCGATCGC TGAGAGGCAG GAGGGCOGAG GCGGGCCTGG GAGGCGGCCC GGAGGTGGGG 120 

CGCCGCTGGG GCCGGCCCGC ACGGGCTTCA TCTGAGGGCG CACGGCCCGC GACCGAGCGT 180 

GCGGACTGGC CTCCCAAGCG TGGGGCGACA AGCTGCCGGA GCTGCAATGG GCCGCGGCTG 240 

ZD GGGATTCTTG TTTGGCCTCC TGGGCGCCGT GTGGCTGCTC AGCTCGGGCC ACGGAGAGGA 300 

GCAGCCCCCG GAGACAGCGG CACAGAGGTG CTTCTGCCAG GTTAGTGGTT ACTTGGATGA 360 

TTGTACCTGT GATGTTGAAA CCATTGATAG ATTTAATAAC TACAGGCTTT TCCCAAGACT 420 

ACAAAAACTT CTTGAAAGTG ACTACTTTAG GTATTACAAG GTAAACCTGA AGAGGCCGTG 480 

TCCTTTCTGG AATGACATCA GCCAGTGTGG AAGAAGGGAC TGTGCTGTCA AACCATGTCA 540 

J\J ATCTGATGAA GTTCCTGATG GAATTAAATC TGCGAGCTAC AAGTATTCTG AAGAAGCCAA 600 

TAATCTCATT GAAGAATGTG AACAAGCTGA ACGACTTGGA GCAGTGGATG AATCTCTGAG 660 

TGAGGAAACA CAGAAGGCTG TTCTTCAGTG GACCAAGCAT GATGATTCTT CAGATAACTT 720 

CTGTGAAGCT GATGACATTC AGTCCCCTGA AGCTGAATAT GTAGATTTGC TTCTTAATCC 780 

TGAGCGCTAC ACTGGTTACA AGGGACCAGA TGCTTGGAAA ATATGGAATG TCATCTACGA 840 

5 J AGAAAACTGT TTTAAGCCAC AGACAATTAA AAGACCTTTA AATCCTTTGG CTTCTGGTCA 900 

AGGGACAAGT GAAGAGAACA CTTTTTACAG TTGGCTAGAA GGTCTCTGTG TAGAAAAAAG 960 

AGCATTCTAC AGACTTATAT CTGGCCTACA TGCAAGCATT AATGTGCATT TGAGTGCAAG 1020 

ATATCTTTTA CAAGAGACCT GGTTAGAAAA GAAATGGGGA CACAACATTA CAGAATTTCA 1080 

/in ACAGOGATTT GATGGAATTT TGACTGAAGG AGAAGGTCCA AGAAGGCTTA AGAACTTGTA 1140 

4U TTTTCTCTAC TTAATAGAAC TAAGGGCTTT ATCCAAAGTG TTACCATTCT TCGAGCGCCC 1200 

AGATTTTCAA CTCTTTACTG GAAATAAAAT TCAGGATGAG GAAAACAAAA TGTTACTTCT 1260 

GGAAATACTT CATGAAATCA AGTCATTTCC TTTGCATTTT GATGAGAATT CATTTTTTGC 1320 

TGGGGATAAA AAAGAAGCAC ACAAACTAAA GGAGGACTTT CGACTGCATT TTAGAAATAT 1380 

TTCAAGAATT ATGGATTGTG TTGGTTGTTT TAAATGTOGT CTGTGGGGAA AGCTTCAGAC 1440 

4 J TCAGGGTTTG GGCACTGCTC TGAAGATCTT ATTTTCTGAG AAATTGATAG CAAATATGCC 1500 

AGAAAGTGGA CCTAGTTATG AATTCCATCT AACCAGACAA GAAATAGTAT CATTATTCAA 1560 

CGCATTTGGA AGAATTTCTA CAAGTGTGAA AGAATTAGAA AACTTCAGGA ACTTGTTACA 1620 

GAATATTCAT TAAAGAAAAC AAGCTGATAT GTGCCTGTTT CTGGACAATG GAGGCGAAAG 1680 

rft AGTGGAATTT CATTCAAAGG CATAATAGCA ATGACAGTCT TAAGCCAAAC ATTTTATATA 1740 

DU AAGTTGCTTT TGTAAAGGAG AATTATATTG TTTTAAGTAA ACACATTTTT AAAAATTGTG 1800 

TTAAGTCTAT GTATAATACT ACTGTGAGTA AAAGTAATAC TTTAATAATG TGGTACAAAT 1860 

TTTAAAGTTT AATATTGAAT AAAAGGAGGA TTATCAAATT CATATATGAT AAAAGTGAAT 1920 

GTTCTAAGTC TCTCAAACTA GCGTTTTATG TAATAATATG TAATATAAAT AAAACTATGG 1980 

TAAATGTGAC AAGCATTTAA TAGGAAAATG CTAAGGAGGC CTCATAAATG ACCCATAATT 2040 

accaaogtag aatttttcag tacatttagg GTTGCTGGAT TTAGCAAATA AAAATAAAGA 2100 

TTGCCCAGTT AGATTTGAAT TTCAGATAAA CAATTAGTTT TTTAATATTT TACATGGAAT 2160 

ATTTG GAAAA TACTTATACT AAAAAATTAT TTGTTTGAAA TTCACATTTA ACTGGGAGTC 2220 

TTGTATTTTA TCTGGCAATC CTAAAATACA TTGGTATGAA ACAAATCACT TTTAGAAGTA 2280 

- ' TATTGCTATT TTGATTGGGT TGTTTTTGTG TGTAGAAACG TACAATAACA ACTCAAAGGC 2340 

OU ACAGGAGATT TCTAAACATT GTGAAAAGTT GAATAGATTA TATATTTATT CTCATAATAC 2400 

TTTCACTAAT ACTAAATAAA ATTTGGGGAA CACTTTTTAT TTTTATATAA TTTCCAATTT 2460 

ACAGAAAAGT TTCAAAAATA GTACAAAGAG CTCTCTTACC CAGATTCACT AATTGTTCAT 2520 

ACGTGCTTTA TCTTTCATGC TTTCTCTGTA CACACACACA CAGACAGAAA TTTTTCCTCA 2580 

s-c ATCATTTGAA AGTCAGTTAT AGGCATCATG CCCCTTAAAC CCTAAATACT TCAGTGTGTA 2640 

OD ATACTGAATA ATTACTAAAA ATGATTTTCT CAGAAAAAAA AACTCCCACA ATTCTGGAAC 2700 

TA TAATA CTG TAAGCCTTAG AATAAATAAT ACTTTCAAGT TCCAATCTAA AGTTCTTTTT 2760 

GAGTTTTGTT GCCCGTTTTA TGCTTGATGT GTATAGTAAT AGGGTAGGCT ATTTATTTTA 2820 

TTAAAATTTT TTTTAGAGAC AAGGTTTTGC TGTGTTGCCC AAGCTGGAAC TTGAACGACT 2880 

GGGCTGAAGT GATCTTCCCA CCTCAGCCTC CCAAGTAGCT GGGAATACAG GTGTCTGCCA 2940 

/U CCATACCCAG TTTCATTTTT GTTTTTTATA CCOGAAGTTC ATTTCCTTTG TCTCCCTAAA 3000 

ACTGAACTGT AATTTTGGGA GGTTTTCATT AGTGGAAGCT CTTCATTTAT AAAGCTATTT 3060 

GAAGGGGTTT AGGAATTTAT ATCACATGGT AATTGTAGAG AAAAAGAAGC TATATACCTC 3120 

AAAATCGTGC CCTCTTTACA TATGTCTTAT CAGGTATAAC ATGTTGAAAT GTCACATTAG 3180 

TAGTAAAGTG GGGTTTATTT ATATAGTGGT TAAGAAATGT CAGTTTACAC TGCTGTATAC 3240 

/D TTCTTCTTCT GTGTCCCTAA GGCCTGGTAC AGTGCCAAGC ACATACTTGG TATCCAATAA 3300 

ATATTTGTTG GATGAAAAAA AAAAAAAAAA AAAA 3334 

_ n Seq ID NO: C51 DNA Sequence 
oU Nucleic Acid Accession 8: NMJJ02888.1 
Coding sequence: 37.. 723 
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CCACGTCCGG GGTGCCGAGC CAACTTTCCT GCGTCCATGC AGCCCCGCCG GCAACGGCTG 60 

CC CGCT CCCT GGTCOGGGCC CAGGGGCCCG CX3CCCCACCG CCCCGCTGCT OGOGCTGCTG 120 

CTGTTGCTCG CCCOGGTGGC GGCGCCCGCG GGGTCCGGGG GCCCCGACGA CCCTGGGCAG 180 

CCTCAGGATG CTGGGGTCCC GCGCAGGCTC CTGCAGCAGA AGGCGCGOGC GGOGCTTCAC 240 

TTCTTCAACT TCCGGTCCGG CTCGCCCAGC GCGCTGCGAG TGCTGGCCGA GGTGCAGGAG 300 

GGCCGCGCGT GGATTAATCC AAAAGAGGGA TGTAAAGTTC ACGTGGTCTT CAGCACAGAG 360 

CGCTACAACC CAGAGTCTTT ACTTCAGGAA GGTGAGGGAC GTTTGGGGAA ATGTTCTGCT 420 

CGAGTGTTTT TCAAGAATCA GAAACCCAGA CCAACCATCA ATGTAACTTG TACACGGCTC 480 

ATCGAGAAAA AGAAAAGACA ACAAGAGGAT TACCTGCTTT ACAAGCAAAT GAAGCAACTG S40 

AAAAACCCCT TGGAAATAGT CAGCATACCT GATAATCATG GACATATTGA TCCCTCTCTG 600 

AGACTCATCT GGGATTTGGC TTTCCTTGGA AGCTCTTACG TGATGTGGGA AATGACAACA 660 

CAGGTGTCAC ACTACTACTT GGCACAGCTC ACTAGTGTGA GG CAGTGGGT AAGAAAAACC 720 

TGAAAATTAA CTTGTGCCAC AAGAGTTACA ATCAAAGTGG TCTCCTTAGA CTGAATTCAT 780 

GTGAACTTCT AATTTCATAT CAAGAGTTGT AATCACATTT ATTTCAATAA ATATGTGAGT 840 

TCCTGC 846 

Seq ID NO: C52 DMA Sequence 

Nucleic Acid Accession Sj NM_005409.3 

Coding sequence t 94 . . 378 

1 11 21 31 41 51 

I I I I I I 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 

GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 

GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 

CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTG G TTT G T 1020 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 

TGG GATACT G GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 

GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 

TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 

GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 1493 



Seq ID NOt C53 DNA Sequence 
jU Nucleic Acid Accession #: FGENESH predicted 
Coding sequence: 1..609 



1 11 21 31 41 51 

I I I I I I 

ATGCTGCGGC AGGTGCTTCG CAGAGGGCTC CAGTCGTTCT GCCACAGGCT GGGTTTGTGC 60 

GTGAGCCGGC ACCCGGTCTT TTTCCTCACC GTGCCCGCAG TCCTGACAAT CACCTTCGGC 120 

CTCAGCGCGC TCAACCGCTT CCAGCCCGAG GGCGACCTGG AGCGCCTGGT CGCTCCCAGC 180 

CACAGCCTGG CCAAGATCGA GCGCAGCCTG GCCAGCAGCC TTTTCCCCCT GGACCAGTCC 240 

AAAAGCCAGC TCTATTCGGA CTTACACACC CCPGGGAGGT ATGGCAGGGT GATCCTCCTC 300 

TCCCCAACOG GGGACAATAT TTTGCTCCAG GCTGAGGGGA TCCTGCAGAC CCACCGAGCC 360 

GTGCTGGAAA TGAAGGTGAA CCACAAGGGC TATAATTATA CTTTTTCCCA TCTGTGTGTG 420 

TTGAGAAATC AGGATAAGAA ATGCGTGCTG GATGATATTA TTTCAGTGCT AGAGGATCTC 480 

AGGCAGGCTG CCGTCTCCAA TAAGACAACA GCCAGGGTGC AAGTGAGGTA TCCCAACACT 540 

AAATTAAAGG TATGCTCCTT CTGCATGCTT CTGCCAATTA AAGAGGCAGC ACTTCATTTC 600 

TTGCCCTAA 609 

Seq ID NO: CS4 DNA Sequence 

Nucleic Acid Accession #: NMJJ02438.1 

Coding sequence: 104 -.4474 

1 11 21 31 41 51 

I I I I I I 

GGGAACTTGG ATTAGGTGGA GAGGCAGTTG GGGGGCCTCG TTGTTTTGCG TCTTAGTTCC 60 

GCCCTCCTGT CCATCAGGAG AAGGAAAGGA TAAACCCTGG GCCATGAGGC TACCCCTGCT 120 

CCTGGTTTTT GCCTCTGTCA TTCCGGGTGC TGTTCTCCTA CTGGACACCA GGCAATTTTT 180 

AATCTATAAT GAAGATCACA AGCGCTGCGT GGATGCAGTG AGTCCCAGTG CCGTCCAAAC 240 

CGCAGCTTGC AACCAGGATG CCGAATCACA GAAATTCCGA TGGGTGTCCG AATCTCAGAT 300 

TATGAGTGTT GCATTTAAAT TATGCCTGGG AGTGCCATCA AAAACAGACT GGGTTGCTAT 360 

C ACTC TCTAT GCCTGTGACT CAAAAAGTGA ATTTCAGAAA TGGGAGTGCA AAAATGACAC 420 

ACTTTTGGGG ATCAAAGGAG AAOATTTATT TTTTAACTAC GGCAACAGAC AAGAAAAGAA 480 

TATTATGCTC TACAAGGGAT CGGGTTTATG GAGCAGGTGG AAGATCTATG GAACCACAGA 540 

CAATCTGTGC TCCAGAGGTT ATGAAGCCAT GTATACGCTA CTAGGCAATG CCAATGGAGC 600 

AACCTGTGCA TTCCCGTTCA AGTTTGAAAA CAAGTGGTAC GCAGATTGCA CGAGTGCTGG 660 

GCGGTCGGAT GGATGGCTCT GGTGCGGAAC CACTACTGAC TATGACACAG ACAAGCTATT 720 
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TGGATATTGT CCATTGAAAT TTGAGGGCAG TGAAAGCTTA TGGAATAAAG ACCCGCTGAC 780 

CAGCGTTTCC TACCAGATAA ACTCCAAATC OGCTTTAAOG TGGCACCAAG CGAGGAAAAG 840 

CTGCCAACAA CAGAACGCTG AGCTCCTGAG CATCACAGAG ATACATGAGC AAACATACCT 900 

GACAGGATTA ACCAGTTCCT TGACCTCAGG ACTCTGGATT GGACTTAACA GTCTGAGCTT 960 

D CAACAGCGGT TGGCAGTGGA GTGACCGCAG TCCTTTCCGA TATTTGAACT GGTTACCAGG 1020 

AAGTCCATCA GCTGAACCTG GAAAAAGCTG TGTGTCACTA AATCCTGGAA AAAATGCTAA 1080 

ATGGGAAAAT CTGGAATGTG TTCAGAAACT GGGCTATATT TGCAAAAAGG GCAACACCAC 1140 

TTTAAATTCT TTTGTTATTC CCTCAGAAAG TGATGTGCCT ACTCACTGTC CTAGTCAGTG 1200 

GTGGCCGTAT GCCGGTCACT GTTACAAGAT TCACAGAGAT GAGAAAAAAA TCCAGAGGGA 1260 

1U TGCTCT GACC ACCTGCAGGA AGGAAGGCGG TGACCTCACA AGTATCCACA CCATCGAGGA 1320 

ATTGGACTTT ATTATCTCCC AGCTAGGATA TGAGCCAAAT GACGAATTGT GGATCGGCTT 1380 

AAATGACATT AAGATTCAAA TGTACTTTGA GTGGAGTGAT GGGACCCCTG TAACGTTTAC 1440 

CAAATGGCTT CGTGGAGAAC CAAGCCATGA AAACAACAGA CAGGAGGATT GTGTGGTGAT IS 00 

GAAAGGCAAG GATGGGTACT GGGCAGATCG GGGCTGTGAG TGGCCTCTTG GCTACATCTG IS 60 

ID CAAGATGAAA TCACGAAGCC AAGGTCCAGA AATAGTGGAA GTCGAAAAAG GCTGCAGGAA 1620 

AGGCTGGAAA AAACATCACT TTTACTGCTA TATGATTGGA CATACGCTTT CAACATTTGC 1680 

AGAAGCAAAC CAAACCTGTA ATAATGAGAA TGCTTATTTA ACAACTATTG AAGACAGATA 1740 

TGAACAAGCC TTCCTGACTA GTTTCGTTGG CTTAAGGCCT GAAAAATATT TCTGGACAGG 1800 

^ ACTTTCAGAT ATACAAACCA AAGGGACTTT TCAGTGGACC ATCGAGGAAG AGGTTCGGTT 1860 

ZU CACCCACTGG AATTCAGATA TGCCAGGGOG AAAGCCAGGG TGTGTTGCCA TGAGAACOGG 1920 

GATTGCAGGG GGCTTATGGG ATGTTTTGAA ATGTGATGAA AAGGCAAAAT TTGTGTGCAA 1980 

GCACTGGGCA GAAGGAGTAA CCCACCCACC GAAGCCCACG AOGACTCCCG AACCCAAATG 2040 

TCCGGAGGAT TGGGGCGCCA GCAGTAGAAC AAGCTTGTGT TTCAAGCTGT ATGCAAAAGG 2100 

ryr AAAACATGAG AAGAAAAOGT GGTTTGAATC TCGAGATTTT TGTCGAGCTC TGGGTGGAGA 2160 

ZD CTTAGCTAGC ATCAATAACA AAGAGGAACA GCAAACAATA TGGCGATTAA TAACAGCTAG 2220 

TGGAAGCTAC CACAAACTGT TTTGGTTGGG ATTGACATAT GGAAGCCCTT CAGAAGGTTT 2280 

TACTTGGAGT GATGGTTCTC CTGTTTCATA TGAAAACTGG GCTTATGGAG AACCTAATAA 2340 

TTATCAAAAT GTTGAATACT GTGGTGAGCT GAAAGGTGAC CCTACTATGT CTTGGAATGA 2400 

_ n TATTAATTGT GAACACCTTA ACAACTGGAT TTGCCAGATA CAAAAAGGAC AAACACCAAA 2460 

jV ACCTGAGCCA ACACCAGCTC CTCAAGACAA TCCACCAGTT ACTGAAGATG GGTGGGTTAT 2520 

TTACAAAGAC TACCAGTATT ATTTCAGCAA AGAGAAGGAA ACCATGGACA ATGCGCGAGC 2580 

GTTTTGCAAG AGGAATTTTG GTGATCTTGT TTCTATTCAA AGTGAAAGTG AAAAGAAGTT 2640 

TCTATGGAAA TATGTAAACA GAAATGATGC ACAGTCTGCA TATTTTATTG GTTTATTGAT 2700 

^ CAGCTTGGAT AAAAAGTTTG CTTGGATGGA TGGAAGCAAA GTGGATTACG TGTCTTGGGC 2760 

J J CACAGGTGAA CCCAATTTTG CAAATGAAGA TGAAAACTGT GTGACCATGT ATTCAAATTC 2820 

AGGGTTTTGG AATGACATTA ACTGTGGCTA TCCAAACGCC TTCATTTGCC AGCGACATAA 2880 

CAGTAGTATC AATGCTACCA CAGTTATGCC TACCATGCCC TCGGTCCCAT CAGGGTGCAA 2940 

GGAAGGTTGG AATTTCTACA GCAACAAGTG TTTCAAAATC TTTGGATTTA TGGAAGAAGA 3000 

A p. AAGAAAAAAT TGGCAAGAGG CACGAAAAGC TTGTATAGGC TTTGGAGGGA ATCTGGTCTC 3060 

4U CATACAAAAT GAAAAAGAGC AAGCATTTCT TACCTATCAC ATGAAGGACT CCACTTTCAG 3120 

TGCCTGGACT GGGCTGAATG ATGTCAATTC AGAACACACG TTCCTTTGGA CGGATGGACG 3180 

AGGAGTCCAT TACACAAACT GGGGGAAAGG TTACCCTGGT GGAAGAAGAA GCAGTCTTTC 3240 

TTATGAAGAT GCTGACTGTG TTGTTATTAT TGGAGGTGCA TCAAATGAAG CAGGAAAATG 3300 

GATGGATGAT ACCTGCGACA GTAAACGAGG CTACATATGC CAGACACGAT CCGACCCTTC 3360 

43 CTTGACTAAT CCTCCAGCAA CGATTCAAAC AGATGGCTTT GTTAAATATG GCAAAAGCAG 3420 

CTATTCACTC ATGAGACAAA AATTTCAATG GCATGAAGCG GAGACATACT GCAAGCTTCA 3480 

CAATTCCCTT ATAGCCAGCA TTCTGGATCC CTACAGTAAT GCATTTGCGT GGCTGCAGAT 3540 

GGAAACATCT AATGAAOGTG TGTGGATCGC CCTGAACAGT AACTTGACTG ATAATCAATA 3600 

CACTTGGACT GATAAGTGGA GGGTGAGGTA CACTAACTGG GCTGCTGATG AGCCCAAATT 3660 

GAAATCAGCA TGTGTTTATC TGGATCTTGA TGGCTACTGG AAGACAGCAC ATTGCAATGA 3720 

AAGTTTTTAC TTTCTCTGTA AAAGATCAGA TGAAATCCCT GCTACTGAAC CCCCACAACT 3780 

GCCTGGCAGA TGCCCGGAGT CAGATCACAC AGCATGGATT CCTTTCCATG GTCACTGTTA 3840 

CTATATTGAG TCCTCATATA CAAGAAACTG GGGCCAAGCT TCTCTGGAAT GTCTTCGAAT 3900 

. . GGGTTCCTCT CTGGTTTCCA TTGAAAGTGC TGCAGAATCC AGTTTTCTGT CATATCGGGT 3960 

J J TGAGCCACTT AAAAGTAAAA CCAATTTTTG GATAGGATTG TTCAGAAATG TTGAAGGGAC 4020 

GTGGCTGTGG ATAAATAACA GTCCGGTCTC CTTTGTCAAC TGGAACACAG GAGATCCCTC 4080 

TGGTGAACGG AATGATTGTG TAGCTTTACA TGCGTCTTCT GGGTTTTGGA GTAATATTCA 4140 

CTGTTCTTCC TACAAAGGAT ATATTTGTAA AAGACCAAAA ATTATTGATG CTAAACCTAC 4200 

TCATGAATTA CTTACAACAA AAGCTGACAC AAGGAAGATG GACCCTTCTA AACCGTCTTC 4260 

OU CAACGTGGCC GGAGTAGTCA TCATTGTGAT CCTCCTGATT TTAACGGGTG CTGGCCTTGC 4320 

CGCCTATTTC TTTTATAAGA AAAGACGTGT GCACCTACCT CAAGAGGGCG CCTTTGAAAA 4380 

CACTCTGTAT TTTAACAGTC AGTCAAGCCC AGGAACTAGT GATATGAAAG ATCTCGTGGG 4440 

CAATATTGAA CAGAATGAAC ACTCGGTCAT CTAGTACCTC AATGCGATTC TGAGATATTT 4500 

s-C GAATTTCATA AAATTGTAAC TGAAATTTAA AATTTTTAGT TCAATGTGAT TGTTTTCTTT 4560 

OD AAA ATGAGTA CTGAATTGTA CTGGTCTGTC CTTTTTTCCT TTGCCTAATT GAAGAAATAA 4620 

• TTGCTTGTTT TCTAGCCTGG CAAGATATTT TCATAAAAGA GGGATAACAA TGCTGATTAC 4680 

TACCTTTTAA AATATTTTAG ATAAATGCAC AGCACCACAG CACCACATCT AAGCATTAGT 4740 

GATGGGTAGC TGATGTCAGC TTCATGTGGA TTTTAAGCAC TCTAGAAACA ATGAAGCTTC 4800 

„ TTGGCATATT TTAAGGAGCT CCCAAAATGT GTTACCTATT AAATTGTAAC TCAGCAAGTA 4860 

JK) GAAG ACCATT TGAAAAGTCA GGTACAAATT TCCTCAAGTG GCATAAAAAT GTAGTCAGTT 4920 

TTCTCTTTTA CCAGTTTTTA TTTCCACTCC AATTATTTAG AACTTTATTT GTACATGTGC 4980 

AGAAGAATAA GGCAGCTGAG AATCTTGTTT CCCCCAAGAG AGTTTTACAG GCTGAGTGTT 5040 

GCAAATGTGT TCTTTGTCCT GTTATATGTA TATCAGGAAT ACAAGGATGT GAAATAAAAC 5100 

TGTAAATTTG CATAACTGGA TGTACTTAGA TAATGTGAAA TAAACATTAA AGACAAGGTC SI 60 

TATTTTTAAT AAAAAAAAAA AAAAA S185 



75 
80 



Seq ID NO: CSS DNA Sequence 

Nucleic Acid Accession ft: NM_024S74.2 

Coding sequence: 424.. 2130 " 

1 11 21 31 41 51 

I I I I I I 

AGTGCGACTA GCGGCGGCCG GCGAAGCCCG CGCCAGCCCC GCGGGTCGCT GCGGCATCCT 60 

CCCGCCGCCC AGCCCCCGCA CGCTCCCTGC AGTTTAAAAG GACCTCCCGC CGCTTCTCGG 120 
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Seq ID NO: C56 DNA Sequence 
Nucleic Acid Accession ft: BC034229.1 
jU Coding sequence: 373.. 1422 



55 
60 
65 
70 
75 
80 



CGCTGCCCCG GGATTCCCCA GCCCCGCGCQ GCTCCCTACT CCACTTCGCA GCAACTTCGG 180 

CGACCGCGOG CCGCCCGCCC TCGCCCGCCT TTGAAGTTTG CTGTGCCGAC CGCAAAGTTG 240 

GGACACTTCA GCGGATTGAA TTTTTCTCTT TTATCTGCCT CCGTCCCCGC CCTCCAGGCT 300 

TCTCGTTCCT GGATATTGGT GCTTAGCATC TTGGCAGGGT CCGGGGAOGT GGACTATTTC 360 

J GCACACCACA CCACGGGGAG GGATTTTTTT CTATTTTCCC TACGAAAAAC AGATCTTTTT 420 

AAGGATGGTG CTGCTCCACT GGTGCCTGCT GTGGCTCCTG TTTCCACTCA GCTCAAGGAC 480 

CCAGAAGTTA CCCACCCGGG ATGAGGAACT TTTTCAGATG CAGATCCGGG ACAAGGCATT 540 

TTTTCATGAT TCGTCAGTAA TTCCAGATGG AGCTGAAATT AGCAGTTATC TCTTTAGAGA 600 

TACACCTAAA AGGTATTTCT TTGTGGTTGA AGAAGACAAT ACTCCATTAT CAGTCACAGT 660 

iU GACGCCCTGT GATGCGCCTT TGGAGTGGAA GCTGAGCCTC CAGGAGCTGC CAGAGGACAG 720 

GAGCGGGGAA GGCTCAGGTG ATCTGGAACC TCTTGAGCAG CAGAAGCAGC AGATCATTAA 780 

TGAGGAAGGC ACTGAGTTAT TCTCCTACAA AGGCAATGAT GTTGAGTATT TTATATCGTC 840 

TAGTTCCCCA TCCGGTTTAT ATCAGTTGGA TCTTCTTTCA ACAGAGAAAG ACACACATTT 900 

CAAAGTATAT GCCACCACAA CTCCAGAATC TGATCAGCCA TACGCTGAGT TACCCTATGA 960 

I J CCCAAGAGTA GATGTGACCT CACTGGGGCG CACCACGGTC ACTTTGGCCT GGAAACCAAG 1020 

CCCCACTGCC TCTTTGCTGA AACAACCCAT TCAGTACTGT GTGGTCATCA ACAAAGAGCA 1080 

CAATTTCAAA AGTCTCTGTG CAGTGGAAGC AAAACTGAGT GCAGATGATG CTTTTATGAT 1140 

GGCACCGAAA CCTGGTCTGG ACTTCAGCCC CTTTGACTTT GCCCACTTTG GATTTCCTTC 1200 

TGATAATTCA GGTAAAGAAC GCAGTTTCCA GGCAAAGCCT TCTCCAAAAC TGGGGCGTCA 1260 

TGTCTACTCC AGGCCCAAGG TTGATATTCA GAAAATCTGC ATAGGAAACA AGAACATCTT 1320 

CACCGTCTCT GATCTGAAAC CCGACACGCA GTACTACTTT GACGTATTTG TGGTCAACAT 1380 

CAACAGCAAC ATGAGCACCG CTTATGTAGG TACCTTTGCC AGGACCAAGG AAGAAGCCAA 1440 

ACAGA AGACA GTCGAGCTCA AAGATGGGAA GATAACAGAT GTATTTGTTA AAAGGAAGGG 1500 

AGCAAAGTTT CTACGGTTTG CTCCAGTCTC TTCTCACCAA AAAGTCACCT TCTTTATTCA 1560 

CTCTTGTCTG GATGCTGTCC AAATCCAAGT GAGAAGAGAT GGGAAACTTC TTCTGTCTCA 1620 

GAATGTGGAA GGCATTCAGC AGTTTCAGCT TAGAGGAAAA CCTAAAGCTA AATACCTCGT 1680 

TCGACTGAAA GGAAACAAGA AAGGAGCATC TATGTTGAAA ATTCTAGCTA CCACAAGGCC 1740 

TACTAAGCAG TCATTTCCCT CTCTTCCTGA AGACACAAGA ATCAAAGCCT TTGACAAGCT 1800 

„ CCGTACCTGT TCCTCGGCCA CCGTGGCTTG GCTAGGCACT CAGGAAAGGA ACAAGTTTTG I860 

JU CATCTACAAA AAAGAAGTGG ATGATAACTA CAATGAAGAC CAGAAGAAAA GAGAGCAAAA 1920 

CCAATGTCTA GGACCAGATA TAAGGAAGAA GTCAGAAAAG GTCCTCTGTA AATATTTCCA 1980 

CAGTCAAAAC CTGCAGAAAG CAGTGACCAC AGAAACAATT AAAGGTCTTC AGCCTGGCAA 2040 

ATCTTACCTG CTGGATGTTT ATGTCATAGG ACATGGGGGG CACTCTGTAA AGTATCAGAG 2100 

TAAGGTTGTG AAAACTAGAA AGTTCTGTTA GTTACCTTCT TATAGAGATA TATTATGTAG 2160 

AACTCCAGGA GGGACATTAA ATCACTTTAA GTATAAACTG ACTACTCCCA CAGTTGAGAG 2220 

AAGTTGTGAC CTGTACTTGT ACTATGGAAG GAAGGATATC AACGTGTGTA TATTGATGTT 2280 

TATATAAGTA ACTCTTGAAG GAGACTTGTT CTAGCGTGCC CCATGGTACC TAGTGTGTGT 2340 

CTGATGCCGG TTGGTGTCAA AGATAGAGGG CTTCTTGAAG GAACTTGCCA TTCCTTGCTT 2400 

TGACCACTGC ATGAACTGCT TCTAAATTAT TTTATTACCT AAAAATTTAA AATATGCCAT 2460 

TCAT TGCA CA CACCCACAAA TGCAAATCAT TCCTCTCTAT AGATGCTAGG ATATATATAA 2520 

ATTATTTTAT AAATTCTTGT TTTAAATGTC AGTGTTTCTA TGATTGTAAA CTATTAAATT 2580 

CTTTTCCTAT TAAAGTACAG ATCTAATCTA AGTATTATTA AGTTGATAGC CCTCTAGTCA 2640 

GTTATATTGC TATTGTAAAT TCTTGTTTGT TGAGTAAAAT GTTTAAATAC TATATGTATC 2700 

TCATGTACAA AGTTGACATA CATTATATTC ATGTACATAA AATTAAAGAG ATTAGATTAT 2760 

ATACTGTTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2808 



1 11 21 31 41 51 

1 I I I I I 

ATCCGGCGGT GGTGACGCGA GAGGCTGGGG TCTCCAGGAC CAACTCCTCT TCATCTTCGT 60 

CTTCCTCAGC CTGCTCAATG TGAAGCCCCT GATCATGATT CACTTCCACT TAATAAATAA 120 

AGTGTTTACA AATCAGAATA ACTTTTAGAC AATATTAAGG TGGTAATCAT GAACAGAAAA 180 

GATTTTGTAG TTCTTCCATG GGGAAAACCT GGAAATTCTG TAAAGCTAAA ATATAGCAAT 240 

GTAATAATTA AAACAAAAGT CTAAGATTTG AAGAGATAAT TTGCTTCAGG ATTTTGATGG 300 

AAGGCAAATG CTAACTTTAA AAACCAGATT TCGGAGAAGT ACAAAAGAAA TAGAAATGCT 360 

CAAGAACTGC GAATGGAGAA AGTACAGTTA GAGTTTGAGA ACCAAGAGAT GGAGAAGAAA 420 

CTGCAAGAAT TCCGATCCAC AAGAAACAAA GAAAAGGAAG ATAGAGAGTC AAGCGAGTAT 480 

TACTGGAAAT CTGGAAAAGT GGGCAAATTG GTCAATCAAT CATATATGAT GTCACAAAAT 540 

AAAGGAAATG TTGTTAAGTT TTCTGCTGGA AAAGTGAAAT TAAAATTGCT GAAGGAACAG 600 

ATTCAAGAGC CAGTGAAACC AACAGTTAAT TATAAAATGG CAAATTCTTC AGAATGTGAA 660 

AAACCCAAGA TAAATGGGAA AGTTTGTGGA CAGTGTGAGA ACAAAGCTGC TCTACTGGTA 720 

TGCCTTGAAT GTGGAGAAGA TTATTGTTCA GGATGCTTTG CTAATGTTCA CCAGAAAGGG 780 

GCACTAAAGC TCCACAGAAC AACTCTTTTG CAGGCAAGAT CTCAAATATT ATTCAATGTA 840 

TTGGATGTTG CCCATCAGTT TATAAAGGAT GTTAATCCAG ATGAACCCAA AGAGGAGAAT 900 

AATTCTACAA AGGAAACCAG TAAAATTCAA CATAAACCCA AATCTGTACT TCTCGAGAGG 960 

AGCAGCTCTG AGGTAGAAAT TACAACGATG AAAAGAGCAC AACGTACAAA ACCAAGAAAG 1020 

AGTCTGTTGT GTGAAGGGTC ATTCGATGAA GAAGCTTCTG CACAGTCCTT TCAGGAAGTG 1080 

TTAAGTCAAT GGAGAACCGG AAATCATGAT GACAACAAGA AACAGAATTT ACATGCAGCA 1140 

GTAAAAGACT CATTGGAAGA ATGCGAAGTA CAGACTAATC TGAAAATTTG GAGAGAACCA 1200 

CTTAATATTG AACTTAAAGA AGACATTCTA TCCTATATGG AAAAATTATG GCTTAAAAAA 1260 

CACAGGAGAA CTCCACAAGA GCAACTTTTT AAATGCTACC AGATACGTTC CCACATCCAC 1320 

ATGAAACCAC TGGTGATGCA CAGTGTTCTC AAAATGAAAA CGATGAAGAT AGTGATGGTG 1380 

AGGAGACCAA AGTACAACAC ACAGCTCTTT TATTGCCAGT AGAAACATTA AACATAGAGA 1440 

GACCTGAACC ATCTCTGAAG ATAGTCGAAC TGGATGATAC TTATGAAGAG GAATTTGAAG 1500 

AAG CAGAA AA TATTGTGCCT TACAAAGTTA AATTAGCTGA TGCAGACAGT CAACGAAGTT 1560 

GTGCTTTTCA TGATTGTCAG AAGAATAGCT TTCCATATGA AAATGGCATC CATCAACATC 1620 

ATGTTTTCGA TAAGGGAAAG AGAGACTTCT TAAATCTTTG TCTGAGAAAC AGCTCTACTT 1680 

ATTATAAAGA TAATTCAAAA GGAGAAACTT CAAACACAGA TTTTGACAAC ATCGTGGATC 1740 

CTGATGTGTA TTCTTCTGAC ATTGAAAAAA TTGAGGAAAG CACCTCCTTT GAAAGAAATT 1800 

TAAAGGAGAA AAATATAGGT TTAGAAAGTA ATCAAAAGTC TGATGATTCC TGTGTATCAC I860 
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TTGAAAGCAA 
TATCTCAAQA 
AAGAATCAAA 
CTATAACAAA 
TCAACTTACT 
CAGGTCAGAA 
AAAGCTCCAG 
CATCTAGAGC 
AGCTTTCCTT 
AAGTGCTGAG 
TCCCAGATTT 
CCTCACATGT 
AAATTCAGTC 
TCAACAAGCA 
ATTACTGTTT 
TTACCTATCC 
ACTAATAAAA 



GGACACTTTG 
CATCAAAGAA 
AACTACAAAG 
ACAATATCAA 
TCCTTCTCAT 
ATCACAGAGA 
TTGCCTTTCA 
TGCTTCTGAA 
AGATGACACT 
ATCTCTTGCA 
CAGCAGCCAA 
GAGGGGTCCC 
TTTGCTGTCA 
ACATGTCATC 
CCATTTTGTA 
AAGTGCTGGA 
GTAAAATTAT 



CTAGGTAGAG 
TCCTTGGAAT 
TCATCACTGT 
GGACTTGAGA 
CGTTTAGAAT 
CCTTCAACAG 
TCCTCTCATC 
ATTTCAGAAA 
ACTGATCAAC 
GATACTTCAG 
TCACTGAATA 
TGTGGAGTTG 
CTTTCTGAGA 
ACACTACCGT 
CCCAGAGTAA 
GATTTTGATT 
AAGTTCAAAA 



ATTTAGAAAA 
TGAGCAATCT 
TGTTACAAGA 
GATTCTTTAT 
GCAACAATTC 
CAAATTTTCC 
CTOGATCAAG 
TTGAATATAT 
ATACTTTAGA 
AAAAGCTTTA 
TAAGTCAGAT 
AGGAATTGAG 
GCAGTACAGA 
GGTCAAAGAG 
AGCAAACAAC 
ACTAATGTCT 
AAATTTTTAA 



AGCTCCCATT 
GTATAAGAGG 
AATAGCCTGC 
TTTTGATACA 
CAGTACTAGG 
ACTTTCCAAC 
AAGTGCAGCT 
TGATATTACT 
CAATTTGGAA 
CAGCTTAACC 
TTCCACAGAT 
CTGTTCTGGA 
TGAGGAGGAG 
TACTTAAAGA 
TGAGAAAAGT 
TTGATGTTTC 
AAAAAAAAAT 



GAGGAGAAAT 
CCAAGCTTTG 
AGAAGTAAGC 
AATGAAAGAC 
ATTACACTTG 
TCTGTTAAAG 
GCTCAATCAT 
GACCAGAATG 
AAAGAATTAC 
TCAGAAGAGT 
TTCCTTAAGA 
AGAGATACCA 
GAAGATTTTC 
TTATTTGTTC 
AACCAAGTGA 
AAGGCTACAA 
AAAAAA 



Seq ID NO: C57 UNA Sequence 

Nucleic Acid Accession #: NM_024687.l 

Coding sequence: 138.. 1706 "~ 



1 
I 

AAAAACATGA 
AATGCGAAGT 
AAGACATTCT 
AGCAACTTTT 
CACAGTGTTC 
ACACAGCTCT 
AGATAGTCGA 
CTTACAAAGT 
AGAAGAATAG 
AGAGAGACTT 
AAGGAGAAAC 
ACATTGAAAA 
GTTTAGAAAG 
TGCTAGGTAG 
AATCCTTGGA 
AGTCATCACT 
AAGGACTTGA 
ATCGTTTAGA 
TTCCAGACCA 
GTGCTCAGAG 
CTTCAACAGC 
CCTCTCATCC 
TTTCAGAAAT 
CTGATCAACA 
ATACTTCAGA 
CACTGAATAT 
GTGGAGTTGA 
TTTCTGAGAG 
CACTACCGTG 
CCAGA GTAAA 
ATTTTGATTA 
AGTTCAAAAA 



11 
I 

TGACAACAAG 
ACAGACTAAT 
ATCCTATATG 
TAAAATGCTA 
TCAAAATGAA 
TTTATTGCCA 
ACTGGATGAT 
TAAATTAGCT 
CTTTCCATAT 
CTTAAATCTT 
TTCAAACACA 
AATTGAGGAA 
TAATCAAAAG 
AGATTTAGAA 
ATTGAGCAAT 
GTTGTTACAA 
GAGATTCTTT 
ATGCAACAAT 
TAGCTTAAGT 
TCCATCATCA 
AAATTTTCCA 
TCGATCAAGA 
TGAATATATT 
TACTTTAGAC 
AAAGCTTTAC 
AAGTCAGATT 
GGAATTGAGC 
CAGTACAGAT 
GTCAAAGAGT 
GCAAACAACT 
CTAATGTCTT 
AAAAAAAAAA 



21 
I 

AAACAGAATT 
CTGAAAATTT 
GAAAAATTAT 
TCAGATAOGT 
AACGATGAAG 
GTAGAAACAT 
ACTTATGAAG 
GATGCAGACA 
GAAAATGGCA 
TGTCTGAGAA 
GATTTTGACA 
AGCACCTCCT 
TCTGATGATT 
AAAGCTCCCA 
CTGTATAAGA 
GAAATAGCCT 
ATTTTTGATA 
TCCAGTACTA 
GAATATGCTG 
AGTAGAAAAC 
CTTTCCAACT 
AGTGCAGCTG 
GATATTACTG 
AATTTGGAAA 
AGCTTAACCT 
TCCACAGATT 
TGTTCTGGAA 
GAGGAGGAGG 
ACTTAAAGAT 
GAGAAAAGTA 
TGATGTTTCA 
AAAA 



31 
I 

TACATGCAGC 
GGAGAGAACC 
GGCTTAAAAA 
TCCCACATCC 
ATAGTGATGG 
TAAACATAGA 
AGGAATTTGA 
GTCAACGAAG 
TCCATCAACA 
ACAGCTCTAC 
ACATCGTGGA 
TTGAAAGAAA 
CCTGTGTATC 
TTGAGGAGAA 
GGCCAAGCTT 
GCAGAAGTAA 
CAAATGAAAG 
GGATTACACT 
ATAATGCAAT 
AGCAAAAGAT 
CTGTTAAAGA 
CTCAATCATC 
ACCAGAATGA 
AAGAATTACA 
CAGAAGAGTT 
TCCTTAAGAC 
GAGATACCAA 
AAGATTTTCT 
TATTTCTTCA 
ACCAAGTGAT 
AGGCTACAAA 



41 
I 

AGTAAAAGAC 
ACTTAATATT 
ACACAGGAGA 
ACATGAAACC 
TGAGGAGACC 
GAGACCTGAA 
AGAAGCAGAA 
TTGTGCTTTT 
TCATGTTTTC 
TTATTATAAA 
TCCTGATGTG 
TTTAAAGGAG 
ACTTGAAAGC 
ATTATCTCAA 
TGAAGAATCA 
GCCTATAACA 
ACTCAACTTA 
TGCAGAAGAC 
TGTCTTGGGT 
GGGTCAGAAA 
AAGCTCCAGT 
ATCTAGAGCT 
GCTTTCCTTA 
AGTGCTGAGA 
CCCAGATTTC 
CTCACATGTG 
AATTCAGTCT 
CAACAAGCAA 
TTACTGTTTC 
TACCTATCCA 
CTAATAAAAG 



51 
I 

TCATTGGAAG 
GAACTTAAAG 
ACTCCACAAG 
ACTGGTGATG 
AAAGTACAAC 
CCATCTCTGA 
AATATTGTGC 
CATGATTGTC 
GATAAGGGAA 
GATAATTCAA 
TATTCTTCTG 
AAAAATATAG 
AAGGACACTT 
GACATCAAAG 
AAAACTACAA 
AAACAATATC 
CTTCCTTCTC 
AGAGAATGGA 
GTTCTGCAGG 
TCACAGAGAC 
TGCCTTTCAT 
GCTTCTGAAA 
GATGACACTA 
TCTCTTGCAG 
AGCAGCCAAT 
AGGGGTCCCT 
TTGCTGTCAC 
CATGTCATCA 
CATTTTGTAC 
AGTGCTGGAG 
TAAAATTATA 



Seq ID NO: CS8 DNA Sequence 

Nucleic Acid Accession #: NM 005408.1 

Coding sequence: 76.. 372 ~ 



I 

AAAAGGCOGG 
CTCTTAACCT 
GCTTTCAACC 
TTCACATTTA 
AGCAGGTGTC 
GACCCAAAGG 
CTGAAGACTT 
TCCATTCTCC 
AGGATGCATT 
TTGACTTGTT 
TTGAGCAAGT 
GGGTTTGTAT 
GATGCTGCAA 
TGTGGAAAGA 
AAAATCTCCA 



11 
I 

CGGAACAGCC 
TCAACATGAA 
CCCAGGGACT 
GCAGTAAGAA 
CCCAGAAGGC 
AGAAGTGGGT 
GAACTCTGCT 
TCTGGCCTCC 
CGGTTTTGTG 
GCTGGTTTGG 
AGGTTGCTGT 
TCGGTTCCCA 
TGTAGGAAGG 
TGAATGCAAT 
AAAAAAAAAA 



21 
I 

AGAGGAGCAG 
AGTCTCTGCA 
TGCTCAGCCA 
GATCTCCTTG 
TGTCATCTTC 
CCAGAATTAT 
ACCCCTACTG 
TCTTCTATGC 
ATTCAAAATG 
AGTTTATTTG 
CTCTAAGCCC 
GGGGTTGAGA 
AGAGCTCTTT 
AGTAGGACTG 



31 
I 

AGAGGCAAAG 
GTGCTTCTGT 
GATGCACTCA 
CAGAGGCTGA 
AGAACCAAAC 
ATGAAACACC 
AAATCAAGCT 
TTTGGAATAC 
TACTATGTGT 
AGTATTGCTG 
CCTTCCCTTC 
GCATGCCTGT 
GTGAATGTGA 
CTGACATTTT 



41 
I 

AAACATTGTG 
GCCTGCTGCT 
ACGTOCCATC 
AGAGCTATGT 
TGGGCAAGGA 
TGGGCCGGAA 
GGAGTACGTG 
TTCTACCATA 
TAAGTAATAT 
ATCTTTTCTA 
CACTATGAGC 
GGGAGTCATG 
GGTGTTGCTA 
GCAGAAAATA 



51 
I 

AAATCTCCAA 
CATGACAGCA 
TACTTGCTGC 
GATCACCACC 
GATCTGTGCT 
AGCTCACACC 
AAATGACTTT 
ATTTTCAAAT 
TGGCTATTAT 
AAGCAAGGCC 
TGCTGGCAGT 
GACATGAAGG 
AATATGTTAT 
CATTTTATTT 
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Seq ID NO: C59 DNA Sequence 

Nucleic Acid Accession # : AK097746.1 

Coding sequence: 185.. 2224 



1233 
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i 
I 

CTTTCATGAC 
CTTCACATAC 
ATTTTCCTTT 
ACTGATGTTT 
AACTTGGCTA 
CAGAGGACTC 
TAAAGAGCCA 
GAAAATAATA 
TGTAACTGAC 
TTACTTGGAT 
TATGGCCAGC 
TATTTCACTG 
AGAAGGAACT 
GGAAAAAATA 
GACAAGCTAT 
TAATGAACCT 
TTCTGATCCT 
TGATACAATT 
AGTGACAGAC 
GTACATAGTT 
TAAAGGCACT 
TGAGATATTT 
CCTCTTTGAG 
TACTGATCAG 
CGACATTGAA 
GTTAGTACAA 
GGACCTTGAA 
CAGTAGCTTA 
TAAGCCCCTG 
GTATAATTCA 
TTTAACTGGA 
ATATGAATTT 
TATCCACGGA 
ACAATATCCC 
ATCTOGGATT 
AGGAACTCTT 
AGACCAACCT 
CTAAATTGGA 



11 

I 

AGTAACAAAT 
AACTTATATT 
TTATTATGTG 
CTTTTAACTG 
CAGGACAAAA 
AGGCAACATT 
CATAATGCTA 
ATTCTTCGGT 
AAACTAGGGA 
TCAAATTGCA 
CTGCTGAAAT 
GGACAGGGAC 
TGGGTGTGCC 
TGTGAAGATT 
CCATCTTCAA 
CCCAOGGGTC 
GAGTTTTTCA 
CCATTTGAAG 
GATTGGGACA 
GAAAACCCTC 
TATGAGGACT 
GGATTACATG 

ATTCTGTTAG 
ATGGCACTAC 
GAAATGGAAA 
AAAGCTATTA 
CTTGTTGGAA 
GGAAGTTACA 
GGAAAACCTT 
GCTATGCAGA 
GAGGTTATCC 
CTGTATCTCG 
AAACTTCTGT 
ATAAAGTCGG 
TCCACTACGG 
ACTCGGCACT 
CAAATTTATA 



21 
I 

CCAAGATTTT 
GTAATATATG 
CCAATCTTCT 
GAGGAGTAAG 
GCTGGGAGGA 
TTTGTGAACA 
AATTTCCAGC 
GTTTAAGACC 
AAAAGTTTGT 
CCATTCOCTT 
TTGCAAATGA 
AAGGACCGAT 
TACAGAATTG 
TTACCTCTGA 
AATTCCCAGT 
TTCGGCTGAA 
AGGGATGCCG 
CTATATCTTA 
GACGTCTTCT 
ATTATAAGTT 
ACATTGAATT 
AAAACGTTGA 
TCACCCAGGG 
AAATTACCAA 
GGAAGTATCC 
GATTTAACAA 
AGGGTGTGGT 
AGGTTCCAGA 
TCACAGATTT 
GTGTGTTTTG 
ATTATGCCAG 
CATCTGATAC 
ATGGCGCACG 
TTGACCTGAT 
ATGCCTATGT 
GACATTCTAC 
GGATCAAGOG 
AAACATCCAA 



31 
I 

GGAAAAGOGC 
CCGATCACTA 
TCTGGCAAGG 
TCTTAAAAGT 
AATCTGTCGG 
TATATATGAA 
ACCAATGGAT 
TGATAAGATA 
AGAGCCTCCA 
AATTTTTGTT 
TAAATCTATG 
TGCAGCAAAA 
CCATCTTGCA 
AACCTGTAAC 
AACAATTCTA 
TCTCCTTCAA 
TGGAAAGGAA 
CCTGACTGGG 
ATTAACCATG 
TTCTCCCAGT 
CATTAAGAAA 
CATCTCCAAG 
AGGCTCCAAA 
AGATATCCTC 
TGTGAGATAT 
TTTAATTATA 
TGTGATGGAT 
AATATGGGCC 
CCTAGCCCGG 
GCTGTCAGGT 
AAAATATACC 
ATCTGACACA 
CTGGGACCGA 
GCCCATCATA 
CTGTCCCCTC 
TAACTTTGTC 
CGGGGTTGCT 
AAGTTT 



41 

I 

CTACGATATT 
TTTGAGAAGG 
AAAGAGATTG 
GCTGAGAAAA 
GCAAGTGAAT 
TGGCGAGAAA 
AAGAACCTAA 
ACCCCAGCTA 
CCATTTGATT 
CTATCTCCAG 
TCTGGAAATA 
ATGATTAAAG 
GTGT CCTGGA 
TCATCCTTTA 
CAGAATGGAG 
TCATATCTCA 
CTGTTATTTA 
GAGTGTAATT 
CTGGCTGACT 
GGAAACTATT 
CTTCCATTTA 
GATCTTCAAC 
CAGACAGGAG 
AACAAGCTCC 
GAAGAAAGCA 
ACTATAOGTA 
TCTGCATTGG 
AAACGTTCAT 
TTGAACTTTT 
TTCTTTTTCA 
ACCCCTATTG 
TCACCAGAAG 
GAAAGTGGAT 
TGGATAAAAC 
TACAAGACAA 
ATTGCAATGT 
TTGCTTTGTC 



51 
I 

TAAATGACCA 
ACAAGCTGTT 
AATACCAGGA 
ATCCTGATCC 
TTCCTGCCTT 
TCTATGACAG 
ATGAACTACA 
TAACAAACTA 
TGACAAAGAG 
GAGCAGATCC 
AGTTTCAAGC 
CAGCAATTGA 
TGCCCATGTT 
GGCTTTGGCT 
TAAAAATGAC 
CTGATCCAGT 
TCAATGAATA 
ATGGAGGAAG 
TTTATAATCT 
TTGCACCTCC 
CTCAACACCC 
AAACAAAAAC 
CCTCAGGAAG 
CTAGTGATTT 
TGAATACTGT 
ACACTCTACG 
AGGCACTCTC 
ACCCAAGCCT 
TACAGGACTG 
CTCAGGCCTT 
ATTTGCTAGG 
ATGGTGTTTA 
TGCTTGCTGA 
CAACTCAAAA 
GTGAACGTAA 
TGTTAAAAAC 
AGTTGGATGA 



Seq ID MO: C60 DNA Sequence 
Nucleic Acid Accession ft: J02761.1 
Coding sequence: 14.. 1159 



1 11 

I I 
GAATTCCGGT GCCATGGCTG 
GCTCTGTGGC CCAGGCACTG 
TGAGTTCTGG TGCCAAAGCC 
ACAGGAAGTC TGGGGACATG 
CCACATCCTT AACAAGATGG 
GGAGCAGGAG TGCAACGTCC 
TGACGACTAC TTCCCCCTGG 
CTGTATGCAC CTGGGCCTGT 
GTCAGACCCC CTGCCCAAAC 
CGTCCTCCCT GTGCTGCCCG 
CTCCGAGCAG CAATTCCCCA 
GCGGATCCAA GCCATGATTC 
CGTGGTACCT CTGGTGGCGG 
CCTGCTCGAC ACGCTGCTGG 
GTGCTCCATG GATGACAGCG 
CTCTGAGTGC CACCTCTGCA 
CATACCACAG GCAATGCTCC 
GCAATTTGTG GAGCAGCACA 
CCACACCACC TGCCAGGCCC 
CCACAGCCCC GACCTTTGAT 
GACGGGCTCT GGGACCATGG 
CCAGGCTGAA AAGAAGCCTC 
CACTTCCACT GGTGGACCAC 
ACCACAGTCT GACACCAGAG 
GCATCTGGAG GAGCTCTGCA 
AAAAACCAGC CACTGCTTTA 
CCCCATGCAC TCAAAGATTG 
AAAATGGGAA CATACAGAAC 
TCAAAAAATC AGCAATTCCC 
GGGATGGCGA CCGGGCAAGC 
TTGTTAAGAT ATAAAAAGGG 
GATTGAAAGT CCTAGGGTGA 
GGGTTGTGCA AGCTAGGGAC 
AGTCCTATTC TCTTTTAAAA 



GTTTCTTTTT 
TTCTATTTCT 
CCATTCCTGT 
AAAAAAAAAA 



31 

I 

GCTGCAGTGG 
CACCTCATCC 
ATTGCAGTGC 
TGACCTATGC 
CATTTTCCAG 
GCTGCTCATG 
CTTCCAGAAC 
GCAGCCAGAG 
CCCTCTGCCA 
GGOGAGGCCT 
CTATTGCTGG 
GCTAGCTGTG 
CCAGTGCCTG 
GCCCCAGCTG 
GTCGCCGACA 
CACCCAGGCC 
TGGCTCCTGG 
GCTGACCCTG 
TGGGACCATG 
CTGTCCAGCT 
CTTCCCCTGC 
CCGCCCTCCT 
CCCTGTGTCG 
TCCTCTCTGG 
CTACCACGAC 
GGTTGAAGCT 
CTACTTGCAA 
AGACATCAGA 
AAGGGTGGAC 
GAGATGCTCT 
GTCTTTCTGT 
GCTGTGATTT 
GTAATACAAT 
AAAAAAAACG 



41 



51 



CTGCTGCTGC 
TTGGCCTGTG 
AGAGCCCTAG 
CAAGAGTGTG 
GACACGATGA 
CCCCAGTGCA 
CAGACTGACT 
CCAGAGCAGG 
GACCCTCTGC 
GGGCCTCACA 
CTCTGCAGGG 
GCAGTGGCCC 
GCTGAGCGCT 
GTCTGCCGCC 
GGAGAATGGC 
GGGAACAGCA 
CTGGACAGGG 
GTGCCCAGGG 
TCCAGCCCTC 
GCAAAGGAAA 
TCCCTGGCCC 
CACCTCCCTT 
GCCTTGTCTG 
TGTGAGGCAC 
CTCCCAGGGC 
GAGCCCCGCC 
TTCAAAATTC 
AATTGTTAAG 
ACTGCACGCT 
GCTGCTTGAG 
AAGGTGGACT 
ATCTGCTGAA 
GTCTGCACCA 
GAATTC 
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TGCTGCCCAC 
CCCAGGGCCC 
GGCATTGCCT 
AGGACATCGT 
GGAAGTTCCT 
ACCAAGTGCT 
CAAACGGCAT 
AGCCAGGGAT 
TGGACAAGCT 
CACAGGATCT 
CTCTGATCAA 
AGGTGTGCGG 
ACTCCGTCAT 
TCGTCCTCCG 
TGCCGCGAGA 
GCGAGCAGGC 
AAAAGTGCAA 
GCTGGGATGC 
TCCAGTGTAT 
AGCCAAGTGA 
TCGCCAGCTG 
CCTCGGCAGT 
TCTCAGCTCA 
AGCGAGGGCA 
TGGGCTCAGG 
TCACACCCAC 
AGAAGAATAA 
TTAAGCTTTT 
CTGGCATGAT 
AGCTATTGCT 
TCCAGATTTT 
AGCTCAGCTG 
ATGCTAATAA 
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Seq ID NO: C61 DNA Sequence 



1234 



WO 03/042661 PCT/US02/36810 



Nucleic Acid Accession #: NM_139172.l 
Coding sequence: 19.. 552 
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i 
I 

GGGGTCTGGG 
CTGCAGGGCT 
GGTGACATAT 
TGCCTGCGTC 
AACGTGCGGA 
AGCTGCAGCA 
TTCCTGGCGG 
ACCAAGAAGA 
GTGGAGGGAG 
GAGGAGGATT 



11 
I 

GAGGTGACAT 

GGGACCGGGA 
TCCGGTGCTG 
GGAAGCACAT 
TCTGCTTGTT 
GTCCGTGTGA 
CGCCGTCCAC 
GCACCGAGGG 
AGGGGAGTCC 




31 
I 

GGGATCCCAG 
CAGGGATTCT 
GGCCAGGCGG 
GGGGTCTGCT 
GTCTGGACGT 
AAGOGCCGGG 
TCCOTCTCGC 
CCAGTCGCCC 
GAGGAGGGTG 
GCTCAATACA 



41 
I 

OGCTGGGCCT 
TCTACCCATG 
CCATCGATAG 
ACCACCAGCG 
GCAGCGGCCT 
ACGTGCTGCA 
TGCTCTCCAA 
TGTCCAAAGA 
AGGAGACAGA 
GATACGGTGG 



51 
I 

GCTCCTGCTG 
GAGCTGTGAG 
CCCCAACCTC 
TCCAGAOGAA 
CCTCCTCCTG 
TATGCCCGGT 
GCACCGAGGG 
GTCCAGGGAT 
GGGCGAGGAA 
ACG 



Seq ID NO: C62 DNA Sequence 
Nucleic Acid Accession 3 : NMJ)S4023 . 
Coding sequence: 98.-379 



1 
I 

GGGGACACTT 
ATTTTTAAAC 
GCTGGTGACC 
CCTTCCTGTT 
ATTAAAGCTT 
GAAGTGTGTA 
GCTATCACAC 
TGCTCCTATC 
GTAGTGACCC 
AAAAAAAAAA 



11 

I 

TGTATGGCAA 
TCCTGAAAAA 
ATCAGCCTTT 
GACAAGTTGG 
CTTCTGAAAA 
AATGAGCTGG 
TTGGTGTGAC 
CTCCCTGCCT 
GTGAAAAGGA 



21 
I 

GTGGAACCAC 
TATCCCAGAT 
GTAGTTACTC 
CACCTTTACC 
CTCTGGGCAT 
GACCAGAGGC 
ATCAAGATAA 
GAAACCTGTT 
CAAATAAAGC 



31 

I 

TGGCTTGGTG 
AACTGTCATG 
TGCTACTGCC 
TCTGGACAAC 
TTCTGTTGAG 
TTCTGAAGCT 
AGAGCGGAGG 
CTACCAATTA 
AATGAATACT 



41 

I 

GATTTTGCTA 
AAGCTGGTAA 
TTCCTCATCA 
ATTCTTCCCT 
CACCTTGTGG 
GTGAAGAAAC 
TGGATGGGGA 
TAGATCAAAT 
AAAAAAAAAA 



51 
1 

GATTTTTCTG 
CTATCTTCCT 
ACAAAGTGCC 
TTATGGATCC 
AGGGGCTAAG 
TGCTGGAGGC 
TGGAAGATGA 
GCCCTAAAAT 
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Seq ID NO: C63 DNA Sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..2874 

1 11 21 31 41 51 

1 1 I I I 1 

ATGCCCCTGT CCTATGCCTA TAAAAAOGCT GAGACCCTAG CAGGCAGACA CACAAGCAGC 
TGGATGTCGA GAGGAGCATA TCAGCGGAGG AACACACGGG CAGCTGGACG TCCAGAGGAA 
TGCACTGACA GAAACTGGCA TGCTGGCAGA ACACGTGGAA TTTGGCTGGG GCAGTTGGAG 
GAGAGATGTT CAGATGTGTT CGGAGTTTCT TTCTTCTGGT GGGTTCGTGG TCTCGCTGGC 
TCAGGAGCGA AGCTGCAGAC CTTCACGCCA GCCCAGGAAG GGGCTCCCAC AGTGCAGCGG 
CAGGCTGAAG CGCTCCTCAA GTGCCGCCAG AGTGGGCGTC CAGGCAGAGG AGGCGCCGAG 
AGCGAGCGAG CGAGGGATGC CAGCATGCTG TCACCTCTCA GTGCTGCCAT GOGAAACTAC 
CCAACGTCCT CTACCATCCC TCCAAGAAGA TCCTACTCTC CAACCGAAAT TGCTCACAAG 
AGTTACTCCT GCAGCCTTCC AGACATGAAA ATCTCCATGG CAGAATCTGG CCCCTCCTTG 
GATAGCCTTG ACATTCTGGA GGATGGCGAG TCTGGGTCAC CATTTCTTGT GACTCATTTG 
TACTTTCTGG GGGTTGTCAC CACTGGGATG GAACAACTAG ATTTTGAAAC AGGACCAAAC 
ATATTTGATT TGCAGATTTA TGTGAAGGAT GAGGTTGGTG TCACAGACCT GCAAGTCCTG 
ACTGTCCAGG TAACAGATGT GAACGAGCCA CCTCAGTTTC AAGGCAACTT GGCAGAAGAT 
CATCTCCGTG CAGACCAGCC ACATTTCAAT GCTCATAGTC ACACGTACGT GAGGGTAGTG 
GCTACTGCAT TGGCCAGGCA CAGGCTTAGA TCTAGCATTG GTTCCCCCTT CCTGGGCACC 
TTCTGTGTTG TGGTGGGCAT GCAGTATTTC CTGATTTCTC CCCCAAAGAG CTTCAGAATG 
TCTGCTAATG GCACCCTCTT CTCCACAACA GAATTGGACT TTGAAGCAGG ACACAGAAGT 
TTCCATCTCA TCGTGGAGGT GAGGGACAGT GGAGGCCTCA AAGCCTCCAC AGAGCTCCAG 
GTGAACATCG TGAACCTCAA CGACGAAGTC CCTCGCTTTA CCAGCCCGAC ACGAGTGTAC 
ACAGTCCTGG AGGAACTGAG TCCAGGAACC ATCGTGGCCA ATATCACAGC GGAGGATCCT 
GATGATGAAG GTTTTCCCAG CCACCTCCTC TACAGCATTA CCACTGTTAG CAAATATTTC 
ATGATAAATC AGTTGACTGG TACAATCCAA GTGGCCCAAA GGATAGACCG AGATGCAGGT 
GAATTGAGAC AAAATCCCAC CATTTCCCTG GAAGTTCTAG TGAAGGACAG ACCATATGGG 
GGTCAGGAGA ATCGCATCCA GATAACCTTC ATTGTGGAAG ACGTCAACGA CAATCCTGCC 
ACATGCCAAA AGTTCACCTT CAGATCCAGT CTCCACCCTG CTCTGTGCTC CAAGACGCTG 
ACCTGGATGG ATACCGTATT AGACTGTTTT CATGCTGCTG ATAAAGATAT ACCTGTGACT 
GGGCGATTTA CAAAAGAAAG AGGTTTAATT GGACTTACAG TTCCACATGG CTGGGGAAGC 
CTCACAATCA TGGCAGAAGG CAAGGAGGAG CAAGTCACAT CTTACATGGA TGGCAGCAGG 
CAAAGAGATA GAGCTTGTGT AGGGAAACTC CTCCTTATAA AGCCATCAGA TCTCATGAGA 
CTTAGTCACT ATCACGAGAA CAACTCAGGA AAGACTTGCC CCCATGATTC CATTTCCTCC 
TACCAGGTCC CTCCCACAAC ATGTAGGAAT TCAAGAATCC AGGCCACCAA CAACGAAGAC 
ACAAGCTCTG TCACTGTTAC TGTGAACATC CTTGAAGAAA ATGATGAAAA GCCAATTTGT 
ACTCCAAACT CTTATTTCCT GGCCCTCCCA GTGGATCTGA AAGTTGGCAC AAATATTCAG 
AATTTCAAGC TGACATGTAC CGACCTTGAT TCCAGCCCCA GATCTTTCCG TTATTCCATT 
GGCCCAGGTA ACGTCAACAA TCATTTCACC TTCTCTCCCA ATGCTGGTTC CAATGTCACA 
CGCCTGCTGC TTACATCTCG CTTTGACTAT GCTGGTGGGT TTGATAAGAT CTGGGACTAC 
AAGCTACTTG TCTACGTAAC TGATGACAAC TTGATGTCTG ACAGGAAGAA AGCGGAGGCT 
CTTGTTGAGA CAGGAACAGT GACACTGAGT ATTAAAGTCA TTCCCCACCC AACCACTATC 
ATCACCACGA CCCCCAGGCC CAGGGTCACC TATCAGGTCC TGAGGAAAAA CGTTTACTCT 
CCATCTGCAT GGTACGTGCC GTTTGTCATC ACTTTGGGCT CCATATTGCT TCTGGGTCTC 
CTCGTGTACC TGGTCGTCCT ATTGGCCAAA GCCATCCACA GACACTGCCC CTGCAAGACT 
GGGAAGAACA AGGAACCTCT GACAAAGAAA GGAGAAACGA AGACTGCAGA GAGAGACGTC 
GTGGTGGAAA CTATCCAGAT GAACACTATC TTTOATGGAG AAGCCATAGA TCCAGAGCCT 
GAGCAAGCTT CACTCGAGCT CTATGCCCTG CTGCCCAGCT GCTGCGACCC TAGTCCAGTA 
ACCCTAAGAA AGGTCCAGGT GTGTGGGGAG AGTGAAGAGA CCGGTCAGTG TTCCGGCCAC 
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1235 



WO 03/042661 



PCT/US02/36810 



ATCACACTTC CCGGCAAGAT TCCAGTOGAT GACCCAAGGA AACAGGAAAC AGGCCTGCAG 2760 

GGTGATTTCG AGGTCTGGAC TCTATGCCCC GCTGTGAAGG TGGTTGTAGG CAGCCCTCAA 2020 

GCTGAACGGT GCATTOGATT GGCTCTCAGT CTGAAAAAGT ACAGTTCTGA TTAA 2874 

5 Seq ID NO: C64 DNA Sequence 

Nucleic Acid Accession 8: XM_168S7l.l 

Coding sequence: 155.. 988 * 

in 1 11 21 31 41 51 

10 1 I | | II 

TACACAGTCC TGGAGGAACT GAGTCCAGGA ACCATCGTGG CCAATATCAC AGCGGAGGAT 60 

CCTGATGATG AAGGTTTTCC CAGCCACCTC CTCTACAGCA TTACCACTGT TAGCAAATAT 120 

TTCATGATAA ATCAGTTGAC TGGTACAATC CAAGTGGCCC AAAGGATAGA CCGAGATGCA 180 

GGTGAATTGA GACAAAATCC CACCATTTCC CTGGAAGTTC TAGTGAAGGA CAGACCATAT 240 

ID GGGGGTCAGG agaatcgcat CCAGATAACC TTCATTGTGG AAGACGTCAA CGACAATCCT 300 

GCCACATGCC AAAAGTTCAC CTTCAGCATT ATGGTGCCGG AAAGAACAGC CAAGGGGACG 360 

TTGCTTCTTG ACCTAAACAA GTTCTGCTTT GATGATGACA GTGAGGCACC AAACAACAGA 420 

TTCAACTTCA CCATGCCATC TGGAGTGGGG AGCGGCAGCA GATTTTTACA GGATCCAGCT 480 

GGCTCTGGGA AGATTGTGCT GATTGGTGAT CTAGACTACG AAAATCCAAG TAACCTAGCA 540 

ZU GCCGGCAATA AATATACGGT GATAATCCAG GTGCAGGATG TGGCCCCCCC TTACTATAAA 600 

AATAACGTCT ACGTTTATAT CCTAACAAGC CCAGAAAATG AGTTTCCTCT CATTTTTGAT 660 

AGGCCATCCT ATGTATTTGA TGTGTCAGAA AGAAGGCCCG CCCAGGGTCA CCTATCAGGT 720 

CCTGAGGAAA AACGTTTACT CTCCATCTGC ATGGTACGTG CCGTTTGTCA TCACTTTGGG 780 

CTCCATATTG CTTCTGGGTC TCCTCGTGTA CCTGGTCGTC CTATTGGCCA AAGCCATCCA 840 

Zt) CAGACACTGC CCCTGCAAGA CTGGGAAGAA CAAGGAACCT CTGACAAAGA AAGGAGAAAC 900 

GAAGACTGCA GAGAGAGACG TCGTGGTGGA AACTATCCAG ATGAACACTA TCTTTGATGG 960 

AGAAGCCATA GATCCAGTGA CCGGGGAAAC ATATGAATTC AACTCAAAAA CTGGAGCCAG 1020 

AAAGTGGAAA GATCCACTAA CCCAAATGCC AAAATCGAAA GAGTCCAGCC ACCAGGGAGC 1080 

aA TGCCCCACGC AGAGTCACTG CTGGGGAAGG GATGGGGTCA CTGAGAAGTG CCAACTGGGA 1140 

3U AGAAGATGAG CTGAGTGGCA AAGCGTGGGC TGAGGATGCT GGTCTGGGTT CCAGAAATGA 1200 

GGGTGGCAAG CTGGGCAACC CAAAGAACAG AAATCCAGCC TTCATGAACA GGGCTTACCC 1260 

CAAACCACAC CCAGGAAAGT AAACGGGGTC TAAGGAGGGG CCTGTCAATC ACTGAGATGC 1320 

TGCCTCACCC TAAATTCTAT GGGGATGGTG TGGGCATGGT GTAGGGGGGA AAATGTGGGC 1380 

TGAGGGGATT CAGACATCCA GGGTCAAACA TGGGATGTTT GACAAATTTT TAAACAAATA 1440 

3 J GAAAGGGGTT TGATCACATA GTTGCGTGTT CTGAAATGAT ACAGGAACAT TTTCTATCAG 1500 

ATTTCAGAAC TACCTGTGCT TCTGATAAGC AAGACTGTTA ACTTTGGGGT GTGGAATTGT 1560 

TGTGTTTCTT CTTTGCATTG ACTGCTAGGA AGCTCTATTC TGTTCACCAT AGAAAGTTTG 1620 

TAGGAATTCC TGACATAAAT AGTGAAGACT ATCCTTACAT CTGGTTTCCA CCTTATTTTC 1680 

AC . CTGCCCTCGT TTTAACATCA CCCAGATTTC TTCAGTTATA AATATGCCAT ACACCTTTGT 1740 

4U AAGTCACCTC AAATCTTCTT CAAAAGAAGC AGAACAGTGA AAAAAACAGA TGAGTAAGTT 1800 

AAGAGTTGGT CATCTGGAAA GAAGAAAACT CAGTAGGCAC CTTCTTTTGT TTTTTCTTGT 1860 

GGTGTCCGGA TCAGCATCCT GCATGTGAGA TTCATCCACG TTGTCCTGTC TAGCAGTAGT 1920 

TCAGTTCTCT TCATGGTTAT GTCTGGTTTC ATTCTATGAT TATATCACAA TTTATCTATT 1980 

CTACACTTGG GTGGCAGCTG CTTCAGATTT TTTACTTTTA AAAAATATAC TTAAAAGTGA 2040 

4D ACTACAGGCA GGGCATGATG GCTCATGCCT GTAATGCCAG CACTTTGCCA AGGTGGGCAG 2100 

ATCACCTAAG GTCAGGAGTT CAAGATCAGC CTGGCCTAGA TGGCAAAACC CTGTCTCTAC 2160 

TAAAAAATAC AAAAATTAGC TTGGTGTGGT GGTGGGCACA TGTAATCCCA GCTACTTGGG 2220 

AGGCTGAGGT AGGGAGAACT GCTTAAACCT GAGAGGTGGA GGTTACAGTG AGTTGAGATT 2280 

GTGCCACTGC ACTCTAGCCT GGGTGACAAA GCAAGACTCC ATCTCAGAAA AAAAAAATAA 2340 

AAGTGAATTA CAACACT 2357 
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Seq ID NO: C65 DNA Sequence 

Nucleic Acid Accession ft: NM_O05266.3 

Coding sequence: 122.. 1198 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGC CATTTTCAAA CAGTCCCTCC TGGGAGAACA CAGACAGGCA GAGGATTACA 60 

ACACAAGGCA GCAAGCACTG GGAGACGAAA GTTTTGGCAT CTGTTCCCTG GCTGTGCCAA 120 

GATGGGCGAT TGGAGCTTCC TGGGAAATTT CCTGGAGGAA GTACACAAGC ACTCGACCGT 180 

GGTAGGCAAG GTCTGGCTCA CTGTCCTCTT CATATTCCGT ATGCTCGTGC TGGGCACAGC 240 

TGCTGAGTCT TCCTGGGGGG ATGAGCAGGC TGATTTCCGG TGTGATACGA TTCAGCCTGG 300 

CTGCCAGAAT GTCTGCTAOG ACCAGGCTTT CCCCATCTCC CACATTCGCT ACTGGGTGCT 360 

GCAGATCATC TTCGTCTCCA CGCCCTCTCT GGTGTACATG GGCCACGCCA TGCACACTGT 420 

GCGCATGCAG GAGAAGCGCA AGCTACGGGA GGCCGAGAGG GCCAAAGAGG TCCGGGGCTC 480 

TGGCTCTTAC GAGTACCCGG TGGCAGAGAA GGCAGAACTG TCCTGCTGGG AGGAAGGGAA 540 

TGGAAGGATT GCCCTCCAGG GCACTCTGCT CAACACCTAT GTGTGCAGCA TCCTGATCCG 600 

CACCACCATG GAGGTGGGCT TCATTGTGGG CCAGTACTTC ATCTACGGAA TCTTCCTGAC 660 

CACCCTGCAT GTCTGCOGCA GGAGTCCCTG TCCCCACCCG GTCAACTGTT ACGTATCCCG 720 

GCCCACAGAG AAGAATGTCT TCATTGTCTT TATGCTGGCT GTGGCTGCAC TGTCCCTCCT 780 

CCTTAGCCTG GCTGAACTCT ACCACCTGGG CTGGAAGAAG ATCAGACAGC GATTTGTCAA 840 

ACGGCGGCAG CACATGGCTA AGTGCCAGCT TTCTGGCCCC TCTGTGGGCA TAGTCCAGAG 900 

CTGCACACCA CCCCCOGACT TTAATCAGTG CCTGGAGAAT GGCCCTGGGG GAAAATTCTT 960 

CAATCCCTTC AGCAATAATA TGGCCTCCCA ACAAAACACA GACAACCTGG TCACCGAGCA 1020 

AGTACGAGGT CAGGAGCAGA CTCCTGGGGA AGGTTTCATC CAGGTTCGTT ATGGCCAGAA 1080 

GCCTGAGGTG CCCAATGGAG TCTCACCAGG TCACCGCCTT CCCCATGGCT ATCATAGTGA 1140 

CAAGCGACGT CTTAGTAAGG CCAGCAGCAA GGCAAGGTCA GATGACCTAT CAGTGTGACC 1200 

CTCCTTTATG GGAGGATCAG GACCAGGTGG GAACAAAGGA GGCTCAGAGA GGAAAGACGT 1260 

GTCCCTTCTG AACTGATGCT TTCTCACTGT CATCACTGCT TGGCTCCTTT GAGCCCCGGG 1320 

TCTCAATGAC GTTGCTCATT AATTCTAGAA ACTATAACCA GGGCTCTGGG ATAGTAAGAG 1380 

AGGTGACAAC CCACCCAGAC TGCAGTTCCC TCCCCACCCT CTACCCAOTA TACGAAGCCT 1440 

TTCAGATTAC TCATGAAACA GGGTAGAGGG AAAGAAGGGA AGCATGGCAA AAGCTGGCCT 1500 

GGAAGGGATA GCCAGAGGGA TAGAATGACT CTCTCTCTAC ATACCAGCAG CATACCAAAT 1560 

GCGTTCTCTA AGTTCCTACC TCCTTGACCT GATCACCCTC CCTCCTCCAA GGAAGAGCTC 1620 
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AAAGTTCCCA GCCAATAGAC AGCATGAATC AAGGAACTTG CATTATATGT GCTCTTGAAT 1680 

CTGTTGTCTC CATGGACCAT TCCTCGGAGT AGTGGTGAGA TGGCCTTGGG TTGCCCTTGG 1740 

CTTCTCCTCC CTCTACTCAG CCTTAAAAAG GGCTTCTTGG AACTTTACCA GCAGCCTCAG 1800 

CTTTACAAAT GCCTTGGTAT GTACCTCTGG CAAATGCCCT GGTATGTACC TCTGGCAAAT 1860 

GCCCCACCTT GGTGATGTTG CAACCTTTCC TTCT6CTAGG GTGTACACCT AGCCTGTGCA 1920 

GGTGTCAGCC CTGCTAGGGA GTCACTGTAC ACACAAACTC TACTGGAATT CCTGCCAACA 1980 

TCTGTCACCC TGCAGCTCCT TTACAGTTCA ATCCAATGAT AGAAACCATC CCTTCCCTXT 2040 

CTCCCTTGGC TGTTCACCCA GCCATTCCCT GAAGGCCTTA CCAACAGGAA TATCCAAGAA 2100 

GCTGTTGTCC TCTCTCGAAC CCTGACCAGA TCATCAGCCA CTGAGGCCAG TGGAATTTCC 2160 

CCAGGCCTTG TTAAAAAAAA AAAAAAAAAA 2190 

Seq ID NO: C66 DNA Sequence 

Nucleic Acid Accession ftt NM_014459.2 

Coding sequence: 738.. 3407 



1 11 21 31 41 SI 

I I I I I I 

GTAGATGCAG TCCGCCGCCG CCGCTGCCTC AGCCAGCAAT GCAAGATTAG ATCTCTAAAT 60 

GCAGCAAAAC ACTGCCTGAA AACAGACOGG CCCGCGCAGC AAGCAGACAT TTCACGGTGC 120 

GCTGGGGAAG CTTCAAAATA TATCTGTGAC TCTGTCTTCG TTGCTCTTCA TCCCCATCAA 180 

TTTCATCACG GGAGGCGAGC AGCAAGTAAG AATTTCACTT TCGGATCTGC CTAGAGACAC 240 

ACCTCCCTGC TCCCTCCCCC ACTCGATGTG AAGAGTATTC CGGAGTCTCC GGGCGGGAGT 300 

AGATTTGCAG CACCCTAGCG GGAGCGAGGA AAACCTACTG ATTCTTTAGC TCATTATCAT 360 

CTCTCCCAGA CGAGATTTCC TTCTTATOGC CTGCCTCATC GCTCAAGTTT GAGCCTCCCG 420 

AAGTCCGGGC GGGAGAGACG AAACCCCTGG CTCACCCCCA GCCGCAGGAA GCCACCGCCT 480 

TGCTCCAAGC CCCTGCAGCT CTGCTGCACC GCAGCTTCTC ACCCAGTGCG GATGCTGTAG 540 

ATCAACAGGT TCAGGGAACT TGAGCAGAAT AAGGAGAGAC CACCGGGTGC CGCAGCTCGG 600 

GTGCAGAGGG AAAAAAGGAC CCATAGACTT GTGGCTCGCG TCGCGCGCGC AOGCTGCGCC 660 

AGGGCCCCAG GCTGGCGCGC ACTCCCTCTC TGGCTCCTCC AGTCCGATTG CTCCTGCCCC 720 

CACCTTACAG GTCTGGGATG TACCTTTCCA TCTGTTGCTG CTTTCTTCTA TGGGCCCCTG 780 

CCCTCACTCT CAAGAACCTC AACTACTCCG TGCCGGAGGA GCAAGGGGCC GGCACGGTGA 840 

TCGGGAACAT CGGCAGGGAT GCTCGACTGC AGCCTGGGCT TCCGCCTGCA GAGCGCGGCG 900 

GGGGAGGGCG CAGCAAGTCG GGTAGCTACC GGGTGCTGGA GAACTCCGCA CCGCACCTGC 960 

TGGACGTGGA CGCAGACAGC GGGCTCCTCT ACACCAAGCA GCGCATCGAC CGCGAGTCCC 1020 

TGTGCCGCCA CAATGCCAAG TGCCAGCTGT CCCTCGAGGT GTTCGCCAAC GACAAGGAGA 1080 

TCTGCATGAT CAAGGTAGAG ATCCAGGACA TCAACGACAA CGCGCCCTCC TTCTCCTOGG 1140 

ACCAGATCGA AATGGACATC TCGGAGAACG CTGCTCCGGG CACCCGCTTC CCCCTCACCA 1200 

GCGCACATGA CCCCGAOGCC GGCGAGAATG GGCTCCGCAC CTACCTGCTC ACGCGCGACG 1260 

ATCACGGCCT CTTTGGACTG GACGTTAAGT CCCGCGGCGA CGGCACCAAG TTCCCAGAAC 1320 

TGGTCATCCA GAAGGCTCTG GACCGCGAGC AACAGAATCA CCATACGCTC GTGCTGACTG 13 80 

CCCTGGACGG TGGCGAGCCT CCACGTTCCG CCACCGTACA GATCAACGTG AAGGTGATTG 1440 

ACTCCAACGA CAACAGCCCG GTCTTCGAGG OGCCATCCTA CTTGGTGGAA CTGCCCGAGA 1500 

ACGCTCCGCT GGGTACAGTG GTCATCGATC TGAACGCCAC CGACGCCGAT GAAGGTCCCA 1560 

ATGGTGAAGT GCTCTACTCT TTCAGCAGCT ACGTGCCTGA CCGCGTGCGG GAGCTCTTCT 1620 

CCATCGACCC CAAGACCGGC CTAATCOGTG TGAAGGGCAA TCTGGACTAT GAGGAAAACG 1680 

GGATGCTGGA GATTGACGTG CAGGCCCGAG ACCTGGGGCC TAACCCTATC CCAGCCCACT 1740 

GCAAAGTCAC GGTCAAGCTC ATCGACCGCA AGGACAATGC GCCGTCCATC GGTTTCGTCT 1800 

COGTGCGCCA GGGGGCGCTG AGCGAGGCCG CCCCTCCCGG CACCGTCATC GCCCTGGTGC 1860 

GGGTCACTGA CCGGGACTCT GGCAAGAACG GACAGCTGCA GTGTCGGGTC CTAGGCGGAG 1920 

GAGGGACGGG CGGCGGCGGG GGCCTGGGCG GGCCCGGGGG TTCCGTCCCC TTCAAGCTTG 1980 

AGGAGAACTA CGACAACTTC TACACGGTGG TGACTGACCG CCCGCTGGAC CGCGAGACAC 2040 

AAGACGAGTA CAACGTGACC ATCGTGGCGC GGGACGGGGG CTCTCCTCCC CTCAACTCCA 2100 

CCAAGTCGTT CGCGATCAAG ATTCTAGACG AGAACGACAA CCCGCCTCGG TTCACCAAAG 2160 

GGCTCTACGT GCTTCAGGTG CACGAGAACA ACATCCCGGG AGAGTACCTG GGCTCTGTGC 2220 

TCGCCCAGGA TCCCGACCTG GGCCAGAAGG GCACCGTATC CTACTCTATC CTGCCCTCGC 2280 

ACATCGGCGA CGTGTCTATC TACACCTATG TGTCTGTGAA TCCCACGAAC GGGGCCATCT 2340 

ACGCCCTGCG CTCCTTTAAC TTCGAGCAGA CCAAGGCTTT TGAGTTCAAG GTGCTTGCTA 2400 

AGGACTGGGG GGCGCCCGCG CACTTGGAGA GCAAOGCCAC GGTGAGGGTG ACAGTGCTAG 2460 

ACGTGAATGA CAACGCGCCA GTGATCGTGC TCCCCACGCT GCAGAACGAC ACGGCGGAGC 2520 

TGCAGGTGCC GCGCAACGCT GGCCTGGGCT ATCTGGTGAG CACTGTGCGC GCCCTAGACA 2580 

GCGACTTCGG CGAGAGCGGG CGTCTCACCT AOGAGATCGT GGACGGCAAC GACGACCACC 2640 

TGTTTGAGAT CGACCCGTCC AGCGGCGAGA TCCGCACGCT GCACCCTTTC TGGGAGGACG 2700 

TGACGCCCGT GGTGGAGCTG GTGGTGAAGG TGACCGACCA CGGCAAGCCT ACCCTGTCCG 2760 

CAGTGGCCAA GCTCATCATC GGCTOGGTGA GCGGATCCCT TCCCGAGGGG GTACCACGGG 2820 

TGAATGGCGA GCAGCACCAC TGGGACATGT CGCTGCCGCT CATCGTGACT CTGAGCACTA 2880 

TCTCCATCAT CCTCCTAGCG GCCATGATCA CCATCGCCGT CAAGTGCAAG OGCGAGAACA 2940 

AGGAGATCCG CACTTACAAC TGCCGCATCG COGAGTACAG CCACCCGCAG CTGGGTGGGG 3000 

GCAAGGGCAA GAAGAAGAAG ATCAACAAAA ATGATATCAT GCTGGTGCAG AGCGAAGTGG 3060 

AGGAGAGGAA CGCCATGAAC GTCATGAACG TGGTGAGCAG CCCCTCCCTG GCCACCTCCC 3120 

/U CCATGTACTT CGACTACCAG ACOCGCCTGC CCCTCAGCTC GCCCCGGTCG GAGGTGATGT 3180 

ATCTCAAACC GGCCTCCAAC AACCTGACTG TCCCTCAGGG GCACGCGGGC TGCCACACCA 3240 

GCTTCACCGG ACAAGGGACT AATGCAAGCG AGACCCCTGC CACTCGGATG TCCATAATTC 3300 

AGACAGACAA TTTTCCCGCA GAGCCCAATT ACATGGGCAG CAGGCAGCAG TTTGTTCAAA 3360 

GTATTTCAGT AGCTCCACGT TTAAGGACCC AGAAAGAGCC AGCCTGAGAG ACAGTGGGCA 3420 

CGGGGACAGT GATCAGGCTG ACAGTGACCA AGACACTAAC AAAGGCTCCT GCTGTGACAT 3480 

GTCTGTTAGG GAGGCACTCA AGATGAAAAC TACTTCAACT AAAAGCCAAC CACTTGAACA 3540 

AGAACCAGAA GAGTGTGTTA ATTGCACAGA TGAATGCCGA GTGCTTGGTC ATTCTGACAG 3600 
GTG CTGGA TG CCACAGTTCC CTGCAGCCAA TCAGGCTGAA AATGCAGATT ACCGCACAAA 3660 

TCTCTTTGTA CCTACAGTTG AAGCTAATGT TGAGACTGAG ACTTACGAAA CTGTGAATCC 3720 
CACTGGGAAA AAGACTTTTT GTACATTTGG AAAAGACAAG CGAGAGCACA CTATTCTCAT 3780 
TGCCAACGTT AAACCTTATT TAAAAGCCAA ACGTGCCCTG AGCOCTCTCC TCCAAGAGGT 3840 
CCCCTCAGCA TCAAGCAGCC CAACCAAGGC GTGCATCGAG CCTTGCACCT CAACAAAAGG 3900 
CTCOCTGGAT GGCTGTGAAG CAAAACCAGG AGCCCTGGCT GAAGCAAGCA GTCAGTACTT 3960 
GCCCACTGAC AGTCAATATC TGTCACCTAG TAAGCAACCA AGAGACCCTC CCTTCATGGC 4020 
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TTCCGATCAG ATGGCAAGGG TCTTTGCAGA TGTGCATTCC AGAGCCAGCC GGGATTCCAG 4080 

TGAGATGGGT GCTGTTCTTG AGCAGCTTGA CCACCCCAAC AGGGATCTGG OCAGAGAGTC 4140 

^TGGATGCA GAGGAAGTTG TGAGAGAAAT TGATAAGCTT TTGCAAGACT GCCGGGGAAA 4200 

5CGACCCTGTG GCTGTGAGAA AGTGAAAAAA GAAAAAAAAA AAGGCATTGG CATTTTCTTG 4260 

TCTCTTCTGT TGATTTAAAA ATGATCCCTC CTGGTGATAA CCCATTTTAC AGGGATGAAG 4320 

AAAGACCAAT GCTGC TTTA A GGCTTTTAGT GAACATCTGA AGTGCCCACA AGTATGTTCT 4380 

TTCCACTGCT GATTTCTTTT TCAGAGATAA CAATGGTTTC GTTTTGACCA AACTTGTATT 4440 

AGGACAGAAT TAATGATGCT TAAAGAGAAA AGAAAAAAAG AGAGAAGAAA AAGGAGAGAT 4500 

GAAAAAGGAG GATGAGGAGA AGAATTACCT TTTGACAATC TGTTAGGAAG GTATGCAGTG 4560 

TGAGAACTGA AGTATTTCTG ATCACTCTCA GACTGTCCTC CGTGATTTAT GCTGACTTAA 4620 

CTGTTTACCT ATAAACCCCA TACAAAGCAG GGTCATAATT TGTGATCTGT GGTGGATTTC 4680 

TAGCAGTCAT CACAGGCTTC TACTGAAAGT CCTGAAAAGA CCTTGCAGTA GTCCAAGCTA 4740 

CACCAAACAT TAACACATAT TTGTGGTAAA CATTTCTGTA TAAAGTTACC TGACACACAT 4800 

1 ATAAAGACAA GGAACATTCC ATATCATTAG TOGAAAACAA AAACAAAAAA AAAACCTTTG 4860 

ID GTCATTTGTA AGACATCTCA TGTCATATAA AAGTTAAATG TAAAAAGATA CAGTCCATTT 4920 

TGTCCTGCAC ACACGTAGAC TAATTCACGT CAAAAAAAAA AAAAAA 4966 

Seq ID NO: C67 DNA Sequence 
Nucleic Acid Accession ft: NM 005601.2 
2A) Coding sequence: 101.. 598 ~ 



25 
30 



1 11 21 31 41 51 

I I I I I I 

CCCAGGAGTC TGGGTGCACA GCCTCCTTCT CTCTGAGATT CAAGAGTCTG ATCAGCAGCC 60 

TCTTCCTCCT CCAGGACCCA GAAGCCCTGA GCTTATCCCC ATGGAGCTCT GCCGGTCCCT 120 

GGCCCTGCTG GGGGGCTCCC TGGGCCTGAT GTTCTGCCTG ATTGCTTTGA GCACCGATTT 180 

CTGGTTTGAG GCTGTGGGTC CCACCCACTC AGCTCACTCG GGCCTCTGGC CAACAGGGCA 240 

TGGGGACATC ATATCAGGCT ACATCCACGT GACGCAGACC TTCAGCATTA TGGCTGTTCT 300 

GTGGGCCCTG GTGTCCGTGA GCTTCCTGGT CCTGTCCTGC TTCCCCTCAC TGTTCCCCCC 360 

AGGCCACGGC CCGCTTGTCT CAACCACCGC AGCCTTTGCT GCAGCCATCT CCATGGTGGT 420 

GGCCATGGCG GTGTACACCA GCGAGCGGTG GGACCAGCCT CCACACCCCC AGATCCAGAC 480 

CTTCTTCTCC TGGTCCTTCT ACCTGGGCTG GGTCTCAGCT ATCCTCTTGC TCTGTACAGG S40 

TGCCCTGAGC CTGGGTGCTC ACTGTGGCGG TCCCCGTCCT GGCTATGAAA CCTTGTGAGC 600 

AGAAGGCAAG AGCGGCAAGA TGAGTTTTGA GCGTTGTATT CCAAAGGCCT CATCTGGAGC 660 

33 CTCGGGAAAG TCTGGTCCTA CATTTGCCCG CCCTTCCAGC CCTTCCCCAG CCCCTCCTCT 720 

TGTTTCTTCA TTCATTCAAC AAAATTTGGC TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA 780 

AAA 783 

40 Seq ID KO: C68 DNA Sequence 

Nucleic Acid Accession ft: NM_006433.2 
Coding sequence: 129.. 566 

AC 1 11 21 31 41 51 

45 I I I I I I 

GTATCTGTGG TAAACCCAGT GACACGGGGG AGATGACATA CAAAAAGGGC AGGACCTGAG 60 

AAAGATTAAG CTGCAGGCTC CCTGCCCATA AAACAGGGTG TGAAAGGCAT CTCAGCGGCT 120 

GCCCCACCAT GGCTACCTGG GCCCTCCTGC TCCTTGCAGC CATGCTCCTG GGCAACCCAG 180 

GTCTGGTCTT CTCTCGTCTG AGCCCTGAGT ACTACGACCT GGCAAGAGCC CACCTGCGTG 240 

ATGAGGAGAA ATCCTGCCCG TGCCTGGCCC AGGAGGGCCC CCAGGGTGAC CTGTTGACCA 300 

AAACACAGGA GCTGGGCCGT GACTACAGGA CCTGTCTGAC GATAGTCCAA AAACTGAAGA 360 

AGATGGTGGA TAAGCCCACC CAGAGAAGTG TTTCCAATGC TGCGACCCGG GTGTGTAGGA 420 

CGGGGAGGTC ACGATGGCGC GACGTCTGCA GAAATTTCAT GAGGAGGTAT CAGTCTAGAG 480 

TTACCCAGGG CCTCGTGGCC GGAGAAACTG CCCAGCAGAT CTGTGAGGAC CTCAGGTTGT 540 

GTATACCTTC TACAGGTCOC CTCTGAGCCC TCTCACCTTG TCCTGTGGAA GAAGCACAGG 600 

CTCCTGTCCT CAGATCCCGG GAACCTCAGC AACCTCTGCC GGCTCCTCGC TTCCTCGATC 660 

CAGAATCCAC TCTCCAGTCT CCCTCCCCTG ACTCCCTCTG CTGTCCTCCC CTCTCACGAG 720 

AATAAAGTGT CAAGCAAG 738 
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Seq ID NO: C69 DNA Sequence 

Nucleic Acid Accession ft: NM_002985.2 

Coding sequence: 69.. 344 



1 11 21 31 41 51 

I I 1 I I 1 

GCTGCAGAGG ATTCCTGCAG AGGATCAAGA CAGCACGTGG ACCTCGCACA GCCTCTCCCA 60 

CAGGTACCAT GAAGGTCTCC GCGGCAGCCC TCGCTGTCAT CCTCATTGCT ACTGCCCTCT 120 

GCGCTCCTGC ATCTGCCTCC CCATATTCCT OGGACACCAC ACCCTGCTGC TTTGCCTACA 180 

TTGCCCGCCC ACTGCCCOGT GCCCACATCA AGGAGTATTT CTACACCAGT GGCAAGTGCT 240 

CCAACCCAGC AGTCGTCTTT GTCACCCGAA AGAACCGCCA AGTGTGTGCC AACCCAGAGA 300 

AGAAATGGGT TCGGGAGTAC ATCAACTCTT TGGAGATGAG CTAGGATGGA GAGTCCTTGA 360 

ACCTGAACTT ACACAAATTT GCCTGTTTCT GCTTGCTCTT GTCCTAGCTT GGGAGGCTTC 420 

CCCTCACTAT CCTACCCCAC CCGCTCCTTG AAGGGCCCAG ATTCTACCAC ACAGCAGCAG 480 

/ 1> TTACAAAAAC CTTCCCCAGG CTGGACGTGG TGGCTCACGC CTGTAATCCC AGCACTTTGG 540 

GAGGCCAAGG TGGGTGGATC ACTTGAGGTC AGGAGTTOGA GACCAGCCTG GCCAACATGA 600 

TGAAACCCCA TCTCTACTAA AAATACAAAA AATTAGCCGG GCGTGGTAGC GGGCGCCTGT 660 

AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG AGAATGGCGT GAACCCGGGA GGCGGAGCTT 720 

QA GCAGTGAGCC GAGATCGCGC CACTGCACTC CAGCCTGGGC GACAGAGCGA GACTCCGTCT 780 

oU CAAAAAAAAA AAAAAAAAAA AAAATACAAA AATTAGCCGG GCGTGGTGGC CCACGCCTGT 840 

AATCCCAGCT ACTCGGGAGG CTAAGGCAGG AAAATTGTTT GAACCCAGGA GGTGGAGGCT 900 

GCAGTGAGCT GAGATTGTGC CACTTCACTC CAGCCTGGGT GACAAAGTGA GACTCCGTCA 960 

CAACAACAAC AACAAAAAGC TTCCCCAACT AAAGCCTAGA AGAG CT TCTG AGGCGCTGCT 1020 

TTGTCAAAAG GAAGTCTCTA GGTTCTGAGC TCTGGCTTTG CCTTGGCTTT GCCAGGGCTC 1080 
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75 



TGTGACCAGG AAGGAAGTCA GCATGCCTCT AGAGGCAAGO AGGGGAGGAA CACTGCACTC 1140 

TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG GCAAACATGA AAAATCGGCT 1200 

TACCATTAAA GTTCTCAATG CAACCATAAA AAAAAAA 1237 

Seq ID NO: C70 DNA Sequence 

Nucleic Acid Accession #: NM_0221S4.2 

Coding sequence: 1381. .1722 ~ 

1 11 21 31 41 51 

I 1 I I I I 

AGTGTGGTTT TAGTTTTTCC TAAGAAGTGG CGT G GTTTGG GGCTTTATAT CCGGGAGGAG 60 

CATATGTACG CAAATCCTGG GGCGTTTGCA AACCCGGATC CGGGGCGTCT GGCCCCATGC 120 

CCGGCCGGGC GTTTGAGGGC TACTGCCACG CAGCGTTTCT GGAGCCTGCC GGCTGGTGCC 180 

CTGGTGGOCT TTATCTCTGT CCCCCTTTGT CCTCTTTATC TCAGGCTCTC CAGGAGGCCG 240 

GGGGGCCCAC TCCGCCTATC GCTCCCCTCG GCTACGCTGC CACTCCAATG CCCCGCAGGT 300 

CGCGAGCTGC TGTTCTTTCG AAGGCGCCGG AGAACCAGGG GCGTCCOGCG CCACCTCTGA 360 

CTCGGAGCAG CGCCGAGCAC TGACGCTCCC GCCCTTGGGC AAGGACGCCA GTGCGCCCGC 420 

GCGCGTCCCT CTGCGCGGCA GCCCGTOGCG GGCCCTCAAG GGGAAGCCCA GGCCAGGATG 480 

GCCCCGGGTC GOGCGGTGGC CGGGCTCCTG TTGCTGGCGG COGCCGGCCT CGGAGGAGTG 540 

GCGGAGGGGC CAGGGCTAGC CTTCAGCGAG GATGTGCTGA GCGTGTTCGG CGCGAATCTG 600 

AGCCTGTOGG CGGCGCAGCT OCAGCACTTG CTGGAGCAGA TGGGAGCCGC CTCCCGCGTG 660 

GGCGTCCCGG AGCCTGGCCA GCTGCACTTC AACCAGTGTT TAACTGCTGA AGAGATCTTT 720 

TCCCTTCATG GCTTTTCAAA TGCTACCCAA ATAACCAGCT CCAAATTCTC TGTCATCTGT 780 

CCAGCAGTCT TACAGCAATT GAACTTTCAC CCATGTGAGG ATOGGCCCAA GCACAAAACA 840 

AGACCAAGTC ATTCAGAAGT TTGGGGATAT GGATTCCTGT CAGTGACGAT TATTAATCTG 900 

GCATCTCTCC TCGGATTGAT TTTGACTCCA CTGATAAAGA AATCTTATTT CCCAAAGATT 960 

TTGACCTTTT TTGTGGGGCT GGCTATTGGG ACTCTTTTTT CAAATGCAAT TTTCCAACTT 1020 

ATTCCAGAGG CATTTGGATT TGATCCCAAA GTCGACAGTT ATGTTGAGAA GGCAGTTGCT 1080 

GTGTTTGGTG GATTTTACCT ACTTTTCTTT TTTGAAAGAA TGCTAAAGAT GTTATTAAAG 1140 

ACATATGGTC AGAATGGTCA TACCCACTTT GGAAATGATA , ACTTTGGTCC TCAAGAAAAA 1200 

ACTCATCAAC CTAAAGCATT ACCTGCCATC AATGGTGTGA CATGCTATGC AAATCCTGCT 1260 

GTCACAGAAG CTAATGGACA TATCCATTTT GATAATGTCA GTGTGGTATC TCTACAGGAT 1320 

GGAAAAAAAG AGCCAAGTTC ATGTACCTGT TTGAAGGGGC CCAAACTGTC AGAAATAGGG 1380 

ACGATTGCCT GGATGATAAC GCTCTGCGAT GCCCTCCACA ATTTCATCGA TGGCCTGGOG 1440 

ATTGGGGCTT CCTGCACCTT GTCTCTCCTT CAGGGACTCA GTACTTCCAT AGCAATCCTA 1500 

TGTGAGGAGT TTCCCCACGA GTTAGGAGAC TTTGTGATCC TACTCAATGC AGGGATGAGC 1560 

ACTCGACAAG CCTTGCTATT CAACTTCCTT TCTGCATGTT CCTGCTATGT TGGGCTAGCT 1620 

TTTGGCATTT TGGTGGGCAA CAATTTCGCT CCAAATATTA TATTTGCACT TGCTGGAGGC 1680 

ATGTTCCTCT ATATTTCTCT GGCAGATATG TTTCCAGAGA TGAATGATAT GCTGAGAGAA 1740 

AAGGTAACTG GAAGAAAAAC CGATTTCACC TTCTTCATGA TTCAGAATGC TGGAATGTTA 1800 

ACTGGATTCA CAGCCATTCT ACTCATTACC TTGTATGCAG GAGAAATCGA ATTGGAGTAA 1860 

TAGAAAATGG AAGATGGTGT TGTTAATAAA GGCATTTAAT AGATAAAAAC ATCTCCAAAA 1920 

AGGATTTTGA AGCTGATCCT ATTTAGTTAA AAAGATAATT TTGCTTTCAA CTGTAGGTCC 1980 

AGAAAACTAA TTATTGGCAT CAGTCTGTGA AATAGTCCAT TATTTGTTGT TAAAAATGCT 2040 

TC AAAAGG TT TTCAGTGTCA GTCTGAGATG CCTGGTATAT AGGAGCCTTT GGGAAATACT 2100 

TATTTTTCAG TATTCCATGC ATATTAGATA TCACCATGAA GCAAGAGACA TGCATTCTAT 2160 

AATCATGTAG ACACTCAGAC TCAGGGGAAA ATACAAGTTA TATCCTGAAA GCCTTTAAAA 2220 

CTCTATGGTA GGATCAAAGA TTCAAATGGT TTCAGAGAGG TTTTATTTCA ATTAATTTGT 2280 

TCTAGTGCTT TCAAGAGCAA GTACATCAAA ATGTAGAAGG TAAAATGTAT GCAACACTAA 2340 

TATAAATTAT TCCAAGTCTT TAAGGAGCCA AAGAAAAAAA AGATTTCTCA CAGCTTTTTG 2400 

TTCTGTTTTG TATTTCAATT AGGAACTTGC AGTATTATTT TGAAAACCAT TCTAAAATAA 2460 

TAGGAGTTAG GAAATAAATA AAGTTTTGCT AGCCCTGCTA AGTTCAGGCT TAGAGGCTTA 2520 

TCGCTAAGTN TAAACTTCAC CAGATTCCAC GAAAAGCTGG ATAGCTTTTT TTCTGACTTA 2580 

TGTTGTGGTT GCACCCCTCA CAAATGGCAG AACAGTATGT AAAGCTGGTA ACACCTCGGT 2640 

TTCAGTGCAC CATGTGTTTG CTTTGTQAAG GTGAAGAATA TGTTGGTTTA GAGAAAGAAA 2700 

TTGGATGTAA TTTTATGCAA TTTACTTTTA AAGACAAACA TAACTATTTA GCAGAGAATA 2760 

TTTTAATAAA TGCAAAACAA CAGCTGGACT GCTGTACATC AAGGACAGAT TAACTGGAAA 2820 

ACATATGTTC CTTATGTGTG ATTGAGAGCC ATTCAGAAAA GACTTCCTTT GTGTTCAGCC 2880 

TATA CTTTTC CATATGGTAT ACCTTGAAAA AAATTAGCAC ACCATGGTTA TTTTTCTACC 2940 

TTTTATAAAA GACAGAGCCT GTTTACTCAT TTAGAAGATA GAGAAAATTG GTCTAAAATT 3000 

GAACATCCTA GATTCACACT CCCAAGTCAC TTAAGGTGAT TTGATGGTGA GGAAAATGAT 3060 

TGACAAAGCC CAACAATGAT CTCAGGAATT ACATTTTCCA ACAGACCAAA AAATGTTTTC 3120 

ATGTAGCAGC AATGCAGATT TGGTGAATAT TTAATATATA TTTTAGTATG TATTTCACTT 3180 

TATGACTGAC AATTAAAAAA TATTGTTTGG CCAAATAGTA AACACCCTTT TGAAACCATG 3240 

AAAAAA 3246 

Seq ID NO: C71 DNA Sequence 

Nucleic Acid Accession 8i NMJJ04184.2 

Coding sequence: 188.. 1603 ~ 



1 11 21 31 41 51 

I 1 I I I I 

CGAAAAAAGA GGGGAAGAGT ATTAAAGACC ATTTCTGGCT GGGCAGGGCA CTCTCAGCAG 60 

CTCAACTGCC CAGCGTGACC AGTGGCCACC TCTGCAGTGT CTTCCACAAC CTGGTCTTGA 120 

CTCGTCTGCT GAACAAATCC TCTGACCTCA GGCCGGCTGT GAACGTAGTT CCTGAGAGAT 180 

AGCAAACATG CCCAACAGTG AGCCCGCATC TCTGCTGGAG CTGTTCAACA GCATCGCCAC 240 

0A ACAAGGGGAG CTCGTAAGGT CCCTCAAAGC GGGAAATGCG TCAAAGGATG AAATTGATTC 300 

OU TGCAGTAAAG ATGTTGGTGT CATTAAAAAT GAGCTACAAA GCTGCCGCGG GGGAGGATTA 360 

CAAGGCTGAC TGTCCTCCAG GGAACCCAGC ACCTACCAGT AATCATGGCC CAGATGCCAC 420 

AGAAGCTGAA GAGGATTTTG TGGACCCATG GACAGTACAG ACAAGCAGTG CAAAAGGCAT 480 

AGACTACGAT AAGCTCATTG TTCGGTTTGG AAGTAGTAAA ATTGACAAAG AGCTAATAAA 540 

CCGAATAGAG AGAGCCACCG GCCAAAGACC ACACCACTTC CTGOGCAGAG GCATCTTCTT 600 
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CTCACACAGA GATATGAATC AGOTTCTTGA TGCCTATGAA AATAAGAAGC CATTTTATCT 660 

GTACACGGGC CGGGGCCCCT CTTCTGAAGC AATGCATGTA GGTCACCTCA TTCCATTTAT 720 

TTTCACAAAG TGGCTCCAGG ATGTATTTAA CGTGCCCTTG GTCATCCAGA TGACGGATGA 780 

CGAGAAGTAT CTGTGGAAGG ACCTGACCCT GGACCAGGCC TATGGOGATG CTGTTGAGAA 840 

TGCCAAGGAC ATCATCGCCT GTGGCTTTGA CATCAACAAG ACTTTCATAT TCTCTGACCT 900 

GGACTACATG GGGATGAGCT CAGGTTTCTA CAAAAATGTG GTGAAGATTC AAAAGCATGT 960 

TACCTTCAAC CAAGTGAAAG GCATTTTCGG CTTCACTGAC AGCGACTGCA TTGGQAAGAT 1020 

CAGTTTTCCT GCCATCCAGG CTGCTCCCTC CTTCAGCAAC TCATTCCCAC AGATCTTCCG 1080 

AGACAGGAOG GATATCCAGT GCCTTATCCC ATGTGOCATT GACCAGGATC CTTACTTTAG 1140 

iU AATGACAAGG GACGTCGCCC CCAGGATCGG CTATCCTAAA CCAGCCCTGT TGCACTCCAC 1200 

CTTCTTCCCA GCCCTGCAGG GCGCCCAGAC CAAAATGAGT GCCAGCGACC CAAACTCCTC 1260 

CATCTTCCTC ACCGACACGG CCAAGCAGAT CAAAACCAAG GTCAATAAGC ATGCGTTTTC 1320 

TG GAGGG AGA GACACCATCG AGGAGCACAG GCAGTTTGGG GGCAACTGTG ATGTGGACGT 1380 

GTCTTTCATG TACCTGACCT TCTTCCTCGA GGACGACGAC AAGCTCGAGC AGATCAGGAA 1440 

GGATTACACC AGCGGAGCCA TGCTCACCGG TGAGCTCAAG AAGGCACTCA TAGAGGTTCT 1500 

GCAGCCCTTG ATCGCAGAGC ACCAGGCCCG GCGCAAGGAG GTCACGGATG AGATAGTGAA 1560 

AGAGTTCATG ACTCCCCGGA AGCTGTCCTT CGACTTTCAG TAGCACTCGT TTTACATATG 1620 

CTTATAAAAG AAGTGATGTA TCAGTAATGT ATCAATAATC CCAGCCCAGT CAAAGCACCG 1680 

CCACCTGTAG GCTTCTGTCT CATGGTAATT ACTGGGCCTG GCCTCTGTAA GCCTGTGTAT 1740 

GTTATCAATA CTGTTTCTTC CTGTGAGTTC CATTATTTCT ATCTCTTATG GGCAAAGCAT 1800 

TGTGGGTAAT TGGTGCTGGC TAACATTGCA TGGTCGGATA QAGAAGTCCA GCTGTGAGTC 1860 

TCTCCCCAAA GCAGCCCCAC AGTGGAGCCT TCGGCTGGAA GTCCATGGGC CACCCTGTTC 1920 

TTGTCCATGG AGGACTTCCG AGGGTTCCAA GTATACTCTT AAGACCCACT CTGTTTAAAA 1980 

ATATATATTC TATGTATGCG TATATGGAAT TGAAATGTCA TTATTGTAAC CTAGAAAGTG 2040 

Zj CTTTGAAATA TTGATGTGGG GAGGTTTATT GAGCACAAGA TGTATTTCAG CCCATGCCCC 2100 

C TCCCA AAAA GAAATTGATA AGTAAAAGCT TCGTTATACA TTTGACTAAG AAATCACCCA 2160 

GCTTTAAAGC TGCTTTTAAC AATGAAGATT GAACAGAGTT CAGCAATTTT GATTAAATTA 2220 

AGACTTGGGG GTGAAACTTT CCAGTTTACT GAACTCCAGA CCATGCATGT AGTCCACTCC 2280 

AGAAATCATG CTCGCTTCCC TTGGCACACC AGTGTTCTCC TGCCAAATGA CCCTAGACCC 2340 

JU TCTGTCCTGC AGAGTCAGGG TGGCTTTTCC CCTGACTGTG TCCGATGCCA AGGAGTCCTG 2400 

GCCTCCGCAG ATGCTTCATT TTGACCCTTG GCTGCAGTGG AAGTCAGCAC AGAGCAGTGC 2460 

CCTGGCTGTG TCCTGGACGG GTGGACTTAG CTAGGGAGAA AGTCGAGGCA GCAGCCCTCG 2520 

AGGCCCTCAC AGATGTCTAG GCAGGCCTCA TTTCATCACG CAGCATGTGC AGGCCTGGAA 2580 

GAGCAAAGCC AAATCTCAGG GAAGTCCTTG GTTGATGTAT CTGGGTCTCC TCTGGAGCAC 2640 

TCTGCCCTCC TGTCACCCAG TAGAGTAAAT AAACTTCCTT GGCTCCTAAA AAA 2693 
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Seq ID NO: C72 DNA Sequence 

Nucleic Acid Accession #: NM_004938.1 

Coding sequence: 337 -.4632 

1 11 21 31 41 51 

1 I I I I I 

CGGAGGACAG CCGGACCGAG CCAACGCCGG GGACTTTGTT CCCTCCACGG AGGGGACTCG 60 

GCAACTCGCA GCGGCAGGGT CTGGGGCCGG CGCCTGGGAG GGATCTGOGC CCCCCACTCA 120 

CTCCCTAGCT GTGTTCCOGC CGCCGCCCCG GCTAGTCTCC GGCGCTGGCG CCTATGGTCG 180 

GCCTCCGACA GCGCTCCGGA GGGACCGGGG GAGCTCCCAG GCGCCCGGGA CTGGAGACTG 240 

ATGCATGAGG GGCCTACGGA GGCGCAGGAO CGGTGGTGAT GGTCTGGGAA GCGGAGCTGA 300 

AGTCCCCTGG GCTTTGGTGA GGCGTGACAG TTTATCATGA CCGTGTTCAG GCAGGAAAAC 360 

GTGGATGATT ACTACGACAC CGGCGAGGAA CTTGGCAGTG GACAGTTTGC GGTTGTGAAG 420 

AAATGCCGTG AGAAAAGTAC CGGCCTCCAG TATGCCGCCA AATTCATCAA GAAAAGGAGG 480 

ACTAAGTCCA GCCGGCGGGG TGTGAGCCGC GAGGACATCG AGCGGGAGGT CAGCATCCTG 540 

AAGGAGATCC AGCACCCCAA TGTCATCACC CTGCACGAGG TCTATGAGAA CAAGAOGGAC 600 

GTCATCCTGA TCTTGGAACT CGTTGCAGGT GGOGAGCTGT TTGACTTCTT AGCTGAAAAG 660 

GAATCTTTAA CTGAAGAGGA AGCAACTGAA TTTCTCAAAC AAATTCTTAA TGGTGTTTAC 720 

TACCTGCACT CCCTTCAAAT CGCCCACTTT GATCTTAAGC CTGAGAACAT AATGCTTTTG 780 

GA TAGA AATG TCOCCAAACC TCGGATCAAG ATCATTGACT TTGGGTTGGC CCATAAAATT 840 

GACTTTGGAA ATGAATTTAA AAACATATTT GGGACTCCAG AGTTTGTCGC TCCTGAGATA 900 

GTCAACTATG AACCTCTTGG TCTTGAGGCA GATATGTGGA GTATCGGGGT AATAACCTAT 960 

ATCCTCCTAA GTGGGGCCTC CCCATTTCTT GGAGACACTA AGCAAGAAAC GTTAGCAAAT 1020 

GTATCCGCTG TCAACTACGA ATTTGAGGAT GAATACTTCA GTAATACCAG TGCCCTAGCC 1080 

AAAGATTTCA TAAGAAGACT TCTGGTCAAG GATCCAAAGA AGAGAATGAC AATTCAAGAT 1140 

AGTTTGCAGC ATCCCTGGAT CAAGCCTAAA GATACACAAC AGGCACTTAG TAGAAAAGCA 1200 

TCAGCAGTAA ACATGGAGAA ATTCAAGAAG TTTGCAGCCC GGAAAAAATG GAAACAATCC 1260 

GTTCGCTTGA TATCACTGTG CCAAAGATTA TCCAGGTCAT TCCTGTCCAG AAGTAACATG 1320 

AGTGTTGCCA GAAGCGATGA TACTCTGGAT GAGGAAGACT CCTTTGTGAT GAAAGCCATC 1380 

ATCCATGCCA TCAACGATGA CAATGTCCCA GGCCTGCAGC ACCTTCTGGG CTCATTATCC 1440 

AACTATGATG TTAACCAACC CAACAAGCAC GGGACACCTC CATTACTCAT TGCTGCTGGC 1500 

TGTGGGAATA TTCAAATACT ACAGTTGCTC ATTAAAAGAG GCTCGAGAAT CGATGTCCAG 1560 

GATAAGGGCG GGTCCAATGC OGTCTACTGG GCTGCTOGGC ATGGCCACGT CGATACCTTG 1620 

AAATTTCTCA GTGAGAACAA ATGCCCTTTG GATGTGAAAG ACAAGTCTGG AGAGATGGCC 1680 

CTCCACGTGG CAGCTOGCTA TGGCCATGCT GACGTGGCTC AAGTTACTTG TGCAGCTTCG 1740 

GCTCAAATCC CAATATCCAG GACAAAGGAA GAAGAAACCC CCCTGCACTG TGCTGCTTGG 1800 

CACGGCTATT ACTCTGTGGC CAAAGCCCTT TGTGAAGCCG GCTGTAACGT GAACATCAAG 1860 

AACCGAGAAG GAGAGACGCC CCTCCTGACA GCCTCTGCCA GGGGCTACCA CGACATCGTG 1920 

GAGTGTCTGG CCGAACATGG AGCCGACCTT AATGCTTGCG ACAAGGACGG ACACATTGCC 1980 

CTTCATCTGG CTGTAAGACG GTGTCAGATG GAGGTAATCA AGACTCTCCT CAGCCAAGGG 2040 

TGTTTCGTCG ATTATCAAGA CAGGCACGGC AATACTCCCC TCCATGTGGC ATGTAAAGAT 2100 

GGCAACATGC CTATCGTGGT GGCCCTCTGT GAAGCAAACT GCAATTTGGA CATCTCCAAC 2160 

AAGTATGGGC GAACGCCTCT GCACCTTGCG GCCAACAACG GAATCCTAGA CGTGGTCCGG 2220 

TATCTCTGTC TGATGGGAGC CAGCGTTGAG GCGCTGACCA CGGACGGAAA GACGGCAGAA 2260 

GATCTTGCTA GATCGGAACA GCACGAGCAC GTAGCAGGTC TCCTTGCAAG ACTTCGAAAG 2340 

GATACGCACC GAGGACTCTT CATCGAGCAG CTCOGACCCA CACAGAACCT GCAGCCAAGA 2400 

ATTAAGCTCA AGCTGTTTGG CCACTCGGGA TCCGGGAAAA CCACCCTTGT AGAATCTCTC 2460 

AAGTGTGGGC TGCTGAGGAG CTTTTTCAGA AGGCGTCGGC CCAGACTGTC TTCCACCAAC 2520 
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TCCAGCAGGT 
AACCTGTACC 
GGTCTTACCA 
GCCGATGACC 
GATTTCAGCG 
GCTGCAAATG 
ATCCAGCTGA 
GAACCCATAG 
CACGCTGACA 
ACATCGTTGC 
CTGTTTGTTC 
CTGCAAGAAA 
AAAATCATCT 
CTGCAGCAGT 
CTCAGGCGCA 
ACAGTTCAGG 
TTGCTGTCCG 
GACATCCAGC 
ATGGACATCT 
AAGACAGACA 
GGCGTGCGCA 
GTCCAGGTGA 
CGCCTGTGGG 
GTCAACCACG 
TGCTGCCPGC 
CCAGGGCTCC 
CCCGTCATGA 
CTGACCAACA 
CACGAOGTCT 
CTCACTCGGA 
TGCCTTCTOG 
GGGGCTCCCA 
TACCCTGAGA 
GCCGCAGACC 
CAGGAGGCCT 
GTATCCCGGT 
GGGTGATGTA 
GCCAGGGGGA 
CCGTCTCATT 
CTTTGGCCGC 
CTCCAGTACC 
TTTTTTAACT 
TGTATTTCTT 
GTAGGAGAAA 
GTGCCTTTTG 
AGGAAACGCT 
TGAGTTTTGT 
AGAAGGGAAT 
TTTCCGTTTG 
ATGCTTCATC 
ACCTATGTAA 
TGCATAGCAG 
TAAAAGCAGA 
TAAATTGATA 
GTTATATCAT 
TTTTTACTTA 
TAAACTGTTG 



TCCCACCTTC 
CAGGCTGCGA 
AAGGGATGCT 
AGTCCACCAA 
TGTGGGAGTT 
ATCCCACGTC 
ACCCAGTGAT 

TCATGAATGT 
TGAAAGAGAT 
TGGATGCTGG 
TACGAAGCCA 
CCACGCTGCC 
TTGTGTACGA 
TTGCTCAGCA 
ACGTGCTGCT 
TGGAGACCCC 
GCCTGGTGCC 
GCGCCCGGGA 
ACCTGCACCG 
TCGTGCCCGT 
ACCTGTGCCG 
TGAATGGCTG 
GCCAGGGCAT 
TGCTGGACTC 
TGACCGTGAA 
TCTACCAGCC 
CCATGGGGGG 
ACTCACAGGC 
GGAAACTGAG 
CCATGAACTT 
AGGATTTCCT 
GCACAGTGGG 
TrTTGCTGAA 
ATGCCTCGAG 
GAGGGCAGCC 
GCCCATCCTT 
TGCCACTCCT 
CCGTTGTCTG 
TTGAAAAGCT 
TTGCTTCTTA 
ATCATATTGA 
ATACGCTTTT 
CTCCCATGTA 
CAGAAGAGGG 
ACCTCTCTGT 
TTTGTATTGT 
CCTCCTCCAG 
CTTTTGTTCC 
ATTCCCTCTC 
GAATTCAGAA 
ATGGTTTCCA 
AAATAAAGGA 
GGAGGGAACA 
ATGTATATAT 
ATCTATAAAA 
CTCGTTAAAA 



ACCCCTGGCT 
GAACGTGAGT 
GGAGGTGTTT 
GGCCATCGAC 
CTCTGGAAAT 
AATCCATGTT 
TTTCTGGCTC 
CAAGCTGAAG 
TCCTCGACCG 
TAGGAACAGG 
GGCTTCTGGG 
GATTGTTTCG 
TTCCTGGAGG 
CGTGCAGGAC 
GCTCCACAGC 
CCTGGACCCC 
ACGGGCGCTG 
CGACAGCGAC 
CCTGAGCAGC 
CTCCTGGGCT 
GGAACACCTC 
GTGGATCCAC 
CAAGCTGGCC 
TGAGGTCCAG 
GGTGTGCAGC 
GCATTACCTG 
ACGGGACTTC 
GTACAAGGAA 
CAGCCTCGGC 
TCGCCTGCTG 
AGGCCTCCCT 
CCCCAGCCCC 
CACCCTCATG 
GGCATCCTCT 
CTGCAACAGC 
TCTGGCTTGG 
CCCTTTGGAG 
CCCTCCGGCT 
TGGATGGTCA 
AGTGTACCTC 
CTGATAATTT 
TTGTCCTTTA 
CTGTGTTATA 
TGGAATCCCA 
TGTGTTTGAA 
CCCTTGCTGT 
TTTCTGACAG 
GGTGATTTTA 
AATGTCAATG 
ATCTCAGGTA 
GACCCCTGAC 
CATTTAGATC 
AGCAAGTTTT 
TGTCCTAATT 
ATATATGCAC 
TGTCGTTAAA 
AAAAAAAAAA 



TCTAAGCCCA 
GTGAGGAGCC 
GTGGCCCOGA 
ATCCAGAACG 
CCTGTGTATT 
GTTGTCTTTA 
AGTTTCCTGA 
AACCCACTCC 
GCTGGAGGCG 
TTTGGAAATG 
TCAAAGGACA 
GTCTGTCCTC 
AAGCTCAATO 
CAGCTGAACC 
ACAGGCGAGA 
CGCTGGCTCT 
CACCACTACC 
GTGGAGGAGC 
GGGACCATGG 
GATGAGGAGG 
ACCCCCTTCC 
CAGCAAAGCA 
AACCGTGGGG 
GTCCGTGGCC 
ACCATTGAGA 
AGCCCCCAGC 
TTCCGGGCAC 
AGCTTCAGCA 
ATGGACATCC 
GACCCGCCCG 
GACCTCGTGG 
CTCCACGCCC 
TCCAAACTGA 
GTGTTCAAAA 
GGCACCTCTT 
ACAGGGTCTG 
ATGCTGAGGG 
TGACCTGTTT 
TTGCAGTTTA 
CTCTCAGTGT 
TGCTGGAATT 
AAAAAGAAAA 
CCATTTCCTC 
CTGTATGATT 
ATCATCGGAG 
ATGCTGATCA 
TTTTTCTGTT 
TGATCAGTGT 
TGAACGTCCA 
GAAGGTTGAC 
TCATCATTTG 
CTGGTTTCAT 
CTTCCATGAT 
CTTCTGTCCT 
TATGTATATA 
AAGTTGTTTG 



CAGTCTCAGT 
GCAGCATGAT 
CCCACCACCC 
CTTATTTGAA 
TCTGCTGTTA 
GTCTAGAAGA 
AGTCCCTTGT 
AAGTTGTCCT 
AGTTTGGATA 
ATCTTCACAT 
TGAAGGTACT 
CCATGACTCA 
GACCCAACCA 
CCCTGGCCAG 
TCAACATCAT 
GCACAAACGT 
GGGGCCGCTA 
TGCTGCAGAT 
TGGACGTCCC 
ACGAGGTGAT 
CATGTGGCAT 
CAGAGGGCGA 
CCGAGCTGCT 
TGGAGAOGGA 
ACGTCATGGC 
AGCTGCGGGA 
AGACTCTGAA 
GCATCATGTG 
ATGCATCAGA 
ACCCCCTGGG 
CAAAGTACAA 
TGCTGCGGGA 
GGGAGCTGGG 
TCAACCTGGA 
ACAATTCCAT 
TTTGGACTGC 
TGTTTCTTCC 
CTCTGCCGCT 
AGAGCAGAAC 
TTTGGACTCC 
CCTAACTTTT 
GTGCATATTT 
AGCTTATCTC 
TATAAACAGA 
TCAGCCAGGA 
TCGCCAGAGG 
TTGTTTGGCA 
TGTTGCTCTA 
CATGAAACCT 
ACAGTTGTAQ 
TGGCAGTCCC 
AACTTCCTGT 
TTTAAATTGT 
GAGAAGCATG 
CATATATATT 
TTTTTTTCTT 



GAGCATCAAC 
GTTCGAGOCG 
GCACTGCTCG 
TGGAGTTGGC 
TGACTATTTT 
GCCCTATGAG 
CCCAGTTGAA 
GGTGGCCACC 
TGACAAAGAC 
TTCAAATAAG 
TCGAAATCAT 
CCTGTGTGAG 
GCTGATGTCG 
CGAGGAGGAC 
GCAAAGTGAA 
CCTGGGGAAG 
CACCGTGGAG 
CCTCGATGCC 
AGCCCTGATC 
GGTGTATGGT 
CTTTCACAAG 
CGCGGACATC 
GGTGCTGCTG 
GAAGATCAAG 
CACCACGCTG 
GCACCATGAG 
GGAAACCTCA 
CTTCGGGTGT 
CCTGAACCTC 
GAAGGACTGG 
CACCAATAAC 
ATGGACCACC 
TCGCCGGGAT 
TGGCAATGGC 
TAGCTCTGTT 
AGAACCAAGG 
TGCACCCACA 
ACCTCCCTCC 
AGATCTTTTA 
ATCTCTCATC 
CAATGACATT 
ATCCAAAATG 
TTTTATATTT 
CAATATGTGA 
GCTGTCACCA 
TGCTTCACCC 
AGGAAAGGGG 
GGAAGACATT 
ACACACTGTC 
GGTTACAGAG 
TTATAATTGG 
ACTTGAAGTC 
GATOGAGTTT 
TAATGTTAAT 
AATACTGGTA 
TTTTTATAAA 



Seq ID NO: C73 DNA Sequence 

Nucleic Acid Accession #: NM_002081.1 

Coding sequence: 222.. 1898 



GGCTGCCCGA 
GGCTTTTGTT 
CGGGACCTTG 
AGAGGCGCGG 
GCTGGTGGCT 
GCAAGAGCCG 
GCGACGTGCC 
CCTGCTGCAC 
CCGCGCTCCG 
TCGATGACCA 
CCGGCGCCTT 
AGCTGCGCCT 
GGGCCCGCCT 
ACTACCTGGA 
GAGAGCTGCG 
TGGGCGTGGC 
CGAGAGCTGT 
CCTGCCCTGA 
ACGCCGAGTG 
CATOGGGTGT 



11 

I 

GCGAGCGTTC 
GTCTCCGCCT 
GCTCTGCCCT 
GCGGGTGGCC 
GCTATGTGCG 
GAGCTGCGGC 
CCAGGCGGAG 
CAGCGAGATG 
GGACAGCAGC 
CTTCCAGCAC 
CGGAGAGCTG 
GTACTACOGC 
GCTCGAGCGC 
CTGCCTGGGC 
CCTGCGGGCC 
CAGCGACGTG 
CATC3AAGCTG 
CTATTGCCGA 
GAGGAACCTC 
GGAGAGTGTC 



21 
I 

GGACCTCGCA 
CCTCGGCCGC 
TCGCGGGCGG 
GGGGGCGCCG 
GCCGCAGCGC 
GAGGTCCGCC 
ATCTCGGGTG 
GAGGAGAACC 
CGCGTCCTGC 
CTGCTGAACG 
TACACGCAGA 
GGTGCCAACC 
CTCTTCAAGC 
AAGCAGGCCG 
ACCCGTGCCT 
GTCCGGAAAG 
GTCTACTGTG 
AATGTGCTCA 
CTGGACTCCA 
ATCGGCAGCG 



31 
I 

CCCOGCGCGC 
CGCCGCCTCT 
GAACTGCGCA 
CCGGCCCCGC 
TGGTCGCCTG 
AGATCTACGG 
AGCACCTGCG 
TGGCCAACCG 
AGGCCATGCT 
ACTCGGAGCG 
ACGCGAGGGC 
TGCACCTGGA 
AGCTGCACCC 
AGGCGCTGCG 
TCGTGGCTGC 
TGGCTCAGGT 
CTCACTGCCT 
AGGGCTGCCT 
TGGTGCTCAT 
TGCACACGTG 



41 

I 

CCCGCGCCGC 
GGACCGCGAG 
GGACCCGGCC 
CATGGAGCTC 
OGCCCGCGGG 
AGCCAAGGGC 
GATCTGTCCC 
CAGCCATGCC 
TGCCACCCAG 
GACGCTGCAG 
CTTCCGGGAC 
GGAGACGCTG 
CCAGCTGCTG 
GCCCTTCGGG 
TCGCTCCTTT 
CCCCCTGGGC 
GGGAGTCCCC 
TGCCAACCAG 
CACCGACAAG 
GCTGGCGGAG 



SI 
i 

CGCCGCCGCC 
CCGCGCGCGC 
AGGATCCGAG 
CGGGCCCGAG 
GACCCGGCCA 
TTCAGCCTGA 
CAGGGCTACA 
GAGCTGGAGA 
CTGCGCAGCT 
GCCACCTTCC 
CTGTACTCAG 
GCCGAGTTCT 
CTGCCTGATG 
GAGGCCCCGA 
GTGCAGGGCC 
CCGGAGTGCT 
GGCGCCAGGC 
GCCGACCTGG 
TTCTGGGGTA 
GCCATCAACG 



2S80 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
S160 
5220 
S280 
5340 
S400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5910 



60 

120 

180 

240 

300 

360 

420 

480 

S40 

600 

660 

720 

7B0 

840 

900 

960 

1020 

1080 

1140 

1200 
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CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT G6GCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 1380 

GGGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCOGG TACCTCCCCG 1SO0 

AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC CGAGGTGGAG GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 1620 

TGCGCAGCGC CTACAACGGC AACGACGTGG ACTTCCAGGA CGCCAGTGAC GACGGCAGCG 1680 

GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 1740 

GCTCCAGCTC CCGGAOGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC COCOGACCTT CCTCCTGCCC CTCCTCCTCT 1860 

TCCTGGCCCT TACAGTAGCC AGGCCCOGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 1920 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 1980 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 2040 

15 GTCCCAGCCC CAGGCCTGGC CTOGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAGCCAGT GTGCCCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC CCCTCGAGGC 2220 

CTACAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG CGAGCCTGTG CCTTCCTCCC 2280 

CGCCTCCTCC CACTGGGACT CCCAGCAGAG COCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAGAGCC 2400 

TCTGAGATGA TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGACGCC TGAGACAGCA CCACTGCTGA 2520 

GGAGTCTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCAT GCGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC AAAGGCCCAG 2640 

GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GGTCCAGGGC TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG CCAGGACCCG 2760 

CCTGCTCCCA TCCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCAGGG TGACACAGGT 2820 

CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGG GATGCTGGTG GCTGGTGAGA 2880 

CCCCGCACTG CACACGGGAA TGCCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 2940 

GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCGCAGGGCC 3060 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 3120 

CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG GTGGCCCTGG GGAGGGGTGG 3180 

TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 3240 

TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 3300 

CGCACAGTGG ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC CCCATAATAA 3420 

CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCCGGGC GCCAGGACAG 3480 

CAGCACTCCC GCTGCACACA GAOGGCCTAG GGGTGGCGCT CAGACCCCAC CCTACGCTCA 3540 

TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 3600 

CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG GTGACGTGTG TTCTTTTGAG 3660 

TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 3692 
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seq id NO: C74 dna Sequence 

Nucleic Acid Accession 8: BC030205.1 

Coding sequence « 45 . . 878 



-~ 1 11 21 31 41 51 

50 | | | 1 | | 

GTGAGCAGCC CCTAACAGGC TGTTACTTCA CTACAACTGA CGATATGATC ATCTTAATTT 60 

ACTTATTTCT CTTGCTATGG GAAGACACTC AAGGATGGGG ATTCAAGGAT GGAATTTTTC 120 

ATAACTCCAT ATGGCTTGAA CGAGCAGCCG GTGTGTACCA CAGAGAAGCA CGGTCTGGCA 180 

AATACAAGCT CACCTACGCA GAAGCTAAGG CGGTGTGTGA ATTTGAAGGC GGCCATCTCG 240 

CAACTTACAA GCAGCTAGAG GCAGCCAGAA AAATTGGATT TCATGTCTGT GCTGCTGGAT 300 

GGATGGCTAA GGGCAGAGTT GGATACCCCA TTGTGAAGCC AGGGCCCAAC TGTGGATTTG 360 

GAAAAACTGG CATTATTGAT TATGGAATCC GTCTCAATAG GAGTGAAAGA TGGGATGCCT 420 

ATTGCTACAA CCCACACGCA AAGGAGTGTG GTGGCGTCTT TACAGATCCA AAGCAAATTT 480 

TTAAATCTCC AGGCTTCCCA AATGAGTACG AAGATAACCA AATCTGCTAC TGGCACATTA S40 

GACTCAAGTA TGGTCAGCGT ATTCACCTGA GTTTTTTAGA TTTTGACCTT GAAGATGACC 600 

CAGGTTGCTT GGCTGATTAT GTTGAAATAT ATGACAGTTA CGATGATGTC CATGGCTTTG 660 

TGGGAAGATA CTGTGGAGAT GAJGCTTCCAG ATGACATCAT CAGTACAGGA AATGTCATGA 720 

CCTTGAAGTT TCTAAGTGAT GCXTCAGTGA CAGCTGGAGG TTTCCAAATC AAATATGTTG 780 

CAATGGATCC TGTATCCAAA TCCAGTCAAG GAAAAAATAC AAGTACTACT TCTACTGGAA 840 

ATAAAAACTT TTTAGCTGGA AGATTTAGCC ACTTATAAAA AAAAAAAAAG GATGATCAAA 900 

ACACACAGTG TTTATGTTGG AATCTTTTGG AACTCCTTTG ATCTCACTGT TATTATTAAC 960 

ATTTATTTAT TATTTTTCTA AATGTGAAAG CAATACATAA TTTAGGGAAA ATTGGAAAAT 1020 

ATAGGAAACT TTAAACGAGA AAATGAAACC TCTCATAATC CCACTGCATA GAAATAACAA 1080 

GCGTTAACAT TTTCATATTT TTTTCTTTCA GTCATTTTTC TATTTGTGGT ATATGTATAT 1140 

ATGTACCTAT ATGTATTTGC ATTTGAAATT TTGGAATCCT GCTCTATGTA CAGTTTTGTA 1200 

TTATACTTTT TAAATCTTGA ACTTTATAAA CATTTTCTGA AATCATTGAT TATTCTACAA 1260 

AAACATGATT TTAAACAGCT GTAAAATATT CTATGATATG AATGTTTTAT GCATTATTTA 1320 

AGCCTGTCTC TATTGTTGGA ATTTCAGGTC ATTTTCATAA ATATTGTTGC AATAAATATC 1380 

CTTGAACACA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1430 
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Seq ID NO: C75 DNA Sequence 

Nucleic Acid Accession ft: NM_0019B2.1 

Coding sequence: 199.. 4227 

1 11 21 31 41 51 

I I I I 1 I 

CTCTCACACA CACACACCCC TCCCCTGCCA TCCCTCCCCG GACTCOGGCT CCGGCTCCGA 60 

TTGCAATTTG CAACCTCCGC TGCCGTCGCC GCAGCAGCCA CCAATTCGCC AGOGGTTCAG 120 

1242 



WO 03/042661 



GTGGCTCTTG CCTCGATGTC CTAGCCTAGG GGCCCCCGGG CCGGACTTGG CTGGGCTCCC 180 

TTCACCCTCT GCGGAGTCAT GAGGGCGAAC GACGCTCTGC AGGTGCTGGG CTTGCTTTTC 240 

AGCCTGGCCC GGGGCTCCGA GGTGGGCAAC TCTCAGGCAG TGTGTCCTGC GACTCTGAAT 300 

GGCCTGAGTG TGACCGGCGA TGCTGAGAAC CAATACCAGA CACTGTACAA GCTCTACGAG 360 

AGGTGTGAGG TGGTGATGGG GAACCTTGAG ATTGTGCTCA OGGGACACAA TGCCGACCTC 420 

TCCTTCCTGC AGTGGATTCG AGAAGTGACA GGCTATGTCC TCGTGGCCAT GAATGAATTC 480 

TCTACTCTAC CATTGCCCAA CCTCCGCGTG GTGCGAGGGA CCCAGGTCTA CGATGGGAAG 540 

TTTGCCATCT TCGTCATGTT GAACTATAAC ACCAACTCCA GCCACGCTCT GCGCCAGCTC 600 

CGCTTGACTC AGCTCACCGA GATTCTGTCA GGG GG TGTTT ATATTGAGAA GAACGATAAG 660 

CTTTGTCACA TGGACACAAT TGACTGGAGG GACATCGTGA GGGACCGAGA TGCTGAGATA 720 

GTGGTGAAGG ACAATGGCAG AAGCTGTCCC CCCTGTCATG AGGTTTGCAA GGGGOQATGC 780 

TGGGGTCCTG GATCAGAAGA CTGCCAGACA TTGACCAAGA CCATCTGTGC TCCTCAGTGT 840 

AATGGTCACT GCTTTGGGCC CAACCCCAAC CAGTGCTGCC ATGATGAGTG TGCCGGGGGC 900 

TGCTCAGGCC CTCAGGACAC AGACTGCTTT GCCTGCCGGC ACTTCAATGA CAGTGGAGCC 960 

TGTGTACCTC GCTGTCCACA GCCTCTTGTC TACAACAAGC TAACTTTCCA GCTGGAACCC 1020 

AATCCCCACA CCAAGTATCA GTATGGAGGA GTTTGTGTAG CCAGCTGTCC CCATAACTTT 1080 

GTGGTGGATC AAACATCCTG TGTCAGGGCC TGTCCTCCTG ACAAGATGGA AGTAGATAAA 1140 

AATGGGCTCA AGATGTGTGA GCCTTGTGGG GGACTATGTC CCAAAGCCTG TGAGGGAACA 1200 

GGCTCTGGGA GCCGCTTCCA GACTGTGGAC TCGAGCAACA TTGATGGATT TGTGAACTGC 1260 

ACCAAGATCC TGGGCAACCT GGACTTTCTG ATCACCGGCC TCAATGGAGA CCCCTGGCAC 1320 

AAGATCCCTG CCCTGGACCC AGAGAAGCTC AATGTCTTCC GGACAGTACG GGAGATCACA 1380 

GGTTACCTGA ACATCCAGTC CTGGCCGCCC CACATGCACA ACTTCAGTGT TTTTTCCAAT 1440 

TTGACAACCA TTGGAGGCAG AAGCCTCTAC AACCGGGGCT TCTCATTGTT GATCATGAAG 1500 

AACTTGAATG TCACATCTCT GGGCTTCCGA TCCCTGAAGG AAATTAGTGC TGGGCGTATC 1560 

TATATAAGTG CCAATAGGCA GCTCTGCTAC CACCACTCTT TGAACTGGAC CAAGGTGCTT 1620 

CGGGGGCCTA CGGAAGAGCG ACTAGACATC AAGCATAATC GGCCGCGCAG AGACTGCGTG 1680 

GCAGAGGGCA AAGTGTGTGA CCCACTGTGC TCCTCTGGGG GATGCTGGGG CCCAGGCCCT 1740 

GGTCAGTGCT TGTCCTGTCG AAATTATAGC CGAGGAGGTG TCTGTGTGAC CCACTGCAAC 1800 

TTTCTGAATG GGGAGCCTCG AGAATTTGCC CATGAGGCCG AATGCTTCTC CTGCCACCCG 1860 

GAATGCCAAC CCATGGGGGG CACTGCCACA TGCAATGGCT OGGGCTCTGA TACTTGTGCT 1920 

CAATGTGCCC ATTTTCGAGA TGGGCCCCAC TGTGTGAGCA GCTGCCCCCA TGGAGTCCTA 1980 

GGTGCCAAGG GCCCAATCTA CAAGTACCCA GATGTTCAGA ATGAATGTCG GCCCTGCCAT 2040 

GAGAACTGCA CCCAGGGGTG TAAAGGACCA GAGCTTCAAG ACTGTTTAGG ACAAACACTG 2100 

GTGCTGATCG GCAAAACCCA TCTGACAATG GCTTTGACAG TGATAGCAGG ATTGGTAGTG 2160 

ATTTTCATGA TGCTGGGCGG CACTTTTCTC TACTGGCGTG GGCGCCGGAT TCAGAATAAA 2220 

AGGGCTATGA GGCGATACTT GGAACGGGGT GAGAGCATAG AGCCTCTGGA CCCCAGTGAG 2280 

AAGGCTAACA AAGTCTTGGC CAGAATCTTC AAAGAGACAG AGCTAAGGAA GCTTAAAGTG 2340 

CTTGGCTCGG GTGTCTTTGG AACTGTGCAC AAAGGAGTGT GGATCCCTGA GGGTGAATCA 2400 

ATCAAGATTC CAGTCTGCAT TAAAGTCATT GAGGACAAGA GTGGAOGGCA GAGTTTTCAA 2460 

GCTGTGACAG ATCATATGCT GGCCATTGGC AGCCTGGACC ATGCCCACAT TGTAAGGCTG 2520 

CTGGGACTAT GCCCAGGGTC ATCTCTGCAG CTTGTCACTC AATATTTGCC TCTGGGTTCT 2580 

CTGCTGGATC ATGTGAGACA ACACCGGGGG GCACTGGGGC CACAGCTGCT GCTCAACTGG 2640 

GGAGTACAAA TTGCCAAGGG AATGTACTAC CTTGAGGAAC ATGGTATGGT GCATAGAAAC 2700 

CTGGCTGCCC GAAACGTGCT ACTCAAGTCA CCCAGTCAGG TTCAGGTGGC AGATTTTGGT 2760 

GTGGCTGACC TGCTGCCTCC TGATGATAAG CAGCTGCTAT ACAGTGAGGC CAAGACTCCA 2820 

ATTAAGTGGA TGGCCCTTGA GAGTATCCAC TTTGGGAAAT ACACACACCA GAGTGATGTC 2880 

TGGAGCTATG GTGTGACAGT TTGGGAGTTG ATGACCTTCG GGGCAGAGCC CTATGCAGGG 2940 

CTACGATTGG CTGAAGTACC AGACCTGCTA GAGAAGGGGG AGCGGTTGGC ACAGCCCCAG 3000 

ATCTGGACAA TTGATGTCTA CATGGTGATG GTCAAGTGTT GGATGATTGA TGAGAACATT 3060 

CGCCCAACCT TTAAAGAACT AGCCAATGAG TTCACCAGGA TGGCCCGAGA CCCACCACGG 3120 

TATCTGGTCA TAAAGAGAGA GAGTGGGCCT GGAATAGCCC CTGGGCCAGA GCCCCATGGT 3180 

CTGACAAACA AGAAGCTAGA GGAAGTAGAG CTGGAGCCAG AACTAGACCT AGACCTAGAC 3240 

TTGGAAGCAG AGGAGGACAA CCTGGCAACC ACCACACTGG GCTCCGCCCT CAGCCTACCA 3300 

GTTGGAACAC TTAATCGGCC ACGTGGGAGC CAGAGCCTTT TAAGTCCATC ATCTGGATAC 3360 

ATGCCCATGA ACCAGGGTAA TCTTGGGGGG TCTTGCCAGG AGTCTGCAGT TTCTGGGAGC 3420 

AGTGAACGGT GCCCCCGTCC AGTCTCTCTA CACCCAATGC CACGGGGATG CCTGGCATCA 3480 

GAGTCATCAG AGGGGCATGT AACAGGCTCT GAGGCTGAGC TCCAGGAGAA AGTGTCAATG 3540 

TGTAGAAGCC GGAGCAGGAG CCGGAGCCCA CGGCCACGCG GAGATAGCGC CTACCATTCC 3600 

CAGCGCCACA GTCTGCTGAC TCCTGTTACC CCACTCTCCC CACCCGGGTT AGAGGAAGAG 3660 

GATGTCAACG GTTATGTCAT GCCAGATACA CACCTCAAAG GTACTCCCTC CTCCCGGGAA 3720 

GG CACCCTTT CTTCAGTGGG TCTCAGTTCT GTCCTGGGTA CTGAAGAAGA AGATGAAGAT 3780 

GAGGAGTATG AATACATGAA CCGGAGGAGA AGGCACAGTC CACCTCATCC CCCTAGGCCA 3840 

AGTTCCCTTG AGGAGCTGGG TTATGAGTAC ATGGATGTGG GGTCAGACCT CAGTGCCTCT 3900 

CTGGGCAGCA CACAGAGTTG CCCACTCCAC CCTGTACCCA TCATGCCCAC TGCAGGCACA 3960 

ACTCCAGATG AAGACTATGA ATATATGAAT CGGCAAOGAG ATGGAGGTGG TCCTGGGGGT 4020 

GATTATGCAG CCATGGGGGC CTGCCCAGCA TCTGAGCAAG GGTATGAAGA GATGAGAGCT 4080 

TTTCAGGGGC CTGGACATCA GGCCCCCCAT GTCCATTATG CCCGCCTAAA AACTCTACGT 4140 

AGCTTAGAGG CTACAGACTC TGCCTTTGAT AACCCTGATT ACTGGCATAG CAGGCTTTTC 4200 

CCCAAGGCTA ATGCCCAGAG AACGTAACTC CTGCTCCCTG TGGCACTCAG GGAGCATTTA 4260 

ATGGCAGCTA GTGCCTTTAG AGGGTACCGT CTTCTCCCTA TTCCCTCTCT CTCCCAGGTC 4320 

CCAGCCCCTT TTCCCCAGTC CCAGACAATT CCATTCAATC TTTGGAGGCT TTTAAACATT 4380 

TTGACACAAA ATTCTTATGG TATGTAGCCA GCTGTGCACT TTCTTCTCTT TCCCAACCCC 4440 

AGGAAAGGTT TTCCTTATTT T OT G T QCTTT CCCAGTCCCA TTCCTCAGCT TCTTCACAGG 4500 

CACTCCTGGA GATATGAAGG ATTACTCTCC ATATCCCTTC CTCTCAGGCT CTTGACTACT 4560 

TGGAACTAGG CTCTTATGTG TGCCTTT GT T TCCCATCAGA CTGTCAAGAA GAGGAAAGGG 4620 

AGGAAACCTA GCAGAGGAAA GTGTAATTTT GGTTTATGAC TCTTAACCCC CTAGAAAGAC 4680 

AGAAGCTTAA AATCTGTGAA GAAAGAGGTT AGGAGTAGAT ATTGATTACT ATCATAATTC 4740 

AGCACTTAAC TATGAGCCAG GCATCATACT AAACTTCACC TACATTATCT CACTTAGTCC 4800 

TTTATCATCC TTAAAACAAT TCTGTGACAT ACATATTATC TCATTTTACA CAAAGGGAAG 4860 

TCGGGCATGG TGGCTCATGC CTGTAATCTC AGCACTTTGG GAGGCTGAGG CAGAAGGATT 4920 

ACCTGAGGCA AGGAGTTTGA GACCAGCTTA GCCAACATAG TAAGACCOCC ATCTC 4975 



Seq ID NO: C76 DNA Sequence 



WO 03/042661 



PCT/US02/36810 



15 



Nucleic Acid Accession 8: NM_001216.1 
Coding sequence: 43.. 1422 

- 1 11 21 31 41 SI 

5 I I | } | I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 60 

AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 120 ' 

CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 180 

TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 240 

1U AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT ACCTGGAGAG 300 

GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 

TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 

AATAATGCCC 7ACAGGGACAA AGAAGGGGAT GACCAGAGTC ATTGGCGCTA TGGAGGCGAC 480 

CCGCCCTGGC CCCGGGTGTC CCCAGCCTQC GCGGGCOGCT TCCAGTCCCC GGTGGATATC S40 

CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT GGGCTTCCAG 600 

CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 

CCTCCTGGGC TAGAGATGGC TCTGGGTCCC GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT 720 

CTGCACTGGG GGGCTGCAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG CCACOGTTTC 780 

CCTGCCGAGA TCCACGTGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA CGAGGCCTTG 840 

1\J GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC GCCTTTCTGG AGGAGGGCCC GGAAGAAAAC 900 

AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG CTCAGAGACT 960 

CAGGTCCCAG GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG CTACTTCCAA 1020 

TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 1080 

CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT CTGACACCCT GTGGGGACCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC CTTTGAATGG GCGAGTGATT 12 00 

GAGGCCTCCT TCCCTGCTGG AGTGGACAGC AGTCCTCGGG CTGCTGAGCC AGTCCAGCTG 1260 

AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT TTGGCCTCCT TTTTGCTGTC 1320 

ACCAGCGTCG GGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGGGGGT 1380 

GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA 1440 

JU TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCCGGTA AGTGTCCTGT CCTGCTCATT 1500 

ATGCCACTTC CTTTTAACTG CCAAGAAATT TTTTAAAATA AATATTTATA AT 1552 

Seq ID NO j C77 DNA Sequence 
Nucleic Acid Accession #: NM_004207.1 
30 Coding sequence: 63.. 1460 
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1 11 21 31 41 51 

I I I I I I 

GGCGAGAGGC GGGCTGAGGC GGCCCAGCGG CGGCAGGTGA GGCGGAACCA ACCCTCCTGG 60 

CCATGGGAGG GGCCGTGGTG GACGAGGGCC CCACAGGCGT CAAGGCCCCT GACGGCGGCT 120 

GGGGCTGGGC CGTGCTCTTC GGCTGTTTCG TCATCACTGG CTTCTCCTAC GCCTTCCCCA 180 

AGGCCGTCAG TGTCTTCTTC AAGGAGCTCA TACAGGAGTT TGGGATCGGC TACAGCGACA 240 

CAGCCTGGAT CTCCTCCATC CTGCTGGCCA TGCTCTACGG GACAGGTCCG CTCTGCAGTG 300 

TGTGCGTGAA CCGCTTTGGC TGCCGGCCCG TCAT6CTTGT GGGGGGTCTC TTTGCGTCGC 360 

TGGGCATGGT GGCTGCGTCC TTTTGCCGGA GCATCATCCA GGTCTACCTC ACCACTGGGG 420 

TCATCACGGG GTTGGGTTTG GCACTCAACT TCCAGCCCTC GCTCATCATG CTGAACCGCT 480 

ACTTCAGCAA GCGGCGCCCC ATGGCCAACG GGCTGGCGGC AGCAGGTAGC CCTGTCTTCC 540 

TGTGTGCCCT GAGCCCGCTG GGGCAGCTGC TGCAGGACCG CTACGGCTGG CGGGGCGGCT 600 

TCCTCATCCT GGGOGGCCTG CTGCTCAACT GCTGCGTGTG TGCCGCACTC ATGAGGCCCC 660 

TGGTGGTCAC GGCCCAGCCG GGCTCGGGGC CGCCGCGACC CTCCCGGCGC CTGCTAGACC 720 

TGAGCGTCTT CCGGGACCGC GGCTTTGTGC TTTACGCCGT GGCCGCCTCG GTCATGGTGC 780 

TGGGGCTCTT CGTCOCGCCC GTGTTCGTGG TGAGCTACGC CAAGGACCTG GGCGTGCCCG 840 

ACACCAAGGC CGCCTTCCTG CTCACCATCC TGGGCTTCAT TGACATCTTC GCGOGGCCGG 900 

CCGCGGGCTT CGTGGCGGGG CTTGGGAAGG TGCGGCCCTA CTCCGTCTAC CTCTTCAGCT 960 

TCTCCATGTT CTTCAACGGC CTCGCGGACC TGGCGGGCTC TACGGCGGGC GACTACGGCG 1020 

GCCTCGTGGT CTTCTGCATC TTCTTTGGCA TCTCCTAOGG CATGGTGGGG GCCCTGCAGT 1080 

TCGAGGTGCT CATGGCCATC GTGGGCACCC ACAAGTTCTC CAGTGCCATT GGCCTGGTGC 1140 

TGCTGATGGA GGCGGTGGCC GTGCTCGTCG GGCCCCCTTC GGGAGGCAAA CTCCTGGATG 1200 

CGACCCACGT CTACATGTAC GTGTTCATCC TGGCGGGGGC CGAGGTGCTC ACCTCCTCCC 1260 

TGATTTTGCT GCTGGGCAAC TTCTTCTGCA TTAGGAAGAA GCCCAAAGAG CCACAGCCTG 1320 

AGGTGGCGGC CGCGGAGGAG GAGAAGCTCC ACAAGCCTCC TGCAGACTCG GGGGTGGACT 1380 

TGCGGGAGGT GGAGCATTTC CTGAAGGCTG AGCCTGAGAA AAACGGGGAG GTGGTTCACA 1440 

CCCCGGAAAC AAGTGTCTGA GTGGCTGGGC GGGGCCGGCA GGCACAGGGA GGAGGTACAG 1500 

AAGCCGGCAA CGCTTGCTAT TTATTTTACA AACTGGACTG GCTCAGGCAO GGCCAOGGCT 1560 

GGGCTCCAGC TGCCGGCCCA GCGGATCGTC GCCCGATCAG TGTTTTGAGG GGGAAGGTGG 1620 

CGGGGTGGGA ACCGTGTCAT TCCAGAGTGG ATCT G CG G TG AAGCCAAGCC GCAAGGTTAC 1680 

AAGGCATCCT CACCAGGGGC CCCGCCTGCT GCTCCCAGGT GGCCTGCGGC CACTGCTATG 1740 

CTCAAGGACC TGGAAACCCA TGCTTCGAGA CAACGTGACT TTAATGGGAG GGTGGGTGGG 1800 

CCGCAGACAG GCTGGCAGGG CAGGTGCTGC GTGGGGCCCT CTCCAGCCCG TCCTACCCTG 1860 

GGCTCACATG GGGCCTGTGC CCACCCCTCT TGAGTGTCTT GGGGACAGCT CTTTCCACCC 1920 

CTGGAAGATG GAAATAAACC TGCGTGTGGG TGGAGTGTTC TCGTGCCGAA TTCAAAAAGC 1980 

TT 1982 



Seq ID NO: C78 DNA Sequence 

Nucleic Acid Accession #: NMJJ00358.1 

Coding sequence: 48.. 2099 



0 « 1 11 21 31 41 51 

GCTTGCCCGT CGGTCGCTAG CTCGCTCGGT GCGCGTCGTC COGCTCCATG GCGCTCTTCG 60 

TGCGGCTGCT GGCTCTCGCC CTGGCTCTGG CCCTGGGCCC CGCCGCGACC CTGGCGGGTC 120 

CCGCCAAGTC GCCCTACCAG CTGGTGCTGC AGCACAGCAG GCTCCGGGGC CGCCAGCACG 180 

GCCCCAACGT GTGTGCTGTG CAGAAGGTTA TTGGCACTAA TAGGAAGTAC TTCACCAACT 240 

1244 



WO 03/042661 



PCT/US02/36810 



GCAAGCAGTG GTACCAAAGG AAAATCTGTG GCAAATCAAC AGTCATCAGC TACGAGTGCT 300 

GTCCTGGATA TGAAAAGC5TC CCTGGGGAGA AGGGCTGTCC AGCAGCCCTA CCACTCTCAA 360 

ACCTTTACGA GACCCTGGGA GTCGTTGGAT CCACCACCAC TCAGCTGTAC ACGGACCGCA 420 

r CGGAGAAGCT GAGGCCTGAG ATGGAGGGGC CCGGCAQCTT CACCATCTTC GCCCCTAGCA 480 

ACGAGGCCTG GGCCTCCTTG CCAGCTGAAG TGCTGGACTC CCTGGTCAGC AATGTCAACA 540 

TTGAGCTGCT CAATGCCCTC CGCTACCATA TGGTGGGCAG GCGAGTCCTG ACTGATGAGC 600 

TGAAACACGG CATGACCCTC ACCTCTATGT ACCAGAATTC CAACATCCAG ATCCACCACT 660 

ATCCTAATGG GATTGTAACT GTGAACTGTG CCCGGCTCCT GAAAGCCGAC CACCATGCAA 720 

1 O CCAACGGGGT GGTGCACCTC ATCGATAAGG TCATCTCCAC CATCACCAAC AACATCCAGC 780 

IU AGATCATTGA GATCGAGGAC ACCTTTGAGA CCCTTCGGGC TGCTGTX3GCT GCATCAGGGC 840 

TCAACACGAT GCTTGAAGGT AACGGCCAGT ACACGCTTTT GGCCCCGACC AATGAGGCCT 900 

TCGAGAAGAT CCCTAGTGAG ACTTTGAACC GTATCCTGGG CGACCCAGAA GCCCTGAGAG 960 

ACCTGCTGAA CAACCACATC TTGAAGTCAG CTATGTGTGC TGAAGCCATC GTTGCGGGGC 1020 

TGTCTGTAGA GACCCTGGAG GGCACGACAC TGGAGGTGGG CTGCAGOGGG GACATGCTCA 1080 

ID CTATCAACGG GAAGGCGATC ATCTCCAATA AAGACATCCT AGCCACCAAC GGGGTGATCC 1140 

ACTACATTGA TGAGCTACTC ATCCCAGACT CAGCCAAGAC ACTATTTGAA TTGGCTGCAG 1200 

AGTCTGATGT GTCCACAGCC ATTGACCTTT TCAGACAAGC CGGCCTCGGC AATCATCTCT 1260 

CTGGAAGTGA GCGGTTGACC CTCCTGGCTC CCCTGAATTC TGTATTCAAA GATGGAACCC 1320 

CTCCAATTGA TGCCCATACA AGGAATTTGC TTCGGAACCA CATAATTAAA GACCAGCTGG 1380 

CCTCTAAGTA TCTGTACCAT GGACAGACCC TGGAAACTCT GGGCGGCAAA AAACTGAGAG 1440 

TTTTTGTTTA TCGTAATAGC CTCTGCATTG AGAACAGCTG CATCGCGGCC CACGACAAGA 1500 

GGGGGAGGTA CGGGACCCTG TTCACGATGG ACCGGGTGCT GACCCCCCCA ATGGGGACTG 1560 

TCATGGATGT CCTGAAGGGA GACAATCGCT TTAGCATGCT GGTAGCTGCC ATCCAGTCTG 1620 

- - CAGGACTGAC GGAGACCCTC AACCGGGAAG GAGTCTACAC AGTCTTTGCT CCCACAAATG 1680 

ZJ AAGCCTTCCG AGCCCTGCCA CCAAGAGAAC GGAGCAGACT CTTGGGAGAT GCCAAGGAAC 1740 

TTGCCAACAT CCTGAAATAC CACATTGGTG ATGAAATCCT GGTTAGCGGA GGCATCGGGG 1800 

CCCTGGTGCG GCTAAAGTCT CTCCAAGGTG ACAAGCTGGA AGTCAGCTTG AAAAACAATG 1860 

TGGTGAGTGT CAACAAGGAG CCTGTTGCCG AGCCTGACAT CATGGCCACA AATGGCGTGG 1920 

TCCATGTCAT CACCAATGTT CTGCAGCCTC CAGCCAACAG ACCTCAGGAA AGAGGGGATG 1980 

AACTTGCAGA CTCTGCGCTT GAGATCTTCA AACAAGCATC AGCGTTTTCC AGGGCTTCCC 2040 

AGAGGTCTGT GCGACTAGCC CCTGTCTATC AAAAGTTATT AGAGAGGATG AAGCATTAGC 2100 

TTGAAGCACT ACAGGAGGAA TGCACCACGG CAGCTCTCCG CCAATTTCTC TCAGATTTCC 2160 

ACAGAGACTG TTTGAATGTT TTCAAAACCA AGTATCACAC TTTAATGTAC ATGGGCCGCA 2220 

CCATAATGAG ATGTGAGCCT TGTGCATGTG GGGGAGGAGG GAGAGAGATG TACTTTTTAA 2280 

ATCATGTTCC CCCTAAACAT GGCTGTTAAC CCACTGCATG CAGAAACTTG GATGTCACTG 2340 

CCTGACATTC ACTTCCAGAG AGGACCTATC CCAAATGTGG AATTGACTGC CTATGCCAAG 2400 

TCCCTGGAAA AGGAGCTTCA GTATTGTGGG GCTCATAAAA CATGAATCAA GCAATCCAGC 2460 

CTCATGGGAA GTCCTGGCAC AGTTTTTGTA AAGCCCTTGC ACAGCTGGAG AAATGGCATC 2520 

ask ATTATAAGCT ATGAGTTGAA ATGTTCTGTC AAATGTGTCT CACATCTACA CGTGGCTTGG 2S80 
W AGGCTTTTAT GGGGCCCTGT CCAGGTAGAA AAGAAATGGT ATGTAGAGCT TAGATTTCCC 



35 



2640 



TATTGTGACA GAGCCATGGT GTGTTTGTAA TAATAAAACC AAAGAAACAT A 2691 

Seq ID KO; C79 DNA Sequence 
4 J Nucleic Acid Accession #: NM 006536.2 
Coding sequence: 109.. 2940 *~ 

} il 21 31 41 51 

CA I i I II | 

JU ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

rr ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

OJ ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

/a TTCCTACTGA ATGATAACTT AACAGCTGGC TAOGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

OU GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

rc AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

OJ CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

t\J CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 

qc CTCGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

fD TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

ofl GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

OU TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 
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GTTATAAAAA 
TATAGCTTGA 
CCAGGGAGTC 
GCTCCAAGGA 
AGCTCAGGAG 
CCACCATGCA 
TGGACAGCAC 
AGTAAAAGTC 
AAGCGAAATC 
ACGAATGGAC 
GCAATACGAG 
CCTCTGTTTA 
GGAGTTTTAA 
CATACTTTAA 
ATAAATATCC 
CATACTAACA 
ATACAGATAA 
CCTTACACTT 
GCAAAGGGAA 
AATAGCCCCA 
TCATTTAGTT 
TTTACATGAA 
CTTGCTATTT 
TTTCACTGTA 
TTTATGACAA 
TTTCTAAGTT 
TACCTAGGAA 



ATGATGGAAT 
AAGTGCATGT 
ATGCTATGTA 
AATCAGTAGG 
GCTCCTTTTC 
AAATTATTGA 
CTGGAGAAGA 
TACAGAATAT 
CTCAGCAAGC 
CTGAACATCA 
CAATGGATAG 
TTCCCCCCAA 
CAGCAATGGG 
GCAGGAAAAA 
AAAGTGTCTT 
AAGTCAAATT 
GATTTTTACA 
TGGCTATGAA 
GGGTAAAGTC 
AGCAGAGAAA 
ACTTTGATTA 
GATCATGCTA 
TGTTATATAT 
AGAGGTAACC 
AGGTCTATTG 
TATTGCCTTG 
A 



TTACTCGAGG 
CAATCACTCT 
TGTACCAGGT 
CAGAAATGAG 
AGTGCTGGGA 
CCTGGAAGCT 
CTTTGATCAG 
CCAAGATGAC 
TGGCATCAGG 
GCCAAATGGA 
GAACTCCTTA 
TTCTGATCCT 
TTTGATAGGA 
GAGAQCAGAC 
CCTTCTTAGA 
AACATCAAAA 
TGGTAGATCA 
CAAATAATAA 
GGACCAGTGT 
AGGAGGGTAG 
ATTTTTCTTT 
TATTTTATAT 
ATTTCAGATG 
TTTAACAATA 
AATTTATTTG 
GGTTATTATG 



TATTTTTTCT 
CCCAGCATAA 
TACACAGCAA 
GAGGAGCGAA 
GTTCCAGCTG 
GTAAAAGTAG 
GGCCAGGCTA 
TTTAACAATG 
GAGATATTTA 
GAAACACATG 
CAGTCTGCTG 
GTACCTGCCA 
ATCATTTGCC 
AAGAAAGAGA 
TATAAGACCC 
CTGTATTAAA 
ACAATTCTTT 
AAATTATTCT 
CAAGGAAAGT 
GTCTGCATTA 
TCTCCTTATC 
ATGTAGCCCC 
ACATCTCCCT 
TGGGTATTAC 
TNTGTAAGTT 
GAATGATAGT 



CCTTTGCTGC 
GCACCCCAGC 
ACGGTAATAT 
AGTGGGGCTT 
GCCCCCACCC 
AAGAGGAATT 
CAAGCTATGA 
CTATTTTAGT 
CGTTCTCACC 
AAAGCCACAG 
TATCTAACAT 
GAGATTATCT 
TTATTATAGT 
ATGGAACAAA 
ATGGCCTTCG 
ATGCATTGAG 
TTGGGGGTAG 
TTAAAGTAAT 
TTGTTTTATT 
TAACTGTCTG 
TGTGCAGTAC 
TAATGCAAAG 
GCTAATGCTC 
CTTTGTCTCT 
TCTACTCCCA 
TATAGCCCOJ 



AAATGGTAGA 
CCACTCTATT 
TCAGATGAAT 
TAGCOGAGTC 
TGATGTGTTT 
GACCCTATCT 
AATAAGAATG 
AAATACATCA 
CCAGATTTCC 
AATTTATGTT 
TGCCCAGGCG 
TATATTGAAA 
TGTGACACAT 
ATT ATT AT AA 
ACTACAAAAA 
TTTTTGTACA 
ATTAGAAAAC 
GTCTTTAAAG 
GAGGTGGAAA 
TGTGAAGCAA 
AGGTTGCTTG 
CTCTTTACCT 
AGAGATCTTT 
TCATACCGGT 
TCAAAGCAGC 
TATAATGOCT 



Seq ID NO: C80 DNA Sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..1413 



i 

ATGAAGTTTC 
AGCTCTACAA 
TATGGCCTTG 
AAGGAAAAAA 
ACATCTACCC 
AGGGAAATGC 
TACACACCTG 
TGGAGTAATG 
GTGGTTTTTG 
CTAGCCCATG 
GAATTCTGGA 
GGCCATTCCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAGACA 
ATTTCTTCCT 
AGAAATCAAG 
GAGCCAAATT 
GATGCAGCTG 
TGGAGGTATG 
AACTTCCAAG 
TATTTCTTCC 
ACACTGAAAA 



11 

I 

TTCTAATACT 
GCCTGGAAAA 
AGATAAACAA 
TCCAAGAAAT 
TGGAGATGAT 
CAGGGGGGCC 
ACATGAACCG 
TTACCCCCTT 
CCCGTG GAGC 
CTTTTGGACC 
CTACACATTC 
TAGGTCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGAG 
GGTTCTTCTG 
TATGGCCAAC 
TTTTTCTTTT 
ATCCCAAGAG 
TTTTTAACCC 
ATGAAAGGAG 
GAATCGGGCC 
AAGGATCTAA 
GCAATAGCTG 



21 
I 

GCTCCTGCAG 
AAATAATGTG 
ACTTCCAGTG 
GCAGCACTTC 
GCACGCACCT 
CGTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTCGCCTC 
GAACCAACGC 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAAGATGAC 
CATACATTCT 
ACGTXTTTAT 
ACAGATGATG 
TAAAATTGAT 
CCAATTTGAA 
GTTT G GTTGT 



31 
I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
CGATGTGGAG 
AAACATTATA 
GACTACGCAA 
AAGATTAACA 
TTCCATGCTT 
ATTGGAGGGG 
AACTTGTTCC 
GATCCAAAGG 
TCTGCTGATG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
GGCATTGAAG 
AAATACTGGT 
TTTGGTTTTC 
AGGACCTACT 
GACCCTGGTT 
GCAGTCTTCT 
TATGACTTCC 
TGA 



41 
I 

CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACCGG 
TCCCCGATGT 
TCACCTACAG 
TCCGGAAAGC 
CAGGCATGGC 
TTGATGGCAA 
ATGCACATTT 
TCACTGCTGT 
CCGTAATGTT 
ACATACGTGG 
CTGACAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAATTAGCAA 
CTAACTTTGT 
TCTTTGTAGA 
ATCCCAAACT 
ACTCTAAAAA 
TACTCCAACG 



51 
I 

TCCCCTGAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 
TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
CGATGAGGAC 
TCACGAGATT 
CCCCACCTAC 
CATTCAGTCC 
AGAACCAGCT 
GATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 
TAACCAGTAT 
GATTACCAAG 
CAAATACTAC 
TATCACCAAA 



Seq ID NO: C81 DNA Sequence 

Nucleic Acid Accession #: Bos sequence 

Coding sequence: 1..1413 



ATGAAGTTTC 
AGCTCTACAA 
TATGGCCTTG 
AAGGAAAAAA 
ACATCTACCC 
AGGGAAATGC 
TACACACCTG 
TGGAGTAATG 
GTGG TTTTT G 
CTAGCCCATG 
GAATTCTGGA 
GGCCATTCCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAGACA 
ATTTCTTCCT 
AGAAATCAAG 
GAGCCAAATT 



11 

I 

TTCTAATACT 
GCCTGGAAAA 
AGATAAACAA 
TCCAAGAAAT 
TGGAGATGAT 
CAGGGGGGCC 
ACATGAACCG 
TTACCCCCTT 
CCCGTGGAGC 
CTTTTGGACC 
CTACACATTC 
TAGGTCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGAG 
GGTTCTTCTG 
TATGGCCAAC 
TTTTT CT T TT 
ATCCCAAGAG 



21 

I 

GCTCCTGCAG 
AAATAATGTG 
ACTTCCAGTG 
GCAGCACTTC 
GCACGCACCT 
CGTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTCGCCTC 
GAACCAACGC 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAAGATGAC 
CATACATTCT 



31 
I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
CGATGTGGAG 
AAACATTATA 
GACTACGCAA 
AAGATTAACA 
TTCCATGCTT 
ATTGGAGGGG 
AACTTGTTCC 
GATCCAAAGG 
TCTGCTGATG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
GGCATTGAAG 
AAATACTGGT 
TTTGGTTTTC 



41 
I 

CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACCGG 
TCCCCGATGT 
TCACCTACAG 
TCCGGAAAGC 
CAGGCATGGC 
TTGATGGCAA 
ATGCACATTT 
TCACTGCTGT 
CCGTAATGTT 
ACATACGTGG 
CTGACAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAATTAGCAA 
CTAACTTTGT 



51 
I 

TCCCCTGAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 
TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
CGATGAGGAC 
TCACGCCATT 
CCCCACCTAC 
CATTCAGTCC 
AGAACCAGCT 
GATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3671 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1413 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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GATGCAGCTG TTTTTAACCC ACOTTTTTAT AGQACCTACT TCTTTGTAGA TAACCAGTAT 1200 

TGGAGGTATG ATGAAAGGAG ACAGATGATG OACCCT GOT T ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAG GAATCGOGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATGACCAAA 1380 

J ACACTGAAAA GCAATAGCTG G T TT GG TTGT TGA 1413 
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Seq ID NO: C82 DMA Sequence 

Nucleic Acid Accession 8: NM_0069S2.l 

Coding sequence: 11.. 793 



1 11 21 31 41 SI 

I I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT G0GGAGT6CA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

2U TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCOGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGOGTGCCT GGTTTTTATC ACAATCAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 

AATTGAATAT TAAGAA 796 

Seq ID NO: C83 DNA Sequence 
Nucleic Acid Accession #: NM_001793.2 
3U Coding sequence: 71.. 2560 

1 11 21 31 41 51 

I I I I I I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 120 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 

CTTGGAGGGG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 

CCCCATGAAC ATCTCCATCA TCGTGACOGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACAOGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA COGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA AIGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 1200 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 1380 

ACCTGTGTTT GTCCCACCCT CCAAAGTOGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 

CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CAGGGACAAG GACCTGTCTC CCCACACCTC 1800 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA 1860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTOCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCAOCAAC 2280 

CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TOGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCOGCCCT ACGACACCCT 2400 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCOGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGOGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 

GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACT T GGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 
TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 
TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 
TTTTTTTAAT GCTATCTTCA AAAOGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 
GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 
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10 



TGGATCTCTG CGTTTTTATA CTQAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 

TAAAGAAACT TTTCCCAGAA AAAAA 3205 

Seq ID NO: C84 UNA Sequence 
Nucleic Acid Accession fh NM_005629.1 
Coding sequence: 639.. 2546 

} 11 - 21 31 41 SI 

I I I I I I 

TAGTCGGAGC GAGGTGGCGA GTCGCTGAGC CCGCCGCGGC CCCGAOAGCG GCT6CA6CCG 60 

CCGCCGCCGG GAAGGAGAGG GOGAGGCGCG CCCGAGCCGC CGCCGCCGCC GCCACCGCCG 120 

CCGCCGCCAC CACCGCCACC GGAGTCGCGG GCCAGCCGGG CAGCCTCCGC GGGCCCCGGC 180 

- _ CGGGGCGGGG GGCGCGGGCC ACAGGCCCCT GCTCCGGCCG TCGTTTGCAG ACCGCGGGCG 240 

10 CCGATGTCGC CCGCGCCCCG TTAGGATGAG TCTGGGGTCG GGCGAGGAGC CGCCGCAGCC 300 

GCCGCCGCCC GAGCGGCGGG CAGGAGCCTC GGGAGCCGCC GCCGCCGCCG CCGCCGCCCG 360 

GCCG GGCC CC GACGCCGCCC GCGCGCCCCC GGGCCCCCGA CACACATGAG ATTCTTCAGG 420 

CTCACTTTCA AGTGCTTCGT GGACTGCTTC TGACTGCGCC GCCCGCGCCC CGCACCCCGC 480 

CGTCCGCCCG CCGCCCCGTC CCCCGGCCCG GCCGCCCCCC GGCCCCCGGC CGGCCCGCGC 540 

ZU CCTCGGGGCC CTCCCCGGTG COGCCGGTGC CCCCCGCCTG ACCGCCGCCC CCCGTGAGGC 600 

GCCGCGACCC CGGCCCGGCC GTGCGGCCCG CCGGGGCCAT GGCGAAGAAG AGOGCCGAGA 660 

ACGGCATCTA TAGCGTGTCC GGCGACGAGA AGAAGGGCCC CCTCATCGCG CCCGGGCCCG 720 

ACGGGGCCCC GGCCAAGGGC GACGGCCCCG TGGGCCTGGG GACACCCGGC GGCCGCCTGG 780 



80 



rye CCGTGCCGCC GCGCGAGACC TGGACGCGCC AGATGGACTT CATCATGTCG TGCGTGGGCT 840 

ZD TCGCCGTGGG CTTGGGCAAC GTGTGGCGCT TCCCCTACCT GTGCTACAAG AACGGCGGAG 900 

GTGTGTTCCT TATTCCCTAC GTCCTGATCG CCCTGGTTGG AGGAATCCCC ATTTTCTTCT 960 

TAGAGATCTC GCTGGGCCAG TTCATGAAGG CCGGCAGCAT CAATGTCTGG AACATCTGTC 102 

CCCTGTTCAA AGGCCTGGGC TACGCCTCCA TGGTGATCGT CTTCTACTGC AACACCTACT 108*, 

q/\ ACATCATGGT GCTGGCCTGG GGCTTCTATT ACCTGGTCAA GTCCTTTACC ACCAOGCTGC 1140 

J\) CCTGGGCCAC ATGTGGCCAC ACCTGGAACA CTCCCGACTG CGTGGAGATC TTCCGCCATG 1200 

35 



AAGACTGTGC CAATGCCAGC CTGGCCAACC TCACCTGTGA CCAGCTTGCT GACCGCCGGT 1260 

CCCCTGTCAT CGAGTTCTGG GAGAACAAAG TCTTGAGGCT GTCTGGGGGA CTGGAGGTGC 1320 

CAGGGGCCCT CAACTGGGAG GTGACCCTTT GTCTGCTGGC CTGCTGGGTG CTGGTCTACT 1380 

TCTGTGTCTG GAAGGGGGTC AAATCCACGG GAAAGATCGT GTACTTCACT GCTACATTCC 1440 

CCTACGTGGT CCTGGTCGTG CTGCTGGTGC GTGGAGTGCT GCTGCCTGGC GCCCTGGATG 1500 

GCATCATTTA CTATCTCAAG CCTGACTGGT CAAAGCTGGG GTCCCCTCAG GTGTGGATAG 1560 

ATGCGGGGAC CCAGATTTTC TTTTCTTACG CCATTGGCCT GGGGGCCCTC ACAGCCCTGG 1620 

GCAGCTACAA CCGCTTCAAC AACAACTGCT ACAAGGACGC CATCATCCTG GCTCTCATCA 1680 

AC . ACAGTGGGAC CAGCTTCTTT GCTGGCTTCG TGGTCTTCTC CATCCTGGGC TTCATGGCTG 1740 

W CAGAGCAGGG CGTGCACATC TCCAAGGTGG CAGAGTCAGG GCCGGGCCTG GCCTTCATCG 1800 

CCTACCCGCG GGCTGTCACG CTGATGCCAG TGGCCCCACT CTGGGCTGCC CTGTTCTTCT 1860 

TCATGCTGTT GCTGCTTGGT CTCGACAGCC AGTTTGTAGG TGTGGAGGGC TTCATCACCG 1920 

GCCTCCTCGA CCTCCTCCCG GCCTCCTACT ACTTCCGTTT CCAAAGGGAG ATCTCTGTGG 1980 

CCCTCTGTTG TGCCCTCTGC TTTGTCATOG ATCTCTCCAT GGTGACTGAT GGCGGGATGT 2040 

ACGTCTTCCA GCTGTTTGAC TACTACTCGG CCAGCGGCAC CACCCTGCTC TGGCAGGCCT 2100 

TTTGGGAGTG CGTGGTGGTG GCCTGGGTGT ACGGAGCTGA CCGCTTCATG GACGACATTG 2160 

CCTGTATGAT CGGGTACCGA CCTTGCCCCT GGATGAAATG GTGCTGGTCC TTCTTCACCC 2220 

CGCTGGTCTG CATGGGCATC TTCATCTTCA ACGTTGTGTA CTACGAGCCG CTGGTCTACA 2280 

cr\ ACAACACCTA CGTGTACCCG TGGTGGGGTG AGGCCATGGG CTGGGCCTTC GCCCTGTCCT 2340 

JVJ CCATGCTGTG CGTGCCGCTG CACCTCCTGG GCTGCCTCCT CAGGGCCAAG GGCACCATGG 2400 

CTGAGCGCTG GCAGCACCTG ACCCAGCCCA TCTGGGGCCT CCACCACTTG GAGTACCGAG 2460 

CTCAGGACGC AGATGTCAGG GGCCTGACCA CCCTGACCCC AGTGTCCGAG AGCAGCAAGG 2520 

TCGTCGTGGT GGAGAGTGTC ATGTGACAAC TCAGCTCACA TCACCAGCTC ACCTCTGGTA 2580 

GCCATAGCAG CCCCTGCTTC AGCCCCACCG CACCCCTCCA GGGGGCCTGC CTTTCCCTGA 2640 

3 J CACTTTTGGG GTCTG CCTGG GGGAGGAGGG GAGAAAGCAC CATGAGTGCT CACTAAAACA 2700 

ACTTTTTCCA TTTTTAATAA AACGCCAAAA ATATCACAAC CCACCAAAAA TAGATGCCTC 2760 

TCCCCCTCCA GCCCTAGCCG AGCTGGTCCT AGGCCCCGCC TAGTGCCCCA CCCCCACCCA 2820 

CAGTGCTGCA CTCCTCCTGC CCCTGCCACG CCCACCCCCT GCCCACCTCT CCAGGCTCTG 2880 

f-r>. CTCTGCAGCA CACCCGTGGG TGACCOCTCA CCCCAGAAGC AGCAGTGGCA GCTTGGGAAA 2940 

OU TGTGAGGAAG GGAAGGAGGG AGAGACGGGA GGGAGGAGAG AGAGGAGAAG GGAGGCAGGG 3000 

GAGGGGCAGC AGAACCAAGG CAAATATTTC AGCTGGGCTA TACCCCTCTC CCCATCCCTG 3060 

TTATAGAAGC TTAGAGAGCC AGCCAGCAAT GGAACCTTCT GGTTCCTGCG CCAATCGCCA 3120 

CCAGTATCAA TTGTGTGAGC TTGGGTGCGA GTGCACGCGT GGGTGAGTAC GGAGAGTATA 3180 

£c TATAGATCTC T ATCT CTTAG CAAAGGTGAA TGCCAGATGT AAATGGCGCC TCTGGGCAAA 3240 

CO GGAGGCTTGT ATTTTGCACA TTTTATAAAA ACTTGAGAGA ATGAGATTTC TGCTTGTATA 3300 

TTTCT AAAAA GAGGAAGGAG CCCAAACCAT CCTCTCCTTA CCACTCCCAT CCCTGTGAGC 3360 

CCTACCTTAC CCCTCTGCCC CTAGCCAAGG AGTGTGAATT TATAGATCTA ACTTTCATAG 3420 

GCAAAACAAA AGCTTCGAGC TGTTGCGTGT GTGAGTCTGT TGTGTGGATG TGCGTGTGTG 3480 

„ GTCCCCAGCC CCAGACTGGA TTGGAAAAGT GCATGGTGGG GGCCTCGGGG CTGTCCOCAC 3S40 

/U GCTGTCCCTT TGCCACAAGT CTGTGGGGCA AGAGGCTGCA ATATTCOGTC CTGGGTGTCT 3600 

GGGCTGCTAA CCTGGCCTGC TCAGGCTTCC CACCCTGTGC GGGGCACACC CCCAGGAAGG 3660 

GACCCTGGAC ACGGCTCCCA CGTCCAGGCT TAAGGTGGAT GCACTTCCCG CACCTCCAGT 3720 

CTTCTGTGTA GCAGCTTTAA CCCACGTTTG TCTGTCACGT CCAGTCCCGA GACGGCTGAG 3780 

TGACCCCAAG AAAGGCTTCC CCGACACCCA GACAGAGGCT GCAGGGCTGG GGCTGGGTGA 3840 

/J GGGTGGCGGG CCTGCGGGGA CATTCTACTG TGCTAAAAAG CCACTGCAGA CATAGCAATA 3900 

AAAACATGTC ATTTTCC 3917 



Seq ID NO: CSS DNA Sequence 

Nucleic Acid Accession 8: NM_006516.1 

Coding sequence: 180.. 1658 

1 11 21 31 41 51 

I I I I I I 

TAGTCGCGGG TCCCCGAGTG AGCACGCCAG GGAGCAGGAG ACCAAACGAC GGGGGTCGGA 60 
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GTCAGAGTCG 
CGCACGCCCG 
TGGAGCCCAG 
TTGGCTCCCT 
AGGAGTTCTA 
TCACCACGCT 
TCTCTGTGGG 
TGCTGGCCTT 
TGCTGATCCT 
CCATGTATGT 
AGCTGGGCAT 
GCAACAAGGA 
GCATCGTGCT 
AGAACCGGGC 
TGCAGGAGAT 
AGCTGTTCCG 
CCCAGCAGCT 
CGGGGGTGCA 
CTGTCGTGTC 
TCGCTGGCAT 
TACCCTGGAT 
TGGGTCCTGG 
CAGCTGCCAT 
GCTTCCAGTA 
TGGTTCTGTT 
ATGAGATCGC 
AGCTGTTCCA 
GCCTGCTCCC 
AACCTGACAG 
CCAGAAGAAT 
AAATCTATTC 
ATATCAGCCT 
GAGGGTGGAG 
CTGGACCTAT 
GAGGTGGCTA 
CATTAGGATT 
CCTGAGACCA 
GCCGGGTTCT 
GGGAGCCTGC 
TGCAAGATAT 
ATATCTGGAC 
TATAAATGGC 
TTTGGATGGG 
GACTCAGGAT 
TTTGATCCCT 
ATCACATATT 
AGGCTTGAAA 



CAGTGGGAGT 
TCGCCACCCG 
CAGCAAGAAG 
GCAGTTTGGC 
CAACCAGACA 
CTGGTCCCTC 
CCTTTTCGTT 

GGGCGGCTTC 
GGGTGAAGTG 
CGTCGTCGGC 
CCTGTGGCCC 
GCCCTTCTGC 
CAAGAGTGTG 
GAAGGAAGAG 
CTCCCCCGCC 
GTCTGGCATC 
GCAGCCTGTG 
GCTGTTTGTG 
GGCGGGTTGT 
GTCCTATCTG 
CCCCATCCCA 
TGCCGTTGCA 
TGTGGAGCAA 
CTTCATCTTC 
TTCCGGCTTC 
TCCCCTGGGG 
AGCAGCCCTA 
ATGTCAGCCG 
ATTCAGGACT 
AGACAAGCAA 
GAGTCTCCTG 
ACTAAGCCCT 
GTCCTAAGGA 
TGGCCACCCG 
TGCCCCTTCC 
GTTGGGAGCA 
AGTCTCCTTT 
AAACTCACTG 
TTATATATAT 
AAGCCAACTT 
TGGTTTTTAG 
AGTGAGACAG 
CCAGTCCCTT 
GTTACCCAGA 
TGATAGTTGG 
TCGCATTATT 



CCCCGGACCG 
CGTACCOGGC 
CTGACGGGTC 
TACAACACTG 
TGGGTCCACC 
TCAGTGGCCA 
AACCGCTTTG 
GTGCTCATGG 
ATCATCGGTG 
TCACCCACAG 
ATCCTCATCG 
CTGCTGCTGA 
CCCGAGAGTC 
CTAAAGAAGC 
AGTCGGCAGA 
TACCGCCAGC 
AAOGCTGTCT 
TATGCCACCA 
GTGGAGCGAG 
GCCATACTCA 
AGCATCGTGG 
TGGTTCATCG 
GGCTTCTCCA 
CTGTGTGGTC 
ACCTACTTCA 
CGGCAGGGGG 
GCTGATTCCC 
AGGATCTCTC 
AGCCGGGCCT 
TAACGGCTCC 
CAGGTTTTAT 
TGCCCACATC 
GTCGAGACAC 
CACACTAATC 
TTCTGCTGGC 
CATCTCTTCC 
CTGGAGTGCA 
GCACTGAGGG 
CTCAAGAAGA 
TTTTGGTTGT 
GTAAATACAC 
AAACATGGTT 
AAGTAAGTGG 
ACACGTACCT 
GAATATATAC 
TGTTCAAAAA 
TTQAATGTGA 



GAGCAOGAGC 

GCCTCATGCT 
GAGTCATCAA 
GCTATGGGGA 
TCTTTTCTGT 
GCCGGCGGAA 
GCTTCTCGAA 
TGTACTGCGG 
CCTTTCGTGG 
CCCAGGTGTT 
GCATCATCTT 
CCCGCTTCCT 
TGCGCGGGAC 
TGATGCGGGA 
CCATCCTCAT 
TCTATTACTC 

CAGGCCGGCG 
TGACCATCGC 
CCATCTTTGG 
TGGCTGAACT 
ACTGGACCTC 
CCTAOGTCTT 
AAGTTCCTGA 
GAGCCAGCCA 
AAGTGTGAGT 
AGGAGCACAG 
GGGGCTCCTT 
AGGATTTTAA 
AATTTTTTTA 
CCAGGCTTCA 
TTGCCTTCTT 
GAACTATGAA 
CTGGATCTCC 
TACCCAACCA 
GGGAGGAGAG 
CCACACTATT 
CATGGAGACT 
CAATATTAAA 
CACCTCACTC 
TTGAAATGCT 
GGTTGCAACC 
CTCATCAGTG 
ATTCTTTATC 
AACACTAGTT 
AGGGAA 



CTGAGCGGGA 
CCACCAGCGC 
GGCTGTGGGA 
TGCCCCCCAG 
GAGCATCCTG 
TGGGGGCATG 
TTCAATGCTG 
ACTGGGCAAG 
CCTGACCACA 
GGCCCTGGGC 
CGGCCTGGAC 
CATCCCGGCC 
GCTCATCAAC 
AGCTGACGTG 
GAAGAAGGTC 
CGCTGTGGTG 
CACGAGCATC 
TATCGTCAAC 
GACCCTGCAC 
GCTAGCACTG 
CTTTGTGGCC 
CTTCAGCCAG 
AAATTTCATT 
CATCATCTTC 
GACTAAAGGC 
AAGTGATAAG 
CGCCCCAGAT 
GCAGCTGGAT 
TCTCCAGCCA 
CAAAAGCAAG 
TTACTGATTT 
CCCTGAATGG 
CACCCAGCTA 
CTACAAAGCT 
CCACTCTAGG 
CTCAAATTAA 
GGGAAGGGCC 
ACCATGAGAA 
CCTGCCCTGT 
TACAGACACT 
CTGTTACTTA 
TGTGGATTGA 
ACTGCAACGG 
TCCTCTTGCT 
TTGACATTCA 
TTGTGCCAGC 



GAGCGCCGCT 

AGCGCTGCCA 

GGAGCAGTGC 

AAGGTGATCG 

CCCACCACGC 

ATTGGCTCCT 

ATGATGAACC 

TCCTTTGAGA 

GGCTTCGTGC 

ACCCTGCACC 

TCCATCATGG 

CTGCTGCAGT 

CGCAACGAGG 

ACCCATGACC 

ACCATCCTGG 

CTGCAGCTGT 

TTCGAGAAGG 

AOGGCCTTCA 

CTCATAGGCC 

CTGGAGCAGC 

TTCTTTGAAG 

GGTCCACGTC 

GTGGGCATGT 

ACTGTGCTCC 

CGGACCTTCG 

ACACCOGAGG 

CACCAGCCCG 

GAGACTTCCA 

GCAATGATGT 

ACTGTTGCTC 
TGTTATTTTT 

TTCCATGCCT 
ATCTGTAGGG 
TCTATCCCAG 
GGTCAGGCTC 
TCTTTCTTTA 
AGTCTGGGCT 
GAGGGCCTGT 
TGTGTATAGA 
AAGTTATAGT 
CCTAAACAGA 
GGGTAGGAGG 
CTTAGACTTC 
CAAAAATCTG 
AGGCATTTCT 
CGTGATGCTC 



Seq ID NOr C86 DNA Sequence 

Nucleic Acid Accession #; XM_035292.2 

Coding sequence t 53.. 1576 



1 
I 

GCTCGCTGGG 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGGGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATOGCAGTAC 
CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 
GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GAOGCTGCTC 
CAACTGGCTC 
TGAGCTTGAG 
CCTCTTCCTG 
CATCATCCTC 
GTGGCTCCTC 
CCCCCAGGAG 



11 
I 

CCGCGGCTCC 
AAGCGGCGCG 
CTGGCCGCCA 
CAGCGGAACA 
GGCATCTTOG 
GTGGTGTGGG 
GGCACCACCA 
CTGCCCGCCT 
ATCGTGGCCC 
CCCGAGGAGG 
TGCTACAGCG 
GCCCTGGCCC 
CTAGATCCCA 
TTATACAGCG 
ATGATGAACC 
CTGGTGTACG 
TCGTCCGAGG 
ATCCCCGTCT 
TCCAGGCTCT 
CACCCACAGC 
TACGCCTTCT 
TGCGTGGCCC 
CGGCCCATCA 
ATCGCOGTCT 
AGCGGGCTGC 
CAGGGCATCT 
ACATAGCCAG 



21 
I 

CGGGTGTCCC 
CGCTAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCCCAC 
CCGCGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TGGTCTTCGC 
CAGCCAAGCT 
TGAAGGCCGC 
TGATCATCCT 
ACTTCTCATT 
GCCTCTTTGC 
CCTACAGAAA 
TGCTGACCAA 
CCGTGGCCGT 
TCGTGGGCCT 
TCTTCGTGGG 
TCCTCACCCC 
CCAAGGACAT 
TGGCCATCAT 
AGGTGAACCT 
CCTTCTGGAA 
CCGTCTACTT 
TCTCCACGAC 
GAGGCC3AGT 



31 

I 

AGGCCCGGCC 
GCCGGCGGCC 
CGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGGGAC 
CTGGATCGAG 
CACCTACCTG 
CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 
CCTGGCCTAC 
GGACTTCGGG 
GTCCTGCTTC 
GTCCCGGGAA 
CGTGCCGTCC 
CTTCTCCGTC 
CGGCATGATC 
GGCCCTGCCT 
GACACCCGTG 
CTTCGGGGTC 
CGTCCTGTGT 
GGCTGCOGGA 



41 

I 

GGTGCGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GCCATCATCG 
AAGGAGGCAG 
ATCGTGGGCG 
TACGCCTACA 
CTGCTCATCA 
CTCAAGCCGC 
CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
TGGAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 
GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGCGCC 
GTGTTCTTCA 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 
GGAGCATGC 



51 
I 

GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 
6CTCGCCQGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 
TGCTGCTCAC 
TTGCCGCCGC 
GAAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
OGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



120 

180 

240 

300 

360 

420 

460 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2856 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1609 



Seq ID NO: C87 DNA Sequence 

Nucleic Acid Accession #t NM_00S268.1 
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Coding sequence: 168.. 989 

1 11 21 31 41 51 

I I I I I I 

TAAAAAGCAA AAGAATTCGC GGCCGCGTCG ACACGGGCTT CCCCGAAAAC CTTCCCCGCT 60 

TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC OGGCTGCTGG GAGCCAGGAG 120 

AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGACGCGTGG GTCCACCATG AACTGGAGTA 180 

TCTrTGAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AGCCTTTQGG CGCATCTGGC 240 

TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT GACGGCCGAG CGTGTGTGGA 300 

GTGATGACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC CGGCTGCTCC AACGTCTGCT 360 

TTGATGAGTT CTTCCCTGTG TCCCATGTGC GCCTCTGGGC CCTGCAGCTT ATCCTGGTGA 420 

CATGCCCCTC ACTGCTCGTG GTCATGCACG TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 480 

ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCOG6C AAGAAGCGGG 540 

GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCGAGOGTG GACATCGCCT 600 

TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGTG GTCAAGTGCC 660 

ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCCCTCA GAGAAGAACA 720 

TTTTCACCCT CTTCATGGTG GCCACAGCTG CCATCTGCAT CCTGCTCAAC CTCGTGGAGC 780 

TCATCTACCT GGTGAGCAAG AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAAGCCA 840 

TGTGCACAGG TCATCACCCC CACGGTACCA CCTCTTCCTG CAAACAAGAC GACCTCCTTT 900 

CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA GACGGCCCCC 960 

GAGACCATGT GAAGAAAACC ATCTTGTGAG GGGCTGCCTG GACTGGTCTG GCAGGTTGGG 1020 

CCTGGATG6G GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAGTGGG GGAGCTAAGC 1080 

CATGAGGTAG GGGCAGGCAA GAGAGAGGAT TCAGACGCTC TGGGAGCCAG TTCCTAGTCC 1140 

TCAACTCCAG CCACCTGCCC CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 1200 

GCTCGGTTTC CTTTTCTAGA ATGGAAATAG TGAGGGCCAA TGC 1243 

Seq ID NO: C88 DNA Sequence 
Nucleic Acid Accession #: NMJ)05130 
Coding sequence : 98 . . 802 



1 11 21 31 41 51 

I I I I 1 I 

CTCTACCTGA CACAGCTGCA GCCTGCAATT CACTCCCACT GCCTGGGATT GCACTGGATC 60 

CGTGTGCTCA GAACAAGGTG AACGCCCAGC TGCAGCCATG AAGATCTGTA GCCTCACCCT 120 

GCTCTCCTTC CTCCTACTGG CTGCTCAGGT GCTCCTGGTG GAGGGGAAAA AAAAAGTGAA 180 

GAATGGACTT CACAGCAAAG TGGTCTCAGA ACAAAAGGAC ACTCTGGGCA ACACCCAGAT 240 

TAAGCAGAAA AGCAGGCCCG GGAACAAAGG CAAGTTTGTC ACCAAAGACC AAGCCAACTG 300 

CAGATGGGCT GCTACTGAGC AGGAGGAGGG CATCTCTCTC AAGGTTGAGT GCACTCAATT 360 

GGACCATGAA TTTTCCTGTG TCTTTGCTGG CAATCCAACC TCATGCCTAA AGCTCAAGGA 420 

TGAGAGAGTC TATTGGAAAC AAGTTGCCCG GAATCTGCGC TCACAGAAAG ACATCTGTAG 480 

ATATTCCAAG ACAGCTGTGA AAACCAGAGT GTGCAGAAAG GATTTTCCAG AATCCAGTCT 540 

TAAGCTAGTC AGCTCCACTC TATTTGGGAA CACAAAGCCC AGGAAGGAGA AAACAGAGAT 600 

GTCCCCCAGG GAGCACATCA AGGGCAAAGA GACCACCCCC TCTAGCCTAG CAGTGACCCA 660 

. GACCATGGCC ACCAAAGCTC CCGAGTGTGT GGAGGACCCA GATATGGCAA ACCAGAGGAA 720 

4D GACTGCCCTG GAGTTCTGTG GAGAGACTTG GAGCTCTCTC TGCACATTCT TCCTCAGCAT 780 

AGTGCAGGAC ACGTCATGCT AATGAGGTCA AAAGAGAACG GGTTCCTTTA AGAGATGTCA 840 

TGTCGTAAGT CCCTCTGTAT ACTTTAAAGC TCTCTACAGT CCCCCCAAAA TATGAACTTT 900 

TGTGCTTAGT GAGTGCAACG AAATATTTAA ACAAGTTTTG TATTTTTTGC TTTTGTGTTT 960 

TGGAATTTGC CTTATTTTTC TTGGATGCGA TGTTCAGAGG CTGTTTCCTG CAGCATGTAT 1020 

TTCCATGGCC CACACAGCTA TGTGTTTGAG CAGCGAAGAG TCTTTGAGCT GAATGAGCCA 1080 

GAGTGATAAT TTCAGTGCAA CGAACTTTCT GCTGAATTAA TGGTAATAAA ACTCTGGGTG 1140 

TTTTTCAAAA AAAAAAAAAA AAA 1163 

Seq ID NO: C89 DNA Sequence 
J J Nucleic Acid Accession 8: BC022S42 
Coding sequence : 2 74 92 7 

1 11 21 31 41 51 

11)111 

ACTTGGTCCC AGCCGATAAA TCTGGGGCAG CGCGCGGTAG GAGCTGOGGG CGGCCAGGCC 60 

CCTTCCTGCG TCCGCACCTG GCCCCGCGCG CCCCTCTCGG GCGTCCGGCT TCCGGCGTCC 120 

TGGOGGCTCG GGTGGCGGOG GTTCGGGCGG CCGCCTGGCT GCTCCTCGGG GCGGCGACGG 180 

GGCTCACGCG CGGGCCCGCC ACGGCCTTCA CCGCCGCGCG CTCTOACGCC GGCATAAGGG 240 

CCATGTGTTC TGAAATTATT TTGAGGCAAG AAGTTTTGAA AGATGGTTTC CACAGAGACC 300 

TTTTAATCAA AGTGAAGTTT GGGGAAAGCA TTGAGGACTT GCACACGTGC CGTCTCTTAA 360 

TTAAACAGGA CATTCCTGCA GGACTTTATG TGGATCCGTA TGAGTTGGCT TCATTACGAG 420 

AGAGAAACAT AACAGAGGCA GTGATGGTTT CAGAAAATTT TGATATAGAG GCCCCTAACT 480 

ATTTGTCCAA GGAGTCTGAA GTTCTCATTT ATGCCAGACG AGATTCACAG TGCATTGACT 540 

GTTTTCAAGC CTTTTTGCCT GTGCACTGCC GCTATCATCG GCCGCACAGT GAAGATGGAG 600 

AAGCCTOGAT TGTGGTCAAT AACCCAGATT TGTTGATGTT TTGTGACCAA GAGTTCCCGA 660 

TTTTGAAATG CTGGGCTCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AATGAGGATA 720 

TATGCCAATG GAACAAGATG AAGTATAAAT CAGTATATAA GAATGTGATT CTACAAGTTC 780 

CAGTGGGACT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATTACAATCC 840 

TGTGCTCTAC ATTGATCCTT GTAGCAGTTT TCAAATATGG CCATTTTTCC CTATAAGTTT 900 

TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAATTTCTG ACGAGAGGTG 960 

TTCTTCTAGA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGCCAAAA TTATGTTTAC 1020 

TAGAGGAAAT TTGGGATCAT TCTCAGCTAA TTCCAAAATG TAGTGCTCTA TTGCATGGAT 1080 

CCTTGGTAAT CCTCAAGCAT CAGATGCCAT AAGGGGAAAC TTAATTCTGC TAAATTAATG 1140 

Q TTTATTTTGT GAGAAGTGAC TTTATCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 1200 

OU GTAGAGACAA ATTATTCTCA TTTTGCAAGT ACTTTCAATT TAAGCTACAA ATTGAGAAAA 1260 

CCGTTATAAA TAAGAATAAA ATAGGCCAGG CACAGTGGCT CACAOCIGTA ATCCCAGCAC 1320 

TTTGGGAGGC CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGGTGAA 1380 

ACCCTGTCTC TACTAAAAAT ACAAAAGTTA GCTGGGGCTG GTGGTGGGCA TCTGTAGTCC 1440 

CAGCTAATTG GAAGGGTGAG GCGGGAGGAT CGCTTGAACC TGGGAGGCGG AGGTTCCAGA 1500 
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GAGCCAAGAT CGCACCACTG CACTACAGCC TGGGCGACAG AAOGAGACCC TGTCTCCAAA 1S60 

GGAAAAACAA AAAAGAAGAA TAAAATAATT TGGATGAAAA TCATGTTTAT TTAAATAGTA 1620 

ATCTCATGAG ACTATTAAAG ATGTGCCAGA GTTTCAATGA AAATCATTAA AGTAGGACAG 1680 

CTAAGAAATT AATATTAATA TAAAAATTAT TGATAATCTT AAATTATTGA TTATTCCTTA 1740 

ACGCACTCCA TTCTCCTTTT ACATTTTATC ATGTTTCTTT TOAATATATG AATTG6CAAA 1800 

GGACTTGATG AAACTGAGTA CTAAGATTTG GTACAGAGTA TGTCAGGAAG ACAACTCAGA 1860 

TTGCCATTTT AAATAAAGTT GTACATGAAC AAAAAAAAAA AAAAAA 1906 

Seq ID NO: C90 UNA Sequence 
Nucleic Acid Accession 8: NM_004994 
Coding sequence: 20. .2143 ~ 



1 11 21 31 41 51 

I I I I I I 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 

CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 300 

GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

CACCTTCACT CGCGTGTACA GCCGGGACGC AGACATCGTC ATCCAGTTTG GTGTGGCGGA 540 

GCACGGAGAC GGGTATCCCT TGGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC GAGTTGTGGT CCCTGGGCAA 660 

GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 

GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC TCCGACGGCT ACCGCTGGTG 960 

CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGTAAGGAG TACTCGACCT GTACCAGOGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGOGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCOCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

A TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

4U AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 

CCTTATCGCC GACAAGTGGC CCGCGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 18O0 

GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 1860 

CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 1920 

GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 

CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 

TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 2334 



Seq ID NO: C91 Sequence 

Nucleic Acid Accession #: NM_000213 

Coding sequence: 188.5656 



^ 1 11 21 31 41 51 

60 | | | | | | 

GCGCTGCCCG CCTOGTCCCC ACCCCCCAAC CCCCCGCGCC CGCCCTCGGA CAGTCCCTGC 60 

TCGCCCGCGC GCTGCAGCCC CATCTCCTAG CGGCAGCCCA GGCGCGGAGG GAGCGAGTCC 120 

GCCCCGAGGT AGGTCCAGGA CGGGCGCACA GCAGCAGCCG AGGCTGGCCG GGAGAGGGAG 180 

GAAGAGGATG GCAGGGCCAC GCCCCAGCCC ATGGGCCAGG CTGCTCCTGG CAGCCTTGAT 240 

CAGCGTCAGC CTCTCTGGGA CCTTGGCAAA CCGCTGCAAG AAGGCCCCAG TGAAGAGCTG 300 

CACGGAGTGT GTCCGTGTGG ATAAGGACTG CGCCTACTGC ACAGACGAGA TGTTCAGGGA 360 

CCGGCGCTGC AACACCCAGG CGGAGCTGCT GGCCGCGGGC TGCCAGCGGG AGAGCATCGT 420 

GGTCATGGAG AGCAGCTTCC AAATCACAGA GGAGACCCAG ATTGACACCA CCCTGCGGCG 480 

_ A CAGCCAGATG TCCCCCCAAG GCCTGCGGGT CCGTCTGCGG CCCGGTGAGG AGCGGCATTT 540 

70 TGAGCTGGAG GTGTTTGAGC CACTGGAGAG CCCCGTGGAC CTGTACATCC TCATGGACTT 600 

CTCCAACTCC ATGTCCGATG ATCTGGACAA CCTCAAGAAG ATGGGGCAGA ACCTGGCTCG 660 

GGTCCTGAGC CAGCTCACCA GCGACTACAC TATTGGATTT GGCAAGTTTG TGGACAAAGT 720 

CAGCGTCCCG CAGACGGACA TGAGGCCTGA GAAGCTGAAG GAGCCCTGGC CCAACAGTGA 780 

CCCCCCCTTC TCCTTCAAGA ACGTCATCAG CCTGACAGAA GATGTGGATG AGTTCCGGAA 840 

TAAACTGCAG GGAGAGCGGA TCTCAGGCAA CCTGGATGCT CCTGAGGGCG GCTTCGATGC 900 

CATCCTGCAG ACAGCTGTGT GCACGAGGGA CATTGGCTGG CGCCCGGACA GCACCCACCT 960 

GCTGGTCTTC TCCACCGAGT CAGCCTTCCA CTATGAGGCT GATGGCGCCA ACGTGCTGGC 1020 

TGGCATCATG AGCCGCAACG ATGAACGGTG CCACCTGGAC ACCACGGGCA CCTACACCCA 10 B0 

OA GTACAGGACA CAGGACTACC CGTOGGTGCC CAC C CTGGTG CGCCTGCTCG CCAAGCACAA 1140 

OU CATCATCCCC ATCTTTGCTG TCACCAACTA CTCCTATAGC TACTACGAGA AGCTTCACAC 1200 

CTATTTCCCT GTCTCCTCAC TGGGGGTGCT GCAGGAGGAC TCGTCCAACA TCGTGGAGCT 1260 

GCTGGAGGAG GCCTTCAATC GGATCCGCTC CAACCTGGAC ATCCGGGCCC TAGACAGCCC 1320 

CCGAG6CCTT OGGACAGAGG TCACCTCCAA GATGTTCCAG AAGACGAGGA CTGGGTCCTT 1380 

TCACATCCGG CGGGGGGAAG TGGGTATATA CCAGGTGCAG CTGCGGGCCC TTGAGCACGT 1440 
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GGATGGQACG CACGTGTGCC AGCTGCCGGA GGACCAGAAG GGCAACATCC ATCTGAAACC 1500 

TTCCTTCTCC GACGGCCTCA AGATGGACGC GGGCATCATC TGTGATGTGT GCACCTGCGA 1S60 

GCTGCAAAAA GAGGTGCGGT CAGCTCGCTG CAGCTTCAAC GGAGACTTCG TGTGCGGACA 1620 

GTGTGTGTGC AGCGAGGGCT GGAGTGGCCA GACCTGCAAC TGCTCCACCG GCTCTCTGAG 1680 

TGACATTCAG CCCTGCCTGC GGGAGGGCGA GGACAAGCOG TGCTCCGGCC GTGGGGAGTG 1740 

CCAGTGCGGG CACTGTGTGT GCTACGGCGA AGGCCGCTAC GAGGGTCAGT TCTGCGAGTA 1800 

TGACAACTTC CAGTGTCCCC GCACTTCCGG GTTCCTGTGC AATGACCGAG GACGCTGCTC 1860 

CATGGGCCAG TGTGTGTGTG AGCCTGGTTG GACAGGCCCA AGCTGTGACT GTCCCCTCAG 1920 

CAATGCCACC TGCATCGACA GCAATGGGGG CATCTGTAAT GGACGTGGCC ACTGTGAGTG 1980 

TGGCOGCTGC CACTGCCACC AGCAGTCGCT CTACACGGAC ACCATCTGCG AGATCAACTA 2040 

CTCGGCGATC CACCCGGGCC TCTGCGAGGA CCTACGCTCC TGCGTGCAGT GCCAGGCGTG 2100 

GGGCACCGGC GAGAAGAAGG GGCGCACGTG TGAGGAATGC AACTTCAAGG TCAAGATGGT 2160 

GGACGAGCTT AAGAGAGCCG AGGAGGTGGT GGTGCGCTGC TCCTTCCGGG ACGAGGATGA 2220 

CGACTGCACC TACAGCTACA CCATGGAAGG TGACGGCGCC CCTGGGCCCA ACAGCACTGT 2280 

CCTGGTGCAC AAGAAGAAGG ACTGCCCTCC GGGCTCCTTC TGGTGGCTCA TCCCCCTGCT 2340 

CCTCCTCCTC CTGCCGCTCC TGGCCCTGCT ACTGCTGCTA TGCTGGAAGT ACTGTGCCTG 2400 

CTGCAAGGCC TGCCTGGCAC TTCTCCCGTG CTGCAACCGA GGTCACATGG TGGGCTTTAA 2460 

GGAAGACCAC TACATGCTGC GGGAGAACCT GATGGCCTCT GACCACTTGG ACACGCCCAT 2520 

GCTGCGCAGC GGGAACCTCA AGGGCCGTGA CGTGGTCCGC TGGAAGGTCA CCAACAACAT 2580 

GCAGCGGCCT GGCTTTGCCA CTCATGCCGC CAGCATCAAC CCCACAGAGC TGGTGCCCTA 2640 

CGGGCTGTCC TTGCGCCTGG CCCGCCTTTG CACCGAGAAC CTGCTGAAGC CTGACACTCG 2700 

GGAGTGCGCC CAGCTGCGCC AGGAGGTGGA GGAGAACCTG AACGAGGTCT ACAGGCAGAT 2760 

CTCCGGTGTA CACAAGCTCC AGCAGACCAA GTTCCGGCAG CAGCCCAATG CCGGGAAAAA 2820 

GCAAGACCAC ACCATTGTGG ACACAGTGCT GATGGCGCCC CGCTCGGCCA AGCCGGCCCT 2880 

GCTGAAGCTT ACAGAGAAGC AGGTGGAACA GAGGGCCTTC CACGACCTCA AGGTGGCCCC 2940 

CGGCTACTAC ACCCTCACTG CAGACCAGGA CGCCCGGGGC ATGGTGGAGT TCCAGGAGGG 3000 

CGTGGAGCTG GTGGACGTAC GGGTGCCCCT CTTTATCCGG CCTGAGGATG ACGACGAGAA 3060 

GCAGCTGCTG GTGGAGGCCA TCGACGTGCC CGCAGGCACT GCCACCCTCG GCCGCCGCCT 3120 

GGTAAACATC ACCATCATCA AGGAGCAAGC CAGAGACGTG GTGTCCTTTG AGCAGCCTGA 3180 

GTTCTCGGTC AGCCGCGGGG ACCAGGTGGC CCGCATCCCT GTCATCCGGC GTGTCCTGGA 3240 

CGGCGGGAAG TCCCAGGTCT CCTACCGCAC ACAGGATGGC ACCGCGCAGG GCAACCGGGA 3300 

CTACATCCCC GTGGAGGGTG AGCTGCTGTT CCAGCCTGGG GAGGCCTGGA AAGAGCTGCA 3360 

GGTGAAGCTC CTGGAGCTGC AAGAAGTTGA CTCCCTCCTG CGGGGCCGCC AGGTCCGCCG 3420 

TTTCCACGTC CAGCTCAGCA ACCCTAAGTT TGGGGCCCAC CTGGGCCAGC CCCACTCCAC 3480 

CACCATCATC ATCAGGGACC CAGATGAACT GGACCGGAGC TTCACGAGTC AGATGTTGTC 3540 

ATCACAGCCA CCCCCTCAOG GCGACCTGGG CGCCCCGCAG AACCCCAATG CTAAGGCCGC 3600 

TGGGTCCAGG AAGATCCATT TCAACTGGCT GCCCCCTTCT GGCAAGCCAA TGGGGTACAG 3660 

GGTAAAGTAC TGGATTCAGG GCGACTCCGA ATCCGAAGCC CACCTGCTCG ACAGCAAGGT 3720 

GCCCTCAGTG GAGCTCACCA ACCTGTACCC GTATTGCGAC TATGAGATGA AGGTGTGCGC 3780 

CTACGGGGCT CAGGGCGAGG GACCCTACAG CTCCCTGGTG TCCTGCCGCA CCCACCAGGA 3840 

AGTGCCCAGC GAGCCAGGGC GTCTGGCCTT CAATGTCGTC TCCTCCACGG TGACCCAGCT 3900 

GAGCTGGGCT GAGCCGGCTG AGACCAACGG TGAGATCACA GCCTACGAGG TCTGCTATGG 3960 

CCTGGTCAAC GATGACAACC GACCTATTGG GCCCATGAAG AAAGTGCTGG TTGACAACCC 4020 

TAAGAACCGG ATGCTGCTTA TTGAGAACCT TCGGGAGTCC CAGCCCTACC GCTACACGGT 4080 

GAAGGCGCGC AACGGGGCCG GCTGGGGGCC TGAGCGGGAG GCCATCATCA ACCTGGCCAC 4140 

CCAGCCCAAG AGGCCCATGT CCATCCCCAT CATCCCTGAC ATCCCTATCG TGGACGCCCA 4200 

GAGCGGGGAG GACTACGACA GCTTCCTTAT GTACAGCGAT GACGTTCTAC GCTCTCCATC 4260 

GGGCAGCCAG AGGCCCAGCG TCTCCGATGA CACTGGCTGC GGCTGGAAGT TCGAGCCCCT 4320 

GCTGGGGGAG GAGCTGGACC TGCGGCGCGT CACGTGGCGG CTGCCCCCGG AGCTCATCCC 4380 

GCGCCTGTCG GCCAGCAGCG GGCGCTCCTC CGACGCCGAG GCCCCCACGG CCCCCCGGAC 4440 

GACGGCGGCG CGGGCGGGAA GGGCGGCAGC CGTGCCCCGC AGTGCGACAC CCGGGCCCCC 4500 

CGGAGAGCAC CTGGTGAATG GCCGGATGGA CTTTGCCTTC CCGGGCAGCA CCAACTCCCT 4560 

GCACAGGATG ACCACGACCA GTGCTGCTGC CTATGGCACC CACCTGAGCC CACACGTGCC 4620 

CCACCGCGTG CTAAGCACAT CCTCCACCCT CACACGGGAC TACAACTCAC TGACCCGCTC 4680 

AGAACACTCA CACTCGACCA CACTGCCCAG GGACTACTCC ACCCTCACCT CCGTCTCCTC 4740 

CCACGACTCT CGCCTGACTG CTGGTGTGCC CGACACGCCC ACCCGCCTGG TGTTCTCTGC 4800 

CCTGGGGCCC ACATCTCTCA GAGTGAGCTG GCAGGAGCCG CGGTGCGAGC GGCCGCTGCA 4860 

GGGCTACAGT GTGGAGTACC AGCTGCTGAA CGGCGGTGAG CTGCATCGGC TCAACATCCC 4920 

CAACCCTGCC CAGACCTCGG TGGTGGTGGA AGACCTCCTG COCAACCACT CCTACGTGTT 4980 

CCGCGTGCGG GCCCAGAGCC AGGAAGGCTG GGGCCGAGAG CGTGAGGGTG TCATCACCAT 5040 

TGAATCCCAG GTGCACCCGC AGAGCCCACT GTGTCCCCTG CCAGGCTCCG CCTTCACTTT 5100 

GAGCACTCCC AGTGCCCCAG GCCOGCTGGT GTTCACTGCC CTGAGCCCAG ACTCGCTGCA 5160 

GCTGAGCTGG GAGCGGCCAC GGAGGCCCAA TGGGGATATC GTCGGCTACC TGGTGACCTG 5220 

TGAGATGGCC CAAGGAGGAG GGCCAGCCAC CGCATTCCGG GTGGATGGAG ACAGCCCCGA 5280 

GAGCCGGCTG ACCGTGCCGG GCCTCAGCGA GAACGTGCCC TACAAGTTCA AGGTGCAGGC 5340 

CAGGACCACT GAGGGCTTCG GGCCAGAGCG CGAGGGCATC ATCACCATAG AGTCCCAGGA 5400 

TGGAGGACCC TTCCCGCAGC TGGGCAGCCG TGCCGGGCTC TTCCAGCACC CGCTGCAAAG 5460 

CGAGTACAGC AGCATCACCA CCACCCACAC CAGCGCCACC GAGCCCTTCC TAGTGGATGG 5520 

GCTGACCCTG GGGGCCCAGC ACCTGGAGGC AGGCGGCTCC CTCACCCGGC ATGTGACCCA 5580 

GGAGTTTGTG AGCCGGACAC TGACCACCAG CGGAACCCTT AGCACCCACA TGGACCAACA 5640 

GTTCTTCCAA ACTTGACCGC ACCCTGCCCC ACCCCCGCCA TGTCCCACTA GGCGTCCTCC 5700 

CGACTCCTCT CCCGGAGCCT CCTCAGCTAC TCCATCCTTG CACCCCTGGG GGCCCAGCCC 5760 

ACCCGCATGC ACAGAGCAGG GGCTAGGTGT CTCCTGGGAG GCATGAAGGG GGCAAGGTCC S820 

GTCCTCTGTG GGCCCAAACC TATTTGTAAC CAAAGAGCTG GGAGCAGCAC AAGGACCCAQ 5880 

CCTTTGTTCT GCACTTAATA AATGGTTTTG CTACTGCTAA AAAAAAAAAA AAAAAAAAAA 5940 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 5994 

Seq ID NO: C92 DNA Sequence 
Nucleic Acid Accession #i KM_02391S 
Coding sequence: 250.. 1326 



1 11 21 31 41 51 

I I I I I I 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 
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TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 120 

GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 180 

CCCAOGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 240 

AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 300 

CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 360 

AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 420 

TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 480 

TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TCCATTTCGA 540 

ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 600 

TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 660 

GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 720 

ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 780 

ATCCTGACAA ATGGTCAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 840 

CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 900 

GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 960 

AGGCAATTCA TAAGTCAGTC AAGCCGAAAG CGAAAACATA ACCAGAGCAT CAGGGTTGTT 1020 

GTGGCTGTGT TTTTTACCTG CTTTCTACCA TATCACTTGT GCAGAATTCC TTTTACTTTT 1080 

AGTCACTTAG ACAGGCTTTT AGATGAATCT GCACAAAAAA TCCTATATTA CTGCAAAGAA 1140 

ATTACACTTT TCTTGTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 1200 

TGTAGGTCAT TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 1260 

ATCAGATCAC TGCAAAGTGT GAGAAGATCG GAAGTTCGCA TATATTATGA TTACACTGAT 1320 

GTGTAGGCCT TTTATTGTTT GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTTCTT 1380 

TTCATTATCC TTAAAAAAAA AA 1402 

Seq ID NO: C93 SNA Sequence 

Nucleic Acid Accession #j NM_020789.1 

Coding sequence: 2 08.. 3 6 99 



00 1 11 21 31 41 51 

30 | | | | | | 

GGCACGAGGG TGGAGCCGAG CGGTGCGGAG CAGATCTGGT GGTTCTCCGG AGAGCAGCTT 60 

CCTTGGGTGT TACATGAGCC AAGCCCTCAC TGTACAGAAG AGTGAGAGCT GAAACCTGTT 120 

CCCTGAGCTG ATCAGAAGGA CATCCCTTGG CCCCTCCATC TGGGCTCCTG TGGATAGGAG 180 

GGGCTGGGTG AGCAGGCCAG CTGGGCTATG GTGTGGTGCC TCGGCCTGGC CGTCCTCAGC 240 

CTGGTCATCA GCCAGGGGGC TGACGGTCGA GGGAAGCCTG AGGTGGTATC GGTGGTGGGC 300 

CGGGCTGAGG AGAGTGTGGT GCTGGGCTGT GACCTGCTGC CCCCGGCCGG CCGGCCCCCC 360 

CTGCATGTCA TCGAGTGGCT GCGCTTTGGA TTCCTGCTTC CCATCTTCAT CCAGTTCGGC 420 

CTCTACTCTC CCCGAATTGA CCCTGATTAC GTGGGACGAG TCOGGCTGCA GAAGGGGGCC 480 

TCTCTCCAGA TTGAGGGTCT CCGGGTGGAA GACCAGGGCT GGTACGAGTG CCGCGTGTTC 540 

TTCCTGGACC AGCACATCCC TGAAGACGAT TTTGCTAACG GCTCCTGGGT GCATCTGACA 600 

GTCAATTCAC CCCCTCAATT CCAGGAGACA CCTCCTGCTG TGTTGGAAGT GCAGGAACTG 660 

GAGCCTGTGA CCCTGCGTTG TGTGGCCCGT GGCAGCCCCC TGCCTCATGT GACGTGGAAG 720 

CTCCGAGGAA AGGACCTTGG CCAGGGCCAG GGCCAGGTGC AAGTGCAGAA CGGGACGCTG 780 

CGGATCCGCC GGGTAGAGCG AGGCAGCTCT GGGGTCTACA CCTGCCAAGC CTCCAGCACT 840 

GAGGGCAGCG CCACCCACGC CACCCAGCTG CTAGTGCTAG GACCCCCAGT CATCGTGGTG 900 

CCCCCCAAGA ACAGCACAGT CAATGCCTCC CAGGATGTTT CATTGGCCTG CCATGCTGAG 960 

GCATACCCTG CTAACCTCAC CTACAGCTGG TTCCAGGACA ACATCAATGT CTTCCACATT 1020 

AGCCGCCTGC AGCCCCGGGT GCAGATCCTG GTGGACGGGA GCCTGCGGCT GCTGGCCACC 1080 

CAGCCTGATG ATGCCGGCTG CTACACCTGT GTGCCCAGCA ATGGCCTCCT GCATCCACCC 1140 

TCAGCCTCTG CCTACCTCAC TGTGCTCTGC ATGCCGGGGG TGATCCGCTG CCCGGTTCGT 1200 

GCCAACCCCC CACTGCTCTT TGTCAGCTGG ACCAAGGATG GAAAGGCCCT GCAGCTGGAC 1260 

AAGTTCCCTG GCTGGTCCCA GGGCACAGAA GGCTCACTGA TCATCGCCCT GGGGAACGAG 1320 

GATGCCCTGG GAGAATACTC CTGCACCCCC TACAACAGTC TTGGTACCGC CGGGCCCTCT 1380 

CCTGTGACCC GCGTGCTGCT CAAGGCTCCC CCAGCTTTTA TAGAGCGGCC CAAGGAAGAA 1440 

TATTTCCAAG AAGTAGGGCG GGAGCTGCTC ATCCCCTGCT CCGCCCAAGG GGACCCTCCT 1S00 

CCTGTTGTCT CTTGGACCAA GGTGGGCCGG GGGCTGCAAG GCCAGGCCCA GGTGGACAGC 1560 

AACAGCAGCC TCATCCTGCG ACCATTGACC AAGGAGGCCC ACGGGCACTG GGAATGCAGT 1620 

GCCAGCAATG CTGTGGCCCG AGTGGCCACC TCCACGAACG TCTACGTGCT GGGCACTAGC 1680 

CCTCATGTTG TCACCAATGT GTCCGTGGTG GCTTTGCCCA AGGGTGCCAA TGTCTCCTGG 1740 

GAGCCTGGCT TTGATGGTGG TTATCTGCAG AGATTCAGTG TCTGGTACAC CCCACTGGCC 1800 

AAGCGTCCTG ACCGAATGCA CCATGACTGG GTGTCCTTGG CAGTGCCTGT GGGGGCTGCT 1860 

CACCTCCTAG TGCCAGGGCT GCAGCCCCAC ACCCAGTACC AGTTCAGCGT GCTAGCTCAG 1920 

AACAAGCTGG GGAGTGGTCC CTTCAGCGAA ATCGTCTTGT CTGCTCCGGA AGGGCTTCCT 1980 

ACCACGCCAG CTGCACCCGG GCTTCCCCCA ACAGAGATAC CGCCTCCCCT GTCCCCTCCG 2040 

CGGGGTCTGG TGGCAGTGAG GACACCCCGG GGGGTACTCC TGCATTGGGA TCCCCCAGAG 2100 

CTGGTCCCTA AGAGACTGGA TGGCTACGTC TTGGAAGGCC GGCAAGGCTC CCAGGGCTGG 2160 

GAGGTGCTGG ACCCGGCTGT GGCAGGCACA GAAACAGAGC TGCTGGTGCC AGGCCTCATC 2220 

AAGGATGTTC TCTACGAGTT CCGCCTCGTG GCCTTCGCGG GCAGCTTCGT CAGCGACCCC 2280 

AGCAACACGG CCAACGTCTC CACTTCCGGT CTGGAGGTCT ACCCTTCGCG CACGCAGCTG 2340 

CCGGGCCTCC TGCCTCAGCC CGTGCTGGCC GGCGTGGTGG GCGGAGTCTG CTTTCTGGGA 2400 

GTGGCCGTCC TTGTGAGCAT CCTGGCCGGC TGCCTCCTGA ACCGGCGCAG GGCTGCCCGC 2460 

CGCCGCCGCA AGCGCCTCCG CCAAGATCCA CCTCTTATCT TCTCTCCGAC CGGGAAGTCA 2520 

GCTGCACCCT CTGCTCTGGG CTCAGGCAGT CCTGACAGCG TGGCGAAGCT GAAGCTCCAG 2580 

GGATCCCCAG TCCCCAGCCT GCGCCAGAGT CTGCTCTGGG GGGATCCTGC CGGAACTCCC 2640 

AGCCCCCACC CGGATCCTCC ATCTAGCCGG GGACCCTTAC CTCTGGAGCC CATTTGCCGG 2700 

GGCCCAGACG GGCGCTTTGT GATGGGGCCC ACTGTGGCGG CCCCCCAGGA AAGGTCAGGC 2760 

CGGGAGCAGG CAGAACCTCG GACTCCAGCC CAGCGTCTGG CCCGGTCCTT TGACTGTAGC 2820 

AGCAGCAGCC CCAGTGGGGC ACCCCAGOCC CTCTGCATTG AAGACATCAG CCCTGTGGCA 2880 

CCCCCTCCAG CAGCCCCACC CAGTCCCTTG CCAGGTCCTG GACCCCTGCT CCAGTACCTG 2940 

AGCCTGCCCT TCTTCCGAGA GATGAATGTG GATGGGGACT GGCCCCCGCT TGAGGAGCCC 3000 

AGCCCTGCTG CACCCCCAGA TTACATGGAT ACCCGGCGCT GTCCCACCTC ATCTTTCCTT 3060 

CGTTCTCCAG AAACCCCTCC TGTATCCCCC AGGGAATCAC TTCCTGGGGC TGTGGTAGGG 3120 

GCTGGGGCCA CTGCAGAGCC CCCTTACACA GCCCTGGCTG ACTGGACACT GAGGGAGCGG 3180 

CTGCTGCCAG GCCTTCTCCC TGCTGCCCCT CGAGGCAGCC TCACCAQCCA GAGCAGCGGG 3240 
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CGAGGCAGCG CTTOGTTCCT GC6GCCCCCC TCCACAGCCC CCTCTGCAGG AGGCAGCTAC 3300 

CTCAGCCCTG CTCCAGGAGA CACCAGCAGC TGGGCCAGTG GCCCTSAGAG ATGGCCCCGA 3360 

AGGGAGCATG TGGTGACAGT CAGCAAGAGG AGGAACACAT CTGTGGACGA GAACTATGAG 3420 

TGGGACTCAG AATTCCCTGG GGACATGGAA TTGCTGGAGA CTTTGCACCT GGGCTTGGCC 3480 

J AGCTCCOGGC TCAGACCTGA AGCTGAGACA GAGCTAGGTG TGAAGACTCC AGAGGAGGGC 3540 

TGCCTCCTGA ACACTGCCCA TGTTACTGGC CCTGAGGCCC GCTGTGCTGC CCTTOGGGAG 3600 

GAATTCCTGG CCTTCCGCCG CCGCCGAGAT GCTACTAGGG CTCGGCTACC AGCCTATOGA 3660 

CAGCCAGTCC CCCACCCCGA ACAGGCCACT CTGCTGTGAA CATCCCTAAT GTGAGGCTGT 3720 

GAAAAGGCAT ATGGACCTGC AAAGGAGGCC CCCAACCAGA CAGACCTAGT TTCAAACGAG 3780 

GGCACTGCCC CTGCCTGCCC CTTTGGTGCC CAGGCACAGA CCCTGATAGT GGGTTTGGGT 3840 

CACCTTGGTA TGGAATGTAT GTGCTGACCC CCTAGGTGAG TCTGGGGATT GGAACAGGGA 3900 

TCTTAGGTCT GCCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCTCTGT GTGTGTGTGT 3960 

GTGTGTGAAG TTTTTTACAG GTGAATAAAC AAAGTTTGAA AGAAAAAAAA AAAAAAAAAA 4020 

j AAAA 4024 

Seq ID NO: C94 DNA Sequence 
Nucleic Acid Accession ft: NM_006875 
Coding sequence: 186.. 1190 

20 i 11 21 31 41 51 

I I I I I I 

GAATTCGGCA CGAGCGCGCG GOGAATCTCA ACGCTGCGCC GTCTGCGGGC GCTTCCGGGC 60 

CACCAGTTTC TCTGCTTTCC ACCCTGGCGC CCCCCAGCCC TGGCTCCCCA GCTGCGCTGC 120 

0 CCCGGGCGTC CACGCCCTGC GGGCTTAGCG GGTTCAGTGG GCTCAATCTG CGCAGCGCCA 180 

ZJ CCTCCATGTT GACCAAGCCT CTACAGGGGC CTCCCGCGCC CCCCGGGACC CCCACGCCGC 240 

CGCCAGGAGG CAAGGATCGG GAAGCGTTCG AGGCCGAGTA TCGACTCGGC CCCCTCCTGG 300 

GTAAGGGGGG CTTTGGCACC GTCTTCGCAG GACACCGCCT CACAGATCGA CTCCAGGTGG 360 

CCATCAAAGT GATTCCCCGG AATCGTGTGC TGGGCTGGTC CCCCTTGTCA GACTCAGTCA 420 

„ CATGCCCACT CGAAGTCGCA CTGCTATGGA AAGTGGGTGC AGGTGGTGGG CACCCTGGCG 480 

3U TGATCCGCCT GCTTGACTGG TTTGAGACAC AGGAAGGCTT CATGCTGGTC CTCGAGCGGC 540 

CTTTGCCCGC CCAGGATCTC TTTGACTATA TCACAGAGAA GGGCCCACTG GGTGAAGGCC 600 

CAAGCCGCTG CTTCTTTGGC CAAGTAGTGG CAGCCATCCA GCACTGCCAT TCCCGTGGAG 660 

TTGTCCATCG TGACATCAAG GATGAGAACA TCCTGATAGA CCTACGCCGT GGCTGTGCCA 720 

^ AACTCATTGA TTTTGGTTCT GGTGCCCTGC TTCATGATGA ACCCTACACT GACTTTGATG 780 

GGACAAGGGT GTACAGCCCC CCAGAGTGGA TCTCTCGACA CCAGTACCAT GCACTCCCGG 840 

CCACTGTCTG GTCACTGGGC ATCCTCCTCT ATGACATGGT GTGTGGGGAC ATTCCCTTTG 900 

AGAGGGACCA GGAGATTCTG GAAGCTGAGC TCCACTTCCC AGCCCATGTC TCCCCAGACT 960 

GCTGTGCCCT AATCCGCCGG TGCCTGGCCC CCAAACCTTC TTCCCGACCC TCACTGGAAG 1020 

AGATCCTGCT GGACCCCTGG ATGCAAACAC CAGCCGAGGA TGTTACCCCT CAACCCCTCC 1080 

4U AAAGGAGGCC CTGCCCCTTT GGCCTGGTCC TTGCTACCCT AAGCCTGGCC TGGCCTGGCC 1140 

TGGCCCCCAA TGGTCAGAAG AGCCATCCCA TGGCCATGTC ACAGGGATAG ATGGACATTT 1200 

GTTGACTTGG TTTTACAGGT CATTACCAGT CATTAAAGTC CAGTATTACT AAGGTAAGGG 1260 

ATTGAGGATC AGGGGTTAGA AGACATAAAC CAAGTTTGCC CAGTTCCCTT CCCAATCCTA 1320 

CAAAGGAGCC TTCCTCCCAG AACCTGTGGT CCCTGATTTT GGAGGGGGAA CTTCTTGCTT 1380 

CTCATTTTGC TAAGGAAGTT TATTTTGGTG AAGTTGTTCC CATTTTGAGC CCCGGGACTC 1440 

TTATTTTGAT GATGTGTCAC CCCACATTGG CACCTCCTAC TACCACCACA CAAACTTAGT 1500 

TCATATGCTT TTACTTGGGC AAGGGTGCTT TCCTTCCAAT ACCCCAGTAG CTTTTATTTT 1560 

AGTAAAGGGA CCCTTTCCCC TAGCCTAGGG TCCCATATTG GGTCAAGCTG CTTACCTGCC 1620 

TCAGCCCAGG ATTTTTTATT TTGGGGGAGG TAATGCCCTG TTGTTACCCC AAGGCTTCTT 1680 

J\) TTTTTTTTTT TTTTTTTTTG GGTGAGGGGA CCCTACTTTG TTATCCCAAG TGCTCTTATT 1740 

CTGGTGAGAA GAACCTTAAT TCCATAATTT GGGAAGGAAT GGAAGATGGA CACCACCGGA 1800 

CACCACCAGA CAATAGGATG GGATGGATGG TTTTTTGGGG GATGGGCTAG GGGAAATAAG 1860 

GCTTGCTGTT TGTTTTCCTG GGGCGCTCCC TCCAATTTTG CAGATTTTTG CAACCTCCTC 1920 

- - CTGAGCCGGG ATTGTCCAAT TACTAAAATG TAAATAATCA CGTATTGTGG GGAGGGGAGT 1980 

J2> TCCAAGTGTG CCCTCCTTTT TTTTCCTGCC TGGATTATTT AAAAAGCCAT GTGTGGAAAC 2040 

CCACTATTTA ATAAAAGTAA TAGAATCAGA AAAAAAAAAA AAAAAAAA 2088 

Seq ID NO: C95 DNA Sequence 
- A Nucleic Acid Accession tii NMJJ02S10.1 
0U Coding sequence? 92.. 1774 

1 11 21 31 41 51 

I I I I I I 

CAGATGCCAG AAGAACACTG TTGCTCTTGG TGGACGGGCC CAGAGGAATT CAGAGTTAAA 60 

CCTTGAGTGC CTGCGTCCGT GAGAATTCAG CATGGAATGT CTCTACTATT TCCTGGGATT 120 

TCTGCTCCTG GCTGCAAGAT TGCCACTTGA TGCCGCCAAA CGATTTCATG ATGTGCTGGG 180 

CAATGAAAGA CCTTCTGCTT ACATGAGGGA GCACAATCAA TTAAATGGCT GGTCTTCTGA 240 

TGAAAATGAC TGGAATGAAA AACTCTACCC AGTGTGGAAG CGGGGAGACA TGAGGTGGAA 300 

AAACTCCTGG AAGGGAGGCC GTGTGCAGGC GGTCCTGACC AGTGACTCAC CAGCCCTCGT 360 

GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA AGGAAGATGC 420 

CAATGGCAAC ATAGTCTATG AGAAGAACTG CAGAAATGAG GCTGGTTTAT CTGCTGATCC 480 

ATATGTTTAC AACTGGACAG CATGGTCAGA GGACAGTGAC GGGGAAAATG GCACCGGCCA 540 

AAGCCATCAT AACGTCTTCC CTGATGGGAA ACCTTTTCCT CACCACCCCG GATGGAGAAG 600 

ATGGAATTTC ATCTACGTCT TCCACACACT TGGTCAGTAT TTCCAGAAAT TGGGACGATG 660 

TTCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT GGGCCTCAAC TCATGGAAGT 720 

GACTGTCTAC AGAAGACATG GACGGGCATA TGTTCCCATC GCACAAGTGA AAGATGTGTA 780 

CGTGGTAACA GATCAGATTC CTGTQTTTGT GACTATGTTC CAGAAGAACG ATCGAAATTC 840 

ATCCGACGAA ACCTTCCTCA AAGATCTCCC CATTATGTTT GATGTCCTGA TTCATGATCC 900 

TAGCCACTTC CTCAATTATT CTACCATTAA CTACAAGTGG AGCTTCGGGG ATAATACTGG 960 

CCTGTTTGTT TCCACCAATC ATACTGTGAA TCACACGTAT GTGCTCAATG GAACCTTCAG 1020 

CCTTAACCTC ACTGTGAAAG CTGCAGCACC AGGACCTTGT CCGCCACCGC CACCACCACC 1080 

CAGACCTTCA AAACCCACCC CTTCTTTAGG ACCTGCTGGT GACAACCCCC TGGAGCTGAG 1140 

TAGGATTOCT GATGAAAACT GCCAGATTAA CAGATATGGC CACTTTCAAG CCACCATCAC 1200 

AATTGTAGAG GGAATCTTAG AGGTTAACAT CATCCAGATG ACAGACGTCC TGATGCCGGT 1260 
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GCCATGGCCT 
GGAGGTCTGT 
CCCTGTGGAT 
GAOGTACTGT 
GATTTCTGTT 
CTCCGTTGGC 
CAAGGAATAC 
TGTCTTTCTC 
ACTCAAAAAC 
TTTTCAGTGC 
TTATTGTTAA 
TTTAGAGATG 
AAAGCAACTT 
TAACTAGTAG 
AGCCTAACCC 
ATGCATAAAG 
TTCAATACAC 
GTGCACACTT 
TGACAACCTA 
GGACATTTAG 
ATTTCCAAAT 
AGATGAGGTC 
ACTAAAACCA 
AAGTGTGGGA 



GAAAGCTCCC 
ACCATCATTT 
GTGGATGAGA 
GTGAACCTCA 
CCTGACAGAG 
TGCTTGGCCA 
AACCCAATAG 
AACCGTGCAA 
CAAGAATTTA 
CATTGATGTG 
ATAGATATTG 
GGGAGAGGGA 
AGCAAGGCT7 
GATAGAAACA 
AGGTTAACTG 
CCAATGTAGT 
ACTCATGAAC 
GCTAGACTCA 
CTTTGCTTGG 
TTAGTGCTTT 
TTTTGTATAG 
CCTGGTTTTT 
TCTACTATAT 
AGAGACAAAA 



TAATAGACTT 
CTGACOCCAC 

CCCTGGGGGA 
ACCCAGCCTC 
TATTTGTCAC 
AAAATAGTCC 
AAGCCGTGTT 
AAGGAGTTTC 
AGATGTGCTG 
TGGTTTGGGG 
TTATACTGCA 
CTTTTCATTA 
CTGTGTCCCG 
CAAGAAGAGG 
CCAGTTTCTA 
TCCTGATGGA 
GAAAAAATAC 
CTGAGTGAAG 
TTATATACCA 
TCGCTGCACA 
CATGGCAACT 
GTTAGACATG 



TGTCGTGAGC 
CTGCGAGATC 
GACTGTGAGA 
TGACACAAGC 
GCCTTTAAGG 
TGTGATCTCC 
TGGGAATGTG 
CTTCCCGGGA 
TTAAATTTCG 
GAGTGGCTAT 
AAGTTGAATT 
GGCAGCTTCA 
TTTTTTATGT 
AGAGTAAGGA 
CGGGATACTT 
AGATCATGTT 
ACAATAACAG 
TACTCTCATA 
GAATGATATT 
GGCATGATGC 
TATTTGAAAT 
TGATCAGTAA 
ACATTCTTTT 



TGCCAAGGGA 
ACCCAGAACA 
CGAACCTTCA 
CTGGCTCTCA 
ATGGCAAACA 
CTCTTGGTGT 
GTCAGAAGCA 
AACCAGGAAA 
ACCTTGTTTC 
TAACCTTTTT 
TTTTATAGGT 
GCCATGTTGT 
TTCACTTATA 
GAGAAGCTAC 
TCAGCTTTCC 
CCAAGCTAAC 
GCCCAAGCCT 
AATGGGTGGG 
CATATATTCA 
TGAGTGACAC 
CATATATTAA 
GGATTTCACC 
TCTCTCCTTC 



GCATTCCCAC 
CAGTCTGCAG 
ATGGGTCTGG 
CGAGCACCCT 
GTGCCCTGAT 
ACAAAAAACA 
AAGGCCTGAG 
AGGATCCGCT 
TGAAGCTCAC 
TTCCTAAAGA 
TAAATGTCAT 
GAAACTGATA 
AAGTCTTAGG 
TATTGATTAG 
ATGTAACTGT 
TGAATCCCAC 
GTGGTATGAT 
AGTATTTTGG 
TTTATTCCAT 
TCTTGTGTAT 
GACTTTCCAA 
TCTGTTTGTA 
CTGAAAAATA 



Seq ID NO: C96 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1-4247 



11 



21 



ATGCGAATCC 
TGGGCTAATG 
ACAGGAGCAC 
CAATCTCCTA 
AAATTTCAGG 
ACAGTGGAAA 
TTTAAAGCAA 
GAGCATAGTT 
GACCGATTTT 
ATTTTGTTTG 
GAAAGTGTTA 
CTGCCAAACT 
ACAGACACAG 
GCTGTTTTTT 
TTACAAAACA 
ACTGGAAAGG 
GACCCAGAGA 
ACCATGATTG 
CATGAATTTT 
AATATGAGTT 
AGCGACCAAC 
TTAATTGGAA 
GCTATTGTGA 
ATTTCTACCA 
CGATCCCCAA 
AATTCCACTT 
CAGACTGTGA 
GGCTCTAAAA 
AATACAGTTT 
ACTGGAGCTG 
GAGAACATAT 
GTCCTTATAC 
GAATCACTAA 
ACAGCACAGC 
ATACGTGTTG 
CAGGGTCCCT 
ACTGAGGTAA 
GTCAACGTGG 
CATGAGTCTC 
CTTGTGATCG 
TACTGGAGGA 
GTTATATCCA 
ATAAAGCACT 
TTTGAGGAAG 
CCAGACAACA 
AAGCTAGCAC 
GTTGATGGCT 
GCTGAAGATT 
AACCTCGTGG 
GAGTACGGGA 
AGGAATTTTA 
GGACGTGTGG 
TCCCTGCCAG 



TAAAGCGTTT CCTCGCTTGC 
GATACTACAG ACAACAGAGA 
TGAATCAAAA AAATTGGGGA 
TCAATATTGA TGAAGATCTT 
GTTGGGATAA AACATCATTG 
TTAATCTCAC TAATGACTAC 
GCAAGATAAC TTTTCACTGG 
TAGAAGGACA AAAATTTCCA 
CAAGTTTTGA GGAAGCAGTC 
AGGTTGGGAC AGAAGAAAAT 
GTCGTTTTGG GAAGCAGGCT 
CAACTGACAA GTATTACATT 
TTGACTGGAT TGTTTTTAAA 
GTGAAGTTCT TACAATGCAA 
ATTTTCGAGA GCAACAGTAC 
AAGAGATTCA TGAAGCAGTT 
ATTATACCAG CCTTCTTGTT 
AGAAGTTTGC AGTTTTGTAC 
TGACAGATGG CTATCAAGAC 
ATGTTCTTCA GATAGTAGCC 
TGATTGTCGA CATGCCTACT 
CTGAAGAAAT AATCAAGGAG 
ATCCTGGTAG AGACAGTGCT 
CAACACACTA CAATCGCATA 
CAAGAGGAAG TGAATTCTCT 
CCCAACCAGT CACTAAATTA 
CTGAACTGCC ACCTCACACT 
CTGTTCTTAG ATCTCCACAT 
CTATAACAGA ATATGAGGAG 
AAGATTCTTC AGGCTCCAGT 
CCCAAGGGTA TATATTTTCC 
CAGAATCTGC TAGAAATGCT 
AGGATCCTTC TATGGAGGGA 
CCGATGTTGG ATCAGGCAGA 
ATGAATCTGA GAAGACAACC 
CAGTTACAGA TCTGGAAATG 
CACCTCATGC TTTTACCCCA 
TATACTCGCA GACAACCCAA 
GTATTGGTCT AGCTGAGGGG 
TGTCAGCCCT GACTTTTATC 
AATGCTTCCA GACTGCACAC 
CACCTCCAAC ACCTATCTTT 
TTCCAAAGCA TGTTGCAGAT 
TGCAGAGCTG TACTGTTGAC 
AGCACAAOAA TCGATACATA 
AGCTTGCTGA AAAGGATGGC 
ACAACAGACC AAAAGCTTAT 
TCTGGAGAAT GATATGGGAA 
AGAAAGGAAG GAGAAAATGT 
ACTTTCTGGT CACTCAGAAG 
CTCTAAGAAA CACAAAAATA 
TCACACAGTA TCACTACACG 
TGCTGACCTT TGTGAGAAAG 



31 
I 

ATTCAGCTCC 
AAACTTGTTG 
AAGAAATATC 
ACACAAGTAA 
GAAAACACAT 
CGTGTCAGCG 
GGAAAATGCA 
CTTGAGATGC 
AAAGGAAAAG 
TTGGATTTCA 
GCTTTAGATC 
TACAATGGCT 
GATACAGTTA 
CAATCTGGTT 
AAGTTCTCTA 
TGTAGTTCAG 
ACATGGGAAA 
CAGCAGTTGG 
TTGGGTGCTA 
ATATGCACTA 
GATAATCCTG 
GAGGAAGAGG 
ACAAACCAAA 
GGGACGAAAT 
GGAAAGGGTG 
GCCACAGAAA 
GTGGAAGGTA 
ATGAACTTGT 
GAGAGTTTAT 
CCCGCAACTT 
TCCGAAAACC 
TCCGAAGATT 
AATGTGTGGT 
GAGAGCTTTC 
AAGTCCTTTT 
CCACATTATT 
TCCTCCAGAC 
CCGGTATACA 
TTGGAATCOG 
TGTCTAGTGG 
TTTTACTTAG 
CCAATTTCAG 
TTACATGCAA 
TTAGGTATTA 
AATATCGTTG 
AAACTGACTG 
ATTGCTGCCC 
CATAATGTGG 
GATCAGTACT 
AGTGTGCAAG 
AAAAAGGGCT 
CAGTGGCCTG 
GCAGCCTATG 



41 
I 

TCTGTGTTTG 
AAGAGATTGG 
CAACATGTAA 
ATGTGAATCT 
TCATTCATAA 
GAGGAGTTTC 
ATATGTCATC 
AAATCTACTG 
GGAAGTTAAG 
AAGCGATTAT 
CATTCATACT 
CATTGACATC 
GCATCTCTGA 
ATGTCATGCT 
GACAGGTGTT 
AACCAGAAAA 
GACCTCGAGT 
ATGGAGAGGA 
TTCTCAATAA 
ATGGCTTATA 
AACTTGATCT 
GAAAACACAT 
TCAGGAAAAA 
ACAATGAAGC 
ATGTTCCCAA 
AAGATATTTC 
CTTCAGCCTC 
CGGGGACTGC 
TGACCAGTTT 
CTGCTATCCC 
CAGAGACAAT 
CAACTTCATC 
TTCCTAGCTC 
TCCAGACTAA 
CTGCAGGCCC 
CTACCTTTGC 
AACAGGATTT 
ATGAGGCCAG 
AGAAGAAGGC 
TTCTTGTGGG 
AGGACAGTAC 
ATGATGTCGG 
GTAGTGGGTT 
CAGCAGACAG 
OCTATGATCA 
ATTATATCAA 
AAGGCCCACT 
AAGTTATTGT 
GGCCTGCCGA 
TGCTTGCCTA 
CCCAGAAAGG 
ACATGGGAGT 
CCAAGCGCCA 



51 
I 

CCGCCTGGAT 
CTGGTCCTAT 
TAGOCCAAAA 
TAAGAAACTT 
CACTGGGAAA 
AGAAATGGTG 
TGATGGATCA 
CTTTGATGCA 
AGCTTTATCC 
TGATGGAGTC 
GTTGAACCTT 
TCCTCCCTGC 
AAGCCAGTTG 
GATGGACTAC 
TTCCTCATAC 
TGTTCAGGCT 
CGTTTATGAT 
CCAAACCAAG 
TTTGCTACCC 
TGGAAAATAC 
TTTCCCTGAA 
TGAAGAAGGC 
GGAACCCCAG 
CAAGACTAAC 
TACATCTTTA 
CTTGACTTCT 
TTTAAATGAT 
AGAATCCTTA 
CAAGCTTGAT 
ATTCATCTCT 
AACATATGAT 
AGGTTCAGAA 
TACAGACATA 
TTACACTGAG 
AGTGATGTCA 
CTACTTCCCA 
GGTCTCCACG 
TAATAGTAGC 
AGTTATACCC 
TATTCTCATC 
ATCCCCTAGA 
AGCAATTCCA 
TACTGAAGAA 
CTCCAACCAC 
TAGCAGGGTT 
TGCCAATTAT 
GAAATCCACA 
CATGATAACA 
TGGGAGTGAG 
TTATACTGTG 
AAGACCCAGT 
ACCAGAGTAC 
TGCAGTGGGG 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2669 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 
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CCTGTTGTCG 
AGTATGTTGC 
ATCCGTTCAC 
ACACTGGTTG 
TATGTTAATG 
CAGCTCCTGA 
AACAGGGAAA 
TCATCCCTGA 
CAGAGCAATG 
AGGATGATAT 
GCAGAAGATG 
AAGGTCACTC 
CAGGACTTTA 
TGTCCTAAAT 
ATAAAAGAAG 
GTGACGGCAG 
TCCGTGGATG 
GACATTGAGC 
GAAGAGAATC 
GCTGAGAGCT 



TCCACTGCAG 
AGCAGATTCA 
AAAGAAATTA 
AGGCCATACT 
CACTCCTCAT 
GCCAGTCAAA 
AGAATCGAAC 
GTGGAGAAGG 
AATTCATCAT 
GGGACCATAA 
AATTTGTTTA 
TTATGGCTGA 
TCTTAGAAGC 
GGCCAAATCC 
AAGCTGCCAA 
GAACTTTCTG 
TTTACCAGGT 
AGTATCAGTT 
CATCCACCTC 
TAGAGTCTTT 



TGCTGGAGTT 
ACAOGAAGGA 
TTTGGTACAA 
TAGTAAAGAA 
TCCTGGACCA 
TATACAGCAG 
TTCTTCTATC 
CACAGACTAC 
TACCCAGCAC 
TGCCCAACTG 
CTGGCCAAAT 
AGAACACAAA 
TACACAGGAT 
AGATAGCCCC 
TAGGGATGGG 
TGCTCTGACA 
AGCCAAGATG 
TCTCTACAAA 
TCTGGACAGT 
AGTTTAA 



GGAAGAACAG 
ACTGTCAACA 
ACTGAGGAGC 
ACTGAGGTGC 
GCAGGCAAAA 
AGTGACTATT 
ATCCCTGTGG 
ATCAATGCCT 
CCTCTCCTTC 
GTGGTTATGA 
AAAGATGAGC 
TGTCTATCTA 
GATTATGTAC 
ATTAGTAAAA 
CCTATGATTG 
ACCCTTATGC 
ATCAATCTGA 
GTGATCCTCA 
AATGGTGCAG 



GCACATATAT 
TATTTGGCTT 
AATATGTCTT 
TGGACAGTCA 
CAAAGCTAGA 
CTGCAGCCCT 
AAAGATCAAG 
CCTATATCAT 
ATACCATCAA 
TTCCTGATGG 
CTATAAATTG 
ATGAGGAAAA 
TTGAAGTGAG 
CTTTTGAACT 
TTCATGATGA 
ACCAACTAGA 
TGAGGCCAGG 
GCCTTGTGAG 
CATTGCCTGA 



TGTGCTAGAC 
CTTAAAACAC 
CATTCATGAT 
TATTCATGCC 
GAAACAATTC 
AAAGCAATGC 
GGTTGGCATT 
GGGCTATTAC 
GGATTTCTGG 
CCAAAACATG 
TGAGAGCTTT 
ACTTATAATT 
GCACTTTCAG 
TATAAGTGTT 
GCATGGAGGA 
AAAAGAAAAT 
AGTCTTTGCT 
CACAAGGCAG 
TGGAAATATA 



Seq ID NO: C97 DNA Sequence 
Nucleic Acid Accession #x XM_031379 
Coding sequence: 148.. 7095 
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3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4347 



31 41 51 

I i 1 ( I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA S40 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAG^CGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTOGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTOA AGAAGGOGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

A ACTT GTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACOCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCAOCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 
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ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTTi ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG' 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG S640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GAGAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCA CCTC TC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 
TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 
TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 
GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 
AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 
AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 
TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 
GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 
TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 
GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 
AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 7944 

Seq ID NO: C98 DMA Sequence 
Nucleic Acid Accession #: NM_002851 
Coding sequence : 77 . . 4518 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCOA TCTATACACT GGAGGATTAA AACAAACAAA 60 
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CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG COGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTQTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

5 AAATATCCAA GATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGOGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

10 GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

15 ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

20 CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGOAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

25 AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

30 AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

35 GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA OGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAG ATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

40 GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AG CTGAGG GG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

45 TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

50 TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGAOCCAG TGGAOGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

55 GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACAOGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

60 GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

65 TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGT CACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC GAGATAGCGC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

70 GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACXAG AAAA AQAAA A TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

_ c CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

75 TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

80 CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 
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ATTGTTTAOT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT S160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 

AAAAAAAAAA AAAAAAAAAA AAAAAAA 5367 

Seq ID NO: C99 DNA Sequence 

Nucleic Acid Accession tt: Eos sequence 

Coding sequence: 501.. 45X4 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCOCTCC CTCTCCACTC TGAQAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACOG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG A6ATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 

AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 

AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 

ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 

TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 

AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 

AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 

GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 

ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 

TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 

ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 

CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 

CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 

TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 

GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 

GGGAAAGACC TOGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 

AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 

GTGCTATTCT CAATAATTTG CTACCCAATA TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 

GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 

ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 

AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 

ACCAAATCAG GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 

CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 

AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 

CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 

AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 

ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 

GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 

CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 

AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 

AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 

TGTGOTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 

GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 

CCTTTTCTGC AGGCCCAGTG ATGTCAGAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 

ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 

CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 2400 

TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 

AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTATCTGTC 2520 

TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 

ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 

TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 

ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 

AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 

TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 

ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 

GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 

AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 

TTCTGGTCAC TCAGAAGAGT GTGGAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 

TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 

CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 

TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 

ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 

AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 

GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 

CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 

TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTOCTGAGCC 3660 

AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 

ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 

GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 

TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 

ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 

TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 

TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 

TAGAAGCTAC ACAGGATOAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 

CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 

CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 
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ACCAGGTAGC 
ATCAGTTTCT 
CCACCTCTCT 
AGTCTTTAGT 
TCCTAAAATT 
ACAGTAACTT 
TTTTTGCAAG 
ATTTCTAAGA 
TTATAGAGGT 
CTGTATTTGT 
AATAAAACAC 
GAAATAATCT 
TTATAATTGT 
TTTAGTTTAA 
TGTGTTACCT 
AAATACCTTC 
ATGGTTTTTA 
AAAAAAAAAA 



CAAGATGATC 
CTACAAAGTG 
GGACAGTAAT 
TTAACACAGA 
AGGCAGGAAA 
TCATGACATA 
ACTTGTAATT 
ATGGAATTGT 
TAGGAATTCC 
AGCAATTATC 
TCTTCCATAT 
GTTACTTATT 
AGATTTTTAT 
TGAOGTAGTT 
AAGTCATTAA 
ATTTTGAAAG 
TCCAAGGAAT 
AAAAAAAAAA 



AATCTGATGA 
ATCCTCAGCC 
GGTGCAGCAT 
AAGGGGTGGG 
ATCAGTCTAG 
GGATTCTGCC 
TACTTATTAT 
GGTATTTTTT 
AAACTACAGA 
AGGTTTGCTA 
GATATTCAAC 
GTAAATACTG 
ATTTTACTAC 
CATTAGCTGG 
CTTTGTTTCA 
AAGTTTTTAT 
TGCAAAAATA 
AAA 



GGCCAGGAGT 
TTGTGAGCAC 
TGCCTGATGG 
GGGACTCACA 
TTCTGTTATC 
GCCAAATTTA 
GTTTGAACTA 
TCTGTATTGA 
AAATGTTTGT 
GAAATATAAC 
ATTTTACAAC 
CCCTAGTGTC 
TGAGTCAAGT 
TCTTACTCTA 
GCATGTAATT 
GAGAATAACA 
AATATAAATA 



CTTTGCTGAC 
AAGGCAGGAA 
AAATATAGCT 
TCTGAGCATT 
TGTTGATTTC 
TATCATTAAC 
AAATGATTGA 
TTTTAACAGA 
TTTTAGTGTC 
TTTTAATACA 
TGCAGTATTC 
TCCATGGACC 
TTTCTAGTTC 
CCAGTTTTCT 
TTAACTTTTG 
CCTTACCAAA 
TTGCCATTAA 



ATTGAGCAGT 
GAGAATCCAT 
GAGAGCTTAG 
GTTTTCCTCT 
CCATCACCTG 
AATGTGTGCC 
ATTTTACAGT 
AAATTTCAAT 
AAATTTTTAG 
GTAGCCTGTA 
ACCTAAAGTA 
AAATTTATAT 
TGTGTAATTG 
GACATTGTAT 
TGGAAAATAG 
CATTGTTCAA 
AAAAAAAAAA 



Seq ID NO: CI 00 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148.. 4362 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
4600 
4860 
4920 
4980 
5040 
5100 
5160 
S220 
5280 
5340 
5363 



1 * 11 21 31 41 51 

I I I I I 1 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATQA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATJA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGOGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT QAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTTGT GAAGTTCTTA CAATGCAACA 960 

ATCTGGTTAT GTCATGCTGA TGGACTACTT ACAAAACAAT TTTCGAGAGC AACAGTACAA 1020 

GTTCTCTAGA CAGGTGTTTT CCTCATACAC TGGAAAGGAA GAGATTCATG AAGCAGTTTG 1080 

TAGTTCAGAA CCAGAAAATG TTCAGGCTGA CCCAGAGAAT TATACCAGCC TTCTTGTTAC 1140 

ATGGGAAAGA CCTCGAGTCG TTTATGATAC CATGATTGAG AAGTTTGCAG TTTTGTACCA 1200 

GCAGTTGGAT GGAGAGGACC AAACCAAGCA TGAATTTTTG ACAGATGGCT ATCAAGACTT 1260 

GGGTGCTATT CTCAATAATT TGCTACCCAA TATGAGTTAT GTTCTTCAGA TAGTAGCCAT 1320 

ATGCACTAAT GGCTTATATG GAAAATACAG CGACCAACTG ATTGTCGACA TGCCTACTGA 1380 

TAATCCTGAA CTTGATCTTT TCCCTGAATT AATTGGAACT GAAGAAATAA TCAAGGAGGA 1440 

GGAAGAGGGA AAAGACATTG AAGAAGGCGC TATTGTGAAT CCTGGTAGAG ACAGTGCTAC 1500 

AAACCAAATC AGGAAAAAGG AACCCCAGAT TTCTACCACA ACACACTACA ATCGCATAGG 1560 

GACGAAATAC AATGAAGCCA AGACTAACCG ATCCCCAACA AGAGGAAGTG AATTCTCTGG 1620 

AAAGGGTGAT GTTCCCAATA CATCTTTAAA TTCCACTTCC CAACCAGTCA CTAAATTAGC 1680 

CACAGAAAAA GATATTTCCT TGACTTCTCA GACTGTGACT GAACTGCCAC CTCACACTGT 1740 

GGAAGGTACT TCAGCCTCTT TAAATGATGG CTCTAAAACT GTTCTTAGAT CTCCACATAT 1800 

GAACTTGTCG GGGACTGCAG AATCCTTAAA TACAGTTTCT ATAACAGAAT ATGAGGAGGA I860 

GAGTTTATTG ACCAGTTTCA AGCTTGATAC TGGAGCTGAA GATTCTTCAG GCTCCAGTCC 1920 

CGCAACTTCT GCTATCCCAT TCATCTCTGA GAACATATCC CAAGGGTATA TATTTTCCTC 1980 

CGAAAACCCA GAGACAATAA CATATGATGT CCTTATACCA GAATCTGCTA GAAATGCTTC 2040 

CGAAGATTCA ACTTCATCAG GTTCAGAAGA ATCACTAAAG GATCCTTCTA TGGAGGGAAA 2100 

TGTGTGGTTT CCTAGCTCTA CAGACATAAC AGCACAGCCC GATGTTGGAT CAGGCAGAGA 2160 

GAGCTTTCTC CAGACTAATT ACACTGAGAT ACGTGTTGAT GAATCTGAGA AGACAACCAA 2220 

GTCCTTTTCT GCAGGCCCAG TGATGTCACA GGGTCCCTCA GTTACAGATC TGOAAATGCC 2280 

ACATTATTCT ACCTTTGCCT ACTTCCCAAC TGAGGTAACA CCTCATGCTT TTACCCCATC 2340 

CTCCAGACAA CAGGATTTGG TCTCCACGGT CAACGTGGTA TACTCGCAGA CAACCCAACC 2400 

GGTATACAAT GAGGCCAGTA ATAGTAGCCA TGAGTCTCGT ATTGGTCTAG CTGAGGGGTT 2460 

GGAATCCGAG AAGAAGGCAG TTATACCCCT TGTGATCGTG TCAGCCCTGA CTTTTATCTG 2520 

TCTAGTGGTT CTTGTGGGTA TTCTCATCTA CTGGAGGAAA TGCTTCCAGA CTGCACACTT 2580 

TTACTTAGAG GACAGTACAT CCCCTAGAGT TATATCCACA CCTCCAACAC CTATCTTTCC 2640 

AATTTCAGAT GATGTCGGAG CAATTCCAAT AAAGCACTTT CCAAAGCATG TTGCAGATTT 2700 

ACATGCAAGT AGTGGGTTTA CTGAAGAATT TGAGACACTG AAAGAGTTTT ACCAGGAAGT 2760 

GCAGAGCTGT ACTGTTGACT TAGGTATTAC AGCAGACAGC TCCAACCACC CAGACAACAA 2820 

GCACAAGAAT CGATACATAA ATATCGTTGC CTATGATCAT AGCAGGGTTA AGCTAGCACA 2880 

GCTTGCTGAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 
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GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA '3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

TTCTAAGAAT GGAATTGTGG TATT T TT TT C TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT S220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 

AAAAAAAAAA AAAAAAAAAA A 5481 

Seg ID NO: C101 DMA Sequence 

Nucleic Acid Accession # : Eos sequence 

Coding sequence: 1..3340 



1 11 21 31 41 51 

I I I I I I 

ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 60 

TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 

AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 300 

ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 360 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 420 

GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCG 480 

GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 540 

ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 600 

GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 660 

CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT CATTGACATC TCCTCCCTGC 720 

ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 

GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 900 

ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 960 

GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 1020 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 1200 

AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AGGCCAGTAA TAGTAGCCAT 1260 

GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG GAATCCGAGA AGAAGGCAGT TATACGCCTT 1320 

GTGATCGTGT CAGCGCTGAC TTTTATCTGT CTAGTGGTTC TTGTGGGTAT TCTCATCTAC 1380 

TGGAGGAAAT GCTTCCAGAC TGCACACTTT TACTTAGAGG ACAGTACATC CCCTAGAGTT 1440 

ATATCCACAC CTCCAACACC TATCTTTCCA ATTTCAGATG ATGTCGGAGC AATTCCAATA 1500 

AAGCACTTTC CAAAGCATGT TGCAGATTTA CATGCAAGTA GTGGGTTTAC TGAAGAATTT 1560 

GAGACACTGA AAGAGTTTTA CCAGGAAGTG CAGAGCTGTA CTGTTGACTT AGGTATTACA 1620 

GCAGACAGCT CCAACCACCC AGACAACAAG CACAAGAATC GATACATAAA TATCGTTGCC 1680 

TATGATCATA GCAGGGTTAA GCTAGCACAG CTTGCTGAAA AGGATGGCAA ACTGACTGAT 1740 

TATATCAATG CCAATTATGT TGATGGCTAC AACAGACCAA AAGCTTATAT TGCTGCCCAA 1800 

GGCCCACTGA AATCCACAGC TGAAGATTTC TGGAGAATGA TATGGGAACA TAATGTGGAA 1860 

GTTATTGTCA TGATAACAAA CCTCGTGGAG AAAGGAAGGA GAAAATGTGA TCAGTACTGG 1920 

CCTGCCGATG GGAGTGAGGA GTACGGGAAC TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG 1980 

CTTGCCTATT ATACTGTGAG GAATTTTACT CTAAGAAACA CAAAAATAAA AAAGGGCTCC 2040 

CAGAAAGGAA GACCCAGTGG ACGTGTGGTC ACACAGTATC ACTACACGCA GTGGCCTGAC 2100 

ATGGGAGTAC CAGAGTACTC CCTGCCAGTG CTGACCTTTG TGAGAAAGGC AGCCTATGCC 2160 

AAGCGCCATG CAGTGGGGCC TGTTGTCGTC CACTGCAGTG CTGGAGTTGG AAGAACAGGC 2220 

ACATATATTG TGCTAGACAG TATGTTGCAG CAGATTCAAC ACGAAGGAAC TGTCAACATA 2280 

TTTGGCTTCT TAAAACACAT CCGTTCACAA AGAAATTATT TGGTACAAAC TGAGGAGCAA 2340 

TATGTCTTCA TTCATGATAC ACTGGTTGAG GCCATACTTA GTAAAGAAAC TGAGGTGCTG 2400 

GACAGTCATA TTCATGCCTA TGTTAATGCA CTCCTCATTC CTGGACCAGC AGGCAAAACA 2460 

AAGCTAGAGA AACAATTCCA GCTCCTGAGC CAGTCAAATA TACAGCAGAG TGACTATTCT 2S20 

GCAGCCCTAA AGCAATGCAA CAGGGAAAAG AATCGAACTT CTTCTATCAT CCCTGTGGAA 2580 

AGATCAAGGG TTGGCATTTC ATCCCTGAGT GGAGAAGGCA CAGACTACAT CAATGCCTCC 2640 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TATATCATGG 
ACCATCAAGG 
CCTGATGGCC 
ATAAATTGTG 
GAGGAAAAAC 
GAAGTGAGGC 
TTTGAACTTA 
CATGATGAGC 
CAACTAGAAA 
AGGCCAGGAG 
CTTGTGAGCA 
TTGCCTGATG 



GCTATTACCA 
ATTTCTGGAG 
AAAACATGGC 
AGAGCTTTAA 
TTATAATTCA 
ACTTTCAGTG 
TAAGTGTTAT 
ATGGAGGAGT 
AAGAAAATTC 
TCTTTGCTGA 
CAAGGCAGGA 
GAAATATAGC 



GAGCAATGAA 
GATGATATGG 
AGAAGATGAA 
GGTCACTCTT 
GGACTTTATC 
TCCTAAATGG 
AAAAGAAGAA 
GACGGCAGGA 
CGTGGATGTT 
CATTGAGCAG 
AGAGAATCCA 
TGAGAGCTTA 



TTCATCATTA 
GACCATAATG 
TTTGTTTACT 
ATGGCTGAAG 
TTAGAAGCTA 
CCAAATCCAG 
GCTGCCAATA 
ACTTTCTGTG 
TACCAGGTAG 
TATCAGTTTC 
TCTACCTCTC 
GAGTCTTTAG 



CTCAGCACCC 
CCCAACT6GT 
GGCCAAATAA 
AACACAAATG 
CACAGGATGA 
ATAGCCCCAT 
GGGATGGGCC 
CTCTGACAAC 
CCAAGATGAT 
TCTACAAAGT 
TGGACAGTAA 



TCTCCTTCAT 
GGTTATGATT 
AGATGAGCCT 
TCTATCTAAT 
TTATGTACTT 
TAGTAAAACT 
TATGATTGTT 
CCTTATGCAC 
CAATCTGATG 
GATCCTCAGC 
TGGTGCAGCA 



2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3340 



Seq ID NO: C102 DMA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . .4480 



11 



21 



31 



41 



51 



ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 60 

TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 

AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 300 

ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 360 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 420 

GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCA 480 

GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 540 

ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 600 

GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 660 

CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGTT CATTGACATC TCCTCCCTGC 720 

ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 

GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 900 

ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 960 

GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 1020 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 1200 

AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 1260 

TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAAGACAT TGAAGAAGGC 1320 

GCTATTGTGA ATCCTGGTAG AGACAGTGCT ACAAACCAAA TCAGGAAAAA GGAACCCCAG 1380 

ATTTCTACCA CAACACACTA CAATCGCATA GGGACGAAAT ACAATGAAGC CAAGACTAAC 1440 

CGATCCCCAA CAAGAGGAAG TGAATTCTCT GGAAAGGGTG ATGTTCCCAA TACATCTTTA 1500 

AATTCCACTT CCCAACCAGT CACTAAATTA GCCACAGAAA AAGATATTTC CTTGACTTCT 1560 

CAGACTGTGA CTGAACTGCC ACCTCACACT GTGGAAGGTA CTTCAGCCTC TTTAAATGAT 1620 

GGCTCTAAAA CTGTTCTTAG ATCTCCACAT ATGAACTTGT CGGGGACTGC AGAATCCTTA 1680 

AATACAGTTT CTATAACAGA ATATGAGGAG GAGAGTTTAT TGACCAGTTT CAAGCTTGAT 1740 

ACTGGAGCTG AAGATTCTTC AGGCTCCAGT CCCGCAACTT CTGCTATCCC ATTCATCTCT 1800 

GAGAACATAT CCCAAGGGTA TATATTTTCC TCCGAAAACC CAGAGACAAT AACATATGAT 1860 

GTCCTTATAC CAGAATCTGC TAGAAATGCT TCCGAAGATT CAACTTCATC AGGTTCAGAA 1920 

GAATCACTAA AGGATCCTTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC TACAGACATA 1980 

ACAGCACAGC CCGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA TTACACTGAG 2040 

ATACGTGTTG ATGAATCTGA GAAGACAACC AAGTCCTTTT CTGCAGGCCC AGTGATGTCA 2100 

CAGGGTCCCT CAGTTACAGA TCTGGAAATG CCACATTATT CTACCTTTGC CTACTTCCCA 2160 

ACTGAGGTAA CACCTCATGC TTTTATCCCA TCCTCCAGAC AACAGGATTT GGTCTCCACG 2220 

GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAGTAGC 2280 

CATGAGTCTC GTATTGGTCT AGCTGAGGGG TTGGAATCCG AGAAGAAGGC AGTTATACCC 2340 

CTTGTGATCG TGTCAGCCCT GACTTTTATC TGTCTAGTGG TTCTTGTGGG TATTCTCATC 2400 

TACTGGAGGA AATGCTTCCA GACTGCACAC TTTTACTTAG AGGACAGTAC ATCCCCTAGA 2460 

GTTATATCCA CACCTCCAAC ACCTATCTTT CCAATTTCAG ATGATGTCGG AGCAATTCCA 2520 

ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA GTAGTGGGTT TACTGAAGAA 2580 

TTTGAGACAC TGAAAGAGTT TTACCAGGAA GTGCAGAGCT GTACTGTTGA CTTAGGTATT 2640 

ACAGCAGACA GCTCCAACCA CCCAGACAAC AAGCACAAGA ATCGATACAT AAATATCGTT 2700 

GCCCATGATC ATAGCAGGGT TAAGCTAGCA CAGCTTGCTG AAAAGGATGG CAAACTGACT 2760 

GATTATATCA ATGCCAATTA TGTTGATGGC TACAACAGAC CAAAAGCTTA TATTGCTGCC 2820 

CAAGGCCCAC TGAAATCCAC AGCTGAAGAT TTCTGGAGAA TGATATGGGA ACATAATGTG 2880 

GAAGTTATTG TCATGATAAC AAACCTCGTG GAGAAAGGAA GGAGAAAATG TGATCAGTAC 2940 

TGGCCTGCCG ATGGGAGTGA GGAGTACGGG AACTTTCTGG TCACTCAGAA GAGTGTGCAA 3000 

GTGCTTGCCT ATTATACTGT GAGGAATTTT ACTCTAAGAA ACACAAAAAT AAAAAAGGGC 3060 

TCCCAGAAAG GAAGACCCAG TGGACGTGTG GTCACACAGT ATCACTACAC GCAGTGGCCT 3120 

GACATGGGAG TACCAGAGTA CTCCCTGCCA GTGCTGACCT TTGTGAGAAA GGCAGCCTAT 3180 

GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC GTCCACTGCA GTGCTGGAGT TGGAAGAACA 3240 
GGCACATATA TTGTGCTAGA CAGTATGTTG CAGCAGATTC AACACGAAGG AACTGTCAAC 3300 
ATATTTGGCT TCTTAAAACA CATCCGTTCA CAAAGAAATT ATTTGGTACA AACTGAGGAG 3360 
CAATATGTCT TCATTCATGA TACACTGGTT GAGGCCATAC TTAGTAAAGA AACTGAGGTG 3420 
CTGGACAGTC ATATTCATGC CTATGTTAAT GCACTCCTCA TTCCTGGACC AGCAGGCAAA 3480 
ACAAAGCTAG AGAAACAATT CCAGGGTCTC ACTCTGTCAC CCAGGCTGGA GTGCAGAGGC 3S40 
ACAATCTCGG CTCACTGCAA CCTTCCTCTC CCTGGCTTAA CTGATCCTCC TACCTCAGCC 3600 
TCCCGAGTGG CTAGGACTAT ACTCCTGAGC CAGTCAAATA TACAGCAGAG TGACTATTCT 3660 
GCAGCCCTAA AGCAATGCAA CAGGGAAAAG AATCGAACTT CTTCTATCAT CCCTGTGGAA 3720 
AGATCAAGGG TTGGCATTTC ATCCCTGAGT GGAGAAGGCA CAGACTACAT CAATGCCTCC 3780 
TATATCATGG GCTATTACCA GAGCAATGAA TTCATCATTA CCCAGCACCC TCTCCTTCAT 3840 
ACCATCAAGG ATTTCTGGAG GATGATATGG GACCATAATG CCCAACTGGT GGTTATGATT 3900 
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CCTGATGGCC 
ATAAATTGTG 
GAGGAAAAAC 
GAAGTGAGGC 
TTTGAACTTA 
CATGATGAGC 
CAACTAGAAA 
AGGCCAGGAG 
CTTGTGAGCA 
TTGCCTGATG 



AAAACATGGC 
AGAGCTTTAA 
TTATAATTCA 
ACTTTCAGTG 
TAAGTGTTAT 
ATGGAGGAGT 
AAGAAAATTC 
TCTTTGCTGA 
CAAGGCAGGA 
GAAATATAGC 



AGAAGATGAA 
GGTCACTCTT 
GGACTTTATC 
TCCTAAATGG 
AAAAGAAGAA 
GACGGCAGGA 
CGTGGATGTT 
CATTGAGCAG 
AGAGAATCCA 
TGAGAGCTTA 



TTTGTTTACT 
ATGGCTGAAG 
TTAGAAGCTA 
CCAAATCCAG 
GCTGCCAATA 
ACTTTCTGTG 
TACCAGGTAG 
TATCAGTTTC 
TCCACCTCTC 
GAGTCTTTAG 



GGCCAAATAA 
AACACAAATG 
CACAGGATGA 
ATAGCCCCAT 
GGGATGGGCC 
CTCTGACAAC 
CCAAGATGAT 
TCTACAAAGT 
TGGACAGTAA 



AGATGAGCCT 
TCTATCTAAT 
TTATGTACTT 
TAGTAAAACT 
TATGATTGTT 
CCTTATGCAC 
CAATCTGATG 
GATCCTCAGC 
TGGTGCAGCA 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4480 



Seq ID NO: CI 03 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . - 4220 

1 11 21 31 41 51 

I I I I 1 I 

ATGCGAATCC TAAAGCGTTT CCTCOCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 60 

TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 

AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 300 

ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 360 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 420 

GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCG 480 

GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 540 

ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 600 

GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 660 

CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT CATTGACATC TCCTCCCTGC 720 

ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 

GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 900 

ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 960 

GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 1020 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 1200 

AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 1260 

TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAACACAT TGAAGAAGGC 1320 

GCTATTGTGA ATCCTGGTAG AGACAGTGCT ACAAACCAAA TCAGGAAAAA GGAACCCCAG 1380 

ATTTCTACCA CAACACACTA CAATCGCATA GGGACGAAAT ACAATGAAGC CAAGACTAAC 1440 

CGATCCCCAA CAAGAGGAAG TGAATTCTCT GGAAAGGGTG ATGTTCCCAA TACATCTTTA 1500 

AATTCCACTT CCCAACCAGT CACTAAATTA GCCACAGAAA AAGATATTTC CTTGACTTCT 1560 

CAGACTGTGA CTGAACTGCC ACCTCACACT GTGGAAGGTA CTTCAGCCTC TTTAAATGAT 1620 

GGCTCTAAAA CTGTTCTTAG ATCTCCACAT ATGAACTTGT CGGGGACTGC AGAATCCTTA 1680 

AATACAGTTT CTATAACAGA ATATGAGGAG GAGAGTTTAT TGACCAGTTT CAAGCTTGAT 1740 

ACTGGAGCTG AAGATTCTTC AGGCTCCAGT CCCGCAACTT CTGCTATCCC ATTCATCTCT 1800 

GAGAACATAT CCCAAGGGTA TATATTTTCC TCCGAAAACC CAGAGACAAT AACATATGAT 1860 

GTCCTTATAC CAGAATCTGC TAGAAATGCT TCCGAAGATT CAACTTCATC AGGTTCAGAA 1920 

GAATCACTAA AGGATCCTTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC TACAGACATA 1980 

ACAGCACAGC CCGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA TTACACTGAG 2040 

ATACGTGTTG ATGAATCTGA GAAGACAACC AAGTCCTTTT CTGCAGGCCC AGTGATGTCA 2100 

CAGGGTCCCT CAGTTACAGA TCTGGAAATG CCACATTATT CTACCTTTGC CTACTTCCCA 2160 

ACTGAGGTAA CACCTCATGC TTTTACCCCA TCCTCCAGAC AACAGGATTT GGTCTCCACG 2220 

GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAGTAGC 2280 

CATGAGTCTC GTATTGGTCT AGCTGAGGGG TTGGAATCCG AGAAGAAGGC AGTTATACCC 2340 

CTTGTGATCG TGTCAGCCCT GACTTTTATC TGTCTAGTGG TTCTTGTGGG TATTCTCATC 2400 

TACTGGAGGA AATGCTTCCA GACTGCACAC TTTTACTTAG AGGACAGTAC ATCCCCTAGA 2460 

GTTATATCCA CACCTCCAAC ACCTATCTTT CCAATTTCAG ATGATGTOGG AGCAATTCCA 2520 

ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA GTAGTGGGTT TACTGAAGAA 2580 
TTTGAGACAC TGAAAGAGTT TTACCAGGAA GTGCAGAGCT GTACTGTTGA CTTAGGTATT 2640 
ACAGCAGACA GCTCCAACCA CCCAGACAAC AAGCACAAGA ATCGATACAT AAATATCGTT 2700 
GCCTATGATC ATAGCAGGGT TAAGCTAGCA CAGCTTGCTG AAAAGGATGG CAAACTGACT 2760 
GATTATATCA ATGCCAATTA TGTTGATGGC TACAACAGAC CAAAAGCTTA TATTGCTGCC 2820 
CAAGGCCCAC TGAAATCCAC AGCTGAAGAT TTCTGGAGAA TGATATGGGA ACATAATGTG 2880 
GAAGTTATTG TCATGATAAC AAACCTCGTG GAGAAAGGAA GGAGAAAATG TGATCAGTAC 2940 
TGGCCTGCCG ATGGGAGTGA GGAGTAOGGG AACTTTCTGG TCACTCAGAA GAGTGTGCAA 3000 
GTGCTTGCCT ATTATACTGT GAGGAATTTT ACTCTAAGAA ACACAAAAAT AAAAAAGGGC 3060 
TCCCAGAAAG GAAGACCCAG TGGACGTGTG GTCACACAGT ATCACTACAC GCAGTGGCCT 3120 
GACATGGGAG TACCAGAGTA CTCCCTGCCA GTGCTGACCT TTGTGAGAAA GGCAGCCTAT 3180 
GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC GTCCACTGCA GTGCTGGAGT TGGAAGAACA 3240 
GGCACATATA TTGTGCTAGA CAGTATGTTG CAGCAGATTC AACAOQAAGG AACTGTCAAC 3300 
ATATTTGGCT TCTTAAAACA CATCCGTTCA CAAAGAAATT ATTTGGTACA AACTGAGGAG 3360 
CAATATGTCT TCATTCATGA TACACTGGTT GAGGCCATAC TTAGTAAAGA AACTGAGGTG 3420 
CTGGACAGTC ATATTCATGC CTATGTTAAT GCACTCCTCA TTCCTGGACC AGCAGGCAAA 3480 
ACAAAGCTAG AGAAACAATT CCAGCTCCTG AGCCAGTCAA ATATACAGCA GAGTGACTAT 3540 
TCTGCAGCCC TAAAGCAATG CAACAGGGAA AAGAATCGAA CTTCTTCTAT CATCCCTGTG 3600 
GAAAGATCAA GGGTTGGCAT TTCATCCCTG AGTGGAGAAG GCACAGACTA CATCAA TGCC 3660 
TCCTATATCA TGGGCTATTA CCAGAGCAAT GAATTCATCA TTACCCAGCA CCCTCTCCTT 3720 
CATACCATCA AGGATTTCTG GAGGATGATA TGGGACCATA ATGCCCAACT GGTGGTTATG 3780 
ATTCCTGATG GCCAAAACAT GGCAGAAGAT GAATTTGTTT ACTGGCCAAA TAAAGATGAG 3840 
CCTATAAATT GTGAGAGCTT TAAGGTCACT CTTATGGCTG AAGAACACAA ATGTCTATCT 3900 
AATGAGGAAA AACTTATAAT TCAGGACTTT ATCTTAGAAG CTACACAGGC ATGGAGGAGT 3960 
GACGGCAGGA ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC 4020 
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CGTGGATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA 4080 

CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA 4140 

AGAGAATCCA TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC 4200 

TGAGAGCTTA GAGTCTTTAG 4220 

Seq ID NO: CI 04 DNA Sequence 

Nucleic Acid Accession tt: XM__002914.6 

Coding sequencer 1..4314 

1 11 21 31 41 51 

I I I I I I 

ATGAAGGATA TCGACATAGG AAAAGAGTAT ATCATCCCCA GTCCTGGGTA TAGAAGTGTG 60 

AGGGAGAGAA CCAGCACTTC TGGGACGCAC AGAGACCGTG AAGATTCCAA GTTCAGGAGA 120 

ACTCGACCGT TGGAATGCCA AGATGCCTTG GAAACAGCAG CCCGAGCCGA GGGCCTCTCT 180 

CTTGATGCCT CCATGCATTC TCAGCTCAGA ATCCTGGATG AGGAGCATCC CAAGGGAAAG 240 

TACCATCATG GCTTGAGTGC TCTGAAGCCC ATCCGGACTA CTTCCAAACA CCAGCACCCA 300 

GTGGACAATG CTGGGCTTTT TTCCTGTATG ACTTTTTCGT GGCTTTCTTC TCTGGCCCGT 360 

GTGGCCCACA AGAAGGGGGA GCTCTCAATG GAAGACGTGT GGTCTCTGTC CAAGCACGAG 420 

TCTTCTGACG TGAACTGCAG AAGACTAGAG AGACTGTGGC AAGAAGAGCT GAATGAAGTT 480 

GGGCCAGACG CTGCTTCCCT GOGAAGGGTT GTGTGGATCT TCTGCCGCAC CAGGCTCATC 540 

CTGTCCATCG TGTGCCTGAT GATCACGCAG CTGGCTGGCT TCAGTGGACC AGCCTTCATG 600 

GTGAAACACC TCTTGGAGTA TACCCAGGCA ACAGAGTCTA ACCTGCAGTA CAGCTTGTTG 660 

TTAGTGCTGG GCCTCCTCCT GACGGAAATC GTGCGGTCTT GGTCGCTTGC ACTGACTTGG 720 

GCATTGAATT ACCGAACCGG TGTCCGCTTG CGGGGGGCCA TCCTAACCAT GGCATTTAAG 780 

AAGATCCTTA AGTTAAAGAA CATTAAAGAG AAATCCCTGG GTGAGCTCAT CAACATTTGC 840 

TCCAACGATG GGCAGAGAAT GTTTGAGGCA GCAGCCGTTG GCAGCCTGCT GGCTGGAGGA 900 

CCCGTTGTTG CCATCTTAGG CATGATTTAT AATGTAATTA TTCTGGGACC AACAGGCTTC 960 

CTGGGATCAG CTGTTTTTAT CCTCTTTTAC CCAGCAATGA TGTTTGCATC ACGGCTCACA 1020 

GCATATTTCA GGAGAAAATG CGTGGCCGCC ACGGATGAAC GTGTCCAGAA GATGAATGAA 1080 

GTTCTTACTT ACATTAAATT TATCAAAATG TATGCCTGGG TCAAAGCATT TTCTCAGAGT 1140 

GTTCAAAAAA TCCGCGAGGA GGAGCGTCGG ATATTGGAAA AAGCTGGGTA CTTCCAGAGC 1200 

ATCACTGTGG GTGTGGCTCC CATTGTGGTG GTGATTGCCA GCGTGGTGAC CTTCTCTGTT 1260 

CATATGACCC TGGGCTTCGA TCTGACAGCA GCACAGGCTT TCACAGTGGT GACAGTCTTC 1320 

AATTCCATGA CTTTTGCTTT GAAAGTAACA CCGTTTTCAG TAAAGTCCCT CTCAGAAGCC 1380 

TCAGTGGCTG TTGACAGATT TAAGAGTTTG TTTCTAATGG AAGAGGTTCA CATGATAAAG 1440 

AACAAACCAG CCAGTCCTCA CATCAAGATA GAGATGAAAA ATGCCACCTT GGCATGGGAC 1500 

TCCTCCCACT CCAGTATCCA GAACTCGCCC AAGCTGACCC CCAAAATGAA AAAAGACAAG 1560 

AGGGCTTCCA GGGGCAAGAA AGAGAAGGTG AGGCAGCTGC AGCGCACTGA GCATCAGGCG 1620 

GTGCTGGCAG AGCAGAAAGG CCACCTCCTC CTGGACAGTG ACGAGCGGCC CAGTCCCGAA 1680 

GAGGAAGAAG GCAAGCACAT CCACCTGGGC CACCTGCGCT TACAGAGGAC ACTGCACAGC 1740 

ATCGATCTGG AGATCCAAGA GGGTAAACTG GTTGGAATCT GCGGCAGTGT GGGAAGTGGA 1800 

AAAACCTCTC TCATTTCAGC CATTTTAGGC CAGATGACGC TTCTAGAGGG CAGCATTGCA 1860 

ATCAGTGGAA CCTTCGCTTA TGTGGCCCAG CAGGCCTGGA TCCTCAATGC TACTCTGAGA 1920 

GACAACATCC TGTTTGGGAA GGAATATGAT GAAGAAAGAT ACAACTCTGT GCTGAACAGC 1980 

TGCTGCCTGA GGCCTGACCT GGCCATTCTT CCCAGCAGCG ACCTGAOGGA GATTGGAGAG 2040 

CGAGGAGCCA ACCTGAGCGG TGGGCAGCGC CAGAGGATCA GCCTTGCCCG GGCCTTGTAT 2100 

AGTGACAGGA GCATCTACAT CCTGGACGAC OCCCTCAGTG CCTTAGATGC CCATGTGGGC 2160 

AACCACATCT TCAATAGTGC TATCCGGAAA CATCTCAAGT CCAAGACAGT TCTGTTTGTT 2220 

ACCCACCAGT TACAGTACCT GGTTGACTGT GATGAAGTGA TCTTCATGAA AGAGGGCTGT 2280 

ATTACGGAAA GAGGCACCCA TGAGGAACTG ATGAATTTAA ATGGTGACTA TGCTACCATT 2340 

TTTAATAACC TGTTGCTGGG AGAGACACCG CCAGTTGAGA TCAATTCAAA AAAGGAAACC 2400 

AGTGGTTCAC AGAAGAAGTC ACAAGACAAG GGTCCTAAAA CAGGATCAGT AAAGAAGGAA 2460 

AAAGCAGTAA AGCCAGAGGA AGGGCAGCTT GTGCAGCTGG AAGAAAAAGG GCAGGGTTCA 2520 

GTGCCCTGGT CAGTATATGG TGTCTACATC CAGGCTGCTG GGGGCCCCTT GGCATTCCTG 2580 

GTTATTATGG CCCTTTTCAT GCTGAATGTA GGCAGCACCG CCTTCAGCAC CTGGTGGTTG 2640 

AGTTACTGGA TCAAGCAAGG AAGCGGGAAC ACCACTGTGA CTCGAGGGAA CGAGACCTCG 2700 

GTGAGTGACA GCATGAAGGA CAATCCTCAT ATGCAGTACT ATGCCAGCAT CTACGCOCTC 2760 

TCCATGGCAG TCATGCTGAT CCTGAAAGCC ATTCGAGGAG TTGTCTTTGT CAAGGGCACG 2820 

CTGCGAGCTT CCTCCCGGCT GCATGACGAG CTTTTCCGAA GGATCCTTCG AAGCCCTATG 2880 

AAGTTTTTTG ACACGACCCC CACAGGGAGG ATTCTCAACA GGTTTTCCAA AGACATGGAT 2940 

GAAGTTGACG TGCGGCTGCC GTTCCAGGCC GAGATGTTCA TCCAGAACGT TATCCTGGTG 3000 

TTCTTCTGTG TGGGAATGAT CGCAGGAGTC TTCCCGTGGT TCCTTGTGGC AGTGGGGCCC 3060 

CTTGTCATCC TCTTTTCAGT CCTGCACATT GTCTCCAGGG TCCTGATTCG GGAGCTGAAG 3120 

CGTCTGGACA ATATCACGCA GTCACCTTTC CTCTCCCACA TCACGTCCAG CATACAGGGC 3180 

CTTGCCACCA TCCACGCCTA CAATAAAGGG CAGGAGTTTC TGCACAGATA CCAGGAGCTG 3240 

CTGGATGACA ACCAAGCTCC TTTTTTTTTG TTTACGTGTG CGATGOGGTG GCTGGCTGTG 33 00 

CGGCTGGACC TCATCAGCAT CGCCCTCATC ACCACCACGG GGCTGATGAT CGTTCTTATG 33 SO 

CACGGGCAGA TTCCCCCAGC CTATGCGGGT CTCGCCATCT CTTATGCTGT CCAGTTAACG 3420 

GGGCTGTTCC AGTTTACGGT CAGACTGGCA TCTGAGACAG AAGCTCGATT CACCTOGGTG 3480 

CAGAGGATCA ATCACTACAT TAAGACTCTG TCCTTGGAAG CACCTGCCAG AATTAAGAAC 3540 

AAGGCTCCCT CCCCTGACTG GCCCCAGGAG GGAGAGGTGA CCTTTGAGAA CGCAGAGATG 3600 

AGGTACCGAG AAAACCTCCC TCTCGTCCTA AAGAAAGTAT CCTTCACGAT CAAACCTAAA 3660 

GAGAAGATTG GCATTGTGGG GCGGACAGGA TCAGGGAAGT CCTCGCTGGG GATGGCCCTC 3720 

TTCCGTCTGG TGGAGTTATC TGGAGGCTGC ATCAAGATTG ATGGAGTGAG AATCAGTGAT 3780 

ATTGGCCTTG CCGACCTCCG AAGCAAACTC TCTATCATTC CTCAAGAGCC GGTGCTGTTC 3840 

AGTGGCACTG TCAGATCAAA TTTGGACCCC TTCAACCAGT ACACTGAAGA CCAGATTTGG 3900 

GATGCCCTGG AGAGGACACA CATGAAAGAA TGTATTGCTC AGCTACCTCT GAAACTTGAA 3960 

TCTGAAGTGA TGGAGAATGG GGATAACTTC TCAGTGGGGG AACGGCAGCT CTTGTGCATA 4020 

GCTAGAGCCC TGCTCCGCCA CTGTAAGATT CTGATTTTAG ATGAAGCCAC AGCTGCCATG 4080 

GACACAGAGA CAGACTTATT GATTCAAGAG ACCATCCGAG AAGCATTTGC AGACTGTACC 4140 

ATGCTGACCA TTGCCCATCG CCTGCACACG GTTCTAGGCT CCGATAGGAT TATGGTGCTG 4200 
GCCCAGGGAC AGGTGGTGGA GTTTGACACC CCATCGGTCC TTCTGTCCAA CGACAGTTCC 4260 
CGATTCTATG CCATGTTTGC TGCTGCAGAG AACAAGGTCG CTGTCAAGGG CTAG 4314 
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Seq ID NO: CIO 5 DNA Sequence 

Nucleic Acid Accession #: NM_00S688.1 

Coding sequence: 126.. 4439 

5 1 11 21 31 41 51 

I I I I 1 1 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT OCCCAGTCCT GGGTATAGAA 1B0 

10 GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATCCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

15 CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

20 TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

25 GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

30 CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA G T TTG TTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

35 ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC I860 

ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GAOGCTTCTA GAGGGCAGCA 1980 

40 TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

45 TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

50 AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAAOGAGA 2820 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

55 CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

60 GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTOGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3460 

65 TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAAOGCAG 3720 

A AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 

70 CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

75 TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAAOGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GOCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGA G TTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

80 GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 
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GGGAACOGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTQG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCAOG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGOGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGOGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGO CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 

ATGTCGTGAC CAACTAGACA TTCTGTOGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 

CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 

AAAAAAAAAA AAAAAAAA 5838 

Seq ID NO: C106 DNA Sequence 
Nucleic Acid Accession ft: NM_005562 
Coding sequence: 90.. 3671 

1 11 21 31 41 51 

I I I I I I 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 

AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACXX3 GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCGAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATOGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG QAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACGGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGOO GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTOATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2B20 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAAOG 3360 
CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 
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AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 34 BO 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

- AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

!> AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

If! ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 

1U ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GT TCTG GACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGOGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
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15 
20 
25 



30 
35 
40 
45 
50 
55 
60 



Seq ID NO: C107 DNA Sequence 
Nucleic Acid Accession 8: NM_021101 
Coding sequence: 221.. 856 

1 11 21 31 41 51 

I I I I I I 

GAATTCGGCA CGAGGCCTCG TGCCGGGGAG CAACCTCAGC TTCTAGTATC CAGACTCCAG 60 

CGCCGCCCCG GGCGCGGACC CCAACCCCGA CCCAGAGCTT CTCCAGCGGC GGCGCAGCGA 120 

GCAGGGCTCC CCGCCTTAAC TTCCTCCGCG GGGCCCAGCC ACCTTCGGGA GTCCGGGTTG 180 

CCCACCTGCA AACTCTCCGC CTTCTGCACC TGCCACCCCT GAGCCAGCGC GGGCGCCGGA 240 

GCGAGTCATG 6CCAACGCGG GGCTGCAGCT GTTGGGCTTC ATTCTCGCCT TCCTGGGATG 300 

GATCGGCGCC ATCGTCAGCA CTGCCCTGCC CCAGTGGAGG ATTTACTCCT ATGCCGGCGA 360 

CAACATCGTG ACCGCCCAGG CCATGTACGA GGGGCTGTGG ATGTCCTGCG TGTCGCAGAG 420 

CACCGGGCAG ATCCAGTGCA AAGTCTTTGA CTCCTTGCTG AATCTGAGCA GCACATTGCA 480 

AGCAACCCGT GCCTTGATGG TGGTTGGCAT CCTCCTGGGA GTGATAGCAA TCTTTGTGGC 540 

CACCGTTGGC ATGAAGTGTA TGAAGTGCTT GGAAGACGAT GAGGTGCAGA AGATGAGGAT 600 

GGCTGTCATT GGGGGCGCGA TATTTCTTCT TGCAGGTCTG 6CTATTTTAG TTGCCACAGC 660 

ATGGTATGGC AATAGAATCG TTCAAGAATT CTATGACCCT ATGACCCCAG TCAATGCCAG 720 

GTACGAATTT GGTCAGGCTC TCTTCACTGG CTGGGCTGCT GCTTCTCTCT GCCTTCTGGG 780 

AGGTGCCCTA CTTTGCTGTT CCTGTCCCCG AAAAACAACC TCTTACCCAA CACCAAGGCC 840 

CTATCCAAAA CCTGCACCTT CCAGCGGGAA AGACTACGTG TGACACAGAG GCAAAAGGAG 900" 

AAAATCATGT TGAAACAAAC CGAAAATGGA CATTGAGATA CTATCATTAA CATTAGGACC 960 

TTAGAATTTT GGGTATTGTA ATCTAAAGTA TGTTATTACA AAACAAACAA ACAAACAAAA 1020 

AACCCATGTG TTAAAATACT CAGTGCTAAA CATGGCTTAA TCTTATTTTA TCTTCTTTCC 1080 

TCAATATAGG AGGGAAGATT TTTCCATTTG TATTAGTGCT TCCCATTGAG TAATCATACT 1140 

CAAATGGGGG AAGGGGTGCT CCTTAAATAT ATATAGATAT GTATATATAC ATGTTTTTCT 1200 

ATTAAAAATA GCCAGTAAAA AAAAAAAAAA AAAAAAA 1237 

Seq ID NO: C10 8 DNA Sequence 
Nucleic Acid Accession #: AF508964.1 
Coding sequence: 98.. 1531 

s c 1 11 21 31 41 SI 

"5 | | | | | | 

CAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGTGGG GGAAAGCGGC OGGTGCAGCG 60 

CGGGGACAGG CACTCGGGCT GGCACTGGCT GCTAGGGATG TCGTCCTGGA TAAGGTGGCA 120 

TGGACCCGCC ATGGCGCGGC TCTGGGGCTT CTGCTGGCTG GTTGTGGGCT TCTGGAGGGC 180 

CGCTTTCGCC TGTCCCACGT CCTGCAAATG CAGTGCCTCT CGGATCTGGT GCAGCGACCC 240 

TTCTCCTGGC ATCGTGGCAT TTCCGAGATT GGAGCCTAAC AGTGTAGATC CTGAGAACAT 300 

CACCGAAATT TTCATCGCAA ACCAGAAAAG GTTAGAAATC ATCAAGGAAG ATGATGTTGA 360 

AGCTTATGTG GGACTGAGAA ATCTGACAAT TGTGGATTCT GGATTAAAAT TTGTGGCTCA 420 

TAAAGCATTT CTGAAAAACA GCAACCTGCA GCACATCAAT TTTACCCGAA ACAAACTGAC 480 

GAGTTTGTCT AGGAAACATT TCCGTCACCT TGACTTGTCT GAACTGATCC TGGTGGGCAA 540 

TCCATTTACA TGCTCCTGTG ACATTATGTG GATCAAGACT CTCCAAGAGG CTAAATCCAG 600 

TCCAGACACT CAGGATTTGT ACTGCCTGAA TGAAAGCAGC AAGAATATTC CCCTGGCAAA 660 

CCTGCAGATA CCCAATTGTG GTTTGCCATC TGCAAATCTG GCCGCACCTA ACCTCACTGT 720 

GGAGGAAGGA AAGTCTATCA CATTATCCTG TAGTGTGGCA GGTGATCCGG TTCCTAATAT 780 
GTATTGGGAT GTTGGTAACC TGOPTTTCCAA ACATATGAAT GAAACAAGCC ACACACAGGG 840 
CTCCTTAAGG ATAACTAACA TTTCATCCGA TGACAGTGGG AAGCAGATCT CTTGTGTGGC 900 
GGAAAATCTT GTAGGAGAAG ATCAAGATTC TGTCAACCTC ACTGTGCATT TTGCACCAAC 960 
TATCACATTT CTCGAATCTC CAACCTCAGA CCACCACTGG TGCATTCCAT TCACTGTGAA 1020 
AGGCAACCCC AAACCAGCGC TTCAGTGGTT CTATAACGGG GCAATATTGA ATGAGTCCAA 1080 
ATACATCTGT ACTAAAATAC ATGTTACCAA TCACACGGAG TACCACGGCT GCCTCCAGCT 1140 
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GGATAATCCC ACTCACATGA ACAATGGGGA CTACACTCTA ATAGCCAAGA ATGAGTATGG 1200 

GAAGGATGAG AAACAGATTT CTGCTCACTT CATGGGCTGG CCTGGAATTG ACGATGGTGC 1260 

AAACCCAAAT TATCCTGATG TAATTTATGA AGATTATGQA ACTGCAGCGA ATGACATOGG 1320 

GGACACCACG AACAGAAGTA ATGAAATCCC TTCCACAGAC GTCACTGATA AAACCGGTCG 1380 

GGAACATCTC TCGGTCTATG CTGTGGTGGT GATTGCGTCT GTGGTGGGAT TTTGCCTTTT 1440 

GGTAATGCTG TTTCTGCTTA AGTTGGCAAG ACACTCCAAG TTTGGCATGA AAGGTTTTGT 1500 

TTTGTTTCAT AAGATCCCAC TGGATGGGTA GCTGAAATAA AGGAAAAGAC AGAGAAAGGG 1560 

GCTGTGGTGC TTGTTGGTTG ATGCTGCCAT GTAAGCTGGA CTCCTGGGAC TGCTGTTGGC 1620 

TTATCCOGGG AAGTGCTGCT TATCTGGGGT TTTCTGGTAG ATGTGGGCGG TGTTTGGAGG 1680 

CTGTACTATA TGAAGCCTGC ATATACTGTG AGCTGTGATT GGGGAACACC AATGCAGAGG 1740 

TAACTCTCAG GCAGCTAAGC AGCACCTCAA GAAAACATGT TAAATTAATG CTTCTCTTCT 1800 

TACAGTAGTT CAAATACAAA ACTGAAATGA AATCCCATTG GATTGTACTT CTCTTCTGAA 1860 

AAGTGTGCTT TTTGACCCTA CTGGACATTT ATTGACTTAA TTGCTTCTGT TTATTAAAAT 1920 

TGACCTGCAA AGTTAAAAAA AAATTAAAGT TGAGAACAGG TATAAGTGCA CACTGAATAG 1980 

TCTAATCTAC ATGTAACACA TATTTTAGTA TGATTTTCTA TACTCTAATC AGCACTGAAT 2040 

TCAGAGGGTT TGACTTTTTC ATCTATAACA CAGTGACTAA AAGAGTTAAG GGTATATATA 2100 

CCATCACTTT GGGACTTGGT AGTATTATTA AAAGGTTATT TCCTTCACTG TCAATAAAAG 2160 

TCCAAATGTT TAGCTTAGGT CTGAGAGTCA AACAATGTTA AGGATTGTCT TAAAGTTCCT 2220 

TAGCCAGCAA AACAAAACAA AACAAAACAA ACAAATGAAA AACGTTTAAA AAGAAGAAGA 2280 

AGAAAAAAAA CAAGAACAAG CAGCAACAGC TGTTTTGTTG GGGCTATAGA TTTAAGTTAG 2340 

GCATAGTCAA TTTCAGAATA ACTAAGAGTG GAATATATGC ATATGGTGAA ATTATAACCT 2400 

TGCCCTTTTT TATTTGCCCT CTGCGATCCA CCTGGCTTTT TAGAAGTCTG CCGAGTGAGA 2460 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 2520 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 2580 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 2640 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 2700 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATGC CCCCTGAGTC TGACCCATGG 2760 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 2820 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CCACCCCACC CACCCACCCT 2880 

GTTCCTTTTT TATCAGGAGG ACTTCAGAGC CAGGCCTGCA GCATTTTGTT TGAAAACACA 2940 

ATCAGCTCTG ACAGTTAGAC ATGCACACAG ACGCCATAGC TGGATTGGAA ACATTGATGT 3000 

TTTAAAAATT TATTTTTTTT GGAAATAGTT GCACAAATGC TGCAATTTAG CTTTAAGGTT 3060 

CTATAGATTT TTAACTAGTC CAACACAGTC AGAAACATTG TTTTGAATCC TCTGTAAACC 3120 

AAGGCATTAA TCTTAATAAA CCAGGATCCA TTTAGGTACC ACTTGATATA AAAAGGATAT 3180 

CCATAATGAA TATTTTATAC TGCATCCTTT ACATTAGCCA CTAAATACGT TATTGCTTGA 3240 

TGAAGACCTT TCACAGAATC CTATGGATTG CAGCATTTCA CTTGGCTACT TCATACCCAT 3300 

GCCTTAAAGA GGGGCAGTTT CTCAAAAGCA GAAACATGCC GCCAGTTCTC AAGTTTTCCT 3360 

CCTAACTCCA TTTGAATGTA AGGGCAGCTG GCCCCCAATG TGGGGAGGTC CGAACATTTT 3420 

CTGAATTCCC ATTTTCTTGT TCGOGGCTAA ATGACAGTTT CTGTCATTAC TTAGATTCCG 3480 

ATCTTTCCCA AAGGTGTTGA TTTACAAAGA GGCCAGCTAA TAGCAGAAAT CATGACCCTG 3540 

AAAGAGAGAT GAAATTCAAG CTGTGAGCCA GGCAGGAGCT CAGTATGGCA AAGGTTCTTG 3600 

AGAATCAGCC ATTTGGTACA AAAAAGATTT TTAAAGCTTT TATGTTATAC CATGGAGCCA 3660 

TAGAAAGGCT ATGGATTGTT TAAGAACTAT TTTAAAGTGT TCCAGACCCA AAAAGGAAAA 3720 

ATAAAAAAAA AGGAATATTT GTACCCAACA GCTAGAAGGA TTGCAAGGTA GATTTTTGTT 3780 

TTAAAATGGA GAGAAGTGGA CAGATAAGGC CATTTAATAT ATCAAAGATC AGTTGACATC 3840 

TCTAGGGAAT GATGAAACAG CAGGCTATTA GAAAATTATT TCATATAGTT CTCGTGTTCT 3900 

TTTCTTTTTT TTAATCCCTG AAGGGAGATC AGTAACATAG CTTCTCTTTT CTGTACTCTA 3960 

GACCACCCCT TTTCATCATT TTGCTTTTTA TGTCTCCCAT AAGAAATGTG CTTTTTAGAG 4020 

CTTCCTAATG CATGTGTTGC ATTATTGCAG CATTAGAAAA GGAGAGGTAG CATTTTTGCT 4080 

GAAATCGGGC CTGTCACTCT CCAATAAAGG TTCTGGCACT TCAATGCCAG GCAGGTCTCC 4140 

TAAATGAACA GAATGATCTG TGTGAGCCGA TGCCTGCCCT TCCAGAGGGG CCACTGTCCC 4200 

CAGCCGCAGC CAACTGTGTC CCACAGGAAT GGGAGCCTAG GTTTCCAAAT CTTGTGATTC 4260 

TTTAGGAGAA ACATGAAACC TGGATTTCGT GTGAAATGTC CCGATTGTTA AAAAGTTGGC 4320 

TCAATTATTT TTAAAACATT TTGTAAGCCA ACAAAAGTCT GTGGGCTGCC AGTTTATTAC 4380 

TTTTGTCTTA AAACATGATC ATTGTTCTCT CACGGTATCC TTCTGTCTTC CCGTTGCAAA 4440 

TTCACTTTTC TTTCTTCCTG ACATTGCCAT TGAGGGCTTT GTTACCACAA GCTAAGAAAC 4500 

TGAGTTTAAC AGCCCAGTTA TCTGCAACAT GTCAATTAOC TTTGCTCCTC TCCTGTGATT 4560 

COCACCATGC TGTGACCCTC AGCTGTCTCC CTTTGCTGGG AATTCTGCAC CAATGTCTTC 4620 

CCCTCAACCC ATTCCCTGGT TGGTCCTACT CCCGTGTGGC CAGAGACATC CTAGCAAATC 4680 

CTTCCTCCTA TTATATCTGA CACTAATTTC TTTTCAACAG CGCTCATGTC TCTTGGCOCA 4740 

GTCAGGTGCT GCCAGGTTTA GATAGGAAAG TACATGTCCC ATTTTCATGG GTGCCCTTAA 4800 

TGTGGTCCAC GTCCTATATC TTATTATATT TACTCATGGC TCAATGGGGG CCTCCAGAGA 4860 

CCCTCTCAGG CTGCTGAGCT AGACTAAGGA ATGCATCCAC TGTCATCACA TGAGACACTG 4920 

ACTCTGTGAC GACAAAAGTA CAAACAGTCT GAGGCTAAGA AAGGTTCATC TCACAACAGG 4980 

AAAAACAAAT CTCAACACAC ATTAGAGATA ATTGATTCAG GGGTTTTCTC TCCCAGTCTC 5040 

CCAGCAGGGA CTGATTTCAT TTCTGACCCA CTAGGTTTTC TTTCCAGAAA TAGGTAGCAA 5100 

GGACAAGAAC TAAACAATCC CAGCCCCACC CAGCAACACA GAACACAGGA GTTTGCTTTT 5160 

GGCTTCTCAC TCTCCAAGTA ACCCTGAATT AGGCCCAGAA TGGCTGAGGC TTGGAGCATC 5220 

TCCTCAGACA GAGCAGAGGC GACACCTCTT CAGGGGTGTG TGGAGTAAAT AGCTCGAAGA 5280 

GCTGAAGACA GAAAACCAGT TTCACGCCAG GTGCGAGAGA GAGCATAATG GAGGGAAGCC 5340 

CGCTTTCTCT CTCCTCTTCT TTTCTCTTTA TTTCTTTAGA GCACTTGACT TTTTTTTCTC 5400 

TCTCTCTCTA GTATTCTAAA CTGACCCCAT GACCAACTGA GAATTTATTT TTGTTTCATT 5460 

GGTTGTTTCA CAGAATTAGA ACACACACGA CTTTTTATTC CTCCATTGCA AAATGGAATC 5520 

AAGATACTAC ACAAGACCTG TGCTTTCTTC CTTTGCATGA TTTACACCTC CGCCTGTTTT 5580 

GGTGCTAGCT GTCTAGAACT TCTCTCTTGG TTTGAATCTG ATTCCTTCAC ACTACACTAG 5640 

AAGTTTATTT CATCTTGTTT TGTCTAGACT CCAGATACAG AGGGACAGCT GGACTGAGGA 5700 

CAAGCAATTC CATCTAGCAT AGGGTCTCTC AGGGTTGGTG CATCCAGCCA CATGGGCAGG 5760 

GCCAGTCACA TCTAGTCTAT GTCCCCAGAG CCCTTGGAGT TGCGCAGCTT AGCTGACTTG 5820 

ACTCCAAGGA AATTAGTACA GAAGTAACCA CTCTATTAAG TGTGTTCTGC TATGTTCACA 5880 

TGCCTGTAGT ACCTGCAAAC CATGCCAGGT TCATCTAAAG ACATAGGGGA AGATTAAGGA 5940 

CTCTTTTGGA CAGACCATGA ATTGAATTTG CTGCCAGGTG CTGCCA6ACT GAATTTGGCT 6000 

GACAGAACTC CCAGCCCAGG AAAGTTCCAT GACAATGACT GTCGCAGAAG GAAATTTCCC 6060 

ACTAAAGTCA GTCCATTTTC AAGTTTTGGT CTTCAGAGAC AAAAGAACGT CCCAGCCACC 6120 

TGATTTTGAT GGTGAGGTAA CTCTAAGTTG AATTCAGGCT AGTGTTGCAG TATAGCTTTG 6180 
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GCATGTTCAT GAGTGAGCAC CCAGAATGTG TTGAACCAAC CCCCAOCCCT AACTACTGAC 6240 

TAT GACTGCA GTGGGTTTTT ATGGGGAAAA AAAGTGTGAA AAGCAAAAAG AAAGGAACAG 6300 

AGATTTTTTA TCACCTTTAT TGTAAGACAG TCCATTTATG AATTGAGTAT AAACACATAC 6360 

AAAGTAACAA GAGATTCCTA AGAAACGCAA ATCCTTGAGT TTCACGCACT TCATGTTCAA 6420 

CCATTTGCTG TAATCCAGAG GCAGCCTGTG AATCATTCTC ATGCCCTGTT Tl'lTTTTT TT 6480 

TTTTCC TATA ATGTTCTGGG TTTAAAAGCC ATCTTTTCCA CATTTTCTGT AAATAATGGA 6540 

TAATCATTTT AAAAAGTTTT ATTTTTAGTG CTGTTTTAAC AATGTAGATA GATCATAAAT 6600 

GTACTTGCTG AATTCAATCA TTTTTAACAA GCCAATAAAG TTTGATAATT CAAAAAAAAA 6660 

AAAAAAAAAA AAAAAAAAA 6679 

Seq ID NO: C109 DNA Sequence 
Nucleic Acid Accession #t NM_006180,l 
Coding sequence : 352 . . 2820 

15 1 ii 21 31 41 51 
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CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGOGC GCOGGGCCAT GCAGCGACGG CCGCCGOGGA GCTCCGAGCA GOGGTAGCGC 120 

^ CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

ZD TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

GAT CCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

JU ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT' CCTGTAGTGT GGCAGGTGAT 1020 

~c CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

J J AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG • 1140 

ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

Ap. TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

AS. GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

4!> GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

_ - CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCQ TGCTGATGGC TGAGGGCAAC 2280 

3D CC6CCCACGG AACTGAOGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTOGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

OU TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA OGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGC CCTTTT C CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

OD CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

rjft ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

t\) CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TT TATTA TTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

_ AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

O ATTTATTATG AACCGCAATA TGGQAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 

TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 3707 

80 seq ID NO: C110 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 
Coding sequence: 939.-2372 
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AAGACGGATT CTCAGACAAG GCTTGCAAAT GCCCCGCAGC CATCATTTAA CTGCACCCGC 60 

AGAATAGTTA 06GTTTGTCA CCCGACCCTC COGGATCGCC TAATTTGTCC CTAGTGAGAC 120 

CCOGAGGCTC TGCCOGCGCC TGGCTTCTTC GTAGCTGGAT GCATATCGTG CTCCGGGCAG 180 

CGCGGGCGCA GGGCAOGCGT TCGCGCACAC CCTAGCACAC ATGAACACGC GCAAGAGCTG 240 

AACCAAGCAC GGTTTCCATT TCAAAAAGGG AGACAGCCTC TACOGCGATT GTAGAAGAGA 300 

CTGTGGTGTG AATTAGGGAC CGGGAGGOGT OGAACGGAGG AACGGTTCAT CTTAGAGACT 360 

AATTTTCTGG AGTTTCTGCC CCTGCTCTGC GTCAGCCCTC ACGTCACTTC GCCAGCAGTA 420 

GCAGAGGCGG CGGCGGCGGC TCCCGGAATT GGGTTGGAGC AGGAGCCTCG CTGGCTGCTT 480 

CGCTCGCGCT CTACGCGCTC AGTCCCOGGC GGTAGCAGGA GCCTGGACCC AGGCGCCGCC 540 

GGGGGGCGTG AGGCGCCGGA GCCCGGCCTC GAGGTGCATA CCGGACCCCC ATTCGCATCT 600 

AACAAGGAAT CTGCGCCCCA GAGAGTCCCG GGAGCGCCGC OGGTCGGTGC CCGGCGGGCC 660 

GGGCCATGCA GCGACGGCCG CCGCGGAGCT CCGAGCAGCG GTAGCGCCCC CCTGTAAAGC 720 

GGTTCGCTAT GCCGGGGCCA CTGTGAACCC TGCCGCCTGC CGGAACACTC TTCGCTCCGG 780 

ACCAGCTCAG CCTCTGATAA GCTGGACTOG GCACGCCCGC AACAAGCACC GAGGAGTTAA 840 

GAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGGTGG GGGAAAGCGG CCGGTGCAGC 900 

GCGGGGACAG GCACTCGGGC TGGCACTGGC TGCTAGGGAT GTCGTCCTGG ATAAGGTGGC 960 

ATGGACCCGC CATGGCGCGG CTCTGGGGCT TCTGCTGGCT GGTTGTGGGC TTCTGGAGGG 1020 

CCGCTTTOGC CTGTCCCAOG TCCTGCAAAT GCAGTGCCTC TCGGATCTGG TGCAGCGACC 1080 

1\J CTTCTCCTGG CATCGTGGCA TTTCCGAGAT TGGAGCCTAA CAGTGTAGAT CCTGAGAACA 1140 

TCACCGAAAT TTTCATCGCA AACCAGAAAA GGTTAGAAAT CATCAACGAA GATGATGTTG 1200 

AAGCTTATGT GGGACTGAGA AATCTGACAA TTGTGGATTC TGGATTAAAA TTTGTGGCTC 1260 

ATAAAGCATT TCTGAAAAAC AGCAACCTGC AGCACATCAA TTTTACCCGA AACAAACTGA 1320 

CGAGTTTGTC TAGGAAACAT TTCCGTCACC TTGACTTGTC TGAACTGATC CTGGTGGGCA 1380 

ATCCATTTAC ATGCTCCTGT GACATTATGT GGATCAAGAC TCTCCAAGAG GCTAAATCCA 1440 

GTCCAGACAC TCAGGATTTG TACTGCCTGA ATGAAAGCAG CAAGAATATT CCCCTGGCAA IS 00 

ACCTGCAGAT ACCCAATTGT GGTTTGCCAT CTGCAAATCT GGCCGCACCT AACCTCACTG 1560 

TGGAGGAAGG AAAGTCTATC ACATTATCCT GTAGTGTGGC AGGTGATCCG GTTCCTAATA 1620 

TGTATTGGGA TGTTGGTAAC CTGGTTTCCA AACATATGAA TGAAACAAGC CACACACAGG 1680 

GCTCCTTAAG GATAACTAAC ATTTCATCCG ATGACAGTGG GAAGCAGATC TCTTGTGTGG 1740 

CGGAAAATCT TGTAGGAGAA GATCAAGATT CTGTCAACCT CACTGTGCAT TTTGCACCAA 1800 

CTATCACATT TCTCGAATCT CCAACCTCAG ACCACCACTG GTGCATTCCA TTCACTGTGA 1860 

AAGGCAACCC CAAACCAGCG CTTCAGTGGT TCTATAACGG GGCAATATTG AATGAGTCCA 1920 

AATACATCTG TACTAAAATA CATGTTACCA ATCACAOGGA GTACCACGGC TGCCTCCAGC 1980 

TGGATAATCC CACTCACATG AACAATGGGG ACTACACTCT AATAGCCAAG AATGAGTATG 2040 

GGAAGGATGA GAAACAGATT TCTGCTCACT TCATGGGCTG GCCTGGAATT GACGATGGTG 2100 

CAAACCCAAA TTATCCTGAT GTAATTTATG AAGATTATGG AACTGCAGCG AATGACATCG 2160 

GGGACACCAC GAACAGAAGT AATGAAATCC CTTCCACAGA CGTCACTGAT AAAACCGGTC 2220 

GGGAACATCT CTCGGTCTAT GCTGTGGTGG TGATTGCGTC TGTGGTGGGA TTTTGCCTTT 2280 

TGGTAATGCT GTTTCTGCTT AAGTTGGCAA GACACTCCAA GTTTGGCATG AAAGGTTTTG 2340 

TTTTGTTTCA TAAQATCCCA CTGGATGGGT AGCTGAAATA AAGGAAAAGA CAGAGAAAGG 2400 

GGCTGTGGTG CTTGTTGGTT GATGCTGCCA TGTAAGCTGG ACTCCTGGGA CTGCTGTTGG 2460 

CTTATCCCGG GAAGTGCTGC TTATCTGGGG TTTTCTGGTA GATGTGGGCG GTGTTTGGAG 2520 

GCTGTACTAT ATGAAGCCTG CATATACTGT GAGCTGTGAT TGGGGAACAC CAATGCAGAG 2580 

GTAACTCTCA GGCAGCTAAG CAGCACCTCA AGAAAACATG TTAAATTAAT GCTTCTCTTC 2640 

TTACAGTAGT TCAAATACAA AACTGAAATG AAATCCCATT GGATTGTACT TCTCTTCTGA 2700 

AAAGTGTGCT TTTTGACCCT ACTGGACATT TATTGACTTA ATTGCTTCTG TTTATTAAAA 2760 

TTGACCTGCA AAGTTAAAAA AAAATTAAAG TTGAGAACAG GTATAAGTGC ACACTGAATA 2820 

GTCTAATCTA CATGTAACAC ATATTTTAGT GTGATTTTCT ATACTCTAAT CAGCACTGAA 2880 

TTCAGAGGGT TTGACTTTTT CATCTATAAC ACAGTGACTA AAAGAGTTAA GGGTATATAT 2940 

ACCATCACTT TGGGACTTGG TAGTATTATT AAAAGGTTAT TTCCTTCACT GTCAATAAAA 3000 

GTCCAAATGT TTAGCTTAGG TCTGAGAGTC AAACAATGTT AAGGATTGTC TTAAAGTTCC 3060 

TTAGCCAGCA AAACAAAACA AAACAAAACA AACAAATGAA AAAOGTTTAA AAAGAAGAAG 3120 

AAGAAAAAAA ACAAGAACAA GCAGCAACAG CTGTTTTGTT GGGGCTATAG ATTTAAGTTA 3180 

GGCATAGTCA ATTTCAGAAT AACTAAGAGT GGAATATATG CATATGGTGA AATTATAACC 3240 

TTGCCCTTTT TTATTTGCCC TCTGCGATCC ACCTGCTTTT TAGAAGTCTG CCGAGTGAGA 3300 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 3360 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 3420 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 3480 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 3540 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 3600 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 3660 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CACCCCACCC CACCCCACCC 3720 

TGTTCCTTTT TTATCAGGAG GACTTCAGAG CCAGGCCTGC AGCATTTTGT TTGAAAACAC 3780 

AATCAGCTCT GACAGTTAGA CATGCACACA GACGCCATAG CTGGATTGGA AACATTGATG 3840 

TTTTAAAAAT TTATTTTTTT TGGAAATAGT TGCACAAATG CTGCAATTTA GCTTTAAGGT 3900 

TCTATAGATT TTTAACTAGT CCAACACAGT CAGAAACATT GTTTTGAATC CTCTGTAAAC 3960 

CAAGGCATTA ATCTTAATAA ACCAGGATCC ATTTAGGTAC CACTTGATAT AAAAAGGATA 4020 

TCCATAATGA ATATTTTATA CTGCATCCTT TACATTAGCC ACTAAATACG TTATTGCTTG 4080 

ATGAAGACCT TTCACAGAAT CCTATGGATT GCAGCATTTC ACTTGGCTAC TTCATACCCA 4140 

TGCCTTAAAG AGGGGCAGTT TCTCAAAAGC AQAAACATGC CGCCAGTTCT CAAGTTTTCC 4200 

TCCTAACTCC ATTTGAATGT AAGGGCAGCT GGCCCCCAAT GTGGGGAGGT CCGAACATTT 4260 

TCTGAATTCC CATTTTCTTG TTOGGGGCTA AATGACAGTT TCTGTCATTA CTTAGATTCC 4320 

GATCTTTCCC AAAGGTGTTG ATTTACAAAG AGGCCAGCTA ATAGCAGAAA TCATGACCCT 4380 

GAAAGAGAGA TGAAATTCAA GCTGTGAGCC AGGCAGGAGC TCAGTATGGC AAAGGTTCTT 4440 

GAGAATCAGC CATTTGGTAC AAAAAAGATT TTTAAAGCTT TTATGTTATA CCATGGAGCC 4500 

ATAGAAAGGC TATGGATTGT TTAAGAACTA TTTTAAAGTG TTCCAGACCC AAAAAGGAAA 4560 

AATAAAAAAA AAGGAATATT TGTACCCAAC AGCTAGAAGG ATTGCAAGGT AGATTTTTGT 4620 

TTTAAAATGG AGAGAAGTGG ACAGATAAGG CCATTTAATA TATCAAAGAT CAGTTGACAT 4680 

CTCCTAGGGA ATGATGAAAA CAGCAGGCTA T 4711 
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Coding sequence: 1..1746 

1270 



25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



WO 03/042661 PCT/US02/36810 



1 11 21 31 41 51 

I I I I I I 

ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 

0 GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCOCAGGT GGAGTGCACC 120 

GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCOGTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 

1 n GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 

1U TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 

GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

10 GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCOC CGGGTGCCTT CAACGGGCTA 1020 

ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 

„ CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 1260 

ZD TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGCGTCGA TGTCCCTGAG 1440 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 

^ TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

J\J ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 

ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 
TGTTAA 
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GTCCCCGCAG CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAGOGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 180 

. TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 240 

4j GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG 300 

TCACTTTTAC GGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 360 

CTCTGCCACT GTCCTTCAGC AAACGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 480 

GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGOGGC 660 

CATCTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 840 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 

£ 0 TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTAOG GCTCTGAAGT 1140 

OU CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 

CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

- AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

OD TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 1560 

CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTCGAAGG GAGAGCCAGC TCCCCOGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 

/U AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATGTGGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT I860 

ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 1920 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 

/J CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATA TTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 

ATCAATAAAA TGTGATTTTT CTGA 2304 
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A^ A ^ TTTACAGATA AAACTGGTAC ACTGACAGAA AATGAGATGC AGTTTCGGGA 60 
C ATGTTCAATT AATGGCATGA AATACCAAGA AATTAATGGT AGACTTGTAC CCGAAGGACC 
^S^ AC TCTTCAGAAG GAAACTTATC TTATCTTAGT AGTTTATCCC ATCTTAACAA 
CTTATCCCAT CTTACAACCA GTTCCTCTTT CAGAACCAGT CCTGAAAATG AAACTGAACT 24 0 
rAirJiJ!^ CATGATCTCT TCTTTAAAGC AGTCAGTCTC TGTCACACTG TACAGATTAG 300 
^iT^t ACTGACTGCA CTGGTGATGG TCCCTGGCAA TCCAACCTGG CACCATCGCA 360 
1 0 S^^ AC TATGCATCTT CACCAGATGA AAAGGCTCTA GTAGAAGCTG CTGCAAGGAT 420 
IH™^ ATTCTGAAGA AACTATGGAG GTTAAAACTC TTGGAAAACT 480 
GGAACGGTAC AAACTGCTTC ATATTCTGGA ATTTGATTCA GATCGTAGGA GAATGAGTGT S40 
AATTGTTCAG GCACCTTCAG GTGAGAAGTT ATTATTTGCT AAAGGAGCTG AGTCATCAAT 600 
^CCCTAAA TGTATAGGTG GAGAAATAGA AAAAACCAGA ATTCATGTAG ATGAATTTGC 660 
1 5 EIS5iS2 CTAAGAACTC TGTGTATAGC ATATAGAAAA TTTACATCAA AAGAGTATGA 720 
55^1^ AAAC <3CATAT TTGAAGCCAG GACTGCCTTG CAGCAGCGGG AAOAGAAATT 780 
r** 0 ™ T AGAGAAAGA CCTGATATTA CTTGGAGCCA CAGCAGTAGA 840 
nS^ CTA E A jS GATAAAG TTCGAGAAAC TATTGAAGCA TTGAGAATGG CTGGTATCAA 900 
^ATC^A CTTACTGGGG ATAAACATGA AACAGCTGTT AGTGTGAGTT TATCATGTGG 960 
20 ^11^ AGAACCATGA ACATCCTTGA ACTTATAAAC CAGAAATCAG ACAGCGAGTG 1020 
^S^^ ^ AGGCAGC TTGCCAGAAG AATTACAGAG GATCATGTGA TtSgcATGG 1080 
r^IS^ GATGGGACCA GCCTATCTCT TGCACTCAGG GAGCATGAAA AACTATTTAT 1140 
S^^II^ AGAAATTGTT CAGCTGTATT ATGCTGTCQT ATGGCTCCAC TGCAGAAAGC 1200 
2^™™* AAATATCACC TGAGAAACCT ATAACATTGG CTGTTGGTGA 1260 
25 TGATACAAGA AGCCCATGTT GGCATAGGAA TCATGGGTAA 1320 

n^^S 2^1°^ GAAACAGTQA CTATGCAATA GCCAGATTTA AOTTCCTCTC 1380 

SI^SS ^T rTIA ATAGCTACGC TTGTACAGTA 1440 

^£2^ AAGAATGTGT GCTTTATCAC ACCCCAGTTT TTATATCAGT TCTACTGTTT 1500 

2!3ESI^G CAAACATTGT ATGACAGCGT GTACCTGACT TTATACAATA TTTGTTTTAC 1560 

OA TTCCCTACCT ATTCTGATAT ATAGTCTTTT GGAACAGCAT GTAGACCCTC ATGTGTTACA 1620 

30 AAATAAGCCC ACCCTTTATC GAGACATTAG TAAAAACCGC CTCTTAAGTA ^S^S ^80 

IS^SS A ^ TCCTGG ^C^TCA TGCCTTTATT TTCTTTTTTG GATCCTATTT 1740 

^ A J^ TACRT CTCTGCTTGG AAATGGCCAG ATGTTTGGAA ACTGGACATT 1800 

t^S^S ?!SH CACAG TCATGGTTAT TACAGTCACA GTAAAGATGG CTCTGGAAAC 1860 

IS l^I^SS. ACTTGGATCA ACCATCTCGT TACCTGGGGA TCTATTATAT TTTATTTTGT 1920 

TaSSSIS n^^S GGATTCTCTG GCCATTTTTG GGCTCCCAGA ATATGTATTT 1980 

TGTGTTTATT CAGCTCCTGT CAAGTGGTTC TGCTTGGTTT GCCATAATCC TCATGGTTGT 2040 

J^TTGA TA TCATAAAGAA GGTCTTTGAC CGACACCTCC ACCCTACAAG 2100 

T^CTGAAAAG GCACAGCTTA CTGAAACAAA TCCAGGTATC AAGTGCTTGG ACTCCATGTG 2160 

40 5!5 TTTCCCG GAAC5GAGAAG CAGCGTGTGC ATCTGTTGGA AGAATGCTGG AACGAGTTAT 2220 

4U AGGAAGATGT AGTCCAACCC ACATCAGCAG ATCATGGAGT GCATCGGATC CTTTCTATAC 2280 

CAACGACAGG AGCATCTTGA CTCTCTCCAC AATGGACTCA TCTACTTGTT A^AG^GG^ 2340 

TAGTACTTTG TGGGAGCCAG TTCACCTCCT TTCCTAAAAT TCAGTGTGAT CACCCTGTTA 2400 

nl^^S JAGCTCTGAA ATTAATTTCC AAAATCTTTG TAGTAGTTCA TACCCACTCA 2460 

^tH^l GGCAAACAAA CAGAAAGCAT TAGTACAAGC CCCTCCCAAC ACCCTTAATT 2520 

^IZ^t ^ T ™ UA TTTGAGAATA AAGAGACATT TTTCATCTCT TTGTCTGGTT 2580 

GCTTATGGGA CTCCTAATGG CATTTCAGTC TGTTGCTGAG GCCATTATAT 2640 

^^t™ AAGAGAGAAA TCTTAGTAAA GAGTATTTTT TAGTATTAGC 2700 

ACTCTTCTAT ^AAATCTGC TTCTGTAAAT TATGCTGAAA GTTTGCCTTG 2760 

aS^™? TTTTTTATTA GAGTTATATT TAAAGCTTTT CATGGGAAAA GTTAATGTGA 2820 

^^AGGA ATTTTGGTCC CTCAGTGACC TGTGTTGTTA ATTCATTAAT GCATTCTGAG 2880 

AA j^ AGGAQ AATCATTTCC AACCATTATT TACTGCAGTA TGGGGAGTAA 2940 

JSSSSS 2^?** CTGTACTGTA ACACAGCCTG TAAAGTTAGC CATATAAATG 3000 

CAAGGGTATA TCATATATAC AAATCAGGAA TCAGGTCOGT TCACCGAACT TCAAATTGAT 3060 

<<Z ^™5 rAAT ATTTTTGTGA CAGAGTATAA AGACCCTATA GTGGGTAAAT TAGATACTAT 3120 

' 7^™™. GTCTTTATCA TTGGATCTTT TGCATGCTTT aK££ 3^80 

A ^5™ TTTCTCTTTA CCTGAAGGCT CTGTGTATAG TATTTCATGA 3240 

ATATCAATAA AAAGTTTGGA CAGTATTTAA ATATTGCAAA 3300 

TATGTTTAAT TATACAAATC AGAATAGTAT GGGTAATTAA ATGAATACAA AAAGAAGAGC 3360 

60 SSSSSS AGCCGACTTA GACATGCTCT TCCCTTTCTA TAAGCTAGAT TTTAGAATAA 3420 

ctS^ SIE^ TATTTTCAGG TTATGTCATC TAACTTATAG CAAACTACCA 3480 

CAATACAGTG AGTTCTGCCA GTGTCCCAGT ACAAGGCATA TTTCAGGTGT GGCTGTGGAA 3540 

^J^I? ^fHSl ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAGAATGAT 3600 

I AA Z^Sr A ^E ACT<S ■TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3660 

65 5^^— GTAGTTCAGT ATTGTCATTT GTTTTTGTTT TATTGAAAAG TAATOTTGTC 3720 

^ AAGATTTA GAAGTGATTA TTAGCTTGAG AACTATTACC CAGCTCTAAG CAAATAATGA 3780 

TTGTAWLCAT ATTAAGATAA TGGTTAAATG OGGTTTTACC AAGTTTTCCC TTGAAAATGT 3840 

AATTCCTTTA TGGAGATTTA TTGTGCAGCC CTAAGCTTCC TTCCCATTTC ATGAATATAA 3900 

GGCTTCTAGA ATTGGACTGG CAGGGGAAAG AATGGTAGAG ACAGAAATTA AGACTTTATC 3960 

70 SSESS ISS AACTAT TATTTTCTTG CTAATGTAAC ATTTGTCTGT TCCAGTGATG 4020 

l^Z^Tl t* 3 ™*™** GCTAAATATT AATTTTCAAA AATAGTCCTT CTTTAACTTA 4080 

GATATTTCAT AGCTGGATTT AGGAAGATCT GTTATTCTGG AAGTACTAAA AAGAATAATA 4140 

CAACGTACAA TGTCTGCATT CACTAATTCA TGTTCCAGAA GAGGAAATAA TGAAGATATA 4200 

CTCAGTAGAG TACTAGGTGG GAGGATATGG AAATTTGCTC ATAAAATCTC TTATAAAACG 4260 

75 AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATGACC GTGTTTAtS 4320 

^GA^S^ Z™**™ ACAAAGACTC CAAAGTCATA AAATAGCCTA 4380 

JSSEJS S^^^ GGAGGTCAGC TCGCCTATAA TGGTGCTTAA AGTGTGATTG 4440 

ATGTAATTTT CTGTACTCAC CATTTGAAGT TAGTTAAGGA GAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTGAACTGTT ACTCCAAATC CACTCCGTTT 4560 

80 S^f^ A r A !, CTI ^ GATTTTAAGA AAAGAGTTTT CTATTTATTT AAGAAAGTAA 4620 

OU CAATGCAGTC TGCAAGCTTT CAGTAGTTTT CTAGTGCTAT ATTCATCCTG TAAAACTCTT 4680 

ACTACGTAAC CAGTAATCAC AAGGAAAGTG TCCCCTTTOC ATATTtSt AAAATTCTTT till 

CTTTOaAAAG TATGATGTTG ATAATTAACT TACCCTTATC TGCCAAAACC AGAGCAAAAT 4800 

^^S: TitHS 01 ^ TCAGTGGTCT CAAATCGATT TGCCTCCCTT TGCCTCGTCT 4860 

GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG CACCAAGTCA GTTTCCAAAA TTCCCCCTCA 4920 
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GCTGCTTTAA GTGACTCAGC ACCCTGCCTC AGCTTCAGCA GGCGTAGGCT CACCCTGGGC 4980 

GGAGCAAAGT ATGGGCCAGG GAGAACTACA GCTACGAAGA CCTGCTGTCG AGTTGAGAAA S040 

AGGGGAGAAT TTATGGTCTG AATTTTCTAA CTGTCCTCTT TCTTGGGTCT AAAGCTCATA 5100 

ATACACAAAG GCTTCCAGAC CTGAGCCACA CCCAGGCCCT ATCCTGAACA GGAGACTAAA 5160 

J CAGAGGCAAA TCAACCCTAG GAAATACTTQ CATTCTGCCC TACGGTTAGT ACCAGGACTG 5220 

AGGTCATTTC TACTGGAAAA GATTGTGAGA TTGAACTTAT CTGATCGCTT GAGACTCCTA 5280 

ATAGGCAGGA GTCAAGGCCA CTAGAAAATT GACAGTTAAG AGCCAAAAGT TTTTAAAATA 5340 

TGCTACTCTG AAAAATCTCG TGAAGGCTGT AGGAAAAGGG AGAATCTTCC AT GT TGGT GT 5400 

TTTTCCTGTA AAGATCAGTT TGGGGTATGA TATAAGCAGG TATTAATAAA AATAACACAC 5460 

1U CAAAGAGTTA CGTAAAACAT GTTTTATTAA TTTTGGTCCC CACGTACAGA CATTTTATTT 5520 

CTATTTTGAA ATGAGTTATC TATTTTCATA AAAGTAAAAC ACTATTAAAG TGCTGTTTTA 5580 

TGTGAAATAA CTTGAATGTT GTTCCTATAA AAAATAGATC ATAACTCATG ATATGTTTGT 5640 

A ATCA TGGTA ATTTAGATTT TTATGAGGAA TGAGTATCTG GAAATATTGT AGCAATACTT S700 

GGTTTAAAAT TTTGGACCTG AGACACTGTG GCTGTCTAAT GTAATCCTTT AAAAATTCTC 5760 

TGCATTGTCA GTAAATGTAG TATATTATTG TACAGCTACT CATAATTTTT TAAAGTTTAT 5820 

GAAGTTATAT TTATCAAATA AAAACTTTCC TATAT 5855 

Seq ID NO: CI 14 dna Sequence 
Nucleic Acid Accession #: XM_087461.l 
ZVJ Coding sequence: 236.. 1138 

1 11 21 31 41 51 

I I I I I I 

CCGGCGCCGG GGGCGGCGGG GAACCCCAAA CGCAACCGGG TCTGGAGGGA TCCCCGCGCC 60 

GAGCCAGCCG CCGTCACCGC CTCCGCGCOG CCCCTGCGGG CTTGGCAGGC GCCCGGCGCG 120 

CCCGCACTGC GCCCGGCCGC CGGCTCCCGC GGTCCCACCG TGAGCTCGCC GGCCCGTCGC 180 

CCGCTCGCCA TGCAACCGCC GCCGGCCTCG CGCGCGTAGG CGCCCGCCGC AGGCCATGCT 240 

GCCCCTGCTC GCCGCGCTCC TGGCCGCCGC CTGCCCGCTG CCGCCCGTCC GCGGCGGGGC 300 

CGCGGACGCG CCCGGCCTCC TCGGGGTGCC CTCCAATGCT TCAGTCAACG CGTCCTCCGC 360 

GGCGAQCCCA TCGCCCCGCG GCTGCTGGCC TCGGCGGCCC CCGGGCCCCC CGAGOGCCCG 420 

GGCCCGGAGG AGGCGGCGGC GGCGGOGGCG CCTGTGCAAC ATCAGCGTGC AGOGGCAGAT 480 

GCTGAGCTCG CTGCTCGTGC GCTGGGGCCG CCCGCGGGGC TTCCAGTGCG ACCTACTGCT 540 

CTTCTCCACC AACGCGCACG GCCGOGCTTT CTTCGCOGCC GCCTTCCACC GCGTCGGGCC 600 

GCCGCTGCTC ATCGAGCACC TGGGGCTGGC GGCGGGCGGC GCGCAGCAGG ACCTGCGCCT 660 

CTGCGTGGGC TGCGGCTGGG TGCGCGGTCG CCGCACCGGC CGCCTCCGGC CCGCCGCCGC 720 

CCCCAGCGCC GCCGCCGCCA CCGCCGGGGC GCCCACOGCG CTGCCAGCCT ACCCCGCGGC 780 

CGAGCCGCCC GGGCCGCTGT GGCTGCAGGG CGAGCCGCTG CATTTCTGCT GCCTAGACTT 840 

CAGCCTGGAG GAGCTGCAGG GCGAGCCGGG CTGGCGGCTG AACCGTAAGC CCATTGAGTC 900 

CACGCTGGTG GCCTGCTTCA TGACCCTGGT CATCGTGGTG TGGAGCGTGG CCGCCCTCAT 960 

CTGGCCGGTG CCCATCATCG CCGGCTTCCT GCCCAACGGC ATGGAACAGC GCCGGACCAC 1020 

CGCCAGCACC ACCGCAGCCA CCCCCGCCGC AGTGCCCGCA GGGACCACCG CAGCCGCCGC 1080 

CGCCGCCGCC GCTGCCGCCG CCGCCGCGGC CGTCACTTCG GGGGTGGCGA CCAAGTGACC 1140 

CGCTCCGCTC CTCCCTGTGT CCGTCCTGTG TCCGCGCGOG CGGGTGCCTT TCCCGCCGGA 1200 

GACTCGGCCG GTGTGCTTCG TGCTGTAGTT ATCGTTAGTT CCTCTTCCCG AGATGGGGCC 1260 

GCCGAGAGAC CCCAGCGCCT TTGAAAAGCA AGGTTTGTGC TGCGCTTCCA GTTCCGAAAA 1320 

GCAGATGTTT AAGCCCTTGG ACTGAGGGTG GGATCGCAGC TCCGAAGACG GAGAGGAGGG 1380 

AAATGGGGCC CTTTCCCCTC TATTGCATCC CCCTGCCCGA CTCCTTCCCC GCACCCACGT 1440 

GCCCTAGATT CATGGCAGAA AATGACCAAA TCCTGTGTAT TTGTTTTATA TATTTAATAA 1500 

CTGTTTTAAA TGAAAGTTTT AGTAAAAAAA ATACAAAACA AAAAGATTAA ATTGCTATTG 1560 

CTGTAGTAAG AGAAGCTCTT TGTATCTGAA CATAGTTGTA TTTGAAATTT GTGGTTTTTT 1620 

AATTTATTTA AAATTGGGGG GAGGGCATGG GAAGGATTTA ACACCGATAT ATTGTTACCG 1680 

CTGAAAATGA ACTTTATGAA CCTTTTCCAA GTTGATCTAT CCAGTGACGT GGCCTGGTGG 1740 

GCGTTTCTTC TTGTACTTAT GTGGTTTTTT GGCTTTTAAT ACAGACATTT TCCTCC 1796 

55 Seq ID NO: C115 DNA Sequence 

Nucleic Acid Accession ft: XM_051522.4 
Coding sequence: 127., 1215 "~ 

y- n 1 11 21 31 41 51 

60 | | | | | i 

ACAGTTGTTG CAAAGTGCTC AGCACTAAGG GAGCCAGCGC ACAGCACAGC CAGGAAGGOG 60 

AGOGAGCCCA GCCAGCCCAG CCAGCCCAGC CAGCCCGGAG GTCATTTGAT TGCCCGCCTC 120 

AGAACGATGG ATCTGCATCT CTTCGACTAC TCAGAGCCAG GGAACTTCTC GGACATCAGC 180 

TGGCCATGCA ACAGCAGCGA CTGCATCGTG GTGGACACGG TGATGTGTCC CAACATGCCC 240 

AACAAAAGCG TCCTGCTCTA CACGCTCTCC TTCATTTACA TTTTCATCTT OGTCATCGGC 300 

ATGATTGCCA ACTCCGTGGT GGTCTGGGTG AATATCCAGG CCAAGACCAC AGGCTATGAC 360 

ACGCACTGCT ACATCTTGAA CCTGGCCATT GCCGACCTGT GGGTTGTCCT CACCATCCCA 420 

GTCTGGGTGG TCAGTCTOGT GCAGCACAAC CAGTGGCCCA TGGGCGAGCT CACGTGCAAA 480 

GTCACACACC TCATCTTCTC CATCAACCTC TTCGGCAGCA TTTTCTTCCT CACGTGCATG 540 

AGCGTGGACC GCTACCTCTC CATCACCTAC TTCACCAACA CCCCCAGCAG CAGGAAGAAG 600 

ATGGTAOGCC GTGTCGTCTG CATCCTGGTG TGGCTGCTGG CCTTCTGCGT GTCTCTGCCT 660 

GACACCTACT ACCTGAAGAC CGTCACGTCT GCGTCCAACA ATGAGACCTA CTGCCGGTCC 720 

TTCTACCCCG AGCACAGCAT CAAGGAGTGG CTGATCGGCA TGGAGCTGGT CTCCGTTGTC 780 

TTGGGCTTTG CCGTTCCCTT CTCCATTATC GCTGTCTTCT ACTTCCTGCT GGCCAGAGCC 840 
ATCTCGGCGT CCAGTGACCA GGAGAAGCAC AGCAGCCGGA AGATCATCTT CTOCTACGTG 900 

GTGGTCTTCC TTGTCTGCTG GCTGCCCTAC CAOGTGGCGG TGCTGCTGGA CATCTTCTCC 960 

ATCCTGCACT ACATCCCTTT CACCTGCCGG CTGGAGCACG CCCTCTTCAC GGCCCTGCAT 1020 

GTCACACAGT GCCTGTCGCT GGTGCACTGC TGCGTCAACC CTGTCCTCTA CAGCTTCATC 1080 

AATCGCAACT ACAGGTACGA GCTGATGAAG GCCTTCATCT TCAAGTACTC GGCCAAAACA 1140 

GGGCTCACCA AGCTCATCGA TGCCTCCAGA GTCTCAGAGA CGGAGTACTC TGCCTTGGAG 1200 
CAGAGCACCA AATGATCTGC CCTGGAGAGG CTCTGGGACG GGTTTACTTG TTTTTGAACA 1260 
GGGTGATGGG CCCTATGGTT TTCTAGAGCA AAGCAAAGTA GCTTCGGGTC TTGATGCTTG 1320 
AGTAGAGTGA AGAGGGGAGC ACGTGCCCCC TGCATCCATT CTCTCTTTCT CTTGATGACG 1380 
CAGCTGTCAT TTGGCTGTGC GTGCTGACAG TTTTGCAACA GGCAGAGCTG TGTCGCACAG 1440 
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CAGTGCTGTG 
TTTCTGTGTT 
TCTCACTATT 
TGGGAGGCAT 
AGTTTTGACT 
ATATAAATAT 
ATCTGTGTGG 
TTGTGACATT 
TGCAAATCTG 
ATTTTTTTTT 



CGTCAGAGOC 
TCCTGAATTT 
GGTGTACCTT 
AGTGCTGACA 
AAGGATGACA 
ATAAATATAT 
TGTTTTGTAC 
AATAGTATTG 
CACACACAAC 
AATAAAGATT 



AGCTGAGGAC 
TTTATATGGT 
ATAAATGTAT 
TATATTCAOA 
CTAATTGTTA 
GCGAGTCTTG 
CGGCACGGGA 
TAAAGTTACA 
GAACAGTTGC 
TTT G TTT CC r 



AGGCTTGCCT 
GATTTC5TATT 
TTGAAAGTTA 
GTGTTGTAGT 
GCTGTTTTGA 
GCTGAAATGT 
TATGGAACGA 
TTTTAAAATA 
ATTTCAGAGA 



GGACTTCTGT 
TAAATTTTAA 
AATATATTTT 
TTTAAGGTTA 
AATTATATAT 
TTTATTTACC 
AAACTGCTTT 
AACAAAAAAC 
GTTCTCTCAA 



AAGATAGGAT 
GACTTTATTT 
AAATATTGTT 
GOGTGACTTC 
ATATAAATAT 
ATAGTTTTAT 
GTAATGCAGT 
TGTTCTGGAC 
TTTGTAAGTT 



Seq ID NO: CI 16 DNA Sequence 

Nucleic Acid Accession 8: NMJ>00350.1 

Coding sequence: 82.. 6903 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2010 



1 11 21 31 41 51 

1 1 I I I I 

CTGGCTCTTA ACGGCGTTTA TGTCCTTTGC TGTCTGAGGG GCCTCAGCTC TGACCAATCT 
GGTCTTOGTG TGGTCATTAG CATGGGCTTC GTGAGACAGA TACAGCTTTT GCTCTGGAAG 
AACTGGACCC TGCGGAAAAG GCAAAAGATT CGCTTTGTGG TGGAACTCGT GTGGCCTTTA 
TCTTTATTTC TGGTCTTGAT CTGGTTAAGG AATGCCAACC CGCTCTACAG CCATCATGAA 
TGCCATTTCC CCAACAAGGC GATGCCCTCA GCAGGAATGC TGCCGTGGCT CCAGGGGATC 
TTCTGCAATG TGAACAATCC CTGTTTTCAA AGCCCCACCC CAGGAGAATC TCCTGGAATT 
GTGTCAAACT ATAACAACTC CATCTTGGCA AGGGTATATC GAGATTTTCA AGAACTCCTC 
ATGAATGCAC CAGAGAGCCA GCACCTTGGC CGTATTTGGA CAGAGCTACA CATCTTGTCC 
CAATTCATGG ACACCCTCCG GACTCACCCG GAGAGAATTG CAGGAAGAGG AATACGAATA 
AGGGATATCT TGAAAGATGA AGAAACACTG ACACTATTTC TCATTAAAAA CATCGGCCTG 
TCTGACTCAG TGGTCTACCT TCTGATCAAC TCTCAAGTCC GTCCAGAGCA GTTCGCTCAT 
GGAGTCCCGG ACCTGGCGCT GAAGGACATC GCCTGCAGCG AGGCCCTCCT GGAGOGCTTC 
ATCATCTTCA GCCAGAGACG CGGGGCAAAG ACGGTGOGCT ATGCCCTGTG CTCCCTCTCC 
CAGGGCACCC TACAGTGGAT AGAAGACACT CTGTATGCCA ACGTGGACTT CTTCAAGCTC 
TTCCGTGTGC TTCCCACACT CCTAGACAGC CGTTCTCAAG GTATCAATCT GAGATCTTGG 
GGAGGAATAT TATCTGATAT GTCACCAAGA ATTCAAGAGT TTATCCATCG GCCGAGTATG 
CAGGACTTGC TGTGGGTGAC CAGGCCCCTC ATGCAGAATG GTGGTCCAGA GACCTTTACA 
AAGCTGATGG GCATCCTGTC TGACCTCCTG TGTGGCTACC CCGAGGGAGG TGGCTCTCGG 
GTGCTCTCCT TCAACTGGTA TGAAGACAAT AACTATAAGG CCTTTCTGGG GATTGACTCC 
ACAAGGAAGG ATCCTATCTA TTCTTATGAC AGAAGAACAA CATCCTTTTG TAATGCATTG 
ATCCAGAGCC TGGAGTCAAA TCCTTTAACC AAAATCGCTT GGAGGGCGGC AAAGCCTTTG 
CTGATGGGAA AAATCCTGTA CACTCCTGAT TCACCTGCAG CACGAAGGAT ACTGAAGAAT 
GCCAACTCAA CTTTTGAAGA ACTGGAACAC GTTAGGAAGT TGGTCAAAGC CTGGGAAGAA 
GTAGGGCCCC AGATCTGGTA CTTCTTTGAC AACAGCACAC AGATGAACAT GATCAGAGAT 
ACCCTGGGGA ACCCAACAGT AAAAGACTTT TTGAATAGGC AGCTTGGTGA AGAAGGTATT 
ACTGCTGAAG CCATCCTAAA CTTCCTCTAC AAGGGCCCTC GGGAAAGCCA GGCTGACGAC 
ATGGCCAACT TCGACTGGAG GGACATATTT AACATCACTG ATOGCACCCT CCGCCTGGTC 
AATCAATACC TGGAGTGCTT GGTCCTGGAT AAGTTTGAAA GCTACAATGA TGAAACTCAG 
CTCACCCAAC GTGCCCTCTC TCTACTGGAG GAAAACATGT TCTGGGCCGG AGTGGTATTC 
CCTGACATGT ATCCCTGGAC CAGCTCTCTA CCACCCCACG TGAAGTATAA GATCCGAATG 
GACATAGACG TGGTGGAGAA AACCAATAAG ATTAAAGACA GGTATTGGGA TTCTGGTCCC 
AGAGCTGATC CCGTGGAAGA TTTCCGGTAC ATCTGGGGCG GGTTTGCCTA TCTGCAGGAC 
ATGGTTGAAC AGGGGATCAC AAGGAGCCAG GTGCAGGCGG AGGCTCCAGT TGGAATCTAC 
CTCCAGCAGA TGCCCTACCC CTGCTTCGTG GACGATTCTT TCATGATCAT CCTGAACCGC 
TGTTTCCCTA TCTTCATGGT GCTGGCATGG ATCTACTCTG TCTCCATGAC TGTGAAGAGC 
ATCGTCTTGG AGAAGGAGTT GOGACTGAAG GAGACCTTGA AAAATCAGGG TGTCTCCAAT 
GCAGTGATTT GGTGTACCTG GTTCCTGGAC AGCTTCTCCA TCATGTCGAT GAGCATCTTC 
CTCCTGACGA TATTCATCAT GCATGGAAGA ATCCTACATT ACAGCGACCC ATTCATCCTC 
TTCCTGTTCT TGTTGGCTTT CTCCACTGCC ACCATCATGC TGTGCTTTCT GCTCAGCACC 
TTCTTCTCCA AGGCCAGTCT GGCAGCAGCC TGTAGTGGTG TCATCTATTT CACCCTCTAC 
CTGCCACACA TCCTGTGCTT CGCCTGGCAG GACCGCATGA CCGCTGAGCT GAAGAAGGCT 
GTGAGCTTAC TGTCTCCGGT GGCATTTGGA TTTGGCACTG AGTACCTGGT TCGCTTTGAA 
GAGCAAGGCC TGGGGCTGCA GTGGAGCAAC ATCGGGAACA GTCCCACGGA AGGGGACGAA 
TTCAGCTTCC TGCTGTCCAT GCAGATGATG CTCCTTGATG CTGCGTGCTA TGGCTTACTC 
GCTTGGTACC TTGATCAGGT GTTTCCAGGA GACTATGGAA CCCCACTTCC TTGGTACTTT 
CTTCTACAAG AGTCGTATTG GCTTAGCGGT GAAGGGTGTT CAACCAGAGA AGAAAGAGCC 
CTGGAAAAGA CCGAGCCCCT AACAGAGGAA ACGGAGGATC CAGAGCACCC AGAAGGAATA 
CACGACTCCT TCTTTGAACG TGAGCATCCA GGGTGGGTTC CTGGGGTATG CGTGAAGAAT 
CTGGTAAAGA TTTTTGAGCC CTGTGGCCGG CCAGCTGTGG ACOGTCTGAA CATCACCTTC 
TACGAGAACC AGATCACOGC ATTCCTGGGC CACAATGGAG CTGGGAAAAC CACCACCTTG 
TCCATCCTGA CGGGTCTGTT GCCACCAACC TCTGGGACTG TGCTCGTTGG GGGAAGGGAC 
ATTGAAACCA GCCTGGATGC AGTCCGGCAG AGCCTTGGCA TGTGTCCACA GCACAACATC 
CTGTTCCACC ACCTCACGGT GGCTGAGCAC ATGCTGTTCT ATGCCCAGCT GAAAGGAAAG 
TCCCAGGAGG AGGCCCAGCT GGAGATGGAA GCCATGTTGG AGGACACAGG OCTCCACCAC 
AAGCGGAATG AAGAGGCTCA GGACCTATCA GGTGGCATGC AGAGAAAGCT GTCGGTTGCC 
ATTGCCTTTG TGGGAGATGC CAAGGTGGTG ATTCTGGACG AACCCACCTC TGGGGTGGAC 
CCTTACTCGA GACGCTCAAT CTGGGATCTG CTCCTGAAGT ATCGCTCAGG CAGAACCATC 
ATCATGCCCA CTCACCACAT GGACQAGGCC GACCACCAAG GGGACCGCAT TGCCATCATT 
GCCCAGGGAA GGCTCTACTG CTCAGGCACC CCACTCTTCC TGAAGAACTG CTTTGGCACA 
GGCTTGTACT TAACCTTGGT GCGCAAGATG AAAAACATCC AGAGCCAAAG GAAAGGCAGT 
GAGGGGACCT GCAGCTGCTC GTCTAAGGGT TTCTCCACCA CGTGTCCAGC CCACGTCGAT 
GACCTAACTC CAGAACAAGT CCTGGATGGG GATGTAAATG AGCTGATGGA TGTAGTTCTC 
CACCATGTTC CAGAGGCAAA GCTGGTGGAG TGCATTGGTC AAGAACTTAT CTTCCTTCTT 
CCAAATAAGA ACTTCAAGCA CAGAGCATAT GCCAGCCTTT TCAGAGAGCT GGAGGAGACG 
CTGGCTGACC TTGGTCTCAG CAGTTTTGGA ATTTCTGACA CTCCCCTGGA AGAGATTTTT 
CTGAAGGTCA CGGAGGATTC TGATTCAGGA CCTCTGTTTG CGGGTGGCGC TCAGCAGAAA 
AGAGAAAACG TCAACCCCCG ACACCOCTGC TTGGGTCCCA GAGAGAAGGC TGGACAGACA 
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^TGTCTG CTCCCCAGGG GCGCCGGCTG CTCACCCAGA GGGCCAGCCT 
CCCCCAGAGC CAGAGTGCCC AGGCCCGCAG CTCAACACGG GQACACAGCT GGTCCTCCAG 
CATGTCCAGG OGCTGCTGGT CAAGAGATTC CAACACACCA TCCGCAGCCA CAAGGACTTC 
^E^T ^CTACCTTT GTGTTTTTGG CTCTGATGCT TTCTATTGTT 
r^^Z TTGGQGAATA CCCOGCTTTG ACCCTTCACC CCTGGATATA TGGGCAGCAG 
CTCCTGAATA IS^^ TGAACCAGG ° AGTQAGCAGT TCACGGTACT TGCAGACGTC 
AGCCAGGCTT TGGCAACCGC TGCCTGAAGG AAGGGTGGCT TCCGGAGTAC 
CCCTGTGGCA ACTCAACACC CTGOAAGACT CCTTCTGTGT CCCCAAACAT CACCCAGCTO 
J^ES^ ^TGGAC ACAGGTCAAC CCTTCACCAT StSaG^ SfcS 
S^SS ?5 ATGCTGCC AGAGTGCCCC GAGGGTGCOQ GGGGCCTCCC GCCCOOCCAG 
i^^^AGC GCAGCAOGGA AATTCTACAA GACCTGAOGG ACAGGAACAT CTcSaCTTC 
^IS^ ^ TCCTGC TCrTATAAGA AGCAGCTTAA AGAGCAAATT CTGGGTCAAT 
JJiSSSS ATGGAGGAAT TTCCATTGGA GGAAAGCTCC CAGTCGTCCC CATCACGGGG 
GAAGCACTTG TTGGGTTTTT AAGCGACCTT GGCCGGATCA TGAATGTGAG CGGGGGCCCT 
™^ AGAAATACCT GATTTCCTTA AACATCTAgI « 
AACATTAAGG TGTGGTTTAA TAACAAAGGC TGGCATGCCC TGGTCAGCTT TCTCAATGTG 
CCATC1TACG GGCCAGCCTG CCTAAGGACA GGAGCCCCGA GGAGTATGGA 
^^Si TTAGCCAACC CCTGAACCTG ACCAAGGAGC AGCTCTCAGA GATTACAGTG 
Sa^™ ATCTGOGTGA TTTTCTCCAT GTCClS 

TTGTCCTTTA TTTGATCCAG GAGCGGGTGA ACAAATCCAA GCACCTCCAG 
HSISSS QAGTGAGCCC CACCACCTAC TGGGTGACCA ACTTCCTCTG GGACATCATG 
^1^2 ™ GTGCTGG GCTGGTGGTG GGCATCTTCA TCGGGTTTCA GAAGAAAGCC 
™2£2£ CAGAAAACCT TCCTGCCCTT GTGGCACTGC TCCTGCTGTA TGGATGGGCG 
SSJS^ ISiI GTACCC AGCATCCTTC CTGTTTGATG TCCCCAGCAC AGCCTATGTG 
GCTTTATCTT GTGCTAATCT GTTCATCGGC ATCAACAGCA GTGCTATTAC CTTCATCTTG 
GAATTATTTG ATAATAACOG GACGCTGCTC AGGTTCAACG CCGTGCTGAG GAAGCTGCTC 
CCCACTTCTC CCTGGGCCGG GGCCTCATTG ACCTlS 
TCTATGCC0G GTTTGGTGAG GAGCACTCTG CAAATCCGTT Ct*CTCGGAC 
CTGATTGGGA AGAACCTGTT TGCCATGGTG GTGGAAGGGG TGGTGTACTT CCTCCTGACC 
CTGCTGGTCC AGCGCCACTT CTTCCTCTCC CAATGGATTG CCGAGCCCAC TAAGGAGCCC 
A ™™I? AAGATGATGA TGTGGCTGAA GAAAGACAAA GAATTATTAC TGGTGGAAAT 
AAAACTGACA TCTTAAGGCT ACATGAACTA ACCAAGATTT ATCTGGGCAC CTCCAGCCCA 
^J^S 5? CraTCTGT *3GAGTTOGC CCTGGAGAGT GCTTTGGCCT CCTGGGAGTG 
JSZSS"* CACATTCAAQ ATGCTCACTC GGGACACCAC AGTGACCTCA 
GGGGATGCCA COGTAGCAGG CAAGAGTATT TTAACCAATA TTTCTGAAGT CCATCAAAAT 
ATGGGCTACT GTCCTCAGTT TGATGCAATC GATGAGCTGC TCACAGGACG AGAACATCTT 
1%^*™ CCCGGCTTCG AGGTGTACCA GCAGAAGAAA TCGAAAAGGT TGCAAACTGG 
2££EiJ£ ^ GGGCCT ^CTGTCTAC GCOGACTGCC TGGCTGGCAC GTACAGTGGG 
GGCAACAAGC GGAAACTCTC CACAGCCATC GCACTCATTG GCTGOCCACC GCTGGTGCTG 
CCACCACAGG GATQGACCCC CAGGCACGCC GCATGCTGTG GAACGTCATC 
^r^S l^^ 5 GAGGGCTGTG GTCCTCACAT CCCACAGCAT GGAAGAATGT 
^ CCCGGCT GGCCATCATG GTAAAGGGOG OCTTTCGATG TATGGGCACC 
ATTCAGCATC TCAAGTCCAA ATTTGGAGAT GGCTATATCG TCACAATGAA GATCAAATCC 
ACCTGCrTCC TGACCTGAAC CCTGTGGAGC AGTTCTTCCA GGGGAACTTC 
CCAGGCAGTG TGCAGAGGGA GAGGCACTAC AACATGCTCC AGTTCCAGGT CTCCTCCTCC 
TCCCTGGOGA GGATCTTCCA GCTCCTCCTC TCCCACAAGG ACAGCCTGCT CATCGAGGAG 
TACTCAGTCA CACAGACCAC ACTGGACCAG GTGTTTGTAA ATTTTGCTAA ACAGCAGACT 
GAAAOTCATG ACCTCCCTCT GCACCCTCGA GCTGCTGGAG CCAGTCGACA AGCcSSS 
TGATCTTTCA CACCGCTCGT TCCTGCAGCC AGAAAGGAAC TCTGGGCAGC TGGAGGCGCA 
GGAGCCTGTG CCCATATGGT CATCCAAATG GACTGGCCCA GGGTAAATGA CCCCACTCCA 
GCAGAAAACA AACACACGAG GAGCATGCAG CGAATTCAGA AAGAGGTCTT TCAGAAGGAA 
ACCGAAACTG ACTTGCTCAC CTGGAACACC TGATGGTGAA ACCAAACAAA TACAAAATCC 
TTCTOAGAC CCCAGAACTA GAAACCCCGG GCCATCCCAC TAGCAGCTTT ScISS 
TTGCTCTCAT TTCAAGCAGA TCTGCTTTTC TGCATGTTTG TCTGTGTGTC TGOGTTGTGT 
GTGATTTTCA TGGAAAAATA AAATGCAAAT GCACTCATCA CAAAAAAAAA AAAAAAAA 

Seq ID NO: C117 DNA Sequence 

Kucleic Acid Accession Ut HM_006671.2 

Coding sequence: 138.. 1820 



I 

GGCACGAGGC 
GCCCAGACCC 
AGCAGGCCAG 
GGCGGAATGG 
TCTTCTTGAG 
AGCTCCTGAT 
CCGGACTTGC 
ACTACCTGTG 
ACCCAGGCAG 
CAGCCGATGC 
CATTCAAACA 
AGGAGGCCCC 
TGCAGAACTT 
GCACCAGCGA 
TGCTGGGCCG 
AGTCGGTCAT 
TCATTGCGGG 
TCTACTCAGT 
TCTACTTCTT 
T6CTCATCGC 
TGCTGGAGAA 
CCATCAACAT 



11 
I 

TGGTGTTTAG 
TGTGCCCCCG 
GCTCACCATG 
ACTCCTCATC 
GACCOGGCGC 
GAGGATGCTG 
CTCCCTGGAT 
GACCACCTTC 
CGCGGCCCAG 
CCTGTTGGAC 
GTACCGCACC 
TCCTCGGCGG 
CGCCCTGGAC 
TGGCATGAAT 
CATGGGTGAC 
GAAGATOGTG 
TAAGATCCTG 
CACCGTGGTG 
CATCACCAAG 
GCTGGCCACC 
CAACCACATC 
GGACGGCACT 



21 

I 

CAACTCCGAC 
GCCGGGCTCT 
GTGCCGCATG 
CTGTCTGTGC 
CTCTCAOCAC 
AAGATGATGA 
GCCAAGACCT 
ATGGCTGTCA 
AAGGAGACCA 
CTCATCCGGA 
AAGACCACCC 
ATCCTCATCT 
CTGACCCCGC 
GTGCTGGGCA 
AGOGGGGCCC 
GCGGTGGCTG 
GAGATGGACG 
TGCGGGCTGG 
AAGAATCCCA 
TCCTCCAGCT 
GACCGGCGCA 
GCGCTCTACG 



31 
I 

CACCTGCCTG 
CATGCGTGGA 
OCATCTTGGC 
TGTCTGTCAT 
AGGAAATTAG 
TCCTGCCACT 
CTAGCCGCCT 
TCGTGGGCAT 
OGGAGCAGAG 
ACATGTTCCC 
CAGTTGTCAA 
ACGGGGTCCA 
CGCCCGAGGT 
TCGTCTTCTT 
CCCTGGTCAG 
TGTGGTATTT 
ACCCCAGGGC 
TGCTCCACGG 
TCGTCTTCAT 
CAGCCACACT 
TCGCTCGCTT 
AGGCTGTGGC 



41 
I 

CTGAGGGGCT 
ATGGTGCTGT 
AOGGGGGAGG 
OGTGGGCTGC 
TTACTTCCAG 
GGTGGTCTCC 
GGGCGTCCTC 
CTTCATGGTC 
TGGGAAGCCC 
AGCCAACCTA 
GTCCCCCAAG 
GGAGGAGAAT 
CGTTTACAAG 
CTCTGCCACC 
CTTCTGCCAG 
CCCCTTCGGC 
OGTCGGCAAG 
GCTCTTTATC 
COGCGGCATC 
GCOCATCACC 
CGTGCTGCGC 
CGCCATCTTC 



51 
I 

AGAGCCCTCA 
GOCCCTTGCC 
GACGTGTGCA 
CTCCTCGGCT 
TTCCCTGGAG 
AGCTTGATGT 
ACCGTGGCGT 
TCCATCATCC 
ATCATGAGCT 
GTAGAAGCCA 
GTGGCACCAG 
GGCTCCCATG 
TCAGAGCCGG 
ATGGGCATCA 
TGCCTCAATG 
ATTGTGTTCC 
AAGCTGGGCT 
CTGCCCCTGC 
CTGCAGGCTC 
TTCAAGTGCC 
GTGGGTGCCA 
ATCGCCCAGG 



4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

S040 

5100 

S160 

5220 

5280 

5340 

S400 

5460 

5S20 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7318 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



TCAACAACTA 
CCAGCATTGG 
CCTCCGTGGG 
ACCGTTTCOG 
ATATATGTCG 
CCAAGCCAGT 
TAGCCGAGGC 
TGGAGGGGGA 
AGCTGGAGAC 
GGGTCCTGAG 
CACACATTCT 
CAGAGAAGGG 
CTGAGGCCCC 
TATCCGGCCC 
AAGGTCTCAC 
GGAGGACAGG 
TCAGCCACAC 
GGCCACAGTG 
GGAGACAGAT 
ATGGAATAGG 
AGCTGCCQAT 
A6GTTTT CTA 
TTTTCAAAAA 



CGAGCTGGAC 
GGCAGCTGGC 
ACTGOCCACC 
CACCATGATT 
GAAGGATTTT 
GAGCCTCCAG 
CTCCGAGCTC 
TGAGGAGCTG 
CAATGTCT6A 
GCAGGAACTC 
TCTCACCCTT 
AAAGGCTGCA 
GCTGTGACAG 
CACTTTCTGO 
GTTAAGGTCA 
GCAGCCTGCA 
AGTTCTCCCC 
CGTCCAGTCT 
GTCTTTACTA 
TAGGCAAGCC 
GGGGCCCTGG 
TTCCTTTCTG 
AAAAAAAAAA 



TTTGGCCAGA 
ATCCCCCAGG 
GATGACATCA 
AAOGTGCTGG 
GCCOGGGACA 
GAGATCGTGG 
ACCCTGGGCC 
CCCGCTGOGA 
GCCTGCGGAG 
GACTCTCCAA 
GAGAGGCTGG 
TGGGGGAGCC 
TTTCCCOGGT 
ATQACAGACT 
AGACACTAAC 
GCTGTGTCCA 
AAGGGGAGCA 
CCTGTCCTGC 
GAGCTGAAAG 
AGGACTCCGG 
TGTCAGAACT 
TGTTTGCAAA 
AAA 



TCATCACCAT 
CCGGCCTCGT 
CCCTCATCAT 
GTGATGCGCT 
CAGGCACCGA 
CAGCCCAGCA 
CCACCTGCCC 
GTCTGAACCA 
CTGCAGGGGC 
CCCTCCTGAG 
AATTAACCCC 
CCATCCAGGG 
GTGAGCCCGG 
TGAGGCTCTG 
TCAAATCTTT 
GGCCCAGGCC 
GCCCAGGGCC 
CACGTGTCTT 
GCCCCCTTGA 
GAAGGAGGTG 
CCCCAAAGGC 
TTCAGTGTTA 



Seq ID NO: CI 18 DNA Sequence 
Nucleic Acid Accession #: NM_005689 
Coding sequence: 278.. 2806 



CAGTATCACA 
CACCATGGTC 
TGCCGTTGAC 
GGCAGCGGGG 
GAAACTGCTG 
GAATGGCTGT 
CCACCACGTC 
CTGCACCATC 
AGGCGAGGCC 
CAGCCGGCAG 
GCTTGACGGA 
AGTGATGGGC 
TGAGGGCGGC 
AGAGCTGAAA 
CAAGCCCCGC 
CCACCCCATA 
AAGCCCCGCT 
TTGCAAAGCT 
CACATCCAGG 
CAGCCAGGAT 
CTGTGCGTCC 
ACTAAATAAA 



GCCACTGCAG 
ATCGTGCTCA 
TGGGCTCTGG 
ATCATGGOCC 
CCCTGOGAGA 
GTGAAGAGTG 
CCCGTTCAAG 
CAGATCAGCG 
TCCAGGGGCA 
GGGCCAGGAT 
AAATGTATCT 
CCGGCATTGC 
AGGCAGGGGT 
ACACTTGTCC 
CTCTCCTCTT 
ACAGGTGGCC 
GCCTTCCCCA 
CCTTGGATGT 
CCAACCTCCC 
GCTCTGGTGG 
AAGTGGAGTC 
GGTATTTTGT 



1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S80 
2640 
2663 



1 11 21 31 41 51 

I I I I I I 

GGGCCTGCAG TTGGCAGAAG GGTCCCGGGC CCAGAGCCAG CGGGGCCGTG CTGAGACGGC 60 

GTACGTGCCC TGCGTGAGTG CGTGGCGGCG GCGCGTGCGC TAGGGGAGTG GGCGGTGAGG 120 

CCTGGTCCAC GTGCGTCCCT TCCCGGGACC CCCGCAGCTT GGCGCCCAGC GGCTACGTGA 180 

GCCAAGGCAC CCGGATGTCC GCGCCCCTCT CCGAGTGACA AGTCCCGGCC TCCGGTCCCG 240 

CAGTGCCCGC AGCCTCGGCC GGCGTCCACG CATTGCCATG GTGACTGTGG GCAACTACTG 300 

CGAGGCCGAA GGGCCCGTGG GTCCGGCCTG GATGCAGGAT GGCCTGAGTC CCTGCTTCTT 360 

CTTCACGCTC GTGCCCTCGA CGCGGATGGC TCTAGGGACT CTGGCCTTGG TGCTGGCTCT 420 

TCCCTGCAGA CGCCGGGAGC GGCCOGCTGG TGCTGATTCG CTGTCTTGGG GGGCCGGCCC 480 

TCGCATCTCT CCCTACGTGC TGCAGCTGCT TCTGGCCACA CTTCAGGCGG CGCTGCCCCT 540 

GGCCGGCCTG GCTGGCCGGG TGGGCACTGC CCGGGGGGCC CCACTGCCAA GCTATCTACT 600 

TCTGGCCTCC GTGCTGGAGA GTCTGGCCGG CGCCTGTGGC CTGTGGCTGC TTGTCGTGGA 660 

GCGGAGCCAG GCACGGCAGC GTCTGGCAAT GGGCATCTGG ATCAAGTTCA GGCACAGCCC 720 

TGGTCTCCTG CTCCTCTGGA CTGTGGCGTT TGCAGCTGAG AACTTGGCCC TGGTGTCTTG 780 

GAACAGCCCA CAGTGGTGGT GGGCAAGGGC AGACTTGGGC CAACAGGTTC AGTTTAGCCT 840 

GTGGGTGCTG CGGTATGTGG TCTCTGGAGG GCTGTTTGTC CTGGGTCTCT GGGCCCCTGG 900 

ACTTCGTCCC CAGTCCTATA CATTGCAGGT TCATGAAGAG GACCAAGATG TGGAAAGGAG 960 

CCAGGTTCGG TCAGCAGCCC AACAGTCTAC CTGGCGAGAT TTTGGCAGGA AGCTCCGCCT 1020 

CCTGAGTGGC TACCTGTGGC CTCGAGGGAG TCCAGCTCTG CAGCTGGTGG TGCTCATCTG 1080 

CCTGGGGCTC ATGGGTTTGG AACGGGCACT CAATGTGTTG GTGCCTATAT TCTATAGGAA 1140 

CATTGTGAAC TTGCTGACTG AGAAGGCACC TTGGAACTCT CTGGCCTGGA CTGTTACCAG 1200 

TTACGTCTTC CTCAAGTTCC TCCAGGGGGG TGGCACTGGC AGTACAGGCT TCGTGAGCAA 1260 

CCTGCGCACC TTCCTGTGGA TCCGGGTGCA GCAGTTCACG TCTCGGCGGG TGGAGCTGCT 1320 

CATCTTCTCC CACCTGCACG AGCTCTCACT GCGCTGGCAC CTGGGGCGCC GCACAGGGGA 13 B0 

GGTGCTGCGG ATCGOGGATC GGGGCACATC CAGTGTCACA GGGCTGCTCA GCTACCTGGT 1440 

GTTCAATGTC ATCCCCACGC TGGCOGACAT CATCATTGGC ATCATCTACT TCAGCATGTT 1500 

CTTCAACGCC TGGTTTGGCC TCATTGTGTT CCTGTGCATG AGTCTTTACC TCACCCTGAC 1560 

CATTGTGGTC ACTGAGTGGA GAACCAAGTT TCGTCGTGCT ATGAACACAC AGGAGAACGC 1620 

TACCCGGGCA CGAGCAGTGG ACTCTCTGCT AAACTTCGAG ACGGTGAAGT ATTACAACGC 1680 

CGAGAGTTAC GAAGTGGAAC GCTATCGAGA GGCCATCATC AAATATCAGG GTTTGGAGTG 1740 

GAAGTCGAGC GCTTCACTGG TTTTACTAAA TCAGACCCAG AACCTGGTGA TTGGGCTCGG 1800 

GCTCCTCGCC GGCTCCCTGC TTTGCGCATA CTTTGTCACT GAGCAGAAGC TACAGGTTGG 1860 

GGACTATGTG CTCTTTGGCA CCTACATTAT CCAGCTGTAC ATGCCCCTCA ATTGGTTTGG 1920 

CACCTACTAC AGGATGATCC AGAOCAACTT CATTGACATG GAGAACATGT TTGACTTGCT 1980 

GAAAGAGGAG ACAGAAGTGA AGGACCTTCC TGGAGCAGGG CCCCTTCGCT TTCAGAAGGG 2040 

CCGTATTGAG TTTGAGAACG TGCACTTCAG CTATGCCGAT GGGCGGGAGA CTCTGCAGGA 2100 

CGTGTCTTTC ACTGTGATGC CTGGACAGAC ACTTGCCCTG GTGGGCCCAT CTGGGGCAGG 2160 

GAAGAGCACA ATTTTGCGCC TGCTGTTTCG CTTCTACGAC ATCAGCTCTG GCTGCATCCG 2220 

AATAGATGGG CAGGACATTT CACAGGTGAC CCAGGCCTCT CTCCGGTCTC ACATTGGAGT 2280 

TGTGCCCCAA GACACTGTCC TCTTTAATGA CACCATCGCC GACAATATCC GTTAOGGCCG 2340 

TGTCACAGCT GGGAATGATG AGGTGGAGGC TGCTGCTCAG GCTGCAGGCA TCCATGATGC 2400 

CATTATGGCT TTCCCTGAAG GGTACAGGAC ACAGGTGGGC GAGCGGGGAC TGAAGCTGAG 2460 

CGGCGGGGAG AAGCAGOGCG TCGCCATTGC CCGCACCATC CTCAAGGCTC CGGGCATCAT 2520 

TCTGCTGGAT GAGGCAAOGT CAGCGCTGGA TACATCTAAT GAGAGGGCCA TCCAGGCTTC 2580 

TCTGGCCAAA GTCTGTGCCA ACCGCACCAC CATCGTAGTG GCACACAGGC TCTCAACTGT 2640 

GGTCAATGCT GACCAGATCC TOGTCATCAA GGATGGCTGC ATCGTGGAGA GGGGAOGACA 2700 

CGAGGCTCTG TTGTCCCGAG GTGGGGTGTA TGCT6ACATG TGGCAGCTGC AGCAGGGACA 2760 

GGAAGAAAOC TCTGAAGACA CTAAGCCTCA GACCATGGAA CGGTGACAAA AGTTTGGCCA 2820 

CTTCCCTCTC AAAQACTAAC CCAGAAGGGA ATAAGATGTG TCTCCTTTCC CTGGCTTATT 2880 

TCATCCTGGT CTTGGGGTAT GGTGCTAGCT ATGGTAAGGG AAAGGGACCT TTCCGAAAAA 2940 

CATCTTTTGG GGAAATAAAA ATGTGGACTG TGAAAAAAAA AAAAAAAAAA AAA 2993 

Seq ZD NO: C119 DNA Sequence 
Nucleic Acid Accession ft: NMJ>00676 
Coding sequence: 333.. 1331 
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Seq ID NO: C120 DNA Sequence 
Nucleic Acid Accession #: NM_052932 
Coding sequence: 217.. 786 



} 11 21 31 41 51 

I i I I ! I 

. GGGCAATTTG TTAGTTATCC GCCGCCACCA AGACGCGGCA CGGOGCCTGG ACCGGAGGGG 60 

3 CCCCGCGCGG GCGCGAACTT TGGGCTOGGG CGAGTGGGTG GTGCTCCGCC CAGCCCGAGA 120 

CGGGCGG6CG CGCGGGCCAA TGGGTGCCGC CTCTTGGCCG CGGGGGOCCC CGACCCGTGG 180 

GTCCCGGCCA CCAGCGCCCC AGCCCCGAGG CTCAGAAGCG GCAGGCGGAG GCGCGGTCCG 240 

GGCGCTATGG CCATGCCCGG CGGGTCTCAC GCGGCTGCCC CTCGCCCGGC GCGCCTTCGG 300 

1 s\ TAGGGGGCGC CCGGGGCCCA GCTGGCCCGG CCATGCTGCT GGAGACACAG GACGCGCTGT 360 

1U ACGTGGCGCT GGAGCTGGTC ATCGCCGCGC TTTCGGTGGC GGGCAACGTG CTGGTGTGCG 420 

CCGCGGTGGG CACGGCGAAC ACTCTGCAGA CGCCCACCAA CTACTTCCTG GTGTCCCTGG 480 

CTGCGGCCGA CGTGGCCGTG GGGCTCTTCG CCATCCCCTT TGCCATCACC ATCAGCCTGG 540 

GCTTCTGCAC TGACTTCTAC GGCTGCCTCT TCCTCGCCTG CTTCGTGCTG GTGCTCACGC 600 

1 - AGAGCTCCAT CTTCAGCCTT CTGGCCGTGG CAGTCGACAG ATACCTGGCC ATCTGTGTCC 660 

13 CGCTCAGGTA TAAAAGTTTG GTCACGGGGA CCCGAGCAAG AGGGGTCATT GCTGTCCTCT 720 

GGGTCCTTGC CTTTGGCATC GGATTGACTC CATTCCTGGG GTGGAACAGT AAAGACAGTG 780 

CCACCAACAA CTGCACAGAA CCCTGGGATG GAACCACGAA TGAAAGCTGC TGCCTTGTGA 840 

AGTGTCTCTT TGAGAATGTG GTCCCCATGA GCTACATGGT ATATTTCAAT TTCTTTGGGT 900 

GTGTTCTGCC CCCACTGCTT ATAATGCTGG TGATCTACAT TAAGATCTTC CTGGTGGCCT 960 

l\) GCAGGCAGCT TCAGCGCACT GAGCTGATGG ACCACTCGAG GACCACCCTC CAGCGGGAGA 1020 

TCCATGCAGC CAAGTCACTG GCCATGATTG TGGGGATTTT TGCCCTGTGC TGGTTACCTG 1080 

TGCATGCTGT TAACTGTGTC ACTCTTTTCC AGCCAGCTCA GGGTAAAAAT AAGCCCAAGT 1140 

GGGCAATGAA TATGGCCATT CTTCTGTCAC ATGCCAATTC AGTTGTCAAT CCCATTGTCT 1200 

- ATGCTTACCG GAACCGAGAC TTCCGCTACA CTTTTCACAA AATTATCTCC AGGTATCTTC 1260 

ZJ TCTGCCAAGC AGATGTCAAG AGTGGGAATG GTCAGGCTGG GGTACAGCCT GCTCTCGGTG 1320 

TGGGCCTATG ATCTAGGCTC TCGCCTCTTC CAGGAGAAGA TACAAATCCA CAAGAAACAA 1380 

AGAGGACACG GCTGGTTTTC ATTGTGAAAG ATAGCTACAC CTCACAAGGA AATGGACTGC 1440 

CTCTCTTGAG CACTTCCCTG GAGCTACCAC GTATCTAGCT AATATGTATG TGTCAGTAGT 1500 

„ AGGCTCCAAG GATTGACAAA TATATTTATG ATCTATTCAG CTG CTTTT AC TGTGTGGATT IS 60 

J\J ATGCCAACAG CTTGAATGGA TTCTAACAGA CTCTTTTGTT TTTAAAAGTC TGCCTTGTTT 1620 

ATGGTGGAAA ATTACTGAAA CTATTTTACT GTGAAACAGT GTGAACTATT ATAATGCAAA 1680 

TACTTTTTAA CTTAGAGGCA ATGGAAAAAT AAAAGTTGAC TGTACTAAAA ATG 1733 



1 11 21 31 41 51 

/in 1 I I I ! I 

HU CCCAGCCCGG CCCCGCCGCC CCGGCTGCGC ACGCGACGCC CCCTCCAGGC CCCGCTCCTG 60 

CGCCCTATTT GGTCATTCGG GGGGCAAG0G GCGGGAGGGG AAACGTGCGC GGCCGAAGGG 120 

GAAGCGGAGC CGGCGCCGGC TGCGCAGAGG AGCCGCTCTC GCCGCCGCCA CCTCGGCTGG 180 

GAGCCCACGA GGCTGCCGCA TCCTGCCCTC GGAACAATGG GACTCGGCGC GCGAGGTGCT 240 

A r TGGGCCGCGC TGCTCCTGGG GAGGCTGCAG GTGCTAGGGC TGCTGGGGGC CGCCCATGAA 300 

AGCGCAGCCA TGGCGGAGAC TCTCCAACAT GTGCCTTCTG ACCATACAAA TGAAACTTCC 360 

AACAGTACTG TGAAACCACC AACTTCAGTT GCCTCAGACT CCAGTAATAC AACGGTCACC 420 

ACCATGAAAC CTACAGCGGC ATCTAATACA ACAACACCAG GGATGGTCTC AACAAATATG 480 

ACTTCTACCA CCTTAAAGTC TACACCGAAA ACAACAAGTG TTTCACAGAA CACATCTCAG 540 

ATATCAACAT CCACAATGAC CGTAACCCAC AATAGTTCAG TGACATCTGC TGCTTCATCA 600 

JU GTAACAATCA CAACAACTAT GCATTCTGAA GCAAAGAAAG GATCAAAATT TGATACTGGG 660 

AGCTTTGTTG GTGGTATTGT ATTAACGCTG GGAGTTTTAT CTATTCTTTA CATTGGATGC 720 

AAAATGTATT ACTCAAGAAG AGGCATTCGG TATCGAACCA TAGATGAACA TGATGCCATC 780 

ATTTAAGGAA ATCCATGGAC CAAGGATGGA ATACAGATTG ATGCTGCCCT ATCAATTAAT 840 

rr TTTGGTTTAT TAATAGTTTA AAACAATATT CTCTTTTTGA AAATAGTATA AACAGGCCAT 900 

DO GCATATAATG TACAGTGTAT TACGTAAATA TGTAAAGATT CTTCAAGGTA ACAAGGGTTT 960 

GGGTTTTGAA ATAAACATCT GGATCTTATA GACCGTTCAT ACAATGGTTT TAGCAAGTTC 1020 

ATAGTAAGAC AAACAAGTCC TATCTTTTTT TTTTTGGCTG GGGTGGGGGC ATTGGTCACA 1080 

TATGACCAGT AATTGAAAGA CGTCATCACT GAAAGACAGA ATGCCATCTG GGCATACAAA 1140 

TAAGAAGTTT GTCACAGCAC TCAGGATTTT GGGTATCTTT TGTAGCTCAC ATAAAGAACT 1200 

OU TCAGTGCTTT TCAGAGCTGG ATATATCTTA ATTACTAATG CCACACAGAA ATTATACAAT 1260 

CAAACTAGAT CTGAAGCATA ATTTAAGAAA AACATCAACA TTTTTTGTGC TTTAAACTGT 1320 

AGTAGTTGGT CTAGAAACAA AATACTCCAA GAAAAAGAAA ATTTTCAAAT AAAACCCAAA 1380 

ATAATAGCTT TGCTTAGCCC TGTTAGGGAT CCATTGGAGC ATTAAGGAGC ACATATTTTT 1440 

- - ATTAACTTCT TTTGAGCTTT CAATGTTGAT GTAATTTTTG TTCTCTGTGT AATTTAGGTA 1500 

OJ AACTGCAGTG TTTAACATAA TAATGTTTTA AAGACTTAGT TCTCAGTATT AAATAATCCT 1560 

GGCATTATAG GGAAAAAACC TCCTAGAAQT TAGATTATTT GCTACTGTGA GAATATTGTC 1620 

ACCACTGGAA GTT ACTT TAO TTCATTTAAT TTTAATTTTA TATTTTGTGA ATATTTTAAG 1680 

AACTGTAGAG CTGCTTTCAA TATCTAGAAA TTTTTAATTG AGTGTAAACA CACCTAACTT 1740 

„ TAAGAAAAAG AACCGCTTGT ATGATTTTCA AAAGAACATT TAGAATTCTA TAGAGTCAAA 1800 

/V ACTATAGCGT AATGCTGTGT TTATTAAGCC AGGGATTGTG GGACTTCCCC CAGGCAACTA 1860 

AACCTGCAGG ATGAAAA TGC TATATTTTCT TTCATGCACT GTCGATATTA CTCAGATTTG 1920 

GGGAAATGAC ATTTTTATAC TAAAACAAAC ACCAAAATAT TTTAGAATAA ATTCTTAGAA 19 BO 

AGTTTTGAGA GGAATTTTTA GAGAGGACAT TTCCTCCTTC CTGATTTGGA TATTCCCTCA 2040 

AATCCCTCCT CTTACTCCAT GCTGAAGGAG AAGTACTCTC AGATGCATTA TGTTAATGGA 2100 

/D GAGAAAAAGC ACAGTATTGT AGAGACACCA ATATTAGCTA ATGTATTTTG GAGTGTTTTC 2160 

CATTTTACAG TTTATATTCC AGCACTCAAA ACTCAGGGTC AAGTTTTAAC AAAAGAGGTA 2220 

TGTAGTCACA GTAAATACTA AGATGGCATT TCTATCTCAG AGGGCCAAAG TGAATCACAC 2280 

CAGTTTCTGA AGGTCCTAAA AATAGCTCAG ATGTCCTAAT GAACATGCAC CTACATTTAA 2340 

Cfk TAGGAG TACA ATAAAACTGT TGTCAGCTTT TGTTTTACAG AGAACGCTAG ATATTAAGAA 2400 

OV TTTTGAAATG GATCATTTCT ACTTGCTGTG CATTTTAACC AATAATCTGA TGAATATAGA 2460 

AAAAAATGAT CCAAAATATG GATATGATTG GATGTATGTA ACACATACAT GGAGTATGGA 2520 

GGAAATTTTC TGAAA AATAC ATTTAGATTA GTTTAGTTTG AAGGAGAGGT GGGCTGATGG 2580 

CTGAGTTGTA TGTTACTAAC TTGGCCCTGA CT GG TTGT GC AACCATTGCT TCATTTCTTT 2640 

GCAAAATGTA GTTAAGATAT ACTTTATTCT AATGAAGGCC TTTTAAATTT GTCCACTGCA 2700 
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TTCTTGGTAT TTCACTACTT CAAGTCAGTC AGAACTTOGT AGACCGACCT GAAGTTTCTT 2760 

TTTGAATACT TGTTTCTTTA GCACTTTGAA GATAGAAAAA CCACTTTTTA AGTACTAAGT 2820 

CATCATTTGC CTTGAAAGTT TCCTCTGCAT TGGGTTTGAA GTAGTTTAGT TATGTCTTTT 2880 

< TCTCTGTATG TAAGTAGTAT AATTTGTTAC TTTCAAATAC CCGTACTTTG AATGTAGGTT 2940 

J TTTTTGTTGT TGTTATCTAT AAAAATTGAG GOAAATGGTT ATGCAAAAAA ATATTTTGCT 3000 

TTGGACCATA TTTCTTAAGC ATAAAAAAAT GCTCA OTTTT GCTTGCATTC CTTGAGAATG 3060 

TATTTATCTG AAGATCAAAA CAAACAATCC AGATGTATAA GTACTAGGCA GAAGCCAATT 3120 

TTAAAATTTC CTTGAATAAT CCATGAAAGG AATAATTCAA ATACAGATAA ACAGAGTTGG 3180 

CAGTATATTA TAGTGATAAT TTTGTATTTT CAAMAAAAAA AAAGTTAAAC TCTTCTTTTC 3240 

IV TTTTTATTAT AATGACCAGC TTTTGGTATT TCATTGTTAC CAAGTTCTAT TTTTAGATAA 3300 

AATTGTTCTC CTTCTAAAAA AAAAAAAAAA AAAAAAAA 3338 

Seq ID NO: C121 DNA Sequence 
1 Nucleic Acid Accession #: NM_004195 

1 J Coding sequence : 1 . . 726 

1 11 21 31 41 51 

I I i I I I 

ATGGCACAGC ACGGGGCGAT GGGCGCGTTT CGGGCCCTGT GCGGCCTGGC GCTGCTGTGC 60 

ZU GCGCTCAGCC TGGGTCAGCG CCCCACCGGG GGTCCOGGGT GCGGCCCTGG GCGCCTCCTG 120 

CTTGGGACGG GAACGGACGC GCGCTGCTGC ' CGGGTTCACA CGACGCGCTG CTGCCGCGAT 180 

TACCCGGGCG AGGAGTGCTG TTCCGAGTGG GACTGCATGT GTGTCCAGCC TGAATTCCAC 240 

TGOGGAGACC CTTGCTGCAC GACCTGCCGG CACCACCCTT GTCCCCCAGG CCAGGGGGTA 300 

rye CAGTCCCAGG GGAAATTCAG TTTTGGCTTC CAGTGTATCG ACTGTGCCTC GGGGACCTTC 360 

ZD TCOGGGGGCC ACGAAGGCCA CTGCAAACCT TGGACAGACT GCACCCAGTT CGGGTTTCTC 420 

ACTGTGTTCC CTGGGAACAA GACCCACAAC GCTGTGTGCG TCCCAGGGTC CCOGCCGGCA 480 

GAGCCGCTTO GGTGGCTGAC CGTOGTCCTC CTGGCCGTGG CCGCCTGCGT CCTCCTCCTG S40 

ACCTCGGCCC AGCTTGGACT GCACATCTGG CAGCTGAGGA GTCAGTGCAT GTGGCCCCGA 600 

GAGACCCAGC TGCTGCTGGA GGTGCCGCCG TCGACCGAAG ACGCCAGAAG CTGCCAGTTC 660 

3\J CCCGAGGAAG AGCGGGGCGA GCGATCGGCA GAGGAGAAGG GGCGGCTGGG AGACCTGTGG 720 

GTGTGA 726 

Seq ID NO: C122 DNA Sequence 
_ _ Nucleic Acid Accession #: AK091896.1 
3 J Coding sequence: 2 8.. 15 72 

1 11 21 31 41 51 

I I I I I I 

A ft AGATCCGCGA GCCCGTCAGC CTGCGCCATG GGCTGCGACG GCCGCGTGTC GGGGCTGCTC 60 

*t\J CGCCGCAACC TGCAGCCCAC GCTCACCTAC TGGAGCGTCT TCTTCAGCTT CGGCCTGTGC 120 

ATCGCCTTCC TGGGGCCCAC GCTGCTGGAC CTGOGCTGTC AGACGCACAG CTCGCTGCCC 180 

CAGATCTCCT GGGTCTTCTT CTCGCAGCAG CTCTGCCTCC TGCTGGGCAG CGCCCTCGGG 240 

GGCGTCTTCA AAAGGACCCT GGCCCAGTCA CTATGGGCCC TGTTCACCTC CTCTCTGGCC 300 

- _ ATCTCCCTGG TGTTTGCCGT CATCCCCTTC TGCCGCGACG TGAAGGTGCT GGCCTCAGTC 360 
HO ATGGCGCTGG CGGGCTTGGC CATGGGCTGC ATCGACACCG TGGCCAACAT GCAGCTGGTA 420 

AGGATGTACC AGAAGGACTC GGCCGTCTTC CTCCAGGTGC TCCATTTCTT CGTGGGCTTT 480 

GGTGCTCTGC TGAGCCCCCT TATTGCTGAC CCTTTCCTGT CTGAGGCCAA CTGCTTGCCT 540 

GCCAATAGCA CGGCCAACAC CACCTCCCGA GGCCACCTGT TCCATGTCTC CAGGGTGCTG 600 

Cf\ GGCCAGCACC ACGTAGATGC CAAGCCTTGG TCCAACCAGA CGTTCCCAGG GCTGACTCCA 660 

3U AAGGACGGGG CAGGGACCCG AGTGTCCTAT GCCTTCTGGA TCATGGCCCT CATCGATCTT 720 

CCAGTGCCCA TGGCTGTGCT GATGCTGCTG TCCAAGGAGC GGCTGCTGAC CTGCTGTCCC 780 

CAGAGGAGGC CCCTGCTTCT GTCTGCTGAT GAGCTTGCCT TGGAGACACA GCCTCCTGAG 840 

AAGGAAGATG CCTCCTCACT GCCCCCAAAG TTTCAGTCAC ACCTAGGGCA TGAGGACCTG 900 

rr TTCAGCTGCT GCCAAAGGAA GAACCTCAGA GGAGCCCCTT ATTCCTTCTT TGCCATCCAC 960 

0-> ATCACGGGCG CCCTGGTACT GTTCATGACG GATGGGTTGA CGGGTGCCTA TTCCGCCTTC 1020 

GTGTACAGCT ATGCTGTGGA GAAGCCCCTG TCTGTGGGAC ACAAGGTGGC TGGCTACCTC 1080 

CCCAGCCTCT TCTGGGGCTT CATCACACTG GGCOGGCTCC TCTCCATTCC CATATCCTCA 1140 

AGAATGAAGC CGGCCACCAT GGTTTTCATC AACGTGGTTG GCGTGGTGGT GACGTTCCTG 1200 

s-r\ GTGCTGCTTA TTTTCTCCTA CAACGTCGTC TTCCTGTTCG TGGGGACGGC AAGCCTGGGC 1260 

OU CTGTTTCTCA GCAGCACCTT CCCCAGCATG CTGGCCTACA CGGAGGACTC GCTGCAGTAC 1320 

AAAGGCTGTG CAACCACAGT GCTGGTGACA GGGGCAGGAG TTGGCGAGAT GGTGCTGCAG 1380 

AT GCTGG TTG GTTCGATATT CCAGGCTCAG GGCAGCTATA GTTTCCTGGT CTGTGGCGTG 1440 

ATCTTTGGTT GTCTGGCTTT TACCTTCTAT ATCTTGCTCC TGTTTTTCCA CAGGATGCAC 1500 

OCTGGACTCC CATCAGTTCC TACCCAAGAC AGATCAATTG GAATGGAAAA CTCTGAGTGC 1560 

OD TACCAGAGGT AAAACTGGGT GAAGAAGGCA AGAGAAGACT TTCAGCCTCT TGATCACCAG 1620 

CACGACCATA CTGTTTCAGA AAGCTGGGTO GTGGTGGAGG CGCTCTCTCA ATGGCTATTC 1680 

AAGTCTTCTC CACTAAAACT TGGTTGGGTA GAGGAAATTA AATTGAGTCC TGGTACCTGG 1740 

TCAAAATCAT TAGAAGTTTA CCTGGCTTCT CAAGTTATCT TCTTCCCTGG TTCAGACTGT 1800 

TGGTAAGAGC TGTCCAGATA CCCAGATGGG AAGGAAGGAG ACAGCCGCGC GCTTCACTCC 1860 

/U ATTTGTCACC TCATGCATGG ACCATACTCT GGGTTTGAGA TCATTCTTCA TTGAAGTTTG 1920 

TAAAAATAGG TTGAAATTGT AAAGCTCCAT GATCACTGCT ATATGTAGAT ATATTTCAAT 1980 

TTAAGCAAAA CAAGCTGCAA GTTATTCCCT GGCATGCTCA AAGGATTTTC GTGCTTTTCA 2040 

CTTAATAGTC CAAAGTCTCT TAAATTCCTG CTGCAGATAT CAATAGCTTA TCTATATTCT 2100 

- CAAACACCAA AAGGAAAAGT TGAATCTTGC TCTCTTTGGT ATACTAATGT AGTGGTATGC 2160 
/J TAAGCTGGCT CATACCAACT TAGAAAAGCT GATTGTAAAA TTTTCATTTT GACAGCTGGT 2220 

TATTAAATGC AGCCATTATT AAAAATCAAA TCATACAAAC TTATAATTAA ATCAATTACA 2280 

TTTAAAACAA AGGTAATAAA TATTCAAAGC ATATCACTTC CT 2322 
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Seq ID NO: C123 DNA Sequence 
Nucleic Acid Accession 8: NM_002203.2 
Coding sequence: 43.. 3588 ~ 

1 11 21 31 41 
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CTGCAAACCC AGOGCAACTA CGGTCCCCGG GTCAGACCCA GGATGGGGCC AGAACGGACA 60 

GGGGCCGCGC CGCTGCOGCT GCTGCTGGTG TTAGCGCTCA GTCAAGGCAT TTTAAATTGT 120 

TG TTTGG CCT ACAATGTTGG TCTCCCAGAA GCAAAAATAT TTTCCGGTCC TTCAAGTGAA 180 

CAGTTTGGGT ATGCAGTGCA GCACTTTATA AATCCAAAAG GCAACTGQTT ACTGGTTGGT 240 

J TCACCCTGGA GTGGCTTTCC TGAGAACCGA ATGGGAGATG TGTATAAATG TCCTGTTGAC 300 

CTATCCACTG CCACATGTGA AAAACTAAAT TTGCAAACTT CAACAAGCAT TCCAAATGTT 360 

ACTGAGATGA AAACCAACAT GAGCCTCGGC TTGATCCTCA CCAGGAACAT GGGAACTGGA 420 

GGTTTTCTCA CATGTGGTCC TCTGTGGGCA CAGCAATGTG GGAATCAGTA TTACACAACG 480 

1 yN GGTGTGTGTT CTGACATCAG TCCTGATTTT CAGCTCTCAG CCAGCTTCTC ACCTGCAACT 540 

1U CAGCCCTGCC CTTCCCTCAT AGATGTTGTG GTTGTGTGTG ATGAATCAAA TAGTATTTAT 600 

CCTTGGGATG CAGTAAAGAA TTTTTTGGAA AAATTTGTAC AAGGCCTTGA TATAGGCCCC 660 

ACAAAGACAC AGGTGGGGTT AATTCAGTAT GCCAATAATC CAAGAGTTGT GTTTAACTTG 720 

AACACATATA AAACCAAAGA AGAAATGATT GTAGCAACAT CCCAGACATC CCAATATGGT 780 

GGGGACCTCA CAAACACATT CGGAGCAATT CAATATGCAA GAAAATATGC CTATTCAGCA 840 

I J GCTTCTGGTG GGCGAOGAAG TGCTACGAAA GTAATGGTAG TTGTAACTGA CGGTGAATCA 900 

CATGATGGTT CAATGTTGAA AGCTGTGATT GATCAATGCA ACCATGACAA TATACTGAGG 960 

TTTGGCATAG CAGTTCTTGG GTACTTAAAC AGAAACGCCC TTGATACTAA AAATTTAATA 1020 

AAAGAAATAA AAGCGATCGC TAGTATTCCA ACAGAAAGAT ACTTTTTCAA TGTGTCTGAT 1080 

ryp. GAAGCAGCTC TACTAGAAAA GGCTGGGACA TTAGGAGAAC AAATTTTCAG CATTGAAGGT 1140 

ZK) ACTGTTCAAG GAGGAGACAA CTTTCAGATG GAAATGTCAC AAGTGGGATT CAGTGCAGAT 1200 

TACTCTTCTC AAAATGATAT TCTGATGCTG GGTGCAGTGG GAGCTTTTGG CTG6AGTGGG 1260 

ACCATTGTCC AGAAGACATC TCATGGCCAT TTGATCTTTC CTAAACAAGC CTTTGACCAA 1320 

ATTCTGCAGG ACAGAAATCA CAGTTCATAT TTAGGTTACT CTGTGGCTGC AATTTCTACT 1380 

„ GGAGAAAGCA CTCACTTTGT TGCTGGTGCT CCTCGGGCAA ATTATACCGG CCAGATAGTG 1440 

£J CTATATAGTG TGAATGAGAA TGGCAATATC ACGGTTATTC AGGCTCACCG AGGTGACCAG 1500 

ATTGGCTCCT ATTTTGGTAG TGTGCTGTGT TCAGTTGATG TGGATAAAGA CACCATTACA 1560 

GACGTGCTCT TGGTAGGTGC ACCAATGTAC ATGAGTGACC TAAAGAAAGA GGAAGGAAGA 1620 

GTCTACCTGT TTACTATCAA AAAGGGCATT TTGGGTCAGC ACCAATTTCT TGAAGGCCCC 1680 

GAGGGCATTG AAAACACTCG ATTTGGTTCA GCAATTGCAG CTCTTTCAGA CATCAACATG 1740 

J\) GATGGCTTTA ATGATGTGAT TGTTGGTTCA CCACTAGAAA ATCAGAATTC TGGAGCTGTA 1800 

TACATTTACA ATGGTCATCA GGGCACTATC CGCACAAAGT ATTCCCAGAA AATCTTGGGA I860 

TC CGA TGGAG CCTTTAGGAG CCATCTCCAG TACTTTGGGA GGTCCTTGGA TGGCTATGGA 1920 

GATTTAAATG GGGATTCCAT CACCGATGTG TCTATTGGTG CCTTTGGACA AGTGGTTCAA 1980 

~r C TCTG GTCAC AAAGTATTGC TGATGTAGCT ATAGAAGCTT CATTCACACC AGAAAAAATC 2040 

3 J ACTTTGGTCA ACAAGAATGC TCAGATAATT CTCAAACTCT GCTTCAGTGC AAAGTTCAGA 2100 

CCTACTAAGC AAAACAATCA AGTGGCCATT GTATATAACA TCACACTTGA TGCAGATGGA 2160 

TTTTCATCCA GAGTAACCTC CAGGGGGTTA TTTAAAGAAA ACAATGAAAG GTGCCTGCAG 2220 

AAGAATATGG TAGTAAATCA AGCACAGAGT TGCCCCGAGC ACATCATTTA TATACAGGAG 2280 

A n. CCCTCTGATG TTGTCAACTC TTTGGATTTG CGTGTGGACA TCAGTCTGGA AAACCCTGGC 2340 

4U ACTAGCCCTG CCCTTGAAGC CTATTCTGAG ACTGCCAAGG TCTTCAGTAT TCCTTTCCAC 2400 

AAAGACTGTG GTGAGGATGG ACTTTGCATT TCTGATCTAG TCCTAGATGT CCGACAAATA 2460 

CCAGCTGCTC AAGAACAACC CTTTATTGTC AGCAACCAAA ACAAAAGGTT AACATTTTCA 2520 

GTAACACTGA AAAATAAAAG GGAAAGTGCA TACAACACTG GAATTGTTGT TGATTTTTCA 2580 

GAAAACTTGT TTTTTGCATC ATTCTCCCTA CCGGTTGATG GGACAGAAGT AACATGCCAG 2640 

GTGGCTGCAT CTCAOAAGTC TGTTGCCTGC GATGTAGGCT ACCCTGCTTT AAAGAGAGAA 2700 

CAACAGGTGA CTTTTACTAT TAACTTTGAC TTCAATCTTC AAAACCTTCA GAATCAGGCG 2760 

TCTCTCAGTT TCCAAGCCTT AAGTGAAAGC CAAGAAGAAA ACAAGGCTGA TAATTTGGTC 2820 

AACCTCAAAA TTCCTCTCCT GTATGATGCT GAAATTCACT TAACAAGATC TACCAACATA 2880 

rr\ AATTTTTATG AAATCTCTTC GGATGGGAAT GTTCCTTCAA TCGTGCACAG TTTTGAAGAT 2940 

OK) GTTGGTCCAA AATTCATCTT CTCCCTGAAG GTAACAACAG GAAGTGTTCC AGTAAGCATG 3000 

GCAACTGTAA TCATCCACAT CCCTCAGTAT ACCAAAGAAA AGAACCCACT GATGTACCTA 3060 

ACTGGGGTGC AAACAGACAA GGCTGGTGAC ATCAGTTGTA ATGCAGATAT CAATCCACTG 3120 

AAAATAGGAC AAACATCTTC TTCTGTATCT TTCAAAAGTG AAAATTTCAG GCACACCAAA 3180 

cr GAA7TGAACT GCAG AACTG C TTCCTGTAGT AATGTTACCT GCTGGTTGAA AGACGTTCAC 3240 

J J ATGAAAGGAG AATACTTTGT TAATGTGACT ACCAGAATTT GGAACGGGAC TTTCGCATCA 3300 

TCAACGTTCC AGACAGTACA GCTAAOGGCA GCTGCAGAAA TCAACACCTA TAACCCTGAG 3360 

ATATATGTGA TTGAAGATAA CACTGTTACG ATTCCCCTGA TGATAATGAA ACCTGATGAG 3420 

AAAGCCGAAG TACCAACAGG AGTTATAATA GGAAGTATAA TTGCTGGAAT CCTTTTGCTG 3480 

TTAGCTCTGG TTGCAATTTT ATGGAAGCTC GGCTTCTTCA AAAGAAAATA TGAAAAGATG 3540 

OU ACCAAAAATC CAGATGAGAT TGATGAGACC ACAGAGCTCA GTAGCTGAAC CAGCAGACCT 3600 

ACCTGCAGTG GGAACCGGCA GCATCCCAGC CAGGGTTTGC TGTTTGCGTG CATGGATTTC 3660 

TTTTTAAATC CCATATTTTT TTTATCATGT CGTAGGTAAA CTAACCTGGT ATTTTAAGAG 3720 

AAAACTGCAG GTCAGTTTGG ATGAAGAAAT TGTGGGGGGT GGGGGAGGTG CGGGGGGCAG 3780 

GTAGGGAAAT AATAGGGAAA ATACCTATTT TATATGATGG GGGAAAAAAA GTAATCTTTA 3840 

Oj aactggctgg cccagagttt acattctaat ttgcattgtg TCAGAAACAT GAAATGCTTC 3900 

CAAGCATGAC AACTTTTAAA GAAAAATATG ATACTCTCAG ATTTTAAGGG GGAAAACTGT 3960 

TCTCTTTAAA ATATTTGTCT TTAAACAGCA ACTACAGAAG TGQAAGTGCT TGATATGTAA 4020 

GTACTTCCAC TTGTGTATAT TTTAATGAAT ATTGATGTTA ACAAGAGGGG AAAACAAAAC 4080 

ACAGGTTTTT TCAATTTATG CTGCTCATCC AAAGTTGCCA CAGATGATAC TTCCAAGTGA 4140 

/U TAATTTTATT TATAAACTAG GTAAAATTTG TTGTTGGTTC CTTTTATACC ACGGCTGCCC 4200 

CTTCCACACC CCATCTTGCT CTAATGATCA AAACATGCTT GAATAACTGA GCTTAGAGTA 4260 

TACCTCCTAT ATGTCCATTT AAGTTAGGAG AGGGGGOGAT ATAGAGACTA AGGCACAAAA 4320 

TTTTGTTTAA AACTCAGAAT ATAACATTTA TGTAAAATCC CATCTGCTAG AAGCCCATCC 4380 

qc TGTGCCAGAG GAAGGAAAAG GAGGAAATTT CCTTTCTCTT TTAGGAGGCA CAACAGTTCT 4440 

I J CTTCTAGGAT TTGTTTGGCT GACTGGCAGT AACCTAGTGA ATTTTTGAAA GATGAGTAAT 4S00 

TTCTTTGGCA ACCTTCCTCC TCCCTTACTG AACCACTCTC CCACCTCCTG GTGGTACCAT 4560 

TATTATAGAA GCCCTCTACA GCCTGACTTT CTCTCCAGCG GTCCAAAGTT ATCCCCTCCT 4620 

TTACCCCTCA TCCAAAGTTC CCACTCCTTC AGGACAGCTG CTGTGCATTA GATATTAGGG 4680 

GGGAAAGTCA T CTGTTT AAT TTACACACTT GCATGAATTA CTGTATATAA ACTCCTTAAC 4740 

TTCAGGGAGC TATTTTCATT TAGTGCTAAA CAAGTAAGAA AAATAAGCTA GAGTGAATTT 4800 

CTAAATGTTG GAATGTTATG GGATGTAAAC AATGTAAAGT AAAACACTCT CAGGATTTCA 4860 

CCAGAAGTTA CAGATGAGGC ACTGGAAACC ACCACCAAAT TAGCAGGTGC ACCTTCTGTG 4920 

GCTGTCTTGT TTCTGAAGTA CTTTTTCTTC CACAAGAGTG AATTTGACCT AGGCAAGTTT 4980 

GTTCAAAAGG TAGATCCTGA GATGATTTGG TCAGATTGGG ATAAGGCCCA GCAATCTGCA 5040 
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TTTTAACAAG CACCCCAGTC ACTAGGATGC AGATGGACCA CACTTTGAGA AACACCACCC 5100 

ATTTCTACTT TTTGCACCTT ATTTTCTCTG TTCCTGAGCC CCCACATTCT CTAGGAGAAA SI 60 

CTTAGATTAA AATTCACAGA CACTACATAT CTAAAGCTTT GACAAGTCCT TGACCTCTAT S220 

AAACTTCAGA GTCCTCATTA TAAAATQGGA AGACTGAGCT GGAGTTCAGC AGTGATGCTT 52 80 

J TTTAGTTTTA AAAGTCTATG ATCTGATCTG GACTTCCTAT AATACAAATA CACAATCCTC S340 

CAAGAATTTG ACTTGGAAAA G 5361 

Seq ID NO: C124 DNA Sequence 
A Nucleic Acid Accession NMJJ31460 • 
1U Coding sequence « 103.. 1101 

1 11 21 31 41 SI 

I I I I I I 

AGCAGGCGTT TGCGAGAGGA GATACGAGCT GGACGCCTGG CCCTTCCCTC CCACCGGGTC 60 

ID CTAGTCCACC GCTCCCGGCG CCGGCTCCCC GCCTCTCCCG CTATGTACCG ACCGCGAGCC 120 

CGGGCGGCTC CCQAGGGCAG GGTCCGGGGC TGCGCGGTGC CCAGCACCGT GCTCCTGCTG 180 

CTCGCCTACC TGGCTTACCT GGCGCTGGGC ACCGGCGTGT TCTGGACGCT GGAGGGCCGC 240 

GCGGCGCAGG ACTCCAGCCG CAGCTTCCAG CGOGACAAGT GGGAGCTGTT GCAGAACTTC 300 

ACGTGTCTGG ACOGCCCGGC GCTGGACTCG CTGATCCGGG ATGTCGTCCA AGCATACAAA 360 

AACTOAGOCA GCCTCCTCAG CAACACCACC AGCATGGGGC GCTGGGAGCT GGTGGGCTCC 420 

TTCTTCTTTT CTGTGTCCAC CATCACCACC ATTGGCTATG GCAACCTGAG CCCCAACACG 480 

ATGGCTGCCC GCCTCTTCTG CATCTTCTTT GCCCTTGTGG GGATCCCACT CAACCTCGTG 540 

GTGCTCAACC GACTGGGGCA TCTCATGCAG CAGGGAGTAA ACCACTGGGC CAGCAGGCTG 600 

0 GGGGGCACCT GGCAGGATCC TGACAAGGCG CGGTGGCTGG CGGGCTCTGG CGCCCTCCTC 660 

iO TCGGGCCTCC TGCTCTTCCT GCTGCTGCCA CCGCTGCTCT TCTCCCACAT GGAGGGCTGG 720 

AGCTACACAG AGGGCTTCTA CTTCGCCTTC ATCACCCTCA GCACCGTGGG CTTCGGCGAC 780 

TACGTGATTG GAATGAACCC CTCCCAGAGG TACCCACTGT GGTACAAGAA CATGGTGTCC 840 

CTGTGGATCC TCTTTGGGAT GGCATGGCTG GCCTTGATCA TCAAACTCAT CCTCTCCCAG 900 

CTGGAGACGC CAGGGAGGGT ATGTTCCTGC TGCCACCACA GCTCTAAGGA AGACTTCAAG 960 

JU TCCCAAAGCT GGAGACAGGG ACCTGACCGG GAGCCAGAGT CCCACTCCCC ACAGCAAGGA 1020 

TGCTATCCAG AGGGACCCAT GGGAATCATA CAGCATCTGG AACCTTCTGC TCAOGCTGCA 1080 

GGCTGTGGCA AGGACAGCTA GTTATACTCC ATTCTTTGGT CGTCGTCCTC GGTAGCAAGA 1140 

CCCCTGATTT TAAGCTTTGC ACATGTCCAC CCAAACTAAA GACTACATTT TCCATCCACC 1200 

^ CTAGAGGCTG GGTGCAGCTA TATGATTAAT TCTGCCCAAT AGGGTATACA GAGACATGTC 1260 

CTGGGTGACA TGGGATGTGA CTTTCGGGTG TCGGGGCAGC ATGCCCTTCT CCCCCACTTC 1320 

CTTACTTTAG CGGGCTGCAA TGCCGCCGAT ATGATGGCTG GGAGCTCTGG CAGCCATACG 1380 

GCACCATGAA GTAGCGGCAA TGTTTGAGCG GCACAATTAG ATAGGAAGAG TCTGGATCTC 1440 

TGATGATCAC AGAGCCATCC TAACAAACGG AATATCACCG ACCCTCCTTT ATGTGAGAGA 1500 

GAAATAAACA TCTATGAAA 1519 
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Seq ID NO: C125 DNA Sequence 
Nucleic Acid Accession ih NM_004154 
Coding sequence : 3 0 9 . . 12 95 



1 U 21 31 41 SI 

I I I I 1 I 

AAGGACAGAG GAGGGGCCCT TCCTGTCAGC TGGCTGGGAG CAGAGGTGGC TTTGTCTTTT 60 

CGGAAGAACT GGTTCTGTGG AATTTGTGCT TATTTCCCAT CAAGGATCAA GGACCTGCTC 120 

TGGGGCTACC TCAGGGCCCC ACAGGATGAG GGGCTGGTTT TCAGATGAGT TTTCTGCTTG 180 

CCTGTCATCT GGATAGTGTC TAAAAATTTG CAAACTGCCT TCTTGTCAGT GTCTTGCTCA 240 

TTCTTCATGA CACTCCTGAT ATGTCTCTCA GTTTCCTCAT CTGCTGCCTC TCCAGACTTC 300 

TGCCAGAACA TTGCACGCGA CAGTTTCAGG CACAGAACTG ACTGGCAGCA GGOGCTGCTC 360 

CACGAGTGGG AATTTGCTCC AGCACTTCAC GGACTGCAAG CGAGGCACTT GCTAACTCTT 420 

GGATAACAAG ACCTCTGCCA GAAGAACCAT GGCTTTGGAA GGCGGAGTTC AGGCTGAGGA 480 

GATGGGTGCG GTCCTCAGTG AGCCCCTGCC TCCCTGAACA TAGGAAACCC ACCTGGGCAG S40 

CCATGGAATG GGACAATGGC ACAGGCCAGG CTCTGGGCTT GCCACCCACC ACCTGTGTCT 600 

ACCGCGAGAA CTTCAAGCAA CTGCTGCTGC CACCTGTGTA TTCGGCGGTG CTGGCGGCTG 660 

GCCTGCCGCT GAACATCTGT GTCATTACCC AGATCTGCAC GTCCCGCCGG GCCCTGACCC 720 

GCACGGCCGT GTACACCCTA AACCTTGCTC TGGCTGACCT GCTATATGCC TGCTCCCTGC 780 

CCCTGCTCAT CTACAACTAT GCCCAAGGTG ATCACTGGCC CTTTGGCGAC TTCGCCTGCC 840 

GCCTGGTCCG CTTCCTCTTC TATGCCAACC TGCACGGCAG CATCCTCTTC CTCACCTGCA 900 

TCAGCTTCCA GCGCTACCTG GGCATCTGCC ACCCGCTGQC CCCCTGGCAC AAACGTGGGG 960 

GCCGCCGGGC TGCCTGGCTA GTGTGTGTAG CCGTGTGGCT GGCCGTGACA ACCCAGTGCC 1020 

TGCCCACAGC CATCTTCGCT GCCACAGGCA TCCAGCGTAA CCGCACTGTC TGCTATGACC 1080 

TCAGCCCGCC TGCCCTGGCC ACCCACTATA TGCCCTATGG CATGGCTCTC ACTGTCATCG 1140 

GCTTCCTGCT GCCCTTTGCT GCCCTGCTGG CCTGCTACTG TCTCCTGGCC TGCCGCCTGT 1200 

GCCGCCAGGA TGGCCCGGCA GAGCCTGTGG CCCAGGAGCG GCGTGGCAAG GCGGCCCGCA 1260 

TGGCCGTGGT GGTGGCTGCT GCCTTTGCCA TCAGCTTCCT GCCTTTTCAC ATCACCAAGA 1320 

CAGCCTACCT GGCAGTGCGC TCGACGCCGG GCGTCCCCTG CACTGTATTG GAGGCCTTTG 1380 

/U CAGCGGCCTA CAAAGGCACG CGGCCGTTTG CCAGTGCCAA CAGCGTGCTG GACCCCATCC 1440 

TCTTCTACTT CACCCAGAAG AAGTTCCGCC GGCGACCACA TGAGCTCCTA CAGAAACTCA 1S00 

CAGCCAAATO GCAQAGGCAG GGTCGCTGAG TCCTCCAGGT CCTGGGCAGC CTTCATATTT 1560 

GCCATTGTGT CCGGGGCACC AGGAGCCCCA CCAACCCCAA ACCATGCGGA GAATTAGAGT 1620 

TCAGCTCAGC TGGGCATGGA GTTAAGATCC CTCACAGGAC CCAGAAGCTC ACCAAAAACT 1680 

ATTTCTTCAG CCCCTTCTCT GGCCCAGACC CTGTGGGCAT GGAGATGGAC AGACCTGGGC 1740 

CTGGCTCTTG AGAGGTCCCA GTCAGCCATG GAGAGCTGGG GAAACCACAT TAAGGTGCTC 1800 

ACAAAAATAC AGTGTGACGT GTACTGTCAA AA 1832 



_ A Seq ID NO: C126 DNA Sequence 
oU Nucleic Acid Accession ft: NM_007197 
Coding sequence: 16.. 1763 

1 11 21 31 41 SI 
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ACACGTCCAA CGCCAGCATG CAGCGCCCGG GCCCCCGCCT GTGGCTGGTC CTGCAGGTGA 60 

TGGGCTCGTG CGCCGCCATC AGCTCCATGG ACATGGAGCG CCCGGGOGAC GGCAAATGCC 120 

AGCCCATCGA GATCCCGATG TGCAAGGACA TCGGCTACAA CATGACTCGT ATGCCCAACC 180 

TGATGGGCCA CGAGAACCAG C60GAGGCAG CCATCCAGTT GCACGAGTTC GOGCOGCTGG 240 

D TGGAGTACGG CTGCCAOGGC CACCTCCGCT TCTTCCTGTG CTCGCTGTAC GOGCOSATGT 300 

GCACOGAGCA GGTCTCTACC CCCATCCCOG CCTGCOGGGT CATGTGCGAG CAGGCOCGGC 360 

TCAAGTGCTC CCCGATTATG GAGCAGTTCA ACTTCAAGTG GCCCGACTCC CTGGACTGCC 420 

GGAAACTCCC CAACAAGAAC GACCCCAACT ACCTGTGCAT GGAGGCGCCC AACAACGGCT 4 BO 

CGGACGAGCC CACCCGGGGC TCGGGCCTGT TCCCGCCGCT GTTCCGGCCG CAGCGGCCCC 540 

1U ACAGCGCGCA GGAGCACCCG CTGAAGGACG GGGGCCCCGG GCGCGGCGGC TGCGACAACC 600 

CGGGCAAGTT CCACCACGTG GAGAAGAGCG CGTCGTGCGC GCOGCTCTGC ACGCCCGGOG 660 

TGGACGTGTA CTGGAGCCGC GAGGACAAGC GCTTCGCAGT GGTCTGGCTG GCCATCTGGG 720 

CGGTGCTGTG CTTCTTCTCC AGCGCCTTCA CCGTGCTCAC CTTCCTCATC GACCCGGCCC 780 

GCTTCCGCTA CCCCGAGCGC CCCATCATCT TCCTCTCCAT GTGCTACTGC GTCTACTCCG 840 

TGGGCTACCT CATCCGCCTC TTCGCCGGCG CCGAGAGCAT CGCCTGCGAC CGGGACAGCG 900 

GCCAGCTCTA TGTCATCCAG GAGGGACTGG AGAGCACCGG CTGCACGCTG GTCTTCCTGG 960 

TCCTCTACTA CTTCGGCATG GCCAGCTCGC TGTGGTGGGT GGTCCTCACG CTCACCTGGT 1020 

TCCTGGCCGC CGGCAAGAAG TGGGGCCACG AGGCCATCGA AGCCAACAGC AGCTACTTCC 1080 

ACCTGGCAGC CTGGGCCATC CCGGCGGTGA AGACCATCCT GATCCTGGTC ATGCGCAGGG 1140 

TGGCGGGGGA CGAGCTCACC GGGGTCTGCT ACGTGGGCAG CATGGACGTC AACGCGCTCA 1200 

CCGGCTTCGT GCTCATTCCC CTGGCCTGCT ACCTGGTCAT CGGCACGTCC TTCATCCTCT 1260 

CGGGCTTCGT GGCCCTGTTC CACATCCGGA GGGTGATGAA GACGGGCGGC GAGAACACGG 1320 

ACAAGCTGGA GAAGCTCATG GTGCGTATCG GGCTCTTCTC TGTGCTGTAC ACCGTGCCGG 1380 

CCACCTGTGT GATCGCCTGC TACTTTTACG AACGCCTCAA CATGGATTAC TGGAAGATCC 1440 

TGGCGGCGCA GCACAAGTGC AAAATGAACA ACCAGACTAA AACGCTGGAC TGCCTGATGG 1500 

CCGCCTCCAT CCCCGCCGTG GAGATCTTCA TGGTGAAGAT CTTTATGCTG CTGGTGGTGG 1560 

GGATCACCAG CGGGATGTGG ATTTGGACCT CCAAGACTCT GCAGTCCTGG CAGCAGGTGT 1620 

GCAGCOGTAG GTTAAAGAAG AAGAGCCGGA GAAAACCGGC CAGCGTGATC ACCAGCGGTG 1680 

GGATTTACAA AAAAGCCCAG CATCCCCAGA AAACTCACCA CGGGAAATAT GAGATCCCTG 1740 

CCCAGTOGCC CACCTGCGTG TGAACAGGGC TGGAGGGAAG GGCACAGGGG CGCCCGGAGC 1800 

TAAGATGTGG TGCTTTTCTT GGTTGTGTTT TTCTTTCTTC TTCTTCTTTT TTTTTTTTTT 1860 

ATAAAAGCAA AAGAGAAATA CATAAAAAAG TGTTTACCCT GAAATTCAGG ATGCTGTGAT 1920 

ACACTGAAAG GAAAAATGTA CTTAAAGGGT TTTGlTrrr GT TTTGGTTTTC CAGCGAAGGG 1980 

AAGCTCCTOC AGTGAAGTAG CCTCTTGTGT AACTAATTTG TGGTAAAGTA GTTGATTCAG 2040 

CCCTCAGAAG AAAACTTTTG TTTAGAGCCC TCCGTAAATA TACATCTGTG TATTTGAGTT 2100 

GGCTTTGCTA CCCATTTACA AATAAGAGGA CAGATAACTG CTTTGCAAAT TCAAGAGCCT 2160 

CCCCTGGGTT AACAAATGAG CCATCCCCAG GGCCCACCCC CAGGAAGGCC ACAGTGCTGG 2220 

GCGGCATCCC TGCAGAGGAA AGACAGGACC CGGGGCCCGC CTCACACCCC AGTGGATTTG 2280 

GAGTTGCTTA AAATAGACTC TGGCCTTCAC CAATAGTCTC TCTGCAAGAC AGAAACCTCC 2340 

ATCAAACCTC ACATTTGTGA ACTCAAACGA TGTGCAATAC ATTTTTTTCT CTTTCCTTGA 2400 

AAATAAAAAG AGAAACAAGT ATTTTGCTAT ATATAAAGAC AACAAAAGAA ATCTCCTAAC 2460 

AAAAGAACTA AGAGGCCCAG CCCTCAGAAA CCCTTCAGTG CTACATTTTG TGGCTTTTTA 2520 

ATGGAAACCA AGCCAATGTT ATAGACGTTT GGACTGATTT GTGGAAAGGA GGGGGGAAGA 2580 

GGGAGAAGGA TCATTCAAAA GTTACCCAAA GGGCTTATTG ACTCTTTCTA TTGTTAAACA 2640 

AATGATTTCC ACAAACAGAT CAGGAAGCAC TAGGTTGGCA GAGACACTTT GTCTAGTGTA 2700 

TTCTCTTCAC AGTGCCAGGA AAGAGTGGTT TCTGCGTGTG TATATTTGTA ATATATGATA 2760 

TTTTTCATGC TCCACTATTT TATTAAAAAT AAAATATGTT CTTTAAAAAA A 2811 
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Seq ID NO: C127 DNA Sequence 
jU Nucleic Acid Accession #: NM 005761.1 
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Coding sequence: 250.. 4956 

1 11 21 31 41 51 

I I I I I I 

GCGAGGAGGA AACGGTGCCG GAGCGCGCAG GGCTTGCTGC CGCCACCGCG GCTGCACAGG 60 

CTGCCGGAGC GAGCCTGCCG OGCGCCGCOC TCCOCGCTCT CCTTCCTGGG CGAGCTGCGG 120 

GGATGGGGCG GCCGOGGGAG CCCGAGCGCG CGCAGGAACC GCCGCCGCCG CCGCCOGOGT 180 

CTCCGTTGCC GCGCGCCTGA GCCGCCGTOG CCGCCGCGCG CCCTGCCCGG GGGCGGCCCC 240 

CCCAGCCCCA TGGAGGTCTC CCGGAGGAAG GCGCCGCCGC GCCCCCCGCG CCCCGCAGCG 300 

CCACTGCCCC TGCTOGCCTA TCTGCTGGCA CTGGCGGCTC COGGCCGGGG CGCGGACGAG 360 

CCCGTGTGGC GGTCGGAGCA AGCCATCGGA GCCATCGCGG CGAGCCAGGA GGACGGCGTG 420 

TTTGTGGCGA GCGGCAGCTG CCTGGACCAG CTGGACTACA GCCTGGAGCA CAGCCTCTCG 480 

OGCCTGTACC GGGACCAAGC GGGCAACTGC ACAGAGCCGG TCTCGCTGGC GCCCCCCGCG 540 

CGGCCCCGGC CCGGGAGCAG CTTCAGCAAG CTGCTGCTGC CCTACCGCGA GGGGGCGGCC 600 

GGCCTCGGGG GGCTGCTGCT CACCGGCTGG ACCTTCGACC GGGGOGCCTG CGAGCTGCGG 660 

CCCCTGGGCA ACCTGAGCCG CAACTCCCTG CGCAACGGCA CCGAGGTGGT GTCGTGCCAC 720 

CCGCAGGGCT CGACGGCCGG CGTGGTGTAC CGCGCGGGCC GGAACAACOG CTGGTACCTG 780 

GCGGTGGCCG CCACCTACGT GCTGCCTGAG CCGGAGACGG CGAGCCGCTG CAACCCCGCG 840 

GCATCCGACC ACGACACGGC CATCGOGCTC AAGGACACGG AGGGGCGCAG CCTGGCCAOG 900 

CAGGAGCTGG GGCGCCTCAA GCTGTGCGAG GGCGCGGGCA GCCTGCACTT CGTGGACGCC 960 

TTTCTCTGGA ACGGCAGCAT CTACTTCCCC TACTACCCCT ACAACTATAC GAGCGGCGCT 1020 

GCCACCGGCT GGCCCAGCAT GGCGCGCATC GCGCAGAGCA CCGAGGTGCT GTTCCAGGGC 1080 

CAGGCATCCC TCGACTGCGG CCACGGCCAC CCCGACGGCC GCCGCCTGCT CCTCTCCTCC 1140 

AGCCTAGTGG AGGCCCTGGA CGTCTGGGCG GGAGTGTTCA GCGCGGCCGC TGGAGAGGGC 1200 

CAGGAGCGGC GCTCCCCCAC CACCACGGCG CTCTGCCTCT TCAGAATGAG TGAGATCCAG 1260 

GCGCGCGCCA AGAGGGTCAG CTGGGACTTC AAGAOGGCCG AGAGCCACTG CAAAGAAGGG 1320 

GATCAACCTG AAAGAGTCCA ACCAATCGCA TCATCTACCT TGATCCATTC CGACCTGACA 1380 

TCCGTTTATG GCACCGTGGT AATGAACAGG ACTGTTTTAT TCTTGGGGAC TGGAGATGGC 1440 

CAGTTACTTA AGGTTATTCT TGGTGAGAAT TTGACTTCAA ATTGTCCAGA GGTTATCTAT 1500 

GAAATTAAAG AAGAGACACC TGTTTTCTAC AAACTCGTTC CTGATCCTGT GAAGAATATC IS 60 

TACATTTATC TAACAGCTGG GAAAGAGGTG AGGAGAATTC GTGTTGCAAA CTGCAATAAA 1620 

CATAAATCCT GTTCGGAGTG TTTAACAGCC ACAGACCCTC ACTGCGGTTG GTGCCATTCG 1680 

CTACAAAGGT GCACTTTTCA AGGAGATTGT GTACATTCAG AGAACTTAGA AAACTGGCTG 1740 

GATATTTCGT CTGGAGCAAA AAAGTGCCCT AAAATTCAGA TAATTCGAAG CAGTAAAGAA 1800 
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AAGACTACAG TGACTATGGT GGGAAGCTTC TCTCCAAGAC ACTCAAAGTG CATGGTQAAG 1860 

AATGTGGACT CTAGCAGGGA GCTCTGCCAG AATAAAAGTC AGCCCAACCG GACCTGCACC 1920 

TGTAGCATCC CAACCAGAGC AACCTACAAA GATGTTTCAG TTGTCAACGT GATGTTCTCC 1980 ■ 

TTOGGTTCTT GGAATTTATC AQACAGATTC AACTTTACCA ACTGCTCATC ATTAAAAGAA 2040 

5 TGCCCAGCAT GCGTAGAAAC TGGCTGCGCG TGGTGTAAAA GTGCAAGAAG GTGTATCCAC 2100 

CCCTTCACAG CTTGCGACCC TTCTGATTAT GAGAGAAACC AGGAACAGTG TCCAGTGGCT 2160 

GTCGAGAAGA CATCAGGAGG AGGAAGACCC AAGGAGAACA AGGGGAACAG AACCAACCAG 2220 

GCTTTACAGG TCTTCTACAT TAAGTCCATT GAGCCACAGA AAGTATCGAC ATTAGGGAAA 2280 

AGCAACGTGA TAGTAACGGG AGCAAACTTT ACCCGGGCAT CGAACATCAC AATGATCCTG 2340 

1U AAAGGAACCA GTACCTGTGA TAAGGATGTG ATACAGGTTA GCCATGTGCT AAATGACACC 2400 

CACATGAAAT TCTCTCTTCC ATCAAGCCGG AAAGAAATGA AGGATGTGTG TATCCAGTTT 2460 

GATGGTGGGA ACTGCTCTTC TGTGGGATCC TTATCCTACA TTGCTCTGCC ACATTGTTCC 2520 

CTTATATTTC CTGCTACCAC CTGGATCAGT GGTGGTCAAA ATATAACCAT GATGGGCAGA 2580 

AATTTTGATG TAATTGACAA CTTAATCATT TCACATGAAT TAAAAGGAAA CATAAATGTC 2640 

15 TCTGAATATT GTGTGGCGAC TTACTGCGGG TTTTTAGCCC CCAGTTTAAA GAGTTCAAAA 2700 

GTGGGCACGA ATGTCACTGT GAAGCTGAGA GTACAAGACA CCTACTTGGA TTGTGGAACC 2760 

CTGCAGTATC GGGAGGACCC CAGATTCACG GGGTATCGGG TGGAATCCGA GGTGGACACA 2820 

GAACTGGAAG TGAAAATTCA AAAAGAAAAT GACAACTTCA ATATTTGCAA AAAAGACATT 2880 

GAAATTACTC TCTTCCATGG GGAAAATGGG CAATTAAATT GCAGTTTTGA AAATATTACT 2940 

2U AGAAATCAAG ATCTTACCAC CATCCTTTGC AAAATTAAAG GCATCAAGAC TGCAAGCACC 3000 

ATTGCCAACT CTTCTAAGAA AGTTCGGGTC AAGCTGGGAA ACCTGGAGCT CTACGTCGAG 3060 

CAGGAGTCAG TTCCTTCCAC ATGGTATTTT CTGATTGTGC TCCCTGTCTT GCTAGTGATT 3120 

GTCATTTTTG CGGCCGTGGG GGTGACCAGG CACAAATCGA AGGAGCTGAG TCGCAAACAG 3180 

0 AGTCAACAAC TAGAATTGCT GGAAAGOGAG CTCOGGAAAG AGATAOGTGA CGGCTTTGCT 3240 

2.J GAGCTGCAGA TGGATAAATT GGATGTGGTT GATAGTTTTG GAACTGTTCC CTTCCTTGAC 3300 

TACAAACATT TTGCTCTGAG AACTTTCTTC CCTGAGTCAG GTGGCTTCAC CCACATCTTC 3360 

ACTGAAGATA TGCATAACAG AGACGCCAAC GACAAGAATG AAAGTCTCAC AGCTTTGGAT 3420 

GCCCTAATCT GTAATAAAAG CTTTCTTGTT ACTGTCATCC ACACCCTTGA AAAGCAGAAG 3480 

AACTTTTCTG TGAAGGACAG GTGTCTGTTT GCCTCCTTCC TAACCATTGC ACTGCAAACC 3540 

iU AAGCTGGTCT ACCTGACCAG CATCCTAGAG GTGCTGACCA GGGACTTGAT GGAACAGTGT 3600 

AGTAACATGC AGCGGAAACT CATGCTGAGA CGCACGGAGT CCGTCGTCGA AAAACTCCTC 3660 

ACAAACTGGA TGTCCGTCTG CCTTTCTGGA TTTCTCCGGG AGACTGTCGG AGAGCCCTTC 3720 

TATTTGCTGG TGAOGACTCT GAACCAGAAA ATTAACAAGG GTCOCGTGGA TGTAATCACT 3780 

TGCAAAGCCC TGTACACACT TAATGAAGAC TGGCTGTTGT GGCAGGTTCC GGAATTCAGT 3840 

35 ACTGTGGCAT TAAACGTOGT CTTTGAAAAA ATOCOGGAAA ACGAGAGTGC AGATGTCTGT 3900 

CGGAATATTT CAGTCAATGT TCTCGACTGT GACACCATTG GCCAAGCCAA AGAAAAGATT 3960 

TTCCAAGCAT TCTTAAGCAA AAATGGCTCT CCTTATGGAC TTCAGCTTAA TGAAATTGGT 4020 

CTTGAGCTTC AAATGGGCAC ACGACAGAAA GAACTTCTGG ACATCGACAG TTCCTCGGTG 4080 

ATTCTTGAAG ATGGAATCAC CAAGCTAAAC ACCATTGGCC ACTATGAGAT ATCAAATGGA 4140 

4U TCCACTATAA AAGTCTTTAA GAAGATAGCA AATTTTACTT CAGATGTGGA GTACTCGGAT 4200 

GACCACTGCC ATTTGAfTTT ACCAGATTCG GAAGCATTCC AAGATGTGCA AGGAAAGAGA 4260 

CATCGAGGGA AGCACAAGTT CAAAGTAAAA GAAATGTATC TGACAAAGCT GCTGTCGACC 4320 

AAGGTGGCAA TTCATTCTGT GCTTGAAAAA CTTTTTAGAA GCATTTGGAG TTTACCCAAC 4380 

AGCAGAGCTC CATTTGCTAT AAAATACTTT TTTGACTTTT TGGACGCCCA GGCTGAAAAC 4440 

45 AAAAAAATCA CAGATOCTGA CGTCGTACAT ATTTGGAAAA CAAACAGCCT TCCTCTTCGC 4500 

TTCTGGGTAA ACATCCTGAA GAACCCTCAG TTTGTCTTTG ACATTAAGAA GACACCACAT 4560 

ATAGAOGGCT GTTTGTCAGT GATTGCCCAG GCATTCATGG ATGCATTTTC TCTCACAGAG 4620 

CAGCAACTAG GGAAGGAAGC ACCAACTAAT AAGCTTCTCT ATGCCAAGGA TATCCCAACC 4680 

TACAAAGAAG AAGTAAAATC TTATTACAAA GCAATCAGGG ATTTGCCTCC ATTGTCATCC 4740 

5U TCAGAAATGG AAGAATTTTT AACTCAGGAA TCTAAGAAAC ATGAAAATGA ATTTAATGAA 4800 

GAAGTGGCCT TGACAGAAAT TTACAAATAC ATCGTAAAAT ATTTTGATGA GATTCTAAAT 4860 

AAACTAGAAA GAGAACGAGG GCTGGAAGAA GCTCAGAAAC AACTCTTGCA TGTAAAAGTC 4920 

TTATTTGATG AAAAGAAGAA ATGCAAGTGG ATGTAAGCAC TCTGGGGCCT GGCTTAATCT 4980 

GGCAAAGTTC TTCAGACGAC TTGGQ AGCA A AATGGCTGCT TGAGCTACTC TGTGTCGTTA 5040 

55 ATTTGTTGTT TGCACATAGG TTCCACTTTG GGCACTGTCT TTTTAAGAGA CCAAGGCACA 5100 

TGCACAGCTT TTAGAAAGCA A 5121 

Seq ID NO j Cl 28 DNA Sequence 
Nucleic Acid Accession 8: NM_002185.1 
OU Coding sequencer 23.. 1402 

1 11 21 31 41 51 

I I I I I I 

CTCTCTCTCT ATCTCTCTCA GAATGACAAT TCTAGGTACA ACTTTTGGCA TGGTTTTTTC 60 

TTTACTTCAA GTCGTTTCTG GAGAAAGTGG CTATGCTCAA AATGGAGACT TGGAAGATGC 120 

AGAACTGGAT GACTACTCAT TCTCATGCTA TAGCCAGTTG GAAGTGAATG GATCGCAGCA 180 

TTCACTGACC TGTGCTTTTG AGGACCCAGA TGTCAACACC ACCAATCTGG AATTTGAAAT 240 

ATGTGGGGCC CTCGTGGAGG TAAAGTGCCT GAATTTCAGG AAACTACAAG AGATATATTT 300 

CATCGAGACA AAGAAATTCT TACTGATTGG AAAGAGCAAT ATATGTGTGA AGGTTGGAGA 360 

AAAGAGTCTA ACCTGCAAAA AAATAGACCT AAOCACTATA GTTAAACCTG AGGCTCCTTT 420 

TGACCTGAGT GTCATCTATC GGGAAGGAGC CAATGACTTT GTGGTGACAT TTAATACATC 480 

ACACTTGCAA AAGAAGTATG TAAAAGTTTT AATGCATGAT GTAGCTTACC GCCAGGAAAA 540 

GGATGAAAAC AAATGGACGC ATGTGAATTT ATCCAGCACA AAGCTGACAC TCCTGCAGAG 600 

AAAGCTCCAA CCGGCAGGAA TGTATGAGAT TAAAGTTOGA TCCATCCCTG ATCACTATTT 660 

TAAAGGCTTC TGGAGTGAAT GGAGTCCAAG TTATTACTTC AGAACTCCAG AGATCAATAA 720 

TAGCTCAGGG GAGATGGATC CTATCTTACT AACCATCAGC ATTTTGAGTT TTTTCTCTGT 780 

CGCTCTGTTG GTCATCTTGG CCTGTGTGTT ATGGAAAAAA AGGATTAAGC CTATCGTATG 840 

GCCCAGTCTC CCCGATCATA AGAAOACTCT GGAACATCTT TGTAAGAAAC CAAGAAAAAA 900 

TTTAAATGTG AGTTTCAATC CTGAAAGTTT CCTGGACTGC CAGATTCATA GGGTGGATGA 960 

CATTCAAGCT AGAGATGAAG TGGAAGGTTT TCTGCAAGAT ACGTTTCCTC AGCAACTAGA 1020 

AGAATCTGAG AAGCAGAGGC TTGGAGGGGA TGTGCAGAGC CCCAACTGCC CATCTGAGGA 1080 

TGTAGTCGTC ACTCGAGAAA GCTTTGGAAG AGATTGATCC CTCACATGCC TGGCTGGGAA 1140 

TGTCAGTGCA TGTGACGCCC CTATTCTCTC CTCTTCCAGG TCCCTAGACT GCAGGGAGAG 1200 

TGGCAAGAAT GGGCCTCATG TGTACCAGGA CCTCCTGCTT AGCCTTGGGA CTACAAACAG 1260 
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CACGCTGCCC CCTCCATTTT CTCTCCAATC TGGAATCCTG ACATTGAACC CAGTTGCTCA 1320 

GGGTCAGCCC ATTCTTACTT CCCTGGGATC AAATCAAGAA GAAGCATATG TCACCATGTC 1380 

CAGCTTCTAC CAAAACCAGT GAAGTGTAAG AAACCCAGAC TGAACTTACC GTGAGCGACA 1440 

AAGATGATTT AAAAGGGAAO TCTAGAGTTC CTAGTCTCCC TCACAGCACA GAGAAGACAA 1500 

5 AATTAGCAAA ACCCCACTAC ACAGTC7GCA AGATTCTGAA ACATTGCTTT GACCACTCTT 1560 

CCTGAGTTCA GTGGCACTCA ACATGAGTCA AGAGCATCCT GCTTCTACCA TGTGGATTTG 1620 

GTCACAAGGT TTAAGGTGAC CCAATGATTC AGCTATTT 1658 

Seq ID NO; C129 dna Sequence 
10 Nucleic Acid Accession #: NMJJ02722.1 
Coding sequence: IS.. 302 

1 11 21 31 41 51 

I I 1 I I I 

ACTCTGGACT CCGGATGGCT GCCGCACGCC TCTGCCTCTC CCTGCTGCTC CTGTCCACCT 60 

GCGTGGCTCT GTTACTACAG CCACTGCTGG GTGCCCAGGG AGCCCCACTG GAGCCAGTGT 120 

ACCCAGGGGA CAATGCCACA CCAGAGCAGA TGGCCCAGTA TGCAGCTGAT CTCCGTAGAT 180 

ACATCAACAT GCTGACCAGG CCTAGGTATG GGAAAAGACA CAAAGAGGAC ACGCTGGCCT 240 

TCTCGGAGTG GGGGTCCCCG CATGCTGCTG TCCCCAQGGA GCTCAGCCCG CTGGACTTAT 300 

AATGCCACCT TCTGTCTCCT ACGACTCCAT GAGCAGCGCC AGCCCAGCTC TCCCCTCTGC 360 

ACCCTTGGCT CTGGCCAAAG CTTGCTCCCT GCTCCCACAC AGGCTCAATA AAGCAAGTCA 420 

AAGCC 425 

Seq ID NO: CI 30 DNA Sequence 
Nucleic Acid Accession #: NM_032545-1 
Coding sequence: 4 7.. 7 18 

1 11 21 31 41 51 

I I I I i I 

AAACTGATCT TCAATGCACT AAGAGAAGGA GACTCTCAAA CCAAAAATGA CCTGGAGGCA 60 

CCATGTCAGG CTTCTGTTTA CGGTCAGTTT GGCATTACAG ATCATCAATT TGGGAAACAG 120 

CTATCAAAGA GAGAAACATA ACGGCGGTAG AGAGGAAGTC ACCAAGGTTG CCACTCAGAA 180 

GCACCGACAG TCACCGCTCA ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC 240 

CGAGGGCTGG GGGCCGGAGG AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 300 

CGCGCGGCCG CGCTGCTGCA GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 360 

CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 420 

CCTGGAGCAC GGAGCCTGGA CCCTCCGCGC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 480 

CCTGCACTGC CTCCCCCTCC AGACGCCTGA CCGCTGTGAC CCGAAAGACT TCCTGGCCTC 540 

CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 600 

ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC TGGTCCCTTC 660 

CGTCCTCCAG CGGGAGCGGC GCCCCTGCGG AAGGCCGGGA CTTGGGCATC GCCTTTAATT 720 

TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 780 

TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 840 

AAAAAA 846 

Seq ID NO: C131 DNA Sequence 
Nucleic Acid Accession ft: NM_006533.1 
Coding sequence: 72.. 46 7 

1 11 21 31 41 51 

I I I I I I 

AGGGAGAGAG GGAGGGGAGQ AAATTGGAGA CCCCAGCACC CCCTTGCTCA CTCTCTTGCT 60 

CACAGTCCAC GATGGCCCGG TC0CTGGTG7 GCCTTGGTGT CATCATCTTG CTGTCTGCCT 120 

TCTCCGGACC TGGTGTCAGG GGTGGTCCTA TGCCCAAGCT GGCTGACCGG AAGCTGTGTG 180 

CGGACCAGGA GTGCAGCCAC CCTATCTCCA TGGCTGTGGC CCTTGAGGAC TACATGGCCC 240 

CCGACTGCCG ATTCCTGACC ATTCACCGGG GCCAAGTGGT GTATGTCTTC TCCAAGCTGA 300 

AGGGCCGTGG GCGGCTCTTC TGGGGAGGCA GCGTTCAGGG AGATTACTAT GGAGATCTGG 360 

CTGCTCGCCT GGGCTATTTC CCCAGTAGCA TTGTCCGAGA GGACCAGACC CTGAAACCTG 420 

GCAAAGTCGA TGTGAAGACA GACAAATGGG ATTTCTACTG CCAGTGAGCT CAGCCTACCG 480 

CTGGCCCTGC CGTTTCCCCT CCTTGGGTTT ATGCAAATAC AATCAGCCCA GTGCAAAC 538 

Seq ID NOt C132 DNA Sequence 
Nucleic Acid Accession ft: AB064272 
Coding sequence: 1..708 

1 11 21 31 41 51 

I I I I I I ' 

ATGACACAAG TCACAGAAAA GTCCACAGAA CACCCAGAAA AGACCACGTC AACCACAGAG 60 

AAAACCACAA GAACCCCAGA AAAGCCTACG CTATACTCAG AGAAGACCAT ATGCACCAAA 120 

GGGAAAAACA CACCAGTCCC AGAAAAGCCT ACAGAAAACC TGGGGAACAC CACACTGACC 180 

ACTGAGACCA TAAAAGCCCC AGTAAAGTCC ACAGAAAACC CAGAAAAAAC AGCAGCAGTC 240 

ACAAAGACTA TAAAACCTTC AGTCAAGGTC ACAGGAGACA AATCTCTCAC TACTACCTCT 300 

TCTCATCTAA ATAAAACTGA AGTTACTCAT CAGGTGCCCA CTGGTTCTTT CACCCTCATT 360 

ACATCTAGAA CGAAGCTGAG TTCTATCACA TCAGAAGCCA CAGGAAACGA GAGCCATCCA 420 

TACCTCAATA AAGATGGCTC ACAGAAAGGT ATCCACGCTG GACAGATGGG AGAGAATGAT 480 

TCATTCCCTG CATGGGCCAT AGTTATTGTG GTCCTGGTGG CTGTGATTCT CCTCCTGGTG 540 

TTCCTTGGCC TGATCTTCTT GGTCTCCTAT ATGATGCGGA CACGCCGCAC ACTAACCCAG 600 

AACACCCAGT ACAATGATGC AGAGGATGAG GGTGGCCCCA ATTCCTAOCC GGTCTACCTG 660 

ATGGAGCAGC AGAATCTTGG CATGGGCCAG ATCCCTTCCC CACGGTGA 708 
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Seq ID NO: C133 DNA Sequence 

Nucleic Acid Accession ft: NN_080B70.1 

Coding sequence: 3.. 710 
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Seq ID MO: CI 3 4 DMA Sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1.. 10674 



1 
I 

ATGTGGCCTC 
TTTCAGCAGA 
GGGGCCCCCG 
GTGGAGCGGC 
CGCCTGGAGC 
AGCGAGCTCA 
CGCGTGGCCA 
TCCACCCGCC 
TCCTACCQAG 
CTTCATGCTA 



TTTGGCATAT 
GAGCACTGTT 
CATGAAGATC 
TGTGATGAAG 
GGCCATTTTG 
ACAGCTTGCC 
ATTCCATGTC 
GTCTGCAGAG 
CTGAAGCCTC 
GCCTGTGGGG 
CTACCCAATG 
CTCCGCCAGC 
ACATGTTTGG 
CAAGGAAACA 
TTTCAGATGC 
TTTGGGACGA 
ATGCTGAGAT 
6AGGTGGAGG 
CAAGATTCTG 
GTGTCAGTCC 
GCTATCGTAT 
AAGGTTATTG 
GTCTCGGAGA 
GCTGAATTGG 
ATAGTACAGT 
GTCATAAAAG 
ACTCCAGATA 
GAAGGGTCTA 
ACCACTGAAT 
GAGATGTTCT 
GCATTTGAGA 



11 

I 

GCCTGGCCTT 
TGTCCCCGTC 
GGAGTATCCC 
TGGGCCAGGC 
TTGTCTTCCT 
TGTTCGTCCG 
TCGTGACCTT 
GCGCGCGCCA 
GT6GCGGCAC 
GAGAAAACTC 
CTAGACCAAT 
GGCAAGGGAA 
ACCTGCTACA 
TACCTTCTGG 
GCAAGGACTG 
AGTGCATCTG 
CATCGGGGAC 
CTGATGAAAA 
AGGGATACAG 
CCGAAAATGG 
TCCGATGTCA 
GTTTGTGGTC 
CGAAACATGG 
TTGCCTGTGA 
GCCAGTGGGA 
CCAAAGATGT 
TCTGCTATGT 
GTACCACTTC 
CTCCTCAAAT 
CCAATGTTAC 
ACGTTCATCC 
ACACGGCAAC 
ATGCAGAACC 
AGGTACATGC 
TCATTACCAG 
ATACAGCCAC 
GTTCTCCCTG 
ATACTGGAGT 
CTGACAAGTA 
GGCCAGACTG 
ACAAAGCAGC 
CGACCCTGGG 



21 
I 

TTGTTGCTGG 
GCGCAATTTC 
CGCGCCGCCC 
GTTCCGGCGA 
GGTGGATGAT 
CAAGCTGCTG 
CTCQTCCAAG 
GCACAAGTGC 
CTACACCAAG 
AACAAAAGTT 
TGCAGCGTCA 
CATTCGAGAG 
CAGTTTTGAA 
GAGTTTTATT 
CTGTGACCGA 
TGAAAAGGGG 
ATACAAACCT 
TCACACCTCT 
GGCATCTGGC 
TTACTTTATC 
CCCTGGATTT 
CGGTTCAGAG 
CCACATCAGC 
TGAAGGGTAC 
TGGGCCAGAA 
CATCATATCC 
AAGTTGCCGC 
TGGAAAATGG 
CAACTGTCCT 
CTGGCAGATT 
AGCTTTCACC 
TGACCTATCC 
AOCTGTCATA 
CGCAAGCTGG 
AAGTCATACA 
TGACCCCTCA 
TGAAATTCCA 
CAACTGTACA 
TTATTGTGCT 
TGCCAAAAAA 
TCGTTGTGAT 
AAAAATGGTC 



31 

I 

GGTCTGGCGC 
AGCTTCCGCC 
GCTCCTGGCG 
CGCGTGCGGC 
TCGTCCAGCG 
TCOGACTTCC 
AACTACGTGG 
GCGCTGCTCC 
GGCGCCTTCC 
GTATTTCTCA 
CTGCGAGATT 
CTGAATGACA 
GAATTTGAGG 
CAAGATGATA 
ATGGGAAGCT 
TATTACGGGA 
GAAGGCTCAC 
CCACCTGGAA 
CAGACCTGTG 
CAAAACACTT 
GATCTTGTGG 
AGCTACTGCA 
TGTTCTACAA 
AGACTAGAAG 
CCCCGGTGTG 
CCCCACAACT 
CAAGGGTTCA 
AATGTCGGAG 
AAGGACATAG 
CCAACAGCTA 
CCACCTTACC 
GGCAACCAGG 
GACTGGTGCA 
GATGAGCCTC 
CAAGGAGACC 
GGCAATAACA 
TTCACACCTG 
TTAACTTGCT 
TATGAAGATG 
OGTTTTGCAA 
GACACAGATC 
CCATCATTTT 



41 
I 



51 



TCGTTTCGGG CTGGGCGACC 
TCTTCCCCGA GACCGCGCCC 
ACGAAGCGGC GGGGAGCAGA 
TGCTGCGGGA GCTCAGCGAG 
TGGGCGAAGT CAACTTCCGC 
CCGTGGTGCC CACGGCCAOG 
TGCCGCGCGT CGATTACATC 
TCCAAGAGAT CCCTGCCATC 
AGCAAGCCGC GCAAATTCTT 
TCACTGATGG ATATTCCAAT 
CAGGAGTGGA GATCTTCACT 
TGGCTTCCAC CCCAAAGGAG 
CTTTAGCTCG CCGGGCATTG 
TGGTCCACTG CTCATATCTT 
GCAAATGTGG GACACACACA 
AAGGTCTGCA GTATGAATGC 
CAGGAGGAAT CAGCAGTTGC 
GCACATCCCC TGAAGACTGT 
AACTTGTCCA CTGCCCTGCC 
GCAACAACCA CTTCAATGCA 
GAAGCAGCAT CATCTTATGT 
GAGTAAGAAC ATGTCCTCAT 
GGGAAATGTT ATATAAGACA 
GCAGTQATAA GCTTACTTGT 
TGGAGCGCCA CTGTTCCACC 
GTGGCAAGCA GCCAGCCAAA 
TTTTATCTGG AGTCAAAGAA 
TTCAGGCAGC TGTGTGTAAA 
AGGCTAAGAC TCTGGAACAG 
AAGACAACTC TGGTGAAAAG 
TTTTCOCAAT TGGAGATGTT 
CCAGCTGCAT TTTCCATATC 
GATCTCCACC TCCCGTCCAG 
AGTTCTCAGA CAACTCAGGG 
TTTTCCCTCA AGGGGAGACT 
GGACATGTGA TATCCATATT 
TAAATGGGGA TTTTATATGC 
TGGAGGGCXA TGATTTCACA 
GCGTCTGGAA ACCAACATAT 
ACCACGGGTT CAAGTCCTTT 
TGATGAAGAA GTTTTCTGAA 
GTAGTGATGC AGAGGACATT 
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GACTGCAGAC TGGAGGAGAA CCTGACCAAA AAATATTGCC TAGAATATAA TTATGACTAT 2580 

GAAAATGGCT TTGCAATTGG ACCAGGTGGC TGGGGTGCAG CTAATAGGCT GGATTACTCT 2640 

TACQATGACT TCCTGGACAC TGT6CAAGAA ACAGCCACAA GCATCGGCAA TGCCAAGTCC 2700 

TCAOGGATTA AAAGAAGTGC CCCATTATCT GACTATAAAA TTAAGTTAAT TTTTAACATC 2760 

5 ACAGCTAGTG TGCCATTACC OGATGAAAGA AATGATACCC TTGAATGGGA AAATCAGCAA 2820 

CGACTCCTTC AGACATTGGA AACTATCACA AATAAACTGA AAAGGACTCT CAACAAAGAC 2880 

CCCATGTATT CCTTTCAGCT TGCATCAGAA ATACTTATAG COGACAGCAA TTCATTAGAA 2940 

ACAAAAAAGG CTTCCCCCTT CTGCAGACCA GGCTCAGTGC TGAGAGGGGG TATGTGTGTC 3000 

AATTGCCCTT TGGGAACCTA TTATAATCTG GAACATTTCA CCTGTGAAAG CTGCOGGATC 3060 

10 GGATCCTATC AAGATGAAGA AGGGCAACTT GAGTGCAAGC TTTGCCCCTC TGGGATGTAC 3120 

ACGGAATATA TCCATTCAAG AAACATCTCT GATTGTAAAG CTCAGTGTAA ACAAGGCACC 3180 

TACTCATACA GTGGACTTGA GACTTGTGAA TCGTGTCCAC TGGGCACTTA TCAGCCAAAA 3240 

TTTGGTTCCC GGAGCTQOCT CTCGTGTCCA GAAAACACCT CAACTGTGAA AAGAGGAGCC 3300 

GTGAACATTT CTGCATGTGG AGTTCCTTGT CCAGAAGGAA AATTCTOQCG TTCTGGGTTA 3360 

15 ATGCCCTGTC ACCCATGTCC TOGTGACTAT TACCAACCTA ATGCAGGGAA GGCCTTCTGC 3420 

CTGGCCTGTC CCTTTTATGG AACTACCCCA TTCGCTGGTT CCAGATCCAT CACAGAATGT 3480 

TCAACTTCAG TTCTGAATAT TACTATTTTC GGTGGATTTG GGCATCTGGA GTTGTTAAAT 3540 

TGTCCTTCTG AGGTTTTCCA TGAATGCTTC TTTAACCCTT GCCACAATAG TGGAACCTGC 3600 

CAGCAACTTG GGOGTGGTTA TGTTTGTCTC TGTCCACTTG GATATACAGG CTTAAAGTGT 3660 

20 GAAACAGACA TOGATGAGTG CAGCCCACTG CCTTGCCTCA ACAATGGAGT TTGTAAAGAC 3720 

CTAGTTGGGG AATTCATTTG TGAGTGCCCA TCAGGTTACA CAGGTCAGCG GTGTGAAGAA 3780 

AATATAAATG AGTGTAGCTC CAGTCCTTGT TTAAATAAAG GAATCTGTGT TGATGGTGTG 3840 

GCTGGCTATC GTTGCACATG TGTGAAAGGA TTTGTAGGCC TGCATTGTGA AACAGAAGTC 3900 

AATGAATGCC AGTCAAACCC ATGCTTAAAT AATGCAGTCT GTGAAGACCA GGTTGGGGGA 3960 

25 TTCTTGTGCA AATGCCCACC TGGATTTTTG GGTACCCGAT GTGGAAAGAA CGTOGATGAG 4020 

TGTCTCAGTC AGCCATGCAA AAATGGAGCT ACCTGTAAAG AOGGTGCCAA TAGCTTCAGA 4080 

TGCCTGTGTG CAGCTGGCTT CACAGGATCA CACTGTGAAT TGAACATCAA TGAATGTCAG 4140 

TCTAATCCAT GTAGAAATCA GGCCACCTGT GTGGATGAAT TAAATTCATA CAGTTGTAAA 4200 

TGTCAGCCAG GATTTTCAGG CAAAAGGTGT GAAACAGAAC AGTCTACAGG CTTTAACCTG 4260 

30 GATTTTGAAG TTTCTGGCAT CTATGGATAT GTCATGCTAG ATGGCATGCT CCCATCTCTC 4320 

CATGCTCTAA CCTGTACCTT CTGGATGAAA TCCTCTGACG ACATGAACTA TGGAACACCA 4380 

ATCTCCTATG CAGTTGATAA CGGCAGCGAC AATACCTTGC TCCTGACTGA TTATAACGGC 4440 

TGGGTTCTTT ATGTGAATGG CAGGGAAAAG ATAACAAACT GTCCCTCGGT GAATGATGGC 4500 

AGATGGCATC ATATTGCAAT CACTTGGACA AGTGCCAATG GCATCTGGAA AGTCTATATC 4S60 

35 GATGGGAAAT TATCTGACGG TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGGT 4620 

GGTGCGTTAG TTCTGGGGCA AGAGGAAGAC AAAAAAGGAG AGGGATTCAG CCCAGCTGAG 4680 

TCTTTTGTGG GCTCCATAAG CCAGCTCAAC CTCTGGGACT ATGTCCTGTC TCCACAGCAG 4740 

GTGAAGTCAC TGGCTACCTC CTGCCCAGAG GAACTCAGTA AAGGAAACGT GTTAGCATGG 4800 

CCTGATTTCT TGTCAGGAAT TGTGGGGAAA GTGAAGATCG ATTCTAAGAG CATATTTTGT 4860 

40 TCTGATTGCC CACGCTTAGG AGGGTCAGTG CCTCATCTGA GAACTGCATC TGAAGATTTA 4920 

AAGCCAGGTT CCAAAGTCAA TCTGTTCTGT GATCCAGGCT TCCAGCTGGT CGGGAACCCT 4980 

GTGCAGTACT GTCTGAATCA AGGACAGTGG ACACAACCAC TTOCTCACTG TGAACGCATT 5040 

AGCTGTGGGG TGCCACCTCC TTTGGAGAAT GGCTTCCATT CAGCCGATGA CTTCTATGCT S100 

GGCAGCACAG TAACCTACCA GTGCAACAAT GGCTACTATC TATTGGGTGA CTCAAGGATG 5160 

45 TTCTGTACAG ATAATGGGAG CTGGAAOGGC GTTTCACCAT CCTGCCTTGA TGTCGATGAG 5220 

TGTGCAGTTG GATCAGATTG TAGTGAGCAT GCTTCTTGCC TGAACGTAGA TGGATCCTAC 5280 

ATATGTTCAT GTGTCCCACC GTACACAGGA GATGGGAAAA ACTGTGCAGA ACCTATAAAA 5340 

TGTAAGGCTC CAGGAAATCC GGAAAATGGC CACTCCTCAG GTGAGATTTA TACAGTAGGT 5400 

A GCCGGAGTCA CATTTTCGTG TCAGGAAGGA TACCAGTTGA TGGGAGTAAC CAAAATCACA 5460 

50 TGTTTGGAGT CTGGAGAATG GAATCATCTA ATACCATATT GTAAAGCTGT TTCATGTGGT 5520 

AAACCGGCTA TTCCAGAAAA TGGTTGCATT GAGGAGTTAG CATTTACTTT TGGCAGCAAA 5580 

GTGAGATATA GGTGTAATAA AGGATATACT CTGGCCGGTG ATAAAGAATC ATCCTGTCTT 5640 

GCTAACAGTT CTTGGAGTCA TTCCCCTCCT GTGTGTGAAC CAGTGAAGTG TTCTAGTCCG 5700 

GAAAATATAA ATAATGGAAA ATATATTTTG AGTGGGCTTA CCTACCTTTC TACTGCATCA 5760 

55 TATTCATGOG ATACAGGATA CAGCTTACAG GGCCCTTCCA TTATTGAATG CAOGGCTTCT 5820 

GGCATCTGGG ACAGAGCGCC ACCTGCCTGT CACCTCGTCT TCTGTGGAGA ACCACCTGCC 5880 

ATCAAAGATG CTGTCATTAC GGGGAATAAC TTCACTTTCA GGAACACCGT CACTTACACT 5940 

TGCAAAGAAG GCTATACTCT TGCTGGTCTT GACACCATTG AATGCCTGGC OGACGGCAAG 6000 

- A TGGAGTAGAA GTGAGCAGCA GTGCCTGGCT GTCTCCTGTG ATGAGCCACC CATTGTGGAC 6060 

60 CACGCCTCTC CAGAGACTGC CCATCGGCTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 6120 

GATGGTTACA GCCTAGCAGA CAATTCCCAG CTTCTCTGCA ATGCCCAGGG CAAGTGGGTA 6180 

CCCCCAGAAG GTCAAGACAT GCCCCGTTGT ATAGCTCATT TCTGTGAAAA ACCTOCATCG 6240 

GTTTCCTATA GCATCTTGGA ATCTGTGAGC AAAGCAAAAT TTGCAGCTGG CTCAGTTGTG 6300 

£c AGCTTTAAAT GCATGGAAGG CTTTGTACTG AACACCTCAG CAAAGATTGA ATGTATGAGA 6360 
65 GGTGGGCAGT GGAACCCTTC CCCCATGTCC ATCCAGTGCA TCCCTGTGCG GTGTGGAGAG 6420 
CCACCAAGCA TCATGAATGG CTATGCAAGT GGATCAAACT ACAGTTTTGG AGCCATGGTG 6480 
GCTTACAGCT GCAACAAGGG GTTCTACATC AAAGGGGAAA AGAAGAGCAC CTGCGAAGCC 6540 
ACAGGGCAGT GGAGTAGTCC TATACCGAOG TGCCACCCGG TATCTTGTGG TGAACCACCT 6600 
_ A AAGGTTGAGA ATGGCTTTCT GGAGCATACA ACTGGCAGGA TCTTTGAGAG TGAAGTGAGG 6660 
70 TATCAGTGTA ACCOGGGCTA TAAGTCAGTC GGAAGTCCTG TATTTGTCTG OCAAGCCAAT 6720 
CGCCACTGGC ACAGTGAATC CCCTCTGATG TGTGTTCCTC TCGACTGTGG AAAACCTCCC 6780 
CCGATCCAGA ATGGCTTCAT GAAAGGAGAA AACTTTGAAG TAGGGTCCAA GGTTCAGTTT 6840 
TTCTGTAATG AGGGTTATGA GCTTGTTGGT GACAGTTCTT GGACATGTCA GAAATCTGGC 6900 
AAATGGAATA AGAAGTCAAA TOCAAAGTGC ATGCCTGCCA AGTGCCCAGA GCOGCCCCTC 6960 
75 TTGGAAAACC AGCTAGTATT AAAGGAGTTG ACCACCGAGG TAGGAGTTGT GACATTTTCC 7020 
TGTAAAGAAG GGCATGTCCT GCAAGGCCCC TCTGTCCTGA AATGCTTGCC ATCCCAGCAA 7080 
TGGAATGACT CTTTOCCTGT TTGTAAGATT GTTCTTTGTA CCCCACCTCC CCTAATTTCC 7140 
TTTGGTGTCC CCATTCCTTC TTCTGCTCTT CATTTTGGAA GTACTGTCAA GTATTCTTGT 7200 
Q GTAGGTGGGT TTTTCCTAAG AGGAAATTCT ACCACCCTCT GCCAACCTGA TGGCACCTGG 7260 
OO AGCTCTCCAC TGCCAGAATG TGTTCCAGTA GAATGTCCCC AACCTGAGGA AATCCCCAAT 7320 
GGAATCATTG ATGTGCAAGG CCTTGCCTAT CTCAGCACAG CTCTCTATAC CTGCAAGCCA 7380 
GGCTTTGAAT TGGTGGGAAA TACTACCACC CTTTGTGGAG AAAATGGTCA CTGGCTTGGA 7440 
GGAAAACCAA CATGTAAAGC CATTGAGTGC CTGAAACCCA AGGAGATTTT GAATGGCAAA 7500 
TTCTCTTACA CGGACCTACA CTATGGACAG ACOGTTACCT ACTCTTGCAA CCGAGGCTTT 7560 
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CGGCTCGAAG GTCCCAGTGC 
CCATCTTGCA ATGCCATCCA 
GGTGCAGATT ACAGCTATGG 
GCTGGTCATQ CCATGCAGAC 
ATGCCAATAG ACTGTGGCCT 
GATGACCAGG GATATTTTGA 
CACCCTCCTT ATCATTTGGG 
GCTACACATT CATCAAACTT 
TATGAACTTC TGGGGAACCC 
GCACCATCCT GCATTTCAAT 
CGTTTTACAG AGACTAGCAT 
CTAGCAGGCT CTGACTTAAG 
CGCTGTGAAG CCATTTCATG 
AGCAACTACA CATACCTGAG 
GGCACTGAGA GGAGAACATG 
ATTCCTGTGG ACTGCAGTTC 
TACACATTCC AAAAAGAGAT 
AGGAGTCGGG TTTGTCTTGC 
GTCAGATGTG CCACCCCGCC 
TTCATGAAGG AAGTAACATT 
CTCACCTGTC AGTCAGATGG 
TGTGGACCTC CTGAAGATCT 
GGCCATATAC AGTATCAGTG 
TGCCTCTCCA ATGGCTCCTG 
ACACCAGTAA TTGAATATGG 
CGGATTCAGT GCTTCAAAGG 
GATGGCCAGT GGAGCTCTGG 
ATGATACCAA ATGCGTTCAT 
AGCTGCAGGT CTGGATATGT 
GTATGGAGCC AGCCTTATCC 
GCCAATGCAG TGGCAACTGG 
CTGGAAGGTT ATACGATGGA 
TGGTTCCCTG AGAGAATCTC 
CATATACTTG TACATGGGGA 
GAAGGGTATA CCTTTGAGGG 
CCACCATTCT CCGATGAATC 
CATGGATTTG TGGTTGGCAG 
CCTGGCTATG AACTAGAGGG 
GGAGGGGTGG CAATATGCAA 
AAAGCTGACA TTGAAAACAG 
TACAGTCTTG AAGGGCCATC 
GTCCCTCTCT GCAAACCAAA 
CTGTCTGAAA AGGAGTTTTA 
CTGCTGCAGG GCCACGGCAT 
GCCAAATGTG AAAAAATCTC 
GGCGTACATT ATCAATATGG 
GAGGGTTTCC TGAGGAGTGT 
AGAGCTGTCT GTCGATTTCC 
TCCTGTCCAG AGGGCTGGAT 
CTGAACGGAG GTOGCTGTGT 
TCTCGCTGTC ATACAGCTGT 
CCAAACCGAT GTCACTGTCT 



CTTGACCTGT TTAGAGACAG GTGATTGGGA TGTAGATGCC 
CTGTGATTCC CCACAACCCA TTGAAAATGG TTTTGTAGAA 
TGCCATAATC ATCTACAGTT GCTTCCCTGG GTTTCAGGTG 
CTGTGAAGAG TCAGGATGGT CAAGTTCCAT CCCAACATGT 
CCCTCCTCAT ATAGATTTTG GAGACTGTAC TAAACTCAAA 
GCAAGAAGAC GACATGATGG AAGTTCCATA TGTGACTCCT 
AGCAGTGGCT AAAACCTGGG AAAATACAAA GGAGTCTCCT 
TCTGTATGGT ACCATGGTTT CATACACCTG TAATCCAGGA 
TGTGCTGATC TGCCAGGAAG ATGGAACTTG GAATGGCAGT 
TGAATGTGAC TTGCCTACTG CTCCTGAAAA TGGCTTTTTG 
GGGAAGTGCT GTGCAGTATA GCTGTAAACC TGGACACATT 
GCTTTGTCTA GAGAATAGAA AGTGGAGTGG TGCCTCCCCA 
CAAAAAGCCA AATCCAGTCA TGAATGGATC CATCAAAGGA 
CACGTTGTAC TATGAGTGTG ACCCCGGATA TGTGCTGAAT 
CCAGGATGAC AAAAACTGGG ATGAGGATGA GCCCATTTGC 
ACCCCCAGTC TCAGCCAATG GCCAGGTGAG AGGAGACGAG 
TGAATACACT TGCAATGAAG GGTTCTTGCT TGAGGGAGCC 
CAATGGAAGT TGGAGTGGAG CCACTCCCGA CTGTGTGCCT 
ACAACTGGCC AATGGGGTGA CGGAAGGCCT GGACTATGGC 
CCACTGTCAC GAGGGCTACA TCTTGCACGG TGCTCCAAAA 
CAACTGGGAT GCAGAGATTC CTCTCTGTAA ACCAGTCAAC 
TGCCCATGGT TTCCCTAATG GTTTTTCCTT TATTCATGGG 
CTTTCCTGGT TATAAGCTCC ATGGAAATTC ATCAAGAAGG 
GAGTGGCAGC TCACCTTCCT GCCTGCCTTG CAGATGTTCC 
AACTGTCAAT GGGACAGATT TTGACTGTGG AAAGGCAGCC 
CTTCAAGCTC CTAGGACTTT CTGAAATCAC CTGTGAAGCC 
GTTOCCCCAC TGTGAACACA CTTCTTGTGG TTCTCTTCCA 
CAGTGAGACC AGCTCTTGGA AGGAAAATGT GATAACTTAC 
CATACAAGGC AGTTCAGATC TGATTTGTAC AGAGAAAGGG 
AGTCTGTGAG CCCTTGTCCT GTGGGTCCCC ACCGTCTGTC 
AGAGGCACAC ACCTATGAAA GTGAAGTGAA ACTCAGATGT 
TACAGATACA GATACATTCA CCTGTCAGAA AGATGGTCGC 
CTGCAGTCCT AAAAAATGTC CTCTCCCGGA AAACATAACA 
CGATTTCAGT GTGAATAGGC AAGTTTCTGT GTCATGTGCA 
AGTTAACATA TCAGTATGTC AGCTTGATGG AACCTGGGAG 
TTGCAGTCCA GTTTCTTGTG GGAAACCTGA AAGTCCAGAA 
TAAATACACC TTTGAAAGCA CAATTATTTA TCAGTGTGAG 
GAACAGGGAA OGTGTCTGCC AGGAGAACAG ACAGTGGAGT 
AGAGACCAGG TGTGAAACTC CACTTGAATT TCTCAATGGG 
GACGACTGGA CCCAACGTGG TATATTCCTG CAACAGAGGC 
TGAGGCACAC TGCACAGAAA ATGGAACCTG GAGCCACCCA 
TCCATGCCCT GTTCCTTTTG TGATTCCOGA GAATGCTCTG 
TGTTGATCAG AATGTGTCCA TCAAATGTAG GGAAGGTTTT 
CATTACCTGC AACCCCGACG AGACGTGGAC ACAGACAAGC 
ATGTGGTCCA CCAGCTCACG TAGAAAATGC AATTGCTCGA 
AGACATGATC ACCTACTCAT GTTACAGTGG ATACATGTTG 
TTGTTTAGAA AATGGAACAT GGACATCACC TCCTATTTGC 
ATGTCAGAAT GGGGGCATCT GCCAACGCCC AAATGCTTGT 
GGGGCGCCTC TGTGAAGAAC CAATCTGCAT TCTTCCCTGT 
GGCCCCTTAC CAGTGTQACT GCCCGCCTGG CTGGACGGGG 
TTGCCAGTCT CCCTGCTTAA ATGGTGGAAA ATGTGTAAGA 
TTCTTCTTGG ACGGGACATA ACTGTTCCAG GTAA 



Seq ID NO: C13S DKA Sequence 

Nucleic Acid Accession #j pgenesh predicted 

Coding sequence : 1 . . 3 90 

1 11 21 31 41 51 

I 1 I ! I I 

ATGAGGTTCA GTGTCTCAGG CATGAGGACC GACTACCCCA GGAGTGTGCT GGCTCCTGCT 
TATGTGTCAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 
GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA CCCCTTGGAG 
CAGTGCTGTT ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG TGGTOCCCCC 
TGCACCTTCT GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT CACAAACGAT 
TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC TCCCATCTCC 
AGTAAATGTG AAAGAGGCCG GATATGTTAG 
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Seq ZD NO: C136 dna Sequence 
7U Nucleic Acid Accession BC035671.1 
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Coding sequence: 126.. 1745 

1 11 21 31 41 51 

t i I I I 1 

GGCAGCGACT GCGCCCCGTC CCGGCGCCGC GCTCGTCCGC AGAGGAGGCG GCCCGGCCCG 60 

GGCAGCTGCG GCTCGGGATC CGTCGAGGGG AGGCCGAGCT TGCCAAGCTG GCGCCCAGCG 120 

GGGTCATGGT GCCCGGCGCC CGCGGCGGCG GCGCACTGGC GCGGGCTGCC GGGCGGGGCC 180 

TCCTGGCTTT GCTGCTCGCG GTCTCCGCCC CGCTCCGGCT GCAGGCGGAG GAGCTGGGTG 240 

ATGGCTGTGG ACACCTAGTG ACTTATCAGG ATAGTGGCAC AATGACATCT AAGAATTATC 300 

CCGGGACCTA CCCCAATCAC ACTGTTTGCG AAAAGACAAT TACAGTACCA AAGGGGAAAA 360 

GACTGATTCT GAGGTTGGGA GATTTGGATA TCGAATCCCA GACCTGTGCT TCTGACTATC 420 

TTCTCTTCAC CAGCTCTTCA GATCAATATG GTCCATACTG TGGAAGTATG ACTGTTCCCA 480 

AAGAACTCTE GTTGAACACA AGTGAAGTAA CCGTCCGCTT TGAGAGTGGA TCCCACATTT 540 

CTGGCCGGGG TTTTTTGCTG ACCTATGCGA GCAGCGACCA TCCAGATTTA ATAACATGTT 600 
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TGGAACGAGC TAGCCATTAT TTGAAGACAG AATACAGCAA ATTCTGCCCA GCTGGTTGTA 660 

GAGAGGTAGC AGGAGACATT TCTGGGAATA TGGTAGATGG ATATAGAGAT ACCTCTTTAT 720 

TGTGCAAAGC TGCCATCCAT GCAGGAATAA TTGCTGATGA ACTAGGTGGC CAGATCAGTG 780 

TGCTTCRGOG CAAAGGGATC AGTCGATATG AAGGGATTCT GGCCAATGGT GT T C TTTC GA 840 

GGGATGGTTC CCTGTCAGAC AAGCGATTTC TGTTTACCTC CAATGGTTGC AGCAGATCCT 900 

TGAGTTTTGA ACCTGACGGG CAAATCAGAG CTTCTTCCTC ATGGCAGTCG GTCAATGAGA 960 

GTGGAGACCA AOTTCACTGG TCTCCTGGCC AAGCCCGACT TCAGGACCAA GGCCCATCAT 1020 

GGGCTTCGGG CGACAGTAGC AACAACCACA AACCACGAGA GTGGCTGGAG ATCGATTTGG 1060 

GGGAGAAAAA GAAAATAACA GGAATTAGGA CCACAGGATC TACACAGTCG AACTTCAACT 1140 

TTTATGTTAA GAGTTTTGTG ATGAACTTCA AAAACAATAA TTCTAAGTGG AAGACCTATA 1200 

AAGGAATTGT GAATAATGAA GAAAAGGTGT TTCAGGGTAA CTCTAACTTT CGGGACCCAG 1260 

TGCAAAACAA TTTCATCCCT CCCATCGTGG CCAGATATGT GCGGGTTGTC CCCCAGACAT 1320 

GGCACCAGAG GATAGCCTTG AAGGTGGAGC TCATTGGTTG CCAGATTACA CAAGGTAATG 1380 

ATTCATTGGT GTGGOGCAAG ACAAGTCAAA GCACCAGTGT TTCAACTAAG AAAGAAGATG 1440 

AGACAATCAC AAGGCCCATC CCCTOGGAAG AAACATCCAC AGGAATAAAC ATTACAACGG 1S00 

TGGCTATTCC ATTGGTGCTC CTTGTTGTCC TGGTGTTTGC TGGAATGGGG ATCTTTGCAG 1560 

CCTTTAGAAA GAAGAAGAAG AAAGGAAGTC CGTATGGATC AGCAGAGGCT CAGAAAACAG 1620 

ACTGTTGGAA GCAGATTAAA TATCCCTTTG CCAGACATCA GTCAGCTGAG TTTACCATCA 1680 

GCTATGATAA TGAGAAGGAG ATGACACAAA AGTTAGATCT CATCACAAGT GATATGGCAG 1740 

GTTAACTCCG TTGACTGCCA AAATAGCATC CCCAACGTGC AGCCCTCCGC ATCTATCAGC 1800 

AGGTTGCCCC GGATGGATCT CAGAGATGAG GATOGGAACA CCATGTTCTT TCCCACCCTA 1860 

ACAACAACAA AGGGCAGTAA ATTAAAGTAC TCTTTGTAAG GTACAGTTAC CGATTAATCT 1920 

AGAGATAAAA TATTTTCTTA AAAATATATT TCATTAAACA CCTATGCTGT CTCTATAAAA 1980 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2010 

Seq ID NO: CI 3 7 DMA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . . 1761 

1 11 21 31 41 51 

I 1 I I I I 

ATGGGATTCG GTGCGGGGCA GCGACTGCGC CCCGTCCCGG CGCCGOGCTC GTCCGCAGAG 60 

GAGGCGGCCC GGCCCGGGCA GCTGGGGCTC GGGATCCGTC GAGGGGAGGC CGAGCTTGCC 120 

AAGCTGGCGC CCAGCGGGGT CATGGTGCCC GGCGCCCGCG GCGGCGGCGC ACTGGCGCGG 180 

GCTGCCGGGC GGGGCCTCCT GGCTTTGCTG CTOGCGGTCT CCGCCCCGCT CCGGCTGCAG 240 

GCGGAGGAGC TGGGTGATGG CTGTGGACAC CTAGTGACTT ATCAGGATAG TGGCACAATG 300 

ACATCTAAGA ATTATCCCGG GACCTACCCC AATCACACTG TTTGCGAAAA GACAATTACA 360 

GTACCAAAGG GGAAAAGACT GATTCTGAGG TTGGGAGATT TGGATATCGA ATCCCAGACC 420 

TGTGCTTCTG ACTATCTTCT CTTCACCAGC TCTTCAGATC AATATGGTCC ATACTGTGGA 480 

AGTATGACTG TTCCCAAAGA ACTCTTGTTG AACACAAGTG AAGTAACCGT CCGCTTTGAG 540 

AGTGGATCCC ACATTXCTGG CCGGGGTTTT TTGCTGACCT ATGCGAGCAG CGACCATCCA 600 

GATTTAATAA CATGTTTGGA ACGAGCTAGC CATTATTTGA AGACAGAATA CAGCAAATTC 660 

TGCCCAGCTG GTTGTAGAGA CGTAGCAGGA GACATTTCTG GGAATATGGT AGATGGATAT 720 

AGAGATACCT CTTTATTGTG CAAAGCTGCC ATCCATGCAG GAATAATTGC TGATGAACTA 780 

GGTGGCCAGA TCAGTGTGCT TCAGCGCAAA GGGATCAGTC GATATGAAGG GATTCTGGCC 840 

AATGGTGTTC TTTCGAGGGA TGGTTCCCTG TCAGACAAGC GATTTCTGTT TACCTCCAAT 900 

GGTTGCAGCA GATCCTTGAG TTTTGAACCT GACGGGCAAA TCAGAGCTTC TTCCTCATGG 960 

CAGTCGGTCA ATGAGAGTGG AGACCAAGTT CACTGGTCTC CTGGCCAAGC CCGACTTCAG 1020 

GACCAAGGCC CATCATGGGC TTCGGGCGAC AGTAGCAACA ACCACAAACC ACGAGAGTGG 1080 

CTGGAGATCG ATTTGGGGGA GAAAAAGAAA ATAACAGGAA TTAGGACCAC AGGATCTACA 1140 

CAGTCGAACT TCAACTTTTA TGTTAAGAQT TTTGTGATGA ACTTCAAAAA CAATAATTCT 1200 

AAGTGGAAGA CCTATAAAGG AATTGTGAAT AATGAAGAAA AGGTGTTTCA GGGTAACTCT 1260 

AACTTTCGGG ACCCAGTGCA AAACAATTTC ATCCCTCCCA TCGTGGCCAG ATATGTGCGG 1320 

GTTGTCCCCC AGACATGGCA CCAGAGGATA GCCTTGAAGG TGGAGCTCAT TGGTTGCCAG 1380 

ATTACACAAG GTAATGATTC ATTGGTGTGG CGCAAGACAA GTCAAAGCAC CAGTGTTTCA 1440 

ACTAAGAAAG AAGATGAGAC AATCACAAGG CCCATCCCCT CGGAAGAAAC ATCCACAGGA 1500 

ATAAACATTA CAACGGTGGC TATTCCATTG GTGCTCCTTG TTGTCCTGGT GTTTGCTGGA 1560 

ATGGGGATCT TTGCAGCCTT TAGAAAGAAG AAGAAGAAAG GAAGTCCGTA TGGATCAGCA 1620 

GAGGC TCAGA AAACAGACTG TTGGAAGCAG ATTAAATATC CCTTTGCCAG ACATCAGTCA 1660 

GCTGAGTTTA CCATCAGCTA TGATAATGAG AAGGAGATGA CACAAAAGTT AGATCTCATC 1740 

ACAAGTGATA TGGCAGGTTA A 1761 

Seq ID NO: CI 3 8 DNA Sequence 

Nucleic Acid Accession #i FGENBSH predicted 

Coding sequence: 1..2310 

1 11 21 31 41 51 

i I I I I I 

ATGTTCCAGC GGCAGGAAAG ATTTCTTGAC TTATCTTCAG CTGAAGCAGT GGCAGCTTGG 60 

ATATTACATC AACATCCTGA CATTATTAAC AAAGGTGATG GCTGTGGACA CCTAGTGACT 120 

TATCAGGATA GTGGCACAAT GACATCTAAG AATTATCCCG GGACCTACCC CAATCACACT 180 

GTTTGCGAAA AGACAATTAC AGTACCAAAG GGGAAAAGAC TGATTCTGAG GTTGGGAGAT 240 

TTGGATATCG AATCCCAGAC CTGTGCTTCT GACTATCTTC TCTTCACCAG CTCTTCAGAT 300 

CAATATGGAA TGCAGAAGGA GGAGGAGACA GAAGTGCTTT GTCTTTCAGT GGCTGGCGCT 360 

CAGAGAGTGG ACATTCCTGT GCAGCTGTTG CCCAGCTTCC TGGAAGGGTG GAAGGGTCAT 420 

GCTGATGCAA GAGGTCCATA CTGTGGAAGT ATGACTGTTC CCAAAGAACT CTTGTTGAAC 480 
ACAAGTGAAG TAACCGTCCG CTTTGAGAGT GOATCCCACA TTTCTGGCCG GGGTTTTTTG 540 
CTGACCTATG CGAGCAGOGA CCATCCAGAT TTAATAACAT GTTTGGAACG AGCTAGCCAT 600 
TATTTGAAGA CAGAATACAG CAAATTCTGC CCAGCTGGTT GTAGAGACGT AGCAGGAGAC 660 
ATTTCTGGGA ATATGGTAGA TGGATATAGA GATACCTCTT TATTGTGCAA AGCTGCCATC 720 
CATGCAGGAA TAATTGCTGA TGAACTAGGT GGCCAGATCA GTGTGCTTCA GCGCAAAGGG 780 
ATCAGTCGAT ATGAAGGGAT TCTGGCCAAT GGTGTTCTTT CGAGGGATGG TTCCCTGTCA 840 
GACAAGCOAT TTCTGTTTAC CTCCAATGGT TGCAGCAGAT CCTTGAGTTT TGAACCTGAC 900 
GGGCAAATCA GAGCTTCTTC CTCATGGCAG TCGGTCAATO AGAGTGGAGA CCAAGTTCAC 960 
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TGGTCTCCTG GCCAAGCCCG ACTTCAGGAC CAAGGCCCAT CATGGGCTTC GGGCGACAGT 1020 

AGCAACAACC ACAAACCAOG AGAGTGGCTG GAGATOGATT TGGGGGAGAA AAAGAAAATA 1080 

ACAGGAATTA GGACCACAGG ATCTACACAG TCGAACTTCA ACTTTTATGT TAAGAGTTTT 1140 

GTGATGAACT TCAAAAACAA TAATTCTAAG TGGAAGACCT ATAAAGGAAT TGTGAATAAT 1200 

GAAGAAAAGG TGTTTCAGGG TAACTCTAAC TTTCGGGACC CAGTGCAAAA CAATTTCATC 1260 

CCTCCCATCG TGGCCAGATA TGTGCGGGTT GTCCCCCAGA CATGGCACCA GAGGATAGCC 1320 

TTGAAGGTGG AGCTCATTGG TTGCCAGATT ACACAAGGTA ATGATTCATT GGTGTGGGGC 1380 

AAGACAAGTC AAAGCACCAG TGTTTCAACT AAGAAAGAAG ATGAGACAAT CACAAGGCCC 1440 

ATCCCCTCGG AAGAAACATC CACAGATGCC ATGCCAGTGC AGATTGTCGG AGACCATACC 1500 

CAGATGATCT CACAAAGGGA GAATCTGGGA CCTGATGAGG GCAAAATACC TTTTAAAGGC 1560 

ACAGCGGAAA GCATGGTTAG AGTAGTGTTT GCTGTTGTGG TTAATGACCT TGGCATGCTG 1620 

TTCTTAGCAC ACACACCTGA GGAGGACATT GATCACTACT GTTGGAAGCA GATTAAATAT 1680 

CCCTTTGCCA GACATCAGTC AGCTGAGTTT ACCATCAGCT ATGATAATGA GAAGGAGATG 1740 

ACACAAAAGT TAGATCTCAT CACAAGTGAT ATGGCAGATT ACCAGCAGCC CCTCATGATT 1800 

GGCACCGGGA CAGTCACGAG GAAGGGCTCC ACCTTCCGGC CCATGGACAC GGATGCCGAG 1860 

GAGGCAGGGG TGAGCACCGA TGCCGGCGGC CACTATGACT GCCCGCAGCG GGCCGGCCGC 1920 

CACGAGTACG CGCTGCCCCT GGOGCCCCCG GAGCCOGAGT AOGCCACGCC CATCGTGGAG 1980 

CGGCAGGTGC TGCGCGCCCA CACGTTCTCT GCGCAGAGCG GCTACCGCGT CCCAGGGCCC 2040 

CAGCCCGGCC ACAAACACTC CCTCTCCTCG GGOGGCTTCT CCCCCGTAGC GGGTGTGGGC 2100 

GCCCAGGACG GAGACTATCA AAGGCCACAC AGCGCACAGC CTGCGGACAG GGGCTACGAC 2160 

CGGCCCAAAG CTGTCAGCGC CCTCGCCACC GAAAGCGGGC ACCCTGACTC TCAGAAGCCC 2220 

CCAACGCATC CCGGGAOGAG TGACAGCTAT TCTGCCCCCA GAGACTGCCT CACACCCCTC 2280 

AACCAGACGG CCATGACTGC CCTTTTGTGA 2310 

Seq ID NO: C139 DNA Sequence 

Nucleic Acid Accession ft: NM_C04616.2 

Coding sequence: 180.. 893 



1 11 21 31 41 51 

30 | II I | | 

AGTGCCCCAG GAGCTATGAC AAGCAAAGGA ACATACTTGC CTGGAGATAG CCTTTGCGAT 60 

ATTTAAATGT CCGTGGATAC AGAAATCTCT GCAGGCAAGT TGCTCCAGAG CATATTGCAG 120 

GACAAGCCTG TAACGAATAG TTAAATTCAC GGCATCTGGA TTCCTAATCC TTTTCCGAAA 180 

TGGCAGGTGT GAGTGCCTGT ATAAAATATT CTATGTTTAC CTTCAACTTC TTGTTCTGGC 240 

TATGTGGTAT CTTGATCCTA GCATTAGCAA TATGGGTACG AGTAAGCAAT GACTCTCAAG 300 

CAATTTTTGG TTCTGAAGAT GTAGGCTCTA GCTCCTACGT TGCTGTGGAC ATATTGATTG 360 

CTGXAGGTGC CATCATCATG ATTCTGGGCT TCCTGGGATG CTGCGGTGCT ATAAAAGAAA 420 

GTCGCTGCAT GCTTCTGTTG TTTTTCATAG GCTTGCTTCT GATCCTGCTC CTGCAGGTGG 480 

CGACAGGTAT CCTAGGAGCT GTTTTCAAAT CTAAGTCTGA TCGCATTGTG AATGAAACTC 540 

TCTATGAAAA CACAAAGCTT TTGAGCGCCA CAGGGGAAAG TGAAAAACAA TTCCAGGAAG 600 

CCATAATTGT GTTTCAAGAA GAGTTTAAAT GCTGOGGTTT GGTCAATGGA GCTGCTGATT 660 

GGGGAAATAA TTTTCAACAC TATCCTGAAT TATGTGCCTG TCTAGATAAG CAGAGACCAT 720 

GCCAAAGCTA TAATGGAAAA CAAGTTTACA AAGAGACCTG TATTTCTTTC ATAAAAGACT 780 

TCTTGGCAAA AAATTTGATT ATAGTTATTG GAATATCATT TGGACTGGCA GTTATTGAGA 840 

TACTGGGTTT GGTGTTTTCT ATGGTCCTGT ATTGCCAGAT CGGGAACAAA TGAATCTGTG 900 

GATGCATCAA CCTATCGTCA GTCAAACCCC TTTAAAATGT TGCTTTGGCT TTGTAAATTT 960 

AAATATGTAA GTGCTATATA AGTCAGGAGC AGCTGTCTTT TTAAAATGTC TCGGCTAGCT 1020 

AGACCACAGA TATCTTCTAG ACATATTGAA CACATTTAAG ATTTGAGGGA TATAAGGGAA 1080 

AATGATATGA ATGTGTATTT TTACTCAAAA TAAAAGTAAC TGTTTACGTT AAAAAAAAAA 1140 

AAAAAAAAAA AAAAAAAAA 1159 



Seq ID NO: C140 DNA Sequence 
Nucleic Acid Accession ft: NMJJ04617.2 
Coding sequence: 232.. 840 



Seq ID NO: C141 DNA Sequence 
Nucleic Acid Accession 8: NM 002381.2 



1 11 21 31 41 51 

I I I I I I 

CTTCAGGTCA GGGAGAATGT ATAAATGTCC ATTGCCATCG AGGTTCTGCT ATTTTTGAGA 60 

AGCTGAAGCA ACTCCAAGGA CACAGTTCAC AGAAATTTGG TTCTCAGCCC CAAAATACTG 120 

ATTGAATTGG AGACAATTAC AAGGACTCTC TGGCCAAAAA CCCTTGAAGA GGCCCCGTGA 180 

AGGAGGCAGT GAGGAGCTTT TGATTGCTGA CCTGTGTCGT ACCACCCCAG AATGTGCACT 240 

GGGGGCTGTG CCAGATGCCT GGGGGGGACC CTCATTCCCC TTGCTTTTTT TGGCTTCCTG 300 

GCTAAGATCC TGTTATTTTT TCCTGGAGGA AAAGTGATAG ATGACAACGA CCACCTTTCC 360 

c CAAGAGATCT GGTTTTTCGG AGGAATATTA GGAAGCGGTG TCTTGATGAT CTTCCCTGCG 420 

OD CTGGTGTTCT TGGGCCTGAA GAACAATGAC TGCTGTGGGT GCTCCGGCAA CGAGGGCTGT 480 

GGGAAGOGAT TTGCGATGTT CACCTCCACG ATATTTGCTG TGGTTGGATT CTTGGGAGCT 540 

GGATACTCGT TTATCATCTC AGCCATTTCA ATCAACAAGG GTCCTAAATG CCTCATGGCC 600 

AATAGTACAT GGGGCTACCC CTTCCACGAC GGGGATTATC TCAATGATGA GGCCTTATGG 660 

AACAAGTGCC GAGAGCCTCT CAATGTGGTT CCCTGGAATC TGACCCTCTT CTCCATCCTG 720 

CTGGTCGTAG GAGGAATCCA GATGGTTCTC TGCGCCATCC AGGTGGTCAA TGGCCTCCTG 780 

GGGACCCTCT GTGGGGACTG CCAGTGTTGT GGCTGCTGTG GGGGAGATGG ACCCGTTTAA 840 

ACCTCCGAGA TGAGCTGCTC AGACTCTACA GCATGACGAC TACAATTTCT TTTCATAAAA 900 

CTTCTTCTCT TCTTGGAATT ATIAATTCCT ATCTGCTTCC TAGCTGATAA AGCTTAGAAA 960 

AGGCAGTTAT TCCTTCTTTC CAACCAGCTT TGCTCGAGTT AGAATTTTGT TATTTTCAAA 1020 

TAAAAAATAG TTT GG CCACT TAACAAATTT GATTTATAAA TCTTTCAAAT TAGTTCCTTT 1080 

TTAGAATTTA CCAACAGGTT CAAAGCATAC TTTTCATGAT TTTTTTATTA CAAATGTAAA 1140 

ATGTATAAAG TCACATGTAC TGCCATACTA CTTCTTTGTA TATAAAGATG TTTATATCTT 1200 

TGGAAGTTTT ACATAAATCA AAGGAAGAAA GCACATTTAA AATGAGAAAC TAAGACCAAT 1260 

TTCTGTTTTT AAGAGGAAAA AGAATGATTG ATGTATCCTA AGTATTGTTA TTTGTTGTCT 1320 

TTTTTTGCTG CCTTGCTTGA GTTGCTTGTG ACTGATCTTT TGAGGCTGTC ATCATGGCTA 1380 

GGGTTCTTTT ATGTATGTTA AATTAAAACC TGAATTCAGA GGTAACGT 1428 
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Coding sequence: 64.. 1524 

1 11 21 31 41 51 

I I I I i I 

AAATCCQAGC CTOGCGTGGO CTCCTGGCCC CCGACGGACA CCACCAGGCC CACGGAGCCC 60 

ACCATGCCGC GCCCGGCCCC CGCGCGCCGC CTCCCGGGAC TCCTCCTGCT GCTCTGGCCG 120 

CTGCTGCTGC TGCCCTCCGC CGCCCCCGAC CCCGTGGCCC GCCCGGGCTT CCGGAGGCTG 180 

GAGACCCGAG GTCCCGGGGG CAGCCCTGGA CGCCGCCCCT CTCCTGCGGC TCCCGACGGC 240 

GCGCCCGCTT CCGGGACCAG CGAGCCTGGC CGCGCCCGCG GTGCAGGTGT TTGCAAGAGC 300 

AGACCCTTGG ACCTGGTGTT TATCATTGAT AGTTCTCGTA GCGTACGGCC CCTGGAATTC 360 

ACCAAAGTGA AAACTTTTGT CTCCGGGATA ATCGACACTC TGGACATTGG GCCAGCCGAC 420 

ACGCGGGTGG CAGTGGTGAA CTATGCTAGC ACTGTGAAGA TCGAGTTCCA ACTCCAGGCC 480 

TACACAGATA AGCAGTCCCT GAAGCAGGCT GTGGGTCGAA TCACACCCTT GTCAACAGGC 540 

ACCATGTCAG GCCTAGCCAT CCAGACAGCA ATGGACGAAG CCTTCACAGT GGAGGCAGGG 600 

GCTCGAGAGC CCTCTTCTAA CATCCCTAAG GTGGCCATCA TTGTTACAGA TGGGAGGCCC 660 

CAGGACCAGG TGAATGAAGT GGOGGCTCGG GCCCAAGCAT CTGGTATTGA GCTCTATGCT 720 

GTGGGCGTGG ACCGGGCAGA CATGGCGTCC CTCAAGATGA TGGCCAGTGA GCCCCTAGAG 780 

GAGCATGTTT TCTACGTGGA GACCTATGGG GTCATTGAGA AACTTTCCTC TAGATTCGAG 840 

GAAACCTTCT GTGCGCTGGA CCCCTGTGTG CTTGGAACAC ACCAGTGCCA GCAOGTCPGC 900 

AT CAGTGATG GGGAAGGCAA GCACCACTGT GAGTGTAGCC AAGGATACAC CTTGAATGCC 960 

GACAAGAAAA CGTGTTCAGC TCTTGATAGG TGTGCTCTTA ACACCCACGG ATGTGAGCAC 1020 

ATCTGTGTGA ATGACAGAAG TGGCTCTTAT CATTGTGAGT GCTATGAAGG TTATACCTTG 1080 

AATGAAGACA GGAAAACTTG TTCAGCTCAA GATAAATGTG CTTTGGGTAC CCATGGGTGT 1140 

CAGCACATTT GTGTGAATGA CAGAACAGGG TCCCATCATT GTGAATGCTA TGAGGGCTAC 1200 

ACTCTGAATG CAGATAAAAA AACATGTTCA GTCCGTGACA AGTGTGCCCT AGGCTCTCAT 1260 

GGTTGCCAGC ACATTTGTGT GAGTGATGGG GCCGCATCCT ACCACTGTGA TTGCTATCCT 1320 

GGCTACACCT TAAATGAGGA CAAGAAAACA TGTTCAGCCA CTGAGGAAGC ACGAAGACTT 1380 

GTTTCCACTG AAGATGCTTG TGGATGTGAA GCTACACTGG CATTCCAGGA CAAGGTCAGC 1440 

TCGTATCTTC AAAGACTGAA CACTAAACTT GATGACATTT TGGAGAAGTT GAAAATAAAT 1500 

GAATATGGAC AAATACATCG TTAAATTGCT CCAATTTCTC ACCTGAAAAT GTGGACAGCT 1560 

TGGTGTACTT AATACTCATG CATTCTTTTG CACACCTGTT ATTGCCAATG TTCCTGCTAA 1620 

TAATTTGCCA TTATCTGTAT TAATGCTTGA ATATTACTGG ATAAATTGTA TGAAGATCTT 1680 

CTGCA GAATC AGCATGATTT TTCCAAGGAA ATACATATGC AGATACTTAT TAAGAGCAAA 1740 

CTTTAGTGTC TCTAAGTTAT GACTGTGAAA TGATTGGTAG GAAATAGAAT GAAAAGTTTA 1800 

GTGTTTCTTT ATCTACTAAT TGAGCCATTT AATTTTTAAA TGTTTATATT AGATAACCAT 1860 

ATTCACAATG GAAACTTTAG GTCTAGTTTC TTTTGATAGT ATTTATAATA TAAATCAATC 1920 

TTATTACTGA GAGTGCAAAT TGTACAAGGT ATTTACACAT ACAACTTCAT ATAACTGAGA 1980 

TGAATGTAAT TTTGAACTGT TTAACACTTT TTGTTTTTTG CTTATTTTGT TGGAGTATTA 2040 

TTGAAGATGT GATCAATAGA TTGTAATACA CATATCTAAA AATAGTTAAC ACAGATCAAG 2100 

TGAACATTAC ATTGCCATTT TTAATTCATT CTGGTCTTTG AAAGAAATGT ACTACTAAAG 2160 

AGCACTAGTT GTGAATTTAG GGTGTTAAAC TTTTTACCAA GTACAAAAAT CCCAAATTCA 2220 

CTTTATTATT TTGCTTCAGG A TCCAAG TGA CAAAGTTATA TATTTATAAA ATTGCTATAA 2280 

ATCGACAAAA TCTAATGTTG TCTTTTTAAT GTTAGTGATC CACCTGCCTC AGCCTCCCAA 2340 

AGTGCTGGGA TTACAGGCTT GAAAGTCTAA CTTTTTTTTA CTTATATATT TGATACATAT 2400 

AATTCTTTTG GCTTTGAAAC TTGCAACTTT GAGAACAAAA CAGTCCTTTA AATTTTGCAC 2460 

TGCTCAATTC TGTTTTTCGT TTGCATTGTC TTTAATATAA TAAAAGTTAT TACCTTTACA 2520 

TATTATCATG TCTATTTTTG ATGACTCATC AATTTTGTCT ATTAAAGATA TTTCTTTAAA 2580 

TTAAAAAAAA AAAAAAAAA 2599 



Seq ID HO: C142 DNA Sequence 

Nucleic Acid Accession #i NMJ>16639.1 

Coding sequence: 40.. 42 9 

1 11 21 31 41 51 

I i I 1 I I 

GCGGCGGGCG CAGACAGCGG CGGGCGCAGG ACGTGCACTA TGGCTCGGGG CTCGCTGCGC 60 

CGGTTGCTGC GGCTCCTCGT GCTGGGGCTC TGGCTGGCGT TGCTGCGCTC CGTGGCCGGG 120 

GAGCAAGCGC CAGGCACCGC CCCCTGCTCC CGCGGCAGCT CCTGGAGCGC GGACCTGGAC 180 

AAGTGCATGG ACTGCGCGTC TTGCAGGGCG CGACCGCACA GCGACTTCTG CCTGGGCTGC 240 

GCTGCAGCAC CTCCTGCCCC CTTCCGGCTG CTTTGGCCCA TCCTTGGGGG CGCTCTGAGC 300 

CTGACCTTCG TGCTGGGGCT GCTTTCTGGC TTTTTGGTCT GGAGACGATG CCGCAGGAGA 360 

GAGAAGTTCA CCACCCCCAT AGAGGAGACC GGCGGAGAGG GCTGCCCAGC TGTGGCGCTG 420 

ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTCGCCC ACTCATCATT CATTCATCCA 480 

TTCTAGAGCC AGTCTCTGCC TCCCAGACGC GGCGGGAGCC AAGCTCCTCC AACCACAAGG 540 

GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGG GAACCTTCCA 600 

AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GGAGCCTCAC GCTGGCTCAC 660 

ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 720 

CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGGCA GACTTGACAC TAGGCCCCAC 780 

TCACTCAGAT GTCCTGAAAT TCCACCACGG GGGTCACCCT GGGGGGTTAG GGACCTATTT 840 

TTAACACTAG GGGCTGGCCC ACTAGGAGGG CTGGCCCTAA GATACAGACC CCCCCAACTC 900 

CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG GAGGGGAGGG AGAATTTATT 960 

AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 998 

Seq ID NO: C143 DNA Sequence 
Nucleic Acid Accession fl: NM_001819 
Coding sequence: 113.. 2146 



OA 1 11 21 31 41 51 

80 | | | | | | 

CCAGGAGGCA CGCTGGTTTT CCGGGGCCGC TCCATCGCGC CTTCCTCCTG CGCCTCGCTT 60 

CTCCGGTCCA GCCGCCATCT TCCTTTCCGC ACAGGGGCCG CCGAGOGGGG CCATGCAGCC 120 

AACGCTGCTT CTCAGCCTCC TGGGAGCCGT GGGGCTGGOG GCTGTCAATT CCATGCCAGT 180 

GGATAACAGG AACCACAATG AAGGAATGGT GACTCGCTGC ATCATTGAGG TCCTCTCAAA 240 
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TGCCTTGTCG 
GAGTAGAAAA 
AAGATTGTTA 
GGCAGGAGCC 
GGCAGAGGGA 
CTCCGACAGC 
GGATQAGGAO 
TGAAGAGAAA 
GCAGGCTTCA 
GCATTCTCAG 
AGGGAGCCAG 
TGAGGAAGGT 
CCACCACGGG 
AAAGGGACAC 
AAAGAGGGAC 
AGAAATAAAG 
GGGCAGAGGA 
GGAGGACAAG 
AGAAAGTGAG 
AGGGGAGCCA 
AGGACACCAC 
GTGGAAAGAG 
GCAGCAGCAG 
TAAACAATAT 
CCCATACTAC 
TGACAATTTT 
CCCAGAATAC 
GTATGAGAAG 
GGTGGCCCAA 
CTATGATTCT 
TGACCAGACA 
TTTGGAACTA 
GTGGGCACTG 
ACCATTTATA 
TTGGAATTGT 
GCATATTCTT 
TCTTTGGAAA 



AAGTCCAGCG 
GACGTCAAAG 
AGAGACCCAG 
CCAGGGGAGG 
GGCGGGCACA 
CAAGTCTCTG 
GAGGAGGAGG 
CACCTTGAAG 
GCTATAAAAA 
GAGAAGACAC 
GAGAATCACC 
GAGGAAGATG 
AGGAGCAGGC 
CCCCAGGAGG 
CACCATTCAA 
GGTTATCCAG 
AGTGAAGAAT 
AGAAACTACC 
GAAGAGAGGG 
CGTGCCTATT 
CGTGTCCAAG 
CTGGACAGAA 
GGAGACCTGC 
AGCTCCCATC 
GACCCTCTCC 
CTCGAGGGTG 
AACTATGACT 
AGAAACCTCG 
CTGGACCAGC 
GAGGAGCCGG 
GTCCTGACAG 
CAGAAGATAG 
TTAAGAAGCA 
TACCCAAGGG 
CTTTAATTTC 
TTCTGCAAAA 
AATGTTTGTC 



CTCCACCCAT 
ACAAAGAGAC 
CTGATGCCTC 
AGGACATCCA 
GCCGAGAGCG 
AAGAAGTGAA 
GAGAGAACTA 
AGCCAGGAGA 
AAGAGGAGTT 
ATAGCCGAGA 
CCCAGGAGTC 
CCACCTCTGA 
CCGACAGGTC 
AATCTGAGGA 
CCCACTACAG 
GCGTCCAGGC 
ACAGGGCTCC 
CCAGCTTAGA 
GCCTTGAGCC 
TCATGTCTGA 
AAAACCAGAT 
ATTATCTCAA 
AGGACACTAA 
ACACAGCTGA 
AGTGGAAGAG 
AGGAGGAAAA 
GGTGGGAGAA 
CCAGGGTCCC 
TCCTTCACTA 
TQAGCACCCA 
AGGACGAGAA 
CTGAGAAATT 
GCCATCACAT 
CAGAAAGTAG 
TGTCAGAATG 
TAGACATATT 
TCAGTTGGAA 



CACCCCTGAG 
AACTGAAAAT 
GGAAGCCCAC 
AGGCCCAACA 
AGCGGATGAG 
GACACGCCAT 
TCAAAAAGGG 
GACACAAAAC 
AGTGGCCAGA 
GAAGAGTAGC 
TAAAGGCCAA 
GGTGGACAAA 
CTCTCAAGGA 
GTCAAACGTC 
GGCTTCAGAG 
CCCTGAGGAC 
AAGACCTCAG 
GCTTGATAAG 
GGGAAAGGGA 
CACCAGAGAA 
GGACAAGGCA 
CTAOGGTGAG 
AGAAAACAGG 
AAAGAGGAAG 
CAGCCATTTT 
TGAGCTGACC 
AAAGCCCTTC 
CAAGCTGGAC 
CAGGAAGAAG 
CCAGGAGGCA 
AAAAGAACTC 
CAGCCAAAGG 
GATCTGTTTT 
AACTTACTAT 
CTATTGAAAA 
AACATGCTTA 
ATAATAAAAG 



TGCCGCCAAG 



GAGTCCTCCA 
AAGGCAGACA 
CCCCAGTGGA 
TCTGAGAAGA 
GAGCGAGGGG 
GCTTTTCTCA 
TOGGAAACAC 
CAGGAGAGTG 
CCCOGAAGCC 
CGACGCACGA 
GGGAGTCTTC 
AGCATGGCCA 
GAAGAACCTG 
CTGGAGTGGG 
AGTGAGGAGA 
ATGGCACATG 
CGCCATCACA 
GAGAAAAGGT 
AGGAGGCATC 
GAAGGAGCCC 
GAGGAAGCTA 
AGATTAGGGG 
GAAAGAAGAG 
TTGAACGAGA 
TCTGAGGATG 
CTGAAAAGGC 
TCAGCTGAGT 
GAAAATGAAA 
GAAAACTTGG 
GGCTGACTGT 
TCACCACTTC 
TCATTAAATG 
TGTGAATTGC 
TGACAATGAC 
ATTCACCTGA 



TCCTGAAGAC 
AGTTTGAAGT 
GCAGGGGAGA 
CAGAGAAATG 
GCCTCTATCC 
GCCAGAGAGA 
AAGATAGCAG 
ATGAAAGAAA 
ATGCTGCCGG 
GAGAGGAGGC 
AGGAAGAATC 
GGCCCAGACA 
CCTCTGAGGA 
GTTTAGGGGA 
AATATGGAGA 
AGCGCTATAG 
GTTGGGATGA 
GATATGGTGA 
GAGGCAGGGG 
TCTTGGGTGA 
CACAAGGTGC 
CAGGGAAGTG 
GGTTTCAAGA 
AACTGTTCAA 
ACAACATGAA 
AGAATTTCTT 
TGAACTGGGG 
AATATGACAG 
TTCCAGACTT 
AGGACAGGGC 
CTGCAATGGA 
CATTGGAGCG 
ACTGAAAGAC 
TTTGACACAA 
ATGACTTGTA 
TGTGCTACTG 
GACC 



Seq ID NO: C144 UNA Sequence 
Nucleic Acid Accession ft: XM_093082. 
Coding sequence: 93.. 1988 



1 
1 

CTTCTTGTGG 
AATTTGTAAA 
AGCCCAACTG 
GTGGGGCTCT 
AGACAAAACT 
GCAAATTACA 
TTGGAATGGT 
ACTGTCAGGA 
TTGCTGCTGC 
CAGGAAGACT 
ACCCAGCATC 
ACTTAGAAGT 
CCCTACAAGT 
TGCCTAATTC 
ACGTGACTGT 
GAGAACAGCT 
CTCCTTGGGA 
AGCAGGCTGA 
TTAAAGATCG 
TGCAGCCAGC 
GCCAAAACCA 
GCGTTCAAGG 
GAACACCTTC 
AAGAAAACTT 
AAGGTTATTA 
TCACTTCTTC 
GTGCOGCCAT 
AAGAAAGAAA 
GAGAAAGCGA 
CCATTCATGA 
AGGAGCCTGC 
TCGAGCTGGA 
CAGAGTCAGA 
AGGCATAG 



11 
I 

TAGGGACCTC 
AAATACCCTC 
TGTATTACAG 
AGCAGCTGTT 
GGAAGCTTCA 
GAAGCTCAAC 
TGGAAATTGT 
CAGTTCTTTC 
AAAGGATTGT 
TGAGGCAGTG 
TCAGGGTGGG 
CAAGGCAGAA 
TGTCCTCCTT 
TCCTGGTCAG 
TGGATCTAAT 
TTCCATCCAG 
GGAGGGGAAG 
ACTCCAGATT 
AATTACAGGG 
AGACAGTGGA 
AGGCATCCTC 
AAGACCAGAA 
CCCTGTGTAC 
CAACCCAACC 
CCAGTGTACT 
ACATCCAGAA 
CATCATCTCT 
TTCTAAGACC 
AGCAATGCCA 
GACTGGCCCT 
CCCAGAGCCT 
GCTGGAGCCA 
GCCTGGGGTT 



21 
I 

TCCTCAGTAT 
GAAGATTCAG 
ACATTGAGGT 
CTTAGCACCA 
GCTTTGAAAT 
TTGCAGTTTT 
TCTGGTTTCT 
AAATGTGATC 
GGGAGTCCTA 
GAGGAGGTTT 
GGGCATTGTG 
CCAAGCCTGA 
TGCAAAATCT 
GTTAGTGTGG 
GTCACTCTCA 
TGGTCTTTCT 
TGGCCAGATG 
TACTTTTCTC 
TCCAACGATC 
ATTTACATCT 
AACGTCAGTG 
ACTGGCCACA 
TACTGGCATA 
ACCGGGATTT 
GCCATCAACA 
GTTGGAATCA 
GTTGTGTGCT 
ATCGOGGAAC 
AGAGAAGACG 
GATACCATCC 
GCCCCAGGAT 
GAAACGCAGT 
GTAGTTGAGC 



31 

! 

TTGAAACTAA 
GAATGAAGCT 
GGTACCGGTG 
GTCAGTGGCT 
TGCTCTATGG 
CTTTATCTGT 
CGGGATCATT 
TTTGTAAGCT 
AGTCCTTCCT 
TGGGGTTGGG 
AAAACTATGG 
GAAAAGGTGG 
TCTCCCTCAA 
TGCAAGTGAC 
TCTGCATCTA 
TCCATAAGAA 
TTGAGGCTGT 
AAGGTGGACA 
CAGGTAATGC 
GCGATGTTAA 
TGTTAGTGAA 
CTATTTCCCT 
AACTTGAGGG 
TGGTCATTGG 
GACTTGGCAA 
TTGTTGGGGC 
TCGCAAGGAA 
TTGAGCCAAT 
CTACCCAACT 
AAGAACCAGA 
CAGAGCCTAT 
CGGAATTGGA 
CCTTAAGTQA 



41 

I 

CCAGCATCTG 
TCTGTGTGAA 
CCTTATCTCT 
CACTGAACTG 
AGGCTTAAAA 
AACCGCTGCA 
GGTGCAATCT 
GCTCTGGCCT 
ATCAGAAGGG 
GGTGCTTGTA 
GTCTTTTAGA 
TATGGATCTC 
ACTATTTCTC 
CATCCCAGAC 
CACCACCACT 
GGAGATGGAG 
GAAGGGCACT 
AGCTGTAGCC 
ATCTATCACT 
CAACCCCCCA 
ACCTTCTAAG 
TTCCTGTCTC 
AAGAGACATC 
AAATCTGACA 
TAGTTCCTGC 
CTTGATTGGT 
TAAGGCAAAA 
GACAAAGATA 
AGAAGTAACT 
CTATGAGCCA 
GGCAGTGCCT 
GCCAGAGCCA 
AGATGAAAAG 



51 

I 

ACAGATTTCG 
GGATTAAAAC 
TCTGCTTCTT 
GAATTTAGTG 
GATCCAAATT 
AAACTTCCAG 
CATTTTGGCT 
TCCACCAGAG 
CTGAACTGGG 
CAGCCCGGTG 
GACTTGGTGG 
CAGAGACCCA 
TTTATTGCAT 
GGTTTCGTGA 
GTGGCCTCCC 
CCAATTTCTT 
CTTGATGGAC 
ATCGGGCAAT 
ATCTCGCATA 
GACTTTCTOG 
CCCCTTTGTA 
TCTGCGCTTG 
GTGCCAGTGA 
AATTTTGAAC 
GAAATCGATC 
AGCCTGGTAG 
GCAAAGGGAA 
AACCCAAGGG 
CTACCATCTT 
AAGCCTACTC 
GACCTTGACA 
GAGCCAGAGC 
GGAGTGGTTA 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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1680 

1740 
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60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

1988 



Seq ID NO: CL45 DNA Sequence 

Nucleic Acid Accession 8: FGOTESH predicted 

Coding sequence i 1 . . 1242 



11 



21 



31 



41 



51 

1290 



WO 03/042661 



PCT/US02/36810 



I I I I I I 

ATGGTGTTOG CATTTTGGAA GGTCTTTCTG ATCCTAAGCT GCCTTGCAGG TCAGGTTAGT 60 

GTGGTGCAAG TGACCATCCC AGACGGTTTC GTGAACGTGA CTGTTGGATC TAATGTCACT 120 

CTCATCTGCA TCTACACCAC CACTGTGGCC TCCCGAGAAC AGCTTTCCAT CCAGTGGTCT 180 

3 TTCTTCCATA AGAAGGAGAT GGAGCCAATT TCTTCTCCTT GGGAGGAGGG GAAGTGGCCA 240 

GATGTTGAGG CTGTGAAGGG CACTCTTGAT GGACAGCAGG CTGAACTCCA GATTTACTTT 300 

TCTCAAGGTG GACAAGCTGT AGCCATCGGG CAATTTAAAG ATCGAATTAC AGGGTCCAAC 360 

GATCCAGGTA ATGCATCTAT CACTATCTCG CATATGCAGC CAGCAGACAG TGGAATTTAC 420 

ATCTGCGATG TTAACAACCC CCCAGACTTT CTCGGCCAAA ACCAAGGCAT CCTCAACGTC 480 

1U AGTGTGTTAG TGAAACCTTC TAAGCCCCTT TGTAGCGTTC AAGGAAGACC AGAAACTGGC 540 

CACACTATTT CCCTTTCCTG TCTCTCTGCG CTTGGAACAC CTTCCCCTGT GTACTACTGG 600 

CATAAACTTG AGGGAAGAGA CATCGTGCCA GTGAAAGAAA ACTTCAACCC AACCACCGGG 660 

ATTTTGGTCA TTGGAAATCT GACAAATTTT GAACAAGGTT ATTACCAGTG TACTGCCATC 720 

AACAGACTTG GCAATAGTTC CTGCGAAATC GATCTCACTT CTTCACATOC AGAAGTTGGA 780 

I J ATCATTGTTG GGGCCTTGAT TGGTAGCCTG GTAGGTGCOG CCATCATCAT CTCTGTTGTG 840 

TGCTTCGCAA GGAATAAGGC AAAAGCAAAG GCAAAAGAAA GAAATTCTAA GACCATCGCG 900 

GAACTTGAGC CAATGACAAA GATAAACCCA AGGGGAGAAA GCGAAGCAAT GCCAAGAGAA 960 

GACGCTACCC AACTAGAAGT AACTCTACCA TCTTCCATTC ATGAGACTGG CCCTGATACC 1020 

ATCCAAGAAC CAGACTATGA GCCAAAGCCT ACTCAGGAGC CTGCCCCAGA GCCTGCCCCA 1080 

GGATCAGAGC CTATGGCAGT GCCTGACCTT GACATCGAGC TGGAGCTGGA GCCAGAAACG 1140 

CAGTGGGAAT TGGAGCCAGA GCCAGAGCCA GAGCCAGAGT CAGAGCCTGG GGTTGTAGTT 1200 

GAGCCCTTAA GTGAAGATGA AAAGGGAGTG GTTAAGGCAT AG 1242 



20 



Seq ID NO: C146 DMA Sequence 
Zj Nucleic Acid Accession #: NM_003020.1 
Coding sequence: 29.. 664 

1 11 21 31 41 SI 

o n I 1 I I I I 

D\) OGCTCCTCGG GCTGCCCCTC GGTTGACAAT GGTCTCCAGG ATGGTCTCTA CCATGCTATC 60 

TGGCCTACTG TTTTGGCTGG CATCTGGATG GACTCCAGCA TTTGCTTACA GCCCCCGGAC 120 

CCCTGACCGG GTCTCAGAAG CAGATATCCA GAGGCTGCTT CATGGTGTTA TGGAGCAATT 180 

GGGCATTGCC AGGCCCCGAG TGGAATATCC AGCTCACCAG GCCATGAATC TTGTGGGCCC 240 

CCAGAGCATT GAAGGTGGAG CTCATGAAGG ACTTCAGCAT TTGGGTCCTT TTGGCAACAT 300 

CCCCAACATC GTGGCAGAGT TGACTGGAGA CAACATTCCT AAGGACTTTA GTGAGGATCA 360 

GGGGTACCCA GACCCTCCAA ATCCCTGTCC TGTTGGAAAA ACAGATGATG GATGTCTAGA 420 

AAACACCCCT QACACTGCAG AGTXCAGTCG AGAGTTCCAG TTGCACCAGC ATCTCTTTGA 480 

TCCGGAACAT GACTATCCAG GCTTGGGCAA GTGGAACAAG AAACTCCTTT ACGAGAAGAT 540 

GAAGGGAGGA GAGAGACGAA AGCGGAGGAG TGTCAATCCA TATCTACAAG GACAGAGACT 600 

GGATAATGTT GTTGCAAAGA AGTCTGTOCC CCATTTTTCA GATGAGGATA AGGATCCAGA 660 

GTAAAGAGAA GATGCTAGAC GAAAACCCAC ATTACCTGTT AGGCCTCAGC ATGGCTTATG 720 

TGCACGTGTA AATGGAGTCC CTGTGAATGA CAGCATGTTT CTTACATAGA TAATTATGGA 780 

TACAAAGCAG CTGTATGTAG ATAGTGTATT GTCTTCACAC CGATGATTCT GCTTTTTGCT 840 

AAATTAGAAT AAGAGCTTTT TTGTTTCTTG GGTTTTTAAA ATGTGAATCT GCAATGATCA 900 

4 J TAAAAATTAA AATGTGAATG TCAACAATAA AAAGCAAGAC TATGAAAGGC TCAGATTTCT 960 

TGCAGTTTAA AATGGTGTCT GAGGTTGTAC TATTTTGGCC AAGTCTGTAG AAAGCTGTCA 1020 

TTTGATTTTG ATTATGTAGT TCATCCAGCC CTTGGGCATT GTTATACACC AGTAAAGAAG 1080 

GCTGTACTCA AGAGGAGGAG CTGACACATT TCACTTGGCT GCGTCTTAAT AAACATGAAT 1140 

GCAAGCATTG GC 1152 



35 
40 



50 
55 
60 
65 
70 



Seq ID NO: C147 DNA Sequence 

Nucleic Acid Accession fti NM_024021.2 

Coding sequence : 144 ..806 



1 11 21 31 41 51 

I i I I I I 

AACATTCCTG CAAATGGTTT CAATATATGC AGATGTCTCG ATATAGGAAT GAAATTACGT 60 

CTTTGGAACA ACTTAAATAA GTCAAATATA CTTGGAGCTT TAAAAATTAA AAGGAGAGAG 120 

ATTCGAGCAC CTTTTCTGCT GCCATGACAA CCATGCAAGG AATGGAACAG GCCATGCCAG 180 

GGGCTGGCCC TGGTGTGCCC CAGCTGGGAA ACATGGCTGT CATACATTCA CATCTGTGGA 240 

AAGGATTGCA AGAGAAGTTC TTGAAGGGAG AACCCAAAGT CCTTGGGGTT GTGCAGATTC 300 

TGACTGCCCT GATGAGCCTT AGCATGGGAA TAACAATGAT GTGTATGGCA TCTAATACTT 360 

ATGGAAGTAA CCCTATTTCC GTGTATATCG GGTACACAAT TTGGGGGTCA GTAATGTTTA 420 

TTATTTCAGG ATCCTTGTCA ATTGCAGCAG GAATTAGAAC TACAAAAGGC CTGGTCCGAG 480 

GTAGTCTAGG AATGAATATC ACCAGCTCTG TACTGGCTGC ATCAGGGATC TTAATCAACA 540 

CATTTAGCTT GGCGTTTTAT TCATTCCATC ACCCTTACTG TAACTACTAT GGCAACTCAA 600 

ATAATTGTCA TGGGACTATG TCCATCTTAA TGGGTCTGGA TGGCATGGTG CTCCTCTTAA 660 

GTGTGCTGGA ATTCTGCATT GCTGTGTCCC TCTCTGCCTT TGGATGTAAA GTGCTCTGTT 720 

GTACCCCTGG TGGGGTTGTG TTAATTCTGC CATCACATTC TCACATGGCA GAAACAGCAT 780 

CTCCCACACC ACTTAATGAG GTTTGAGGCC ACCAAAAGAT CAACAGACAA ATGCTCCAGA 840 

AATCTATGCT GACTGTGACA CAAGAGCCTC ACATGAGAAA TTACCAGTAT CCAACTTCGA 900 

TACTGATAGA CTTGTTGATA TTATTATTAT ATGTAATCCA ATTATGAACT GTGTGTGTAT 960 

AGAGAGATAA TAAATTCAAA ATTATGTTCT CATTTTTTTC CCTGGAACTC AATAACTCAT 1020 

O TTCACTGGCT CTTTATCGAG AGTACTAGAA GTTAAATTAA TAAATAATGC ATTTAATGAG 1080 

GCAACAGCAC TTGAAAGTTT TTCATTCATC ATAAGAACTT TATATAAAGG CATTACATTG 1140 

GCAAATAAGG TTTGGAAGCA GAAGAGCAAA AAAAAGATAT TGTTAAAATG AGGCCTCCAT 1200 

GCAAAACACA TACTTCCCTC CCATTTATTT AACTTTTTTT TTCTCCTACC TATGGGGACC 1260 

AAAGTGCTTT TTCCTTCAGG AAGTGGAGAT GCATGGCCAT CTCCCCCTCC CTTTTTC CTT 1320 

oU CTCCTGCTTT TCTTTCCCCA TAGAAAGTAC CTTGAAGTAG CACAGTCCGT CCTTGCATGT 1380 

GCACGAGCTA TCATTTGAGT AAAAGTATAC ATGGAGTAAA AATCATATTA AGCATCAGAT 1440 

TCAACTTATA TTTTCTATTT CATCTTCTTC CTTTCCCTTC TCCCACCTTC TACTGGGCAT 1500 

AATTATATCT TAATCATATA TGGAAATGTG CAACATATGG TATTTGTTAA ATACGTTTGT 1560 

TTTTATTGCA GAGCAAAAAT AAATCAAATT AGAAGCAATA AAAAAAAAAA AAAAAAAAA 1619 

1291 



WO 03/042661 



PCT/US02/36810 
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Seq ID NO: Cl 48 DNA Sequence 

Nucleic Acid Accession ft: NM_002091.1 

Coding sequence: 56.. 502 

1 11 21 31 41 51 

I I I I I I 

AGTCTCTGCT CTTCCCAGCC TCTCCGGCGC GCTCCAAGGG CTTCCCGTCG GGACCATGCG 60 

CGGCAGTGAG CTCCCGCTGG TCCTGCTGGC GCTGGTCCTC TGCCTAGOGC CCCGGGGGCG 120 

AGCGGTCCCG CTGCCTGCGG GCGGAGGGAC CGTGCTGACC AAGATGTACC CGCGCGGCAA 180 

CCACTGGGCG GTGGGGCACT TAATGGGGAA AAAGAGCACA GGGGAGTCTT CTTCTGTTTC 240 

TGAGAGAGGG AGCCTGAAGC AGCAGCTGAO AGAGTACATC AGGTGGGAAG AAGCTGCAAG 300 

GAATTTGCTG GGTCTCATAG AAGCAAAGGA GAACAGAAAC CACCAGCCAC CTCAACCCAA 360 

GGCCTTGGGC AATCAGCAGC CTTCGTGGGA TTCAGAGGAT AGCAGCAACT TCAAAGATGT 420 

AGGTTCAAAA GGCAAAGTTG GTAGACTCTC TGCTCCAGGT TCTCAACGTG AAGGAAGGAA 480 

CCCCCAGCTG AACCAGCAAT GATAATGATG GCCTCTCTCA AAAGAGAAAA ACAAAACCCC 540 

TAAGAGACTG AGTTCTGCAA GCATCAGTTC TACGGATCAT CAACAAGATT TCCTTGTGCA 600 

AAATATTTGA CTATTCTGTA TCTTTCATCC TTGACTAAAT TCGTGATTTT CAAGCAGCAT 660 

CTTCTGGTTT AAACTTGTTT GCTGTGAACA ATTGTCGAAA AGAGTCTTCC AATTAATGCT 720 

TTTTTATATC TAGGCTACCT GTTGGTTAGA TTCAAGGCCC CGAGCTGTTA CCATTCACAA 780 

TAAAAGCTTA AACACAT 797 



Seq ID NO: C149 DNA Sequence 
Nucleic Acid Accession #: NM_012261.1 
25 Coding sequence: 203.. 1045 

1 11 21 31 41 51 

I I I I 1 I 

0 A GATTTGCTCT GCCAGCAGCT GTOGGTGCCG CGCTCGACAC CGAGTCCTAG CTAGGCGCTC 60 

JU ACAGAATACG CGCTCCCTCC CTCCCCCTTC TCTGTCCOCC GCCTCTCGCT CACCCCGGCC 120 

CACTCCAGCG GCGACTTTGA GGGATTCCCT CTCTGGCGGC CTCTGCAGCA GCACAGCCGG 180 

CCTCATTCGG GGCACTGCGA GTATGGATCT CCAAGGAAGA GGGGTCCCCA GCATCGACAG 240 

ACTTCGAGTT CTCCTGATGT TGTTCCATAC AATGGCTCAA ATCATGGCAG AACAAGAAGT 300 

- GGAAAATCTC TCAGGCCTTT CCACTAACCC TGAAAAAGAT ATATTTGTGG TGCGGGAAAA 360 

35 TGGGACGACG TGTCTCATGG CAGAGTTTGC AGCCAAATTT ATTGTACCTT ATGATGTGTG 420 

GGCCAGCAAC TACGTAGATC TGATCACAGA ACAGGCCGAT ATCGCATTGA CCCGGGGAGC 480 

TGAGGTGAAG GGCCGCTGTG GCCACAGCCA GTCGGAGCTG CAAGTGTTCT GGGTGGATCG 540 

CGCATATGCA CTCAAAATGC TCTTTGTAAA GGAAAGCCAC AACATGTCCA AGGGACCTGA 600 

GGCGACTTGG AGGCTGAGCA AAGTGCAGTT TGTCTACGAC TCCTCGGAGA AAACCCACTT 660 

41) CAAAGACGCA GTCAGTGCTG GGAAGCACAC AGCCAACTCG CACCACCTCT CTGCCTTGGT 720 

CACCCCCGCT GGGAAGTCCT ATGAGTGTCA AGCTCAACAA ACCATTTCAC TGGCCTCTAG 780 

TGATCCGCAG AAGACGGTCA CCATGATCCT GTCTGCGGTC CACATCCAAC CTTTTGACAT 840 

TATCTCAGAT TTTGTCTTCA GTGAAGAGCA TAAATGCCCA GTGGATGAGC GGGAGCAACT 900 

GGAAGAAACC TTGCCCCTGA TTTTGGGGCT CATCTTGGGC CTCGTCATCA TGGTAACACT 960 

45 CGCGATTTAC CACGTCCACC ACAAAATGAC TGCCAACCAG GTGCAGATCC CTCGGGACAG 1020 

ATCCCAGTAT AAGCACATGG GCTAGAGGCC GTTAGGCAGG CACCCCCTAT TCCTGCTCCC 1080 

CCAACTGGAT CAGGTAGAAC AACAAAAGCA CTTTTCCATC TTGTACAOGA GATACACCAA 1140 

CATAGCTACA ATCAAACAGG CCTGGGTATC TGAGGCTTGC TTGGCTTGTG TCCATGCTTA 1200 

AACCCACGGA AGGGGGAGAC TCTTTCGGAT TTGTAGGGTG AAATGGCAAT TATTCTCTCC 1260 

5U ATGCTGGGGA GGAGGGGAGG AGGGTCTCAG ACAGCTTTOG TGCTCATGGT GGCTTGGCTT 1320 

TGACTCTCCA AAGAGCAATA AATGCCACTT GGAGCTGTAT CTGGCCCCAA AGTTTAGGGA 1380 

TTGAAAACAT GCTTCTTTGA GGAGGAAACC CCTTTAGGTT CAGAAGAATA TGGGGTGCTT 1440 

TGCTCCCTTG GACACAGCTG GCTTATCCTA TACAGTTGTC AATGCACACA GAATACAACC 1500 

TCATGCTCCC TGCAGCAAGA COCCTGAAAG TGATTCATGC TTCTGGCTGG CATTCTGCAT 1560 

55 GTTTAGTGAT TGTCTTGGGA ATGTTTCACT GCTACCCGCA TCCAGCGACT GCAGCACCAG 1620 

AAAACGACTA ATGTAACTAT GCAGAGTTGT TTGGACTTCT TCCTGTGCCA GGTCCAAGTC 1680 

GGGGGACCTG AAGAATCAAT CTGTGTGAGT CTGTTTTTCA AAATGAAATA AAACACACTA 1740 

TTCTCTGGC 1749 

60 Seq ID NOx C150 DNA Sequence 

Nucleic Acid Accession #: NMJJ03226.1 
Coding sequence i 2 . . 226 

^- 1 11 21 31 41 51 

65 1 | | | | | 

GATGCTGGGG CTGGTCCTGG CCTTGCTGTC CTCCAGCTCT GCTGAGGAGT ACGTGGGCCT 60 

GTCTGCAAAC CAGTGTGCCG TGCCGGCCAA GGACAGGGTG GACTGOGGCT ACCCCCATGT 120 

CACCCCCAAG GAGTGCAACA ACCGGGGCTG CTGCTTTGAC TCCAGGATCC CTGGAGTGCC 180 

TTGGTGTTTC AAGCCCCTGA CTAGGAAGAC AGAATGCACC TTCTGAGGCA CCTCCAGCTG 240 

70 CCCCTGGGAT GCAGGCTGAG CACCCTTGCC CGGCTGTGAT TGCTGCCAGG CACTGTTCAT 300 

CTCAGTTTTT CTGTCCCTTT GCTCOCGGCA AGCTTTCTGC TGAAAGTTCA TATCTGGAGC 360 

CTGATGTCTT AACGAATAAA GGTCCCATGC TCCACCCG 398 



75 



Seq id NO: C151 DNA Sequence 

Nucleic Acid Accession 8: NM__002993.1 

Coding sequence: 64.. 408 



0/x 1 11 21 31 41 51 

80 i | | | | | 

GGCACGAGCC AGTCTCCGCG CCTCCACCCA GCTCAGGAAC CCGCGAACCC TCTCTTGACC 60 

ACTATGAGCC TCCCGTCCAG COGCGCGGCC CGTGTCCCGG GTCCTTCGGG CTCCTTGTGC 120 

GCGCTGCTCG CGCTGCTGCT CCTGCTGACG CCGCCGGGGC CCCTCGCCAG CGCTGGTCCT 180 

GTCTCTGCTG TGCTGACAGA GCTGCGTTGC ACTTGTTTAC GCGTTACGCT GAGAGTAAAC 240 

1292 



WO 03/042661 



PCT/US02/36810 



CCCAAAACGA TTGGTAAACT GCAGGTGTTC CCCGCAGGCC CGCAGTGCTC CAAGGTGGAA 300 

GTGGTAGCCT CCCTGAAGAA CGGGAAGCAA GTTTGTCTGG ACCCGGAAGC CCCTTTTCTA 360 

AAGAAAGTCA TCCAGAAAAT TTTGGACAGT GGAAACAAGA AAAACTGAGT AACAAAAAAG 420 

ACCATGCATC ATAAAATTGC CCAGTCTTCA GCGGAGCAGT TTTCTGGAGA TCCCTGGACC 480 

J CAGTAAGAAT AAGAAGGAAG GGTTGGTTTT TTTCCATTTT CTACATGGAT TCCCTACTTT S40 

GAAGAGTGTG GGGGAAAGCC TACGCTTCTC CCTGAAGTTT ACAGCTCAGC TAATGAAGTA 600 

CTAATATAGT ATTTCCACTA TTTACTGTTA TTTTACCTGA TAAGTTATTG AACCCTTTGG 660 

CAATTGACCA TATTGTGAGC AAAGAATCAC TGGTTATTAG TCTTTCAATG AATATTGAAT 720 

TGAAGATAAC TATTGTATTT CTATCATACA TTCCTTAAAG TCTTACCGAA AAGGCTGTGG 780 

1U ATTTCGTATG GAAATAATGT TTTATTAGTG TGCTGTTGAG GGAGGTATCC TGTTGTTCTT 840 

ACTCACTCTT CTCATAAAAT AGGAAATATT TTAGTTCTGT TTTCTTGGGG AATATGTTAC 900 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 960 

TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTGTAAGGG 1020 

CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC TTCTTAGTAT GGCATAATGT 1080 

15 CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT TTCACTATAG GATGACTATA 1140 

ATTCTGGTCA CTAAATATAC ACTTTAGATA GATQAAGAAG CCCAAAAACA GATAAATTCC 1200 

TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 1260 

AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTGAAT ATAAATTCAT 1320 

CATTTAGTCC TCAAAATATA TACAGCATTG CTAAGATTTT CAGATATCTA TTGTGGATCT 1380 

2U TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT GTATCACATT CACTATATTA 1440 

AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT TTGGTACTTG TATTGTCATT 1500 

TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 1S47 
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Seq ID NO: CI 52 DNA Sequence 

Nucleic Acid Accession #i NM_005242.2 

Coding sequence: 148.. 1341 



CGGCCCGCCC 
TTTCTCTCGG 
GCGCCOGGOG 
GCCGCCATCC 
TCCTCTAAAG 
GGAGTTACAG 
AAACTGACCA 
AGTAACGGCA 
ATTTACATGG 
ATTGCCTATC 
ATTGGCTTTT 
CAGAGGTATT 
ATTGGCATCT 
GTGAAGCAGA 
GAGCAGCTCT 
CTGTTCCCAG 
TCTGCCATGG 
GTCCTGGCCA 
TTTCTGATTA 
CTCTCTACCC 
AGGGATCATG 
GTATCCCTCA 
ACTGTTAAGA 
TGGAACCTGT 
ACATACCACC 



11 
I 

TGGGGAGGCG 
TGCGTCCAGT 
TCGGGGCTTC 
TGCTAGCAGC 
GAAGAAGCCT 
TTGAAACAGT 
CGGTCTTCCT 
TGGCCCTGTG 
CCAATCTGGC 
ACATACATGC 
TCTATGGCAA 
GGGTCATCGT 
CCCTGGCAAT 
CCATCTTCAT 
TGGTGGGAGA 
CCTTCCTCAC 
ATGAAAACTC 
TGTACCTGAT 
AGAGCCAGGG 
TTAACAGCTG 
CAAAGAACGC 
CCTCAAAGAA 
CCTCCTATTG 
TTAATGTTAT 
G 



21 
I 

CGCAGCAGAG 
GGAGCTCTGA 
CAGGAGGATG 
CTCTCTCTCC 
TATTGGTAAG 
CTTTTCTGTG 
TCCAATTGTC 
GGTCTTTCTT 
CTTGGCTGAC 
CAACAACTGG 
CATGTACTGT 
GAACCCCATG 
ATGGCTGCTG 
TCCTGCCCTG 
CATGTTCAAT 
AGCCTCTGCC 
AGAGAAGAAA 
CTGCTTCACT 
CCAGAGCCAT 
CATCGACCCC 
TCTCCTTTGC 
ACACTCCAGG 
AGTTTTCCAG 
GAGGACGTGT 



31 
I 

GCTCCGATTC 
GTTTCGAATC 
CGGAGCCCCA 
TGCAGTGGCA 
GTTGATGGCA 
GATGAGTTTT 
TACACAATTG 
TTCCGAACTA 
CTCCTCTCTG 
ATTTATGGGG 
TCCATTCTCT 
GGGCACTCCA 
ATTCTGCTGG 
AACATCACGA 
TACTTCCTCT 
TATGTGCTGA 
AGGAAGAGGG 
CCTAGTAACC 
GTCTATGCCC 
TTTGTCTATT 
CGAAGTGTCC 
AAATCCAGCT 
GTCCTCAGAT 
CTGTTATTTC 



41 

I 

GGGGCAGGTG 
GGTGGCGGCG 
GCGCGGCGTG 
CCATCCAAGG 
CATCCCACGT 
CTGCATCTGT 
TGTTTGTGGT 
AGAAGAAGCA 
TCATCTGGTT 
AAGCTCTTTG 
TCATGACCTG 
GGAAGAAGGC 
TCACCATCCC 
CCTGTCATGA 
CTCTGGCCAT 
TGATCAGAAT 
CCATCAAACT 
TTCTGCTTGT 
TGTACATTGT 
ACTTTGTTTC 
GCACTGTAAA 
CTTACTCTTC 
GGGAATTGCA 
CTAATCAAAA 



51 

1 

AGAGGCTGAC 
GATTCCCCGC 
GCTGCTGGGG 
AACCAATAGA 
CACTGGAAAA 
CCTCACTGGA 
GGGTTTGCCA 
OCCTGCTGTG 
CCCCTTGAAG 
TAATGTGCTT 
CCTCAGTGTG 
AAACATTGCC 
TTTGTATGTC 
TGTTTTGCCT 
TGGGGTCTTT 
GCTGCGATCT 
CATTGTCACT 
GGTGCATTAT 
AGCCCTCTGC 
ACATGATTTC 
GCAGATGCAA 
AAGTTCAACC 
CAGTAGGATG 
AGGTCTCACC 



Seq ID NO: C153 DNA Sequence 

Nucleic Acid Accession #: NM_003469.2 

Coding s equence i 92 .. 1 94 5 



GAAACGGCCC 
CATATAAACA 
AGCAGCCCTG 
TCAGAGAAAC 
TCCCAGTCCT 
TAAGGAAGAA 
AAAAGAAAAT 
GATGAGAATA 
AGAAAATAAG 
TTATGAGACA 
TGAA0AGAAT 
TACTCCTCAA 
ACCAAACAAC 
AGATGATATC 
GAACCCAGTA 
GAATATAGGA 
CCAGGAAGAA 
AATTGCCTAT 
AAATGGGGAA 
GCTGATTGAA 
AACTGGGGAG 
CCTAGATGAC 



11 
I 

GAGAAGCTCG 
AAAAGAGGAA 
TCTCTTATCC 
CAGCTGCTTC 
GAAATGATCA 
AGCAGCCCAG 
GGCGATGAAA 
ATACTCGAAG 
CCCTATGCCT 
CAGCAGTGGC 
TCCAGGGATA 
AGCCTTGCTA 
CAGAAACGTG 
TACAAGGCTA 
GAGGAGAAAA 
AAAAATGAAC 
GATCTTCGGA 
TTGAAAAGGT 
AGGGCCACCA 
ATCXCAAGGA 
AAGCCGAATG 
ATCTCAGAGG 



21 
I 

CCOGGAGAAC 
ATCTTTCAAA 
CTTTAATTTT 
AGAAAGAACC 
GGGCTTTGGA 
ATTATAATCC 
GCCACTTGCC 
CTTTGAGACA 
TGAATTCAGA 
CAGAAAGAAA 
ACCCCTTTAA 
CATTGGAATC 
AGAGGATGGA 
ATAACATTGC 
TAGAGAGTCA 
AAATCAACGA 
AAGAGAGTAA 
TAGTAAATGC 
GGCTTTTTGA 
ATTTACAGAT 
GATCAGTGGA 
CTGACTTAGA 



31 
I 

GGGGAGGAAT 
CATGGCTGAA 
CCTCATCTCT 
AGACCTCAGG 
GTACATAGAA 
CTACCAAGGT 
CGAGAGGGAT 
GGCTGAAAAT 
AAAGAACTTT 
GCTTAAGCAC 
ACGCACAAAT 
TGTCTTCCAA 
TGAGGAGCAA 
CTATGAAGAT 
AACCCAGGAA 
TGAGATGAAA 
AGACCAACTC 
TGCAGGAAGT 
GAAACCTCTT 
ACCCCCAGAA 
ACCGGAGCGG 
CCATCCAGAC 



41 
I 

ATGCTGTGGA 
GCAAAGACCC 
GGGGCTGAAG 
TTGGAAAATG 
AACCTCCGAC 
GTCTCTGTCC 
TCACTGAGTG 
GAGCCTCAGT 
CCAATGGACA 
ATGCAATTCC 
GAAATAGTGG 
GAGCTGGGGA 
AAACTTTATA 
GTGGTCGGGG 
GAGGTGAGAG 
CGCTCAGGGC 
TCAGATGATG 
GGGAGGTTAC 
GATTCTCAGT 
GACTTAATTG 
GAGCTTGACC 
CTGTTCCAAA 



51 
I 

GCTCCTCTGC 
ACTGGCTTGG 
CAGCTTCATT 
TCCAAAAGTT 
AACAAGCTCA 
CCCTTCAGCA 
AAGAAGACTG 
CTGCAOCAAA 
TGAGTGATGA 
CTCCTATGTA 
AGGAACAATA 
AACTGACAGG 
CGGATGATGA 
GAGAAGACTG 
ACAGCAAAGA 
AGCTTGGCAT 
TCTCCAAAGT 
AGAATGGGCA 
CTATTTATCA 
AGATG CTCA A 
TTCCTGTTGA 
ATAGGATGCT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1451 
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CTCCAAGAGT GGCTACCCTA AAACACCTGG TCGTGCTGGG ACTGAGGCCC TACCAGACGG 1380 

GCTCAGTGTT GAGGATATTT TAAATCTTTT AGGGATGGAG AGTGCAGCAA ATCAGAAAAC 1440 

GTOGTATTTT CCCAATCCAT ATAACCAGGA GAAAGTTCTG CCAAGGCTCC CTTATGGTGC 1500 

TGGAAQATCT AQATCGAACC AGCTTCCCAA AGCTGCCTGG ATTCCACATG TTGAAAACAG 1560 

ACAGATGGCA TATGAAAACC TGAACGACAA GGATCAAGAA TTAGGTGAGT ACTTGGCCAG 1620 

GATGCTAGTT AAATACCCTG AGATCATTAA TTCAAACCAA GTGAAGCGAG TTCCTGGTCA 1680 

AGGCTCATCT GAAGATGACC TGCAGGAAGA GGAACAAATT GAGCAGGCCA TCAAAGAGCA 1740 

TTTGAATCAA GGCAGCTCTC AGGAGACTGA CAAGCTGGCC CCGGTGAGCA AAAGGTTCCC 1800 

TGTGGGGCCC CCGAAGAATG ATGATACCCC AAATAGGCAG TACTGGGATG AAGATCTGTT 1860 

AATGAAAGTG CTGGAATACC TCAATCAAGA AAAGGCAGAA AAGGGAAGGG AGCATATTGC 1920 

TAAGAGAGCA ATGGAAAATA TGTAAGCTGC TTTCATTAAT TACCCTACTT TCATTCCTCC 1980 

CACCCCAAGC AAATCCCAAC ATTTCTCTTC AGTGTGTTGA CTTCTATCCT GTTAACACTG 2040 

TAATATCTTT AAATGATGTA CAGGCAGATG AAACCAGGTC ACTGGGGAGT CTGCTTCATT 2100 

TCCTCTGAGC TGTTATCTTG TGTATGGATA TGTGTAAATG TTATGACTCC TTGATAAAAA 2160 

ATTTATTATG TCCATTATTC AAGAAAGATA TCTATGACTG TGTTTAATAG TATATCTAAT 2220 

GGCTGTGGCA TTGTTGATGC TCACATATGA TAAAAAAGTG TCCTATAATT CTATTGAAAG 2280 

TTTTTAATAT TTATTGAATT ATTTTGTTAC TGTCTGTAGC GTTTTGTGGA GTACTGGACC 2340 

AAAAAAATAA AGCATTATAA ATATA 2365 

20 Seq ID NO: C154 bna Sequence 

Nucleic Acid Accession # « Nrt_03095S 
Coding sequence: 327.. 5108 

~- 1 11 21 31 41 51 

25 | | | | | | 

GAATTCCGGG AGCGGGCGGG CTGCGAGGCC GCGGGGCATG CGGGAGGCGG AGGGGTGGGA 60 

CCGGGTGGCT GCGCCCATTC CACACCCGCC GAAAGCGGAC ACTGTCAGCT GAATCACTCC 120 

CCTTTTAGGA GGAGGGAGGG GGAAAAGGTG TCTAGCTAAT TTCTGCTTAA AAAAGCACAG 180 

GAGATCGCGG GTCAGCTTTG CAGTCGCTGC CTTCTCGCGC CTGACCATGC ACCCCTGCAT 240 

jy) CTTCCTGCTG GGCACAGGOG AGCGCTTTAT TTCTGGAGCT GAGGGCTAAA ACTTTTTTCA 300 

CTTTTCTTCT CCTCAACATC TGAATCATGC CATGTGCCCA GAGGAGCTGG CTTGCAAACC 360 

TTTCCGTGGT GGCTCAGCTC CTTAACTTTG GGGCGCTTTG CTATGGGAGA CAGCCTCAGC 420 

CAGGCCCGGT TCGCTTCCCG GACAGGAGGC AAGAGCATTT ' TATCAAGGGC CTGCCAGAAT 480 

ACCACGTGGT GGGTCCAGTC CGAGTAGATG CCAGTGGGCA TTTTTTGTCA TATGGCTTGC 540 

5 J ACTATCCCAT CACGAGCAGC AGGAGGAAGA GAGATTTGGA TGGCTCAGAG GACTGGGTGT 600 

ACTACAGAAT TTCTCACGAG GAGAAGGACC TGTTTTTTAA CTTGACGGTC AATCAAGGAT 660 

TTCTTTCCAA TAGCTACATC ATGGAGAAGA GATATGGGAA CCTCTGCCAT GTTAAGATGA 720 

TGGCTTCCTC TGCCCOCCTC TGCCATCTCA GTGGCACGGT TCTACAGCAG GGCACCAGAG 780 

TTGGGACGGC AGCCCTCAGT GCCTGCCATG GACTGACTGG ATTTTTCCAA CTACCACATG 840 

4U GAGACTTTTT CATTGAACCC GTGAAGAAGC ATOCACTGGT TGAGGGAGGG TACCACCCGC 900 

ACATCGTTTA CAGGAGGCAG AAAGTTCCAG AAACCAAGGA GCCAACCTGT GGATTAAAGG 960 

ACAGTGTTAA CATCTCCCAG AAGCAAGAGC TATGGOGGGA GAAGTGGGAG AGGCACAACT 1020 

TGCCAAGCAG AAGCCTCTCT CGGCGTTCCA TCAGCAAGGA GAGATGGGTG GAGACACTGG 1080 

TGGTGGCCGA CACAAAGATG ATTGAATACC ATGGGAGTGA GAATGTGGAG TCCTACATCC 1140 

4 J TCACCATCAT GAACATGGTC ACTGGGTTGT TCCATAACCC AAGCATTGGC AATGCAATTC 1200 

ACATTGTTGT GGTTCGGCTC ATTCTACTCG AAGAAGAAGA GCAAGGACTG AAAATAGTTC 1260 

ACCATGCAGA AAAGACACTG TCTAGCTTCT GCAAGTGGCA GAAGAGTATC AATCCCAAGA 1320 

GTGACCTCAA TCCTGTTCAT CACGACGTGG CTGTCCTTCT CACCAGAAAG GACATCTGTG 1380 

CTGGTTTCAA TCGCCCCTGC GAGACCCTGG GCCTGTCTCA CCTTTCAGGA ATGTGTCAGC 1440 

jU CTCACCGCAG TTGTAACATC AATGAAGATT CGGGACTCCC TCTGGCTTTC ACAATTGCCC 1500 

ATGAGCTAGG ACACAGCTTC GGCATCCAGC ATGATGGGAA AGAAAATGAC TGTGAGCCTG 1560 

TGGGCAGACA TCCGTACATC ATGTCCCGCC AGCTCCAGTA CGATCCCACT CCGCTGACAT 1620 

GGTCCAAGTG CAGCGAGGAG TACATCACCC GCTTCTTGGA CCGAGGC7GG GGGTTCTGTC 1680 

TTGATGACAT ACCTAAAAAG AAAGGCTTGA AGTCCAAGGT CATTGCCCCC GGAGTGATCT 1740 

J J ATGATGTTCA CCACCAGTGC CAGCTACAAT ATCGACCCAA TGCTACCTTC TGCCAGGAAG 1800 

TAGAAAAOGT CTGCCAGACA CTGTGGTGCT COGTGAAGGG CTTTTGTCGC TCTAAGCTGG 1860 

ACGCTGCTGC AGATGGAACT CAATGTGGTG AGAAGAAGTG GTGTATGGCA GGCAAGTGCA 1920 

TCACAGTGGG GAAGAAACCA GAGAGGATTC CTGGAGGCTG GGGCCGCTGG TCACCCTGGT 1980 

- CCCACTGTTC CAGGACCTGT GGGGCTGGAG TCCAGAGOGC AGAGAGGCTC TGCAACAACC 2040 

OU CCGAGCCAAA GTTTGGAGGG AAATATTGCA CTGGAGAAAG AAAACGCTAT OGCTTGTGCA 2100 

ACGTCCACCC CTGTCGCTCA GAGGCACCAA CATTTCGGCA GATGCAGTGC AGTGAATTTG 2160 

ACACTGTTCC CTACAAGAAT GAACTCTACC ACTGQTTTCC CATTTTTAAC CCAGCACATC 2220 

CTTGTGAGCT CTACTGCCGA CCCATAGATG GCCAGTTTTC TGAGAAAATG CTGGATGCTG 2280 

c - TCATTGATGG TACCCCTTGC TTTGAAGGCG GCAACAGCAG AAATGTCTGT ATTAATGGCA 2340 

OJ TATGTAAGAT GGTTGGCTGT GACTATGAGA TCGATTCCAA TGCCACCGAG GATCGCTGCG 2400 

GTGTGTGCCT GGGAGATGGC TCTTCCTGCC AGACTGTGAG AAAGATGTTT AAGCAGAAGG 2460 

AAGGATCTGG TTATGTTGAC ATTGGGCTCA TTCCAAAAGG AGCAAGGGAC ATAAGAGTGA 2520 

TGGAAATTGA GGGAGCTGGA AACTTCCTGG CCATCAGGAG TGAAGATCCT GAAAAATATT 2580 

„ ACCTGAATGG AGGGTTTATT ATCCAGTGGA ACGGGAACTA TAAGCTGGCA GGGACTGTCT 2640 

/U TTCAGTATGA CAGGAAAGGA GACCTGGAAA AGCTGATGGC CACAGGTCCC ACCAATGAGT 2700 

CTGTGTGGAT CCAGCTTCTA TTCCAGGTGA CTAACCCTGG CATCAAGTAT GAGTACACAA 2760 

TCCAGAAAGA TGGCCTTGAC AATGATGTTG AGCAGATGTA CTTCTGGCAG TACGGOCACT 2820 

GGACAGAGTG CAGTGTGACC TGCGGGACAG" GTATCCGCCG CCAAACTGCC CATTGCATAA 2880 

AGAAGGGCCG CGGGATGGTG AAAGCTACAT TCTGTGACCC AGAAACACAG CCCAATGGGA 2940 

l~> GACAGAAGAA GTGCCATGAA AAGGCTTGTC CACCCAGGTG GTGGGCAGGG GAGTGGGAAG 3000 

CATGCTCGGC GACATGCGGG CCCCAOGGGG AGAAGAAGCG AACCGTGCTG TGCATCCAGA 3060 

CCATGGTCTC TGACGAGCAG GCTCTCCCGC CCACAGACTG CCAGCACCTG CTGAAGCCCA 3120 

AGACCCTCCT TTCCTGCAAC AGAGACATCC TGTGCCCCTC GGACTGGACA GTGGGCAACT 3180 

OA GGAGTGAGTG TTCTGTTTCC TGTGGTGGTG GAGTGOGGAT TCGCAGTGTC ACATGTGCCA 3240 

OU AGAACCATGA TGAACCTTGC GATGTGACAA GGAAACCCAA CAGCCGAGCT CTGTGTGGCC 3300 

TCCAGCAATG CCCTTCTAGC CGGAGAGTTC TGAAACCAAA CAAAGGCACT ATTTCCAATG 3360 

GAAAAAACCC ACCAACACTA AAGCCOGTCC CTCCACCTAC ATCCAGGCCC AGAATGCTGA 3420 

CCACACCCAC AGGGCCTGAG TCTATGAGCA CAAGCACTCC AGCAATCAGC AGCCCTAGTC 3480 

CTACCACAGC CTCCAAAGAA GGAGACCTGG GTGGGAAACA GTGGCAAGAT AGCTCAACCC 3540 
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AACCTGAGCT GAGCTCTCGC TATCTCATTT CCACTGGAAG CACTTCCCAG CCCATCCTCA 3600 

CTTCCCAATC CTTGAGCATT CAGCCAAGTG AGGAAAATGT TTCCAGTTCA GATACTGGTC 3660 

CTACCTCGGA GGGAGGCCTT GTAGCTACAA CAACAAGTGG TTCTGGCTTG TCATCTTCCC 3720 

GCAACCCTAT CACTTGGCCT GTGACTCCAT TTTACAATAC CTTGACCAAA GGTCCAGAAA 3780 

TGGAGATTCA CAGTGGCTCA GGGGAAQAAA GAGAACAGCC TGAGGACAAA GATGAAAGCA '3840 

ATCCTGTAAT ATGGACCAAG ATCAGAGTAC CTGGAAATGA CGCTCCAGTG GAAAGTACAG 3900 

AAATGCCACT TGCACCTCCA CTAACACCAG ATCTCAGCAG GGAGTCCTGG TGGCCACCCT 3960 

TCAGCACAGT AATGGAAGGA CTGCTCCCCA GCCAAAGGCC CACTACTTCC GAAACTGGGA 4020 

CACCCAGAGT TGAGGGGATG GTTACTGAAA AGCCAGCCAA CACTCTGCTC CCTCTGGGAG 4080 

GAGACCACCA GCCAGAACCC TCAGGAAAGA CGGCAAACCG TAACCACCTG AAACTTCCAA 4140 

ACAACATGAA CCAAACAAAA AGTTCTGAAC CAGTCCTGAC TGAGGAGGAT GCAACAAGTC 4200 

TGATTACTGA GGGCTTTTTG CTAAATGCCT CCAATTACAA GCAGCTCACA AACGGCCACG 4260 

GCTCTGCACA CTGGATCGTC GGAAACTGGA GCGAGTGCTC CACCACATGT GGCCTGGGGG 4320 

CCTACTGGAA AAGGGTGGAG TGCACCACCC AGATGGATTC TGACTGTGCG GCCATCCAGA 4380 

GACCTGACCC TGCAAAAAGA TGCCACCTCC GTCCCTGTGC TGGCTGGAAA GTGGGAAACT 4440 

GGAGCAAGTG CTCCAGAAAC TGCAGTGGGG GCTTCAAGAT ACGCGAGATT CAGTGCGTGG 4500 

ACAGCCGGGA CCACCGGAAC CTGAGGCCAT TTCACTGCCA GTTCCTGGCC GGCATTCCTC 4560 

CCCCATTGAG CATGAGCTGT AACCCGGAGC CCTGTGAGGC GTGGCAGGTG GAGCCTTGGA 4620 

GCCAGTGCTC CAGGTCCTGT GGAGGTGGAG TTCAGGAGAG AGGAGTGTTC TGTCCAGGAG 4680 

GCCTCTGTGA TTGGACAAAA AGACCCACAT CCACCATGTC TTGCAATGAG CACCTGTGCT 4740 

GTCACTGGGC CACTGGGAAC TGGGACCTGT GTTCCACTTC CTGTGGAGGT GGCTTTCAGA 4800 

AGAGGATTGT CCAATGTGTG CCCTCAGAGG GCAATAAAAC TGAAGACCAA GACCAATGTC 4860 

TATGTGATCA CAAACCCAGA CCTCCAGAAT TCAAAAAATG CAACCAGCAG GCCTGCAAGA 4920 

AAAGTGCCGA TTTACTTTGC ACTAAGGACA AACTGTCAGC CAGTTTCTGC CAGACACTGA 4980 

AAGCCATGAA GAAATGTTCT GTGCCCACOG TGAGGGCTGA GTGCTGCTTC TCGTGTCCCC 50 40 

AGACACACAT CACACACACC CAAAGGCAAA GAAGGCAAOG GTTGCTCCAA AAGTCAAAAG 5100 

AACTCTAAGC CCAAA 5115 

Seq ID NO: CI 55 DNA Sequence 

Nucleic Acid Accession 8: NM_ooi062.l 

Coding sequence: 76.. 13 80 

1 11 21 31 41 51 

I I I I I I 

GCTCTCATTA CCTTCTGCCC ATCACTTAAT AAATAGCCAG CCAATTCATC AACATTCTGG 60 

TACACTGTTG GAGAGATGAG ACAGTCACAC CAGCTGCCCC TAGTGGGGCT CTTACTGTTT 120 

TCTTTTATTC CAAGCCAACT ATGCGAGATT TGTGAGGTAA GTGAAGAAAA CTACATCCGC 180 

CTAAAACCTC TGTTGAATAC AATGATCCAG TCAAACTATA ACAGGGGAAC CAGCGCTGTC 240 

AATGTTGTGT TGTCCCTCAA ACTTGTTGGA ATCCAGATCC AAACCCTGAT GCAAAAGATG 300 

ATCCAACAAA TCAAATACAA TGTGAAAAGC AGATTGTCAG ATGTAAGCTC GGGAGAGCTT 360 

GCCTTGATTA TACTGGCTTT GGGAGTATGT CGTAACGCTG AGGAAAACTT AATATATGAT 420 

TACCACCTGA CTGACAAGCT AGAAAATAAA TTCCAAGCAG AAATTGAAAA TATGGAAGCA 480 

CACAATGGCA CTCCCCTGAC TAACTACTAC CAGCTCAGCC TGGACGTTTT GGCCTTGTGT 540 

CTGTTCAATG GGAACTACTC AACCGCCGAA GTTGTCAACC ACTTCACTCC TGAAAATAAA 600 

AACTATTATT TTGGTAGCCA GTTCTCAGTA GATACTGGTG CAATGGCTGT CCTGGCTCTG 660 

ACCTGTGTGA AGAAGAGTCT AATAAATGGG CAGATCAAAG CAGATGAAGG CAGTTTAAAG 720 

AACATCAGTA TTTATACAAA GTCACTGGTA GAAAAGATTC TGTCTGAGAA AAAAGAAAAT 780 

GGTCTCATTG GAAACACATT TAGCACAGGA GAAGCCATGC AGGCCCTCTT TGTATCATCA 840 

GACTATTATA ATGAAAATGA CTGGAATTGC CAACAAACTC TGAATACAGT GCTCACGGAA 900 

ATTTCTCAAG GAGCATTCAG TAATCCAAAC GCTGCAGCCC AGGTCTTACC TGCCCTGATG 960 

GGAAAGACCT TCTTGGATAT TAACAAAGAC TCTTCTTGCG TCTCTGCTTC AGGTAACTTC 1020 

AACATCTCCG CTGATGAGCC TATAACTGTG ACACCTCCTG ACTCACAATC ATATATCTCC 1080 

GTCAATTACT CTGTGAGAAT CAATGAAACA TATTTCACCA ATGTCACTGT GCTAAATGGT 1140 

TCTGTCTTCC TCAGTGTGAT GGAGAAAGCC CAGAAAATGA ATGATACTAT ATTTGGTTTC 1200 

ACAATGGAGG AGCGCTCATG GGGGCCCTAT ATCACCTGTA TTCAGGGCCT ATGTGCCAAC 1260 

AATAATGACA GAACCTACTG GGAACTTCTG AGTGGAGGCG AACCACTGAG CCAAGGAGCT 1320 

GGTAGTTACG TTGTCCGCAA TGGAGAAAAC TTGGAGGTTC GCTGGAGCAA ATACTAATAA 1380 

GCCCAAACTT TCCTCAGCTG CATAAAATCC ATTTGCAGTG GAGTTCCATG TTTATTGTCC 1440 

TTATGCCTTC TTCTTCATTT ATCCCAGTAC GAGCAGGAGA GTTAATAACC TCCCCTTCTC 1500 

TCTCTACATG TTCAATAAAA GTTGTTGAAA GATTAAC 1537 

Seq ID NO: CI 5 6 DNA Sequence 
Nucleic Acid Accession ft: NM_00459l 
Coding sequence: 5 9.. 34 9 

1 11 21 31 41 51 

I I I I I I 

CACTCCCAAA GAACTGGGTA CTCAACACTG AGCAGATCTG TTCTTTGAGC TAAAAACCAT 60 

GTGCTGTACC AAGAGTTTGC TCCTGGCTGC TTTGATGTCA GTGCTGCTAC TCCACCTCTG 120 

CGGCGAATCA GAAGCAGCAA GCAACTTTGA CTGCTGTCTT GGATACACAG ACCGTATTCT 180 

TCATCCTAAA TTTATTGTGG GCTTCACACG GCAGCTGGCC AATGAAGGCT GTGACATCAA 240 

TGCTATCATC TTTCACACAA AGAAAAAGTT GTCTGTGTGC GCAAATCCAA AACAGACTTG 300 

GGTGAAATAT ATTGTGCGTC TCCTCAGTAA AAAAGTCAAG AACATGTAAA AACTGTGGCT 360 

TTTCTGGAAT GGAATTGGAC ATAGCCCAAG AACAGAAAGA ACCTT GC TGG GGTTGGAGGT 420 

TTCACTTGCA CATCATGGAG GGTTTAGTGC TTATCTAATT TGTGCCTCAC TGGACTTGTC 480 

CAATTAATGA AGTTQATTCA TATTGCATCA TAGTTTGCTT TGTTTAAGCA TCACATTAAA 540 

GTTAAACTGT ATTTTATGTT ATTTATAGCT GTAGGTTTTC TGTGTTTAGC TATTTAATAC 600 

TAATTTTCCA TAAGCTATTT TGGTTTAGTG CAAAGTATAA AATTATATTT GGGGGGGAAT 660 

AAGATTATAT GGACTTTCTT GCAAGCAACA AGCTATTTTT TAAAAAAACT ATTTAACATT 720 

CTTTTGTTTA TATTGTTTTG TCTCCTAAAT TGTTGTAATT GCATTATAAA ATAAGAAAAA 780 

CATTAATAAG ACAAATATT 799 

Seq ID NO: C157 DNA Sequence 

Nucleic Acid Accession 8t nm 013271.1 
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Coding sequence: 27.. 809 

1 11 21 31 41 51 

r \ I I I I I 

J TCCGGAGCCA GGCTCGCTGG GGCAGCATGG CGGGGTCGCC QCTGCTCTGQ GGGCCGCGGG 60 

CCGGGGGCGT CGGCCTTTTG GTGCTGCTGC TGCTCGGCCT GTTTOGGCCG CCCCCCGOGC 120 

TCTGCGCGOG GCCGGTAAAG GAACCCCGCG GCCTAAGCGC AGOGTCTCGG CCCTTGGCTG 180 

AGACTGGCGC TCCTCGCCGC TTCOGGCGGT CAGTGCCCCG AGGTGAGGCG GCGGGGGCGG 240 

TGCAGGAGCT GGCGCGGGCG CTGGCGCATC TGCTGGAGGC CGAACGTCAG GAGCGGGCGC 300 

10 GGGCCGAGGC GCAGGAGGCT GAGGATCAGC AGGCGCGCGT CCTGGCGCAG CTGCTGCGCG 360 

TCTGGGGCGC CCCCCGCAAC TCTGATCCGG CTCTGGGCCT GGAOGACGAC CCCGACGCGC 420 

CTGCAGCGCA GCTCGCTOGC GCTCTGCTCC GGGCCCGCCT TGACCCTGCC GCCCTAGCAG 480 

CCCAGCTTGT CCCCGCGCCC GTCCCCGCCG CGGCGCTCCG ACCCCGGCCC CCGGTCTAOG S40 

ACGACGGCCC CGCGGGCCCG GATGCTGAGG AGGCAGGCGA CGAGACACCC GACGTGGACC 600 

CCGAGCTGTT GAGGTACTTG CTGGGACGGA TTCTTGCGGG AAGCGCGGAC TCCGAGGGGG 660 

TGGCAGCCCC GCGCCGCCTC CGCOGTGCCG CCGACCACGA TGTGGGCTCT GAGCTGCCCC 720 

CTGAGGGCGT GCTGGGGGCG CTGCTGCGTG TGAAACGCCT AGAGACCCCG GCGCCCCAGG 780 

TGCCTGCACG CCGCCTCTTG CCACCCTGAG CACTGCCCGG ATCCCGTGCA CCCTGGGACC 840 

CAGAAGTGCC CCCGCCATCC CGCCACCAGG ACTTCTCCCC GCCAGCACGT CCAGAGCAAC 900 

TTACCCCGGC CAGCCAGCCC TCTCACCCGA GGATCCCTAC CCCCTGGCCC ACAATAACAT 960 

GATCTGAGC 969 



15 



20 



,Seq ID NO: C158 DMA Sequence 
Nucleic Acid Accession #: NM_002245.2 
25 Coding sequence: 183.. 1193 ~ 

1 11 21 31 41 51 

I I I I I I 

GGGCAGGAAG ACGGCGCTGC CCGGAGGAGC GGGGCGGGCG GGCGCGCGGG GGAGCGGGCG 60 

30 GCGGGCGGGA GCCAGGCCCG GGCGCGGGCG GGGGCGGCGG GGCCAGAAGA GGCGGCGGGC 120 

CGCGCTCCGG CCGGTCTGCG GCGTTGGCCT TGGCTTTGGC TTTGGCGGCG GCGGTGGAGA 180 

AGATGCTGCA GTCCCTGGCC GGCAGCTCGT GCGTGCGCCT GGTGGAGCGG CACCGCTCGG 240 

CCTGGTGCTT CGGCTTCCTG GTGCTGGGCT ACTTGCTCTA CCTGGTCTTC GGCGCAGTGG 300 

TCTTCTCCTC GGTGGAGCTG CCCTATGAGG ACCTGCTGCG CCAGGAGCTG CGCAAGCTGA 360 

35 AGCGACGCTT CTTGGAGGAG CACGAGTGCC TGTCTGAGCA GCAGCTGGAG CAGTTCCTGG 420 

GCCGGGTGCT GGAGGCCAGC AACTACGGCG TGTCGGTGCT CAGCAACGCC TCGGGCAACT 480 

GGAACTGGGA CTTCACCTCC GCGCTCTTCT TCGCCAGCAC CGTGCTCTCC ACCACAGGTT 540 

ATGGCCACAC CGTGCCCTTG TCAGATGGAG GTAAGGCCTT CTGCATCATC TACTCCGTCA 600 

TTGGCATTCC CTTCACCCTC CTGTTCCTGA CGGCTGTGGT CCAGCGCATC ACCGTGCACG 660 

40 TCACCCGCAG GCCGGTCCTC TACTTCCACA TCCGCTGGGG CTTCTCCAAG CAGGTGGTGG 720 

CCATCGTCCA TGCCGTGCTC CTTGGGTTTG TCACTGTGTC CTGCTTCTTC TTCATCCCGG 780 

CCGCTGTCTT CTCAGTCCTG GAGGATGACT GGAACTTCCT GGAATCCTTT TATTTTTGTT 840 

TTATTTCCCT GAGCACCATT GGCCTGGGGG ATTATGTGCC TGGGGAAGGC TACAATCAAA 900 

AATTCAGAGA GCTCTATAAG ATTGGGATCA CGTGTTACCT GCTACTTGGC CTTATTGCCA 960 

45 TGTTGGTAGT TCTGGAAACC TTCTGTGAAC TCCATGAGCT GAAAAAATTC AGAAAAATGT 1020 

TCTATGTGAA GAAGGACAAG GAGGAGGATC AGGTGCACAT CATAGAGCAT GACCAACTGT 1080 

CCTTCTCCTC GATCACAGAC CAGGCAGCTG GCATGAAAGA GGACCAGAAG CAAAATGAGC 1140 

CTTTTGTGGC CACCCAGTCA TCTGCCTGCG TGGATGGCCC TGCAAACCAT TGAGCGTAGG 1200 

A ATTTGTTGCA TTATGCTAGA GCACCAGGGT CAGGGTGCAA GGAAGAGGCT TAAGTATGTT 1260 

JO CATTTTTATC AGAATGCAAA AGCGAAAATT ATGTCACTTT AAGAAATAGC TACTGTTTGC 1320 

AATGTCTTAT TAAAAAACAA CAAAAAAAGA CACATGGAAC AAAGAAGCTG TGACCCCAGC 1380 

AGGATGTCTA ATATGTGAGG AAATGAGATG TCCACCTAAA ATTCATATGT GACAAAATTA 1440 

TCTCGACCTT ACATAGGAGG AGAATACTTG AAGCAGTATG CTGCTGTGGT TAGAAGCAGA 1500 

TTTTATACTT TTAACTGGAA ACTTTGGGGT TTGCATTTAG ATCATTTAGC TGATGGCTAA 1560 

J J ATAGCAAAAT TTATATTTAG AAGCAAAAAA AAAAAGCATA GAGATGTGTT TTATAAATAG 1620 

GTTTATGTGT ACTGGTTTGC ATGTACCCAC CCAAAATGAT TATTTTTGGA GAATCTAAGT 1680 

CAAACTCACT ATTTATAATG CATAGGTAAC CATTAACTAT GTACATATAA AGTATAAATA 1740 

TGTTTATATT CTGTACATAT GGTTTAGGTC ACCAGATCCT AGTGTAGTTC TGAAACTAAG 1800 

ACTATAGATA TTTTGTTTCT TTTGATTTCT CTTTATACTA AAGAATCCAG AGTTGCTACA 1860 

ATAAAATAAG GGGAATAATA AACTTGAGAG TGAATAACCA T 1901 
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Seq ID NO: CIS 9 DNA Sequence 

Nucleic Acid Accession #: NM_005472.l 

Coding sequence: 93.. 404 

1 11 21 31 41 51 

I I I I i I 

AAAGGGACTC CTTGAAACTG ATTGAGAGCC CAGTGGATTT GCCAGCAGTT TGAGCTTCTA 60 

CCGAGTCTTC CCCCACCTCA ATCCCTGTTG CTATGGAGAC TACCAATGGA ACGGAGACCT 120 

GGTATGAGAG CCTGCATGCC GTGCTGAAGG CTCTAAATGC CACTCTTCAC AGCAATTTGC 180 

TCTGCCGGCC AGGGCCAGGG CTGGGGCCAG ACAACCAGAC TGAAGAGAGG CGGGCCAGCC 240 

TACCTGGCCG TGATGACAAC TCCTACATGT ACATTCTCTT TGTCATGTTT CTATTTGCTG 300 

TAACTGTGGG CAGCCTCATC CTGGGATACA CCCGCTCCCG CAAAGTGGAC AAGCGTAGTG 360 

ACCCCTATCA TGTGTATATC AAGAACCGTG TGTCTATGAT CTAACACGAG AGGGCTGGGA 420 

CGGTGGAAGA CCAAGACACC TGGGGATTGC GTCTGGGGCC TCCAGAACTC TGCTGTGGAC 480 

TGCATCAGGT CT 492 



Seq ID NO: CI 60 DNA Sequence 
Nucleic Acid Accession #: KM 005245.1 
oO Coding sequence: 187.. 13959 ~~ 

1 11 21 31 ' 41 51 

I I I I I I 

CTGGGCGGCC GGGCGCGGGG AGAGGGCGCG GGAGOGGCTC GTGCGGCAGG TACCATGCGG 60 

1296 



WO 03/042661 



ACGCGCGAGC CCGGCGAGGC CCCGGCAGGC CCGTCCCTGC TCGGGGGCGC GCTGAGACGG 120 

CGGGTGAGCT CCACGAGAGC GCCGTCGCCA CTTCGGGCCA ACTTTGCGAT TCCCGACAGT 180 

TAAGCAATGG GGAGACATTT GGCTTTGCTC CTGCTTCTGC TCCTTCTCTT CCAACATTTT 240 

GGAGACAGTG ATGGCAGCCA ACGACTTGAA CAGACTCCTC TGCAGTTTAC ACACCTCGAG 300 

TACAACGTCA CCGTGCAGGA GAACTCTGCA GCTAAGACTT ATGTGGGGCA TCCTGTCAAG 360 

ATGGGTGTTT ACATTACACA TCCAGCGTGG GAAGTAAGGT ACAAAATTGT TTCOGGAGAC 420 

AGTGAAAACC TGTTCAAAGC TGAAGAGTAC ATTCTOGGAG ACTTTTGCTT TCTAAGAATA 480 

AGGACCAAAG GAGOAAATAC AGCTATTCTT AATAGAGAAG TGAAGGATCA CTACACATTG 54 0 

ATAGTGAAAG CACTTGAAAA AAATACTAAT GTGGAGGCGC GAACAAAGGT CAGGGTGCAG 600 

GTGCTGGATA CAAATGACTT GAGACCGTTA TTCTCACCCA CCTCATACAG CGTTTCTTTA 660 

CCTGAAAACA CAGCTATAAG GACCAGTATC GCAAGAGTCA GCGCCACGGA TGCAGACATA 720 

GGAACCAACG GGGAATTTTA CTACAGTTTT AAAGATCGAA CAGATATGTT TGCTATTCAC 780 

CCAACCAGTG GTGTGATAGT GTTAACTGGT AGACTTGATT ACCTAGAGAC CAAGCTCTAT 840 

GAGATGGAAA TCCTCGCTGC GGACCGTGGC ATGAAGTTGT ATGGGAGCAG TGGCATCAGC 900 

AGCATGGCCA AGCTAAOGGT GCACATCGAA CAGGCCAATG AATGTGCTCC GGTGATAACA 960 

GCAGTGACAT TGTCACCATC AGAACTGGAC AGGGACCCAG CATATGCAAT TGTGACAGTG 1020 

GATGACTGCG ATCAGGGTGC CAATGGTGAC ATAGCATCTT TAAGCATOGT GGCAGGTGAC 1080 

CTTCTCCAGC AGTTTAGAAC AGTGAGGTCC TTTCCAGGGA GTAAGGAGTA TAAAGTCAAA 1140 

GCCATCGGTG ACATTGATTG GGACAGTCAT CCTTTCGGCT ACAATCTCAC ACTACAGGCT 12 0O 

AAAGATAAAG GAACTCCGCC CCAGTTCTCT TCTGTTAAAG TCATTCACGT GACTTCTCCA 1260 

CAGTTCAAAG CCGGGCCAGT CAAGTTTGAA AAGGATGTTT ACAGAGCAGA AATAAGTGAA 1320 

TTTGCTCCTC CCAACACACC TGTGGTCATG GTAAAGGCCA TTCCTGCTTA TTCCCATTTG 1380 

AGGTATGTTT TTAAAAGGAC ACCTGGAAAA GCTAAATTCA GTTTAAATTA CAACACTGGT 1440 

CTCATTTCTA TTTTAGAACC AGTTAAAAGA CAGCAGGCAG CCCATTTTGA ACTTGAAGTA 1500 

ACAACAAGTG ACAGAAAAGC GTCCACCAAG GTCTTGGTGA AAGTCTTAGG TGCAAATAGC 1560 

AATCCCCCTG AATTTACCCA GACAGCGTAC AAAGCTGCTT TTGATGAGAA CGTGCCCATT 1620 

GGTACTACTA TCATGAGCCT GAGTGCCGTA GACCCTGATG AGGGTGAGAA TGGGTACGTG 1680 

ACATACAGTA TCGCAAATTT AAATCATGTG CCGTTTGCGA TTGACCATTT CACTGGTGCC 1740 

GTGAGTACGT CAGAAAACCT GGACTACGAA CTGATGCCTC GGGTTTATAC TCTGAGGATT 1800 

CGTGCATCAG ACTGGGGCTT GCCGTACOGC CGGGAAGTCG AAGTCCTTGC TACAATTACT I860 

CTCAATAACT TGAATGACAA CACACCTTTG TTTGAGAAAA TAAATTGTGA AGGGACAATT 1920 

CCCAGAGATC TAGGCGTGGG AGAGCAAATA ACCACTGTTT CTGCTATTGA TGCAGATGAA 1980 

CTTCAGTTGG TACAGTATCA GATTGAAGCT GGAAATGAAC TGGATTTGTT TAGTTTAAAC 2040 

CCCAACTCGG GGGTATTGTC ATTAAAGCGA TCGCTAATGG ATGGCTTAGG TGCAAAGGTG 2100 

TCTTTCCACA GTCTGAGAAT CACAGCTACA GATGGAGAAA ATTTTGCCAC ACCATTATAT 2160 

ATCAACATAA CAGTGGCTGC CAGTCACAAG CTGGTAAACT TGCAGTGTGA AGAGACTGGT 2220 

GTTGCCAAAA TGCTGGCAGA GAAGCTCCTG CAGGCAAATA AATTACACAA CCAGGGAGAG 2280 

GTGGAGGATA TTTTCTTCGA TTCTCACTCT GTCAATGCTC ACATACOGCA GTTTAGAAGC 2340 

ACTCTTCCGA CTGGTATTCA GGTAAAGGAA AACCAGCCTG TGGGTTCCAG TGTAATTTTC 2400 

ATGAACTCCA CTGACCTTGA CACTGGCTTC AATGGAAAAC TGGTCTATGC TGTTTCTGGA 2460 

GGAAATGAGG ATAGTTGCTT CATGATTGAT ATGGAAACAG GAATGCTGAA AATTTTATCT 2520 

CCTCTTGACC GTGAAACAAC AGACAAATAC ACCCTGAATA TTACCGTCTA TGACCTTGGG 2580 

ATACCCCAGA AGGCTGCGTG GCGTCTTCTA CATGTCGTGG TTGTOGATGC CAATGATAAT 2640 

CCACCCGAGT TTTTACAGGA GAGCTATTTT GTGGAAGTGA GTGAAGACAA GGAGGTACAT 2700 

AGTGAAATCA TCCAGGTTGA AGCCACAGAT AAAGACCTGG GGCCCAACGG ACACGTGACG 2760 

TACTCAATTC TTACAGACAC AGACACATTT TCAATTGACA GCGTGACGGG TGTTGTTAAC 2820 

ATCGCACGCC CTCTGGATCG AGAGCTGCAG CATGAGCACT CCTTAAAGAT TGAGGCCAGG 2880 

GACCAAGOCA GAGAAGAGCC TCAGCTGTTC TCCACXGTCG TTGTGAAAGT ATCACTAGAA 2940 

GATGTTAATG ACAACCCACC TACATTTATT CCACCTAATT ATOGTGTGAA AGTCCGAGAG 3000 

GATCTTCCAG AAGGAACCGT CATCATGTGG TTAGAAGCCC ACGATCCTGA TTTAGGTCAG 3060 

TCTGGTCAGG TGAGATACAG CCTTCTGGAC CAOGGAGAAG GAAACTTOGA TGTGGATAAA 3120 

CTCAGTGGAG CAGTTAGGAT CGTCCAGCAG TTGGACTTTG AGAAGAAGCA AGTGTATAAT 3180 

CTCACTGTGA GGGCCAAAGA CAAGGGAAAG CCAGTTTCTC TGTCTTCTAC TTGCTATGTT 3240 

GAAGTTGAGG TGGTTGATGT GAATGAGAAC CTGCACCCAC CCGTGTTTTC CAGCTTTGTG 3300 

GAAAAGGGGA CAGTGAAAGA AGATGCACCT GTTGGTTCAT TGGTAATGAC GGTGTCGGCT 3360 

CATGATGAGG ACGCOGGAAG AQATGGGGAG ATCCGATACT CCATTAGAGA TGGCTCTGGC 3420 

GTTGGTGTTT TCAAAATAGG TGAAGAGACA GGTGTCATAG AGACGTCAGA TCGACTGGAC 3480 

CGTGAATCGA CCTCCCATTA TTGGCTAACA GTCTTTGCAA COGATCAGGG TGTOGTGCCT 3540 

CTTTCATCGT TCATAGAGAT CTACATAGAG GTTGAGGATG TGAATGACAA TGCACGACAG 3600 

ACATCAGAGC CTGTTTATTA CCCAGAAATC ATGGAAAATT CTCCTAAAGA TGTATCTGTG 3660 

GTCCAGATCG AGGCATTTGA TCCAGATTCG AGCTCTAATG ACAAGCTCAT GTACAAAATT 3720 

ACAAGTGGAA ATCCACAAGG ATTCTTTTCA ATACATCCTA AAACAGGTCT CATCACAACT 3780 

ACGTCAAGGA AGCTAGACCG AGAACAGCAA GATGAACACA TATTAGAGGT TACTGTGACA 3840 

GACAATGGTA GTCCCCCCAA ATCAACCATT GCAAGAGTCA TTGTGAAAAT CCTTGATGAA 3900 

AATGACAACA AACCTCAGTT TCTGCAAAAG TTCTACAAAA TCAGACTCCC TGAGCGGGAA 3960 

AAGCCAGACC GAGAAAGAAA TGCCAGACGG GAGCCGCTCT ATCGCGTCAT AGCCACCGAC 4020 

AAGGATGAGG GCCCCAATGC AGAAATCTCC TACAGCATCG AAGACGGGAA TGAGCATGGC 4080 

AAATTTTTCA TCGAACCGAA AACTGGAGTG GTTTCGTCCA AGAGGTTTTC AGCAGCTGGA 4140 

GAATATGATA TTCTTTCAAT TAAGGCAGTT GACAATGGTC GCCCTCAAAA GTCATCAACC 4200 

ACCAGACrCC ATATTGAATG GATCTCCAAG CCCAAACAGT CCCTGGAGCC CATTTCATTT 4260 

GAAGAATCAT TTTTTACCTT TACTGTGATG GAAAGTGACC COGTTGCTCA CATGATTGGA 4320 
GTAATATCTG TGGAGCCTCC TGGCATACCC C TTT G G TTT G ACATCACTGG TGGCAACTAC 4380 
GACAGTCACT TCGATGTGGA CAAGGGAACT GGAACCATCA TTGTTGCCAA ACCTCTTGAT 4440 
GCAGAACAGA AGTCAAACTA CAACCTCACA GTCGAGGCTA CAGATGGAAC CACCACTATC 4500 
CTGACTCAGG TATTCATCAA AGTAATAGAC ACAAATGACC ATCGTCCTCA GTTTTCXACA 4560 
TCAAAGTATG AAGTTGTTAT TCCTGAAGAT ACAGCGCCAG AAACAGAAAT TTTGCAAATC 4620 
AGTGCTGTGG ATCAGGATGA GAAAAACAAA CTAATCTACA CTCTGCAGAG CAGTAGAGAT 4680 
CCACTGAGTC TCAAGAAATT TCGTCTTGAT CCTGCAACCG GCTCTCTCTA TACTTCTGAG 4740 
AAACTGGATC ATGAAGCTGT TTCACCAGCA CACCTCACGG TCATGGTACG AGATCAAGAT 4800 
GTGCCTGTAA AACGCAACTT TGCAAGGATT GTGGTCAATG TCAGCGACAC GAATGACCAC 4860 
GCCCCGTGGT TCACCGCTTC CTCCTACAAA GGGCGGGTTT ATGAATCGGC AGCCGTTGGC 4920 
TCAGTTGTGT TGCAGGTGAC GGCTCTGGAC AAGGACAAAG GGAAAAATGC TGAAGTGCTG 4980 
TACTCGATCG AGTCAGGAAA TATTGGAAAT ATTGGAAATT CTTTTATGAT TGATCCTGTC 5040 
TTGGGCTCTA TTAAAACTGC CAAAGAATTA GATCGAAGTA ACCAAGCGGA GTATGATTTA 5100 
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ATGGTAAAAG CTACAGATAA GGGCAGTCCA CCAATGAGTG AAATAACTTC TGTGCGTATC 5160 

TTTGTCACAA TTGCTGACAA CGCCTCTCCG AAGTTTACAT CAAAAGAATA TTCTGTTGAA 5220 

CTTAGTGAAA CTGTCAGCAT TGGGAGTTTC GTTGGGATGG TTACAGCCCA TAGTCAATCA 5280 

TCAGTGGTGT ATGAAATAAA AGATGGAAAT ACAGGTGATG CTTTTGATAT TAATCCACAT 5340 

TCTGGAACTA TCATCACTCA GAAAGCCCTG GACTTTGAAA CTTTGCCCAT TTACACATTG 5400 

ATAATACAAG GAACTAACAT GGCTGGTTTG TCCACTAATA CAACGGTTCT AGTTCACTTG 5460 

CAGGATGAGA ATGACAACGC GCCAGTTTTT ATGCAGGCAG AATATACAGG ACTCATTAGT 5S20 

GAATCAGCCT CAATTAACAG CGTGGTCCTA ACAGACAGGA ATGTCCCACT GGTGATTCGA 5580 

GCAGCTGATG CTGATAAAGA CTCAAATGCT TTGCTTGTAT ATCACATTGT TGAACCATCT 5640 

GTACACACAT ATTTTGCTAT TGATTCTAGC ACTGGTGCTA TTCATACAGT ACTAAGTCTG 5700 

GACTATGAAG AAACAAGTAT TTTTCACTTT ACCGTCCAAG TGCATGACAT GGGAACCCCA 5760 

CGTTTATTTG CTGAGTATGC AGCGAATGTA ACAGTACATG TAATTGACAT TAATGACTGC 5820 

CCCCCTGTGT TTGCCAAGCC ATTATATGAA GCATCTCTTT TGTTACCAAC ATACAAAGGA 5880 

GTAAAAGTCA TCACAGTAAA TGCTACAGAT GCTGATTCAA GTGCATTCTC ACAGTTGATT 5940 

TACTCCATCA CCGAAGGCAA CATCGGGGAG AAGTTTTCTA TGGACTACAA GACTGGTGCT 6000 

CTCACTGTCC AAAACACAAC TCAGTTAAGA AGCCGCTACG AGCTAACCGT TAGAGCTTCC 6060 

GATGGCAGAT TTGCCGGCCT TACCTCTGTC AAAATTAATG TGAAAGAAAG CAAAGAAAGT 6120 

CACCTAAAGT TTACCCAGGA TGTCTACTCT GCGGTAGTGA AAGAGAATTC CACCGAGGCC 6180 

GAAACATTAG CTGTCATTAC TGCTATTGGG AGTCCAATCA ATGAGCCTTT GTTTTATCAC 6240 

ATCCTCAACC CAGATCGCAG ATTTAAAATA AGCOGCACTT CAGGGGTTCT GTCAACCACT 6300 

GGCACGCCCT TCGATCGTGA GCAGCAGGAG GCGTTTGATG TGGTTGTAGA AGTGATAGAG 6360 

GAACATAAGC CTTCTGCAGT GGCCCACGTT GTCGTGAAGG TCATTGTAGA AGACCAAAAT 6420 

GATAATGCGC CGGTGTTTGT CAACCTTCCC TACTACGCCG TTGTTAAAGT GGACACTGAG 6480 

GTGGGCCATG TCATTCGCTA TGTCACTGCT GTAGACAGAG ACAGTGGCAG AAACGGGGAA 6540 

GTGCATTACT ACCTCAAGGA ACATCATGAA CACTTTCAAA TTGGACCCTT GGGTGAAATT 6600 

TCACTGAAAA AGCAATTTGA GCTTGACACC TTAAATAAAG AATATCTTGT TACAGTGGTT 6660 

GCAAAAGATG GAGGGAACCC GGCCTTTTCA GCGGAAGTTA TCGTTCCGAT CACTGTCATG 6720 

AATAAAGCCA TGCCTGTGTT TGAAAAACCT TTCTACAGTG CAGAGATTGC AGAGAGCATC 6780 

CAGGTGCACA GCCCTGTGGT CCACGTGCAG GCTAACAGCC CGGAAGGCCT GAAAGTGTTC 6840 

TACAGCATCA CAGACGGAGA CCCTTTCAGC CAGTTCACTA TTAACTTCAA TACTGGAGTT 6900 

ATCAATGTCA TAGCTCCTCT GGACTTTGAG GCCCACCCGG CATATAAGCT GAGCATACGC 6960 

GCAACTGACT CCTTGACGGG CGCTCATGCT GAAGTATTTG TGGACATCAT AGTAGACGAC 7020 

ATCAATGATA ACCCTCCTGT GTTTGCTCAG CAGTCTTATG CGGTGACCCT GTCTGAGGCA 7080 

TCTGTAATTG GAACGTCTGT TGTTCAAGTT AGAGCCACCG ATTCTGATTC AGAACCAAAT 7140 

AGAGGAATCT CATACCAGAT GTTTGGGAAT CACAGCAAGA GTCATGATCA TTTTCATGTA 7200 

GACAGCAGCA CTGGCCTCAT CTCACTACTC AGAACCCTGG ATTACGAGCA GTCGCGGCAG 7260 

CACACGATTT TTGTGAGGGC AGTTGATGGT GGTATGCCCA CGCTGAGCAG TGATGTGATT 7320 

GTCACGGTGG ACGTTACCGA CCTCAATGGT AATCCACCAC TCTTTGAACA ACAGATTTAT 7380 

GAAGCCAGAA TTAGCGAGCA CGCCCCTCAT GGGCATTTCG TGACCTGTGT AAAAGCCTAT 7440 

GATGCAGACA GTTCAGACAT AGACAAGTTG CAGTATTCCA TTCTGTCTGG CAATGATCAT 7500 

AAACATTTTG TCATTGACAG TGCAACAGGG ATTATCACCC TCTCAAACCT GCACCGGCAC 7560 

GCCCTGAAGC CATTTTACAG TCTTAACCTG TCAGTGTCTG ATGGAGTTTT TAGAAGTTCC 7620 

ACCCAGGTTC ATGTAACTGT AATTGGAGGC AATTTGCACA GTCCTGCTTT CCTTCAGAAC 7680 

GAATATGAAG TGGAACTAGC TGAAAACGCT CCCCTACATA CCCTGGTGAT GGAGGTGAAA 7740 

ACTACGGATG GGGATTCTGG TATTTATGGT CACGTTACTT ACCATATTGT AAATGACTTT 7800 

GCCAAAGACA GATTTTACAT AAATGAGAGA GGACAGATAT TTACTTTGGA AAAACTTGAT 7860 

CGAGAAACCC CGGCGGAGAA AGTGATCTCA GTCCGTTTAA TGGCTAAGGA TGCTGGAGGA 7920 

AAAGTTGCTT TCTGCACCGT GAATGTCATC CTTACAGATG ACAATGACAA TGCACCACAA 7980 

TTTCGAGCAA CCAAATACGA AGTGAATATC GGGTCCAGTG CTGCTAAAGG GACTTCAGTC 8040 

GTAAAGTCTG CAAGTGATGC CGATGAGGGC 7CCAATGC0G ACATCACCTA TGCCATTGAA 8100 

GCAGACTCTG AAAGTGTAAA AGAGAATTTG GAAATTAACA AACTGTCCGG CGTAATCACT 8160 

ACAAAGGAGA GCCTCATTGG CTTGGAAAAT GAATTCTTCA CTTTCTTTGT TAGAGCTGTG 8220 

GATAATGGGT CTCCATCAAA AGAATCTGTT GTTCTTGTCT ATGTTAAAAT OCTTCCACCG 8280 

GAAATGCAGC TTCCAAAATT TTCAGAACCT TTCTATACCT TTACAGTGTC AGAGGACGTG 8340 

CCTGTTGGAA CAGAGATAGA TCTCATCCGA GCAGAACATA GTGGGACTGT TCTTTACAGC 8400 

CTGGTCAAAG GGAATACTCC AGAAAGCAAT AGGGATGAGT CCTTTGTGAT TGACAGACAG 8460 

AGCGGGAGAC TGAAGTTGGA GAAGAGTCTT GATCATGAGA CAACTAAGTG GTATCAGTTT 8520 

TCCATACTGG CCAGGTGCAC TCAAGATGAC CATGAGATGG TGGCTTCTGT AGATGTTAGT 8580 

ATCCAAGTGA AAGATGCAAA TGACAACAGC CCGGTCTTTG AATCTAGTCC ATATGAGGCA 8640 

TTCATTGTTG AAAACCTGCC AGGGGGAAGT AGAGTAATTC AGATCAGGGC ATCTGATGCT 8700 

GACTCAGGAA CCAACGGCCA AGTTATGTAT AGCCTGGATC AGTCACAAAG TGTGGAAGTC 8760 

ATTGAATCCT TTGCCATTAA CATGGAAACA GGCTGGATTA CAACTTTAAA GGAACTTGAC 8820 

CATGAAAAGA GAGACAATTA CCAGATTAAA GTGGTTGCAT CAGATCATGG TGAAAAGATC 8880 

CAGCTATCCT CCACAGCCAT TGTGGATGTT ACCGTCACCG ATGTCAACGA TAGTCCACCA 8940 

CGATTCACGG CCGAGATCTA TAAAGGGACT GTGAGTGAGG ATGACOCCCA AGGTGGGGTG 9000 

ATTGCCATCT TAAGTACCAC GGATGCTGAT TCTGAAGAGA TCAACAGACA AGTTACATAT 9060 

TTCATAACAG GAGGGGATCC TTTAGGACAG TTTGCOGTTG AAACTATACA GAATGAATGG 9120 

AAGGTATATG TGAAGAAACC TCTAGACAGG GAAAAAAGGG ACAATTACCT TCTTACTATC 9180 

AOGGCAACTG ATGGCACCTT CTCATCAAAA GCGATAGTTG AAGTGAAAGT TCTGGATGCA 9240 

AATGACAACA GTCCAGTTTG TGAAAAGACT TTATATTCAG ACACTATTCC TGAAGACGTC 9300 

CTTCCTGGAA AATTGATCAT GCAGATCTCT GCTACAGACG CAGACATCCG CTCTAACGCT 9360 

GAAATTACTT ACACGTTATT GGGTTCAGGT GCAGAAAAAT TCAAACTAAA TCCAGACACA 9420 

GGTGAACTGA AAACGTCAAC CCCCCTTGAT CGTGAGGAGC AAGCTGTTTA TCATCTTCTC 9480 

GTCAGGGCCA CAGATGGAGG AGGAAGATTC TGCCAAGCCA GTATTGTCGT CACGCTAGAA 9540 

GATGTGAACG ATAACGCCCC CGAATTCTCT GCCGATCCTT ATGCCATCAC CGTGTTTGAA 9600 

AACACAGAGC CGGGAACGCT GCTGACAAGA GTGCAGGCCA CAGATGCOGA CGCAGGATTA 9660 

AATCGGAAGA TTTTATACTC ACTGATTGAC TCTGCTGATG GGCAGTTCTC CATTAACGAA 9720 

TTATCTGGAA TTATTCAGTT AGAAAAACCT TTGGACAGAG AACTCCAGGC AGTATACACC 9780 

CTCTCTTTGA AAGCTGTGGA TCAAGGCTTG CCAAGGAGGC TGACTGCCAC TGGCACTGTG 9840 

ATTGTATCAG TTCTTGACAT AAATGACAAC CCCCCTGTGT TTGAGTACCG TGAATATGGT 9900 

GCCACCGTGT CTGAGGACAT TCTTGTTGGA ACTGAAGTTC TTCAAGTGTA TGCAGCAAGT 9960 

CGGGATATTG AAGCAAATGC AGAAATCACC TACTCAATAA TAAGTGGAAA TGAACATGGG 10020 

AAATTCAGCA TAGATTCTAA AACAGGGGCC GTATTTATCA TTGAGAATCT GGATTATGAG 10080 
AGCTCTCATG AGTATTACCT AACAGTAGAG GCCACTGATG GAGGCACGCC TTCACTGAGC 10140 
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GACGTTGCCA CTGTGAACGT TAATGTAACA GATATCAAOG ATAATACCCC TGTGTTCAGC 10200 

CAAGACACCT ACACGACAGT CATCAGTGAA GATGCCGTTC TTGAGCAGTC TGTCATCA03 10260 

GTTATGGCCG ATGATGCCGA TGGACCTTCC AACAGCCACA TCCACTACTC AATTATAGAT 10320 

OGCAACCAAG GAAGCTCGTT CACAATTGAC CCOGTCAGGG GAGAAGTCAA AGTGACCAAA 10380 

5 CTTCTCGACC GAGAAACGAT TTCAGGTTAC ACGCTCACGG TTCAAGCTTC TGATAATGGC 10440 

AGTCCACCCA GAGTCAACAC GACGACCGTG AACATCGATG TGTCCGATGT CAATGACAAC 10SOO 

GCGCCCGTCT TCTCCAGGGG AAACTACAGT GTCATTATCC AGGAAAATAA GCCAGTGGGC 10560 

TTCAGCGTGC TGCAGCTGGT AGTAACAGAT GAGGATTCTT CCCATAACGG TCCACCCTTC 10620 

TTCTTTACTA TTGTAACTGG AAATGATGAG AAGGCTTT TG AAGTTAACCC GCAAGGAGTC 10680 

10 CTCCTGACAT CATCTGCCAT CAAGAGGAAG GAGAAAGATC ATTACTTACT GCAGGTGAAG 10740 

GTGGCAGATA ATGGAAAGCC TCAGTTGTCA TCTTTGACAT ACATTGACAT TAGGGTAATT 10800 

GAGGAGAGCA TCTATCCGCC TGCGATTTTG CCCCTGGAGA TTTTCATCAC CTCTTCTGGA 10860 

GAAGAATACT CAGGTGGCGT CATTGGGAAG ATCCATGCCA CAGACCAGGA CGTGTATGAT 10920 

ACTCTAACCT ACAGTCTCGA CCCTCAGATG GACAACCTGT TCTCTGTTTC CAGCACAGGG 10980 

15 GGCAAGCTGA TAGCACACAA AAAGCTAGAC ATAGGGCAAT ACCTTCTCAA TGTCAGCGTA 11040 

ACAGATGGGA AGTTCACGAC GGTGGCCGAC ATCACAGTGC ATATCAGACA AGTCACACAG 11100 

GAGATGTTGA ACCACACCAT CGCGATCCGC TTTGCCAACC TCACTCCGGA AGAATTCGTT 11160 

GGTGACTACT GGCGCAACTT CCAGCGAGCT TTACGGAACA TOCTGGGTGT GAGGAGGAAC 11220 

GACATACAGA TTGTTAGTTT GCAGTCCTCT GAACCTCACC CACATCTGGA CGTCTTACTT 11280 

20 TTTGTAGAGA AACCAGGTAG TGCTCAGATC TCAACAAAAC AACTTCTGCA CAAGATTAAC 11340 

TCTTCOGTGA CTGACATTGA GGAAATCATT GGAGTTAGGA TACTGAATGT ATTCCAGAAA 11400 

CTCTGCGCGG GACTGGACTG CCCCTGGAAG TTCTGCQATG AAAAGGTGTC TGTGGATGAA 11460 

AGTGTGATGT CAACACACAG CACAGCCAGA CTGAGTTTTG TGACTCCCCG CCACCACAGG 11520 

GCAGCGGTGT GTCTCTGCAA AGAGGGAAGG TGCCCACCTG TCCACCATGG CTGTGAAGAT 11580 

25 GATCCGTGCC CTGAGGGATC CGAATGTGTG TCTGATCCCT GGGAGGAGAA ACACACCTGT 11640 

GTCTGTCCCA GCGGCAGGTT TGGTCAGTGC CCAGGGAGTT CATCTATGAC ACTGACTGGA 11700 

AACAGCTACG TGAAATACCG TCTGACGGAA AATGAAAACA AATTAGAGAT GAAACTGACC 11760 

ATGAGGCTCA GAACATATTC CAOGCATGCG GTTGTCATGT ATGCTCGAGG AACTGACTAT 11820 

A AGCATCTTGG AGATTCATCA TGGAAGGCTG CAGTACAAGT TTGACTGTGG AAGTGGCCCT 11880 

30 GGAATTGTCT CTGTTCAGAG CATTCAGGTC AATGATGGGC AGTGGCACGC AGTGGCCCTG 11940 

GAAGTGAATG GAAACTATGC TOGCTTGGTT CTAGACCAAG TTCATACTGC ATCGGGCACA 12000 

GCCCCAGGGA CTCTGAAAAC CCTGAACCTG GATAACTATG TGTTTTTTGG TGGCCACATC 12060 

CGTCAGCAGG GAACAAGGCA TGGAAGAAGT CCTCAAGTTG GTAATGGTTT CAGGGGTTGT 12120 

ATGGACTCCA TTTATTTGAA TGGGCAGGAG CTCCCTTTAA ACAGCAAACC CAGAAGCTAT 12180 

35 GCACACATCG AAGAGTCGGT GGATGTATCT CCAGGCTGCT TCCTGAOGGC CACGGAAGAC 12240 

TGCGCCAGCA ACCCTTGCCA GAATGGAGGC GTTTGCAATC CGTCACCTGC TGGAGGTTAT 12300 

TACTGCAAAT GCAGTGCCTT GTACATAGGG ACCCACTGTG AGATAAGCGT CAATCCGTGT 12360 

TCCTCCAACC CATGCCTCTA TGGGGGCAOG TGTGTTGTCO ACAACGGAGG CTTTGTTTGC 12420 

CAGTGTAGAG GATTATATAC TGGTCAGAGG TGTCAGCTTA GTCCATACTG CAAAGATGAA 12480 

40 CCCTGTAAGA ATGGCGGAAC ATGCTTTGAC AGTTTGGATG GCGCCGTTTG TCAGTGTGAT 12540 

TCGGGTTTTA GGGGAGAAAG GTGTCAGAGT GATATCGACG AGTGCTCTGG AAACCCTTGC 12600 

CTGCACGGGG CCCTCTGTGA GAACACGCAC GGCTCCTATC ACTGCAACTG CAGCCACGAG 12660 

TACAGGGGAC GTCACTGCGA GGATGCTGCG CCCAACCAGT ATGTGTCCAC GCCGTGGAAC 12720 

ATTGGGTTGG CGGAAGGAAT TGGAATCGTT GTGTTTGTTG CAGGGATATT TTTACTGGTG 12780 

45 GTGGTGTTTG TTCTCTGCCG TAAGATGATT AGTCGGAAAA AGAAGCATCA GGCTGAACCT 12840 

AAAGACAAGC ACCTGGGACC CGCTACGGCT TTCTTGCAAA GACCGTATTT TGATTCCAAG 12900 

CTAAATAAGA ACATTTACTC AGACATACCA CCCCAGGTGC CTGTCCGGCC TATTTCCTAC 12960 

ACCCCGAGTA TTCCAAGTGA CTCAAGAAAC AATCTGGACC GAAATTCCTT CGAAGGATCT 13020 

GCTATCCCAG AGCATCCCGA ATTCAGCACT TTTAACCCCG AGTCTGTGCA CGGGCACCGA 13080 

50 AAAGCAGTGG CGGTCTGCAG CGTGGCGCCA AACCTGCCTC COCCACCCCC TTCAAACTCC 13140 

CCTTCTGACA GCGACTCCAT CCAGAAGCCT AGCTGGGACT TTGACTATGA CACAAAAGTG 13200 

GTGGATCTTG ATCCCTGTCT TTCCAAGAAG CCTCTAGAGG AAAAGCCTTC CCAGCCATAC 13260 

AGTGCCCGGG AAAGCCTGTC TGAAGTGCAG TCCCTGAGCT CCTTCCAGTC CGAATCGTGC 13320 

GATGACAATG GGTATCACTG GGATACATCA GATTGGATGC CAAGCGTTCC TCTGCCGGAC 13380 

55 ATACAAGAGT TCCCCAACTA TGAGGTGATT GATGAGCAGA CACCCCTGTA CTCAGCAGAT 13440 

CCAAACGCCA TCGATACGGA CTATTACCCT GGAGGCTACG ACATCGAAAG TGATTTTCCT 13500 

CCACCCCCAG AAGACTTCCC CGCAGCTGAT GAGCTACCAC CGTTACCGCC CGAATTCAGC 13560 

AATCAGTTTG AATCCATCCA CCCTCCTAGA GACATGCCTG CCGCGGGTAG CTTGGGTTCT 13620 

TCATCAAGAA ACCGGCAGAG GTTCAACTTG AATCAGTATT TGCCCAATTT TTATCCCCTC 13680 

60 GATATGTCTG AACCTCAAAC AAAAGGCACT GGTGAGAATA GTACTTGTAG AGAACCCCAT 13740 

GCCCCTTACC CGCCAGGGTA TCAAAGACAC TTCGAGGCGC CCGCTGTCGA GAGCATGCCC 13800 

ATGTCTGTGT ACGCCTCCAC CGCCTCCTGC TCTGACGTGT CAGCCTGCTG CGAAGTGGAG 13860 

TCCGAGGTCA TGATGAGTGA CTATGAGAGC GGGGACGACG GCCACTTCGA AGAGGTGACG 13920 

- ATCCCGCCCC TGGATTCCCA GCAGCACAOG GAAGTCTGAC TCTCAACTCC COCCAAAGTG 13980 

65 CCTGACTTTA GTGAACCTAG AGGTGATGTG AGTAATCCGC GCTGTTCTTT GCAGCAGTGC 14040 

TTCCAAGCTT TTTTTGGTGA GCCGAATGGG CATGGCTGCG CTGGATCCTG CGCCTCTGGA 14100 

CGTGCTAGCC ATTTCCAGTG TCCCAACTAC TGTCATCGTG AGGTTTTCAT OGGCTGTGCC 14160 

ATTTCCCAAC GTCTTTTGGG ATTTACATCT GTCTGTGTTA AAATAATCAA ACGAAAAATC 14220 

AGTCCTGTGT TGTCAGCATG ATTCATGTAT TTATATAGAT TTGATTATTT TAATTTTCCT 14280 

70 GTCTCTTTTT TTTGTAAATT TTATGTACAG ATTTGATTTT TCATAGTTTT AACTAGATTT 14340 

CCAAGATATT TTGTGCATTT GTTTCAACTG AATTTTGGTG GTGTCAGTGC CATTATCTAG 14400 

CACCCTGATT TTTTTTTTTT TACTATAACC AGGGTTTCAT TCTGTCTTTT TCCACTGAAG 14460 

TGTGACATTT TGTTAGTACA TTTCAGTGTA GTCATTCATT TCTAGCTGTA CATAGGATGA 14520 

_ AGGAGAGATC AGATACATGA ACATGTCTTA CATGGGTTGC TGTATTTAGA ATTATAAACA 14580 

75 TTTTTCATTA TTGGAAAGTG TAACGGGGAC CTTCTGCATA CCTGTTTAGA ACCAA AACCA 14640 

CCATGACACA GTTTTTATAG TGTCTGTATA TTTGTGATGC AATGGTCTTG TAAAGGTTTT 14700 

TAATGAAAAC TACCATTAGC CAGTCTTTCT TACTGACAAT AAATTATTAA TAAAAT 14756 
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GTGGTGTTTG 
CTGAAGACAA 
CACGATGCAT 
TGCTTTACTT 
TGTGGTTCTT 
TCATTGGGCT 
GATGTGCGAT 
GTGTCATTGT 
AGTGGAACTA 
CCGAGTGCAC 
TGGCTCTTGG 
GAGGCATATG 
CCAGGACAGA 
ATTTGTAAAA 
TAAAGACTGG 
GTTTATTTGT 
TTACAGACTG 
AAATAACATT 
AGTTGCTTTT 
CACTGCTTGT 
AACTGCCTTG 
TGTTCCAATC 
GAGATTCAAA 
ATCATGTGTT 
AATAAAGCAG 
AGGAAATACC 
AAAAAAAAAA 



CTTTCTCCAC 
AGAGAAGGGG 
OGGACATTCT 
TCCCAATGGG 
TTCTGGCATC 
GGAACAGGAT 
GCTTTCTTCT 
GGCAGCCCTT 
CACCTTTGCC 
TGAACCCAAG 
TGGAATTGAA 
TG6 CTTTT 6C 
GCCACAATCT 
CTTTGTATTA 
CATCTTCACA 
TTTTGTTTTT 
AGTGACAGTA 
CCAATCACTA 
TATAAGACCA 
ATGATGTTTC 
TGTTCTGTGA 
CAAATGAATG 
TTTTTCTAAC 
TAAAAAAAGA 
TTGATCAGCA 
CTCAAAACTA 
AAAAAAAAAA 



CAGAAGGGCA 
GAGAAAACCT 
CTGGTGGGGC 
GAAACAAAGT 
GTAGGAGGTG 
GACTGCTGTG 
GTATTGGCTG 
GGCTTAGCAG 
AGCACCGAGG 
CACATTGTGG 
TTCATCTTGT 
TGCTCTCACC 
TCCTCTATTT 
GTGTAACATA 
GGATGTCAGT 
TTTTAAGGAA 
CTCAGTATAT 
TTGTATATAT 
AGAAGGAGAA 
CCATTCATAC 
GAAACAAATA 
CATCACAACT 
ATATGGAAAG 
AAGGCTACGA 
TCATTGGAAC 
ACTTGTTTAC 
AAA 



CACTTTCATC 
AGCAGACCAC 
TCGCCCTCCT 
ATGCCTCCGA 
GCCTGCTGAT 
GCTGCTGTGG 
CTCTCATTGG 
AAGGACCACT 
GCCAGTACCT 
AATGGAATGT 
GTCTTATTCA 
AACAGCAATA 
CATTGTAATT 
CTCCCCACAG 
GTTTAAATTT 
TGAGGAAACA 
CTGAGATAAA 
GTGCATGTAT 
AATCCGACAA 
ACCTATAAAT 
TTTACTTAGA 
TACAATGCTG 
CCTTTTGTCC 
TGACTGGGCA 
ATGGGGACGA 
AACAAAATAA 



TAATTTGGGG 
CATGTGCTAT 
GTGCATCGCG 
AAACCACCTC 
GCTCCTGCCA 
CCATGAAAAC 
AATTGCAGGA 
ATGTCTTGAT 
TCTGGATACC 
ATCTCTGTTT 
AGTAATAAAT 
TGACTGCTAA 
TATATATTTC 
TCTACTTTTA 
AGTAAACTTC 
AACCACCCTC 
CTCTATAATG 
TTTTTAAATT 
CCTGGAAAGA 
CTCTAACAAG 
GTGGAAGGAC 
CTCATTGTTG 
TCCAAAGATG 
AGAAGAAAGA 
GTGACGGCAG 
AGTATTCACT 



TATCACTGAG 
GGGAAG7GTG 
GCTAATATTT 
AGCCGCTTCG 
GCATTT6TCT 
TGTGGCAAAC 
TCTGGCTACT 
TCCCTOGGCC 
TCCACATGGT 
TCTATCCTCT 
GGAGTGCTTG 
AAGAACCAAC 
ACTTGTATTC 
CAAACGCCTG 

TGGGGGTAGT 
TTTTGGATAA 
AAAGATGTCT 
TTTTTGTTTT 
AGGCCCTTTG 
TGATTGAGAA 
TGAGTACTAT 
AGTACTAGGG 
TGGGAAACTG 
GAGGACCACG 
ACGAAAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

iaao 

1440 
1500 
1560 
1583 



Seq ID NO; CI 6 2 DNA Sequence 

Nucleic Acid Accession ft: NM_003 759.1 

Coding sequence: 150 .. 3257 " 

1 11 21 31 41 51 

I I I I I I 

GTTCTTTGTG ACACATCACA CAGAATTGGA GTGCTGTCCT TCTGGAGAGT QGTGGAGAAC 60 

CAAGATACAG TTCAGAACCA AAGGAATAGA GAAGGGCTTT GATTTCTTTT TGGCTTTAGA 120 

TTGGGGATTT GGGAGGCTTA GCAGGAAAGA TGTCCACTGA AAATGTGGAA GGGAAGCCCA 180 

GTAACCTTGG GGAGAGAGGA AGAGCCCGGA GCTCCACTTT CCTCAGGGTT GTCCAGCCAA 240 

TGTTTAACCA CAGTATTTTC ACTTCTGCAG TCTCTCCTGC TGCAGAACGC ATCCGATTCA 300 

TCTTGGGAGA GGAGGATGAC AGCCCAGCTC CCCCTCAGCT CTTCACGGAA CTGGATGAGC 360 

TGCTGGCCGT GGATGGGCAG GAGATGGAGT GGAAGGAAAC AGCCAGGTGG ATCAAGTTTG 420 

AAGAAAAAGT GGAACAGGGT GGGGAAAGAT GGAGCAAGCC CCATGTGGCC ACATTGTCCC 480 

TTCATAGTTT ATTTGAGCTG AGGACATGTA TGGAGAAAGG ATCCATCATG CTTGATCGGG 540 

AGGCTTCTTC TCTCCCACAG TTGGTGGAGA TGATTGTTGA CCATCAGATT GAGACAGGCC 600 

TATTGAAACC TGAACTTAAG GATAAGGTGA CCTATACTTT GCTCCGGAAG CACCGGCATC 660 

AAACCAAGAA ATCCAACCTT CGGTCCCTGG CTGACATTGG GAAGACAGTC TCCAGTGCAA 720 

GTAGGATGTT TACCAACCCT GATAATGGTA GCCCAGCCAT GACCCATAGG AATCTGACTT 780 

CCTCCAGTCT GAATGACATT TCTGATAAAC CGGAGAAGGA CCAGCTGAAG AATAAGTTCA 840 

TGAAAAAATT GCCACGTGAT GCAGAAGCTT CCAACGTGCT TGTTGGGGAG GTTGACTTTT 900 

TGGATACTCC TTTCATTGCC TTTGTTAGGC TACAGCAGGC TGTCATGCTG GGTGCCCTGA 960 

CTGAAGTTCC TGTGCCCACA AGGTTCTTGT TCATTCTCTT AGGTCCTAAG GGGAAAGCCA 1020 

AGTCCTACCA CGAGATTGGC AGAGCCATTG CCACCCTGAT GTCTGATGAG GTGTTCCATG 1080 

ACATTGCTTA TAAAGCAAAA GACAGGCACG ACCTGATTGC TGGTATTGAT GAGTTCCTAG 1140 

ATGAAGTCAT CGTCCTTCCA CCTGGGGAAT GGGATCCAGC AATTAGGATA GAGCCTCCTA 1200 

AGAGTCTTCC ATCCTCTGAC AAAAGAAAGA ATATGTACTC AGGTGGAGAG AATGTTCAGA 1260 

TGAATGGGGA TACGCCCCAT GATGGAGGTC ACGGAGGAGG AGGACATGGG GATTGTGAAG 1320 

AATTGCAGCG AACTGGAOGG TTCTGTGGTG GACTAATTAA AGACATAAAG AGGAAAGCGC 1380 

CATTTTTTGC CAGTGATTTT TATGATGCTT TAAATATTCA AGCTCTTTCG GCAATTCTCT 1440 

TCATTTATCT GGCAACTGTA ACTAATGCTA TCACTTTTGG AGGACTGCTT GGGGATGCCA 1500 

CTGACAACAT GCAGGGCGTG TTGGAGAGTT TCCTGGGCAC TGCTGTCTCT GGAGCCATCT 1560 

TTTGCCTTTT TGCTGGTCAA CCACTCACTA TTCTGAGCAG CACCGGACCT GTCCTAGTTT 1620 

TTGAGAGGCT TCTATTTAAT TTCAGCAAGG ACAATAATTT TGACTATTTG GAGTTTCGCC 1680 

TTTGGATTGG CCTGTGGTCC GCCTTCCTAT GTCTCATTTT GGTAGCCACT GATGCCAGCT 1740 

TCTTGGTTCA ATACTTCACA CGTTTCACGG AGGAGGGCTT TTCCTCTCTG ATTAGCTTCA 1800 

TCTTTATCTA TGATGCTTTC AAGAAGATGA TCAAGCTTGC AGATTACTAC CCCATCAACT 1860 

CCAACTTCAA AGTGGGCTAC AACACTCTCT TTTCCTGTAC CTGTGTGCCA CCTGACCCAG 1920 

CTAATATCTC AATATCTAAT GACACCACAC TGGCCCCAGA GTATTTGCCA ACTATGTCTT 1980 

CTACTGACAT GTACCATAAT ACTACCTTTG ACTGGGCATT TTTGTCGAAG AAGGAGTGTT 2040 

CAAAATACGG AGGAAACCTT GTCGGGAACA ACTGTAATTT TGTTCCTGAT ATCACACTCA 2100 

TGTCTTTTAT CCTCTTCTTG GGAACCTACA CCTCTTCCAT GGCTCTGAAA AAATTCAAAA 2160 

CTAGTCCTTA TTTTCCAACC ACAGCAAGAA AACTGATCAG TGATTTTGCC ATTATCTTGT 2220 

CCATTCTCAT CTTTTGTGTA ATAGATGCCC TAGTAGGCGT GGACACCCCA AAACTAATTG 2280 

TGCCAAGTGA GTTCAAGCCA ACAAGTCCAA ACCGAGGTTG GTTCGTTCCA CCGTTTGGAG 2340 

AAAACCCCTG GTGGGTGTGC CTTGCTGCTG CTATCCCGGC TTTGTTGGTC ACTATACTGA 2400 

TTTTCATGGA CCAACAAATT ACAGCTGTGA TTGTAAACAG GAAAGAACAT AAACTCAAGA 2460 

AAGGAGCAGG GTATCACTTG GATCTCTTTT GGGTGGCCAT CCTCATGGTT ATATGCTCCC 2520 

TCATGGCTCT TCCGTGGTAT GTAGCTGCTA CGGTCATCTC CATTGCTCAC ATCGACAGTT 2580 

TGAAGATGGA GACAGAGACT TCTGCACCTG GAGAACAACC AAAGTTTCTA GGAGTGAGGG 2640 

AACAAAGAGT CACTGGAACC CTTGTGTTTA TTCTGACTGG TCTGTCAGTC TTTATGGCTC 2700 

CCATCTTGAA GTTTATACCC ATGCCTGTAC TCTATGGTGT GTTCCTGTAT ATGGGAGTAG 2760 

CATCCCTTAA TGGTGTGCAG TTCATGGATC GTCTGAAGCT GCTTCTGATG CCTCTGAAGC 2820 

ATCAGCCTQA CTTCATCTAC CTGCGTCATG TTCCTCTGCG CAGAGTCCAC CTGTTCACTT 2880 

TCCTGCAGGT GTTGTGTCTG GCCCTGCTTT GGATCCTCAA GTCAAOGGTG GCTGCTATCA 2940 
TTTTTCCAGT AATGATCTTG GCACTTGTAG CTGTCAGAAA AGGCATGGAC TACCTCTTCT 3000 
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CCCAGCATGA CCTCAGCTTC CTGGATGATG TCATTCCAGA AAAGGACAAG AAAAAGAAGG 3060 

AGGATGAGAA GAAAAAGAAA AAGAAGAAGG GAAGTCTGGA CAGTGACAAT GATGATTCTG 3120 

ACTGCCCATA CTCAGAAAAA GTTCCAAGTA TTAAAATTCC AATGGACATC ATGGAACAGC 3180 

AACCTTTCCT AAGCGATAGC AAACCTTCTG ACAGAGAAAG ATCACCAACA TTCCTTGAAC 3240 

5 GCCACACATC ATGCTGATAA AATTCCTTTC CTTCAGTCAC TCGGTATGCC AAGTCCTCCT 3300 

AGAACTCCAG TAAAAGTTGC CTCAAATTAG ACTAGAACTT GAACCTGAAG ACAATGATTA 3360 

TTTCTGGAGG AGCAAGGGAA CAGAAACTAC ATTGTAACCT GTTTGTCTTT CTTAAAACTG 3420 

ACATTTGTTG TTAATGTCAT TTGTTTTTGT TTGGCTGTTT GTTTATTTTT TAACTTTTAT 3480 

TTCGTCTCAG TTTTTGGTCA CAGGCCAAAT AATACAGCGC TCTCTCTGCT TCTCTCTTGC 3540 

10 ATAGATACAA TCAAGACAAT AGTGCACCGT TCCTTAAAAA CAGCATCTGA GGAATCCCCC 3600 

TTTTGTTCTT AAACTTTCAG ATGTGTCCTT TGATAACCAA ATTCTGTCAC TCAAGACACA 3660 

GACACCCACA GACCCTGTCC TTTGCCTCTA TTAAGCAGAG GATGGAAGTA TTAAGGATTT 3720 

TGTAACACCT TTTATGAAAA TGTTGAAGGA ACTTAAAACT TTAGCTTTGG AGCTGTGCTT 3780 

ACTGGCTTGT CTTTGTCTGG TAGAACAAAC CTTGACCTCC AGACAGAGTC CCTTCTCACT 3840 

15 TATAGAGCTC TCCAGGACTG GAAAAAGTGC TGCTATTTTA ACTTGCTCTT GCTTGTAAAT 3900 

CCTAATCTTA GAGTTATCAA AAGAAGAAAA AACTGAAGGT ACTTTACTCC CTATAGAGAA 3960 

ACCATTGCCA TCATTGTAGC AAGTGCTGGA ATGTOCCTTT TTTCCTATGC AACTTTTTTA 4020 

TAACCCTTTA ATGAACTTAT CTGTGGAGTA CATTGAAGAA TATTTTTCTT CCTAGATTTT 4080 

GTTGTTTAAA TTATGGGGCC TAACCTGCCA CTTATTTTTT GTCAATTTTT AAAACTTTTT 4140 

20 TTTAATTACT GTAAAGAAAA TGAATTTTTT CCTGCAGCAG GAAACATAGT TTTCAGTAGT 4200 

TCTACCTCTT ATTTGTAGCT GCCAGGCTTT CTGTAAAAAT TGTATTGTAT ATAATGTGAT 4260 

TTTTACACAT ACATACACAC ACAAATACAC AATCTCTAGG GTAAGCCAGA AGGCAAGATC 4320 

AGATTAAAAA. CACCATGTTT CTAAGCATCC ATTTTTCCCT TTCTTTAAAA GAAACTTAAC 4380 

TGTTCTATGA AGGAGATTGA GGGAGAAGAG ACAAACTCCT ATGTCATGAG AATAACCGAT 4440 

25 GTTCTGATAA TAGTAGCATC TAGGTACAGA TGCTGGTTGT ATTACCACGT CAATGTCCTA 4500 

TGCAGTATTG TTAGACATTT TCTCATTTTG AAATATTTGT GTGTTTGTGT ATGTGCTCTG 4560 

TGCCATGGCT GGTGTATATA TGTGCAATGT TAGAAGGCAA AAGAGTGATG GTAGGCAGAG 4620 

GGCAAAGTCA TTGAATCTCT TATGCCAGTT TTCATAAAAC CCAAACCACA TATGA AAAAA 4680 

TCCATTAAGG GTCCAAGAAG TCTGTCCATA TGAAAATGAG GGTAAATATA GTTTATTTCC 4740 

30 CAGGTATCAG TCATTATAAT TGATATAATA GCTCTAACAT GCAATATAAA ATTCATAGGA 4 BOO 

GTATTAATAG CCCATTTACA CATCTATAAA ATGTAATGGG ATTGCAGAGC TGCAGAGTAC 4860 

AGTGTAACAG TACTCTCATG CAATTTTTTT CAGGATGCAA AGGCAATTAT TCTTTGTAAG 4920 

CGGGACATTT AGATATATTT GTGTACATAT TATATGTATG TATATTTCAA AGTACCACAC 4980 

TGAAAATTAG ACATTTATTA ACCAAATTTA ACGTGGTATT TAAAGGTAAT ATTTTTAATA 5040 

35 TGATACATTA CATATTGTGA ATGTATACTA AAAAAACATT TTAAATGTTA AAATTATAAT 5100 

TTCAGATTCA TATAACCACA ACTGTGATAT ATCCTAACTA TAACCAGTTG TTGAGGGGTA 5160 

TACTAGAAGC AGAATGAAAC CACATTTTTT GGTTTGATAA TATGCACTTA TTGACTCCCA 5220 

CTCATTGTTA TGTTAATTAA GTTATTATTC TGTCTCCTTG TAATTTTGAT TACAAAAATT 52 BO 

TTATTATCCT GAGTTAGCTG TTACTTTTAC AGTACCTGAT ACTCCTAAAA CTTTTAACTT 5340 

40 ATACAAATTA GTCAATAATG ACCCCAATTT TTTCATTAAA ATAATAGTGG TGAATTATAT 5400 

GTTATTGTGT TAAAACCTCA CTTGCCAAAT TCTGGCTTCA CATTTGTATT TAGGGCTATC 5460 

CTTAAAATGA TGAGTCTATA TTATCTAGCT TTCTATTACC CXAATATAAA CTGGTATAAG 5520 

AAGACTTTCC TTTTTTCTTT ATGCATGGAA GCATCAATAA ATTGTTTAAA AACCATGTAT 5580 

AGTAAATTCA GCTTAACCCG TGATCTTCTT AAGTTAAAGG TACTTTTGTT TTATAAAAGC 5640 

45 TCTAGATAAA ACTTTCTTTT CTGATCATGA ATCAAGTATC TGTGGTTTCA TGCCCCTCTC 5700 

TATACCTTTC AAAGAACTCC TGAAGCAACT TAACTCATCA TTTCAGCCTC TGAGTAGAGG 5760 

TAAAACCTAT GTGTACTTCT GTTTATGATC CATATTGATA TTTATGACAT GAACACAGAA 5820 

TAGTACCTTA CATTTGCTAA ACAGACAGTT AATATCAAAT CCTTTCAATA TTCTGGGAAC 5880 

CCAGGGAAGT TTTTAAAAAT GTCATTACTT TCAAAGGAAC AGAAGTAGTT AACCAAACTA 5940 

50 ACAAGCAAAA CCTGAGGTTT ACCTAGTGAC ACCAAATTAT CGGTATTTTA ACTGAATTTA 6000 

CCCATTGACT AAGAATGAAC CGGATTTGGT GGTGGTTTTG TTTCTATGCA AACTGGACAC 6060 

AAATTACAAC AGTAAATTTT TTTATAAGTG CTTCTCCCTT CTCCATGATG TGACTTCCGG 6120 

AGATAAAGGA TTCAAAAGAT AAAGACAAAG TACGCTCAGA GTTGTTAACC AGAAAGTCCT 6180 

GGCTGTGGTT GCAGAAACAC TGTTGGAAGA AAAGAGATGA CTAAGTCAAG TGTCTGCCTT 6240 

55 ATCAAAAGAG CAAAAATGCC TCTGGTTTTG TGTTTGGGAG AAAAATATCT TGGACGCACT 6300 

GTTTTCCTTG ATAAAAGTCA TCTTCTCTAC TGTGTGAAAT GAATACTTGG AATTCTAATT 6360 

GTTTTGTGTG CCAGGGGCAG TAATGTCCCT GCCTCTTCTC CCAATCAAGG TTGAGGAGTG 6420 

GGGCTGGGGA GAGGACTTAA CTGACTTAAG AAGTAGGAAA ACAAAAACCT CTCTCCTCAG 6480 

CCTTCCACCT CCAAGAGAGG AGGAAAAACA GTTGTCTGCT GTCTGTAATT CAGTTTGCGT 6S40 

OO GTATTTTATG CTCATGCACC AACCCATACA GAGTAAATCT TTTATCAACT ATATACTGGT 6600 

GTTTAATAGA GAATGATTGT CTTCCGAGTT TTTTGGTTCC TTTTTTAACT GTGTTAAAGT 6660 

ACTTGAAATG TATTGACTGC TGACTATATT TTAAAAACAA AATGAAATAA TTTGAGTTGT 6720 

ATTACAGAGG TTGACATTGT TCAGGGATGG GACAAAGCCT TCTTCAATCC TTTTCATACT 6780 

ACTTAATGAT TTTGGTGCAG GAACCTGAGA TTTTCTGATT TATATTTCAT GATATTTCAC 6840 

65 ATTTGCTCTT CACAGCATGA GCATGAAGCC CAGTGGCACC AAATGGCTGG GTACAATCAA 6900 

GTGATATTTT GTAGCACCTC ACTATCTGAA AGGCCATGAG TTTTCAGATG ATTTCATTGA 6960 

GCTTCATTGC AGCCTGAAAT TTTAAAAAAG TTGTGTAATA CGCCAACCAG TCAAGTTGTG 7020 

TTTTGGCCAG AGATTTAGAT ATGTCCAATT TCCTGGCTCA TTTCATTGTG CTCTATGGGT 7080 

ACGTATAAAA AGCAAGAATT CTGTTTCCTA GGCAAACATT GCAACTCAGG GCTAAAGTCA 7140 

70 TCCAGTGAAA CTTTTAGAGC CAGAAGTAAC TTTGTCCCAG TCCTACAATG TGAAAAGAGT 7200 

GAATAGTTGC CTCTTTTTAG CCATTTTCAT GGCTGGTACA TATTCGTACG CATTACTTTT 7260 

CAGAATCAAT ACGCACTTTC AGATATTCTT ATTTTTATTC TCTTAAGTCT TTATTAACTT 7320 
TGGAGAGAGA AATGATGCAT CTTTTTATTT TAAATGAAGT AGATCAACAT GGTGGAACAA 7380 
AATGATAAAG AACAGAAAAC ATTTCAATAT ATTACTAATA ACTTTTTCCA ATATAAATCC 7440 
75 TAAAATTCCT ATAACATAGT ATTTTACAGT TTTATGAAGC TTTCTATTGT GACTTTTATG 7500 
GAATTAAGAG ATGAAGAAGA TGAGATATTT TAGCATTTAT ATTTTTCAAA ATTATATGTA 7560 
TACTTAAAAA TAAAGTAACT TTATGC 7586 

Seq ID NO i CI 6 3 DMA Sequence 
80 Nucleic Acid Accession ft: NMJ)00958 
Coding sequence: 389.. 1855 
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CGGCACAGCC TCACACCTGA AOGCTGTCCT CCCGCAGACG AGACCGGCGG GCACTGCAAA 60 

GCTGGGACTC GTCTTTGAAG GAAAAAAAAT AGOGAGTAAG AAATCCAGCA CCATTCTTCA 120 

CTGACCCATC CCGCTGCACC TCTTGTTTCC CAAGTTTTTG AAAGCTGGCA ACTCTGACCT 180 

CGGTGTCCAA AAATCGACAG CCACTGAGAC CGGCTTTGAG AAGCCGAAGA TTTGGCAGTT 240 

TCCAGACTGA GCAGGACAAG GTGAAAGCAG GTTGGAGGCG GGTCCAGGAC ATCTGAGGGC 300 

TGACCCTGGG GGCTCGTGAG GCTGCCACCG CTGCTGCCGC TACAGACCCA GCCTTGCACT 360 

CCAAGGCTGC GCACCGCCAG CCACTATCAT GTCCACTCCC GGGGTCAATT CGTCCGCCTC 420 

CTTGAGCCCC GACCGGCTGA ACAGCCCAGT GACCATCCCG GCGGTGATGT TCATCTTCGG 480 

GGTGGTGGGC AACCTGGTGG CCATCGTGGT GCTGTGCAAG TCGCGCAAGG AGCAGAAGGA S40 

GACGACCTTC TACACGCTGG TATGTGGGCT GGCTGTCACC GACCTGTTGG GCACTTTGTT 600 

GGTGAGCCCG GTGACCATCG CCAOGTACAT GAAGGGCCAA TGGCCCGGGG GCCAGCCGCT 660 

GTGCGAGTAC AGCACCTTCA TTCTGCTCTT CTTCAGCCTG TCCGGCCTCA GCATCATCTG 720 

CGCCATGAGT GTCGAGCGCT ACCTGGCCAT CAACCATGCC TATTTCTACA GCCACTACGT 780 

GGACAAGCGA TTGGOGGGCC TCACGCTCTT TGCAGTCTAT GCGTCCAACG TGCTCTTTTG 840 

CGCGCTGCCC AACATGGGTC TCGGTAGCTC GCGGCTGCAG TACCCAGACA CCTGGTGCTT 900 

CATCGACTGG ACCACCAACG TGACGGCGCA CGCCGCCTAC TCCTACATGT ACGCGGGCTT 960 

CAGCTCCTTC CTCATTCTCG CCACCGTCCT CTGCAACGTG CTTGTGTGCG GCGCGCTGCT 1020 

CCGCATGCAC CGCCAGTTCA TGCGCCGCAC CTGGCTGGGC ACCGAGCAGC ACCACGCGGC 1080 

CGCGGCCGCC TCGGTTGCCT CCCGGGGCCA CCCCGCTGCC TCCCCAGCCT TGCCGCGCCT 1140 

CAGCGACTTT CGGCGCCGCC GGAGCTTCCG CCGCATCGCG GGCGCCGAGA TCCAGATGGT 1200 

CATCTTACTC ATTGCCACCT CCCTGGTGGT GCTCATCTGC TCCATCCCGC TCGTGGTGCG 1260 

AGTATTCGTC AACCAGTTAT ATCAGCCAAG TTTGGAGCGA GAAGTCAGTA AAAATCCAGA 1320 

TTTGCAGGCC ATCCGAATTG CTTCTGTGAA CCCCATCCTA GACCCCTGGA TATATATCCT 1380 

CCTGAGAAAG ACAGTGCTCA GTAAAGCAAT AGAGAAGATC AAATGCCTCT TCTGCCGCAT 1440 

TGGCGGGTCC CGCAGGGAGC GCTCCGGACA GCACTGCTCA GACAGTCAAA GGACATCTTC 1500 

TGCCATGTCA GGCCACTCTC GCTCCTTCAT CTCCCGGGAG CTGAAGGAGA TCAGCAGTAC 1560 

ATCTCAGACC CTCCTGCCAG ACCTCTCACT GCCAGACCTC AGTGAAAATG GCCTTGGAGG 1620 

CAGGAATTTG CTTCCAGGTG TGCCTGGCAT GGGCCTGGCC CAGGAAGACA CCACCTCACT 1680 

GAGGACTTTG CGAATATCAG AGACCTCAGA CTCTTCACAG GGTCAGGACT CAGAGAGTGT 1740 

CTTACTGGTG GATGAGGCTG GTGGGAGCGG CAGGGCTGGG CCTGCCCCTA AGGGGAGCTC 1800 

CCTGCAAGTC ACATTTCCCA GTGAAACACT GAACTTATCA GAAAAATGTA TATAATAGGC 1860 

AAGGAAAGAA ATACAGTACT GTTTCTGGAC CCTTATAAAA TCCTGTGCAA TAGACACATA 1920 
CATGTCACAT TTAGCTGTGC TCAGAAGGGC TATCATCA 1958 

Seq XD MO: C164 DNA Sequence 

Nucleic Acid Accession #: NM__002659.1 

Coding sequence; 427.. 1434 

1 11 21 31 41 * 51 

I I I I I I 

CAGTATCCCT CCTGACAAAA CTAACAAAAA TCCTGTTAGC CAAATAATCA GCCACATTCA 60 
TATTTACCGT CAAAGTTTTT ATCCTCATTT TACAGCAGTG GAGAGCGATT GCCCCGGGTC 120 
CCACGTTAGG AAGAGAGAGA ACTGGGATTT GCACCCAGGC AATCTGGGGA CAGAGCTGTG 180 
ATCACAACTC CATGAGTCAG GGCCGAGCCA GCCCCTTCAC CACCAGCCGG CCGCGCCCCG 240 
GGAAGGAAGT TTGTGGCGGA GGAGGTTCGT ACGGGAGGAG GGGGAGGCGC CCACGCATCT 300 
GGGGCTGACT CGCTCTTTCG CAAAACGTCT GGGAGGAGTC CCTGGGGCCA CAAAACTGCC 360 
TCCTTCCTGA GGCCAGAAGG AGAGAAGACG TGCAGGGACC CCGCGCACAG GAGCTGCCCT 420 
CGCGACATGG GTCACCCGCC GCTGCTGCCG CTGCTGCTGC TGCTCCACAC CTGCGTCCCA 480 
GCCTCTTGGG GCCTGCGGTG CATGCAGTGT AAGACCAACG GGGATTGCCG TGTGGAAGAG 540 
TGCGCCCTGG GACAGGACCT CTGCAGGACC ACGATCGTGC GCTTGTGGGA AGAAGGAQAA 600 
GAGCTGGAGC TGGTGGAGAA AAGCTGTACC CACTCAGAGA AGACCAACAG GACCCTGAGC 660 
TATCGGACTG GCTTGAAGAT CACCAGCCTT ACCGAGGTTG TGTGTGGGTT AGACTTGTGC 720 
AACCAGGGCA ACTCTGGCCG GGCTGTCACC TATTCCCGAA GCCGTTACCT CGAATGCATT 780 
TCCTGTGGCT CATCAGACAT GAGCTGTGAG AGGGGCCGGC ACCAGAGCCT GCAGTGCCGC 840 
AGCCCTGAAG AACAGTGCCT GGATGTGGTG ACCCACTGGA TCCAGGAAGG TGAAGAAGGG 900 
CGTCCAAAGG ATGACCGCCA CCTCCGTGGC TGTGGCTACC TTCCCGGCTG CCCGGGCTCC 960 
AATGGTTTCC ACAACAACGA CACCTTCCAC TTCCTGAAAT GCTGCAACAC CACCAAATGC 1020 
AACGAGGGCC CAATCCTGGA GCTTGAAAAT CTGCCGCAGA ATGGCCGCCA GTGTTACAGC 1080 
TGCAAGGGGA ACAGCACCCA TGGATGCTCC TCTGAAGAGA CTTTCCTCAT TGACTGCCGA 1140 
GGCCCCATGA ATCAATGTCT GGTAGCCACC GGCACTCACG AACCGAAAAA CCAAAGCTAT 1200 
ATGGTAAGAG GCTGTGCAAC CGCCTCAATG TGCCAACATG CCCACCTGGG TGACGCCTTC 1260 
AGCATGAACC ACATTGATGT CTCCTGCTGT ACTAAAAGTG GCTGTAACCA CCCAGACCTG 1320 
GATGTCCAGT ACCGCAGTGG GGCTGCTCCT CAGCCTGGCC CTGCCCATCT CAGCCTCACC 1380 
ATCACCCTGC TAATGACTGC CAGACTGTGG GGAGGCACTC TCCTCTGGAC CTAAACCTGA 1440 
AATCCCCCTC TCTGCCCTGG CTGGATCCGG GGGACCCCTT TGCCCTTCCC TCGGCTCCCA 1500 
GCCCTACAGA CTTGCTGTGT GACCTCAGGC CAGTGTGCCG ACCTCTCTGG GCCTCAGTTT 1560 
TCCCAGCTAT GAAAACAGCT ATCTCACAAA GTTGTGTGAA GCAGAAGAGA AAAGCTGGAG 1620 
GAAGGCCGTG GG CAATGGGA GAGCTCTTGT TATTATTAAT ATTGTTGCCG CTGTTGTGTT 1680 
GTTGTTATTA ATTAATATTC ATATTATTTA TTTTATACTT ACATAAAGAT TTTGTACCAG 1740 
TGG 1743 

Seq ID NO: CI 65 DNA Sequence 
Nucleic Acid Accession #: AK027843.1 
Coding sequence: 193.. 1731 



1 11 21 31 41 51 

i 1 I I I I 

TTGCTTGAGT CATCTTCTGA AGCTTTAAAA ACAATTGATG AATTGGCCTT CAAGATAGAC 60 

CTAAATAGCA CATCACATGT GAATATTACA ACTCGGAACT TGGCTCTCAG CGTATCATCC 120 

CTGTTACCAG GGACAAATGC AATTTCAAAT TTTAGCATTG GTCTTCCAAG CAATAATGAA 180 

TCGTATTTCC AGATGGATTT TGAGAGTGGA CAAGTGGATC CACTGGCATC TGTAATTTTG 240 

CCTCCAAACT TACTTGAGAA TTTAAGTCCA GAAGATTCTG TATTAGTTAG AAGAGCACAG 300 

TTTACTTTCT TCAACAAAAC TGGACTTTTC CAGGATGTAG GACCCCAAAG AAAAACTTTA 360 

GTGAGTTATG TGATGGCGTG CAGTATTGGA AACATTACTA TCCAGAATCT GAAGGATCCT 420 
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GTTCAAATAA AAATCAAACA TACAAGAACT CAGGAAGTGC ATCATCCCAT CTGTGCCTTC 480 

TGGGATCTGA ACAAAAACAA AAGTTTTGGA GGATGGAACA CGTCAGGATG TGTTGCACAC 540 

AGAGATTCAG ATGCAAGTGA GACAGTCTGC CTGTGTAACC ACTTCACACA CTTTGGAGTT 600 

CTGATGGACC TTCCAAGAAG TGCCTCACAG TTAGATGCAA GAAACACTAA AGTCCTCACT 660 

5 TTCATCAGCT ATATTGGGTG TGGAATATCT GCTATTTTTT CAGCAGCAAC TCTOCTGACA 720 

TATGTTGCTT TTGAGAAATT GCGAAGGGAT TATCCCTCCA AAATCTTGAT GAACCTGAGC 780 

ACAGCCCTGC TGTTCCTGAA TCTCCTCTTC CTCCTAGATG GCTGGATCAC CTCCTTCAAT 840 

GTGGATGGAC TTTGCATTGC TGTTGCAGTC CTGTTGCATT TCTTCCTTCT GGCAACCTTT 900 

ACCTGGATGG GGCTAGAAGC AATTCACATG TACATTGCTC TAGTTAAAGT ATTTAACACT 960 

10 TACATTCGCC GATACATTCT AAAATTCTGC ATCATTGGCT QGGGTTTGCC TGCCTTAGTG 1020 

GTGTCAGTTG TTCTAGCGAG CAGAAACAAC AATGAAGTCT ATGGAAAAGA AAGTTATGGG 1080 

AAAGAAAAAG GTGATGAATT CTGTTQGATT CAAGATCCAG TCATATTTTA TGTGACCTGT 1140 

GCTGGGTATT TTGGAGTCAT GTTTTTTCTG AACATTGCCA TGTTCATTGT GGTAATGGTG 1200 

CAGATCTGTG GGAGGAATGG CAAGAGAAGC AACCGGACCC TGAGAGAAGA AGTGTTAAGG 1260 

15 AACCTGCGCA GTGTGGTTAG CTTGACCTTT CTGTTGGGCA TGACATGGGG TTTTGCATTC 1320 

TTTGCCTGGG GACCCTTAAA TATCCCCTTC ATGTACCTCT TCTCCATCTT CAATTCATTA 1380 

CAAGGCTTAT TTATATTCAT CTTCCACTGT GCTATGAAGG AGAATGTTCA GAAACAGTGG 1440 

CGGCGGCATC TCTGCTGTGG TAGATTTCGG TTAGCAGATA ACTCAGATTG GAGTAAGACA 1500 

GCTACCAATA TCATCAAGAA AAGTTCTGAT AATCTAGGAA AATCTTTGTC TTCAAGCTCC 1560 

20 ATTGGTTCCA ACTCAACCTA TCTTACATCC AAATCTAAAT CCAGCTCTAC CACCTATTTC 1620 

AAAAGGAATA GCCACACAGA TAATGTCTCC TATGAGCATT CCTTCAACAA AAGTGGATCA 1680 

CTCAGACAGT GCTTCCATGG ACAAGTCCTT GTCAAAACTG GCCCATGCTG ATGGAGATCA 1740 

AACATCAATC ATCCCTGTCC ATCAGGTCAT TGATAAGGTC AAGGGTTATT GCAATGCTCA 1800 

TTCAGACAAC TTCTATAAAA ATATTATCAT GTCAGACACC TTCAGCCACA GCACAAAGTT 1860 

25 TTAATGTCTT TAAGAAAAAG AAATCAATCT GCAGAAATGT GAAGATTTGC AAGCAGTGTA 1920 

AACTGCAACT AGTGATGTAA ATGTGCTATT ACCTAGGTAA CTGCATATAT ATAAGGAATG 1980 

TATTTTGTTA AGAAGGCTTT TGTGAAATTC AGAATTTTTC TTTTTAATAT ATTTCTTCCA 2040 

TGGAAGAGTT GTCATCACTA AAACTTCAGT ACTGAGAGTA ACATGACTCA GTAGCCACAG 2100 

AAGCTATGAT TTGTAAAATA TATAATTGAA TCAGAGTAAT CATAATGCAG GGGAGACATT 2160 

30 CAAATTAGAG ACAAGGGAGA AGCAATGCTG AGGAAGACCC TAGATAGAGC TCATTTTACT 2220 

CCACCTAATC GTTATATCTG GATATACCCA TTTTCTGCAT CTTCTTTCTC AACAATAAAC 2280 

TGTCCTTGCT TTGGAGACTT TAAGACATTT CCTAAAGCAC AAATAAAAGC CTCGTATTTC 2340 

CCCATTGAGA GTTTTGTTCC AAGGAATATG AAGTGAGACA TATGGGTGAG TCATAATAAT 2400 

CAAAATAATT TATGAAGAGC TGGGTCTGCA ATAGCTAGTC TAAAAACTAC TTGTGTGTCA 2460 

35 GTCCTCTGGT TATAGTATAT AAGAGCCTGA GGAGGTCTGG CAAGATAGAT GGTGTATTAT 2520 

TTATGGATCA GGCTGCTGCA TACAAACCTT GCATACTATT ATGCAGCTTA CCTAACTCTC 2580 

AGACTATTCT GAGTAATGCT TGCTTGCTAA TGAATGTATA GGAGACCACA TTGTAATTGT 2640 

TCTTAGATGA TGGAGTCCAT GCAGTTTCTT AGAAATCGGT CTCAGTGCAT GCTGTGCTTT 2700 

TTCACATTTG CTCTGGGTTA TCTGGGAAGT ATCAGGTTCT GGGAGGCAAC AGCATTAAGT 2760 

40 GATAAGAAAA GGAGACATTC TGGCAAAGCC AATCTGCTTA AAGGCAAAGT CCAGAACCTG 2820 

GAACCTAGAG GCCTTTCTCT CTGCACGAAA AACAGGTAGT TTGCAGTCTG AGATATGGGA 2880 

GAGCTTTTAG GCTACACAGC AACCCAAGGG ACCTCTCACC TTTTGCTGAG CTTCAATCAG 2940 

GAAGCTATTT GCCTGGCTCC AGCAGATGAT GAGATAATGA GGTAGTGGGT TTTTTATTAC 3000 

TGTTCCATTT TGCAACATCC TGCAACACCA TCCTGGGAGA CAAGAGCATT ACCCAGCTTG 3060 

45 GCTTTCACGG GGGAGGGTTG TATTCAGT 3088 
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Seq ID NO: C166 DMA Sequence 
Nucleic Acid Accession #: NM__000574.l 
Coding sequence i 66.. 1211 

1 11 21 31 41 51 

1 I I I I I 

CCGCTGGGCG TAGCTGCGAC TCGGCGGAGT CCCGGCGGCG CGTCCTTGTT CTAACCCGGC 60 

GCGCCATGAC CX3TOGC6CGG CCGAGCOTOC OCGOGG0GCT GCCCCTCCTC GGGGAGCTGC 120 

CCCGGCTGCT GCTGCTGGTG CTGTTGTGCC TGCCGGCCGT GTGGGGTGAC TGTGGCCTTC 180 

CCCCAGATGT ACCTAATGCC CAGCCAGCTT TGGAAGGCOG TACAAGTTTT CCCGAGGATA 240 

CTGTAATAAC GTACAAATGT GAAGAAAGCT TTGTGAAAAT TCCTGGCGAG AAGGACTCAG 300 

TGATCTGCCT TAAGGGCAGT CAATGGTCAG ATATTGAAGA GTTCTGCAAT CGTAGCTGCG 360 

AGGTGCCAAC AAGGCTAAAT TCTGCATCCC TCAAACAGCC TTATATCACT CAGAATTATT 420 

TTCCAGTCGG TACTGTTGTG GAATATGAGT GCCGTCCAGG TTACAGAAGA GAACCTTCTC 480 

TATCACCAAA ACTAACTTGC CTTCAGAATT TAAAATGGTC CACAGCAGTC GAATTTTGTA 540 

AAAAGAAATC ATGCCCTAAT CCGGGAGAAA TACGAAATGG TCAGATTGAT GTACCAGGTG 600 

GCATATTATT TGGTGCAACC ATCTCCTTCT CATGTAACAC AGGGTACAAA TTATTTGGCT 660 

CGACTTCTAG TTTTTGTCTT ATTTCAGGCA GCTCTGTCCA GTGGAGTGAC CCGTTGCCAG 720 

AGTGCAGAGA AATTTATTGT CCAGCAOCAC CACAAATTGA CAATGGAATA ATTCAAGGGG 780 

AACGTGACCA TTATGGATAT AGACAGTCTG TAACGTATGC ATGTAATAAA GGATTCACCA 840 

TGATTGGAGA GCACTCTATT TATTGTACTG TGAATAATGA TGAAGGAGAG TGGAGTGGCC 900 

CACCACCTGA ATGCAGAGGA AAATCTCTAA CTTCCAAGGT CCCACCAACA GTTCAGAAAC 960 

CTACCACAGT AAATGTTCCA ACTACAGAAG TCTCACCAAC TTCTCAGAAA ACCACCACAA 1020 

AAACCACCAC ACCAAATGCT CAAGCAACAC GGAGTACACC TGTTTCCAGG ACAACCAAGC 1080 

ATTTTCATGA AACAACCCCA AATAAAGGAA GTGGAACCAC TTCAGGTACT ACCCGTCTTC 1140 

TATCTGGGCA CACGTGTTTC ACGTTGACAG GTTTGCTTGG GACGCTAGTA ACCATGGGCT 1200 

TGCTGACTTA GCCAAAGAAG AGTTAAGAAG AAAATACACA CAAGTATACA GACTGTTCCT 1260 

AGTTTCTTAG ACTTATCTGC ATATTGGATA AAATAAATGC AATTGTGCTC TTCATTTAGG 1320 

ATGCTTTCAT TGTCTTTAAG ATGTGTTAGG AATGTCAACA GAGCAAGGAG AAAAAAGGCA 1380 

GTCCTGGAAT CACATTCTTA GCACACCTAC ACCTCTTGAA AATAGAACAA CTTGCAGAAT 1440 

TGAGAGTGAT TCCTTTCCTA AAAGTGTAAG AAAGCATAGA GATTTGTTCG TATTTAGAAT 1500 

GGGATCACGA GGAAAAGAGA AGGAAAGTGA TTTTTTTCCA CAAGATCTGT AATGTTATTT 1S60 

CCACTTATAA AGGAAATAAA AAATGAAAAA CATTATTTGG ATATCAAAAG CAAATAAAAA 1620 

CCCAATTCAG TCTCTTCTAA GCAAAATTGC TAAAGAGAGA TGAACCACAT TATAAAGTAA 1680 

TCTTTGGCTG TAAGGCATTT TCATCTTTCC TTCGGGTTGG CAAAATATTT TAAAGGTAAA 1740 

ACATGCTGGT GAACCAGGGG TGTTGATGGT GATAAGGGAG GAATATAGAA TGAAAGACTG 1800 

AATCTTCCTT TGTTGGACAA ATAGAGTTTG GAAAAAGCCT GTGAAAGGTG TCTTCTTTGA 1860 

CTTAATGTCT TTAAAAGTAT CCAGAGATAC TACAATATTA ACATAAGAAA AGATTATATA 1920 
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TTATTTCTGA ATCGAGATGT CCATAGTCAA ATTTGTAAAT CTTATTCTTT TGTAATATTT 1980 

ATTTATATTT ATTTATGACA GTGAACATTC TGATTTTACA TGTAAAACAA GAAAAGTTGA 2040 

AGAAGATATG TGAAGAAAAA TGTATTTTTC CTAAATAGAA ATAAATGATC CCATTTTTTG 2100 

GT 2102 

Seq ID NO: C167 DNA Sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence : 1 . .2651 



1 11 21 31 41 51 

I I I i I I 

ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60 

GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120 

- - GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 180 

I-> CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240 

CTGCTCCCGA ATCCCCTGCC CAGTCTCOGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300 

QCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360 

CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420 

CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480 

C TGCAT TCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG S40 

GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600 

ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660 

AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720 

TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780 

GAACTACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 840 

CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 900 

ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 960 

CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1020 

„ GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1080 

JU GAAATCTACG AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1140 

TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1200 

ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCGTCTTTTC CTATAACTGG GTTACATGGT 1260 

TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACAGA GCTTGATATC ATCTGAAAAC 1320 

TTTCCAGAAC TCAAGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGTGC ATTTGGAGTG 1380 

TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 1440 

GACCTTCATA AGAAAGATGC TGGAATGTTT CAGGCTCAAG ATGAACGTGA CCTTGAAGAT 1500 

TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT CAGTGCAGTG TTCACCTTCC 1560 

CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG AATTGGAGTG 1620 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 1680 

TCCCCTCTGT ACATTTCCCC CATTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1740 

CTCACGGGAG TCTCCAGTGC CGTGCTGGCT GGTGTGGATG CGTTCACTTT TGGCAGCTTT 1800 

G CACGA CATG GTGCCTGGTG GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC 1860 

ATTTTTGCTT CAGAATCATC TGTTTTCCTG CTTACTCTGG CAGCCCTGGA GCGTGGGTTC 1920 

TCTGTGAAAT ATTCTGCAAA ATTTGAAACG AAAGCTCCAT TTTCTAGCCT GAAAGTAATC 1980 

ATTTTGCTCT GTGCCCTGCT GGCCTTGACC ATGGCCGCAG TTCCCCTGCT GGGTGGCAGC 2040 

AAGTATGGCG CCTCCCCTCT CTGCCTGCCT TTGCCTTTTG GGGAGCCCAG CACCATGGGC 2100 

TACATGGTGG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATGATGAC CATTGCCTAC 2160 

ACCAAGCTCT ACTGCAATTT GGACAAGGGA GACCTGGAGA ATATTTGGGA CTGCTCTATG 2220 

GTAAAACACA TTGCCCTGTT GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 2280 

TTGTCCTTCT CCTCTTTAAT AAACCTTACA TTTATCAGTC CTGAAGTAAT TAAGTTTATC 2340 

CTTCTGGTGG TAGTCCCACT TCCTGCATGT CTCAATCCCC TTCTCTACAT CTTGTTCAAT 2400 

CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG GACAAGATCA 2460 

AAACACCCAA GCTTGATGTC AATTAACTCT GATGATGTOG AAAAACAGTC CTGTGACTCA 2520 

ACTCAAGCCT TGGTAACCTT TACCAGCTOC AGCATCACTT ATGAOCTGCC TCCCAGTTCC 2580 

GTGCCATCAC CAGCTTATCC AGTGACTGAG AGCTGCCATC TTTCCTCTGT GGCATTTGTC 2640 

CCATGTCTTA A 26S1 



Seq ID NO: C168 DNA Sequence 

Nucleic Acid Accession ft: NM_003667.2 

Coding sequence: 49.-2772 

1 U 21 31 41 51 

t I I I I I 

TGCTGCTCTC CGCCCGCGTC CGGCTCGTGG CCCCCTACTT CGGGCACCAT GGACACCTCC 60 

CGGCTCGGTG TGCTCCTGTC CTTGCCTGTG CTGCTGCAGC TGGCGACCGG GGGCAGCTCT 120 

CCCAGGTCTG GTGTGTTGCT GAGGGGCTGC CCCACACACT GTCATTGCGA GCCCGACGGC 180 

AGGATGTTGC TCAGGGTGGA CTGCTCCGAC CTGGGGCTCT CGGAGCTGCC TTCCAACCTC 240 

AGCGTCTTCA CCTCCTACCT AGACCTCAGT ATGAACAACA TCAGTCAGCT GCTCCCGAAT 300 

CCCCTGCCCA GTCTCCGCTT CCTGGAGGAG TTACGTCTTG CGGGAAACGC TCTGACATAC 360 

ATTCCCAAGG GAGCATTCAC TGGCCTTTAC AGTCTTAAAG TTCTTATGCT GCAGAATAAT 420 

CAGCTAAGAC ACGTACCCAC AGAAGCTCTG CAGAATTTGC GAAGCCTTCA ATCCCTGCGT 480 

CTGGATGCTA ACCACATCAG CTATGTGCCC CCAAGCTGTT TCAGTGGCCT GCATTCCCTG 540 

AGGCACCTGT GGCTGGATGA CAATGCGTTA ACAGAAATCC OCGTCCAGGC TTTTAGAAGT 600 

TTATCGGCAT TGCAAGCCAT GACCTTGGCC CTGAACAAAA TACACCACAT ACCAGACTAT 660 

GCCTTTGGAA ACCTCTCCAG CTTGGTAGTT CTACATCTCC ATAACAATAG AATCCACTCC 720 

CTGGGAAAGA AATGCTTTGA TGGGCTCCAC AGCCTAGAGA CTTTAGATTT AAATTACAAT 780 

AACCTTGATG AATTCCCCAC TGCAATTAGG ACACTCTCCA ACCTTAAAGA ACTAGGATTT 840 

CATAGCAACA ATATCAGGTC GATACCTGAG AAAGCATTTG TAGGCAACCC TTCTCTTATT 900 

ACAATACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 960 

CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 1020 

ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 1080 

CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1140 

GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1200 

GAAATCTACG. AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1260 
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TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1320 

ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCGTCTTTTC CTATAACTGG GTTACATGGT 1380 

TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACAGA GCTTGATATC ATCTGAAAAC 1440 

TTTCCAGAAC TCAAGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGT6C ATTTGGAGTG 1500 

5 TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 1S60 

GACCTTCATA AGAAAGATGC TGGAATGTTT CAGGCTCAAG ATGAACGTGA CCTTGAAGAT 1620 

TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT CAGTGCAGTG TTCACCTTCC 1680 

CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG AATTGGAGTG 1740 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 1800 

10 TCCCCTCTGT ACATTTCCCC CATTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1860 

CTCACGGGAG TCTCCAGTGC CGTGCTGGCT GGTGTGGATG CGTTCACTTT TGGCAGCTTT 1920 

GCACGACATG GTGCCTGGTG GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC 1980 

ATTTTTGCTT CAGAATCATC TGTTTTCCTG CTTACTCTGG CAGCCCTGGA GCGTGGGTTC 2040 

TCTGTGAAAT ATTCTGCAAA ATTTGAAACG AAAGCTCCAT TTTCTAGCCT GAAAGTAATC 2100 

15 ATTTTGCTCT GTGCCCTGCT GGCCTTGACC ATGGCCGCAG TTCCCCTGCT GGGTGGCAGC 2160 

AAGTATGGCG CCTCCCCTCT CTGCCTGCCT TTGCCTTTTG GGGAGCCCAG CACCATGGGC 2220 

TACATGGTCG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATGATGAC CATTGCCTAC 2280 

ACCAAGCTCT ACTGCAATTT GGACAAGGGA GACCTGGAGA ATATTTGGGA CTGCTCTATG 2340 

GTAAAACACA TTGCCCTGTT GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 2400 

20 TTGTCCTTCT CCTCTTTAAT AAACCTTACA TTTATCAGTC CTGAAGTAAT TAAGTTTATC 2460 

CTTCTGGTGG TAGTCCCACT TCCTGCATGT CTCAATCCCC TTCTCTACAT CTTGTTCAAT 2520 

CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG GACAAGATCA 2580 

AAACACCCAA GCTTGATGTC AATTAACTCT GATGATGTOG AAAAACAGTC CTGTGACTCA 2640 

ACTCAAGCCT TGGTAACCTT TACCAGCTCC AGCATCACTT ATGACCTGCC TCCCAGTTCC 2700 

25 GTGCCATCAC CAGCTTATCC AGTGACTGAG AGCTGCCATC TTTCCTCTGT GGCATTTGTC 2760 

CCATGTCTCT AATTAATATG TGAAGGAAAA TGTTTTCAAA GGTTGAGAAC CTGAAAATGT 2820 

GAGATTGAGT ATATCAGAGC AGTAATTAAT AAGAAGAGCT GAGGTGAAAC TCGGTTTAAA 2880 

Seq ID HO: Ci 69 DNA Sequence 
30 Nucleic Acid Accession #: NMJ)03S06.1 
Coding sequence: 2S9..2379 

1 11 21 31 41 51 

I I I I I I 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG 60 

TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 120 

CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 

ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 240 

ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 360 

ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 

GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 480 

ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 540 

TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 600 

ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 660 

GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 780 

TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 840 

CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 900 
TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 1020 
GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGGTGA CACTGTTGTC 1080 
CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 
GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 1200 
TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 1260 
CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 
GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 1380 
CTGTGCCTTT GTGTGTTTGT TGGGCXCTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 1440 
CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 1500 
ATTCGAATTG GAGTCTTCAG CGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTCGGATGT 1560 
TACGTCTATG AGCAAGTGAA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 1620 
CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AGAATTGGCT 1680 
TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 1740 
GGAAGCAAAA AGACATGCAC AGAATGGGCT GCGTTTTTTA AACGAAATCG CAAGAGAGAT 1800 
CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 1860 
TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 1920 
TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 1980 
ATT ACT AG CC ATGATTACCT AGGACAAGAA ACTTTGACAG AAATCCAAAC CTCACCAGAA 2040 
ACATCAATGA GAGAGGTGAA AGCGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 2100 
TGTGGTGAAC CTGCCTCGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 2160 
GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGCGGA GTGAAGGAAG GATTAGTCCA 2220 
AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 2280 
TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAGTG 2340 
_ AGAAAAGAGC AGGGAGGTGG TTGTCATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 2400 
75 CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 
TACGTTCTTC TTTTGCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGAGT 2520 
TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 
AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 
_ _ CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 
OU TTTACCTTTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 2760 
GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 2820 
ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2860 
ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2940 
GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 3000 
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TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 
GGTGCTTACT CAAAGAGTGT CCACTATTQA TTGTATTATG CTGCTCACTG ATCCTTCTGC 
ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 
GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTQ ATTGTATTAT 
AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAACTTTTG TTTCTTAACA TTTAGAATAT 
TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTGT AG 

Seq ID NO: C170 DNA Sequence 
Nucleic Acid Accession #: NMJ>00582 
Coding sequence: 88~990 



3060 
3120 
3180 
3240 
3300 
3342 



GCAGAGCACA 
AACGCCGACC 
GGCATCACCT 
CTTTACAACA 
CAGAATCTCC 
GATQATATGG 
AAOGACTCTG 
TCTGATGAAT 
TTCACTCCAG 
GGACTGAGGT 
GACGAGGACA 
CCCGTTGCCC 
GAAACGAGTC 
TATAAGCGGA 
CTTTCCAAAG 
GTTGTAGACC 
TTAGATAGTG 
ATTTAGTCAA 
CTCAGTTTAT 
ATTAGTTTAG 
CTATGTTCAT 
TCATGAATAG 
ATAACATTTT 
TATCTTTTTG 
AATTGCTTAT 
GCCTAAAAAA 



11 
I 

GCATCGTCGG 
AAGGAAAACT 
GTGCCATACC 
AATACCCAGA 
TAGCCCCACA 
ATGATGAAGA 
ATGATGTAGA 
CTGATGAACT 
TTGTCCCCAC 
CAAAATCTAA 
TCACCTCACA 
AGGACCTGAA 
AGCTGGATGA 
AAGCCAATGA 
TCAGCCGTGA 
CCAAAAGTAA 
CATCTTCTGA 
AAGAAAAAAT 
TGGTTGAATG 

TCTATAGAAG 
AAATTTATGT 
ATGTCACTAT 
TGGTGTGAAT 
TTGTTTTCCC 
AAAAAAAAAA 



21 
I 

GACCAGACTC 
CACTACCATG 
AGTTAAACAG 
TGCTGTGGCC 
GACCCTTCCA 
TGATGATGAC 
TGACACTGAT 
GGTCACTGAT 
AGTAGACACA 
GAAGTTTCGC 
CATGGAAAGC 
CGCGCCTTCT 
CCAGAGTGCT 
TGAGAGCAAT 
ATTCCACAGC 
GGAAGAAGAT 
GGTCAATTAA 
GCTTTATAGC 
TOT AT CT ATT 
CATGGAAACT 
AAATGCAAAC 
AGAAGCAAAC 
AATCTTTTGT 
AAATCTTTTA 
ACGGTTGTCC 



31 
I 

GTCTCAGGCC 
AGAATTGCAG 
GCTGATTCTG 
ACATGGCTAA 
AGTAAGTCCA 
CATGTGGACA 
GATTCTCACC 
TTTCCCACGG 
TATGATGGCC 
AGACCTGACA 
GAGGAGTTGA 
GATTGGGACA 
GAAACCCACA 
GAGCATTCCG 
CATGAATTTC 
AAACACCTGA 
AAGGAGAAAA 
AAAATGAAAG 
TGAGTCTGGA 
CCCTGTAAAC 
TATCACTGTA 
AAAATACTTT 
TTTTTAAGTT 
TCTTGAATGT 
AGCAATTAAT 



41 
I 

AGTTGCAGCC 
TGATTTGCTT 
GAAGTTCTGA 
ACCCTGACCC 
ACGAAAGCCA 
GCCAGGACTC 
AGTCTGATGA 
ACCTGCCAGC 
GAGGTGATAG 
TCCAGTACCC 
ATGGTGCATA 
GCCGTGGGAA 
GCCACAAGCA 
ATGTGATTGA 
ACAGCCATGA 
AATTTCGTAT 
AATACAATTT 
AGAACATGAA 
AATAACTAAT 
TAAAAGCTTC 
TTTTAATATT 
TACCCACTTA 
AGTGTATATT 
AATAAGAATT 
AAAACATAAC 



SI 
I 

TTCTCAGCCA 
TTGCCTCCTA 
GGAAAAGCAG 
ATCTCAGAAG 
TGACCACATG 
CATTGACTCG 
GTCTCACCAT 
AACCGAAGTT 
TGTGGTTTAT 
TGATGCTACA 
CAAGGCCATC 
GGACAGTTAT 
GTCCAGATTA 
TAGTCAGGAA 
AGATATGCTG 
TTCTCATGAA 
CTCACTTTGC 
ATGCTTCTTT 
GTGTTTGATA 
AGGGTTATGT 
TGTTATTCTC 
AAAAGAGAAT 
TTGTTGTGAT 
TGGTGGTGTC 
CTTTTTTACT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1S00 

1524 



Seq ID NO: Cl 71 DNA Sequence 
Nucleic Acid Accession #: NM_002821 
Coding sequence: ISO.. 3362 

1 11 21 31 41 51 

III!!! 

AACTCCCGCC TCGGGACGCC TCGGGGTCGG GCTCOGGCTG CGGCTGCTGC TGCGGCGCCC 60 

GCGCTCCGGT GCGTCCGCCT CCTGTGCCCG CCGCGGAGCA GTCTGCGGCC CGCCGTGCGC 120 

CCTCAGCTCC TTTTCCTGAG CCCGCCGCGA TGGGAGCTGC GCGGGGATCC CCGGCCAGAC 180 

CCCGCCGGTT GCCTCTGCTC AGCGTCCTGC TGCTGCCGCT GCTGGGCGGT ACCCAGACAG 240 

CCATTGTCTT CATCAAGCAG CCGTCCTCCC AGGATGCACT GCAGGGGCGC CGGGCGCTGC 300 

TTCGCTGTGA GGTTGAGGCT CCGGGCCCGG TACATGTGTA CTGGCTGCTC GATGGGGCCC 360 

CTGTCCAGGA CACGGAGCGG CGTTTCGCCC AGGGCAGCAG CCTGAGCTTT GCAGCTGTGG 420 

ACCGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCG GGATGATGTC ACTGGAGAAG 480 

AAGCCCGCAG TGCCAACGCC TCCTTCAACA TCAAATGGAT TGAGGCAGGT CCTGTGGTCC 540 

TGAAGCATCC AGCCTCGGAA GCTGAGATCC AGCCACAGAC CCAGGTCACA CTTCGTTGCC 600 

ACATTGATGG GCACCCTCGG CCCACCTACC AATGGTTCCG AGATGGGACC CCCCTTTCTG 660 

ATGGTCAGAG CAACCACACA GTCAGCAGCA AGGAGCGGAA CCTGACGCTC CGGCCAGCTG 720 

GTCCTGAGCA TAGTGGGCTG TATTCCTGCT GCGCCCACAG TGCTTTTGGC CAGGCTTGCA 780 

GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TGCCAGGGTG GTGCTGGCAC 840 

CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGCCC 900 

AGCCACCCCC GAGCCTGCAG TGGCTC TTT G AGGATGAGAC TCCCATCACT AACCGCAGTC 960 

GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 1020 

TCCGGCCACG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 1080 

TCATCCTGGA AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCGCTA TTTGAGCCAC 1140 

GGGTGTTTAC AGCTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 1200 

AGCCCAGCGT GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGGTCTACC 1260 

AGAAGGGCCA CGAGCTGGTG TTGGCCAATA TTGCTGAAAG TGATGCTGGT GTCTACACCT 1320 

GCCACGCGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACT GTGGCCACTG 1380 

TGCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCGGCTACT 1440 

TGGATTGCCT GACCCAGGCC ACACCAAAAC CTACAGTTGT CTGGTACAGA AACCAGATGC 1500 

TCATCTCAGA GGACTCACGG TTCGAGGTCT TCAAGAATGG GACCTTGCGC ATCAACAGCG 1560 

TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCAG CACCCCAGCC GGCAGCATCG 1620 

AGGCGCAAGC CCGTGTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 1680 

AGCAGTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAOAGA 1740 

AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGG GTGACAGACA 1800 

ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTOQAGATGA CGCTGGCAAC TACACTTGCA 1860 

TTGCCTCCAA CGGGCCGCAG GGCCAGATTC GT6CCCATGT CCAGCTCACT GTGGCAGTTT 1920 

TTATCACCTT CAAAGTGGAA CCAGAGCGTA CGACTGTGTA OCAGGGCCAC ACAGCCCTAC 1980 

TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 2040 

TCCTGGACCC CACCAAGCTG GGACCCAGGA TGCACATCTT CCAGAATGGC TCCCTGGTGA 2100 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 2160 

ACATCAAGCA CACGGAGGCC CCCCTCTATG TOGTGGACAA GCCTGTGCCG GAGGAGTCGG 2220 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGTGGGTG 2280 

CCGCTGTGGC CTACATCATT GCCGTGCTGG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 2340 
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AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 2400 

OAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 2460 

GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 2S20 

TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 2580 

TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 2640 

GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG GAGGGAGTTG GAGATGTTTG 2700 

GGAAGCTGAA CCACGCCAAC GTGGTGOGGC TCCTGGGGCT GTGCCGGGAG GCTGAGCCCC 2760 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2820 

AGAGCAAGGA TGAAAAATTG AAGTCACAGC CCCTCAGCAC CAAGCAGAAG GTGGCCCTAT 2880 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2940 

TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 3060 

GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 

CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTGGGCAGG 3180 

CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG GGCCCTCAGC CCCAAGGACC 3300 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 3360 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3660 

TTCTCCCCTT GACOGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 3780 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG ATTATAGAGG 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3960 

CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC GGGCGGCTTT TATATGTAAT 4020 

TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG AGGAGGGTGG 4080 

GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT TTTTTGTTTT 4140 

TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC TTTTTTA 4187 

Seq ID NO: C172 dna Sequence 

Nucleic Acid Accession 8: NM_0023O9.2 

Coding sequence : 65 . . 673 

1 11 21 31 41 51 

1 i I I I I 

ATGAACCTCT GAAAACTGCC GGCATCTGAG GTTTCCTCCA AGGCCCTCTG AAGTGCAGCC 60 

CATAATGAAG GTCTTGGCGG CAGGAGTTGT GCCCCTGCTG TTGGTTCTGC ACTGGAAACA 120 

TGGGGCGGGG AGCCCCCTCC CCATCACCCC TGTCAACGCC ACCTGTGCCA TACGCCACCC 180 

ATGTCACAAC AACCTCATGA ACCAGATCAG GAGCCAACTG GCACAGCTCA ATGGCAGTGC 240 

CAATGCCCTC TTTATTCTCT ATTACACAGC CCAGGGGGAG CCGTTCCCCA ACAACCTGGA 300 

CAAGCTATGT GGCCCCAACG TGACGGACTT CCCGCCCTTC CACGCCAACG GCACGGAGAA 360 

GGCCAAGCTG GTGGAGCTGT ACOGCATAGT CGTGTACCTT GGCACCTCCC TGGGCAACAT 420 

CACCCGGGAC CAGAAGATCC TCAACCCCAG TGCCCTCAGC CTCCACAGCA AGCTCAACGC 480 

CACCGCCGAC ATCCTGCGAG GCCTCCTTAG CAACGTGCTG TGCCGCCTGT GCAGCAAGTA 540 

CCACGTGGGC CATGTGGACG TGACCTACGG CCCTGACACC TCGGGTAAGG ATGTCTTCCA 600 

GAAGAAGAAG CTGGGCTGTC AACTCCTGGG GAAGTATAAG CAGATCATCG CCGTGTTGGC 660 

CCAGGCCTTC TAGCAGGAGG TCTTGAAGTG TGCTGTGAAC CGAGGGATCT CAGGAGTTGG 720 

GTCCAGATGT GGGGGCCTGT CCAAGGGTGG CTGGGGCCCA GGGCATCGCT AAACCCAAAT 780 

GGGGGCTGCT GGCAGACCCC GAGGGTGCCT GGCCAGTCCA CTCCACTCTG GGCTGGGCTG 840 

TGATGAAGCT GAGCAGAGTG GAAACTTCCA TAGGGAGGGA GCTAGAAGAA GGTGCCCCTT 900 

CCTCTGGGAG ATTGTGGACT GGGGAGCGTG GGCTGGACTT CTGCCTCTAC TTGTCCCTTT 960 

GGCCCCTTGC TCACTTTGTG CAGTGAACAA ACTACACAAG TCATCTACAA GAGCCCTGAC 1020 

CACAGGGTGA GACAGCAGGG CCCAGGGGAG TGGACCAGCC CCCAGCAAAT TATCACCATC 1080 

TGTGCCTTTG CTGCCCCTTA GGTTGGGACT TAGGTGGGCC AGAGGGGCTA GGATCCCAAA 1140 

GGACTCCTTG TCCCCTAGAA GTTTGATGAG TGGAAGATAG AGAGGGGCCT CTGGGATGGA 1200 

AGGCTGTCTT CTTTTGAGGA TGATCAGAGA ACTTGGGCAT AGGAACAATC TGGCAGAAGT 1260 

TTCCAGAAGG AGGTCACTTG GCATTCAGGC TCTTGGGGAG GCAGAGAAGC CACCTTCAGG 1320 

CCTGGGAAGG AAGACACTGG GAGGAGGAGA GGCCTGGAAA GCTTTGGTAG GTTCTTCGTT 1380 

CTCTTCCCOG TGATCTTCCC TGCAGCCTGG GATGGCCAGG GTCTGATGGC TGGACCTGCA 1440 

GCAGGGGTTT GTGGAGGTGG GTAGGGCAGG GGCAGGTTGC TAAGTCAGGT GCAGAGGTTC 1500 

TGAGGGACCC AGGCTCTTCC TCTGGGTAAA GGTCTGTAAG AAGGGGCTGG GGTAGCTCAG 1560 

AGTAGCAGCT CACATCTGAG GCOCTGGGAG GTCTTGTGAG GTCACACAGA GGTACTTGAG 1620 

GGGGACTGGA GGCCGTCTCT GGTCCCCAGG GCAAGGGAAC AGCAGAACTT AGGGTCAGGG 1680 

TCTCAGGGAA CCCTGAGCTC CAAGCGTGCT GTGCGTCTGA CCTGGCATGA TTTCTATTTA 1740 

TTATGATATC CTATTTATAT TAACTTATTG GTGCTTTCAG TGGCCAAGTT AATTCCCCTT 1800 

TCCCTGGTCC CTACTCAACA AAATATGATG ATGGCTCCCG ACACAAGOGC CAGGGCCAGG 1860 

GCTTAGCAGG GCCTGGTCTG GAAGTCGACA ATGTTACAAG TGGAATAAGC TTACGGGTGA 1920 

AGCTCAGAGA AGGGTCGGAT CTGAGAGAAT GGGGAGGCCT GAGTGGGAGT GGGGGGCCTT 1980 

GCTCCACCCC CATCCCCTAC TGTGACTTGC TTTAGCGTGT CAGGGTCCAG GCTGCAGGGG 2040 

CTGGGCCAAT TTGTGGAGAG GCCGGGTGCC TTTCTGTCTT GCTTCCAGGG GGCTGGTTCA 2100 

CACTGTTCTT GGGCGCCCCA GCATTGTGTT GTGAGGCGCA CTGTTCCTGG CAGATATTGT 2160 

GCCCCCTGGA GCAGTGGGCA AGACAGTCCT TGTGGCCCAC CCTGTCCTTG TTTCTGTGTC 2220 

CCCATGCTGC CTCTGAAATA GCGCCCTGGA ACAACCCTGC CCCTGCACCC AGCATGCTCC 2280 
GACACAGCAG GGAAGCTCCT CCTGTGGCCC GGACACCCAT AGACGGTGCG GGGGGCCTGG 2340 
CTGGGCCAGA CCCCAGGAAG GTGGGGTAGA CTGGGGGGAT CAGCTGCCCA TTGCTCCCAA 2400 
GAGGAGGAGA GGGAGGCTGC AGACGCCTGG GACTCAGACC AGGAAGCTGT GGGCCCTCCT 2460 
GC TCCA CCCC CATCCCACTC CCACCCATGT CTGGGCTCCC AGGCAGGGAA CCCGATCTCT 2520 
TCCTTTGTGC TGGGGCCAGG CGAGTGGAGA AACGCCCTCC AGTCTGAGAG CAGGGGAGGG 2580 
AAGGAGGCAG CAGAGTTGGG GCAGCTGCTC AGAGCAGTGT TCTGGCTTCT TCTCAAACCC 2640 
TGAGCGGGCT GCCGGCCTCC AAGTTCCTCC GACAAGATGA TGGTACTAAT TATGGTACTT 2700 
TTCACTCACT TTGCACCTTT CCCTGTCGCT CTCTAAGCAC TTTACCTGGA TGGCGCGTGG 2760 
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10 



GCAGTGTGCA GGCAGGTCCT GAGGCCTGGG GTTGGGGTGG AGGGTGCGGC CCGGAGTTGT 2820 

CCATCTGTCC ATCCCAACAG CAAGACGAGG ATGTGGCTGT TGAGATGTGG GCCACACTCA 2880 

CCCTTGTCCA GGATGCAGGG ACTGCCTTCT CCTTCCTGCT TCATCCGGCT TAGCTTGGGG 2940 

CTGGCTGCAT TCCCCCAGGA TGGGCTTCGA GAAAGACAAA CTTGTCTGGA AACCAGAGTT 3000 

GCTGATTCCA CCCGGGGGGC CCGGCTGACT CGCCCATCAC CTCATCTCCC TGTGGACTTG 3060 

GGAGCTCTGT GCCAGGCCCA CCTTGCGGCC CTGGCTCTGA GTCGCTCTCC CACCCAGCCT 3120 

GGACTTGGCC CCATGGGACC CATCCTCAGT GCTCCCTCCA GATCCCGTCC GGCAGCTTGG 3180 

CGTCCACCCT GCACAGCATC ACTGAATCAC AGAGCCTTTG OGTGAAACAG CTCTGCCAGG 3240 

CCGGGAGCTG GGTTTCTCTT CCCTTTTTAT CTGCTGGTGT GGACCACACC TGGGCCTGGC 3300 

CGGAGGAAGA GAGAGTTTAC CAAGAGAGAT GTCTCCGGGC CCTTATTTAT TATTTAAACA 3360 

TTTTTTTAAA AAGCACTGCT AGTTTACTTG TCTCTCCTCC CCATCGTCCC CATCGTCCTC 3420 

CTTGTCCCTG ACTTGGGGCA CTTCCACCCT GACCCAGCCA GTCCAGCTCT GCCTTGCOGG 3480 

CTCTCCAGAG TAGACATAGT GTGTGGGGTT GGAGCTCTGG CACCCGGGGA GGTAGCATTT 3540 

CCCTGCAGAT GOTACAGATG TTCCTGCCTT AGAGTCATCT CTAGTTCCCC ACCTCAATCC 3600 

I J CGGCATCCAG CCTTCAGTCC CGCCCACGTG CTAGCTCCGT GGGCCCACCG TGCGGCCTTA 3660 

GAGGTTTCCC TCCTTCCTTT CCACTGAAAA GCACATGGCC TTGGGTGACA AATTCCTCTT 3720 

TGATGAATGT ACCCTGTGGG GATGTTTCAT ACTGACAGAT TATTTTTATT TATTCAATGT 3780 

CATATTTAAA ATATTTATTT TTTATACCAA ATGAATCACT TTTTTTTTTA AGAAAAAAAA 3840 

2Q GAGAAATGAA TAAAGAATCT ACTCTTCG 3868 

Seq ID NO: C173 DNA Sequence 
Nucleic Acid Accession 8: XM_097508 
Coding sequence: 44.. 2788 ~~ 

25 i 11 21 31 41 51 

I 1 I 1 I I 

TGAAAGGCGG TTGTGGTGCA AAGGAAAACC CACAGGCCAA GGAATGGGAA GACCAAGGTT 60 

GACACTTGTT TGTCACGTGT CAATAATCAT CTCTGCCCGG GACCTCAGCA TGAACAACCT 120 

CACAGAGCTT CAGCCTGGCC TCTTCCACCA CCTGCGCTTC TTGGAGGAGC TGCGTCTCTC 180 

JU TGGGAACCAT CTCTCACACA TCCCAGGACA AGCATTCTCT GGTCTCTACA GCCTGAAAAT 240 

CCTGATGCTG CAGAACAATC AGCTGGGAGG AATCCCCGCA GAGGCGCTGT GGGAGCTGCC 300 

GAGCCTGCAG TCGCTGCGCC TAGATGCCAA CCTCATCTCC CTGGTCCCGG AGAGGAGCTT 360 

TGAGGGGCTG TCCTCCCTCC GCCACCTCTG GCTGGACGAC AATGCACTCA CGGAGATCCC 420 

„ TGTCAGGGCC CTCAACAACC TCCCTGCCCT GCAGGCCATG ACCCTGGCCC TCAACCGCAT 480 

3D CAGCCACATC CCOGACTACG CGTTCCAGAA TCTCAOCAGC CTTGTGGTGC TGCATTTGCA 540 

TAACAACGGC ATCCAGCATC TGGGGACCCA CAGCTTCGAG GGGCTGCACA ATCTGGAGAC 600 

ACTAGACCTG AATTATAACA AGCTGCAGGA GTTCCCTGTG GCCATCCGGA CCCTGGGCAG 660 

ACTGCAGGAA CTGGGGTTCC ATAACAACAA CATCAAGGCC ATCCCAGAAA AGGCCTTCAT 720 

GGGGAACCCT CTGCTACAGA CGATACACTT TTATGATAAC CCAATCCAGT TTGTGGGAAG 780 

4U ATCGGCATTC CAGTACCTGC CTAAACTCCA CACACTATCT CTGAATGGTG CCATGGACAT 840 

CCAGGAGTTT CCAGATCTCA AAGGCACCAC CAGCCTGGAG ATCCTGACCC TGACCCGCGC 900 

AGGCATCCGG CTGCTCCCAT CGGGGATGTG CCAACAGCTG CCCAGGCTCC GAGTCCTGGA 960 

ACTGTCTCAC AATCAAATTG AGGAGCTGCC CAGCCTGCAC AGGTGTCAGA AATTGGAGGA 1020 

. AATCGGCCTC CAACACAACC GCATCTGGGA AATTGGAGCT GACACCTTCA GCCAGCTGAG 1080 

4S CTCCCTGCAA GCCCTGGATC TTAGCTGGAA CGCCATCOGG TCCATCCACC CCGAGGCCTT 1140 

CTCCACCCTG CACTCCCTGG TCAAGCTGGA CCTGACAGAC AACCAGCTGA CCACACTGCC 1200 

CCTGGCTGGA CTTGGGGGCT TGATGCATCT GAAGCTCAAA GGGAACCTTG CTCTCTCCCA 1260 

GGCCTTCTCC AAGGACAGTT TCCCAAAACT GAGGATCCTG GAGGTGCCTT ATGCCTACCA 1320 

GTGCTGTCCC TATGGGATGT GTGCCAGCTT CTTCAAGGCC TCTGGGCAGT GGGAGGCTGA 1380 

!>U AGACCTTCAC CTTGATGATG AGGAGTCTTC AAAAAGGCCC CTGGGCCTCC TTGCCAGACA 1440 

AGCAGAGAAC CACTATGACC AGGACCTGGA TGAGCTCCAG CTGGAGATGG AGGACTCAAA 1500 

GCCACACOCC AGTGTCCAGT GTAGCCCTAC TCCAGGCCCC TTCAAGCCCT GTGAGTACCT 1S60 

CTTTGAAAGC TGGGGCATCC GCCTGGCOGT GTGGGCCATC GTGTTGCTCT CCGTGCTCTG 1620 

CAATGGACTG GTGCTGCTGA COGTGTTCGC TGGCGGGCCT GTCCCCCTGC CCCCGGTCAA 1680 

D*> GTTTGTGGTA GGTGCGATTG CAGGCGCCAA CACCTTGACT GGCATTTCCT GTGGCCTTCT 1740 

AGCCTCAGTC GATGCCCTGA CCTTTGGTCA GTTCTCTGAG TACGGAGCCC GCTGGGAGAC 1800 

GGGGCTAGGC TGCCGGGCCA CTGGCTTCCT GGCAGTACTT GGGTCGGAGG CATCGGTGCT 1860 

GCTGCTCACT CTGGCCGCAG TGCAGTGCAG CGTCTCCGTC TCCTGTGTCC GGGCCTATGG 1920 

GAAGTCCCCC TCCCTGGGCA GCGTTCGAGC AGGGGTCCTA GGCTGCCTGG CACTGGCAGG 1980 

OU GCTGGCCGCC GCGCTGCCCC TGGCCTCAGT GGGAGAATAC GGGGCCTOCC CACTCTGCCT 2040 

GCCCTACGCG CCACCTGAGG GTCAGCCAGC AGCCCTGGGC TTCACCOTGG CCCTGGTGAT 2100 

GATGAACTCC TTCTGTTTCC TGGTCGTGGC CGGTGCCTAC ATCAAACTGT ACTGTGACCT 2160 

GCCGOGGGGC GACTTTGAGG CCGTGTGGGA CTGCGCCATG GTGAGGCACG TGGCCTGGCT 2220 

CATCTTCGCA GAOGGGCTCC TCTACTGTCC CGTGGCCTTC CTCAGCTTTG CCTCCATGCT 2280 

Oj GGGCCTCTTC CCTGTCACGC CCGAGGCCGT CAAGTCTGTC CTGCTGGTGG TGCTGCCCCT 2340 

GCCTGCCTGC CTCAACCCAC TGCTGTACCT GCTCTTCAAC CCCCACTTCC GGGATGACCT 2400 

TCGGCGGCTT CGGCCCCGCG CAGGGGACTC AGGGCCCCTA GCCTATGCTG CGGCCGGGGA 2460 

GCTGGAGAAG AGCTCCTGTG ATTCTACCCA GGCCCTGGTA GCCTTCTCTG ATGTGGATCT 2520 

CATTCTGGAA GCTTCTGAAG CTGGGCGGCC CCCTGGGCTG GAGACCTATG GCTTCCCCTC 2580 

/U AGTGACCCTC ATCTCCTGTC AGCAGCCAGG GGCCCCCAGG CTGGAGGGCA GCCATTGTGT 2640 

AGAGCCAGAG GGGAACCACT TTGGGAACCC CCAACCCTCC ATGGATGGAG AACTGCTGCT 2700 

GAGGGCAGAG GGATCTACGC CAGCAGGTGG AGGCTTGTCA GGGGGTGGCG GCTTTCAGCC 2760 

CTCTGGCTTG GCCTTTGCTT CACAGGTGTA AATATCCCTC CCCATTCTTC TCTTCCCCTC 2820 

TCTTCCCTTT CCTCTCTCCC CCTOGGTGAA TGATGGCTGC TTCTAAAACA AATACAACCA 2880 

/-> AAACTCAGCA GTGTGATCTA TAGCAGGATG GCCCAGTCCC TGGCTCCACT GATCACCTCT 2940 

CTCCTGTGAC CATCACCAAC GGGTGCCTCT TGGCCTGGCT TTCCCTTGGC CTTCCTCAGC 3000 

TTCACCTTGA TACTGGGCCT CTTCCTTGTC ATGTCTGAAG CTGTGGACCA GAGACCTGGA 3060 

CTTTTGTCTG CTTAAGGGAA ATGAGGGAAG TAAAGACAGT GAAGGGGTGG AGGGTTGATC 3120 

CA AGGGCACAGT GGACAGGGAG ACCTCACAGA GAAAGGCCTG GAA6GTGATT TCCCGTGTGA 3180 

OU CTCATGGATA GGATACAAAA TGTGTTCCAT GTACCATTAA TCTTGACATA TGCCATGCAT 3240 

AAAGACTTCC TATTAAAATA AGCTTTGGAA GAG 3273 

Seq id NO: C174 DNA Sequence 
Nucleic Acid Accession #: NM_130849 
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Coding sequence: 101.. 2044 

1 11 21 31 41 51 

c I I ! I I I 

J AGTCTGGCCC TGGACAACCC CAGCAAAGCC GCCCTCAGCC AGCCCAGAAG CACTGGGCCT 60 

TGGCCACAGC AACACCCACT GAGCAOGCTG GGAGCTGAGT ATGGCGTCCC TGGTCTCGCT 120 

GGAGCTGGGG CTGCTTCTGG CT GTG CTGGT GGTGACGGCG ACGGCGTCCC CGCCTGCTGG 180 

TCTGCTGAGC CTGCTCACCT CTGGCCAGGG CGCTCTGGAT CAAGAGGCTC TGGGCGGCCT 240 

GTTAAATACG CTGGCGGACC GTGTGCACTG CACCAACGGG CCGTGTGGAA AGTGCCTGTC 300 

1U TGTGGAGGAC GCCCTGGGCC TGGGCGAGCC TGAGGGGTCA GGGCTGCCCC OGGGCCCGGT 360 

CCTGGAGGCC AGGTACGTCG CCCGCCTCAG TGCCGCCGCC GTCCTGTACC TCAGCAACCC 420 

CGAGGGCACC TGTGAGGACA CTCGGGCTGG CCTCTGGGCC TCTCATGCAG ACCACCTCCT 4 BO 

GGCCCTGCTC GAGAGCCCCA AGGCCCTGAC CCCGGGCCTG AGCTGGCTGC TGCAGAGGAT 540 

GCAGGCCCGG GCTGCCGGCC AGACCCCCAA GACGGCCTGC GTAGATATCC CTCAGCTGCT 600 

ID GGAGGAGGCG GTGGGGGCGG GGGCTCCGGG CAGTGCTGGC GGCGTCCTGG CTGCCCTGCT 660 

GGACCATGTC AGGAGCGGGT CTTGCTTCCA CGCCTTGCCG AGCCCTCAGT AC7TCGTGGA 720 

CTTTGTGTTC CAGCAGCACA GCAGCGAGGT CCCTATGACG CTGGCCGAGC TGTCAGCCTT 780 

GATGCAGCGC CTGGGGGTGG GCAGGGAGGC CCACAGTGAC CACAGTCATC GGCACAGGGG 840 

AGCCAGCAGC CGGGACCCTG TGCCCCTCAT CAGCTCCAGC AACAGCTCCA GTGTGTGGGA 900 

CACGGTATGC CTGAGTGCCA GGGACGTGAT GGCTGCATAT GGACTGTCGG AACAGGCTGG 960 

GGTGACCCCG GAGGCCTGGG CCCAACTGAG CCCTGCCCTG CTCCAACAGC AGCTGAGTGG 1020 

AGCCTGCACC TCCCAGTCCA GGCCCCCCGT CCAGGACCAG CTCAGCCAGT CAGAGAGGTA 1080 

TCTGTACGGC TCCCTGGCCA CGCTGCTCAT CTGCCTCTGC GOGGTCTTTG GCCTCCTGCT 1140 

GCTGACCTGC ACTGGCTGGA GGGGGGTCGC CCACTACATC CTGCAGACCT TCCTGAGCCT 1200 

GGCAGTGGGT GCACTCACTG GGGAOGCTGT CCTGCATCTG ACGCCCAAGG TGCTGGGGCT 1260 

GCATACACAC AGCGAAGAGG GCCTCAGCCC ACAGCCCACC TGGCGCCTCC TGGCTATGCT 1320 

GGCCGGGCTC TACGCCTTCT TCCTGTTTGA GAACCTCTTC AATCTCCTGC TGCCCAGGGA 1380 

CCCGGAGGAC CTGGAGGACG GGCCCTGCGG CCACAGCAGC CATAGCCACG GGGGCCACAG 1440 

„ CCACGGTGTG TCCCTGCAGC TGGCACCCAG CGAGCTCCGG CAGCCCAAGC CCCCCCACGA 1500 

3U GGGCTCCOGC GCAGACCTGG TGGCGGAGGA GAGCCCGGAG CTGCTGAACC CTGAGCCCAG 1560 

GAGACTGAGC CCAGAGTTGA GGCTACTGCC CTATATGATC ACTCTGGGCG ACGCCGTGCA 1620 

CAACTTCGCC GACGGGCTGG CCGTGGGCGC CGCCTTCGCG TCCTCCTGGA AGACCGGGCT 1680 

GGCCACCTCG CTGGCCGTGT TCTGCCACGA GTTGCCA'CAC GAGCTGGGGG ACTTCGCCGC 1740 

CTTGOTGCAC GCGGGGCTGT CCGTGCGCCA AGCACTGCTG CTGAACCTGG CCTCCGCGCT 1800 

CACGGCCTTC GCTGGTCTCT ACGTGGCACT CGCGGTTGGA GTCAGCGAGG AGAGCGAGGC 1860 

CTGGATCCTG GCAGTGGCCA CCGGCCTGTT CCTCTACGTA GCACTCTGCG ACATGCTCCC 1920 

GGCGATGTTG AAAGTACGGG ACCCGCGGCC CTGGCTCCTC TTCCTGCTGC ACAACGTGGG 1980 

CCTGCTGGGC GGCTGGACCG TCCTGCTGCT GCTGTCCCTG TACGAGGATG ACATCACCTT 2040 

CTGATACCCT GCCCTAGTCC CXXACCTTTG ACTTAAGATC CCACACCTCA CAAACCTACA 2100 

GCCCAGAAAC CAGAAGCCCC TATAGAGGCC CCAGTCCCAA CTCCAGTAAA GACACTCTTG 2160 

TCCTTGGAAA AAAAAAAAAA AAAAAAAAAA AA 2192 
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Seq XD NO: C175 UNA Sequence 
Nucleic Acid Accession #: NMJ>18971 
4!) Coding sequence: 1..1128 
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1 11 21 31 41 51 

I I 1 I I I 

ATGGCGAACG CGAGCGAGCC GGGTGGCAGC GGCGGCGGCG AGGCGGCCGC CCTGGGCCTC 60 

AAGCTGGCCA CGCTCAGCCT GCTGCTGTGC GTGAGCCTAG CGGGCAACGT GCTGTTCGCG 120 

CTGCTGATCG TGCGGGAGCG CAGCCTGCAC CGCGCCCCGT ACTACCTGCT GCTCGACCTG 180 

TGCCTGGCCG ACGGGCTGCG CGCGCTCGCC TGCCTCCOGG CCGTCATGCT GGCGGCGCGG 240 

CGTGCGGCGG CCGOGGCGGG GGOGCCGCCG GGCGCGCTGG GCTGCAAGCT GCTCGCCTTC 300 

CTGGCCGOGC TCTTCTGCTT CCACGCCGCC TTCCTGCTGC TGGGCGTGGG CGTCACCCGC 360 

TACCTGGCCA TCGCGCACCA CCGCTTCTAT GCAGAGCGCC TGGCCGGCTG GCCGTGCGCC 420 

GCCATGCTGG TGTGCGCCGC CTGGGCGCTG GCGCTGGCCG CGGCCTTCCC GCCAGTGCTG 480 

GACGGCGGTG GCGACGACGA GGAOGCGCCG TGCGCCCTGG AGCAGCGGCC CGACGGCGCC 540 

CCCGGCGCGC TGGGCTTCCT GCTGCTGCTG GCCGTGGTGG TGGGCGCCAC GCACCTCGTC 600 

TACCTCCGCC TGCTCTTCTT CATCCACGAC CGCCGCAAGA TGCGGCCOGC GCGCCTGGTG 660 

CCCGCCGTCA GCCACGACTG GACCTTCCAC GGCCCGGGCG CCACCGGCCA GGCGGCCGCC 720 

AACTGGAOGG CGGGCTTCGG CCGCGGGCCC ACGCCGCCCG CGCTTGTGGG CATCCGGCCC 780 

GCAGGGCCGG GCCGCGGCGC GCGCCGCCTC CTCGTGCTGG AAGAATTCAA GACGGAGAAG 840 

AGGCTGTGCA AGATGTTCTA CGCCGTCACG CTGCTCTTCC TGCTCCTCTG GGGGCCCTAC 900 

GTGGTGGCCA GCTACCTGCG GGTCCTGGTG CGGCCCGGCG CCGTCCCCCA GGCCTACCTG 960 

ACGGCCTCCG TGTGGCTGAC CTTCGOGCAG GCCGGCATCA ACCCCGTCGT GTGCTTCCTC 1020 

TTCAACAGGG AGCTGAGGGA CTGCTTCAGG GCCCAGTTCC CCTGCTGCCA GAGCCCCCGG 1080 

ACCACCCAGG CGACCCATCC CTGCGACCTG AAAGGCATTG GTTTATGA 1128 



„ Seq ID NO: C176 DNA Sequence 

/U Nucleic Acid Accession §: NM 005631 



Coding sequence: 290.. 2653 

1 11 21 31 41 SI 

I I I I I I 

GGCACGAGGG GGCTGAAGAC AACTTGGATT GCGAGGCTAG GGCTTGGGGA GTCGTGCATC 60 

CCGTTCCGGG CCTCCGCAGC CCAACATGGG CCCCGGGTTC CAAAGTTTGC GAAGTTGGGC 120 

GCCGAGGGGC OGGGGCGCGC GGAGCGTCCG GGGGGGCCCG GGCCCGGATT CTCTGGGCGC 180 

ACAGGTCGCC TGAGCCGCCT CCGGGGCCGC CGAGGTCGTG CGTGTGGCCG GGGGGCTCCG 240 

AGGAGCAGGC GGGGGCGCCG GGGCTTTTGC TGAGTTGGCG GGGTTGGCCA TGGCCGCTGC 300 

CCGCCCAGCG CGGGGGCCGG AGCTCCCGCT CCTGGGGCTG CTGCTGCTGC TGCTGCTGGG 360 

GGACCCGGGC OSGGGGGCGG CCTCGAGCGG GAACGCGACC GGGCCTGGGC CTCGGAGCGC 420 

GGGCGGGAGC GOGAGGAGGA GCGCGGCGGT GACTGGCCCT CCGCCGCCGC TGAGCCACTG 480 

CGGCCGGGCT GCCCCCTGCG AGCCGCTGCG CTACAACGTG TGCCTGGGCT CGGTGCTGCC 540 

CTACGGGGCC ACCTCCACAC TGCTGGCCGG AGACTCGGAC TCCCAGGAGG AAGCGCACGG 600 
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CAAGCTCGTG CTCTGGTOGG GCCTCCGGAA TGCCCCCCGC TOCTQGGCAO TGATCCAGCC 660 

CCTGCTGTGT GCCGTATACA TGCCCAAGTG TGAGAATGAC CGGGTGGAGC TGCCCAGCCG 720 

TACCCTCTGC CAGGCCACCC GAGGCCCCTG TGCCATOGTG GAGAGGGAGC GGGGCTGGCC 780 

TGACTTCCTG CGCTGCACTC CTGACCGCTT CCCTGAAGGC TGCACGAATG AGGTGCAGAA 840 

CATCAAGTTC AACAGTTCAG GCCAGTGCGA AGTGCCCTTG GTTOGGACAG ACAACCCCAA 900 

GAGCTGGTAC GAGGACGTGG AGGGCTGCGG CATCCAGTGC CAGAACCCGC TCTTCACAGA 960 

GGCTGAGCAC CAGGACATGC ACAGCTACAT OGCGGCCTTC GGGGCCGTCA CGGGCCTCTG 1020 

CACGCTCTTC ACCCTGGCCA CATTOGTGGC 7GACTGGCGG AACTCGAATC GCTACCCTGC 1080 

TGTTATTCTC TTCTAOGTCA ATGOGTGCTT CTTTGTGGGC AGCATTGGCT GGCTGGCCCA 1140 

GTTCATGGAT GGTGCCOGCC GAGAGATOGT CTGCCGTGCA GATGGCACCA TGAGGCTTGG 1200 

GGAGCCCACC TCCAATGAGA CTCTGTCCTG CGTCATCATC TTTGTCATCG TGTACTACGC 1260 

CCTGATGGCT GGTGTGGTTT GGTTTGTGGT CCTCACCTAT GCCTGGGACA CTTCCTTCAA 1320 

AGCCCTGGGC ACCACCTACC AGCCTCTCTC GGGCAAGACC TCCTACTTCC ACCTGCTCAC 1380 

CTGGTCACTC CCCTTTGTCC TCACTGTGGC AATCCTTGCT GTGGCGCAGG TGGATGGGGA 1440 

CTCTGTGAGT GGCATTTGTT TTGTGGGCTA CAAGAACTAC CGATACGGTG OGGGCTTCGT 1500 

GCTGGCCCCA ATCGGCCTGG TGCTCATCGT GGGAGGCTAC TTCCTCATCC GAGGAGTCAT IS 60 

GACTCTGTTC TCCATCAAGA GCAACCACCC CGGGCTGCTG AGTGAGAAGG CTGCCAGCAA 1620 

GATCAACGAG ACCATGCTGC GCCTGGGCAT TTTTGGCTTC CTGGCCTTTG GCTTTGTGCT 1680 

CATTACCTTC AGCTGCCACT TCTACGACTT CTTCAACCAG GCTGAGTGGG AGCGCAGCTT 1740 

CCGGGACTAT GTGCTATGTC AGGCCAATGT GACCATCGGG CTGCCCACCA AGCAGCCCAT 1800 

CCCTGACTGT GAGATCAAGA ATCGCCCGAG CCTTCTGGTG GAGAAGATCA ACCTGTTTGC IB 60 

CATGTTTGGA ACTGGCATCG CCATGAGCAC CTGGGTCTGG ACCAAGGCCA CGCTGCTCAT 1920 

CTGGAGGCGT ACCTGGTGCA GGTTGACTGG GCAGAGTGAC GATGAGCCAA AGCGGATCAA I960 

GAAGAGCAAG ATGATTGCCA AGGCCTTCTC TAAGOGGCAC GAGCTCCTGC AGAACCCAGG 2040 

CCAGGAGCTG TCCTTCAGCA TGCACACTGT GTCCCACGAC GGGCCGGTGG CGGGCTTGGC 2100 

CTTTGACCTC AATGAGCCCT CAGCTGATGT CTCCTCTGCC TGGGCOCAGC ATGTCACCAA 2160* 

GATGGTGGCT CGGAGAGGAG CCATACTGCC CCAGGATATT TCTGTCACOC CTGTGGCAAC 2220 

TCCAGTGCCC CCAGAGGAAC AAGCCAACCT GTGGCTGGTT GAGGCAGAGA TCTCCCCAGA 2280 

GCTGCAGAAG CGCCTGGGCC GGAAGAAGAA GAGGAGGAAG AGGAAGAAGG AGGTGTGCCC 2340 

GCTGGCGCCG CCCCCTGAGC TTCACCCCCC TGCCCCTGCC CCCAGTACCA TTCCTCGACT 2400 

GCCTCAGCTG CCCCGGCAGA AATGCCTGGT GGCTGCAGGT GCCTGGGGAG CTGGGGACTC 2460 

TTGCCGACAG GGAGCGTGGA CCCTGGTCTC CAACCCATTC TGCCCAGAGC CCAGTCCCCC 2520 

TCAGGATCCA TTTCTGCCCA GTGCACCGGC CCCCGTGGCA TGGGCTCATG GCCGCCGACA 2580 

GGGCCTGGGG CCTATTCACT CCCGCACCAA CCTGATGGAC ACAGAACTCA TGGATGCAGA 2640 

CTCGGACTTC TGAGCCTGCA GAGCAGGACC TGGGACAGGA AAGAGAGGAA CCAATACCTT 2700 

CAAGGCTCTT CTTCCTCACC GAGCATGCTT CCCTAGGATC CCGTCTTCCA GAGAACCTGT 2760 

GGGCTGACTG CCCTCCGAAG AGAGTTCTGG ATGTCTGGCT CAAAGCAGCA GGACTGTGGG 2820 

AAAGAGCCTA ACATCTCCAT GGGGAGGCCT CACCCCAGGG ACAGGGCCCT GGAGCTCAGG 2880 

GTCCTTGTTT CTGCCCTGCC AGCTGCAGCC TGGTTGGCAG CATCTGCTCC ATCGGGGCAG 2940 

GGGGTATGCA GAGCTTGTGG TGGGGCAGGA ACGGTGGAGG CAGAGGTGAC AGTTCCCAGA 3000 

GTGGGCTTTG GTGGCCAGGG AGGCAGCCTA GCCTATGTCT GGCAGATGAG GGCTGGCTGC 3060 

CGTTTTCTGG GCTGATGGGT GCCCTTTCCT GGCAGTCTCA GTCCAAAAGT GTTGACTGTG 3120 

TCATTAGTCC TTTGTCTAAG TAGGGCCAGG GCACCGTATT CCTCTCCCAG GTGTTTGTGG 3180 

GGCTGGAAGG ACCTGCTCCC ACAGGGGCCA TGTCCTCTCT TAATAGGTGG CACTACCCCA 3240 

AACCCATCTT TTGTTCTCCT ATATCCTCCT TCTCCTGTTC CATTTCAGTT CAGTTTCAGC 3300 

GGTGCCAACC TCTTTGCGTT TCCTTTTTGT TGATGAGGAC CCAGAGCTGC TGCACACACT 3360 

CACCTCTAAC CCCCTCCCCT CGCTGCTGGG CCCCATCTCC ACAGGAGAGA CTGGTTCGGC 3420 

TCTAGGGCCT CAGTCTGGAG TGGGATAGGA GCAGTGAGTG ACAAAGCCTC TGAAAGATGC 3480 

ATCATCTCTT CCTCACACCC ATTTAGTGGG GGATGGGTCC TCTAGACTTG AGGGGCTACC 3540 

CTGGGAAGCT GCOGTAGCTT CAGCCAGGCA AGAAAGCTTC CTTCAACCTG CATAGCCGGT 3600 

GGGTGAGGAG ATTCCCACCT TCCATAGCCT CCAAACATGT TCCCAAGGCC CCACTTTCAA 3660 

GAATCAGACA GCAGGAAGCC ATAGATGCTG GCTGGGTTCC AGGTTATGGG GAGAAGAAAT 3720 
ACAGTCAATA AAAGGTTTTT GTATAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 3780 
A 3781 

Seq ID NO: C177 DNA Sequence 
Nucleic Acid Accession #i AK094595 
Coding sequence: 1..2853 



1 11 21 31 41 SI 

I I I I I I 

ATGGGGGCCC GGAGCGGAGC TOGGGGCGCG CTGCTGCTGG CACTGCTGCT CTGCTGGGAC 60 

CCGAGGCTGA GCCAAGCAGG TAGGAAGCGA TCGGGTGAGG TGCTCCCTGA CTCCTTCCCG 120 

TCAGCGCCAG CAGAGCCGCT GCCCTACTTC CTGCAGGAGC CACAGGACGC CTACATTGTG 180 

AAGAACAAGC CTGTGGAGCT CCGCTGCCGC GCCTTCCCCG CCACACAGAT CTACTTCAAG 240 

TGCAACGGCG AGTGGGTCAG CCAGAAOGAC CACGTCACAC AGGAAGGCCT GGATGAGGCC 300 

ACCCTGGGGG CGCGGGGCGG CCTGCGGGTG CGCGAGGTGC AGATCGAGGT GTCGCGGCAG 360 

CAGGTGGAGG AGCTCTTTGG GCTGGAGGAT TACTGGTGCC AGTGCGTGGC CTGGAGCTCC 420 

GCGGGCACCA CCAAGAGTOG CCGAGCCTAC GTCOGCATCG CCTACCTGOG CAAGAACTTC 480 

/U GATCAGGAGC CTCTGGGCAA GGAGGTGCCC CTGGACCATG AGGTTCTCCT GCAGTGCCGC 540 

CCGCCGGAGG GGGTGCCTGT GGCCGAGGTG GAATGGCTCA AGAATQAGGA TGTCATOGAC 600 

CCCACCCAGG ACACCAACTT CCTGCTCACC ATCGACCACA ACCTCATCAT CCGCCAGGCC 660 

OGCCTGTCGG ACACTGCCAA CTATACCTGC GTGGCCAAGA ACATCGTGGC CAAACGCOGG 720 

AGCACCACTG CCACCGTCAT CGTCTACGTG AATGGCGGCT GGTCCAGCTG GGCAGAGTGG 780 

/D TCACCCTGCT CCAACGGCTG TGGCOGAGGC TGGCAGAAGC GCACCOGGAC CTGCACCAAC 840 

CCCGCTCCAC TCAACGGAGG GGCCTTCTGC GAGGGCCAGG CATTCCAGAA GACCGCCTGC 900 

ACCACCATCT GCCCAGTGGA TGGGGCGTGG AOGGAGTGGA GCAAGTGGTC AGCCTGCAGC 960 

ACTGAGTGTG CCCACTGGCG TAGCCGCGAG TGCATGGCGC CCCCACCCCA GAACGGAGGC 1020 

OA CGTGACTGCA GCGGGACGCT GCTCGACTCT AAGAACTGCA CAGATGGGCT GTGCATGCAA 1080 

OU AATAAGAAAA CTCTAAGCGA CCCCAACAGC CACCTGCTGG AGGCCTCAGG GGATGCGGCG 1140 

CTGTATGOGG GGCTCGTGGT GGCCATCTTC GTGGTOGTGG CAATCCTCAT GGCGGTGGGG 1200 

GTGGTGGTGT ACCGCCGCAA CTGCCQTGAC TTCGACACAG ACATCACTGA CTCATCTGCT 1260 

GCCCTGACTG GTGGTTTCCA CCCCGTCAAC TTTAAGACGG CAAGGCCCAG CAACCOGCAG 1320 

CTCCTACACC CCTCTGTGCC TCCTGACCTG ACAGCCAGCG COGGCATCTA CCGCGGACCC 1380 
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GTGTATGCCC TGCAGGACTC CACCGACAAA ATCCCCATGA CCAACTCTCC TCTGCTGGAC 1440 

CCCTTACCCA GCCTTAAGGT CAAGGTCTAC AGCTCCAGCA CCACGGGCTC TGGGCCAGGC IS 00 

CTGGCAGATG GGGCTGACCT GCTGGGGOTC TTGCCGCCTG GCACATACCC TAGCGATTTC 1560 

GCCCGGGACA CCCACTTCCT GCACCTGCGC AGCGCCAGCC TCGGTTCCCA GCAGCTCTTG 1620 

J GGCCTGCCCC GAGACCCAGG GAGCAGCGTC AGCGGCACCT TTGGCTGCCT GGGTGGGAGG 1680 

CTCAGCATCC CCGGCACAGG GGTCAGCTTG CTGGTGQCCA ATGGAGCCAT TCCCCAGGGC 1740 * 

AAGTTCTACG AGATGTATCT ACTCATCAAC AAGGCAGAAA GTACCCTCCC GCTTTGAGAA 1800 

GGGACCCAGA CAGTATTGAG CCCCTCGGTG ACCTGTGGAC CCACAGGCCT CCTGCTGTGC 1860 

1 A CGCCCCGTCA TCCTCACCAT GCCCCACTGT GCCGAAGTCA GTGCCCGTGA CTGGATCTTT 1920 

1U CAGCTCAAGA OCCAGGCCCA CCAGGGCCAC TGGGAGGAGG TGGTGACCCT GGATGAGGAG 1980 

ACCCTGAACA CACCCTGCTA CTGCCAGCTG GAGCCCAGGG CCTGTCACAT CCTGCTGGAC 2040 

CAGCTGGGCA CCTAOGTGTT CACGGGOGAG TCCTATTCCC GCTCAGCAGT CAAGOGGCTC 2100 

CAGCTGGCCG TCTTCGCCCC CGCCCTCTGC ACCTCOCTGG AGTACAGCCT CCGGGTCTAC 2160 

TGCCTGGAGG ACACGCCTGT AGCACTGAAG GAGGTGCTGG AGCTGGAGCG GACTCTGGGC 2220 

GGATACTTGG TGGAGGAGCC GAAACGGCTA ATGTTCAAGG ACAGTTACCA CAACCTGCGC 2280 

CTCTCCCTCC ATGACCTCCC CCATGCCCAT TGGAGGAGCA AGCTGCTGGC CAAATACCAG 2340 

GAGATCCCCT TCTATCACAT TTGGAGTGGC AGCCAGAAGG CCCTCCACTG CACTTTCACC 2400 

CTGGAGAGGC ACAGCTTGGC CTCCACAGAG CTCACCTGCA AGATCTGOGT GOGGCAAGTG 2460 

GAAGGGGAGG GCCAGATATT CCAGCTGCAT ACCACTCTGG CAGAGACACC TGCTGGCTCC 2520 

CTGGACACTC TCTGCTCTGC CCCTGGCAGC ACTGTCACCA CCCAGCTGGG ACCTTATGCC 2580 

TTCAAGATCC CACTGTCCAT CCGCCAGAAG ATATGCAACA GCCTAGATGC CCCCAACTCA 2640 

CGGGGCAATG ACTGGCGGAT GTTAGCACAG AAGCTCTCTA TGGACCGGTA CCTGAATTAC 2700 

TTTGCCACCA AAGCGAGCCC CACGGGTGTG ATCCTGGACC TCTGGGAAGC TCTGCAGCAG 2760 

GAOGATGGGG ACCTCAACAG CCTGGCGAGT GCCTTGGAGG AGATGGGCAA GAGTGAGATG 2820 

CTGGTGGCTG TGGCCACOGA CGGGGACTGC TGAGCCTCCT GGGACAGCGG GCTGGCAGGG 2880 

ACTGGCAGGA GGCAGGTGCA GGGAGGCCTG GGGCAGCCTC CTGATGGGGA TGTTTGGCCT 2940 

CTGCTTCCTC CCAGTTCACA GCCAGAGTTG CCTCTCCTCC TCCTCTTCCC CAACCCCCAG 3000 

ACCATGACCA GCCTTAGAAA ATCCATGTAC TCTGTTGTTA GAGGGCCCAG AGTTCCTTCT 3060 

CCACCCCCGC TCTCTCTCTC TTGGCCTGAG ATCTCTGTGC AGGAACCAAG ATGGGGCTGA 3120 

AGCCTCTGGA GGCAGTTGGT TGGGGGGGGG CAGGCAGGAG GCCCTCCCTC CACCCCCCCA 3180 

CCCTCAGCCC GGCAACTTCT GGGTTCCATG GGTTTTAGTT CCGTTCTOGT TTTCTTCCTC 3240 

CGTTATTGAT TTCTCCTTTC TCCCTAAGCC CCCTTCTGCT TCCACGCOCT TTTCCTCTTT 3300 

GAAGAGTCAA GTACAATTCA GACAAACTGC TTTCTCCTGT CCAAAAGCAA AAAGGCAAAG 3360 

GAAAGAAAGA AAGCTTCAGA CCGCTAGTAA GGCTCAAAGA AGAAGAAAAA CACCAAAACC 3420 

ACAAGGGAAA A GAAAAA CCC AGTTTCTTAG GAAACGCAAA CGATTTATTA TCCAGATTAT 3480 

TTGGATAAGT CCTTTTTAAA A 3501 
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Seq ID NO: CI 7 8 OKA Sequence 
Nucleic Acid Accession #: KM_004625 
*H) Coding sequence: 310.. 1359 
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1 11 21 31 41 51 

I I 1 I I I 

GAGGGAGGGG CGGGGGCTGG AGGCAGCAGC GCCCCCGCAC TCCCCGCGTC TCGCACACTT 60 

GCACCGGTCG CTCGCGCGCA GCCCGGCGTC GCCCCACGCC GCGCTCGCTC CTCCCTCCCT 120 

CCTCCCGCTC CGTGGCTCCC GTGCTCCTGG CGAGGCTCAG GCGCGGAGCG CGCGGACGGG 180 

CGCACCGACA GACGGCCCCG GGGACGCCTC GGCTCGCGCC TCCCGGGCGG GCTATGTTGA 240 

TTGCCCOGCC GGGGCCGGCC CGCGGGATCA GCACAGCCCG GCCCGCGGCC CCGGCGGCCA 300 

ATCGGGACTA TGAACCGGAA AGCGCGGCGC TGCCTGGGCC ACCTCTTTCT CAGCCTGGGC 360 

ATGGTCTACC TCCGGATCGG TGGCTTCTCC TCAGTGGTAG CTCTGGGCGC AAGCATCATC 420 

TGTAACAAGA TCCCAGGCCT GGCTCOCAGA CAGCGGGCGA TCTGCCAGAG CCGGCCCGAC 480 

GCCATCATCG TCATAGGAGA AGGCTCACAA ATGGGCCTGG ACGAGTGTCA GTTTCAGTTC 540 

CGCAATGGCC GCTGGAACTG CTCTGCACTG GGAGAGCGCA CCGTCTTCGG GAAGGAGCTC 600 

AAAGTGGGGA GCCGGGAGGC TGGGTTCACC TACGCCATCA TTGCCGCCGG CGTGGCCCAC 660 

GCCATCACAG CTGCCTGTAC CCAGGGCAAC CTGAGCGACT GTGGCTGCGA CAAAGAGAAG 720 

CAAGGCCAGT ACCACCGGGA OGAGGGCTGG AAGTGGGGTG GCTGCTCTGC CGACATCCGC 780 

TACGGCATCG GCTTCGCCAA GGTCTTTGTG GATGCCCGGG AGATCAAGCA GAATGCCCGG 840 

ACTCTCATGA ACTTGCACAA CAACGAGGCA GGCCGAAAGA TCCTGGAGGA GAACATGAAG 900 

CTGGAATGTA AGTGCCACGG CGTGTCAGGC TCGTGCACCA CCAAGACGTG CTGGACCACA 960 

CTGCCACAGT TTCGGGAGCT GGGCTAOGTG CTCAAGGACA AGTACAACGA GGCCGTTCAC 1020 

GTGGAGCCTG TGCGTGCCAG CCGCAACAAG CGGCCCACCT TCCTGAAGAT CAAGAAGCCA 1080 

CTGTCGTACC GCAAGCCCAT GGACACGGAC CTGGTGTACA TCGAGAAGTC GCCCAACTAC 1140 

TGGGAGGAGG ACCCGGTGAC CGGCAGTGTG GGCACCCAGG GCCGCGCCTG CAACAAGACG 1200 

GCTCCCCAGG CCAGCGGCTG TGACCTCATG TGCTGTGGGC GTGGCTACAA CACCCACCAG 1260 

TACGCCCGCG TGTGGCAGTG CAACTGTAAG TTCCACTGGT GCTGCTATGT CAAGTGCAAC 1320 

ACGTGCAGCG AGCGCACGGA GATGTACACG TGCAAGTGAG CCCCGTGTGC ACACCACCCT 1380 

CCCGCTGCAA GTCAGATTGC TGGGAGGACT GGACOGTTTC CAAGCTGCGG GCTCCCTGGC 1440 

AGGATGCTGA GCTTGTCTTT TCTGCTGAGG AGGGTACTTT TCCTGGGTTT CCTGCAGGCA 1500 

TCCGTGGGGG AAAAAAAATC TCTCAGAGCC CTCAACTATT CTGTTCCACA CCCAATGCTG 1560 

CTCCACCCTC CCCCAGACAC AGCCCAGGTC CCTCCGCGGC TGGAGCGAAG OCTTCTGCAG 1620 

CAGGAACTCT GGACCCCTGG GCCTCATCAC AGCAATATTT AACAATTTAT TCTGATAAAA 1680 

ATAATATTAA TTTATTTAAT TAAAAAGAAT TCTTCCACAA AAAAAAAAAA AAAAAA 1736 



Seq ID NO: C179 DNA Sequence OBR3 
/J Nucleic Acid Accession ft: NM 003786 



Coding sequence: 71.. 4654 

1 11 21 31 41 51 

I I 1 I I I 

CTCOGGCGCC CGCTCTGCCC GCCGCTGGGT CCGACCGCGC TCGCCTTCCT TGCAGCCGCG 60 

CCTCGGCCCC ATGGACGCCC TGTGCGGTTC CGGGGAGCTC GGCTCCAAGT TCTGGGACTC 120 

CAACCTGTCT GTGCACACAG AAAACCCGGA CCTCACTCCC TGCTTCCAGA ACTCCCTGCT 180 

GGCCTGGGTG CCCTGCATCT ACCTGTGGGT CGCCCTGCCC TGCTACTTGC TCTACCTGCG 240 

GCACCATTGT CGTGGCTACA TCATCCTCTC CCACCTGTCC AAGCTCAAGA TGGTCCTGGG 300 
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TGTCCTGCTG TGGTGCGTCT CC7GGG0GGA CCTTTTTTAC TCCTTCCATG GCCTGGTCCA 360 

TGGCCGGGCC CCTGCCCCTG TTTTCTTTGT CACCCCCTTG GTGGTGGGGG TCACCATGCT 420 

GCTGOCCACC CTGCTGATAC AGTATGAGCG GCTGCAGGGC GTACAGTCTT CGGGGGTCCT 480 

CATTATCTTC TGGTTCCTGT GTGTGGTCTG CGCCATCGTC CCATTCCGCT CCAAGATCCT 540 

TTTAGCCAAG GCAGAGGGTG AGATCTCAGA CCCCTTCCGC TTCACCACCT TCTACATOCA 600 

CTTTGOCCTG GTACTCTCTG CCCTCATCTT GGCCTGCTTC AGGGAGAAAC CTCCATTTTT 660 

CTCCGCAAAG AATGTOGACC CTAACCCCTA CCCTGAGACC AGCGCTGGCT TTCTCTCCCG 720 

CCTGTTTTTC TGGTGGTTCA CAAAGATGGC CATCTATGGC TACCGGCATC CCCTGGAGGA 780 

GAAGGACCTC TGGTCCCTAA AGGAAGAGGA CAGATCCCAG ATGGTGGTGC AGCAGCTGCT 840 

GGAGGCATGG AGGAAGCAGG AAAAGCAGAC GGCACGACAC AAGGCTTCAG CAGCACCTGG 900 

GAAAAATGCC TCCGGOGAGG ACGAGGTGCT GCTGGGTGCC CGGCCCAGGC CCOQGAAGCC 960 

CTCCTTCCTG AAGGCCCT6C TGGCCACCTT CGGCTCCAGC TTCCTCATCA GTGCCTGCTT 1020 

CAAGCTTATC CAGGACCTGC TCTCCTTCAT CAATCCACAG CTGCTCAGCA TCCTGATCAG 1080 

GTTTATCTCC AACCCCATGG (XCCCTCCTG GTGGGGCTTC CTGGTGGCTG GGCTGATGTT 1140 

CCTGTGCTCC ATGATGCAGT CGCTGATCTT ACAACACTAT TACCACTACA TCTTTGTGAC 1200 

TGGGGTGAAG TTTOGTACTG GGATCATGGG TGTCATCTAC AGGAAGGCTC TGGTTATCAC 1260 

CAACTCAGTC AAAOGTGCGT CCACTGTGGG GGAAATTGTC AACCTCATGT CAGTGGATGC 1320 

CCAGCGCTTC ATGGACCTTG CCCCCTTCCT CAATCTGCTG TGGTCAGCAC CCCTGCAGAT 1380 

CATCCTGGCG ATCTACTTOC TCTGGCAGAA CCTAGGTCCC TCTGTCCTGG CTGGAGTOGC 1440 

TTTCATGGTC TTGCTGATTC CACTCAACGG AGCTGTGGCC GTGAAGATGC GCGCCTTCCA 1500 

GGTAAAGCAA ATGAAATTGA AGGACTCGCG CATCAAGCTG ATGAGTGAGA TCCTGAACGG 1560 

CATCAAGGTG CTGAAGCTGT ACGCCTGGGA GCCCAGCTTC CTGAAGCAGG TGGAGGGCAT 1620 

CAGGCAGGGT GAGCTCCAGC TGCTGCGCAC GGOGGCCTAC CTCCACACCA CAACCACCTT 1680 

CACCTGGATG TGCAGCCCCT TCCTGGTGAC CCTGATCACC CTCTGGGTGT ACGTGTACGT 1740 

GGACCCAAAC AATGTGCTGG ACGCCGAGAA GGCCTTTGTG TCTGTGTCCT TGTTTAATAT 1800 

CTTAAGACTT CCCCTCAACA TGCTGCCCCA GTTAATCAGC AACCTGACTC AGGCCAGTGT 1860 

GTCTCTGAAA CGGATCCAGC AATTCCTGAG CCAAGAGGAA CTTGACCCCC AGAGTGTGGA 1920 

AAGAAAGACC ATCTCCCCAG GCTATGCCAT CACCATACAC AGTGGGACCT TCACCTGGGC 1980 

CCAGGACCTG CCCCCCACTC TGCACAGCCT AGACATCCAG GTCCCGAAAG GGGCACTGGT 2040 

GGCCGTGGTG GGGCCTGTGG GCTGTGGGAA GTCCTCCCTG GTGTCTGCCC TGCTGGGAGA 2100 

GATGGAGAAG CTAGAAGGCA AAGTGCACAT GAAGGGCTCC GTGGCCTATG TGCCCCAGCA 2160 

GGCATGGATC CAGAACTGCA CTCTTCAGGA AAACGTGCTT TTCGGCAAAG CCCTGAACCC 2220 

CAAGCGCTAC CAGCAGACTC TGGAGGCCTG TGCCTTGCTA GCTGACCTGG AGATGCTGCC 2280 

TGGTGGGGAT CAGACAGAGA TTGGAGAGAA GGGCATTAAC CTGTCTGGGG GCCAGCGGCA 2340 

GCGGGTCAGT CTGGCTCGAG CTGTTTACAG TGATGCCGAT A TTTTCTT GC TGGATGACCC 2400 

ACTGTCOGCG GTGGACTCTC ATGTGGCCAA GCACATCTTT GACCAOGTCA TCGGGCCAGA 2460 

AGGOGTGCTG GCAGGCAAGA CGGGAGTGCT GGTGACGCAC GGCATTAGCT TCCTGCCCCA 2520 

GACAGACTTC ATCATTGTGC TAGCTGATGG ACAGGTGTCT GAGATGGGCC CGTACCCAGC 2580 

CCTGCTGCAG CGCAACGGCT CCTTTGCCAA CTTTCTCTGC AACTATGCCC CCGATGAGGA 2640 

CCAAGGGCAC CTGGAGGACA GCTGGACCGC GTTGGAAGGT GCAGAGGATA AGGAGGCACT 2700 

GCTGATTGAA GACACACTCA GCAACCACAC GGATCTGACA GACAATGATC CAGTCACCTA 2760 

TGTGGTCCAG AAGCAGTTTA TGAGACAGCT GAGTGCCCTG TCCTCAGATG GGGAGGGACA 2B20 

GGGTCGGCCT GTACCCCGGA GGCACCTGGG TCCATCAGAG AAGGTGCAGG TGACAGAGGC 2880 

GAAGGCAGAT GGGGCACTGA CCCAGGAGGA GAAAGCAGCC ATTGGCACTG TGGAGCTCAG 2940 

TGTGTTCTGG GATTATGCCA AGGCCGTGGG GCTCTGTACC ACGCTGGCCA TCTGTCTCCT 3000 

GTATGTGGGT CAAAGTGCGG CTGCCATTGG AGCCAATGTG TGGCTCAGTG CCTGGACAAA 3060 

TGATGCCATG GCAGACAGTA GACAGAACAA CACTTOCCTG AGGCTGGGCG TCTATGCTGC 3120 

TTTAGGAATT CTGCAAGGGT TCTTGGTGAT GCTGGCAGCC ATGGCCATGG CAGCGGGTGG 3180 

CATCCAGGCT GCCCGTGTGT TGCACCAGGC ACTGCTGCAC AACAAGATAC GCTCGCCACA 3240 

GTCCTTCTTT GACACCACAC CATCAGGCCG CATCCTGAAC TGCTTCTCCA AGGACATCTA 3300 

TGTCGTTGAT GAGGTTCTGG CCCCTGTCAT CCTCATGCTG CTCAATTCCT TCTTCAACGC 3360 

CATCTCCACT CTTGTGGTCA TCATGGCCAG CACGCOGCTC TTCACTGTGG TCATCCTGCC 3420 

CCTGGCTGTG CTCTACACCT TAGTGCAGCG CTTCTATGCA GCCACATCAC GGGAACTGAA 3480 

GCGGCTGGAA TCAGTCAGCC GCTCACCTAT CTACTCCCAC TTTTCGGAGA CAGTGACTGG 3540 

TGCCAGTGTC ATCCGGGCCT ACAACOGCAG CCX3GGATTTT GAGATCATCA GTGATACTAA 3600 

GGTGGATGCC AACCAGAGAA GCTGCTACCC CTACATCATC TCCAACCGGT GGCTGAGCAT 3660 

CGGAGTGGAG TTCGTGGGGA ACTGCGTGGT GCTCTTTGCT GCACTATTTG CCGTCATCGG 3720 

GAGGAGCAGC CTGAACCOGG GGCTGGTGGG CCTTTCTGTG TCCTACTCCT TGCAGGTGAC 3780 

ATTTGCTCTG AACTGGATGA TACGAATGAT GTCAGATTTG GAATCTAACA T0GTGGCTGT 3840 

GGAGAGGGTC AAGGAGTACT CCAAGACAGA GACAGAGGOG CCCTGGGTGG TGGAAGGCAG 3900 

CCGCOCTCCC GAAGGTTGGC OCCCAOGTGG GGAGGTGGAG TTCCGGAATT ATTCTGTGCG 3960 

CTACGGGCGG GGCCTAGACC TGGTGCTGAG AGACCTGAGT CTGCATGTGC ACGGTGGCGA 4020 

GAAGGTGGGG ATCGTGGGCC GCACTGGGGC TGGCAAGTCT TCCATGACCC TTTGCCTGTT 4080 

CCGCATCCTG GAGGCGGCAA AGGGTGAAAT CCGCATTGAT GGCCTCAATG TGGCAGACAT 4140 

CGGCCTCCAT GACCTGCGCT CTCAGCTGAC CATCATCCCG CAGGACCCCA TCCTGTTCTC 4200 

GGGGACCCTG CGCATGAACC TGGACCCCTT OGGCAGCTAC TCAGAGGAGG ACATTTGGTG 4260 

GGCTTTGGAG CTGTCCCACC TGCACACGTT TGTGAGCTCC CAGCCGGCAG GCCTGGACTT 4320 
CCAGTGCTCA GAGGGCGGGG AGAATCTCAG OGTGGGCCAG AGGCAGCTOG TGTGCCTGGC 4380 
CCGAGCCCTG CTCCGCAAGA GCCGCATCCT GGTTTTAGAC GAGGCCACAG CTGCCATGGA 4440 
CCTGGAGACT GACAACCTCA TCCAGGCTAC CATCCGCACC CAGTTTGATA CCTGCACTGT 4500 
CCTGACCATC GCACACCGGC TTAACACTAT CATGGACTAC AOCAGGGTCC TGGTCCTGGA 4S60 
CAAAGGAGTA GTAGCTGAAT TTGATTCTCC AGCCAACCTC ATTGCAGCTA GAGGCATCTT 4620 
CTACGGGATG GCCAGAGATG CTGGACTTGC CTAAAATATA TTCCTGAGAT TTCCTCCTGG 4680 
CCTTTCCTGG TTTTCATCAG GAAGGAAATG ACACCAAATA TGTCCGCAGA ATGGACTTGA 4740 
TAGCAAACAC TGGGGGCACC TTAAGATTTT GCACCTGTAA AGTGCCTTAC AGGGTAACTG 4800 
TGCTGAATGC TTTAGATGAG GAAATGATCC CCAAGTGGTG AATGACACGC CTAAGGTCAC 4860 
AGCTAGTTTG AGCCAGTTAG ACTAGTCCCC OGGTCTCCCG ATTCCCAACT GAGTGTTATT 4920 
TGCACACTGC ACTGTTTTCA AATAACGATT TTATGAAATG ACCTCTGTCC TCCCTCTGAT 4980 
TTTTCATATT TTCCTAAAGT TTOGTTTCTG TTTTTTAATA AAAAGCTTTT TCCTCCTGGA 5040 
ACAGAAGACA GCTGCTGGGT CAGGCCACCC CTAGGAACTC AGTCCTGTAC TCTGGGGTGC 5100 
TGCCTGAATC CATTAAAAAT GGGAGTACTG ATGAAATAAA ACTACATGGT CAACAGTAAA 5160 
AAAAAAAAAA AAAAAA 5176 



Seq ID NO: C180 DNA Sequence 



1312 



WO 03/042661 
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Nucleic Acid Accession ft: nm 004626 
Coding sequence j 124.. 1188 

c 1 11 21 ' 31 41 SI 

TAACCCGCCG CCTCOGCTCT CCCCGGCTGC AGGCGGCGTG CAGGACCAGC GGOGGCOGTG 60 

CAGGCGGAGG ACTTCGGCGC GGCTCCTCCT GG6TGTGACC CCGGGCGCGC CCGCCGCGCG 120 

ACGATGAGGG CGCGGCCGCA GGTCTGCGAG GCGCTGCTCT TOGCCCTGGC GCTCCAGACC 180 

n GGCGTGTGCT ATGGCATCAA GTGGCTGGCG CTGTCCAAGA CACCATCGGC CCTGGCACTG 240 

1U AACCAGACGC AACACTGCAA GCAGCTGGAG GGTCTGGTGT CTGCACAGGT GCAGCTGTGC 300 

CGCAGCAACC TGGAGCTCAT GCACAOGGTG GTGCAOGCCG CCCGCGAGGT CATGAAGGCC 360 

TGTCGCCGGG CCTTTGCCGA CATGCGCTGG AACTGCTCCT CCATTGAGCT CGCCCCCAAC 420 

TATTTGCTTG ACCTGGAGAG AGGGACCCGG GAGTCGGCCT TCGTGTATGC GCTGTCGGCC 480 

GCCGCCATCA GCCACGCCAT CGCCCGGGCC TGCACCTCCG GCGACCTGCC CGGCTGCTCC 540 

I J TGCGGCCCCG TCCCAGGTGA GCCACOOGGG CCCGGGAACC QCTGGGGAGG ATGTGCGGAC 600 

AACCTCAGCT AOGGGCTCCT CATGGGGGCC AAGTTTTCCG ATGCTCCTAT GAAGGTGAAA 660 

AAAACAGGAT CCCAAGCCAA TAAACTGATG CGTCTACACA ACAGTGAAGT GGGGAGACAG 720 

GCTCTGCGCG CCTCTCTGGA AATGAAGTGT AAGTGOCATG GGGTGTCTGG CTCCTGCTCC 780 

ATCCGCACCT GCTGGAAGGG GCTGCAGGAG CTGCAGGATG TGGCTGCTGA CCXCAAGACC 840 

2U CGATACCTGT CGGCCACCAA GGTAGTGCAC CGACCCATGG GCACCCGCAA GCACCTGGTG 90 0 

CCCAAGGACC TGGATATCCG GCCTGTGAAG GACTCGGAAC TCGTCTATCT GCAGAGCTCA 960 

CCTGACTTCT GCATGAAGAA TGAGAAGGTG GGCTCCCACG GGACACAAGA CAGGCAGTGC 1020 

AACAAGACAT CCAACGGAAG OGACAGCTGC GACCTTATGT GCTGCGGGCG TGGCTACAAC 1080 

CCCTACACAG ACCGCGTGGT CGAGCGGTGC CACTGTAAGT ACCACTGGTG CTGCTACGTC 1140 

Z5 ACCTGCCGCA GGTGTGAGCG TACCGTGGAG CGCTATGTCT GCAAGTGAGG CCCTGCCCTC 1200 

CGCCCCACGC AGGAGCGAGG ACTCTGCTCA AGGACCCTCA GCAACTGGGG CCAGGGGCCT 1260 

GGAGACACTC CATGGAGCTC TGCTTGTGAA TTCCAGATGC CAGGCATGGG AGGCGGCTTG 1320 

TGCTTTGCCT TCACTTGGAA GCCACCAGGA ACAGAAGGTC TGGCCACCCT GGAAGGAGGG 1380 

CAGGACATCA AAGGAAACCG ACAAGATTAA AAATAACTTG GCAGCCTGAG GCTCTGGAGT 1440 

30 GCCCACAGGC TGGTGTAAGG AGCGGGGCTT GGGATCGGTG AGACTGATAC AGACTTGACC 1500 

TTTCAGGGCC ACAGAGACCA GCCTCCGGGA AGGGGTCTGC CCGCCTTCTT CAGAATGTTC 1560 

TGCGGGACCC CCTGGCCCAC CCTGGGGTCT GAGCCTGCTG GGCCCACCAC ATGGAATCAC 1620 

TAGCTTGGGT TGTAAATGTT TTCTTTTGTT TTTTGCTTTT TCTTCCTTTG GGATGTGGAA 16 B0 

GCTACAGAAA TATTTATAAA ACATAGCTTT TTC TTTGG GG TGGCACTTCT CAATTCCTCT 1740 

J 5 TTATATATTT TATATATATA AATATATATG TATATATATA ATGATCTCTA TTTTAAAACT 1800 

AGCTTTTTAA GCAGCTGTAT GAAATAAATG CTGAGTGAGC CCCAGCCOGC 1 CCCTGCAGTT 1860 

CCCGGCCTCG TCAAGTGAAC TCGGCAGACC CTGGGGCTGG CAGAGGGAGC TCTCCAGTTT 1920 

CCAGGCA 1927 

40 Seq ID NO: C181 DNA Sequence 1 
Nucleic Acid Accession #: NM_031866 
Coding sequence: 6.. 2090 

AP 1 11 21 31 41 51 

45 | ! | | | | 

ACAGCATGGA GTGGGGTTAC CTGTTGGAAG TGACCTCGCT GCTGGCCGCC TTGGCGCTGC 60 

TGCAGCGCTC TAGCGGCGCT GCGGCCGCCT CGGCCAAGGA GCTGGCATGC CAAGAGATCA 120 

CCGTGCCGCT GTGTAAGGGC ATCGGCTACA ACTACACCTA CATGCCCAAT CAGTTCAACC 180 

ACGACACGCA AGACGAGGOG GGCCTGGAGG TGCACCAGTT CTGGCCGCTG GTGGAGATCC 240 

AGTGCTCGCC CGATCTCAAG TTCTTCCTGT GCAGCATGTA CACGCCCATC TGCCTAGAGG 300 

ACTACAAGAA GCCGCTGCCG CCCTGCCGCT CGGTGTGCGA GCGCGCCAAG GCCGGCTGCG 360 

CGCCGCTCAT GCGCCAGTAC GGCTTOGCCT GGCCCGACCG CATGCGCTGC GACCGGCTGC 420 

COGAGCAAGG CAACCCTGAC ACGCTGTGCA TGGACTACAA CCGCACCGAC CTAACCACCG 480 

CCGCGCCCAG CCCGCCGCGC CGCCTGCCGC CGCCGCCGCC CGGCGAGCAG CCGCCTTOGG 540 

GCAGCGGCCA CGGCCGCCCG CCGGGGGCCA GGCCCCCGCA CCGCGGAGGC GGCAGGGGCG 600 

GTGGCGGCGG GGACGCGGCG GCGCCCCCAG CTCGCGGCGG CGGCGGTGGC GGGAAGGCGC .660 

GGCCCCCTGG GGGCGGCGCG GCTCCCTGCG AGCCCGGGTG CCAGTGCCGC GCGCCTATGG 720 

TGAGCGTGTC CAGCGAGCGC CACCCGCTCT ACAACCGCGT CAAGACAGGC CAGATCGCTA 780 

ACTGCGCGCT GCCCTGCCAC AACCCCTTTT TCAGCCAGGA CGAGCGCGCC TTCACCGTCT 840 

TCTGGATCGG CCTGTGGTCG GTGCTCTGCT TCGTGTCCAC CTTCGCCACC GTCTCCACCT 900 

TOCTTATCGA CATGGAGCGC TTCAAGTACC CGGAGCGGCC CATTATCTTC CTCTCGGCCT 960 

GCTACCTCTT CGTGTCGGTG GGCTACCTAG TGOGCCTGGT GGCGGGCCAC GAGAAGGTGG 1020 

CGTGCAGCGG TGGCGCGCCG GGCGCGGGGG GCGCTGGGGG CGCGGGCGGC GCGGCGGCGG 1080 

GCGCGGGCGC GGCGGGCGCG GGCGCGGGCG GCCCGGGCGG GCGCGGCGAG TACGAGGAGC 1140 

TGGGCGCGGT GGAGCAGCAC GTGCGCTACG AGACCACOGG CCCCGCGCIG TGCACCGTGG 1200 

TCTTCTTGCT GGTCTACTTC TTCGGCATGG CCAGCTCCAT CTGGTGGGTG ATCTTGTCGC 1260 

TCACATGGTT CCTGGCGGCC GGTATGAAGT GGGGCAACGA AGCCATCGCC GGCTACTCGC 1320 

AGTACTTCCA CCTGGCCGOG TGGCTTGTGC CCAGCGTCAA GTCCATCGCG GTGCTGGCGC 1380 

TCAGCTCGGT GGACGGCGAC CCGGTGGCGG GCATCTGCTA CGTGGGCAAC CAGAGCCTGG 1440 

ACAACCTGCG CGGCTTCGTG CTGGCGCCGC TGGTCATCTA CCTCTTCATC GGCACCATGT 1500 

TCCTGCTGGC CGGCTTCGTG TCCCTGTTCC GCATCCGCTC GGTCATCAAG CAACAGGACG 1560 
GCCCCACCAA GACGCACAAG CTGGAGAAGC TGATGATCCG CCTGGGCCTG TTCACCGTGC 1620 
TCTACACCGT GCCCGCCGCG GTGGTGGTCG CCTGCCTCTT CTACGAGCAG CACAACCGCC 1680 
CGCGCTGGGA GGCCACGCAC AACTGCCCGT GCCTGCGGGA CCTGCAGCCC GACCAGGCAC 1740 
GCAGGCCCGA CTACGCCGTC TTCATGCTCA AGTACTTCAT GTGCCTAGTG GTGGGCATCA 1800 
CCTCGGGOGT GTGGGTCTGG TCCGGCAAGA CGCTGGAGTC CTGGCGCTCC CTGTGCACCC 1860 
GCTGCTGCTG GGCCAGCAAG GGCGCCGCGG TGGGCGGGGG CGCGGGCGCC ACGGCOGCGG 1920 
GGGGTGGCGG CGGGCCGGGG GGCGGCGGCG GCGGGGGACC CGGOGGCGGC GGGGGGCCGG 1980 
GCGGCGGCGG GGGCTCCCTC TACAGCGACG TCAGCACTGG CCTGACGTGG CGGTCGGGCA 2040 
CGGCGAGCTC CGTGTCTTAT CCAAAGCAGA TGCCATTGTC CCAGGTCTGA GOGGAGGGGA 2100 
GGGGGCGCCC AGGAGGGGTG GGGAGGGGGG CGAGGAGACC CAAGTGCAGC GAAGGGACAC 2160 
TTGATGGGCT GAGGTTCCCA CCCCTTCACA GTGTTGATTG CTATTAGCAT GATAATGAAC 2220 
TCTTAATGGT ATCCATTAGC TGGGACTTAA ATGACTCACT TAGAACAAAG TACCTGGCAT 2280 
TGAAGCCTCC CAGACCCAGC CCCTTTTCCT CCATTGATGT GCGGGGAGCT CCTCCCGCCA 2340 
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CGCGTTAATT 
TGGAGOCCTC 
GAGAACCTCT 
GAGGAGGGGT 
AAATGCCTTA 
TTCTTACATT 
AAAATATGTA 
TACCCCTOTA 
GATTTAACCA 
AGGACGCATG 
TGGAGAAAAT 
TGAACTGAAA 
CTCTTTTCTG 
AGGCAGTAAT 
AACCCTGAAA 



TCTGTTGGCT 
CCTGGCTGCA 
TTTTCTCCCT 
GACCGCCACC 
AGTAAACAAA 
AGAGGATGTA 
TATATCCAAA 
AGAACAGATA 
TTGCCCTCTC 
GGGAAAATGG 
TACTTGTTAA 
TCAAAATTGA 
ATCCATGGAT 
TCCCCTTAGG 



GAGGAGGGTG 
CTTGGCTGGG 
CGACTCTTCC 
TGATGGGATT 
CAAGAAATGT 
TTTATATAAT 
GATATAGTGT 
TAAGTATTCT 
CCCOGCCTCT 
ACATTTTCTG 
ACTCTAGTTC 
GTTTTTGCAC 
AACAACTCTC 
CCTTGTTATT 



GACTCTGCGG 
TTTGCAGTCA 
TAOGTAAACT 
GCACGGTTTG 
CTTAATTATA 
TATTTGTTAA 
GTACATTTTT 
ATTTTGTCAA 
TCTGAGCTGT 
GCTTGTCATT 
TTAAGTTGTT 
CTTCCCCAAA 
ACTTTAGTGG 
TATCCTGCAT 



CGTTTCCAGA 
GATACACAGA 
CCCACCCCTG 
OGTATTCTTA 
CACCCCACGT 
ATTGTAAAAA 
TTGTAAAAAG 
TAAAATGACT 
CACCTTTAAA 
CTGTACACTG 
AGCCAAGTAA 
GACGGTGTTT 
ATGTAAATGG 
GGTATCACTA 



ACCCGAGATT 
TTTCACCTGG 
ACTTACCCTG 
ATGACCAGGC 
AAATACGGGT 
AAAAAAGTGT 
TTTAGAGGCT 
TTTGATAAAT 
GTGCTTGCTA 
ACCTTAGGCA 
ATATCATTGT 
TTCATGGGAG 
AACTTCTGCA 
AAGGTTTCAA 



Seq ID NO: CI 82 DNA Sequence 
Nucleic Acid Accession #: XM 050625 
Coding sequence: 2 2 2.. 1109 ~ 



I 

CCGGGTCGGA 
CCGGTGTCCC 
GCACCCAGCG 
CGGCTCCGCT 
CGCTGCTGCT 
TCTTTGGCCA 
TGCAGCTGTG 
AGACCATGAA 
GCCACCCGGA 
TAGACGAGAC 
CGGTCATGTC 
AGGACAACGA 
AAGCTCCAAA 
AAACGCTTTG 
ACCGAGATAC 
TGTCCGAAAG 
GTGAGGAGAT 
AGCTGGTGAT 
CCCGCAGCAT 
CTCCAGAGCA 
TCCTAGCTGC 
CCAGCATTTC 
GCCCACCCGA 
TT 



11 
I 

GCCCCCCGGA 
GCTTCTCCGC 
AAGAGAGCGG 
CCCTCTGCCC 
GCTCTTCCTC 
GCCCGACTTC 
CCACGGCATC 
GGAGGTGCTG 
CACCAAGAAG 
CATCCAGCCA 
CGCCTTCGGC 
CCTTTGCATC 
GGTATGTGAA 
TAAAAATGAT 
CAAAATCATC 
GGACCTGAAG 
GAACGACATC 
CACCTCGGTG 
CCGCAAGCTG 
CGGCTGACCA 
TCCAGTCTCA 
CTGAGTTATA 
ATCTTGTAGA 



21 
I 

GCTGCGCGCG 
GCCCCAGCCG 
GCCCGGGACA 
CCTOGGGGTC 
GCCTCGCACT 
TCCTACAAGC 
GAATACCAGA 
GAGCAGGCCG 
TTCCTGTGCT 
TGCCACTCGC 
TTCCCCTGGC 
CCCCTCGCTA 
GCCTGCAAAA 
TTTGCACTGA 
CTGGAGACCA 
AAATCGGTGC 
AACGCGCCCT 
AAGCGGTGGC 
CAGTGCTAGT 
TTTCTGCTCC 
GCCTGGGCAG 
AGGCCACAGG 
AATATTCAAA 



31 
I 

GGCTTGCAGC 
CCGGCTGCCA 
AGCTCGAACT 
GCGCGCCCAC 
GCTGCCTGGG 
GCAGCAATTG 
ACATGCGGCT 
GCGCTTGGAT 
CGCTCTTCGC 
TCTGCGTGCA 
CCGACATGCT 
GCAGCGACCA 
ATAAAAATGA 
AAATAAAAGT 
AGAGCAAGAC 
TGTGGCTCAA 
ATCTGGTCAT 
AGAAGGGGCA 
CCCGGCATCC 
GGGATCTCAG 
CTTCCCCCTG 
AGTGGATAGC 
CTAATAAAAT 



41 
I 

GCCTCGCCCG 
GCTTTTCQOQ 
CCGGCCGCCT 
GATGCTGCAG 
CTCGGCGCGC 
CAAGCCCATC 
GCCCAACCTG 
CCCGCTGGTC 
CCCCGTCTGC 
GGTGAAGGAC 
TGAGTGCGAC 
CCTCCTGCCA 
TGATGACAAC 
GAAGGAGATA 
CATTTACAAG 
AGACAGCTTG 
GGGACAGAAA 
GAGAGAGTTC 
TGATGGCTCC 
CTCCCGTTCC 
CCTTTTGCAC 
TGTTTTCACC 
CATGAATATT 



51 
I 

CGCTGTCCTC 
GCCCCGAGTC 
CGCCCTTCCC 
GGCCCTGGCT 
GGGCTCTTCC 
CCTGCCAACC 
CTGGGCCACG 
ATGAAGCAGT 
CTCGATGACC 
CGCTGCGCCC 
CGTTTCCCCC 
GCCACCGAGG 
GACATAATGG 
ACCTACATCA 
CTGAAOGGTG 
CAGTGCACCT 
CAGGGTGGGG 
AAGCGCATCT 
GACAGGCCTG 
CCAAGCACAC 
GTTTGCATCC 
TAAAGGAAAA 
TTTATGAAGT 



Seq ID NO: C183 DNA Sequence 

Nucleic Acid Accession #: NM_001306.1 

Coding sequence: 199.. 861 



i 

AATTCGGCAC 
GCGCGCGCOG 
CGCCAGGCCC 
GGCCTTGCCG 
CTGGGCTGGC 
ATCGGCAGCA 
GTGCAGAGCA 
GACCTTCAGG 
CTAGTGGCGC 
AAGATCACCA 
GTGTCCTGGT 
CAGAAGOGCS 
CTGGGGGGCG 
AAGGTCGTCT 
GACCGCAAGG 
CAACACCACC 
GCCTCGGAGG 
TCCCCAGCAG 
GCATGGACTG 
ACCACCCCGT 
GCCCTGGAAG 



11 
I 

GAGGGCAGGT 
TCGGTGAGTC 
AGCGGCCCCG 
CGGCAGCCAT 
TGGGCACCAT 
ACATCATCAC 
CCGGCCAGAT 
CGGCCCGCGC 
TGGTGGGCGC 
TCGTGGCAGG 
CGGCCAACAC 
AGATGGGCGC 
CGCTGCTCTG 
ACTCCGCGCC 
ACTACGTCTA 
ACCACCACCG 
CCAGCCCACC 
CCACGGCTTT 
TGAAACCTCA 
CGAGCCCCAT 
GGGCACTTGA 



21 
I 

GCAGGCGCAC 
AGTCOGTCCG 
GCCCCTCGTC 
GTCCATGGGC 
CGTGTGCTGC 
GTCGCAGAAC 
GCAGTGCAAG 
CCTCATCGTG 
CCAGTGCACC 
CGTGCTGTTC 
CATTATCCGG 
GGGCCTGTAC 
CTGCTCGTGT 
GCGCTCCACC 
AGGGACAGAC 
CGAGCTGGAG 
CCCAGAAGCC 
GCGGGCCGGG 
CCCTTCTGGA 
CGGGCCGCTG 
TATTTTTCAA 



31 
I 

GCGGCGAGAG 
TCCGTCCGTC 
TCCCCGCACC 
CTGGAGATCA 
GCGTTGCCCA 
ATCTGGGAGG 
GTGTACGACT 
GTGGCCATCC 
AACTGCGTGC 
CTTCTCGCCG 
GACTTCTACA 
GTGGGCTGGG 
CCCCCACGCG 
GGCCCGGGAG 
GCAGGGAGAC 
CGCGCACCAG 
AGGAAGCCCC 
CAGTCGACTT 
GCACGGGGCC 
CCCCCATGTC 
TAAAAGCCTC 



41 
I 

CGTATGQAGC 
CGTCGGGGCG 
CGGAGCCACC 
CGGGCACCGC 
TGTGGCGCGT 
GCCTGTGGAT 
CGCTGCTGGC 
TGCTGGCCGC 
AGGACGACAC 
CCCTGCTCAC 
ACCCOGTGGT 
CGGCCGCGGC 
AGAAGAAGTA 
CCAGCCTGGG 
CCCACCACCA 
GCCATCCAGC 
CGCGCTGGAC 
CGGGGCCCAG 
TGGGTGACCG 
GCGCTGGGCA 
TCGTTTTAGC 



51 
I 

CGAGCCGTTA 
CCGCAGCTCC 
CGGTGGAGOG 
GCTGGCCGTG 
GTCGGCCTTC 
GAACTGCGTG 
ACTGCCACAG 
CTTCGGGCTG 
GGCCAAGGCC 
CCTCGTGCCG 
GCCCGAGGCG 
GCTGCAGCTG 
CACGGCCACC 
CACAGGCTAC 
CCACCACCAC 
GTGCAGCCTT 
TGGGGCAGCT 
GGACCAACCT 
CCAATACTTG 
GGGACCGGCA 



Seq ID NO: C184 DNA Sequence 

Nucleic Acid Accession NM_012449.1 

Coding sequence: 66.. 1085 



ll 



21 



31 



41 



51 
I 



COGAGACTCA CGGTCAAGCT AAGGOGAAGA GTGGGTGGCT GAAGCCATAC TATTTTATAG 
AATTAATGGA AAGCAGAAAA GACATCACAA ACCAAGAAGA ACTTTGGAAA ATGAAGCCTA 
GGAGAAATTT AGAAGAAGAC GATTATTTGC ATAAGGACAC GGGAGAGACC AGCATGCTAA 
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2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
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3060 
3120 
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60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1060 

1140 

1200 

1260 

1320 

1360 

1382 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 » 

1080 

1140 

1200 

1250 



60 

120 

180 



WO 03/042661 
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AAAGACCTGT GCTTTTGCAT TTGCACCAAA CAGCCCATGC TGATGAATTT GACTGCCCTT 240 

CAGAACTTCA GCACACACAG GAACTCTTTC CACAGTGGCA CTTGCCAATT AAAATAGCTG 300 

CTATTATAGC ATCTCTGACT TTTCTTTACA CTCTTCTGAG GGAAGTAATT CACCCTTTAG 360 

CAACTTCCCA TCAACAATAT TTTTATAAAA TTCCAATCCT GGTCATCAAC AAAGTCTTGC 420 

3 CAATGGTTTC CATCACTCTC TTGGCATTGG TTTACCTGCC AGGTGTGATA GCAGCAATTG 480 

TCCAACTTCA TAATGGAACC AAGTATAAGA AGTTTCCACA TTGGTTGGAT AAGTGGATGT 540 

TAACAAGAAA GCAGTTTGGG CTTCTCAGTT TCTTTTTTGC TGTACTGCAT GCAATTTATA 600 

GTCTGTCTTA CCCAATGAGG CGATCCTACA GATACAAGTT GCTAAACTGG GCATATCAAC 660 

AGGTCCAACA AAATAAAGAA GATGCCTGGA TTGAGCATGA TGTTTGGAGA ATGGAGATTT 720 

lU ATGTGTCTCT GGGAATTGTG GGATTGGCAA TACTGGCTCT GTTGGCTGTG ACATCTATTC 780 

CATCTGTGAG TGACTCTTTG ACATGGAGAG AATTTCACTA TATTCAGAGC AAGCTAGGAA 840 

TTGTTTCCCT TCTACTGGGC ACAATACACG CATTGATTTT TGCCTGGAAT AAGTGGATAG 900 

ATATAAAACA ATTTGTATGG TATACAOCTC CAACTTTTAT GATAGCTGTT TTCCTTCCAA 960 

TTGTTGTCCT GATATTTAAA AGCATACTAT TCCTGCCATG CTTGAGGAAG AAGATACTGA 1020 

AGATTAGACA TGGTTGGGAA GACGTCACCA AAATTAACAA AACTGAGATA TGTTCCCAGT 1080 

TGTAGAATTA CTGTTTACAC ACATTTTTGT TCAATATTGA TATATTTTAT CACCAACATT 1140 

TCAAGTTTGT ATTTGTTAAT AAAATGATTA TTCAAGGAAA AAAAAAAAAA AAAAA 1195 



15 
20 



35 
40 
45 
50 



Seq ID NO: CI 85 DNA Sequence 

Nucleic Acid Accession fl: NMJJ01775.1 

Coding sequences 70.. 972 



1 11 21 31 41 51 

« I I I I I I 

CTAAAGCTCT CTTGCTGCCT AGCCTCCTGC CGGCCTCATC TTCGCCCAGC CAACCCCGCC 60 

TGGAGCCCTA TGGCCAACTG CGAGTTCAGC CCGGTGTCCG GGGACAAACC CTGCTGCCGG 120 

CTCTCTAGGA GAGCCCAACT CTGTCTTGGC GTCAGTATCC TGGTCCTGAT CCTCGTCGTG 180 

GTGCTOGCGG TGGTCGTCCC GAGGTGGCGC CAGACGTGGA GCGGTCCGGG CACCACCAAG 240 

„ CGCTTTCCCG AGACCGTCCT GGCGCGATGC GTCAAGTACA CTGAAATTCA TCCTGAGATG 300 

JU AGACATGTAG ACTGCCAAAG TGTATGGGAT GCTTTCAAGG GTGCATTTAT TTCAAAACAT 360 

CCTTGCAACA TTACTGAAGA AGACTATCAG CCACTAATGA AGTTGGGAAC TCAGACCGTA 420 

CCTTGCAACA AGATTCTTCT TTGGAGCAGA ATAAAAGATC TGGCCCATCA GTTCACACAG 480 

GTCCAGCGGG ACATGTTCAC CCTGGAGGAC ACGCTGCTAG GCTACCTTGC TGATGACCTC 540 

ACATGGTGTG GTGAATTCAA CACTTCCAAA ATAAACTATC AATCTTGCCC AGACTGGAGA 600 

AAGGACTGCA GCAACAACCC TGTTTCAGTA TTCTGGAAAA CGGTTTCCCG CAGGTTTGCA 660 

GAAGCTGCCT GTGATGTGGT CCATGTGATG CTCAATGGAT CCCGCAGTAA AATCTTTGAC 720 

AAAAACAGCA CTTTTGGGAG TGTGGAAGTC CATAATTTGC AACCAGAGAA GGTTCAGACA 780 

CTAGAGGCCT GGGTGATACA TGGTGGAAGA GAAGATTCCA GAGACTTATG CCAGGATCCC 840 

ACCATAAAAG AGCTGGAATC GATTATAAGC AAAAGGAATA TTCAATTTTC CTGCAAGAAT 900 

ATCTACAGAC CTGACAAGTT TCTTCAGTGT GTGAAAAATC CTGAGGATTC ATCTTGCACA 960 

TCTGAGATCT GAGCCAGTCG CTGTGGTTGT TTTAGCTCCT TGACTCCTTG TGGTTTATGT 1020 

CATCATACAT GACTCAGCAT ACCTGCTGGT GCAGAGCTGA AGATTTTGGA GGGTCCTCCA 1080 

CAATAAGGTC AATGCCAGAG ACGGAAGCCT TTTTCCCCAA AGTCTTAAAA TAACTTATAT 1140 

CATCAGCATA CCTTTATTGT GATCTATCAA TAGTCAAGAA AAATTATTGT ATAAGATTAG 1200 

AATGAAAATT GTATGTTAAG TTACTTCCTT TAG 1233 



Seq ID NOt C186 DNA Sequence 

Nucleic Acid Accession #: XMJL20513.2 

Coding sequence : 1 . . 2 2 08 



1 11 21 31 41 51 

I i I I i 1 

ATGGTGTCAT GCACGTTCTC GGGGCCCCTA CGGGAAACAA ATGAAAACGT GAAAAAGTTC 60 

TACGCCTTGC GAGCTTTTAT GTTCCGCATG AGCTCAGAGG CCGCGATGCT CGGGGAAAGC 120 

J 3 AGGACCCCAA AGCCCCGTAA ACACCGCGCG ACCACCCGGG CCAAGATCTT CAAGAGGTTC 180 

TTTTCAGAAG GATCGGAGAG CAATTCCCGA TTGGTAGAAG AACTTGCTGT AATACACACG 240 

TACTCTGACG ACCCCGCCCC AACGACTAGC CCCTCCTCTG TGCAACCCCG AGAGTTTGGG 300 

GTCATGCAGG GGGCGCCACG AGCTCGTTTC GGAAGCCGGA CCCCGCCCGC AGCCGCAGAA 360 

- n GCCTCGAGTC CACATCTGGG CATTGGCGAG GCAGCCTGTC AATCAGGAGC TCGGGCGGCA 420 

OU GCCCCCCGCG CGGGGGCTCG GCGATGCCAG OCTCAGCGAC AGGCGGGGGC GGCGGCGGCC 480 

ACGGCACAGA CACACACCCT CCCACACGCG CGCACCAGGG CAGACCCGGC GGGCAGGCGG 540 

OGGAGGCACC CTCGGAGCCC GGCGCCCGGC GGGGAGGGGA CGTGCTCCGA GGGACCGGCC 600 

CCGAGGCGCC GGATGGAGGA AGAGATGCAG CCGGCAGAGG AGGGGOCCAG CGTCCCCAAA 660 

- ATCTACAAGC AGCGCAGCCC CTACAGCGTC CTCAAGACGT TCCCCAGCAA GAGACCGGCG 720 

UJ CTGGCCAAGC GCTACGAGCG ACCCACCCTG GTGGAGCTGC CGCACGGCCA CTTGAGGACT 780 

CCGGCGCAGC CGCCGCCCGC GTCCCCCGCC GCCTCCTCGT CGTCTTCGTT CGCCGCTGTC 840 

GTCAGACTCG GGGCTCCTCC GCGTCCGCCA CGCCGTGGAT TTCGGGCGCG GGGAACCATC 900 

CCGCCCCTCC TTCCTGCCCC GGGAGTCGCA GGCACTCTGC TCCCACCGCC CACGTCGTCG 960 

TCCCCGCCAT CCCCTCGTCC CCGTCCGTGG CACGCTGCCG CGCOGCGGGG AGGGACCTCA 1020 

/U CATACACATA TGTGGAGGTC CCAGTCCACA CTTCCAGGAT CTGACACCAT GGTCTCTGTC 1080 

TTTGGATTGA TGGCTCAGAG AAGATGGCAG CATAGATCTT TAAAGCAGTT TGAGTGGGGA 1140 

ATTCTTGGAT CTTGGGGTAC TTGGCCATGT GGACAGGATT GGCTGGAGAA GGAGGGTCAG 1200 

GTGGCGGTCC TGCTGCCAAG GTCTGAGGGT AATACTGCTC CTAAGAAGAG TCGAATGATC 1260 

TTGGATGCCT TTGCCCAGCA GTGCAGTCGA GTTCTTAGCC TCTTAAATTG TGGAGGAAAA 1320 

/ J CTCCTGGACT CCAACCATTC TCAGTCCATG ATTTCTTGCG TAAAGCAGGA AGGCTCAAGT 1380 

TACAACGAAA GACAGGAGCA CTGTCACATT GGGAAAGGGG TCCACAGTCA GACCTCAGAC 1440 

AATGTAGACA TAGAGATGCA GTATATGCAA AGGAAACAAC AAACTTCTGC CTTTTTGAGG 1500 

GTTTTCACTG ACTCTCTACA AAATTACCTG CTCTCGGGAA GCTTTCCAAC TCCAAACCCC 1560 

A TCGTCAGCCA GTGAATATGG CCATCTGGCC GACGTGGATC CTCTGTCAAC CTCTCCTGTG 1620 

OU CATACATTAG AAAATATTTC ACTTGATTCC ACAGCTTCCC TGTGTAAATC TAGGCATCTA 1680 

TCCAGAGAGC CCCCAGTCAA GAGTGATTTT CCAAATCCTT TGCAGCAGGC CTTGGCTGGG 1740 

GGTGCTTCAA GACCATTTTC AGGGGCACAG CAAAGCATCG CTTACAGGGT GAACTCTGAA 1800 

CTTGAGGATG GCATCCGCAG CCCCGTCCCT TTGAGTTGTG AGGCCTTGGA AATGGATTTG 1860 

ACCTCCTTGG GAAGCAAGCA GCTGTTGAAC AACTATCCTG TCTACATAAC GAGCAAACAG 1920 
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TGGGATGAGQ CTGTAAATTC TTCAAAGAAA GATGGGAGAC GGCTCCTTCQ ATACCTCATC 
AQATTTGTTT TCACAACCGA TGAGCTTAAG TACTCATGCG GCCTTGGGAA AAGGAAAAGG 
TCAGTGCAGT CAGGAGAGAC AGGTCCCGAA AGACGCCCTC TGGATCCAGT TAAAGTAACA 
TGCCTCCGAG GTACTGCATC CTTCOGCTCA GTGTCACCAT CTGTGATCTC ATTTCACCGC 
ATTGGCTGTG GCTCTCCCOG TACAAGTGTT CAGCCTTCTG TATTTTGA 

Seq ID NO: C187 DNA Sequence 
Nucleic Acid Accession #: AB037745.1 
Coding sequence: 26.. 1744 



1980 
2040 
2100 
2160 
2208 



41 



SI 



1 11 21 31 

I I I I 

ATGGTGGAAC ACGCTGCCCA CAAACATGGA AACGACCGTT CTCAGTGGGA TCAACTTCGA 60 

GTACAAGGGC ATGACAGGCT GGGAGGTGGC TGGTGATCAC ATTTACACAG CTGCTGGAGC 120 

CTCAGACAAT GACTTCATGA TTCTCACTCT GGTTGTGCCA GGATTTAGAC CTCCGCAGTC 180 

GGTGATGGCA GACACAGAGA ATAAAGAGGT GGCCAGAATC ACATTTGTCT TTGAGACCCT 240 

CTGTTCTGTG AACTGTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 300 

TGTGGAGACG TGGAAAGGTT CCAAAGGCAA ACAGTCCTAT ACCTACATCA TTGAGGAGAA 360 

CACTACCACG AGCTTCACCT GGGCCTTCCA GAGGACCACT TTTCATGAGG CAAGCAGGAA 420 

GTACACCAAT GACGTTGCCA AGATCTACTC CATCAATGTC ACCAATGTTA TGAATGGCGT 480 

GGCCTCCTAC TGCCGTCCCT GTGCCCTAGA AGCCTCTGAT GTGGGCTCCT OCTGCACCTC 540 

TTGTCCTGCT GGTTACTATA TTGACCGAGA TTCAGGAACC TGCCACTCCT GCCCCCCTAA 600 

CACAATTCTG AAAGCCCACC AGCCTTATGG TGTCCAGGCC TGTGTGCCCT GTGGTCCAGG 660 

GACCAAGAAC AACAAGATCC ACTCTCTGTG CTACAATGAT TGCACCTTCT CACGCAACAC 720 

TCCAACCAGG ACTTTCAACT ACAACTTCTC CGCTTTGGCA AACACOGTCA CTCTTGCTGG 780 

AGGGCCAAGC TTCACTTCCA AAGGGTTGAA ATACTTCCAT CACTTTACCC TCAGTCTCTG 840 

TGGAAACCAG GGTAGGAAAA TGTCTGTGTG CACCGACAAT GTCACTGACC TCCGGATTCC 900 

TGAGGGTGAG TCAGGGTTCT CCAAATCTAT CACAGCCTAC GTCTGCCAGG CAGTCATCAT 960 

CCCCCCAGAG GTGACAGGCT ACAAGGCCGG GGTTTCCTCA CAGCCTGTCA GCCTTGCTGA 1020 

TCGACTTATT GGGGTGACAA CAGATATGAC TCTGGATGGA ATCACCTCCC CAGCTGAACT 1080 

TTTCCACCTG GAGTCCTTGG GAATACCGGA CGTGATCTTC TTTTATAGGT CCAATGATGT 1140 

GACCCAGTCC TGCAGTTCTG GGAGATCAAC CACCATCCGC GTCAGGTGCA GTCCACAGAA 1200 

AACTGTCCCT GGAAGTTTGC TGCTGCCAGG AACGTGCTCA GATGGGACCT GTGATGGCTG 1260 

CAACTTCCAC TTCCTGTGGG AGAGCGCGGC TGCTTGCCCG CTCTGCTCAG TGGCTGACTA 1320 

CCATGCTATC GTCAGCAGCT GTGTGGCTGG GATCCAGAAG ACTACTTACG TGTGGCGAGA 1380 

ACCCAAGCTA TGCTCTGGTG GCATTTCTCT GCCTGAGCAG AQAGTCACCA TCTGCAAAAC 1440 

CATAGATTTC TGGCTGAAAG TGGGCATCTC TGCAGGCACC TGTACTGCCA TCCTGCTCAC 1500 

CGTCTTGACC TGCTACTTTT GGAAAAAGAA TCAAAAACTA GAGTACAAGT ACTCCAAGCT 1560 

GGTGATGAAT GCTACTCTCA AGGACTGTGA CCTGCCAGCA GCTGACAGCT GCGCCATCAT 1620 

GGAAGGCGAG GATGTAGAGG ACGACCTCAT CTTTACCAGC AAGAAGTCAC TCTTTGGGAA 1680 

GATCAAATCA TTTACCTCCA AGCAGCCAGC TCCTGTCACC ATCTCTCTTT CAGAGGACTC 1740 

CTGATGGATT TGACTCAGTG CCGCTGAAGA CATCCTCAGG AGGCCCAGAC ATGGACCTGT 1800 

GAGAGGCACT GCCTGCCTCA CCTGCCTCCT CACCTTGCAT AGCACCTTTG CAAGCCTGCG 1860 

GCGATTTGGG TGCCAGCATC CTGCAACACC CACTGCTGGA AATCTCTTCA TTGTGGCCTT 1920 

ATCAGATGTT TGAATTTCAG ATCTTTTTTT ATAGAGTACC CAAACCCTCC TTTCTGCTTG 1980 

CCTCAAACCT GCCAAATATA CCCACACTTT GTTTGTAAAT TATGCCCTTG CTTGTATCTT 2040 

GTTTCCCAAA ATGGCCCATC CGCCAGAGCC ATAGCTTCGT CTGCTCATAA TTCTTATAGC 2100 

TTTGGAATGA AAATATTTCT ATCTTCTTAA GTATAGAAAC TATTTCCTCT GTCCTCTAAC 2160 

TTAAGGGCAG AAACAGCTGG GAGTTTTCCT CGCATGCCCT CAGCTCATGA TCTCTTCAGG 2220 

AGAGAGGCTG GGTGAGGAGG GTGTCGGGGT TCCCTGGTGG ATAATCTTCA TAGCAGCCTG 2280 

GATCCATTTC CCCTGGATAA CCAGCTCAAA GGGAGTGAAA ATGGTAGTCT GAGGGCAAGG 2340 

GGAGCAAGGC CTGGGTAAGA AAAGCCTTGA AAAGCATAAA AAGAGGCCGG GCGCGGTGGC 2400 

TCACGCCTGT AATCCCAGCA CTTTGGGAGG CCGAGGCGGG CAGATCATGA GGTCGGGAGA 2460 

TTGAGACCAT CCTGGCTAAC ACGGTGAAGC CCCGTCTCTA CTGGAAATAC AAAAAATTAG 2520 

CCGGGCGTGG TGGCGGGTGC CTGTGGTCCC AGCTACTCGG GAGGCTGAGG CGGGAGAATA 2580 

GCGTGGGCCT GGAAGGOGGA GCTTGCAGTG AGCCGAGATC GCGCCACTGC ACTCCATCCA 2640 

GCCTGGGTGA CAGAGTGAGA CTCTGCCTCA AAAAAAAAAA AAAAAAAGAA AAGCACAAAG 2700 

AGAGGCAACA AGGAATGTTT TTGTTTTTGA GACAGGCTCT CACTCTGTCA CCTAGGCTGG 2760 

AGTGCAGTGG CGTAATCACT GTTCAGTGCA GCCTCAAGCT CTTGGGCTGA GGCTATCCTC 2820 

CCATCTCAGC CTCTCAAGTA GCTGGGACTA CGAGTGTGCA CCACCAGGCT CACTAATTTT 2880 

TGTGTTTTTT GTAGACACGG GGTTTCACCG TGTTGCCCAG GCTGGTCTCC AACTCCTGGG 2940 

CTCAAGTGAT CTGTCCGCCT CGGCCTCCCA AACTGCTGGG ATTACAGGCA TAAGCCACTG 3000 

CACTCAGCCT TTTATTTGTT TTTTAAACCA CGTAGCTCAT TGCCTTCTCT TAAGTAAATG 3060 

ATAGATATTC TCACTGAAGC CAAAGGAATA AGTTCATCAA GAAAATGCCC AAAGCCCTGG 3120 

TGGATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA TCACTCTATG ACACTGAAAA 3180 

GAACCAGGTA AGCCCCAAAC CCAGATGTTC CAGCCTTATC CTCTATTGGG TTTACCCACA 3240 

GACATAGCAA ACCCTGTCAG TGAGGAAAAT TCCCCATCCT TGAGTGCCCC CGTCCTAGAA 3300 

GTTTGGGCCA TATTATGGAA CAGGGGTCTC TTATTTGAAA AGAGCACAAG GAGGCCAAGA 3360 

TTTTAATGGG GCACTTTAGG GGATACAGCC CACAATGGCA TGGGCCTGAG GTGGCCGTGA 3420 

TGTCTGCTTC TAAGCTTAAC GCATCTGCTC AGGCACAGAA TAAACGTCTA GGCTGGCCAA 3480 

AAAAGGAACT GAATCCCAGG CCCATACGCC AGCACCAGAA TCAAACCAGT CTTCAAGGAA 3540 

GGAAGGCTAG GAGAGTTTAA CAAGATTTTC ACTGGGCCCA GCATGGTGGC TCACACCTGT 3600 

AATCCCAAGG CAGAATGGTG GCTTGAGCTC AGGAGTTCAA GACCAGCCTG GGCAACACAG 3660 

TGAGACCCTG TCTCTAAAAA ATTTAAAAAT AAACAAGGTG TTCACCAAGC TGGGATACTT 3720 

CTCACTATTA AGCCCCTATC TTTCTCTTTT TTTCATTCTC AATTGCTTTG TGTGATAAAA 3780 

AACTAAAGAG ACTTCTGGTC CAATTTCTGG CAACATCCCT TCTGAAAGGT GAGTAGAGTG 3840 

GGTGTCTTCT ATGCCCATTT TCCGCAATTT TACACAAACT ATTATCAATG AACTTTTAAG 3900 
TACCTAGAAT GGGTAAAACC AGAGCAAGAC TTTAAATTAC CTTCTTCTTT CTTCTACTGG 3960 
CAGTTCTGCC TCCATCACTA TCAGGCTAGG GTGACCTTCC CTTGGTCAAG CCCCAATTGC 4020 
CCATGATTTG TGCCTGTGCC CTTTCTCCAG TGACCATTTG GTGACCAGAT GGTAGATATA 4080 
GAAAGGGGAT GGCATTTGCA AGTGACTAGT CTGCCACAAA ATGCTCATCT GATTAGCCAC 4140 
TGCTGCCCTG GCAATGGCTT TGTAAGAGTC AATGAGAACT AGAGCCAGGC TGTGGTCCCT 4200 
GGCCATCAAC JVGTGTTGGTG ACGGCAGGGA GTCCCTTTGG TTTAATAAAT CCAGTTTTTC 4260 
TTTGGGTATC CAAATTCTCC CCTCCTTTTG TAGGAGTCAG GCTCTCAGAA CCTGTGTCCA 4320 



1316 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TGTTGGAACT 
TTCTGTAGCC 
CAAAGGCCAC 
TTGACTTCTT 
ACTACCCCTT 
6TTACGCTAC 
ACAGCTTTAT 
ATGGTTTTCA 
TTCTTTTCCT 
CCATGATCTT 
CCTCCCCAGT 
TAGTAGAGAT 
ATCCACCCAC 
CTACGTGTTC 
TGTCCAAGCT 
TTGAGTCCTG 
ATACAAATGA 
GTGCCAACAA 
AACAGAAACT 
CTGGTTGGAT 
TATTGCATGT 



TCCCCCAGTG 
TCAGCAATAC 
AGACAGCCCT 
TATTCTGACT 
ACCGTGCTGA 
TGGGGTCACC 
TGAGATATAA 
GCAAACTCAC 
TTTTTTTTTT 
GGCTCACTGC 
AGCTGGGATT 
GGGGTTTCAC 
CTCGGCCTCC 
AATTTTCTAT 
GTTGTTGGTA 
GAACACGCAG 
AGATTGTGGT 
AACTTTTTTT 
TTGTAATAGA 
TTCCTTTTTA 
CACTTTTTTT 




CAAAGTGCTG 
GAACAAAGGC 
GAGGGAGTAT 
TAACTGTCAT 
GTATTCAAGC 
TGTCTGACTA 
TTTTTTCAGC 
CCCTGTAATC 
TTTTTGTAAA 



TACGCAGCTC 
GCTGTCTCAC 
CGGAAACAGT 
TAGCCATAAT 
TCTGCCCTGT 
CTAGCAAGTT 
ATACAATTCA 
GCCCACTTGA 
CAGCTTTGTC 
TCCTGGGTTC 
GCCACCACGC 
AGGCTGGTCT 
GGATTGCAGG 
TTTAGTCCTT 
GATAAAATGT 
GCTATAQTCA 
AGTAGGGTTT 
CATTAAAGAT 
TTTGTGAAAT 
CAAGCGTTAA 
ATAAAAACAT 



CTGAGCTCCA 
TGAATAGCTT 
AGGAAAAATT 
ACTTTAAGGA 
GACCTGTCAG 
AGGCATGTCA 
CCTTTAAAAC 
GAGCAAACAC 
GCCCAGGCTG 
AAGTGATCCT 
CTAGCTAATT 
CAAACTCCTG 
TGTGAGCCAC 
GACCCAGGGC 
TTAAATCTCA 
TCATCTGTAT 
TTGCTTTTGT 
AAGACTGACT 
CGAATTTTTT 
TAGTTTGTTA 
ACCTTAC 



GCCTAAAGTC 
TCTTTTGTGA 
ACATATGTCT 
GCTACTTTTT 
GAACTCCTGA 
TATATTTTTA 
ATACGATTCA 
ATGTTCAATT 
GAGTGCAGTG 
TCTGCTTCAG 
TTTGTGTTTT 
GACTCAAGTG 
CGTGCCTGGC 
TAAAGTGGTC 
TTTGGTTACC 
TTGGCTGGGA 
TTTTGTTTTA 
ATATTTATAC 
TTCATCAGGG 
GAAGATGGGT 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5567 



Seq ID NO: C188 DNA Sequence 

Nucleic Acid Accession ft: NM_014324.1 

Coding sequence: 89.. 1237 



51 



1 11 21 31 41 

I I.I I I I 

GGCGCCGGGA TTGGGAGGGC TTCTTGCAGG CTGCTGGGCT GGGGCTAAGG GCTGCTCAGT 60 

TTCCTTCAGC GGGGCACTGG GAAGCGCCAT GGCACTGCAG GGCATCTCGG TCGTGGAGCT 120 

GTCCGGCCTG GCCCCGGGCC GTNTCTGTGC TATGGTCCTG GCTGACTTCG GGGCGCGTGT 180 

GGTACGCGTG GACCGGCCCG GCTCCCGCTA CGACGTGAGC OGCTTGGGCC GGGGCAAGCG 240 

CTCGCTAGTG CTGGACCTGA AGCAGCCGCG GGAGCCGCGT GCTGCGGCGT CTGTGCAAGC 300 

GGTCGGATGT GCTGCTGGAG CCCTTCCGCC GCGGTGTCAT GGAGAAACTC CAGCTGGGCC 3S0 

CAGAGATTCT GCAGCGGGAA AATCCAAGGC TTATTTATGC CAGGCTGAGT GGATTTGGCC 420 

AGTTCAGGAA AGCTTCTGCC GGTTAGCTGG CCACGATATC AACTATTTGG CTTTGTCAGG 480 

TGTTCTCTCA AAAATTGGCA GAAGTGGTGA GAATCCGTAT GCCCCGCTGA ATCTCGTGGC 540 

TGACTTTGCT GGTGGTGGCC TTATGTGTGC ACTGGGCATT ATAATGGCTC TTTTTGACOG 600 

CACACGCACT GACAAGGGTC AGGTCATTGA TGCAAATATG GTGGAAGGAA CAGCATATTT 660 

AAGTTCTTTT CTGTGGAAAA CTCAGAAATC GAGTCTGTGG GAAGCACCTC GAGGACAGAA 720 

CATGTTGGAT GGTGGAGCAC CTTTCTATAC GACTTACAGG ACAGCAGATG GGGAATTCAT 780 

GGCTGTTGGA GCAATAGAAC CCCAGTTCTA CGAGCTGCTG ATCAAAGGAC TTGGACTAAA 840 

GTCTGATGAA CTTCCCAATC AGATGAGCAC GGATGATTGG CCAGAAATGA AGAAGAAGTT 900 

TGCAGATGTA TTTGCAAAGA AGACGAAGGC AGAGTGGTGT CAAATCTTTG ACGGCACAGA 960 

TGCCTGTGTG ACTCCGGTTC TGACTTTTGA GGAGGTTGTT CATCATGATC ACAACAAGGA 1020 

ACGGGGCTCG TTTATCACCA GTGAGGAGCA GGACGTGAGC CCCCGCCTTG CACCTCTGCT 1080 

GTTAAACACC CCAGCCATCC CTTCTTCCAA AGGGGATCCT TTCATAGGAG AACACACTGA 1140 

GGAGATACTT GAAGAATTTG GATTCAGCCG AGAAGAGATT TATCAGCTTA ACTCAGATAA 1200 

AATCATTGAA AGTAATAAGG TAAAAGCTAG TCTCTAACTT CCAGGCCCAC GGCTCAAGTG 1260 

AATTTGAATA CTGCATTTAC AGTGTAGAGT AACACATAAC ATTGTATGCA TGGAAACATG 1320 

GAGGAACAGT ATTACAGTGT CCTACCACTC TAATCAAGAA AAGAATTACA GACTCTGATT 1380 

CTACAGTGAT GATTGAATTC TAAAAATGGT TATCATTAGG GCTTTTGATT TATA AAACT T 1440 

TGGGTACTTA TACTAAATTA TGGTAGTTAT TCTGCCTTCC AGTTTGCTTG ATATATTTGT 1500 

TGATATTAAG ATTCTTGACT TATATTTTGA ATGGGTTCTA GTGAAAAAGG AATGATATAT 1560 

TCTTGAAGAC ATCGATATAC ATTTATTTAC ACTCTTGATT CTACAATGTA GAAAATGAGG 1620 

AAATGCCACA AATTGTATGG TGATAAAAGT CACGTGAAAC AGAGTGATTG GTTGCATCCA 1680 

GGCCTTTTGT CTTGGTGTTC ATGATCTCCC TCTAAGCACA TTCCAAACTT TAGCAACAGT 1740 

TATCACACTT TGTAATTTGC AAAGAAAAGT TTCACCTGTA TTGAATCAGA ATGCCTTCAA 1800 

CTGAAAAAAA CATATCCAAA ATAATGAGGA AATGTGTTGG CTCACTACGT AGAGTCCAGA 1860 

GGGACAGTCA GTTTTAGGGT TGCCXGTATC CAGTAACTCG GGGCCTGTTT OCCCGTGGGT 1920 

CTCTGGGCTG TCAGCTTTCC TTTCTCCATG TGTTTGATTT CTCCTCAGGC TGGTAGCAAG 1980 

TTCTGGATCT TATACCCAAC ACACAGCAAC ATCCAGAAAT AAAGATCTCA GGACCCCCCA 2040 

AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2068 

Seq ID NO: CI 8 9 DNA Sequence 

Nucleic Acid Accession ft: XMJ>91332.1 

Coding sequence: 1..1401 



1 
I 

ATGCAAAGGT 
CCACAAACAG 
CCCCTGTTCT 
TGGGCTGAGG 
GCCTCCATCC 
CCCGGCATCC 
GAATGGACTG 
GGCGTCCACG 
GAGCAGCTAC 
TATCTCCCTG 
AAGAGCCTGA 
CAAAAGGCAA 
GGGTCATGGA 
CAGGAAGGCC 



11 
I 

GGACACTGTG 
ACATCAGTAT 
GGATGGAGTT 
CCGACCTCTA 
ACAGCTGGGA 
CAGCTGACGT 
ATGGCTCATC 
CGGACCCAGA 
AGGCCCCAGA 
AGGATTTCCC 
AACCATCCAA 
AGACCCGACA 
AAGAAAGAAT 
GGCTCCGGTG 



21 
I 

GGCTGCAGCC 
CAGTCCAGCC 
CAAAGGCCAC 
CTGTTCTGAG 
GGAGAATGTC 
CTGGACAGGC 
CTATGACTAC 
AGAAGAGGAC 
GCCCCAGTTA 
AGCTGAGCCC 
GAGTCATCTT 
TAGGAGCCTG 
GAATGCAGAC 
CAGGGAGCGC 



31 
I 

TTCCTGACCC 
CTGCCAGAGC 
TGCTATCGAT 
TTCTCTGTGG 
TTTGTATATG 
CTTCATGATC 
AGCTACTGGG 
TGCGTGCAGA 
CCCTTATCAA 
AAGCTCATGG 
ATGGAGCCAC 
CGCGGCGTCT 
TACGGGCGAA 
CGCCTGCGGG 



41 
I 

TCCACTCTGC 
TGCCCCTGCC 
TCTTCCCTCT 
GCAGGAAGTC 
ACCTCGTQAA 
ACAGACAGGA 
ATGGCAGCCA 
TATGGTACAG 
TCTCAGAGGC 
ACCAGTCCTG 
CCACTCCAGT 
GGTGGCCATC 
GAAAGCGATC 
CTGCTTCGGG 



51 
I 

ACAGGCCTTT 
TTCCCTGTGC 
CAATAAGACC 
CGCCAAGCTG 
CAGCTGTGTT 
AGGGCAGTTT 
GCCAGATGAT 
GCCTACCAGT 
CACAGATGTC 
GGTGTCCAGG 
GGCCAAGCAC 
AGGTAAGGCT 
GGCCCCGAGG 
CCAGGGTCGA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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CCCGAGGGCC 
GGAGGGGTGT 
CAGCGGGGTG 
TGGAGGGGGC 
CTTCCGGGGA 
GAAGAAAAGA 
TGTGACCAAQ 
GGGCCTTACC 
CTACAAAGTG 
ATTGCTCGAG 



AGCGCAAGCA 
CCCCAATGCG 
CGGCGCOGGA 
AGCGGCTCCA 
ATGCACACGG 
ATAATCGCAC 
ATAGCAAGGA 
GGTTAGTTAA 
GAGGGGAAGT 
AACCTCCATA 



GCGGCAACAG 
GGGCGCCCAA 
GTGCGGGGAG 
GCCCCAQACC 
CCTGGCCGCC 
CCGTTTCTCA 
GCAGTCTTTA 
GAAAAAAAGA 
TTGTCTGGAT 
A 



GAGCGCCAGG 
GCGTGGCAGC 
AACCACCAGG 
GCCGOGCTCT 
GCCTTCGTGC 
GGTGCTTACT 
AGGOGACAQG 
GGACCTGTTG 
TTTCCTGTAG 



AGAGAGGCTG 
AOGGGCTGGG 
CGCCGGAATT 
GTCACTTTGC 
AGCCCGCCCT 
TCACCATGTC 
GCAGAGAGGC 
CCTGTCCCTC 
AACTGAGGGC 



GGAAGAACTG 
AGCGGGGAGC 
GGGGAGCACG 
ATTAAGAAAG 
GCAGGTGCAG 
CGATCCGACG 
AGAAAAAGAT 
TAGCTTTGAA 
AGGGACCTGG 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1401 



Seq ID NO: CI 90 DNA Sequence 

Nucleic Acid Accession XM_0S4869.2 

Coding sequence: 26.. 2902 

1 11 21 31 41 51 

I I I I I 1 

TAGACGCGGA GCCCAAGGAG GTAAAATGCA CACTTGCTGC CCCCCAGTAA CTTTGGAACA 60 

GGACCTTCAC AGAAAAATGC ATAGCTGGAT GCTGCAGACT CTAGCGTTTG CTGTAACATC 120 

TCTCGTCCTT TCGTGTGCAG AAACCATCGA TTATTATGGG GAAATCTGTG ACAATGCATG 180 

TCCTTGTGAG GAAAAGGACG GCATTTTAAC TGTGAGCTGT GAAAACCGGG GGATCATCAG 240 

TCTCTCTGAA ATTAGCCCTC CCCGTTTCCC AATCTACCAC CTCTTGTTGT CCGGAAACCT 300 

TTTGAACCGT CTCTATCCCA ATGAGTTTGT CAATTACACT GGGGCTTCAA TTTTGCATCT 360 

AGGTAGCAAT GTTATCCAGG ACATTGAGAC CGGGGCTTTC CATGGGCTAC GGGGTTTGAG 420 

GAGATTGCAT CTAAACAATA ATAAACTGGA ACTTCTGCGA GATGATACCT TCCTTGGCTT 480 

GGAGAACCTG GAGTACCTAC AGGTCGATTA CAACTACATC AGCGTCATTG AACCCAATGC 540 

TTTTGGGAAA CTGCATTTGT TGCAGGTGCT TATCCTCAAT GACAATCTTT TGTCCAGTTT 600 

ACCCAACAAT CTTTTCCGTT TTGTGCCCTT AACGCACTTG GACCTCCGGG GGAACCGGCT 660 

GAAACTTCTG CCCTACGTGG GGCTCTTGCA GCACATGGAT AAAGTTGTGG AGCTACAGCT 720 

GGAGGAAAAC CCTTGGAATT GTTCTTGTGA GCTGATCTCT CTAAAGGATT GGTTGGACAG 780 

CATCTCCTAT TCAGCCCTGG TGGGGGATGT AGTTTGTGAG ACCCCCTTCC GCTTACACGG 840 

AAGGGACTTG GACGAGGTAT CCAAGCAGGA ACTTTGCCCA AGGAGACTTA TTTCTGACTA 900 

CGAGATGAGG CCGCAGACGC CTTTGAGCAC CACGGGGTAT TTACACACCA CCCCGGCGTC 960 

AGTGAATTCT GTGGCCACTT CTTCCTCTGC TGTTTACAAA CCCCCTTTGA AGCCCCCTAA 1020 

GGGGACTCGC CAACCCAACA AGCCCAGGGT GCGCCCCACC TCTCGGCAGC CCTCTAAGGA 1080 

CTTGGGCTAC AGCAACTATG GCCCCAGCAT CGCCTATCAG ACCAAATCCC CGGTGCCTTT 1140 

GGAGTGTCCC ACCGCGTGCT CTTGCAACCT GCAGATCTCT GATCTGGGCC TCAAOGTAAA 1200 

CTGCCAGGAG CGAAAGATOG AGAGCATCGC TGAACTGCAG CCCAAGCCCT ACAATCCCAA 1260 

GAAAATGTAT CTGACAOAGA ACTACATCGC TGTCGTGCGC AGGACAGACT TCCTGGAGGC 1320 

CACGGGGCTG GACCTCCTGC ACCTGGGGAA TAACCGCATC TCGATGATCC AGGACCGCGC 1380 

TTTCGGGGAT CTCACCAACC TGAGGCGCCT CTACCTGAAT GGCAACAGGA TCGAGAGGCT 1440 

GAGCCCGGAG TTATTCTATG GCCTGCAGAG CCTGCAGTAT CTCTTCCTCC AGTACAATCT 1500 

CATCCGCGAG ATTCAGTCTG GAACTTTTGA CCCGGTCCCA AACCTCCAGC TGCTATTCTT 1560 

GAATAACAAC CTCCTGCAGG CCATGCCCTC AGGCGTCTTC TCTGGCTTGA CCCTCCTCAG 1620 

GCTAAACCTG AGGAGTAACC ACTTCACCTC CTTGCCAGTG AGTGGAGTTT TGGACCAGCT 1680 

GAAGTCACTC ATCCAAATCG ACCTGCATGA CAATCCTTGG GATTGTACCT GTGACATTGT 1740 

GGGCATGAAG CTGTGGGTGG AGCAGCTCAA AGTGGGCGTC CTAGTGGACG AGGTGATCTG 1800 

TAAGGCGCCC AAAAAATTCG CTGAGACCGA CATGCGCTCC ATTAAGTOGG AGCTGCTGTG 1860 

CCCTGACTAT TCAGATGTAG TAGTTTCCAC GCCCACACCC TCCTCTATCC AGGTCCCTGC 1920 

GAGGACCAGC GCCGTGACTC CTGCGGTCCG GTTGAATAGC ACCGGGGCCC CCGCGAGCTT 1980 

GGGCGCAGGC GGAGGGGCGT CGTCGGTGCC CTTGTCTGTG TTAATTCTCA GCCTCCTGCT 2040 

GGTTTTCATC ATGTCCGTCT TCGTGGCCGC CGGGCTCTTC GTGCTGGTCA TGAAGCGCAG 2100 

GAAGAAGAAC CAGAGCGACC ACACCAGCAC CAACAACTCC GACGTGAGCT CCTTTAACAT 2160 

GCAGTACAGC GTGTACGGCG GCGGCGGCGG CACGGGCGGC CACCCACACG CGCACGTGCA 2220 

TCACCGCGGG CCCGCGCTGC CCAAGGTGAA GACGCCCGCG GGCCACGTGT ATGAATACAT 2280 

CCCCCACCCA CTGGGCCACA TGTGCAAAAA CCCCATCTAC CGCTCCOGAG AGGGCAACTC 2340 

CGTAGAGGAT TACAAAGACC TGCACGAGCT CAAGGTCACC TACAGCAGCA ACCACCACCT 2400 

GCAGCAGCAG CAGCAGCCGC CGCCGCCACC GCAGCAGCCA CAGCAGCAGC CCCCGCCGCA 2460 

GCTGCAGCTG CAGCCCGGGG AGGAGGAGAG GCGGGAAAGC CACCACTTGC GGAGCCCCGC 2520 

CTACAGCGTC AGCACCATCG AGCCCCGGGA GGACCTGCTG TCGCCGGTGC AGGAOGOCGA 2580 

CCGCTTTTAC AGGGGCATTT TAGAACCAGA CAAACACTGC TCCACCACCC CCGCCGGCAA 2640 

TAGCCTCCCG GAATATCCCA AATTCCCGTG CAGCCCCGCT GCTTACACTT TCTCCCCCAA 2700 

CTATGACCTG AGACGCCCCC ATCAGTATTT GCACCCGGGG GCAGGGGACA GCAGGCTACG 2760 

GGAACCGGTG CTCTACAGCC CCCCGAGTGC TGTCTTTGTA GAACCCAACC GGAACGAATA 2820 

TCTGGAGTTA AAAGCAAAAC TAAACGTTGA GCCGGACTAC CTCGAAGTGC TGGAAAAACA 2880 

GACCACGTTT AGCCAGTTCT AAAAGCAAAG AAACTCTCTT GGAGCTTTTG CATTTAAAAC 2940 

AAACAAGCAA GCAGACACAC ACAGTGAACA CATTTGATTA ATTGTGTTGT TTCAACGTTT 3000 

AGGGTGAAGT GCCTTGGCAC GGGATTTCTC AGCTTCGGTG GAAGATACGA AAAGGGTGTG 3060 

CAATTTCCTT TAAAATTTAC ACGTGGGAAA CATTTGTGTA AACTGGGCAC ATCACTTTCT 3120 

CTTCTTGCGT GTGGGGCAGG TGTGGAGAAG GGCTTTAAGG AGGCCAATTT GCTGCGCGGG 3180 

TGACCTGTGA AAGGTCACAG TCATTTTTGT AGTGGTTGGA AGTGCTAAGA ATGGTGGATG 3240 

ATGGCAGAGC ATAGATTCTA CTCTTCCTCT TTTGCTTCCT CCCCCTCCCC CGCCCCTGCC 3300 

CCACCTCTCT TTCTCCCCTT TTAAGCCATG GGTGGGTCTA ACTGGCTTTT GTGGAGAAAT 3360 

TAGCACACCC CAACTTTAAT AGGAAATTTG TTCT CTTTT T CCGCCCCTCT CCTTCTCTCC 3420 
TCCCCTCCCC TCCCTTCTCA TTCCTTTTCT TTGTTTTTAA AGGATGTGTT TGTATGCATT 3480 
CTGGACATTT GAATTAAAAA AAAAGTATTG TGATCCTGTA AAGGATCACC ATAGATGTGG 3540 
ACAAATCATT AAAATTACAG AGCTATATGA TCCATAATTG ATTAGTCAAA ATAACTTATT 3600 
GATGAAATAT ACAAATATTT TATTGTAGCA CCTATTTTTA TATGCACATT TAGCATTCCT 3660 
CTTTCCTTCA CTATTTAGCC TATGATTTTG CAGAGGTGTC ACACTGTATT AGGATCTGCA 3720 
TTTCTAAAAC TGACGTGGTA TCAGGAAGGC ATTTTCAATC ATTCAAAATG TGGAGAATTT 3780 
AATGGCTAAA TCTTTAAAAG CCAATGCAAC CCACCCAATT GAATCTGCAT TTTCTTTTAA 3840 
GAAAACAGAG CTGATTGTAT CCCAATGTAT TTTAAAAAAT AGGGCAATTG ATT GGGCC AT 3900 
TCCGAGAGAA TTGTTTGCAA GTTTTGGQTT TTATTAGAAA ATATTTGAAA GTATTTTTAT 3960 
TAATGAACCA AAATGACATG TTCATTTGAC TACTATTGTA GCCGATTTTC GATTGTTTAA 4020 
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CCAAACCCAG TTGCATTTGT ACAGATCCAC GTGTACTGGC ACCTCAQAAG AOCAAATCAT 4080 

GGACTGTACA AGTCTCTATA CAATGTCTTT ATCCCTGTGG 6CAGCAAGCA ATGATGATAA 4140 

TGACAAACAG GATATCTGTA AGATGGGGCT ACTGTTGTTA CAGTCTCATA TGTATCCCAG 4200 

^ CACATGTAAT TTTTTAAATA GTTTCTGAAT AAACACTTGA TAACTATGTC 4250 

Seq ID NO: C191 DNA Sequence 

Nucleic Acid Accession #: NM_000793.2 

Coding sequence: 401.. 1222 ~ 

10 1 11 21 31 41 51 

I 1 1.1 I I 

GCCTGCAGAG AGAGGCACTT TGCACCACAG. ACAGATAGCA AGAAGGGAAA GACAGAGAGT 60 

GAGAAAAAAG AGGAGTCAGT CGCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAG 120 

AAGCACAGAA AGTGTGTGTA AAACGGAGTA AAGAAAGAAA AAAAAAAAAC TACCCTTAAA 180 

15 GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 240 

CATAGAGACA ATGAAAGGCT AAAGAAAATT TTAAAATCTC TGCCACAGTC TCATAGGTGC 300 

TTGGAAATGA AAGTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGAGG GTGAAGGGGA 360 

ACCAGAGCGC ACAAGGGAAC TGACTCAGCJA GGCAGAGAAG ATGGGCATCC TCAGCGTAGA 420 

CTTGCTGATC ACACTGCAAA TTCTGCCAGT TTTTTTCTCC AACTGCCTCT TCCTGGCTCT 480 

20 CTATGACTOG GTCATTCTGC TCAAGCACGT GGTGCTGCTG TTGAGCCGCT CCAAGTCCAC 540 

TOGCGGAGAG TGGCGGCGCA TGCTGACCTC AGAGGGACTG CGCTGCGTCT GGAAGAGCTT 600 

CCTCCTCGAT GCCTACAAAC AGGTGAAATT GGGTGAGGAT GCCCCCAATT CCAGTGTGGT 660 

GCATGTCTCC AGTACAGAAG GAGGTGACAA CAGTGGCAAT GGTACCCAGG AGAAGATAGC 720 

TGAGGGAGCC ACATGCCACC TTCTTGACTT TGCCAGCCCT GAGCGCCCAC TAGTGGTCAA 780 

25 CTTTGGCTCA GCCACTTGAC CTCCTTTCAC GAGCCAGCTG CCAGCCTTCC GCAAACTGGT 840 

GGAAGAGTTC TCCTCAGTGG CTGACTTCCT GCTGGTCTAC ATTGATGAGG CTCATCCATC 900 

AGATGGCTGG GCGATACCGG GGGACTCCTC TTTGTCTTTT GAGGTGAAGA AGCACCAGAA 960 

CCAGGAAGAT CGATGTGCAG CAGCCCAGCA GCTTCTGGAG CGTTTCTCCT TGCCGCCCCA 1020 

GTGCCGAGTT GTGGCTGACC GCATGGACAA TAACGCCAAC ATAGCTTACG GGGTAGCCTT 1080 

30 TGAACGTGTG TGCATTGTGC AGAGACAGAA AATTGCTTAT CTGGGAGGAA AGGGCCCCTT 1140 

CTCCTACAAC CTTCAAGAAG TCCGGCATTG GCTGGAGAAG AATTTCAGCA AGAGATGAAA 1200 

GAAAACTAGA TTAGCTGGTT AAAGGTATGA TTATAAGAGA GCTTATTGTT TTAAAAAGTT 1260 

ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA ACAGAGCCCT ATTGGCTTAC 1320 

TGAAAGACAG GAGTTTATCT ATCGGAAGAA CATGAATCTC TAACAGCTCC ATACTTCTTT 1380 

35 CACTACTCAA ATGGCATTGG GCTGAGTAAG TAACCATATC ACCTCTCTTC TTAGTAAAAA 1440 

GCCCTATGTG AAAAGATCCC AAGATGGAGA GGAAGAAACG CTAATTCAGC ATGTGTTCAT 1500 

TCTGCATTGA GAAGGAACTG ATACATCTGA TGCATGCTTT GAGACCAGAA GAAAAGACTT 1560 

ACCTGAATAA TTACTACATT AGGGAAGCTA CTGTCTACGT TAAGATAAAG GGTATTGCCT 1620 

TGGCTCTATT TGGCATGGAT GGAGCCCAGT TGGAAAATTC CCAAATATTA CAACAAGTCC 1680 

40 TTGAACCCAG GCCATGTGGT TAGACGTTGG. TGTTAAGGTT AGACCTTATG TTAGAGTCAT 1740 

TTCTGATGTT CCAGCTTCTA GCCATGTAGT GCTCTCAGTC TTCATACCCC AGAAATTATT 1800 

GGTATATTTG TAGATACCGA GAATGATCCC TCAGTCTGAG AGGTTAGAAT GATCATCTGT 1860 

AATCTGAGGG TTAATTTCTA GGCAGGTGGA GAGAGTGGTA AAAAAGAAAT GAAATTGACA 1920 

AGCTAGGAAA GAGGAGGCAG AAAGATTTGG AAAATTCACA GAGTTTCACC CTTAAGCTGT 1980 

45 AGAGAGTGGG TCACATTTGT TAGCCACGGA AACATAGAAA CATACACAAG GCCAGAAAAA 2040 

GAAGAAGGAG CTCAACTAAA AGTGGCATAG AGAATACACA TATAAAAACA ATATATTTGT 2100 

CATATGCTCC TAGAGAGGAG AAAGGGGTGA TTGAAAGAAA AAAAAATACT TAAATATTTG 2160 

TAATTGTGAG GGGTTTCTTT TGGAAATAAT TACTTTTGAA CCATGTATGT GGTATGTATA 2220 

TTTTCAGTGG GTTAATTATA CCCCATGATA CCTATTAAAG GAAAACCAGT GGGTCTGGTG 2280 

50 GTGCTGGTCT TTTCCTCCCC ATTCCTACAA TTTCTATGTG GCCCAAGTCA TTCCTAATCT 2340 

TGGTCTCTAT AGCAGTGTTC TCTCTGAATG CTGAGCTGAA GAAATTATAC GTACATACAC 2400 

ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCTGCTGCG GGAGGTAGGT 2460 

ACCATGGCCA TTCAGCACAG CCTTGATTTC CTCCCAAAGT AGGTGAGCTA TAGTGAAGAA 2520 
TAGGTGCAAA CAAACAAGCT TACTTCCATT GCAAAATAGA AGAAGAGGAA GTTAGAGATA 2580 

55 ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGGCAAC COCCGGTATA TCATGGAATT 2640 
TCCATTGACA TTTGAATTTG ■GACTTGGATC TTCCCTTGGT CCCATTAGCT GAGGTTTAGT 2700 
AATCTAAAGT CCCTATAGTA TATGATTATA ATGCTATTTT AAAAAATATA TATATAAAAT 2760 
ATTTTTTTCT TTTTAAAATA GAGACTATAG TTTTACCCAT AAGTAATATT TAAAGATTAT 2820 

- AGCTCCCAAA AGAATGGACC AACCACTTTC GTATCATAAT TTCTTTTTGG TAAATATGAG 2880 
60 ACTATTATGA AATCATAGTA TATGATTGTA TTTAAAGGTA CAATCAAAGG ATCTTTTGTC 2940 

CATTCCATTA ATAACTGAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTTGAA 3000 
AATACATTCT TAAACTAGTT GTCTGAAATG AGAAAAGAGT GAGAACTAGG TGTGCAAGAA 3060 
CCAAACGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTCGTG TCACCAGCTG 3120 

- _ GTATATTGTG TAAATATTAA AGCTCCATTG GGACTGATTT TTCATGGCAA CATCAGCTTT 3180 
65 CTAATGTTCT AAATTCTATA AAAACCACCC ACAAAGAAAC AAAGCAAATT TCATTATCTA 3240 

ATGAGTTGCT GGAAAATCAT ATTGAGAATA ATTATTTCAG ATTCCTCAGT TGTTAACTTC 3300 
TACATTCAAG GGCTTATCTC TGCCCCCATT GATTTTTAAC CTCAAAATGG TGTGAGATTT 3360 
ACTGTGGAAC CCTAAAGCAG TAAAATAAAA AACCTGGTTG CAGCACATTC ACACTGTTGT 3420 
. CCTTAAAATT CCCCTTTTTT CTCTATGTAC GATAAAGTAA CAGTATGTCA GATAAGCCGG 3480 

70 TGGGGGGATG AGATTAGGCT GAGGCAGTGC TAGTCAACTG GGGGAAAAGG ATGATGGAAA 3540 
AATCACCCAG TTGTGCTATA TTTTTAAAGA AGGAGGTCGT TTATGTGTGC AGACAATTCT 3600 
CCCTGAGGTT AGCCCAATGG AGAAATGAAG CAGAGGAAGG AAACATAGAA AGACATGGGC 3660 
TATCAGGGAG GAAGATGTTC AATAGAACAT GCAAGAATTT CTGGAAGAAA GGCTGTGGAA 3720 
_ GGGCCAATGG AGAAAATGAA TGGACAAAGC TCAGGAATCC CTACGCTATG TAGAATGTTC 3780 

75 TTGGTGTTAT CAGGGTTAAG CCCTGTAATT ATGTAACCTA TTTATOGCAA CATGAATTTT 3840 
TATGATTTCT TGTGATGTAT TCTTTTATGA AATTAACAAG AACTCATTAT TTTGAGGTAG 3900 
AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATTGCCAA TACTCATGTG 3960 
CGTTTCATGT GTTTTATAAG GTTTGTTCCT TTGAAGAATT GTAGTTCTTA GTCCCACAGG 4020 
QA GAAATGTGTA TCTATTTATA TATCATAGTA TAAATCTATG ATATATTTAT ATCATATATA 4080 
80 AAAGTCTGAG TTCTCTTTCT TAGTCCCTAA TCATGTTTCT CCCATAGGCT GTGTTTACAT 4140 
GGAGCTATCG GTTTAGCCTT TTAAGCTTCA TTAGCTTGTC TATTATTGAA ATAGTTTCCA 4200 
AGAAATTTTA GATATTATCA TAACATCTGG GTCTACTCAA ACACTTATTG TTTGAAAGAC 4260 
TTATGTCTTG GACCTATCAA AAACTGACTT TATTTATTGC TTAGTGAAAA TACTAGTGGG 4320 
ATCAACAATG ATTTTCTTGA ATGGGCATGA ATGGAGATGC CCGCACAGTA ATGTAGAAAT 4380 
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G TTTCA TACA GCTATTAAAA TOTAACTQAC CTCCTTAGAG GCAGATTAGT AACTGTTCCT 4440 

ACTTTGTATA GCTAAGTGAC AGTCACTTAA CTTACATGAC TTTCTTTTTT CACATTGGGT 4500 

CTCTGGTCCT GTGTCTTCAC CTCATTTATA GCACGTCTCC TTGATTTTTG GTAGTATCAA 4560 

CTTCCCAGTG ATCTGTTCAG TTAAGTTCTT CTCCCGTTAA CCAGGAAGTG CTTATTCTCT 4620 

CATCACAGTG GGAAGAATAG CCTATTGTCT TTCATTTTGC CTGAGTGTAT TTTACTATTT 4680 

GGGCTCTGAA ATAAAAATTA TGAAATATGG TGAGGTCACA TGTTGGTGCT GCCTTGCTGC 4740 

ATAAAATTCT AGGAGGGCAG GTTAGGAGAC AGTTATGTAT GG CCTTTCGG GAAAATTCAA 4800 

AGGGTGGGAT TACAAGGGTG TTCCTCAGGC ATGCCCCTAT GGGCCCTATG TGGAAGCAAG 4860 

AAGAATTGAC TGATTTACAG GACTTCTCTT TATGTCAATC TTAAQAGGAT GGATGAATCT 4920 

GGACATTTGT TCCACCCGAC CTCTGACTGA TGGTTTGGAA AATAACTTTA ATTAGGATCA 4980 

TATGACCATT GAAAAAGGAA AAATGTAGAC TCTGACTTCC GTCCCACTGA AGGATTAATG 5040 

AAAACCTTTA CTAGCATTTA GAGCTTTTCA GAACATCCCC ACTGTCATGT GTCTCAGCAG 5100 

TGGAGACTGC AAGTAAGGCT TTTAATTTTA GGAGGTTTTT TTTTTTTTTT TTTTTTCCCC 5160 

TAAATGGTAT GGCCAAAAGT CAGAGTTAAA ATATATATAG TTAGATTCCA ACTTCCTCCT 5220 

TCACTCTAAA AATAGAATCC AAACCCACTC TTCATATATG CTTCCAGAAT GGGGCTTAAG 5280 

TACCAATCTC TGCTTTGCAA TGGGCACAAT CTTGGTCATG TCCTGAGGCT CTCTAAGAAA 5340 

AGAGAGGATC TAGGATGGGA GAGCTAGAAA GTTGCTAACT GGGAAGAACA AGGCCCTGAG 5400 

GGGTTGGTCT ACCAATCTGG GAAGATTTGA AAACAAACTT CTCGCAACTG AAGGAAGGCT 5460 

GAAGGCTGCT GCAAGTCATT GAGTGACTTT AGGATGAGCA AAACATTGGG CCACTTCCTA 5520 

ATGCCCTATG TGTATAGTAC CAGAAGCAAG GTCTCAGACT TAACAGACCC AGCTCTGTTC 5580 

CAAGGTGAGT CTGAACCAAT AGAAAGCAAA CATGTGCAGA TATCCAAACA AGACTGCTCA 5640 

TGCAAGTCGG GGCTGGCTAC CCGTCTTAGG CAGCAACAGC AGAGCTCCAG GGAGCTTATT 5700 

CAATATTTAC TGAGACTTCG AAGACCCAGC AGATGTTTAA TGAAGTCACT ATTTTGGCTC 5760 

AAACCCTCCA CTTCTCCCCC TCCCCTCAAA AAGCCAACAG GTAAACACAT AAATGAAAGA 5820 

AACCCACAGA AGGGGATGGG AAATAAAGAA AATTCTCTCA AGACTTCTCC AGGCCCATGT 5880 

CACTGGTCAG CGTGGTTTTT ATGTGTATTA GGATTGGGGG ATGTGAAGAA ATAAGTATCC 5940 

AGTACTTTAT AACCAAAGCA ATTAAATGAT ATTGGGGTAG GGAATGTTGG CCAGTTTTGT 6000 

TTAGTTTTGC CATCACATTG TCACCCAGAC CTCACCTAGC CGCAAGTAAT CGGGCGCCCC 6060 

GAAGAGGGAG ACAGAGATGT GCCAGAGTTG AOCCAGTGTG CGGATGATAA CTACTGACGA 6120 

AAGAGTCATC GACCTCAGTT AGTGGTTGGA TGTAGTCACA TTAGTTTGCC TCTCCCCATC 6180 

TTTGTCTCCC TGGCAAGGAG AATATGCGGG ACATGATGCT AAGAGCCCTG GGTAAATGTG 6240 

G TGAGAA TGC ACGCGTGCAT ATGCTACACA TATGTGCTTC TCAGTTGCAG AAAATGAACT 6300 

GCTTTGGGAG ATTATCAGTA GAAAGAGTGT TATCATATTG GTGCTGAGTG CTATGTGTGC 6360 

TTATACAATT TGTTCTTGTA TTTTAATAAA CTTTQAATAA AAGAATAAAA AAAAAAAAAA 6420 

AAAAAAAAA 6429 

Seq ID NO: CI 92 DNA Sequence 

Nucleic Acid Accession #: NM_006549.2 

Coding sequence: 824.. 2590 



1 11 21 31 41 51 

I I I I I I 

GAGCCTGGGG AGGTCGAGGG TGCAGCGAGC CGTGATCGTG CTACTGCACT CCAGCCTGGG 60 

CAACACAGAG AGACCCTGTC TCAAAACAAA CAAACAAACA AACAAACAAA CAAAACAAAG 120 

AAAAAAAAAT GGGAGTGGGC CGGGCGCGGT GACTCACACC TGTAATCCCA GCACTTTCGG 1B0 

AGGCCAAGGC GGGTGGATCA CGAGGTCAGG AATTCAAGAT TAGCCTGGAC AACATGGTGA 240 

AACCCCATCT CTACGAAAAA TACAAAAATT AGCCAAGTAT GGTGGCOGGC GCCTGTAATC 300 

CCAGCTACTC GGGAGACTGA GGCAGAGAAC TGCTTGAACC TGGGAGGCAG AGGTTGCAGT 360 

GATCCGAGAT CGCGTCACTG CACTCCAGCG TGGGCGACAG AGCGAGACTC CGTTTCAGAA 420 

AAGAAAAAAA AAAAAAAAAA AAAAAGGGAG TCGGGGTGGA GCTCTCATTG GCTCGTTGCA 480 

TGTGAGTGTC CCTACGGCCT AGAAATACCA GAGAAGCACA TCGGAACGGG CTGGAAATCC 540 

ACCCAGTTAA CTAGAGGGCT TTGAACCTTT TATTAACTTG GAGGTTGACT CTCCTGTCAA 600 

CTCGATTCCC TTTTGGCTGT TTGGCAGGGT CAGTGAGACA TCCCCTGGGT CGCTCGACCC 660 

CGTAGGACGG TTCAGGGAGC CCTCCAGGTC TTCGTTTCTC CTCTTCCCCG CACAGTGCTG 720 

TTATCCAGCT GGGGGATCCA ACGCACACTT AAGGCTCCAG CAAAGTGGCT CCGCTGCCGG 780 

ATGGGAGTGC CCCAGTGTGC TGGATGAAGC TGGCGCATGC ACCATGTCAT CATGTGTCTC 840 

TAGCCAGCCC AGCAGCAACC GGGCCGCCCC CCAGQATGAG CTGGGGGGCA GGGGCAGCAG 900 

CAGCAGCGAA AGCCAGAAGC CCTGTGAGGC CCTGCGGGGC CTCTCATCCT TGAGCATCCA 960 

CCTGGGCATG GAGTCCTTCA TTGTGGTCAC CGAGTGTGAG CCQGGCTGTG CTGTGGACCT 1020 

CGGCTTGGCG CGGGACCGGC CCCTGGAGGC GGATGGCCAA GAGGTCCCCC TTGACTCCTC 1080 

CGGGTCCCAG GCCCGGCCCC ACCTCTCCGG TCGCAAGCTG TCTCTGCAAG AGCGGTCCCA 1140 

GGGTGGGCTG GCAGCCGGTG GCAGCCTGGA CATGAACGGA CGCTGCATCT GCCCGTCCCT 1200 

GCCCTACTCA CCCGTCAGCT CCCCGCAGTC CTCGCCTCGG CTGCCCCGGC GGCCGACAGT 1260 

GGAGTCTCAC CACGTCTCCA TCACGGGTAT GCAGGACTGT GTGCAGCTGA ATCAGTATAC 1320 

CCTGAAGGAT GAAATTGGAA AGGGCTCCTA TGGTGTCGTC AAGTTGGCCT ACAATGAAAA 1380 

TGACAATACC TACTATGCAA TGAAGGTGCT GTCCAAAAAG AAGCTGATCC GGCAGGCCGG 1440 

CTTTCCACGT CGCCCTCCAC CCCGAGGCAC CCGGCCAGCT CCTGGAGGCT GCATCCAGCC 1500 

CAGGGGCCCC ATTGAGCAGG TGTACCAGGA AATTGCCATC CTCAAGAAGC TGGACCACCC 1560 

n[ . CAATGTGGTG AAGCTGGTGG AGGTCCTGGA TGACCCCAAT GAGGACCATC TGTACATGGT 1620 

/U GTTCGAACTG GTCAACCAAG GGCCCGTGAT GGAAGTGCOC ACCCTCAAAC CACTCTCTGA 1680 

AGACCAGGCC CGTTTCTACT TCCAGGATCT GATCAAAGGC ATCGAGTACT TACACTACCA 1740 

GAAGATCATC CACCGTGACA TCAAACCTTC CAACCTCCTG GTCGGAGAAG ATGGGCACAT 1800 

CAAGATCGCT GACTTTGGTG TGAGCAATGA ATTCAAGGGC AGTGACGCGC TCCTCTCCAA 1860 

CACCGTGGGC ACGCCCGCCT TCATGGCACC CGAGTCGCTC TCTGAGACCC GCAAGATCTT 1920 

CTCTGGGAAG GCCTTGGATG TTTGGGCCAT GGGTGTGACA CTATACTGCT TTGTCTTTGG 1980 

CCAGTGCCCA TTCATGGACG AGCGGATCAT GTGTTTACAC AGTAAGATCA AGAGTCAGGC 2040 

CCTGGAAfTT CCAGACCAGC CCGACATAGC TGAGGACTTG AAGGACCTGA TCACCCGTAT 2100 

GCTGGACAAG AACCCCGAGT CGAGGATCGT GGTGCCGGAA ATCAAGCTGC ACCCCTGGGT 2160 

CACGAGGCAT GGGGCGGAGC CGTTGCOGTC GGAGGATGAG AACTGCACGC TGGTCGAAGT 2220 

GACTGAAGAG GAGGTCGAGA ACTCAGTCAA ACACATTCCC AGCTTGGCAA CCGTGATCCT 2280 

GGTGAAGACC ATGATACGTA AACGCTCCTT TGGGAACCCA TTCGAGGGCA GCCGGCGGGA 2340 

GGAACGCTCA CTGTCAGCGC CTGGAAACTT GCTCACCAAA AAACCAACCA GGGAATGTGA 2400 

GTCCCTGTCT GAGCTCAAGG AAGCAAGGCA GCGAAGACAA CCTCCAGGGC ACCGACCCGC 2460 

CCCCCGTGGG GGAGGAGGAA GTGCTCTTGT GAGAGGCAGT CCCTQCGTGG AAAGTTGCTG 2520 
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GGCCCCCGCC CCCGGCTCCC COGCAOGaft GCATCCACTG CGGCCGGAGG AGGCCATGGA 2S80 

GCCCGAGTAG CTGCCTGGAT CGCTOGACCT GGCATGCGCG CGGCGTOGCC TCTGGGGGGC 2640 

TGCTGCACOG CGTTTCCATA GCAGCATGTC CTAOGGAAAC CCAGCACGTG TGTAGAGCCT 2700 

CGATOGTCAT CTCTGGTTAT TT G TTTTTT C CTTTGTTGTT TTAAAGGGGA CAAAAAAAAA 2760 

AAAAAAAGGA CTTGACTCCA TGACGTCGAC CGTGGCOGCT GGCTGGCTGG ACAGGCGGGT 2820 

GTGAGGAGTT GCAGACCCAA ACCCACGTGC ATTTTGGGAA ATTGCTTTTT AAAAGATTTT 2880 

TATGCCAAAA ATCCTTCATT GTGATTTTCA GAACCACGTC AGATATACCA AGTGACTGTG 2940 

TGTGGGGTTT GACAACTGTG GAAAGGOGAG CAGAAAACTC OGGGGGTCTG AGGCCATGGA 3000 

GGTGGTTGCT GCATTTGAGA GGGAGTAGGG GGCTAGATGT GGCTCCTAGT GCAAACOGGA 3060 

AACCATGGCA CCTTCCAGAG CCGTGGTCTC AAGGAGTCAG AGCAGGGCTG GCCCTCAGTA 3120 

GCTGCAGGGA GCTTTGATGC AACTTATTTG TAAGAAGGAT TTTTAAATTT TTTATGGGTA 3180 

GAATTGTAGT CAGGAAAACA GAAAGGGCTT GAAATTTAAT AAGTGCTGCT GGAAGGGGAT 3240 

TTTCCAAGCC TGGAAGGGTA TTCAGCAGCT GTGGTGGGGA AACATTTCTC CTGAAAGACT 3300 

GAACGTGTTT CTTCATGACA GCTGCTCAAA GCAGGTTTCT GAGATAGCTG ACCGAGCTCT 3360 

GGTAAATCTC TTTGTCAAAT TACGAAJACT TCAGGGTGAA ATCCTATGCT TTCATGTACA 3420 

TTACATGGCT TAAGATTAAA CAAAAACATT TTTCAAGTCT CTAACTAGAG TGAACTCTAG 3480 

AGCACAGTAG TTCAGAAACT ATTTAGAGCT TCCAGGATAT ATTTCACAGC TTCAGGCATG 3540 

TGATCAGTTA GAGCCGATGA AACCTATGCC CGCCTGTATA TATATTAGCA GCTTAGCTAG 3600 

TTCATAACCT GTATATTCTA AAGACTGCTA AGGTTTT G TT TTCATTTTAA ATCCTAGCTG 3660 

ATTGTTGTGG TCAATGAAAT ACCCAGTTTC TGAAGGGCCA GGTGGGAAAT GCTTTCACTG 3720 

GACCAACACA CAAATGATCA TCCTGAGGAT CTGAGCTTCC CTAGACTCCA CACAATAACC 3780 

TTGGGGCACC CTTTTAGAGA AGACTGTTGA AACCCACAGC ACTCGTTGGG GTATGAGGAA 3840 

ACCAGGGCTT GGCACAGGAA GTTCCCCTTT GTAGCTAAAA GTCCAGAAAG AAAGGGTTCA 3900 

TCTTTTTGAC TTCCAACTGA TATTGGGAAG TTTGGTTGAG GTTCAAGTGT GACTCCTTCC 3960 

AGAGCCACAG GTAGGGGAGT GTGAAGTTGA GGGGGAGGAA AGCTGGAAGG ACTCTGCCTT 4020 

GGGAGATTCC CAGCTCTGCT TTCCAGCGCT TGGTGGAATC TGGGCTGGGG AAAGACGGCA 4080 

CCGGGAAACT CTGCTTCCCC ATTGTTTCCA TCTGATCAGC TGTGGTGTGA GGACTTCTCA 4140 

GACAAAGGCA AGGCCTOGTG CCCCTGCCCA GCCCATTCAT GGAGCCCTGG GCCTTCTTGG 4200 

CTTCCATAGA TCCTAAGCTC TTGACTGTAG TTTAGCCAGA CTTGTTTTGC TATCTTATAA 4260 

GCAGTTCAGA ATTAGGGAAT GCTGGTTTTG AAGAGCAAAG GACAGGTAGT CTAGAGAGGG 4320 

TCGTCTGGCC TGCTTGCTGG GTCTTTGTAA CCCAGCACTT CCTCTTGCCC TCCTGGCTTT 4380 

ATGTTTATGG GAGAGGACTC AATAGCTCCA CCCCTTCTGG CACCAGATGG GGCTTGGTTA 4440 

GTTTGCAATA AGCACCTTGC AGAGGTTAAA GCCAGCGGGT CCCTAGTCTT AGGCCCAGCC 4S00 

TGCTTGTGTG GGCTCTGGCC TGGCCTGGTG GCTGGCCCAG GGGGCAGCAG TGCTTAGAGC 4560 

TTCTGCAGGG CTTCTCTTGT TTACACAGCT GCATCAGACA ATGCCATTTC TCCCCACCAC 4620 

GGAACCTTCC ATCTAAGATT TCTTCCAGGG AATGCCAGCA ATCAGGCAGC ACCCAGCTGT 4680 

GGGGGCAGTG GGGTGGGGGA GACCCACATT GATGACTTTT TTTTTTTCTT TTAATGAAGA 4740 

AACACCAAAG AAAGCTGTGG AAAGGACCTG CCCCACATGA AAAGGATAAG CCAAGATGGC 4800 

TGTAAACACA GAGCATTTGA GCTGCCACTC TTGGAGCACA TTGATTTTTC AAAAGCCAGC 4B60 

TCTGTCAGGA AAGGAGGTGC TGTTATGACA GCTCTTCCAG TGGGCAAAGA GGACGCCCAT 4920 

AATTTCTTCC ATTGCTAGCT CATCTGTGGG ACCAATTTGG TGTAAGCAAC CTGTGGCCTG 4980 

CACTTGTGGC CTCGAAGGAA GCACAAACCC TCCATCCACT TCCCATTTCC TCTGCCCTTT 5040 

TCCACCTCCC CCTTCCATCC CACCAGCTGC CAGTGGCTCC CAGAAAGCCT TATTGAGCCC 5100 

CTTGTTGACA CTTGGGGCTG CGGAGGCCTT TCCCTACTGG TCTGGCCXTT CCTGAGAGGC 5160 

AGGTCTTCCG TOCTCAGAGC CTTTCTGGAA CAAGGAGAAT GCCTGTGCAG GTGGACACAC 5220 

AGGCCTGGCC TGTCGCTCTC ACTTGTCTTC CAGCGGGGAG CTTCACGTTG CCGAGTGGAA 5280 

GAACCATGAC CTCCACTTGC TTCCAAGGTG CTAGGGAAGT TTCAGGGTAC GCTGGTTCCC 5340 

CTCTCCAGCT GGAGGCCGAG TTTCTGGGGA CTGCAGATTT TTCTACTCTG TGATCG ATTC 5400 

AATGCCCGAT GCTTCTGTTT CATTCCCGAC CCTTTCTACT ATGCATTTTC CTTTTATCAG 5460 

GTGTATAAAG TTAAATACTG TGTATTTATC ACTAAAAAGT ACATGAACTT AAGAGACAAC 5520 

TAAGCCTTTC GTGTTTTTCC ACAGGTGTTT AAGCTTCTCT GTACAGTTGA AATAAACAGA 5580 

CAGCAAAATG GTAAAAAAAA AAAAAAAAAA A 5611 

Seq ID NO: C193 DNA Sequence 
Nucleic Acid Accession 8: NM_018646 
Coding sequence: 217.. 2394 

1 11 21 31 41 51 

I I I I I I 

GCTCTGCCAA GTGTAACAAA CTCACAGCCC TCTCCAAACT GGCTGGGGCT GCTGGGAGAC 60 
TCCCAAGGAA CTCGTCAGGA AGGCAGGAGA CAGGAGACGG GACCTCTACA GGGAGACGGT 120 
GGGCCGGCCC TTGGGGGGGC TGATGTGGCC CCAAGGCTGA GTCCCGTCAG GGTCTGGCCT 180 
CGGCCTCAGG CCCCCAAGGA GCCGGCCCTA CACCCCATGG GTTTGTCACT GCCCAAGGAG 240 
AAAGGGCTAA TTCTCTGCCT ATGGAGCAAG TTCTGCAGAT GGTTCCAGAG ACGGGAGTCC 300 
TGGGCCCAGA GCCGAGATGA GCAGAACCTG CTGCAGCAGA AGAGGATCTG GGAGTCTCCT 360 
CTCCTTCTAG CTGCCAAAGA TAATGATGTC CAGGCCCTGA ACAAGTTGCT CAAGTATGAG 420 
GATTGCAAGG TGCACCAGAG AGGAGCCATG GGGGAAACAG CGCTACACAT AGCAGCCCTC 480 
TATGACAACC TGGAGGCCGC CATGGTGCTG ATGGAGGCTG CCCCGGAGCT GGTCTTTGAG 540 
CCCATGACAT CTGAGCTCTA TGAGGGTCAG ACTGCACTGC ACATCGCTGT TGTGAACCAG 600 
AACATGAACC TGGTGCGAGC CCTGCTTGCC CGCAGGGCCA GTGTCTCTGC CAGAGCCACA 660 
GGCACTGCCT TCCGCCGTAG TCCCTGCAAC CTCATCTACT TTGGGGAGCA CCCTTTGTCC 720 
TTTGCTGCCT GTGTGAACAG TGAGGAGATC GTGCGGCTGC TCATTGAGCA TGGAGCTGAC 780 
ATCCGGGCCC AGGACTCCCT GGGAAACACA GTGTTACACA TCCTCATCCT CCAGCCCAAC 840 
AAAACCTTTG CCTGCCAGAT GTACAACCTG TTGCTGTCCT ACGACAGACA TGGGGACCAC 900 
CTGCAGCCOC TGGACCTCGT GCCCAATCAC CAGGGTCTCA CCCCTTTCAA GCTGGCTGGA 960 
GTGGAGGGTA ACACTGTGAT GTTTCAGCAC CTGATGCAGA AGCGGAAGCA CACCCAGTGG 1020 
ACGTATGGAC CACTGACCTC GACTCTCTAT GACCTCACAG AGATCGACTC CTCAGGGGAT 1080 
GAGCAGTCCC TGCTGGAACT TATCATCACC ACCAAGAAGC GGGAGGCTCG CCAGATCCTG 1140 
GACCAGACGC CGGTGAAGGA GCTGGTGAGC CTCAAGTGGA AGCGGTACGG GCGGCCGTAC 1200 
TTCTGCATGC TGGGTGCCAT ATATCTGCTG TACATCATCT GCTTCACCAT GTGCTGCATC 1260 
TACCGCCCCC TCAAGCCCAG GACCAATAAC CGCACGAGCC CCCGGGACAA CACCCTCTTA 1320 
CAGCAGAAGC TACTTCAGGA AGCCTACATG ACCCCTAAGG ACGATATCCG GCTGGTCGGG 1380 
GAGCTGGTGA CTGTCATTGG GGCTATCATC ATCCTGCTGG TAGAGGTTCC AGACATCTTC 1440 
AGAATGGGGG TCACTCGCTT CTTTGGACAG ACCATCCTTG GGGGCCCATT CCATGTCCTC 1500 
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ATCATCACCT 
G6GGA6GTGG 
GCCCGAGGAT 
GACCTGATGC 
TATATCATCT 
GCCCTGTTCA 
GTGGACCTGC 
CTCATGCTCA 
CGGGATGAGC 
CCTCGCTGCC 
TGGTTCCTGC 
CAGGCCTTCC 
CTGGGCTGTC 
TCCOGCAGCA 
GGGATAATCA 
GTTCTCACTT 
AACCAAACAC 
CGCCAAGGAA 
GGCAGAGGAA 
TGCTCATCAC 
TGCTTGGGCG 
ACCCAGGCAG 
AACGGCTCTG 
TTTTCGGATC 



ATGCCTTCAT 
TACCCATGTC 
TCCAGATGCT 
GATTCTGCTG 
TCCAGACAGA 
GCACCTTCGA 
CCTTCATGTA 
ACCTCCTCAT 
TGTGGAGGGC 
TGTGGCCTCG 
GGGTGGAAGA 
ACACCCGGGG 
CCTTCAGCCC 
GTGCCAATTG 
ACAGGGGTCT 
CGCTTCCTGG 
CCAGAGGTCT 
TGTACGTTGA 
GCCCAGCCCA 
CCTTCCGACA 
CTCGGTCTCC 
GTGCAGGGAA 
GGGGTGGGAA 
CCTGAAAAAA 



GGTGCTGGTO 
CTTTGCACTC 
AGGCCCCTTC 
GCTGATGGCT 
GGACCCCGAG 
GCTGTTCCTT 
CAGCATCACC 
TGCCATGATG 
CCAGATTGTG 
CTCCGGGATC 
CAGGCAAGAT 
CTCTGAGGAT 
CCACCTGTCC 
GGAAAGGCTT 
GGAGGACGGG 
AACTTGCTCT 
CATCTCCCAG 
GAATCACTGC 
AGCACGGGGC 
GGAGCACTGC 
ACCCCAGGGT 
GTGCAGAGCT 
GTGGGGCTAG 
AAAAAAAAAA 



ACCATGGTGA 
GTGCTGGGCT 
ACCATCATGA 
GTGGTCATCC 
GAGCTAGGCC 
ACCATCATCG 
TATGCTGCCT 
GGCGACACTC 
GCCACCACGG 
TGCGGACGGG 
CTCAACCGGC 
TTGGACAAAG 
CTTCCTATGC 
CGGCAAGGGA 
GAGAGCTGGG 
CATTTTCCTG 
GCCCCAGGGG 
TCCAGGCCTG 
TGGCAGGGCG 
ATGTCAGAGC 
CATAAGTGGG 
TGTGGAAAGC 
GTCTTGCCAA 



TGCGGCTCAT 
GGTGCAACGT 
TTCAGAAGAT 
TGGGCTTTGC 
ACTTCTACGA 
ATGGCCCAGC 
TTGCCATCAT 
ACTGGCGAGT 
TGATGCTGGA 
AGTATGGCCT 
AGCGGATCCA 
ACTCAGTGGA 
CCTCAGTGTC 
CCCTGAGGAG 
AATATCAGAT 
GGTGCATCAA 
AGAAAGAGGA 
CATTACTCCT 
TGAGGAACTC 
ACTTTAAAAA 
GAGAGAGCCC 
GTGTGAGTGA 
CTCCATCTTC 



CAGTGCCAGC 
CATGTACTTC 
GATTTTTGGC 
TTCAGCCTTC 
CTACCCCATG 
CAACTACAAC 
CGCCACACTG 
GGCCCATGAG 
GCGGAAGCTG 
GGGAGACCGC 
ACGCTACGCA 
AAAACTAGAG 
TCGAAGTACC 
AGACCTGCGT 
CTGACTGCGT 
ACAAAACAAA 
GTAGCATGAA 
TCAGCTCTGG 
TCCTGTGGCC 
CAGGCCAGCC 
TTCCCAGGGC 
GGGAGACAGG 
AATAAAGTCG 



Seq ID NO: Cl 94 DMA Sequence 

NUclelc Acid Accession ft: NM_021910.1 

Coding sequence: 260.. 601 



GTTCTCCACA 
GTGAGGTTTG 
ATTTCTCCCG 
CCCACTCCAC 
CCCGGGCCAG 
CTTTCCTGTC 
GCACAGCCTC 
CATCGTCATG 
TCCCCTGACC 
GTCCGGTCAC 
ATGAGGACAG 
TGCACAGAAA 
GGCAGGGCCT 
AATGATTGTG 
TTGGGGCACC 
CTGCTTGTTG 
CGCCCTCTGC 
AAGGAATGAA 
CATTCCAGGA 
ATAATAGTAA 
TTTCCAAATA 
AACATACAAT 
CCCAATACCT 
CTCAAAACAC 
ACAATAAACT 



11 

I 

ACTGCCAGCA 
CTTAGGGCGT 
GAACCTCTGC 
GGTGCAGCTG 
CGCTCTGACA 
CTGGACGCCA 
CAGGTTGGCG 
AGTGAGTGGA 
ACTCAGCTCT 
CATCCAGGGG 
ACCAGCTGAA 
CTTGAACTCC 
CATCTCACCT 
CCTCTGCCCA 
ATGAGAAGGT 
GGAAAAGCCC 
CTCCTCAGCC 
AGAGAGCTCT 
AAACTGGGAC 
GCCCCTGTGA 
CCCAAATATC 
GACCACATAG 
AATAAGGCAT 
CCAAAAATAT 
CGTGACTTGT 



21 
I 

ATCCTTCCAC 
GGCAGCTTCG 
TCAGCCTGGT 
CGGCTTATCT 
TGCAGAAGGT 
ATGACCTAGA 
GGCTCATCTG 
GGAGCTCGGG 
CCCCAACAGG 
AGACTCCACC 
ATTGGGTGGA 
AGGATGGAAT 
CTCGCAAGAG 
AGCAGCCTGG 
TGGCGTGCCC 
ACAGGCCTGT 
ATGGGACCTC 
AACCAGATGG 
ATAGGATCGT 
CCCTCTCACT 
CCTGCAAGCC 
TGTGAGAACT 
GCGAAATGTT 
CTCCTCCAAT 
CCCCTC 



31 
I 

CAGGCAAAAC 
GATAAACGCA 
GAACCACACA 
CTCAGCCCAG 
GACCCTGGGC 
AGATAAAAAC 
OGCTGGGGTT 
GGAGCAGGCG 
TGCAAAATGC 
TCTCATCACC 
GGACCGTTCT 
TCTTCCTCCT 
GGTCTCTTTG 
AGACTTCCTA 
TGGAGGCTGA 
TCCCTTGTGG 
ATATGCAATT 
AACACTGGAA 
CCCGCTATGA 
TACCCCGAGA 
CGTTAAATAA 
TCCAACAAGC 
CTCATGAACT 
GTCCTGAGAC 



41 
I 

ACATCATCTA 
GGACTCCGCC 
GGCCCGAGTT 
CGAGATGCCA 
CTGCTTGTGT 
AGTCCTTTCT 
CTGTGCGCCA 
GGCCGGGGCT 
AAATGCAAGT 
CCOGGCTCAG 
CTGTCCCCAG 
CTGCTGGGAC 
TTCAATTTTT 
TGTGTGCATT 
CACAGAGGCT 
CTTGGGACAT 
TGGGATTTAC 
CATTCCAGTG 
TGGAAGTGTT 
CCTCACTTTA 
TTCCCTATGC 
CTCAAAGTCC 
ACCCCACAAC 
ATGAACCCAA 



51 
I 

AGGAAAAGAA 
TGGCAGCCCG 
TCACCCAGTC 
GCCTTCCTGT 
TCCTGGCAGG 
ACTATGACTG 
TGGGCATCAT 
GGGGCTCCCC 
TTGGCCAGAA 
CCCAAAGCTG 
GTCCTGTCTC 
TCCTTTGCAT 
TTTAATCTAA 
GGGGTGGGGC 
GGCACTGAGC 
GGCACAGGCC 
TAGTAGCCAA 
GACCCTGGAC 
CAGACAGTTT 
TTACAAGATC 
TACCCTTAAT 
CTTGAGACTC 
ACGCCTAAAA 
AAAGAGACCC 



Seq ID NO i C19S DNA Sequence 

Nucleic Acid Accession #s NM_005971.2 

Coding sequence: 176.. 4 39 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2918 



GTTCTCCACA 
GTGAGGTTTG 
ATTTCTCCCG 
GAAGGTGACC 
CCTAGAAGAT 
CATCTGCGCT 
ATGCAAGTTT 
AGGCTCAGCC 
GTCCCCAGGT 
GCTGGGACTC 
CAATTTTTTT 
TGTGCATTGG 
CAGAGGCTGG 
TGGGACATGG 
GGATTTACTA 
TTCCAGTGGA 
GAAGTGTTCA 
TCACTTTATT 
CCCTATGCTA 
CAAAGTCCCT 
CCCACAACAC 



11 
I 

ACTGCCAGCA 
CTTAGGGCGT 
GAACCTCTGC 
CTGGGCCTGC 
AAAAACAGTC 
GGGGTTCTGT 
GGCCAGAAGT 
CAAAGCTGAT 
CCTGTCTCTG 
CTTTGCATGG 
TAATCTAAAA 
GGTGGGGCTT 
CACTGAGCCT 
CACAGGCCCG 
GTAGCCAAAA 
CCCTGGACCA 
GACAGTTTAT 
ACAAGATCTT 
CCCTTAATAA 
TGAGACTCCC 
GCCTAAAACT 



21 
I 

ATCCTTCCAC 
GGCAGCTTCG 
TCAGCCTGGT 
TTGTGTTCCT 
CTTTCTACTA 
GCGCCATGGG 
CCGGTCACCA 
GAGGACAGAC 
CACAGAAACT 
CAGGGCCTCA 
TGATTGTGCC 
GGGGCACCAT 
GCTTGTTGGG 
CCCTCTGCCT 
GGAATGAAAG 
TTCCAGGAAA 
AATAGTAAGC 
TCCAAATACC 
CATACAATGA 
CAATACCTAA 
CAAAACACCC 



31 
I 

CAGGCAAAAC 
GATAAACGCA 
GAACCACACA 
GGCAGGCTTT 
TGACTGGCAC 
CATCATCATC 
TCCAGGGGAG 
CAGCTGAAAT 
TGAACTCCAG 
TCTCACCTCT 
TCTGCCCAAG 
GAGAAGGTTG 
AAAAGCCCAC 
CCTCAGCCAT 
AGAGCTCTAA 
ACTGGGACAT 
CCCTGTGACC 
CAAATATCCC 
CCACATAGTG 
TAAGGCATGC 
AAAAATATCT 



41 
I 

ACATCATCTA 
GGACTCCGCC 
GGCGAGCGCT 
CCTGTCCTGG 
AGCCTCCAGG 
GTCATGAGTG 
ACTCCACCTC 
TGGGTGGAGG 
GATGGAATTC 
CGCAAGAGGG 
CAGCCTGGAG 
GCGTGCCCTG 
AGGCCTGTTC 
GGGACCTCAT 
CCAGATGGAA 
AGGATCGTCC 
CTCTCACTTA 
TGCAAGCCCG 
TGAGAACTTC 
GAAATGTTCT 
CCTCCAATGT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1466 



51 
I 

AGGAAAAGAA 
TGGCAGCCCG 
CTGACATGCA 
ACGCCAATGA 
TTGGCGGGCT 
CAAAATGCAA 
TCATCACCCC 
ACCGTTCTCT 
TTCCTCCTCT 
TCTCTTTGTT 
ACTTCCTATG 
GAGGCTGACA 
CCTTGTGGCT 
ATGCAATTTG 
CACTGGAACA 
CGCTATGATG 
CCCCGAGACC 
TTAAATAATT 
CAACAAGCCT 
CATGAACTAC 
CCTGAGACAT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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Seq ID NO: C196 UNA Sequence 

Nucleic Acid Accession 8: NM_004961.2 

Coding sequence: 55..1S7S 

1 11 21 31 41 51 

I I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCOGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG GGTCGAGGGA 120 

CCTCAGACTG AATCAAAGAA TGAAGOCTCT TCCCGTGATG TTGTCTATGG COCCCAGCCC 180 

CAGCCTCTGG AAAATCAGCT CCTCTCTGAG GAAACAAAGT CAACTGAGAC TGAGACTGGG 240 

AGCAGAGTTG GCAAACTGCC AGAAGCCTCT CGCATCCTGA ACACTATCCT GAGTAATTAT 300 

GACCACAAAC TGCGCCCTGG CATTGGAOfAG AAGCCCACTG TGGTCACTGT TGAGATCGCC 360 

GTCAACAGCC TTGGTCCTCT CTCTATCCTA GACATGGAAT ACACCATTGA CATCATCTTC 420 

TCCCAGACCT GGTACGACGA ACGCCTCTGT TACAACGACA CCTTTGAGTC TCTTGTTCTG 480 

AATGGCAATG TGGTGAGCCA GCTATGGATC COGGACACCT TTTTTAGGAA TTCTAAGAGG 540 

ACCCACGAGC ATGAGATCAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGATGGCAAG 600 

GTGTTGTACA CAATTAGGAT GACCATTGAT GCCGGATGCT CACTCCACAT GCTCAGATTT 660 

CCAATGGATT CTCACTCTTG CCCTCTATCT TTCTCTAGCT TTTCCTATCC TGAGAATGAG 720 

ATGATCTACA AGTGGGAAAA TTTCAAGCTT GAAATCAATG AGAAGAACTC CTGGAAGCTC 780 

TTCCAGTTTG ATTTTACAGG AGTGAGCAAC AAAACTGAAA TAATCACAAC CCCAGTTGGT 84 0 

GACTTCATGQ TCATOACGAT TTTCTTCAAT GTGAGCAGGC GGTTTGGCTA TGTTGCCTTT 900 

CAAAACTATG TCCCTTCTTC CGTGACCACG ATGCTCTCCT GGGTTTCCTT TTGGATCAAG 960 

ACAGAGTCTG CTCCAGCCCG GACCTCTCTA GGGATCACCT CTGTTCTGAC CATGACCACG 1020 

TTGGGCACCT TTTCTCGTAA GAATTTCCCG CGTGTCTCCT ATAT CACAG C CTTGGATTTC 1080 

TATATCGCCA TCTGCTTCGT CTTCTGCTTC TGCGCTCTGT TGGAGTTTGC TGTGCTCAAC 1140 

TTCCTGATCT ACAACCAGAC AAAAGCCCAT GCTTCTCCTA AACTCCGCCA TOCTCGTATC 1200 

AATAGCCGTG CCCAT6CCCG TACCCGTGCA CGTTCCCGAG CCTGTGCCGG CCAACATCAG 1260 

GAAGCTTTTG TGTGCCAGAT TGTCACCACT GAGGGAAGTG ATGGAGAGGA GOGOOCGTCT 1320 

TGCTCAGCCC AGCAGCCCCC TAGCCCAGGT AGCCCTGAGG GTCCCCGCAG CCTCTGCTCC 1380 

AAGCTGGCCT GCTGTGAGTG GTGCAAGCGT TTTAAGAAGT ACTTCTGCAT GGTCCCCGAT 1440 

TGTGAGGGCA GTACCTGGCA GCAGGGCCGC CTCTGCATCC ATGTCTACCG CCTGGATAAC 1500 

TACTCGAGAG TTGTTTTCCC AGTGACTTTC TTCTTCTTCA ATGTGCTCTA CTGGCTTGTT 1560 

TGCCTTAACT TGTAGGTACC AGCTGGTACC CTGTGGGGCA ACCTCTCCAG TTCCCCAGGA 1620 

GGTCCAAGCC CCTTGCCAAG GGAGTTGGGG GAAAGCAGCA GCAGCAGCAG GAGCGACTAG 1680 

AGTTTTTCCT GCCCCATTCC CCAAACAGAA GCTTGCAGAG GGTTTGTCTT TGCTGCCCCT 1740 

CTCCCCTACC TGGCCCATTC ACTGAGTCTT CTCAGCAGAC CATTTCAAAT TATTAATAAA 1800 

TGGGCCACCT CCCTCTTCTT CAAGGAGCAT CCGTGATGCT CAGTGTTCAA AACCACAGCC 1860 

ACTTAGTGAT CAGCTCCCTA AAACCATGCC TAAGTACAGG CGGATTAGCT ATCTTCCAAC 1920 

AATGCTGACC ACCAGACAAT TACTGCATTT TTCCAGAAGC CCACTATTGC CTTTGTAGTG 1980 

CTTTCGGCCC AGTTCTGGCC TCAGCCTCAA AGTGCACCGA CTAGTTGCTT GCCTATACCT 2040 

GGCACCTCAT TAAGATGCTG GGCAGCAGTA TAACAGGAGG AAGAGATCCC TCTCCTTTGG 2100 

TCAGATTATT ATGTTCTCAG TTCTCTCTCC CTGCTACCCC TTTCTCTGCA GATAGATAGA 2160 

CACTGGCATT ATCCCTTTAG GAAGAGGGGG GGGCAGCAAG AGAGCCTATT TGGGACAGCA 2220 

TTCCTCTCTC TCTGCTGCTG TGACATCTCC CTCTCCTTGC TGGCTCCATC TTTCGTCTGC 2280 

ACTACCAATT CAATGCCCTT CATCCAATGG GTATCTATTT TTGTGTGTGA TTATAGTAAC 2340 

TACTCCCTGC TTTATATGCC ACCCTCTTCC TTCTCTTTGA CCCCTGTGAC TCTTTCTGTA 2400 
ACTTTCCCAG TGACTTCCCC TAGCCCTGAC OCAGGCACTA GGCCTTGGTG ACTTCCTGGG 2460 
GCCAAGAAAC TAAGGAAACT CGGCTTTGCA ACAGGCATTA CTCGCCATTG ATTGGTGCCC 2520 
ACCCAGGGCA CACTGTCGGA GTTCTATCAC TTGCTTGACC CCTGGACCCA TAAACCAGTC 2580 
CACTGTTATA CCCGGGGCAC TCTAACCATC ACAATCAATC AATCAAATTC CCTTAAATTT 2640 
GTATGGCACT GGAACTTTGG CAAAGCACTT TTGACAAGTT GTGTCTGATT GGAGCTTCAT 2700 
GATAGCCTTG TGACATCTTT AGGGCAGGAT TCTTATCCCC ATTTTGCAGA TGAAAACCCT 2760 
GAGTCACAGA TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA GCCCACTGTC 2820 
ACCTTCTAGA CCACATGATA GGGCTAGACA GCTCAGTTCA CCATGATTCT CTTCTGTCAC 2880 
CTCTGCTGGC ACACCAGTGG CAAGGCCCAG AATGGCGACC TCTCTTTAGC TCAATTTCTG 2940 
GGCCTGAGGT GCTCAGACTG CCCCCAAGAT CAAATCTCTC CTGGCTGTAG TAACCCAGTG 3000 
GAATGAATTT GGACATGCCC CAATGCTTCT ATATGCTAAG TGAAATCTGT GTCTGTAATT 3060 
TGTTGGGGGG TGGATAGGGT GGGGTCTCCA TCTACTTTTT GTCACCATCA TCTGAAATGG 3120 
GGAAATATGT AAATAAATAT ATCAGCAAAG CAAAAAGAAA AAAAAAAA 3168 

Seq ID NO: C197 DNA Sequence 

Nucleic Acid Accession S: NM_021984.1 

Coding sequence: 572.. 1753 ~ 



1 
I 

GCCAGAGCGT 
TCCAAAGTTC 
CAGAGAAGTG 
GTGTAAAGAA 
CACTGCCTCC 
TCAGACTGAA 
GCCTCTGGAA 
CAGAGTTGGC 
CCACAAACTG 
CAACAGCCTT 
CCAGACCTGG 
TGGCAATGTG 
CCACGAGCAT 
GTTGTACACA 
AATGGATTCT 
GATCTACAAG 



11 
I 

GAGCCGCGAC 
TTCCAGTCCT 
CTCAAATCAT 
AGCCAAATCA 
CAGCAAAGGC 
TCAAAGAATG 
AATCAGCTCC 
AAACTGCCAG 
CGCCCTGGCA 
GGTCCTCTCT 
TACGAOGAAC 
GTGAGCCAGC 
GAGATCACCA 
ATTAGGATGA 
CACTCTTGCC 
TGGGAAAATT 



21 

1 

CTCOGCGCAG 
CCTAGGCATC 
AAGTGTACAG 
AGGACCCGAA 
AGCACTATCC 
AAGCCTCTTC 
TCTCTGAGGA 
AAGCCTCTCG 
TTGGAGAGAA 
CTATCCTAGA 
GCCTCTGTTA 
TATGGATCCC 
TGCCCAACCA 
CCATTGATGC 
CTCTATCTTT 
TCAAGCTTGA 



31 
I 

GTGGTCGCGC 
TTATTGATCC 
CTGATGAGTT 
TGTGAGCAGG 
GGACTTCTAA 
CCGTGATGTT 
AACAAAGTCA 
CATCCTGAAC 
GCCCACTGTG 
CATGGAATAC 
CAACGACACC 
GGACACCTTT 
GATGGTCCGC 
CGGATGCTCA 
CTCTAGCTTT 
AATCAATGAG 



41 
I 

CGGTCTCCGC 
TCCAGTCGAG 
GTCAAAAAAT 
ACCTCAGAAG 
CACCATCGGG 
GTCTATGGCC 
ACTGAGACTG 
ACTATCCTGA 
GTCACTGTTG 
ACCATTGACA 
TTTGAGTCTC 
TTTAGGAATT 
ATCTACAAGG 
CTCCACATGC 
TCCTATCCTG 
AAGAACTCCT 



51 
1 

GGAAATGTTG 
AACATGTATA 
GACCACAGCG 
CCCCCTTTGT 
TCGAGGGACC 
CCCAGCCCCA 
AGACTGGGAG 
GTAATTATGA 
AGATCTCCGT 
TCATCTTCTC 
TTGTTCTGAA 
CTAAGAGGAC 
ATGGCAAGGT 
TCAGATTTCC 
AGAATGAGAT 
GGAAGCTCTT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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CCAGTTGGAT TTTACAGGAG TGAGCAACAA AACTGAAATA ATCACAACCC CAGTTGGTGA 1020 

CTTCATGGTC ATGACGATTT TCTTCAATGT GAGCAGGCGG TTTGGCTATG TTGCCTTTCA 1080 

AAACTATGTC CCTTCTTCOG TGACCACGAT GCTCTOCTGG QTTTCCTTTT GOATCAAGAC 1140 

AGAGTCTGCT CCAGCCCGGA CCTCTCTAGQ GATCACCTCT GTTCTGACCA TGACCACGTT 1200 

GGGCACCTTT TCTCGTAAGA ATTTCCCGCG TGTCTCCTAT ATCACAGCCT TGGATTTCTA 1260 

TATCGCCATC TGCTTCGTCT TCTGCTTCTG CGCTCTGTTG GAGTTTGCTG TGCTCAACTT 1320 

CCTGATCTAC AACCAGACAA AAGCCCATGC TTCTCCTAAA CTCCGCCATC CTOGTATCAA 1380 

TAGCCGTGCC CATGCCCGTA CCCGTGCACG TTCCCGAGCC TGTGCCOGCC AACATCAGGA 1440 

AGCTTTTGTG TGCCAGATTG TCACCACTGA GGGAAGTGAT GGAGAGGAGC GCCXX3TCTTG 1500 

CTCAGCCCAG CAGCCCCCTA GCCCAGGTAG CCCTGAGGGT CCCCGCAGCC TCTGCTCCAA 1560 

GCTGGCCTGC TGTGAGTGGT GCAAGCGTTT TAAGAAGTAC TTCTGCATGG TCCCCGATTG 1620 

TGAGGGCAGT ACCTGGCAGC AGGCCOGCCT CTGCATCCAT GTCTACGGCC TGGATAACTA 1680 

CTCGAGAGTT GTTTTCCCAG TGACTTTCTT CTTCTTCAAT GTGCTCTACT GGCTTGTTTG 1740 

CCTTAACTTG TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG 1800 

TCCAAGCCCC TTGCCAAGGG AGTTGGGGGA AAGCAGCAGC AGCAGCAGGA GCGACTAGAG 1860 

TTTTTCCTGC CCCATTCCCC AAACAGAAGC TTGCAGAGGG TTTGTCTTTG CTGCCCCTCT 1920 

CCCCTACCTG GCCCATTCAC TGAGTTTTCT CAGCAGACCA TTTCAAATTA TTAATAAATG 1980 

GGCCACCTCC CTCTTCTTCA AGGAGCATCC GTGATGCTCA GTGTTCAAAA CCACAGCCAC 2040 

TTAGTGATCA GCTCCCTAAA ACCATGCCTA AGTACAGGCG GATTAGCTAT CTTCCAACAA 2100 

TGCTGACCAC CAGACAATTA CTGCATTTTT CCAGAAGCCC ACTATTGCCT TTGCAGTGCT 2160 

TTCGGCCCAG TTCTGGCCTC AGCCTCAAAG TGCACCGACT AGTTGCTTGC CTATACCTGG 2220 

CACCTCATTA AGATGCTGGG CAGCAGTATA ACAGGAGGAA GAGATCCCTC TCCTTTGGTC 2280 

AGATTATTAT GTTCTCAGTT CTCTCTOCCT GCTACCCCTT TCTCTGCAGA TAGATAGACA 2340 

CTGGCATTAT CCCTTTAGGA AGAGGGGGGG GCAGCAAGAG AGCCTATTTG GGACAGCATT 2400 

CCTCTCTCTC TGCTGCTGTG ACATCTCCCT CTCCTTGCTG GCTCCATCTT TCGTCTGCAC 2460 

TACCAATTCA ATGCCCTTCA TCCAATGGGT ATCTATTTTT GTGTGTGATT ATAGTAACTA 2520 

CTCCCTGCTT TATATGCCAC CCTCTTCCTT CTCTTTGACC CCTGTGACTC TTTCTGTAAC 2580 

TTTCCCAGTG ACTTCCCCTA GCCCTGACCC AGGCACTAGG CCTTGGTGAC TTCCTGGGGC 2640 

CAAGAAACTA AGGAAACTCG GCTTTGCAAC AGGCATTACT CGCCATTGAT TGGTGCCCAC 2700 

CCAGGGCACA CTGTCGGAGT TCTATCACTT GCTTGACCCC TGGACCCATA AACCAGTCCA 2760 

CTGTTATACC CGGGGCACTC TAACCATCAC AATCAATCAA TCAAATTCCC TTAAATTTGT 2820 

ATGGCACTGG AACTTTGGCA AAGCACTTTT GACAAGTTGT GTCTGATTGG AGCTTCATGA 2880 

TAGCCTTGTG ACATCTTTAG GGCAGGATTC TTATCCCCAT TTTGCAGATG AAAACCCTGA 2940 

GTCACAGATT TCTGTGGGAC TGTGGATCTC ACTGGAAGCT ATCCAAGAGC CCACTGTCAC 3000 

CTTCTAGACC ACATGATAGG GCTAGACAGC TCAGTTCACC ATGATTCTCT TCTGTCACCT 3060 

CTGCTGGCAC ACCAGTGGCA AGGCCCAGAA TGGCGACCTC TCTTTAGCTC AATTTCTGGG 3120 

CCTGAGGTGC TCAGACTGCC CCCAAGATCA AATCTCTCCT GGCTGTAGTA ACCCAGTGGA 3180 

ATGAATTTGQ ACATGCCCCA ATGCTTCTAT ATGCTAAGTG AAATCTGTCT CTGTAATTTG 3240 

TTGGGGGGTG GATAGGGTGG GGTCTCCATC TACTTTTTGT CACCATCATC TGAAATGGGG 3300 

AAATATGTAA ATAAATATAT CAGCAAAGC 3329 

Seq ID NO: CI 9 8 DNA Sequence 
Nucleic Acid Accession 8: NM_021987.1 
Coding sequence: 572.. 1657 

1 11 21 31 41 51 

I I I I i I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTOGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 540 

CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG AATTCTAAGA GGACCCACGA GCATGAGATC ACCATGCCCA ACCAGATGGT 660 

CCGCATCTAC AAGGATGGCA AGGTGTTGTA CACAATTAGG ATGACCATTG ATGCCGGATG 720 

CTCACTCCAC ATGCTCAGAT TTCCAATGGA TTCTCACTCT TGCCCTCTAT CTTTCTCTAG 780 

CTTTTCCTAT CCTGAGAATG AGATGATCTA CAAGTGGGAA AATTTCAAGC TTGAAATCAA 840 

TGAGAAGAAC TCCTGGAAGC TCTTCCAGTT TGATTTTACA GGAGTGAGCA ACAAAACTGA 900 

AATAATCACA ACCCCAGTTG GTGACTTCAT GGTCATGACG ATTTTCTTCA ATGTGAGCAG 960 

GCGGTTTGGC TATGTTGCCT TTCAAAACTA TGTCCCTTCT TCCGTGACCA CGATGCTCTC 1020 

CTGGGTTTCC TTTTGGATCA AGACAGAGTC TGCTCCAGCC CGGACCTCTC TAGGGATCAC 1080 

CTCTGTTCTG ACCATGACCA CGTTGGGCAC CTTTTCTCGT AAGAATTTCC CGCGTGTCTC 1140 

CTATATCACA GCCTTGGATT TCTATATCGC CATCTGCTTC GTCTTCTGCT TCTGCGCTCT 1200 

GTTGGAGTTT GCTGTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC ATGCTTCTCC 1260 

TAAACTCCGC CATCCTCGTA TCAATAGCCG TGCCCATGCC CGTACCCGTG CACGTTCCCG 1320 

AGCCTGTGCC CGCCAACATC AGGAAGCTTT TGTGTGCCAG ATTGTCACCA CTGAGGGAAG 1380 

TGATGGAGAG GAGCGCCCGT CTTGCTCAGC CCAGCAGCCC CCTAGCCCAG GTAGCCCTGA 1440 

GGGTCCCCGC AGCCTCTGCT CCAAGCTGGC CTGCTGTGAG TGGTGCAAGC GTTTTAAGAA 1500 

GTACTTCTGC ATGGTCCCCG ATTGTGAGGG CAGTACCTGG CAGCAGGGCC GCCTCTGCAT 1560 

CCATGTCTAC CGCCTGGATA ACTACTCGAG AGTTGTTTTC CCAGTGACTT TCTTCTTCTT 1620 

CA A TGTGCTC TACTGGCTTG TTTGCCTTAA CTTGTAGGTA CCAGCTGGTA CCCTGTGGGG 1680 

CAACCTCTCC AGTTCCCCAG GAGGTCCAAG CCCCTTGCCA AGGGAGTTGG GGGAAAGCAG 1740 

CAGCAGCAGC AGGAGCGACT AGAGTTTTTC CTGCCCCATT CCCCAAACAG AAGCTTGCAG 1800 

AGGGTTTGTC TTTGCTGCCC CTCTOCCCTA CCTGGCCCAT TCACTGAGTT TTCTCAGCAG 1860 

ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAAGGAGC ATCCGTGATG 1920 

CTCAGTGTTC AAAACCACAG CCACTTAGTG ATCAGCTCCC TAAAACCATG CCTAAGTACA 1980 

GGCGGATTAG CTATCTTCCA ACAATGCTGA CCACCAGACA ATTACTGCAT TTTTCCAGAA 2040 

GCCCACTATT GCCTTTGCAG TQCTTTO GG C CCAGTTCTGG CCTCAGCCTC AAAGTGCACC 2100 

GACTAGTTGC TTGCCTATAC CTGGCACCTC ATTAAGATGC TGGGCAGCAG TATAACAGGA 2160 

GGAAGAGATC CCTCTCCTTT GGTCAGATTA TTATGTTCTC AGTTCTCTCT CCCTGCTACC 2220 

1324 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCTTTCTCTG 
AGAGAGCCTA 
GCTGGCTCCA 
TTTTGTGTGT 
GACCCCTGTG 
AGGCCTTGGT 
ACTCGCCATT 
CCCTGGACCC 
CAATCAAATT 
TGTGTCTGAT 
CATTTTGCAG 
GCTATCCAAG 
ACCATGATTC 
CTCTCTTTAG 
CCTGGCTGTA 
GTGAAATCTG 
TGTCACCATC 



CAGATAGATA 
TTTGGGACAG 
TCTTTCGTCT 
GATTATAGTA 
ACTCTTTCTG 
GACTTCCTGG 
GATTGGTQCC 
ATAAACCAGT 
CCCTTAAATT 
TGGAGCTTCA 
ATGAAAACCC 
AGCCCACTGT 
TCTTCTGTCA 
CTCAATTTCT 
GTAACCCAGT 
TGTCTGTAAT 
ATCTGAAATG 



GACACTGGCA 
CATTCCTCTC 
GCACTACCAA 
ACTACTCCCT 
TAACTTTCCC 
GGCCAAGAAA 
CACCCAGGGC 
CCACTGTTAT 
TGTATGGCAC 
TGATAGCCTT 
TGAGTCACAG 
CACCTTCTAG 
CCTCTGCTGG 
GGGCCTGAGG 
GGAATGAATT 
TTGTTGGGGG 
GGGAAATATG 



TTATCCCTTT 
TCTCTGCTGC 
TTCAATGCCC 
GCTTTATATG 
AGTGACTTCC 
CTAAGGAAAC 
ACACTGTCGG 
ACCCGGGGCA 
TGGAACTTTG 
GTGACATCTT 
ATTTCTGTGG 
ACCACATGAT 
CACACCAGTG 
TGCTCAGACT 
TGGACATGCC 
GTGGATAGGG 
TAAATAAATA 



AGGAAGAGGG 
TGTGACATCT 
TTCATCCAAT 
CCACCCTCTT 
CCTAGCCCTG 
TCGGCTTTGC 
AGTTCTATCA 
CTCTAACCAT 
GCAAAGCACT 
TAGGGCAGQA 
GACTGTGGAT 
AGGGCTAGAC 
GCAAGGCCCA 
GCCCCCAAGA 
CCAATGCTTC 
TGGGGTCTCC 
TATCAGCAAA 



Seq ID NOs C199 DNA Sequence 

Nucleic Acid Accession #: NM_021990.1 

Coding sequence: 1309.. 2490 



11 



GGGGGCAGCA 
CCCTCTCCTT 
GGGTATCTAT 
CCTTCTCTTT 
ACCAGGCACT 
AACAGGCATT 
CTTGCTTGAC 
CACAATCAAT 
TTTGACAAGT 
TTCTTATCCC 
CTCACTGGAA 
AGCTCAGTTC 
GAATGGCGAC 
TCAAATCTCT 
TATATGCTAA 
ATCTACTTTT 
GC 



51 



2280 
2340 
2400 
2460 
2S20 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3232 



21 31 41 

I 1 I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGT GAGTTTCATA 300 

CCTTGGCAGA TGGCCTTTAA CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 360 

TTTTCTTGGC TGTGGTGCAT GGCTGTGGAG CTCAGGGTGG ACTCCTGTTG GGCAGCCAGT 420 

TCCTGGATGG CTGTCTGTGG GTGGAGGACT CCTGCCTTTC CTGTTTAGAC ACCCACAAAG 480 

GCTGCTCTTT AGCCTCCTTC CCTTCATCCC CTTCCCCTGC CCCCAGTGCA ACGAGTATTA 540 

CACAACCAAC AAAACCGCAA AATATTCCCA CAATTTTCTG GTCCTCTCTG GGAGAGGCCG 600 

CTCTGGCTTT TCCTCTCAGC CCTGGCCCTC TGCCTGCTCC TCACTCCTGG TTGGTGCTGG 660 

TCAGGCTGAC TAGAGGCCAA GGCGACCAAC ACTAGGCAAA CGCGGCCAGC GCTCAGACAT 720 

AAATGCCCTC TTCATTTCAC GTGTAACATT CTTTTAAAAT CTAGGTCTTG GTTTTGTTGA 780 

TTTTTTCTTA AATAAAAGAG TGATCATAAA AGAGGGACAG CATAGAAAGT CCCCAAAGAG 840 

CAGCAAGGTT TTAAAGAAAT TCACAAGCCT AATCTGTCAC TGTCTTATAA TTTGCTATTA 900 

CCAGTCACAA TTTAACTAGG TTTTGTGTTG AAAACTTGTT TTGGTTTGCT TCTGTCCCAA 960 

GAGGCACTAG CTGGGGCCCC TACAGAGTGC AGGGCAGAGC TTCATTTTTC GTTTGAATGT 1020 

TCTAGGGTCG AGGGACCTCA GACTGAATCA AAGAATGAAG CCTCTTCCOG TGATGTTGTC 1080 

TATGGCCCCC AGCCCCAGCC TCTGGAAAAT CAGCTCCTCT CTGAGGAAAC AAAGTCAACT 1140 

GAGACTGAGA CTGGGAGCAG AGTTGGCAAA CTGCCAGAAG CCTCTCGCAT CCTGAACACT 1200 

ATCCTGAGTA ATTATGACCA CAAACTGCGC CCTGGCATTG GAGAGAAGCC CACTGTGGTC 1260 

ACTGTTGAGA TCTCCGTCAA CAGCCTTGGT CCTCTCTCTA TCCTAGACAT GGAATACACC 1320 

ATTGACATCA TCTTCTCCCA GACCTGGTAC GACGAACGCC TCTGTTACAA CGACACCTTT 1380 

GAGTCTCTTG TTCTGAATGG CAATGTGGTG AGCCAGCTAT GGATCCCGGA CACCTTTTTT 1440 

AGGAATTCTA AGAGGACCCA CGAGCATGAG ATCACCATGC CCAACCAGAT GGTCCGCATC 1500 

TACAAGGATG GCAAGGTGTT GTACACAATT AGGATGACCA TTGATGCCGG ATGCTCACTC 1560 

CACATGCTCA GATTTCCAAT GGATTCTCAC TCTTGCCCTC TATCTTTCTC TAGCTTTTCC 1620 

TATCCTGAGA ATGAGATGAT CTACAAGTGG GAAAATTTCA AGCTTGAAAT CAATGAGAAG 1680 

AACTCCTGGA AGCTCTTCCA GTTTGATTTT ACAGGAGTGA GCAACAAAAC TGAAATAATC 1740 

ACAACCCCAG TTGGTGACTT CATGGTCATG ACGATTTTCT TCAATGTGAG CAGGCGGTTT 1800 

GGCTATGTTG CCTTTCAAAA CTATGTCCCT TCTTCCGTGA CCACGATGCT CTCCTGGGTT 1860 

TCCTTTTGGA TCAAGACAGA GTCTGCTCCA GCCCGGACCT CTCTAGGGAT CACCTCTGTT 1920 

CTGACCATGA CCACGTTGGG CACCTTTTCT CGTAAGAATT TCCCGCGTGT CTCCTATATC 1980 

ACAGCCTTGG ATTTCTATAT CGCCATCTGC TTCGTCTTCT GCTTCTGCGC TCTGTTGGAG 2040 

TTTGCTGTGC TCAACTTCCT GATCTACAAC CAGACAAAAG CCCATGCTTC TCCTAAACTC 2100 

CGCCATCCTC GTATCAATAG CCGTGCCCAT GCCCGTACCC GTGCACGTTC CCGAGCCTGT 2160 

GCCCGCCAAC ATCAGGAAGC TTTTGTGTGC CAGATTGTCA CCACTGAGGG AAGTGATGGA 2220 

GAGGAGGGCC CGTCTTGCTC AGCCCAGCAG CCCCCTAGCC CAGGTAGCCC TGAGGGTCCC 2280 
CGCAGCCTCT GCTCCAAGCT GGCCTGCTGT GAGTGGTGCA AGCGTTTTAA GAAGTACTTC 2340 
TGCATGGTCC CCGATTGTGA GGGCAGTACC TGGCAGCAGG GCCGCCTCTG CATCCATGTC 2400 
TACCGCCTGG ATAACTACTC GAGAGTTGTT TTCCCAGTGA CTTTCTTCTT CTTCAATGTG 2460 
CTCTACTGGC TTGTTTGCCT TAACTTGTAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 2520 
TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG CAGCAGCAGC 2580 
AGCAGGAGCG ACTAGAGTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2640 
GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTTTTCTCAG CAGACCATTT 2700 
CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 2760 
TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 2820 
TAGCTATCTT CCAACAATGC TGACCACCAG ACAATTACTG CATTTTTCCA GAAGCCCACT 2880 
ATTGCCTTTG CAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2940 
TGCTTGCCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAGTATAACA GGAG GAAGA G 3000 
ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 3060 
CTGCAGATAG ATAGACACTG GCATTATCCC TTTAGGAAGA GGGGGGGGCA GCAAGAGAGC 3120 
CTATTTGGGA CAGCATTCCT CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 3180 
CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 3240 
TGTGATTATA GTAACTACTC CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 3300 
GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCAGGC ACTAGGCCTT 3360 
GGTGACTTCC TGGGGCCAAG AAACTAAGGA AACTCGGCTT TGCAACAGGC ATTACTCGCC 3420 
ATTGATTGGT GCCCACCCAG GGCACACTGT CGGAGTTCTA TCACTTGCTT GACCCCTGGA 3480 
CCCATAAACC AGTCCACTGT TATACCOGGG GCACTCTAAC CATCACAATC AATCAATCAA 3540 
ATTCCCTTAA ATTTGTATGG CACTGGAACT TTGGCAAAGC ACTTTTGACA AGTTGTGTCT 3600 
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GATTGGAGCT TCATGATAGC CTTGTGACAT CTTTAGGGCA GGATTCTTAT CCCCATTTTG 3660 

CAGATGAAAA CCCTGAGTCA CAGATTTCTG TGGGACTGTO GATCTCACTG GAAGCTATCC 3720 

AAGAGCCCAC TGTCACCTTC TAGACCACAT GATAGGGCTA GACAGCTCAG TTCACCATGA 3780 

TTCTCTTCTG TCACCTCTGC TGGCACACCA GTGGCAAGGC CCAGAATGGC GACCTCTCTT 3840 

TAGCTCAATT TCTGGGCCTG AGGTGCTCAG ACTGCCOCCA AGATCAAATC TCTCCTGGCT 3900 

GTAGTAACCC AGTGGAATGA ATTTGGACAT GCCCCAATGC TTCTATATGC TAAGTGAAAT 3960 

CTGTGTCTGT AATTTGTTGG GGGGTGGATA GGGTGGGGTC TCCATCTACT TTTTGTCACC 4020 

ATCATCTGAA ATGGGGAAAT ATGTAAATAA ATATATCAGC AAAGC 4 065 

Seq ID NO: C200 DNA Sequence 

Nucleic Acid Accession #: NM_021819.1 

Coding sequence : 39.. 1619 

1 11 21 31 41 51 

I I I I I I 

TAGGGCGGTC GCGTAATTCG GCACGAGGGC GCTCACGTAT GCCGGCGGTC AGTGGTCCAG 60 

GTCCCTTATT CTGCCTTCTC CTCCTGCTCC TGGACCCCCA CAGCCCTGAG AOGGGGTGTC 120 

CTCCTCTACG CAGGTTTGAG TACAAGCTCA GCTTCAAAGG CCCAAGGCTG GCATTGCCTG 180 

GGGCTGGAAT ACCCTTCTGG AGCCATCATG GAGACGCCAT CCTGGGCCTG GAGGAAGTGC 240 

GGCTGACGCC ATCCATGAGG AACCGGAGTG GCGCCGTGTG GAGCAGGGCC TCTGTCCCCT 300 

TCTCTGCCTG GGAAGTAGAG GTGCAAATGA GGGTGACGGG ACTGGGGCGC CGGGGAGCCC 360 

ACGGCATGGC CGTGTGGTAC ACCCGGGGCA GGGGCCATGT AGGCTCTGTC CTTGGGGGGC 420 

TGGCTTCGTG GGACGGCATC GGGATCTTCT TTGACTCTCC GGCAGAGGAT ACTCAGGACA 480 

GTCCTGCCAT CCGTGTGCTG GCCAGCGACG GGCACATCCC CTCTGAGCAG CCTGGGGATG 540 

GAGCTAGCCA AGGGCTGGGC TCCTGTCATT GGGACTTCCG GAACCGGCCA CACCCCTTCA 600 

GAGCACGGAT CACCTACTGG GGGCAGAGGC TGCGCATGTC CTTGAACAGT GGCCTCACTC 660 

CCAGTGATCC AGGTGAGTTC TGTGTGGATG TGGGGCCCCT GCTTTTGGTC CCTGGAGGTT 720 

TCTTTGGGGT CTCAGCAGCC ACCGGCACCC TGGCAGATGA TCATGATGTC CTGTCCTTCC 780 

TGACCTTCAG CCTGAGTGAG CCCAGCCCAG AGGTTCCCCC TCAGCCCTTC CTGGAGATGC 840 

AGCAGCTCCG CCTGGCGAGG CAGCTGGAAG GGCTGTGGGC AAGGCTGGGC TTGGGCACCA 900 

GGGAGGATGT AACTCCAAAA TCAGACTCTG AAGCTCAAGG AGAAGGGGAA AGGCTCTTTG 960 

ACCTGGAGGA GACGCTGGGC AGACACCGCC GGATCCTGCA GGCTCTGCGG GGTCTCTCCA 1020 

AGCAGCTGGC CCAGGCTGAG AGACAATGGA AGAAGCAGCT GGGGCCCCCA GGCCAAGCCA 1080 

GGCCTGACGG AGGCTGGGCC CTGGATGCTT CCTGCCAGAT TCCATCCACC CCAGGGAGGG 1140 

GTGGCCACCT CTCCATGTCA CTCAATAAGG ACTCTGCCAA GGTOGGTGCC CTGCTCCATG 1200 

GACAGTGGAC TCTGCTCCAG GCCCTGCAAG AGATGAGGGA TGCAGCTGTC CGCATGGCTG 1260 

CAGAAGCCCA GGTCTCCTAC CTGCCTGTGG GCATTGAGCA TCATTTCTTA GAGCTGGACC 1320 

ACATCCTGGG CCTCCTGCAG GAGGAGCTTC GGGGCCCGGC GAAGGCAGCA GCCAAGGCCC 1380 

CCCGCCCACC TGGCCAGCCC CCAAGGGCCT CCTCGTGCCT GCAGCCTQGC ATCTTCCTGT 1440 

TCTACCTCCT CATTCAGACT GTAGGCTTCT TCGGCTACGT GCACTTCAGG CAGGAGCTGA 1500 

ACAAGAGCCT TCAGGAGTGT CTGTCCACAG GCAGCCTTCC TCTGGGTCCT GCACCACACA 1560 

CCCCCAGGGC CCTGGGGATT CTGAGGAGGC AGCCTCTCCC TGCCAGCATG CCTGCCTGAC 1620 

CCACCTCAGA GCCTGCTTTG CATCACTGGG AAGCAGGCAG TGTCTTGGGT GGGGGCTTGG 1680 
TCAGTATCCT CTCCGTCTGG GTGCCCAGCT CCCACGCACA CCTGAGCTTT CGGCATGCTC 1740 
CCACCTCGTT AAAGGTGATT TCCCTCTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1800 
A 1801 

Seq ID NO: C201 DNA Sequence 

Nucleic Acid Accession #: XMJ.17036.1 

Coding sequence: 25.. 495 

1 11 21 31 41 51 

I I I I I I 

AGCCAAGAGA GGGGGGACAG ACAGATGGAA AGACGGACAC GGGGAGCTCT GGGGAGCAGG 60 
CGGCCCCCAC CCCCACTTCC AGCTCTGAGA CATTTGTGCA CAGGCCTGCA GGCGGCTGGG 120 
ATGGCCTGGC CGGGCACGCT CTGGAGACAC ACATGCCAAG GCAGGGCCNG GGCAGCCGAG 180 
GGCCCGTGGG GGCTGTTCCG ACCACACAGG TGCCCGAGGG AGGCAGGGCA GGCCCCGGTG 240 
GGACCCAGCC CCGAAACGCA GGGTGTCGCC CACGTTTGCA GCCGCGCTCG CGTGTCGGTG 300 
GATGAGCGGG AACCGGGGGG CGGCGCCTAC GCAATGCACG TTAOGCCGCG GTGGAAGGGG 360 
TGTCACOGCC ACAGCGGCCG CACCGTGCGT GGCTCTGTTT CCTGGAAACG TCCAGAACAG 420 
GCAGCCCCCG AGACAGGGAG GGGCCCAGCG GTCGCCAGGG GAAGTGGGGA TGGGAATGAG 480 
TGCGGATGGG GCTGAGGTTC TTCCTGAAGG ATGAATGCTC TGAAACTGTG AATAGACACA 540 
CGGAACTGTG CACGTTCACN GGTGAACCGT GTGGGGAGTG AATCCATCTC AACAGAGCTG 600 
TTACACAGTG AACAAGAGCA CGANGGGTGG AGGCCNCTTG TCTGCTCCTG CAGATCGAGC 660 
CGGCTGCTCA GAGCCGGGAG ATGGCCACTC AGCTCCTTTA ACTG 704 

Seq ID NO: C202 DNA Sequence 
Nucleic Acid Accession 8: XM_167803.2 
Coding sequence: 11 62.. 1488 "~ 

1 11 21 31 41 51 

I I I I I I 

AACATCATAC ATAGTAGGTG AATCGTTTTG TAGAGTGAAG AATGCTAATG TAAAGCAAAT 60 
AGTCACCCAC GTTCCTTGTA AATCCAAATG TTTCTATATT GTAGCTTTGC TTAAAATGGG 120 
GTCGGCCCCA ACTGCATCCT CCTCTTTGGC GGGCTGGGGA GCGGCCCCCA GCCGGGACGG 180 
GAGGGCAGCG ACCCOGAGGC CTCGTGGACG TGGGAGAGAG TGTGGTGGGA AGTCTTGAGC 240 
GGAGGAGGGG ATCTGCCCTT CTCCACTCCT CTCTTGGATC CGCCTCGGTT TCCTGTCCCC 300 
CCACCACCCG CCTGCCCCGC GGAAGACCGC CCAGTGAGCC AGCCCCCACC TTCCAGGCGC 360 
CTTCGCCCTG GGGATCCAAC CAACTTGTAT CGAGTGGGCG GGGCAACGGC TCCCCATTTT 420 
TCCCGAGCCC CGCCCACAGA GCTCTTAGCC AATCCTATGC AGAGAGCATC TCCTGGCAGG 480 
GGTCTCTTCC CAACCAGACC CCACCCAGGC ACATTAGCGA CCAGGCTTGG GCTTCCCCAG 540 
CGCCCCACCA CCACCACGTG CAGGTGGAGC TCTGGGATGC TATGTTGGGG CGGCAAGCGG 600 
TGGGCCGAGG GCCGGGTAGG CTAGCAGGGG AGGTAAGGGT GGTATGGGAT GGGGCGGGGG 660 
CGGTCTAGGG CAATAGGAGA GCAGAGAATG GGGGAACTTG AGGGTGGGGG GAGGGCACCG 720 

1326 



WO 03/042661 



PC17US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GAGCCTTGCC 
CCCCATCTGT 
CACCATCACC 
AGrGTAATTA 
AGCTAGCTGA 
GGCTTCGTCT 
TTTCTTGTGA 
GCACCTGCCC 
CCAGGGGGGC 
GAGTGCAAGT 
GCAGOGAAGA 
ACAAAATTTT 
TATTTTTGTG 
CGTGAAAAAG 



ACCATCCCAG 
AAAATGATGG 
TGACCTGGGG 
CCGAATCCTG 
GGCCAACGCA 
GCCATCTTTG 
CCATCTCCCC 
TCTGGCACCA 
TCCCCGGAGC 
GTTTTATAAA 
GAAATGAAGA 
TCCACCACAC 
GGAATGGGCC 
T 



GACTTTGGGC 
TAATAATACT 
GTCAAGGCAG 
ACTGCAAGGC 
GGCCCATCCG 
TGGGTGCCCT 
ATGAAAGTGC 
GATGCCAGGG 
CCCAGGGGTG 
AAATAAAACA 
AAAACTGAAG 
ATACCCTCTA 
CTGCTTTTTG 



AAGTCACCCG 
TCACCTACCT 
GAGGACTCCG 
CCACCTGCCC 
TCTCTTCACT 
AGACTTAGTC 
TGTACAAATT 
AAGGGACAGA 
GGGATTGGTG 
AAAACCCACC 
AAAAAAAAAA 
AGCCAGCAAG 
TGGCAAGGCC 



CACTCCCTGG 
CATAGGGGAG 
AAGGTGCTAC 
CTCCCCCACA 
CTGTCGCAGG 
CTTATCTTGT 
CCACCCGCCC 
GGAAAACAGC 
GCCACTGTTT 
ATCACAAAAA 
AACAGGAAAA 
ATTTCCTCTT 
TGTTCTGATT 



GCCTCGGTTT 
GTTGTGAGGC 
CCGTGAGCAA 
GAGCCTCCAG 
CCCTTTCATG 
CCTGGTTTCC 
CAGGACCCCC 
CACAAACAAG 
GTATGTTCTT 
AAAAAATTTT 
AAAGAACCAT 
TGCAAAATCA 
AATAAAGGAT 



Seq ID NO: C203 DNA Sequence 

Nucleic Acid Accession 8: NM_024780.1 

Coding sequence : 31 . . 1023 



ATTAATCTGG 
CCACGGCACG 
GGCATTCTTT 
ACCCTCATTG 
AATTCCTTCC 
GGCCTTCAGG 
GTGTGGATTG 
ATCATGTTCT 
TGGOGGGCCT 
GGGGTCTTGT 
CCTTTTCGAG 
ACACGGCCTG 
TTCTTTTTCA 
GAGGGAAGGA 
AAAATGTTCC 
AGCTCACTTG 
CATGATGGCA 
TGATGACTCT 
ATTTCTTCCA 
CAGGAGCGGA 
TTTAATACCT 
GTTGTAATAT 
ATTATCTTCA 
TCCCAGACCT 
CATGTAATAC 
ACTTTGAACT 
CTGAACTTTC 
CCCCTGTGAG 
AGTACOGTAG 
ATACCCCACC 
GAGACTGAAT 



11 

I 

CCGTGCCATG 
AAGTCTACGT 
GTTACTATTG 
GCCAGGACAT 
TGGGGGAGTT 
AGTTTGACAT 
GCATCTTCTT 
ACTCCAAAAA 
CACAGATGAT 
GCACCCTGGC 
GTCTGCCTCT 
GCTACCTGTQ 
TCCTCACCCT 
AGATTATGAT 
TGATAGAAAA 
TTCTGGAAAG 
GTCTTGACTT 
TTTGGTAACC 
TGCCACCTGT 
AACTGACTAC 
GGATTGCTGA 
TGAAATGAGC 
AGAAGTGTGT 
GGCAAAGGTT 
TCCAGCAGGG 
AAAGACATCC 
TTGCGTACTG 
GTGTTATACA 
CCAATGAGAT 
GTGATGACTT 
AAG 



21 
I 

CATCTACTCC 
TCTCCTGATC 
GCTCAACACC 
CTACCGGCTC 
TCTGAGGAGA 
TGCCAGGAAC 
CTGCCCCCTG 
TATCAGCCTG 
GACTTTCTTC 
CATCACCATC 
CTTCATTCAC 
GGTTGTTTGG 
CATTGTGCTA 
AAGGCTGCTC 
ATTGATCAAG 
GAGAGAGGTG 
GCGATCTAGA 
AGACACCAAT 
GCCTTTAGGA 
CATGTAATTA 
TTTTTCAAGA 
CTACAAAAAC 
GCAGGAATGA 
TAGAAACTGT 
ATATGAAGCC 
CCCATGTCCC 
ATTACATGAG 
TGACCATCAA 
TTATCCGAGA 
GAAATATAAT 



31 
I 

ATGTTCAGGC 
CGAAACATCT 
GTGGCCCTGT 
CTTCTGATGG 
ATCATTGGGA 
GTTCTAGAAC 
CTGCCCTTTA 
ATGATGAATT 
ATCTTCTTGC 
TGGAGATTGA 
TCCATCTACA 
ATCTATCGGA 
ATCATCACCT 
CATGAGCAGA 
CTGCAGGATA 
GAGCAACAAG 
AGATCAGTTC 
CAAATAAGGG 
ACTGCCCAGA 
TCAAAGTAAA 
CAAAATACTT 
CTAGGAAGAG 
TTGGTTCTTA 
TGCTAAGAAA 
TCTGAATTGT 
AAAGGTGGAA 
TCTTTGGAGT 
AGTCCTACGT 
CGCGATTATT 
CAGCGCTGGC 



41 

I 

TTGTGGAGAG 
TTTTGAAAAT 
CTGGTGAAGA 
ATTTTGTGTT 
TGCAACTGAT 
TGATCTATGC 
TCCAAATGAT 
TCCAGCCTCC 
TCTTTTTCCC 
AGCCTTCAGC 
GCTGGATCGA 
ACCTCATTGG 
ATCTTTACTG 
TCATTAATGA 
TGGAGAAGAA 
GCTTTTTGCA 
AAGAAGGTAA 
GAGGAGAOGA 
AGAAAATCCA 
ATTGGGCATT 
GGGGTTTTCC 
ATAACTAGGG 
GAAATCTCTC 
AGTGGTCCAT 
AGAACCTGCA 
TACAACCAGA 
CGGGGATGGA 
CAAGCTAGCT 
GCTAATTGGA 
AATTTTTGAC 



51 

I 

GTACGAGATG 
ATCAATCATT 
GTGTTGGGAA 
CTCTTTAGTC 
CACAAGTCTT 
ACAAACTCTG 
TATGCTTTTC 
GAGCAAAGCC 
ATCCTTCACC 
TGACTGTGGC 
CACCCTAAGT 
AAGTGTGCAC 
GCAGATCACA 
GGGCAAAGAT 
AGCAAACCCC 
TTTGGGGGAA 
TCCAAGGGCC 
AAATGGAATG 
AGGCTTTAGC 
CCATGCTATT 
AATAAAGATT 
AATAATGTAT 
CTGCCAGACT 
CCTGAATAAA 
TTTATTTGTG 
GGTCTCATCT 
GGAGGTTCTG 
TTGCAGTGGC 
AATTTTCCCA 
AGTCTCTACG 



seq ID NO: C204 Protein Sequence 
Protein Accession #: Eos sequence 



1 
I 

AGCGGGGTTA 
ATCCTGGCAT 
CCCTGOGAGT 
CAGTGGAGAA 
TCATCATACA 
TTTTTCAAGA 
TTGATGTGAT 
TGCGGTTTTC 
TGGGCATAGA 
CCCTTATACC 
TGGAAGAGAA 
GTATCCTTCA 
GATCCAAGAA 
AGATCATTGG 
TTTTGAAGTT 
TTACCGTGGC 
GTTATTTTAG 
GGAACAGCGG 
TGACCACCTG 
TTAATCCCCA 
TCACTCATGA 
ACCTGTCAGA 
TCTCTGCCTA 
CCGTTTATTA 
CGGTGCTGTT 
CCATGTTCAG 



11 
I 

GATGCAGGGA 
TTTAAAAATA 
CCCAATCAAT 
GAAGGCTTGC 
GGTTGAGAAT 
AAAGGTGCTG 
AGACTCTCAG 
AACATCTTTG 
CACTCCTGGT 
CATGACATCC 
AAGGAACCTT 
GCTCAATTGC 
CAGCCTGTCG 
AGGCAAGTTT 
CAACATTTTC 
CAAAAAGAAC 
GGACACAGTG 
GGCATCCTAC 
CTTCTTCAGT 
CATTTACTCC 
AAAAGCTGTG 

gctcogtcag 

CATGGTAGCC 

cctggctgag 

ACTGCCTTTC 
GCTTGTGGAG 



21 
I 

GTTATGTTGT 
ATGTGAATGG 
GCGGGTGTGA 
AGGAGGCAGG 
GTTTCCTCTG 
CTAGACTCAA 
ACAGTCAGCA 
AATGAGTCGA 
TCTTCACATG 
AGAGACAGAA 
AGGAAAATAG 
TGTATTCAGT 
GAAATTCTGA 
GGAACCAGCG 
TCATTCATCC 
ACCCTCCAGT 
ATGTACTATG 
AACATGCAGC 
TTGCTGTTCA 
GGAGGGATCA 
AAGCTAAAAC 
GAGAATTCCA 
TGGGTTGTCT 
TACAACTTAG 
GTTGTGTCCT 
AGGTACGAGA 



31 
I 

GATTTGTGTA 
TGTATCCCTT 
CTGCTGTATG 
AGATGCTGTC 
GGGTCCAAAG 
GCATCAACAT 
AAAGGAATGA 
TGTCTCAGAC 
AAACTGTTCA 
TTAAAGCCAT 
TTGACAAAGA 
GTCTGAACTC 
ATTCCATCAG 
TCCTCTCCTA 
TGAACTTCAG 
TCACTGGGCT 
GCTTTTACAC 
TGGCCTACAT 
GCATGGCCAA 
CCAAGCTGAT 
AGAAGAATCT 
AGTTGACGTT 
CTACAGGAGT 
AGTTCCTGAA 
GCATTAATCT 
TGCCACGGCA 



41 
I 

GGGGTGGCTT 
TGTGATCATC 
AAGTCTCAGG 
CGATGACCAC 
CCACCCATCC 
GGTTTTGTCA 
CCAAAAGGGT 
CCTTCATAGC 
AGGACAGAAG 
CAGGAACCAG 
AAAAAGCAAA 
CATTTCCCGG 
CCTGTGGCAG 
TTTCAACTTT 
CTTCATCATA 
GGAGTTTTTC 
CAATTCCACC 
CTTCACAATC 
GTATTTCCGG 
CTTTTGCTGG 
TAGCACTGAG 
CAATCAGCTG 
GGCCATAGCC 
GACACACAGT 
GGCCGTGCCA 
CGAAGTCTAC 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1511 



51 
I 

TGGTGAGCTC 
ATCACTTTTT 
AAGCCCACCC 
GTGAATGAAA 
TCAAATCAGA 
ATATCTGACA 
AACCAGGTGC 
CTAGAATGCA 
TTAATCGCAT 
CCAAGGACCA 
CAGACCCATC 
GCTTATCGGA 
AAGACGCTGA 
CTGAGATGGC 
ATCCCTCAGT 
ACTGGGGTGG 
ATCCAGCACG 
GGAGCATGCT 
AACAACTTCA 
GACTTCACTG 
ATAAGGGAGA 
CTGACCCGCT 
TGCTGTGCAG 
AACCCTGGGG 
TGCATCTACT 
GTTCTCCTGA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1813 



60 
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180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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TCCGAAACAT CTTTTTGAAA ATATCAATCA TTGGCATTCT TTGTTACTAT TGGCTCAACA 1620 

COGTGGCCCT GTCTGGTGAA GAGTGTTGGG AAACCCTCAT TGGCCAGGAC ATCTACCGGC 1680 

TCCTTCTGAT GGATTTTGTG TTCTCTTTAG TCAATTCCTT CCTGGGGGAG TTTCTGAGGA 1740 

GAATCATTGG GATGCAACTG ATCACAAGTC TTGGCCTTCA GGAGTTTGAC ATTGCCAGGA 1800 

J ACGTTCTAGA ACTGATCTAT GCACAAACTC TGGTGTGGAT TGGCATCTTC TTCTGCCCCC 1860 

TGCTGCCCTT TATCCAAATG ATTATGCTTT TCATCATGTT CTACTCCAAA AATATCAGCC 1920 

TGATGATGAA TTTCCAGCCT CCGAGCAAAG CCTGGOGGGC CTCACAGATG ATGACTTTCT 1980 

TCATCTTCTT GCTCTTTTTC CCATCCTTCA COGGGGTCTT GTGCACCCTG GCCATCACCA 2040 

TCTGGAGATT GAAGCCTTCA GCTGACTGTG GCCCTTTTCG AGGTCTGCCT CTCTTCATTC 2100 

1U ACTCCATCTA CAGCTGGATC GACACCCTAA GTACACGGCC TGGCTACCTG TGGGTTGTTT 2160 

GGATCTATCG GAACCTCATT GGAAGTGTGC ACTTCTTTTT CATCCTCACC CTCATTGTGC 2220 

TAATCATCAC CTATCTTTAC TGGCAGATCA CAGAGGGAAG GAAGATTATG ATAAGGCTGC 2280 

TCCATGAGCA GATCATTAAT GAGGGCAAAG ATAAAATGTT CCTGATAGAA AAATTGATCA 2340 

AGCTGCAGGA TATGGAGAAG AAAGCAAACC CCAGCTCACT TGTTCTGGAA AGGAGAGAGG 2400 

15 TGGAGCAACA AGGCTTTTTG CATTTGGGGG AACATGATGG CAGTCTTGAC TTGCGATCTA 2460 

GAAGATCAGT TCAAGAAGGT AATCCAAGGG CCTGATGACT CTTTTGGTAA CCAGACACCA 2520 

ATCAAATAAG GGGAGGAGAY GAAAATGGAA TGATTTCTTC CATGCCACCT GTGCCTTTAG 2580 

GAACTGCCCA GAAGAAAATC CAAGGCTTTA GCCAGGAGCG GAAACTGACT ACCATGTAAT 2640 

TATCAAAGTA AAATTGGGCA TTCCATGCTA TTTTTAATAC CTGGATTGCT GATTTTTCAA 2700 

GACAAAATAC TTGGGGTTTT CCAATAAAGA TTGTTGTAAT ATTGAAAWRA RMMWAMAAAA 2760 

ACCTAGGAAG AGATAACTAG GGAATAATGT ATATTATCTT CAAGAAGTGT GTGCAGGAAT 2820 

GATTGGTTCT TAGAAATCTC TCCTGCCAGA CTTCCCAGAC CTGGCAAAGG TTTAGAAACT 2880 

GTTGCTAAGA AAAGTGGTCC ATCCTGAATA AACATGTAAT ACTCCAGCAG GGATATGAAG 2940 

CCTCTGAATT GTAGAACCTG CATTTATTTG TGACTTTGAA CTAAAGACAT CCCCCATGTC 3000 

CCAAAGGTGG AATACAACCA GAGGTCTCAT CTCTGAACTT TCTTGCGTAC TGATTACATG 3060 

AGTCTTTGGA GTCGGGGATG GAGGAGGTTC TGCCCCTGTG AGGTGTTATA CATGACCATC 3120 

AAAGTCCTAC GTCAAGCTAG CTTTGCAGTG GCAGTACCGT AGCCAATGAG ATTTATCCGA 3180 

GACGCGATTA TTGCTAATTG GAAATTTTCC CAATACCCCA CCGTGATGAC TTGAAATATA 3240 

ATCAGCGCTG GCAATTTTTG ACAGTCTCTA CGGAGACTGA ATAAG 3285 



20 



25 



30 
35 
40 
45 
50 
55 
60 
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70 
75 
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Seq ID NO: C205 DNA Sequence 

Nucleic Acid Accession NM_002250.1 

Coding sequence: 397.. 1680 "~ 

1 11 21 31 41 51 

I I I I I I 

GTCCTTCGGT GTCTGGGTGT GGTGAGTAGA GGTGTGTGTC ACAAAGTACA GACCATTGTG 60 

TGTGACAAAG CCCATCGTGT GTCTGTGTGT GTCTTTATCC ACGTGGATGG ACGTCTCTTT 120 

CTTGCTCTGC CCCAAGACAC ACCCTAGCCC CTCCTTATTC TCAAAAGGGG GAGCTGGGGA 180 

GCCTCCCCCT ACCCTGGGGC CTCCCCTGCC CCTCCCCGCC CTGCCTGGCC GTCACCACTC 240 

CCCAGAGGGC ACAGGGCTCT GCTGTGCCTC AGAGCAAAAG TCCCAGAGCC AGCAGAGCAG 300 

GCTGACGACC TGCAAGCCAC AGTGGCTGCC CTGTGCGTGC TGCGAGGTGG GGGACCCTGG 360 

GCAGGAAGCT GGCTGAGCCC CAAGACCCCG GGGGCCATGG GCGGGGATCT GGTGCTTGGC 420 

CTGGGGGCCT TGAGACGCCG AAAGCGCTTG CTGGAGCAGG AGAAGTCTCT GGCCGGCTGG 480 

GCACTGGTGC TGGCAGGAAC TGGCATTGGA CTCATGGTGC TGCATGCAGA GATGCTGTGG 540 

TTCGGGGGGT GCTCGTGGGC GCTCTACCTG TTCCTGGTTA AATGCACGAT CAGCATTTCC 600 

ACCTTCTTAC TCCTCTGCCT CATCGTGGCC TTTCATGCCA AAGAGGTCCA GCTGTTCATG 660 

ACCGACAACG GGCTGCGGGA CTGGCGCGTG GCGCTGACOG GGCGGCAGGC GGCGCAGATC 720 

GTGCTGGAGC TGGTGGTGTG TGGGCTGCAC CCGGCGCCCG TGCGGGGCCC GCCGTGCGTG 780 

CAGGATTTAG GGGCGCCGCT GACCTCCCOG CAGCCCTGGC CGGGATTCCT GGGCCAAGGG 840 

GAAGCGCTGC TGTCCCTGGC CATGCTGCTG CGTCTCTACC TGGTGCCCCG CGCCGTGCTC 900 

CTGCGCAGCG GCGTCCTGCT CAACGCTTCC TACCGCAGCA TCGGCGCTCT CAATCAAGTC 960 

CGCTTCCGCC ACTGGTTCGT GGCCAAGCTT TACATGAACA CGCACCCTGG CCGCCTGCTG 1020 

CTCGGCCTCA CGCTTGGCCT CTGGCTGACC ACCGCCTGGG TGCTGTCCGT GGCCGAGAGG 1080 

CAGGCTGTTA ATGCCACTGG GCACCTTTCA GACACACTTT GGCTGATCCC CATCACATTC 1140 

CTGACCATCG GCTATGGTGA CGTGGTGCCG GGCACCATGT GGGGCAAGAT OGTCTGCCTG 1200 

TGCACTGGAG TCATGGGTGT CTGCTGCACA GCCCTGCTGG TGGCCGTGGT GGCCOGGAAG 1260 

CTGGAGTTTA ACAAGGGAGA GAAGCACGTG CACAACTTCA TGATGGATAT CCAGTATACC 1320 

AAAGAGATGA AGOAGTCCGC TGCCCGAGTG CTACAAGAAG CCTGGATGTT CTACAAACAT 1380 

ACTCGCAGGA AGGAGTCTCA TGCTGCCCGC AGGCATCAGC GCAAGCTGCT GGCCGCCATC 1440 

AAOGCGTTCC GCCAGGTGCG GCTGAAACAC CGGAAGCTCC GGGAACAAGT GAACTCCATG 1500 

GTGGACATCT CCAAGATGCA CATGATOCTG TATGACCTGC AGCAGAATCT GAGCAGCTCA 1560 

CACCGGGCCC TGGAGAAACA GATTGACACG CTGGCGGGGA AGCTGGATGC CCTGACTGAG 1620 

CTGCTTAGCA CTGCCCTGGG GCCGAGGCAG CTTCCAGAAC CCAGCCAGCA GTCCAAGTAG 16 B0 

CTGGACCCAC GAGGAGGAAC CAGGCTACTT TCCCCAGTAC TGAGGTGGTG GACATCGTCT 1740 

CTGCCACTCC TGACCCAGCC CTGAACAAAG CACCTCAAGT GCAAGGACCA AAGGGGGCCC 1800 

TGGCTTGGAG TGGGTTGGCT TGCTGATGGC TGCTGGAGGG GACGCTGGCT AAAGTGGGTA 1860 

GGCCTTGGCC CACCTGAGGC CCCAGGTGGG AACATGGTCA CCCCCACTCT GCATACCCTC 1920 

ATCAAAAACA CTCTCACTAT GCTGCTATGG ACGACCTCCA GCTCTCAGTT ACAAGTGCAG 1980 

GCGACTGGAG GCAGGACTCC TGGGTCCCTG GGAAAGAGGG TACTAGGGGC CCGGATCCAG 2040 

GATTCTGGGA GGCTTCAGTT ACCGCTGGCC GAGCTGAAGA ACTGGGTATG AGGCTGGGGC 2100 

GGGGCTGGAG GTGGCGCCCC CTGGTGGGAC AACAAAGAGG ACACCATTTT TCCAGAGCTG 2160 

CAGAGAGCAC CTGGTGGGGA GGAAGAAGTG TAACTCACCA GCCTCTGCTC TTATCTTTGT 2220 

AATAAATGTT AAAGCCAG 2238 

Seq ID NO: C206 DNA Sequence 

Nucleic Acid Accession #: NMJ)2S257.l 

Coding sequence: 1..2139 

1 11 21 31 41 51 

I 1 I I I I 

ATGGGGGGAA AGCAGCGGGA CGAGGATGAC GAGGCCTACG GGAAGCCAGT CAAATACGAC 60 
CCCTCCTTTC GAGGCCCCAT CAAGAACAGA AGCTGCACAG ATGTCATCTG CTGCGTCCTC 120 
TTCCTGCTCT TCATTCTAGG TTACATCGTG GTGGGGATTG TGGCCTGGTT GTATGGAGAC 180 

1328 



WO 03/042661 
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CCCCGGCAAG TCCTCTACCC CAGGAACTCT ACTGGGGCCT ACTGTGGCAT GGGGGAGAAC 240 

AAAGATAAGC CGTATCTCCT GTACTTCAAC ATCTTCAGCT GCATCCTGTC CAGCAACATC 300 

ATCTCAGTTG CTGAGAACGG CCTACAGTGC CCCACACCCC AGGTGTGTGT GTCCTCCTGC 360 

CCGGAGGACC CATGGACTQT GGGAAAAAAC GAGTTCTCAC AGACTGTTGG GGAAGTCTTC 420 

TATACAAAAA ACAGOAACTT TTGTCTGCCA GGGGTAOCCT GGAATATGAC GGTGATCACA 4 BO 

AGCCTGCAAC AGGAACTCTG CCCCAGTTTC CTCCTCCCCT CTGCTCCAGC TCTGGGACGC 540 

TGCTTTCCAT GGACCAACAT TACTCCACCG GCGCTCCCAG GGATCACCAA TGACACCACC 600 

ATACAGCAGG GGATCAGCGG TCTTATTGAC AGCCTCAATG OCOGAGACAT CAGTGTTAAG 660 

ATCTTTGAAG ATTTTGCCCA GTCCTGGTAT TGGATTCTTG TTGCCCTGGG GGTGGCTCTG 720 

GTCTTGAGCC TACTGTTTAT CTTGCTTCTG OGCCTGGTGG CTGGGCCCCT GGTGCTGGTG 780 

CTGATCCTGG GAGTGCTGGG OGTGCTGGCA TATGGCATCT ACTACTGCTG GGAGGAGTAC 840 

CGAGTGCTGC GGGACAAGGG CGCCTCCATC TCCCAGCTGG GTTTCACCAC CAACCTCAGT 900 

GCCTACCAGA GCGTGCAGGA GACCTGGCTG GCCGCCCTGA TCGTGTTGGC GGTGCTTGAA 960 

GCCATCCTGC TGCTGGTGCT CATCTTCCTG CGGCAGOGGA TTOGTATTGC CATCGCCCTC 1020 

CTGAAGGAGG CCAGCAAGGC TGTGGGACAG ATGATGTCTA CCATGTTCTA CCCACTGGTC 1080 

ACCTTTGTCC TCCTCCTCAT CTGCATTGCC TACTGGGCCA TGACTGCTCT GTATCCTCTG 1140 

CCCAOGCAGC CAGCCACTCT TGGATATGTG CTCTGGGCAT CCAACATCAG CTCCCCOGGC 1200 

TGTGAGAAAG TGCCAATAAA TACATCATGC AACCCCAOGG CCCACCTTGT GAACTCCTCG 1260 

TGCCCAGGGC TGATGTGCGT CTTCCAGGGC TACTCATCCA AAGGCCTAAT CCAAOGTTCT 1320 

GTCTTCAATC TGCAAATCTA TGGGGTCCTG GGGCTCTTCT GGACCCTTAA CTGGGTACTG 1380 

GCCCTGGGCC AATGCGTCCT CGCTGGAGCC TTTGCCTCCT TCTACTGGGC CTTCCACAAG 1440 

CCCCAGGACA TCCCTACCTT CCCCTTAATC TCTGCCTTCA TCCGCACACT CCGTTACCAC 1500 

ACTGGGTCAT TGGCATTTGG AGCCCTCATC CTGACCCTTG TGCAGATAGC CCGGGTCATC 1560 

TTGGAGTATA TTGACCACAA GCTCAGAGGA GTGCAGAACC CTGTAGCCOG CTGCATCATG 1620 

TGCTGTTTCA AGTGCTGCCT CTGGTGTCTG GAAAAATTTA TCAAGTTCCT AAACCGCAAT 1680 

GCATACATCA TOATCGCCAT CTACGGGAAG AATTTCTGTG TCTCAGCCAA AAATGCGTTC 1740 

ATGCTACTCA TGGGAAACAT TGTCAGGGTG GTCGTCCTGG ACAAAGTCAC AGACCTGCTG 1800 

CTGTTCTTTG GGAAGCTGCT GGTGGTCGGA GGCGTQGGGG TCCTGTOCTT CTTTTTTTTC 1860 

TCCGGTOGCA TCCGGGGGCT GGGTAAAGAC TTTAAGAGCC CCCACCTCAA CTATTACTGG 1920 

CTGCCCATCA TGACCTCCAT CCTGGGGGCC TATGTCATOG CCAGCGGCTT CTTCAGCGTT 1980 

TTCGGCATGT GTGTGGACAC GCTCTTCCTC TGCTTCCTGG AAGACCTGGA GOGGAACAAC 2040 

GGCTCCCTGG ACCGGCCCTA CTACATGTCC AAGAGCCTTC TAAAGATTCT GGGCAAGAAG 2100 

AACGAGGCGC CCCCGGACAA CAAGAAGAGG AAGAAGTGAC AGCTCCGGCC CTGATCCAGG 2160 

ACTGCACCCC ACCCCCACCG TCCAGCCATC CAACCTCACT TCGCCTTACA GGTCTCCATT 2220 
TTGTGGTAAA AAAAGGTTTT AGGCCAGGCG CCGTGGCTCA OGCCTGTAAT CCAACACTTT 2280 
GAGAGGCTGA GGCGGGCGGA TCACCTGAGT CAGGAGTTCG AGACCAGCCT GGCCAACATG 2340 
GTGAAAC 2347 

Seq ID NO: C207 DNA Sequence 

Nucleic Acid Accession #: NM_016180.1 

Coding sequence: 26.. 1618 

1 11 21 31 41 51 

I I i I I I 

CAGGAAGGTT CCTCTCCCAG TGGCCATGGG TAGCAACAGT GGGCAGGCTG GCOGCCACAT 60 
CTATAAATCC CTAGCTGATG ATGGCCCCTT TGACTCTGTG GAGCCGCCTA AAAGACCCAC 120 
CAGCAGACTC ATCATGCACA GCATGGCCAT GTTCGGAAGA GAGTTCTGCT ACGCGGTGGA 180 
GGCAGCGTAT GTGACCCCAG TCCTGCTCAG CGTAGGTCTG CCCAGCAGCC TGTACAGCAT 240 
TGTGTGGTTC CTCAGCCCCA TCCTGGGATT CCTGCTGCAG CCCGTGGTCG GATCGGCCAG 300 
CGACCACTGC CGGTCCAGGT GGGGCCGCCG GAGACCCTAC ATCCTCACCC TGGGAGTCAT 360 
GATGCTCGTG GGCATGGCTC TGTACCTCAA TGGGGCTACT GTTGTAGCAG CTTTGATTGC 420 
TAACCCAAGG AGGAAGCTGG TTTGGGCCAT AAGTGTCACC AT GATA GGTG TCGTTCTCTT 480 
TGATTTTGCT GCCGACTTCA TTGATGGGCC CATCAAAGCC TACTTATTTG ATGTCTGCTC 540 
CCATCAGGAC AAGGAGAAGG GCCTCCACTA CCATGCCCTC TTCACAGOTT TTGGAGGTGC 600 
CCTGGGTTAC CTTTTGGGTG CTATAGACTG GGCCCATCTG GAGCTGGGAA GACTGTTGGG 660 
TACAGAATTC CAGGTCATGT TCTTCTTCTC TGCATTGGTG CTCACTTTGT GTTTTACTGT 720 
TCATCTGTGC AGTATCTCTG AAGCCCCACT TACAGAGGTT GCAAAGGGCA TTCCCCCACA 780 
GCAAACCCCT CAGGACCCTC CATTGTCATC AGATGGAATG TACGAGTATG GTTCTATCGA 840 
GAAAGTTAAA AATGGTTAGG TAAATCCAGA GCTGGCAATQ CAGGGAGCAA AAAACAAAAA 900 
TCATGCTGAA CAGACTCGCA GGGCAATGAC ATTAAAGTCA CTGCTGAGAG CACTGGTGAA 960 
CATGCCTCCT CACTACCGCT ACCTTTGCAT CAGCCACCTC ATTGGATGGA CGGCCTTCCT 1020 
GTCCAACATG CTGTTCTTCA CAGATTTGAT GGGCCAGATT GTGTACCGCG GGGATCCCTA 1080 
TAGTGCACAC AACTCCACAG AGTTTCTCAT CTAOGAAAGA GGAGTCGAGG TTGGATGTTG 1140 
GGGCTTCTGC ATCAACTCCG TGTTTTCCTC ACTTTATTCT TACTTTCAGA AAGTTTTGGT 1200 
ATCCTACATT GGATTAAAGG GTCTTTACTT CACGGGATAT TTGCTGTTTG GCCT GGGGA C 1260 
GGGATTTATT GGGCTCTTCC OGAATGTCTA CTCCACCCTG GTCCTGTGCA GCCTGTTTGG 1320 
TGTAATGTCC AGCACCCTGT ACACTGTGCC CTTTAACCTC ATTACTGAGT ACCACCGCGA 1380 
GGAAGAAAAG GAGAGGCAGC AGGCCCCAGG AGGGGACCCA GAGAACAGCG TGAGAGGGAA 1440 
GGGCATGGAC TGCGCCACCC TCACATGCAT GGTGCAGCTG GCTCAGATCC TGGTCGGAGG 1500 
TGGCCTGGGC TTTCTGGTCA ACACAGCCGG GACCGTTGTC GTCGTGGTGA TCACAGCGTC 1560 
TGCGGTGGCA CTGATAGGCT GTTGCTTTGT CGCTCTCTTT GTTAGATATG TGGATTAGGT 1620 
CAATAAAGAG ACAATGACCC TAAAAAAAAA 1650 

Seq ID NO i C208 DNA Sequence 

Nucleic Acid Accession 8: NM_003273.1 

Coding sequences 255.. 2024 

1 11 21 31 41 51 

I I I I I I 

CGCCGCGGGG CCGGATCCTC CGCGCGGCCG AGTCCATCTC CTGGGAAATG GGGCGGACAG 60 
TGTTTCCTTG ACTGACTATT GTGAGCGCCC TCTCTCTCCG GCGGA GOGGA GACCATGGCC 120 
CCCACTCAGG CCCCCGGGCC CCGCTGGAAT TCGGAGGGCC CCTGGGTAAT GGGGCAGAGA 180 
GATGGGACCT GGGGCAAAGG CTAAGCGAAG GAGAGCIGGA GCGGGTGAAC TAAGA GCGGG 240 
GGCGAGATCT GAGGATGGAA GGCTTTGGGG GTGTCGGAGG CAGAGGGACC CGGGGGTTTG 300 
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CAGCGAAGGG 
GGGGATTTAT 
GACTAAGATG 
CTGAGGACGG 
GGOCGCOGTO 
CTTCCCGCAG 
CTGGCGGCCC 
GAGGTGCTGT 
GCGCTCTACC 
GAAGCGAATG 
CCGCAGTGGC 
GCTTCCAGGC 
CTCTGGGGGC 
TCATCTTCAG 
CTGGGGGGAA 
GTATCTGTTT 
TCCTCATCAA 
CCATGTGGCT 
AGGCCGTCCT 
GGGACATGGC 
CGCAGCCCCT 
ACATCTTCCG 
GAGTGGCTGG 
GGTGGGGTAT 
CCTTGCCCTG 
TGCTGGTGCA 
GCAGGAGTAC 
CACCACCCCA 
CGACACACTT 
AGAAGAGGTG 



TGTCTGGAGA 
GGTGTCGACT 
GAOGCCTGGG 
GGTGCGGAGG 
CGGGCCGGCG 
GOGCOGCGGC 
GTTCGGGCCC 
GGAGCCCACG 
TACTGCCGGC 
GGCTCGGCGA 
CGAGGGGCAG 
CCTGGTGCTG 
GCTCCCGGAA 
CCTCTTTCTC 
CTCAGGCAAT 
CTTCGACTTC 
CCTGGCCCTG 
GGTCAATGGC 
CACCACCATG 
CTGGGTGCCC 
GGGGTTGCCC 
TGGGGCGAAT 
GCTTGAGACC 
GGTCCGCCAT 
CGGGGTGTCA 
CCGTGAGGCC 

GGTGGGGCAT 
GGGACTCAAG 
GTTTAGAGCA 



GGGAGAGCTG AGGAGGGGCC 
GGGAGCAGGA GGAGGGTCTT 
AAGGGAACTG GGAGGCAGCG 
CGCACTCTGG GAATGCCGAG 
GGGAGCTGGG GGGCTAGGGG 
TCTGCTACTG CTGCTGCCCG 
CGCGCGCCTG CTGGGTCCAC 
GGCGCTGCTG CTGTGGCTCG 
GCGCAAGGTG CGGGCCCCGC 
GGGAAAGGAC GCCCOGGGCC 
GAATTGAAGG ACAAGAGTCG 
ACAGCCCTGT TGGTGGGGCT 
ATGCTCCTGC CCTTGGOGTT 
TACATGAAGG OGCAGGTAGC 
CCGATTTACG ACTTTTTTCT 
AAATATTTCT GTGAACTGCG 
TTGATGAAGG AGGCAGAGCT 
TTCCAGTTGC TCTACGTGGG 
GATATCACAC ATGACGGGTT 
TTCACCTACA GCCTGCAGGC 
ATGGCCTCTG TCATCTGCCT 
TCCCAGAAAA ACACTTTCCG 
ATCTCTACAG CCACAGGGCG 
CCCAACTATC TTGGAGACCT 
CACCTGCTGC CCTACTTCTA 
CGGGATGAGC GGAGTGCCTG 
TGCCTTACCG CATCATGCCC 
GTGCCCACTC ATCCACCAGC 
GGCTTGCACC CCACCCAGCC 
AGGAAAAAAA TGAAACCAGT 



GGTTCTGGGG GCTGCAGAAC 360 

CGAGGGGCCT GGGGGCGGGG 420 

GGGTGCCTGG GGGCCGAGGG 480 

AGGGTCCCGC AGAGACGTCA 540 

CGGACGCCGA CGTGATGGCC 600 

CCACCATGTT CCACCTGCTC 660 

CCGCGTCCCT GCCCGGGCTG 720 

CCTGGCTCGG CCTGCAGGCG 780 

TCGCGGACGC TCGGGGGAGG 640 

TTATCAGAGC CCCCTTGGAC 900 

CCTGCGCTAT CCTATTAACG 960 

GGGGATGTCA GCGGGGCTGC 1020 

TGTCGCCACC CTCACCGCTT 1080 

CCCAGTTTOG GCCCTGGCAC 1140 

GGGACGAGAG CTCAACCCTC 1200 

ACCCGGCCTC ATCGGCTGGG 1260 

TCGAGGCAGT CCCTCACTGG 1320 

TGATGCCCTC TGGCACGAGG 1380 

TGGCTTCATG CTGGCGTTTG 1440 

CCAGTTCCTG CTGCACCACC 1500 

CATCAATGCT ACTGGTTACT 1560 

AAAGAATCCT TCTGACCCCA 1620 

GAAACTGCTG GTGTCTGGGT 1680 

CATCATGGCT CTGGCTTGGT 1740 

CCTCCTCTAC TTCACCGCGC 1B00 

CAGAAGTACG GCCTGGCCTG 1860 

TACATCTACT GAAGCGGCTC 1920 

ACACCCAGGA CCAGGAGCCT 1980 

CTGAGGATGA ACAACCTCAG 2040 

GACCAAAAAA AAAAAAAAAA 2100 



Seq ID NO: C209 DNA Sequence 

Nucleic Acid Accession #: NM_015720.1 

Coding sequence: 21.. 1838 

1 11 21 31 41 51 

I I I I I i 

CCAGTTCCGG CACGAGGACC ATGGGCCGGC TGCTGCGGGC CGCCCGGCTG CCGCCGCTGC 60 

TTTCGCCGCT GCTGCTTCTG CTGGTTGGGG GAGCGTTCCT GGGTGCCTGT GTGGCTGGGT 120 

CTGATGAGCC TGGCCCAGAG GGCCTCACCT CCACCTCCCT GCTAGACCTC CTGCTGCCCA 180 

CTGGCTTGGA GCCACTGGAC TCAGAGGAGC CTAGTGAGAC CATGGGCCTG GGAGCTGGGC 240 

TGGGAGCCCC TGGCTCAGGC TTCCCCAGCG AAGAGAATGA AGAGTCTCGG ATTCTGCAGC 300 

CACCACAGTA CTTCTGGGAA GAGGAGGAAG AGCTGAATGA CTCAAGTCTG GACCTGGGAC 360 

CCACTGCAGA TTATGTTTTT CCTGACTTAA CTGAGAAGGC AGGTTCCATT GAAGACACTA 420 

GCCAGGCTCA AGAGCTGCCA AACCTCCCCT CTCCCTTGCC CAAGATGAAT CTGGTTGAGC 480 

CTCCCTGGCA TATGCCTCCC AGAGAGGAGG AAGAAGAGGA AGAGGAAGAG GAGGAGAGGG 540 

AGAAGGAAGA GGTAGAGAAA CAAGAGGAGG AGGAAGAGGA GGAGCTGCTC CCTGTGAATG 600 

GATCCCAAGA AGAAGCCAAG CCTCAGGTCC GTGACTTTTC TCTCACCAGC AGCAGCCAGA 660 

CCCCAGGGGC CACCAAAAGC AGGCATGAAG ACTCCGGGGA CCAGGCCTCA TCAGGTGTGG 720 

AGGTGGAGAG CAGCATGGGG CCCAGCTTGC TGCTGCCTTC AGTCACCCCA ACTACAGTGA 780 

CTCOGGGGGA CCAGGACTCC ACCAGCCAAG AOGCAGAGGC CACAGTGCTG CCAGCTGCAG 840 

GGCTTGGGGT AGAGTTCGAG GCTCCTCAGG AAGCAAGCGA GGAAGCCACT GCAGGAGCAG 900 

CTGGTTTGTC TGGCCAGCAC GAGGAGGTGC CGGCCTTGCC TTCATTCCCT CAAACCACAG 960 

CTCCCAGTGG GGCCGAGCAC CCAGATGAAG ATCCCCTTGG CTCTAGAACC TCAGCCTCTT 1020 

CCCCACTGGC CCCTGGAGAC ATGGAACTGA CACCTTCCTC TGCTACCTTG GGACAAGAAG 1080 

ATCTCAACCA GCAGCTCCTA GAAGGGCAGG CAGCTGAAGC TCAATCCAGG ATACCCTGGG 1140 

ATTCTACGCA GGTGATCTGC AAGGACTGGA GCAATCTGGC TGGGAAAAAC TACATCATTC 1200 

TGAACATGAC AGAGAACATA GACTGTGAGG TGTTCCGGCA GCACCGGGGG CCACAGCTCC 1260 

TGGCCCTGGT GGAAGAGGTG CTGCGCGGCC ATGGCAGTGG CCACCATGGG GCCTGGCACA 1320 

TCTCTCTGAG CAAGCCCAGC GAGAAGGAGC AGCACCTTCT CATGACACTG GTGGGCGAGC 1380 

AGGGGGTGGT GCCCACTCAA GATGTCCTTT CCATGCTGGG TGACATCCGC AGGAGCCTGG 1440 

AGGAGATTGG CATCCAGAAC TATTCCACAA CCAGCAGCTG CCAGGCGCGG GCCAGCCAGG 1500 

TGCGCAGCGA CTACGGCACG CTCTTCGTGG TGCTGGTGGT CATTGGGGCC ATCTGCATCA 1560 

TCATCATTGC GCTTGGCCTG CTCTACAACT GCTGGCAGCG COGGCTGCCC AAGCTCAAGC 1620 

ACGTGTCGCA CGGCGAGGAG CTGCGCTTCG TGGAGAACGG CTGCCACGAC AACCCCACGC 1680 

TGGACGTGGC CAGOGACAGC CAGTCGGAGA TGCAGGAGAA GCACCCCAGC CTGAACGGCG 1740 

GCGGGGCCCT CAACGGCCCG GGGAGCTGGG GGGCGCTCAT GGGGGGCAAG CGGGACCCCG 1800 

AGGACTCGGA CGTGTTCGAG GAGGACACGC ACCTGTGAGC GCAGCGAGGC GCAGGCCGAG 1860 

TGGGCCGCCA GGACCAAGCG AGGTGGACCC CGAAACGGAC GGCCCGGAGC CCGCACCAGC 1920 

CCCGCGCCTA CCCGGCCGCC CCOGCGCCTG GCCCTOGGOG OGGGCTCCTT CCCGCTTCCC 1980 

CCGACTTCAC ACGGCGGCTT CGGACCAACT CCCTCACTCC CGCCCGAGGG GCAGGCCTCA 2040 

AAGCCCGCCT TGGCCCCGCT TTCCOGCCCC TGAACCCCGG CCCCGCGGGC GGOGGGCGCG 2100 

CTTCCTGCGC CCCGGGACTC AATTAAACCC GCCCGGAGAC CACGCGGGCC CAGCGAAAAA 2160 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2220 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 2269 

Seq ID NO: C210 DNA Sequence 

Nucleic Acid Accession ft: NM_001197.3 

Coding sequence : 61 . . 543 

1 11 21 31 41 SI 

I I I I I I 

GACACGAAGC CTCCCGGGTG GCTTACAGAC GCTGCCAGCA TCGCCGCCGC CAGAGGAGAA 60 

ATGTCTGAAG TAAGACOOCT CTCCAGAGAC ATCTTGATGG AGACCCTCCT GTATGAGCAG 120 
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CTCCTGGAAC 
CCTATGGAGG 
CTGGCCTGCA 
TCCGAGGTGG 
AGGGATGTTC 
TTCTGGAGAT 
CTGCTGCTGC 
TGAGGCCCCG 
GGCGGCCTGC 
CCCGAGATAG 
TTCCAGTTTT 
ACTGTGGCCT 
AGGGGGAGTG 
GAATAGATTC 



CCCCGACCAT 
ACTTCGATTC 
TCGGGGACGA 
CCATGCACAG 
TTAGAAGTTT 
CCCCGAACCC 
TGGCGCTGCT 
GCGGCTCAGG 
TGCTGTTATC 
TGCTGGAACA 
CGTTTTTTCT 
GTGCCCAGGA 
CTGGTCACAC 
OGAGGAGCAG 



GGAGGTTCTT 
TTTGGAATGC 
GATGGACGTG 
CCTGGGTCTG 
CATGGACGGT 

GCTGCCGCTG 
GCGGGGCTGG 
TTTTTAACTG 
CTGCTGAGGT 
AAAAGATGAA 
AGAGCCATTC 
CCCTGTGTGA 
GAGTGCTCAA 



GGCATGACTG 
ATGGAGGGCA 
AGCCTCAGGG 
GCTTTCATCT 
TTCACCACAC 
GTGTCCTGGG 
CTCAGCGGGG 
CCCCACCCCC 
TTTTCTCATG 
TTTATACTCA 
TTGCTATGGC 
ACTCCTGCCC 
TATGTGATGC 
TAAAATGTTG 



ACTCTGAAGA 
GTGACGCATT 
CCCCGCGCCT 
ACGACCAGAC 
TTAAGGAGAA 
AACAGGTGCT 
GCCTGCACCT 
ATGACCACTG 
ATGCCTTTTT 
GGTTTTTTGT 
TCTGCAATTG 
CTGCCCACAC 
CCTOGGCAAA 
GTTTCCAGCA 



GGACCTGGAC 
GGCCCTGOGG 
GGCCCAGCTC 
TGAGGACATC 
CATAATGAGG 
GCTGGCGCTG 
GCTGCTCAAG 
CCCTGGAGGT 
ATATTTAAAC 
TTTTTTTTTA 
TCACCGGTTA 
GGCAGGTAGC 
GAATCTACTG 
AAAAAAAAAA 




Seq ID NO: C211 DNA Sequence 
Nucleic Acid Accession ft: AF272357 
Coding sequence: 83.. 1060 



GCTGCTCCCG 
GCCCCGGCGG 
GCGGCTGCTG 
CGCCGGCCAC 
GGCAAGGTGT 
CCAGCAAGGG 
CAGACTGGAA 
ATCAACTCCG 
CTCGGCACGG 
CACGCCCCAC 
GGAGCCCCGG 
GGCCGGTGCA 
CCGCCTGACT 
COGGATCTCG 
CCAACGGCAA 
GGCCTCCTCG 
GGCCCCGACC 
GCCCCTGCCG 
CAOCTGCTCC 
AACATGTTTT 
TGCCAGGGAG 
GTTCGGTTTA 
GGGTAATTCT 



GATGAGATTG 
CCCCTACCCA 
GGGCAGGGGC 
ACCTCCCTGG 
GGAGGGCAAG 
GCCGCCCTCT 
CAGAAGGCCG 
CCTGGGGACC 
CAGATGCTGT 
GATGAGGAGA 
GGGGAAATGG 
GCCCCCAGCT 
CCGACCTCGA 
GATGCTGTGT 
ACCCCCGAAC 
GACCCCCAAA 
AGAGACAAAA 



Seq ID NO: C212 DNA Sequence 

Nucleic Acid Accession #: NM_00444S.l 

Coding sequence: 799.. 3819 



1 11 21 

! ! I 

CGGAGGGGGC GGGCOGGGCT GCGTTCGCTC 
GACCCGGGAC CCAGCTTGGC GACGGCGATT 
GCCCCACCTC TGGAGCAGCC CCTGCOGCCA 
TCTCGGCGCC GAACGGACCC GGGCCGGTGC 
GTGGCACCGT GCGAGCTCCA GGAGCCCCGG 
CTGCAGAGAC TGCGGCCAAC GGGAAGAAAT 
CTTCTGAGAC TCAGACACGA GGAGAGATAG 
AGGAGGTGCC AAGCCTGTTT GTCTTCATTG 
AAGATCAGGG TGCCGGCATG GTCAGTTCCT 
CCCTCTTCTT TGTTGTGTCC TCGAATGGCA 
GAGAGTAAAT GAAGAGAAAG AACTGGAATA 
CTTAGCTGTA CACCCTGAGT CTTGCAAAAG 
CTGGGGCGAT GGTGGACGCC CTGAAGATGT 
GGGAACAGAG TGGCGGGCAT GGTGTGTAGC 
CTGGCTCTGG AAGAGGTATT GCTGGACACC 
ACCTACCCAC CAGGGGGGTG GGAOGAGGTG 
CGGACCTTTG AGGCATGTCA TGTGGCAGGG 
TTGCAGACAC ACTTTGTGGA GCGGCGCGGG 
TCTGTGCGGG CATGCTCCAG CCTGGQTGTG 
CTTTACTACC GTCAGGCTGA GGAGCCCGAC 
AAACGCTGGA CCAAGGTGGA CACAATTGCA 
TCCTCCTCCT CTTCTTCCTC TGCAGCGTGG 
GCTGGACTGC AACTGAACGT CAAAGAGCGG 
TACGTGGCCT TCCAGGACAC GGGGGCCTGC 
TACACCTGCC CTGCCGTGCT COGATCCTTT 
GCTGGGGGGG CCTCCCTGGT GGCAGCTGTG 
GAGGATGGAG TAGGGGGCCA GGCAGGAGGC 
GGCAAGTGGA TGGTAGCTGT CGGGGGCTGC 
GGAGACAAGG CCTGCCAAGC CTGCCCACGG 
CCCTGCTCAC CATGCCCTGC CCGCAGTCAC 
TGCCTGGAGG GCTTCTACCG GGCCAGTTCC 
CCATCGGCTC CCCAGGAGCT TTGGTTTGAG 



GGCACCTGTG 
AGCCCCCCCA 
CGCTGCCAGC 
GGGCTCTATA 
GCTCCCAACC 
GACCCACCAG 
GTGCAAGGCT 



41 
I 

TCTACAGCAG 
GCCCCCAGGA 
CACCCOGGAA 
CCGGACTGGA 
AGGCCTCGGG 
TAGTCCACCC 
ATCTCTAGAT 
GTCTAGATGC 
CTCTTCTAGG 
TGGCTGTTGG 
GAAAAAAAAA 
ACCCAGGAGC 
CTGAAGGGGC 
TGCTCCTGGT 
CATCTGAGAT 
ACGACCAGCG 
GCACCGGGCA 
CGCACATTCG 
CCTGCCGGGA 
GCGTTTCCTC 
GCTTTCCCTC 
CCCACGGGGC 
CTCTCACCCA 
TCGCTGTCAG 
CAGAGACGCA 
TGGCTCATGC 
GGCTGCACTG 
CTGGATACCA 
AGTCTTCTGC 
CAGCAGCCCC 
AGGCCCCCTG 
CAGCACTCAT 



1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
963 




51 

I 

CGGGCGGCGG 
TTCTCCCGGC 
TCCCAGGGAC 
GAAGACGCGG 
GGGCGCAGAC 
AATTCACAGA 
CAACAAGCAA 
TGGGAAGTCC 
TTTCAGACTG 
AGGAAGGGAG 
AAAAGGGAAG 
AGGGTGGTGG 
TGCCCAGTTA 
GTCTTCAGTT 
TGGCTGGCTC 
ACGCCTGACT 
GGACAATTGG 
ACTCCACTTC 
GACCTTCACC 
CTGGCACCTC 
CTCCTCCTCC 
TGGGCAGCGG 
ACGCGGCTTC 
GCTCTTCTCC 
GGCCAGTGGG 
AGAGCCAGAG 
CAACGGGGAG 
ACCAGCACGA 
TGGGAATGCT 
CGTTTGCCCC 
CACTOGTCCT 
GCTACACTGG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 



1331 



WO 03/042661 PCTAJS02/36810 



CGCCTGCCTC GGGAGCTGGG GGGTCGAGGG GACCTGCTCT TCAATGTCGT GTGCAAGGAG 1980 

TGTGAAGGCC GCCAGGAACC TGCCAGOGGT GGTGGG6GCA CTTGTCACOG CTGCAGGGAT 2040 

GAGGTOCACT TCGACCCTCG CCAGAGAGGC CTGACTGAGA GCCGAGTGTT AGTGGGGGGA 2100 

CTCCGGGCAC ACGTACCCTA CATCTTAGAG GTGCAGGCTG TTAATGGGGT GTCTGAGCTC 2160 

5 AGCCCTGACC CTCCTCAGGC TGCAGCCATC AATGTCAGCA CCAGCCATGA AGTGCCCTCT 2220 

GCTGTCCCTG TGGTGCACCA GGTGAGCOGG GCATCCAACA GCATCACGGT GTCCTGGCCG 2280 

CAGCCCGACC AGACCAATGG GAACATCCTG GACTATCAGC TCCGCTACTA TGACCAGGCA 2340 

GAAGACGAAT CCCACTCCTT CACCCTGACC AGCGAGACCA ACACTGCCAC CGTGACACAG 2400 

CTGAGCCCTG GCCACATCTA TGGTTTCCAG GTGCGGGCCC GGACTGCTGC CGGCCACGGC 2460 

10 CCCTACGGGG GCAAAGTCTA TTTCCAGACA CTTCCTCAAG GGGAGCTGTC TTOCCAGCTT 2520 

CCGGAAAGAC TCTCCTTGGT GATCGGCTCC ATCCTGGGGG CTTTGGCCTT CCTCCTGCTG 2S80 

GCAGCCATCA CCGTGCTGGC GGTCGTCTTC CAGCGGAAGC GGCGTGGGAC TGGCTACACG 2640 

GAGCAGCTGC AGCAATACAG CAGCCCAGGA CTCGGGGTGA AGTATTACAT OGACCCCTCC 2700 

ACCTACGAGG ACCCCTGTCA GGCCATCCGA GAACTTGCCC GGGAAGTCGA TCCTGCTTAT 2760 

15 ATCAAGATTG AGGAGGTCAT TGGGACAGGC TCTTTTGGAG AAGTGCGCCA GGGCCGCCTG 2820 

CAGCCACGGG GAGGGAGGGA GCAGACTGTG GCCATCCAGG CCCTGTGGGC CGGGGGCGCC 2880 

GAAAGCCTGC AGATGACCTT CCTGGGCOGG GCCGCAGTGC TGGGTCAGTT CCAGCACCCC 2940 

AACATCCTGC GGCTGGAGGG CGTGGTCACC AAGAGCCGAC CCCTCATGGT GCTGACGGAG 3000 

TTCATGGAGC TTGGCCCCCT GGACAGCTTC CTCAGGCAGC GGGAGGGCCA GTTCAGCAGC 3060 

20 CTGCAGCTGG TGGCCATGCA GCGGGGAGTG GCTGCTGCCA TGCAGTACCT GTCCAGCTTT 3120 

GCCTTCGTCC ATCGCTCGCT GTCTGCCCAC AGOGTGCTGG TGAATAGCCA CTTGGTGTGC 3180 

AAGGTGGCCC GTCTTGGCCA CAGTCCTCAG GGCCCAAGTT GTTTGCTTCG CTGGGCAGCC 3240 

CCAGAGGTCA TTGCACATGG AAAGCATACA ACATCCAGTG ATGTCTGGAG CTTTGGGATA 3300 

CTCATGTGGG AAGTGATGAG TTATGGAGAA CGGCCTTACT GGGACATGAG TGAGCAGGAG 3360 

25 GTACTAAATG CAATAGAGCA GGAGTTCCGG CTGCCCCCGC CTCCAGGCTG TCCTCCTGGA 3420 

TTACATCTAC TTATGTTGGA CACTTGGCAG AAGGACCGTG CCCGGOGGCC TCATTTTGAC 3480 

CAGCTGGTGG CTGCATTTGA CAAGATGATC CGCAAGCCAG ATACCCTGCA GGCTGGCGGG 3540 

GACCCAGGGG AAAGGCCTTC CCAGGCCCTT CTGACCCCTG TGGCCCTGGA CTTTCCTTGT 3600 

CTGGACTCAC CCCAGGCCTG GCTTTCAGCC ATTGGACTGG AGTGCTACCA GGACAACTTC 3660 

30 TCCAAGTTTG GCCTCTGTAC CTTCAGTGAT GTGGCTCAGC TCAGCCTAGA AGACCTGCCT 3720 

GCCCTGGGCA TCACCCTGGC TGGCCACCAG AAGAAGCTGC TGCACCACAT CCAGCTCCTT 3780 

CAGCAACACC TGAGGCAGCA GGGCTCAGTG GAGGTCTGAG AATGACGATA CCCGTGACTC 3840 

AGCCCTGGAC ACTGGTCCGA GAAGGGACAT GTGGGACGTG "AGCOGGGCTC CAACAGCCTC 3900 

TGTGAGAGAT GCCCCACACC AAACCCAACC CTCCCGATGG CTGCATTCCC TGGTCCTCCG 3960 

35 CCTCTCCACC AGCCCCCTCC TCATTAAAGG GAAAGAAGGG AATTTGCAAA 4010 



40 



75 



Seq ID NO: C213 UNA Sequence 
Nucleic Acid Accession ft: XM_043340. 
Coding sequence: 195,. 1067 



1 11 21 31 41 51 

I I I I I I 

GGGCGGCGCC CAATGGGCTG CGCGGAGCGT CACTTCCCGG CAGCGGGAGG CGAGTGGCGA 60 

GTGGCGAGTG GCGAGTGTCA GGGGGGCGGC CGGCGGGGGC GGGGCGGCCG GAGGAGGCGT 120 

45 TGGCAGCGGG CTCGGACCCA CGCGGOGCOG CGGCCCGCCT GGCCTGCAGC GCTCCCACCC 180 

CCGGCGGCGG CACGATGCCC TTTGACTTCA GGAGGTTTGA CATCTACAGG AAGGTGCCCA 240 

AGGACCTTAC GCAGCCAACG TACACCGGGG CCATTATCTC CATCTGCTGC TGGCTCTTCA 300 

TCCTCTTCCT CTTCCTCTCG GAGCTCACCG GATTTATAAC GACAGAAGTT GTGAACGAGC 360 

TCTATGTCGA TGACCCAGAC AAGGACAGCG GTGGCAAGAT CGACGTCAGT CTGAACATCA 420 

50 GTTTACCCAA TCTGCACTGC GAGTTGGTTG GGCTTGACAT TCAGGATGAG ATGGGCAGGC 480 

ACGAAGTGGG CCACATCGAC AACTCCATGA AGATCCOGCT GAACAATGGG GCAGGCTGCC 540 

GCTTCGAGGG GCAGTTCAGC ATCAACAAGG TCCCCGGCAA CTTCCACGTG TCCACACACA 600 

GTGCCACAGC CCAGCCACAG AACCCAGACA TGACGCATGT CATCCACAAG CTCTCCTTTG 660 

GGGACACGCT ACAGGTCCAG AACATCCACG GAGCTTTCAA TGCTCTCGGG GGAGCAGACA 720 

55 GACTCACCTC CAACCCCCTG GCCTCCCACG ACTACATCCT GAAGATTGTG CCCACGGTTT 780 

ATGAGGACAA GAGTGGCAAG CAGCGGTACT CCTACCAGTA CACGGTGGCC AACAAGGAAT 840 

ACGTCGCCTA CAGCCACACG GGCCGCATCA TCCCTGCAAT CTGGTTCCGC TACGACCTCA 900 

GCCCCATCAC GGTCAAGTAC ACAGAGAGAC GGCAGCCGCT GTACAGATTC ATCACCACGA 960 

TCTGTGCCAT CATTGGCGGG ACCTTCACOG TCGCCGGCAT CCTGGACTCA TGCATCTTCA 1020 

60 CAGCCTCTGA GGCCTGGAAG AAGATCCAGC TGGGCAAGAT GCATTGACGC CACACCCAGC 1080 

CTAATGGCCG AGGACCCTGG GCATCGCCAG CCTTGCCTCC AGTGCCCTGT CTCCTTTGGC 1140 

CCTCAATCTG GTCCCAAATC TGGCTGTGTC CCAAAGGGTG TGTGGGAAGT GGGGGGAAAG 1200 

TAGAGGATGG CTCGATGTTT TGCAGCTACC TCTTTTCCCC GTGTTTCTTT TTAGACAAAT 1260 

TACACTGCCT GAAGTTGCAG TTOCCCTTTC GCTGGGGAGC CCCAAGAACA GAGTCAGGCA 1320 

65 AGGGGTGGGG AGTCCAGGGG AACATCCCAG AATGCATATC GATCAGCTCT CAGCCAGGCT 1380 

TCGACAATCT CGCAGCCCCC ACTAGGTGGA CACATTAATG ATTTGGTTTC TCCCCTGGGC 1440 

AGCCAACCTG CCCCAGAGGC ACCAGACCTG GGCTTTCAGC TTTGGGACCA GGCTGCCCAA 1S00 

AGGTACTCCT TTATACACCC GGCACCTTCC ACGAAAGATG GTACTTCCCA AGCAAGCOCC 1560 

TATGATTTGT CACTATAGAT GGAAATGTGT GGCATGCACA TGAGTTGAAA TTCTTTTATG 1620 

70 CA TT TT TT TG AAGAAAAAAA AAAAAACAAC TCTGAGGACA TAGGGGATGT CAGTTTCCTA 1680 

TGGAAGAGAC ACCTCTGACC CGTTATTCTT ATAATCAAAA TCTGAAGGGA AAAAAATGTT 1740 

TTAGTTCTTT CCCCACTCGT TGGGTTCAAC TAGATTAAAA GGCTGATTTT CAG 1793 



Seq ID NO: C214 DNA Sequence 

Nucleic Acid Accession ft: NM_002151.1 

Coding sequence: 246.- 1499 



1 11 21 31 41 51 

o n I I I I I I 

OU TCGAGCCCGC TTTCCAGGGA CCCTACCTGA GGGCCCACAG GTGAGGCAGC CTGGCCTAGC 60 

AGGCCCCACG CCACCGCCTC TGCCTCCAGG CCGCCCGCTG CTGCGGGGCC ACCATGCTCC 120 

TGCCCAGGCC TGGAGACTGA CCCGACCCCG GCACTACCTC GAGGCTCCGC CCCCACCTGC 180 

TGGACCCCAG GGTCCCACCC TGGCCCAGGA GGTCAGCCAG GGAATCATTA ACAAGAGGCA 240 

GTGACATGGC GCAGAAGGAG GGTGGCCGGA CTGTGCCATG CTGCTCCAGA CCCAAGGTGG 300 
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CAGCTCTCAC TGCGGGGACC CTGCTACTTC TGACAGCCAT CGGGGCGGCA TCCTGGGCCA 360 

TTGTGGCTGT TCTCCTCAGG AGTGACCAGG AGCOGCTGTA CCCAGTGCAQ GTCAGCTCTG 420 

CGGACGCTCG GCTCATGGTC TTTGACAAGA GGGAAGGGAC GTGGOGGCTG CTGTGCTCCT 480 

5CGCGCTCCAA CGCCAGGGTA GCOGGACTCA GCTGCGAGGA GATGGGCTTC CTCAGGGCAC 540 

TGACCCACTC OGAGCTGGAC GTGCGAAOGG CGGGCGCCAA TGGCACGTOG GGCTTCTTCT 600 

GTGTGGACGA GGGGAGGCTG CCCCACACCC AGAGGCTGCT GGAGGTCATC TCCGTGTGTG 660 

ATTGCCCCAG AGGCGGTTTC TTGGCCGCCA TCTGCCAAGA CTGTGGCOGC AGGAAGCTGC 720 

CCGTGGACCG CATCGTGGGA GGCCGGGACA CCAGCTTGGG COGGTGGCGG TGGCAAGTCA 780 

1 ^ GCCTTCGCTA TGATGGAGCA CACCTCTGTG GGGGATCCCT GCTCTCCGGG GACTGGGTGC 840 

1U TGACAGCCGC CCACTGCTTC CCGGAGCGGA ACCGGGTCCT GTCCCGATGG CGAGTGTTTG 900 

CCGGTGCCGT GGCCCAGGCC TCTCCCCAOG GTCTGCAGCT GGGGGTGCAG GCTGTGGTCT 960 

ACCACGGGGG CTATCTTCCC TTTOGGGACC CCAACAGCGA GGAGAACAGC AACGATATTG 1020 

CCCTGGTCCA CCTCTCCAGT CCCCTGCCCC TCACAGAATA CATCCAGCCT GTGTGCCTCC 1080 

- _ CAGCTGCCGG CCAGGCCCTG GTGGATGGCA AGATCTGTAC GGTGAOGGGC TGGGGCAACA 1140 

ID CGCAGTACTA TGGCCAACAG GCCGGGGTAC TCCAGGAGGC TOGAGTCCCC ATAATCAGCA 1200 

ATGATGTCTG CAATGGCGCT GACTTCTATG GAAACCAGAT CAAGCCCAAG ATGTTCTGTG 1260 

CTGGCTACCC CGAGGGTGGC ATTGATGCCT GCCAGGGCGA CAGCGGTGGT CCCTTTGTGT 1320 

GTGAGGACAG CATCTCTCGG ACGCCACGTT GGCGGCTGTG TGGCATTGTG AGTTGGGGCA 1380 

OA CTGGCTGTGC CCTGGCCCAG AAGCCAGGCG TCTACACCAA AGTCAGTGAC TTCCGGGAGT 1440 

ZK) GGATCTTCCA GGCCATAAAG ACTCACTCCG AAGCCAGCGG CATGGTGACC CAGCTCTGAC 1500 

CGGTGGCTTC TCGCTGCGCA GCCTCCAGGG CCCGAGGTGA TCCCGGTGGT GGGATCCACG 1560 

CTGGGCCGAG GATGGGACGT TTTTCTTCTT GGGCCCGGTC CACAGGTCCA AGGACACCCT 1620 

CCCTCCAGGG TCCTCTCTTC CACAGTGGCG GGCCCACTCA GCCCCGAGAC CACCCAACCT 1680 

0 „ CACCCTCCTG ACCCCCATGT AAATATTGTT CTGCTGTCTG GGACTCCTGT CTAGGTGCCC 1740 

£3 CTGATGATGG GATGCTCTTT AAATAATAAA GATGGTTTTG ATT 1783 
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Seq ID NO i C21S DNA Sequence 
Nucleic Acid Accession #i AB037745.1 
Coding sequence: 26.. 1744 



1 11 21 31 41 51 

I I I i I I 

ATGGTGGAAC ACGCTGCCCA CAAACATGGA AACGACCGTT CTCAGTGGGA TCAACTTCGA 60 

GTACAAGGGC ATGACAGGCT GGGAGGTGGC TGGTGATCAC ATTTACACAG CTGCTGGAGC 120 

CTCAGACAAT GACTTCATGA TTCTCACTCT GGTTGTGCCA GGATTTAGAC CTCCGCAGTC 180 

GGTGATGGCA GACACAGAGA ATAAAGAGGT GGCCAGAATC ACATTTGTCT TTGAGACCCT 240 

CTGTTCTGTG AACTGTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 300 

TGTGGAGAOG TGGAAAGGTT CCAAAGGCAA ACAGTCCTAT ACCTACATCA TTGAGGAGAA 360 

ACi CACTACCAOG AGCTTCACCT GGGCCTTCCA GAGGACCACT TTTCATGAGG CAAGCAGGAA 420 

4U GTACACCAAT GACGTTGCCA AGATCTACTC CATCAATGTC ACCAATGTTA TGAATGGCGT 480 

GGCCTCCTAC TGCCGTCCCT GTGCCCTAGA AGCCTCTGAT GTGGGCTCCT CCTGCACCTC 540 

TTGTCCTGCT GGTTACTATA TTGACCGAGA TTCAGGAACC TGCCACTCCT GCCCCCCTAA 600 

CACAATTCTG AAAGCCCACC AGCCTTATGG TGTCGAGGCC TGTGTGCCCT GTGGTCCAGG 660 

GACCAAGAAC AACAAGATCC ACTCTCTGTG CTACAATGAT TGCACCTTCT CACGCAACAC 720 

TCCAACCAGG ACTTTCAACT ACAACTTCTC CGCTTTGGCA AACACCGTCA CTCTTGCTGG 780 

AGGGCCAAGC TTCACTTCCA AAGGGTTGAA ATACTTCCAT CACTTTACCC TCAGTCTCTG 840 

TGGAAACCAG GGTAGGAAAA TGTCTGTGTG CACCGACAAT GTCACTGACC TCCGGATTCC 900 

TGAGGGTGAG TCAGGGTTCT CCAAATCTAT CACAGCCTAC GTCTGCCAGG CAGTCATCAT 960 

CCCCCCAGAG GTGACAGGCT ACAAGGCCGG GGTTTCCTCA CAGCCTGTCA GCCTTGCTGA 1020 

TCGACTTATT GGGGTGACAA CAGATATGAC TCTGGATGGA ATCACCTCCC CAGCTGAACT 1080 

TTTCCACCTG GAGTCCTTGG GAATACCGGA CGTGATCTTC TTTTATAGGT CCAATGATGT 1140 

GACCCAGTCC TGCAGTTCTG GGAGATCAAC CACCATCCGC GTCAGGTGCA GTCCACAGAA 1200 

AACTGTCCCT GGAAGTTTGC TGCTGCCAGG AACGTGCTCA GATGGGACCT GTGATGGCTG 1260 

CAACTTCCAC TTCCTGTGGG AGAGOGCGGC TGCTTGCCCG CTCTGCTCAG TGGCTGACTA 1320 

CCATGCTATC GTCAGCAGCT GTGTGGCTGG GATCCAGAAG ACTACTTACG TGTGGCGAGA 1380 

ACCCAAGCTA TGCTCTGGTG GCATTTCTCT GCCTGAGCAG AGAGTCACCA TCTGCAAAAC 1440 

CATAGATTTC TGGCTGAAAG TGGGCATCTC TGCAGGCACC TGTACTGCCA TCCTGCTCAC 1500 

CGTCTTGACC TGCTACTTTT GGAAAAAGAA TCAAAAACTA GAGTACAAGT ACTCCAAGCT 1560 

GGTGATGAAT GCTACTCTCA AGGACTGTGA CCTGCCAGCA GCTGACAGCT GCGCCATCAT 1620 

GGAAGGCGAG GATGTAGAGG ACGACCTCAT CTTTACCAGC AAGAAGTCAC TCTTTGGGAA 1680 

GATCAAATCA TTTACCTCCA AGCAGCCAGC TCCTGTCACC ATCTCTCTTT CAGAGGACTC 1740 

CTGATGGATT TGACTCAGTG COGCTGAAGA CATCCTCAGG AGGCCCAGAC ATGGACCTGT 1800 

GAGAGGCACT GCCTGCCTCA CCTGCCTCCT CACCTTGCAT AGCACCTTTG CAAGCCTGCG I860 

GCGATTTGGG TGCCAGCATC CTGCAACACC CACTGCTGGA AATCTCTTCA TTGTGGCCTT 1920 

ATCAGATGTT TGAATTTCAG ATCTTTTTTT ATAGAGTACC OUVACCCTCC TTTCTGCTTG 1980 

CCTCAAACCT GCCAAATATA CCCACACTTT GTTTGTAAAT TATGCCCTTG CTTGTATCTT 2040 

GTTTCCCAAA ATGGCCCATC CGCCAGAGCC ATAGCTTCGT CTGCTCATAA TTCTTATAGC 2100 

TTTGGAATGA AAATATTTCT ATCTTCTTAA GTATAGAAAC TATTTCCTCT GTCCTCTAAC 2160 

TTAAGGGCAG AAACAGCTGG GAGTTTTCCT CGCATGCCCT CAGCTCATGA TCTCTTCAGG 2220 

AGAGAGGCTG GGTGAGGAGG GTGTCGGGGT TCCCTGGTGG ATAATCTTCA TAGCAGCCTG 2280 

GATCCATTTC CCCTGGATAA CCAGCTCAAA GGGAGTGAAA ATGGTAGTCT GAGGGCAAGG 2340 

GGAGCAAGGC CTGGGTAAGA AAAGCCTTGA AAAGCATAAA AAGAGGCOGG GCGCGGTGGC 2400 

TCACGCCTGT AATCCCAGCA CTTTGGGAGG CCGAGGCGGG CAGATCATGA GGTCGGGAGA 2460 

TTGAGACCAT CCTGGCTAAC ACGGTGAAGC CCCGTCTCTA CTGGAAATAC AAAAAATTAG 2520 

CCGGGCGTGG TGGCGGGTGC CTGTGGTCCC AGCTACTCGG GAGGCTGAGG CGGGAGAATA 2S80 

GCGTGGGCCT GGAAGGCGGA GCTTGCAGTG AGCCGAGATC GCGCCACTGC ACTCCATCCA 2640 

GCCTGGGTGA CAGAGTGAGA CTCTGCCTCA AAAAAAAAAA AAAAAAAGAA AAGCACAAAG 2700 

AGAGGCAACA AGGAATGTTT TTGTTTTTGA GACAGGCTCT CACTCTGTCA CCTAGGCTGG 2760 

AGTGCAGTGG CGTAATCACT GTTCAGTGCA GCCTCAAGCT CTTGGGCTCA GGCTATCCTC 2820 

CCAT CTCAGC CTCTCAAGTA G CTGG GACTA CGAGTGTGCA CCACCAGGCT CACTAATTTT 2880 

TGTGTTTTTT GTAGACACGG GGTTTCACGG TGTTGCCCAG GCTGGTCTCC AACTCCTGGG 2940 

CTCAAGTGAT CTGTCCGCCT CGGOCTCCCA AACTGCTGGG ATTACAGGCA TAAGCCACTG 3000 

CACTCAGCCT TTTATTTGTT TTTTAAACCA CGTAGCTCAT TGCCTTCTCT TAAGTAAATG 3060 

ATAGATATTC TCACTGAAGC CAAAGGAATA AGTTCATCAA GAAAATGCCC AAAGCCCTGG 3120 
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TGGATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA TCACTCTATG ACACTGAAAA 3180 

GAACCAGGTA AGCCCCAAAC CCAGATGTTC CAGCCTTATC CTCTATTGGG TTTACCCACA 3240 

GACATAGCAA ACCCTGTCAG TGAGGAAAAT TCCCCATCCT TGAGTGCCCC CGTCCTAGAA 3300 

GTTTGGGCCA TATTATGGAA CAGGGGTCTC TTATTTGAAA AGAGCACAAG GAGGCCAAGA 3360 

TTTTAATGGG GCACTTTAGG GGATACAGCC CACAATGGCA TGGGCCTGAG GTGGCCGTGA 3420 

TGTCTGCTTC TAAGCTTAAC GCATCTGCTC AGGCACAGAA TAAACGTCTA GGCTGGCCAA 3480 

AAAAGGAACT GAATCCCAGG CCCATACGCC AGCACCAGAA TCAAACCAGT CTTCAAGGAA 3S40 

GGAAGGCTAG GAGAGTTTAA CAAGATTTTC ACTGGGCCCA GCATGGTGGC TCACACCTGT 3600 

AATCCCAAGG CAGAATGGTG GCTTGAGCTC AGGAGTTCAA GACCAGCCTG GGCAACACAG 3660 

1U TGAGACCCTG TCTCTAAAAA ATTTAAAAAT AAACAAGGTG TTCACCAAGC TGGGATACTT 3720 

CTCACTATTA AGCCCCTATC TTTCTCTTTT TTTCATTCTC AATTGCTTTG TGTGATAAAA 3780 

AACTAAAGAG ACTTCTGGTC CAATTTCTGG CAACATCCCT TCTGAAAGGT GAGTAGAGTG 3840 

GGTGTCTTCT ATGCCCATTT TCCCCAATTT TACACAAACT ATTATCAATG AACTTTTAAG 3900 

TACCTAGAAT GGGTAAAACC AGAGCAAGAC TTTAAATTAC CTTCTTCTTT CTTCTACTGG 3960 

CAGTTCTGCC TCCATCACTA TCAfeGCTAGG GTGACCTTCC CTTGGTCAAG CCCCAATTGC 4020 

CCATGATTTG TGCCTGTGCC CTTTCTCCAG TGACCATTTG GTGACCAGAT GGTAGATATA 4080 

GAAAGGGGAT GGCATTTGCA AGTpACTAGT CTGCCACAAA ATGCTCATCT GATTAGCCAC 4140 

TGCTGCCCTG GCAATGGCTT TGTAAGAGTC AATGAGAACT AGAGCCAGGC TGTGGTCCCT 4200 

GGCCATCAAC AGTGTTGGTG ACGGCAGGGA GTCCCTTTGG TTTAATAAAT CCAGTTTTTC 4260 

TTTGGGTATC CAAATTCTCC CCTCCTTTTG TAGGAGTCAG GCTCTCAGAA CCTGTGTCCA 4320 

TGTTGGAACT TCCCCCAGTG TGGATGCAGA TACGCAGCTC CTGAGCTCCA GCCTAAAGTC 4380 

TTCTGTAGCC TCAGCAATAC TTGGGCACCT GCTGTCTCAC TGAATAGCTT TCTTTTGTGA 4440 

CAAAGGCCAC AGACAGCCCT TAGACTATTC CGGAAACAGT AGGAAAAATT ACATATGTCT 450 0 

TTGACTTCTT TATTCTGACT CCACTGATTT TAGCCATAAT ACTTTAAGGA GCTACTTTTT 4S60 

ZJ ACTACCCCTT ACCGTGCTGA CTTCTGCAGG TCTGCCCTGT GACCTGTCAG GAACTCCTGA 4620 

GTTACGCTAC TGGGGTCACC TGTTGCTCCC CTAGCAAGTT AGGCATGTCA TATATTTTTA 4680 

ACAGCTTTAT TGAGATATAA TTCACATATT ATACAATTCA CCTTTAAAAC ATACGATTCA 4740 

ATGGTTTTCA GCAAACTCAC AGAGTTGTCC GCCCACTTGA GAGCAAACAC ATGTTCAATT 4800 

TTCTTTTCCT TTTTTTTTTT GAGACAGAGT CAGCTTTGTC GCCCAGGCTG GAGTGCAGTG 4860 

CCATGATCTT GGCTCACTGC AGCCTCCCCA TCCTGGGTTC AAGTGATCCT TCTGCTTCAG 4920 

CCTCCCCAGT AGCTGGGATT ACAGGCATGC GCCACCACGC CTAGCTAATT TTTGTGTTTT 4980 

TAGTAGAGAT GGGGTTTCAC CGTGTTGGCC AGGCTGGTCT CAAACTCCTG GACTCAAGTG 5040 

ATCCACCCAC CTCGGCCTCC CAAAGTGCTG GGATTGCAGG TGTGAGCCAC CGTGCCTGGC 5100 

CTACGTGTTC AATTTTCTAT GAACAAAGGC TTTAGTCCTT GACCCAGGGC TAAAGTGGTC 5160 

TGTCCAAGCT GTTGTTGGTA GAGGGAGTAT GATAAAATGT TTAAATCTCA TTTGGTTACC 5220 

TTGAGTCCTG GAACACGCAG TAACTGTCAT GCTATAGTCA TCATCTGTAT TTGGCTGGGA 5280 

ATACAAATGA AGATTGTGGT GTATTCAAGC AGTAGGGTTT TTGCTTTTGT TTTTGTTTTA 5340 

GTGCCAACAA AACTTTTTTT TGTCTGACTA CATTAAAGAT AAGACTGACT ATATTTATAC 5400 

AACAGAAACT TTGTAATAGA TTTTTTCAGC TTTGTGAAAT CGAATTTTTT TTCATCAGGG 5460 

CTGGTTGGAT TTCCTTTTTA CCCTGTAATC CAAGCGTTAA TA GT T TG TT A GAAGATGGGT S520 

TATTGCATGT CACTTTTTTT TTTTTGTAAA ATAAAAACAT ACCTTAC 5567 
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Seq ID NO: C216 DMA Sequence 
Nucleic Acid Accession 0 NM_004864. 
45 Coding sequence! 26-952 
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1 11 21 31 41 51 

I I I I I I 

CGGAACGAGG GCAACCTGCA CAGCCATGCC CGGGCAAGAA CTCAGGACGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TGGTGCTCTC GTGGCTGCCG CATGGGGGOG CCCTGTCTCT 120 

GGCCGAGGCG AGCCGCGCAA GTTTCCCGGG ACCCTCAGAG TTGCACTCOG AAGACTCCAG 180 

ATTCOGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGGGGG CCAACCAGAG 240 

CTGGGAAGAT TCGAACACCG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 300 

AGTGCGGCTG GGATCCGGCG GCCACCTGCA CCTGOGTATC TCTCGGGCCG CCCTTCCCGA 360 

GGGGCTCCCC GAGGCCTCCC GCCTTCACCG GGCTCTGTTC CGGCTGTCCC CGACGGOGTC 420 

AAGGTCGTGG GACGTGACAC GACOGCTGCG GCGTCAGCTC AGCCTTGCAA GACCCCAAGC 480 

GCCCGCGCTG CACCTGCGAC TGTCGCCGCC GCCGTOGCAG TGGGACCAAC TGCTGGCAGA 540 

ATCTTCGTCC GCACGGCCCC AGCTGGAGTT GCACTTGOGG CCGCAAGCCG CCAGGGGGCG 600 

CCGCAGAGCG CGTGCGCGCA ACGGGGACGA CTGTOOGCTC GGGCCOGGGC GTTGCTGCCG 660 

TCTGCACACG GTCCGCGCGT GGCTGGAAGA CCTGGGCTGG GCCGATTGGG TGCTGTOGCC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGCGTGCCCG AGCCAGTTCC GGGCGGCAAA 780 

CATGCACGCG CAGATCAAGA CGAGCCTGCA CCGCCTGAAG CCOGACAOGG AGCCAGCGCC 840 

CTGCTGCGTG CCOGCCAGCT ACAATCCCAT GGTGCTCATT CAAAAGACCG ACACOGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTGC CACTGCATAT GAGCAGTCCT 960 

GGTCCTTCCA CTGTCCACCT GCGOGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTQGGGAC TOGGGGGCTG GTCTGATGGA 1140 

ACTGTGTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 

1204 



Seq ID NO: C432 DMA Sequence 

Nucleic Acid Accession #: NM_0S2858.l 

Coding sequence: 54.. 1259 *~ 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGT GTTGCCCTCA GGTCGCTCCC GGGGGCGGAC ACGGAACCCG GCCATGGAAG 60 

ATCCGTOGGG GGCTCGCGAG CCCCGGGCCC GGCCGAGAGA GCGGGACCCG GGACGGCGCC 120 

CCCACCCAGA CCAAGGCCGC ACCCACGATC GACOGCGGGA CCGACCCGGG GACCCGCGCA 180 

GGAAGCGAAG CAGCGACGGG AACCGGCGAA GGGAOGGGGA CCGGGACCCG AAGAGAGACC 240 

AGGAGAGGGA CGGGAACCGC GACCGGAACC GGGAOGGGGA GAGGGAGAGA GAGAGGGAAA 300 

GAGACCCGGA CCGAGGCCCC CGCCGGGACA CACACAGGGA CGCGGGCCCT GGCGCAGGTG 360 

AACACGGAGT TTGGGAAAAA COGCGCCAAA GCCGGACGCG GGAOGGAGCC CGGGGACTGA 420 

CCTGGGACGC AGCOGCGCCT CCTGGGCCCG CGCCCTGGGA AGCCCOSGAG CCGCCGCAGC 480 
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CGCAGAGGAA 
ATCTGCOCTC 
AAGGACTCCT 
TGCTGGAGGT 
CAGGGGGCTA 
CTTACAGTGG 
ACCAGCTAAA 
TCTGCTGCCT 
TGGTGACOGA 
TCTACTTCCA 
TGTACCAAAG 
CTGGAATCTT 
AGGGCTACCG 
GGTTTCTAAA 
TTCCCAAGAA 
GTGTGGTGGG 
CTTCTGGAGT 
TGGCCGGCCT 
ATCACTTGAG 
TAAAATACAA 
AGGCTGAGGC 
CGCCACTGCA 



GGGAGACCCC 
GACCCCCAGG 
GGAATGCCAC 
TCTCCTGAAC 
CACGGGCATC 
CTTTGATGGT 
GCTGCCCATG 
CTTCGTTGCC 
AGGCTTGTTG 
CTACCTCTCT 
CAAAGGCTAC 
TGCTGCCCTG 
AAAAGTTAGG 
ACGCTCTGAC 
TCCCTTGTTG 
CGGAGCTCCC 
CCTCTGTGAG 
CGGCGGCTCA 
GTCAGGAGCT 
AAAAATTAGC 
AGGAGAATCG 
CTCCAGCCCA 



GGGCGOCGGA 
CCTGGACGAG 
AAATGCAAAT 
TTGCTGATCC 
ACCAGCTTGG 
GCTGACGGGG 
GTCACTGTGG 
ATGGGTGTCC 
GACATGCTCA 
GCTGCCTATG 
AGCGGTTTCG 
GGCATTGTGG 
AAGCTAAAAG 
AGATGCAAGT 
TGGAAGTTTC 
AGTCGCATGG 
TGAGGGACCA 
CACCTGTAAC 
CGAGACCAGC 
CAGGCGTGGT 
CTTGAATCTG 
GGTGACAGAG 



GACCCGAAAG 
AGGAGGTGGA 
ACTTGTGCAC 
TGGCCTGCAG 
GGGGCATTTA 
AGAAGGCCCA 
CAATGGCCTG 
TGCGGGTCCC 
TCGCGGGGGG 
GCTCTCCTGT 
GCTGCAGTTT 
TCTTTGCCCT 
AGAAGCCAGC 
GGTGGTGGAA 
CAATGCTGGA 
AGCGGTGTTC 
ATCAAAATTA 
CCCAGCACTT 
TTGGCCAACA 
GGCGGGGGCC 
GGAGGOGGAG 
CGAGACTCCA 



TGAACCCCCT 
ATATTACCAG 
TGGGAGAGCC 
CTCTGTGTCT 
CTACTATCAG 
GCAACTGGAT 
TAGTGGAGCC 
GTGGCATTGT 
GTACATCCCG 
GTGTAAAGAG 
CCACGGAGCA 
GGGGGGGGTC 
AGAAATGTTT 
GGTAGTCTGA 
AAAGCAGCGA 
ATGGATGCAA 
TTTTTCAAAA 
TGGGAGGCTG 
TGGTGAGCCC 
TGTAATCCCA 
ATTGCAGTGA 
TCTCAAAAAA 



TOGGAGAGAT 
TCAGAGGCGG 
TGCTGCCAAA 
TACAGTTCCA 
TTCGGAGGGG 
GTCCAGTTCT 
CTCACAGCCC 
CCACTGTTGC 
GCCTTGTACT 
AGGCAGGCGC 
GATATAGGAG 
CTGGCCATAA 
GAATTTTAAG 
GCCACTGCCT 
GCCAGCGTTG 
CAGACCCTGG 
AGCAAAAAAA 
AGGTGGGTGG 
CCGTCTCTAC 
GCTACTTGGG 
GCCGAGATCC 
AAAAAAAAAA 



Seq ID NO: C434 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 261.. 2861 
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31 



51 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1660 

1740 

1800 



41 

I I I I I I 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG OGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGOGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGOQCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA IS 60 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 

TCCAGGTGCT TCCCTGGAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 1740 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

- AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACCC AGGGTACATC CCCAAGCCCA GGCAAGACXG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC OGTGGGAATG TACTCOCCAG 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 

AGCCCOGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCAOGGGG 2640 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2820 

CGGGGTCGCC CTTTGCTCAC GTCTCTCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 

GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 

TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 3060 

CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 

TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 3180 

TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC OGGCCCCAGA 3240 

GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCCCTC GAAGGAAAGG 3300 
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AAATGACTAG 
AACCCGCCCT 
AGCCCAGCCT 
TCTGCAGCTC 
CCAATTTGGC 
TGCTGCTTAA 
TAAGAGGGAG 
GTCTTGTGAT 
AGAGGGCCTG 
TTCCCCAGGT 
GGGAGGTGAC 
GGAAGGACTT 
TCCTTCTTGT 
GAACCACTTA 
TTCATTTAAA 
AGGATCTGTA 
TTTCTTGGTA 
TGTAAGACTT 
TTTTTCTGTT 
TGTAAACATT 
ATGTGCACTT 
GAGATGTCCT 
TTGGAAAAAT 
AAAAAAAAAA 



AGTAGAATGA 
CCCCTTGGTG 
GGGTGCACAG 
TACAGGTGAG 
TGATCTTGGG 
TGCCCTGCTC 
AACTCTATCT 
GAACTACATT 
CCTGGCTCCC 
GGGAGCCAAC 
CATAGGGCTC 
CTTCCAGGGA 
CCACGGTTTT 
GGATGTGATC 
AAAGATATCT 
CATAAAAGTT 
GCACAAATTT 
AAGTGAGTTA 
GCCGAAATAG 
TCTTGTAGGC 
CAAGAAGTCA 
TTGCATTGCT 
TTTGCTGTTA 
AAAAAAAAAA 



CAGCTAGCAG 
TTGGCGGTCC 
TAGCTGCAAC 
GCCCAGCAGA 
TGTCTGAACA 
TCTCCCTGGC 
GTGGTTTATA 
TATCCCCTTT 
TCCACCCAAC 
TGTCAGGGAG 
TGCTTTTAAA 
GATTAGTGGT 
GTTGAGTTTT 
ACTTTCAGGT 
ATTTGAAAGT 
TCTTTCCTAA 
TCTTATTGCT 
GGTCTTTAAG 
CTGGTCCTTT 
ATCACCATGA 
CTGTCAGAGA 
TGGAAGGGGT 
TTATAGTAAA 
AA 



ATCTCTTCCC 
CTGTGGCCTT 
TCCCCATTGG 
GGGAGTAGGG 
GCTATTGGGT 
CCACCTTATA 
ATCTTGCACG 
CCTGCCCCAA 
TGCACCCATG 
GTCTTTCOCA 
GATATGGCTG 
GATGGAGAGG 
CACTCTTCTA 
GGCCAGGAAT 
TCTCAGAGTT 
ACCATTCACC 
TAGAAAATTG 
GAAAGCAAGG 
TTCGGGAGTT 
ACAAAGATAT 
AATAAAGAAT 
GTACCTAGAG 
CATACAAAGG 



TCCTGCTCCC 
CACTTTGTTC 
TGCTACCTGG 
CTOGCCATGT 
CCACCCCAGT 
GAGAGCCCAA 
AGGCACCAGA 
CCACAAACTC 
AGACTCGGTC 
CCAAACATCT 
CTTCAAAGGC 
AGAGTTAAAA 
ATGCAAGGGT 
GTTGAATGTC 
GTACATATGT 
AAGAGCCAAT 
TCCTCCTTGT 
CTCCTCTGAA 
AGATGTATAG 
ATTTTCTATT 
TGTCTTAAAT 
CCAAGGAAAT 
ATGTCAAAAA 



AGCGCACACA 
ACTACCTGTC 
CTCTCCTGTC 
TTCTGGTGAG 
CCCTTTCAGC 
AGAGCTCCTG 
GTCTCCCTGG 
TTTCCTTCAA 
CAAGAGTCCA 
TTCAGCTGCT 
CAGAGTCACA 
TGACCTCATG 
CTCACACTGT 
TTTGGCTCAG 
TTCACAGTAC 
ATCTAGGCAT 
TATTTCTGTT 
ATGCTTGTCT 
AGTGTTTGTA 
TATTTATTAT 
GTCATGATTG 
TGGCTCTGGT 
AAAAAAAAAA 



Seg ID NO; C217 Protein Sequence 
Protein Accession ft: NP_00580S.l 

1 11 21 31 41 51 

I I 1 I I I 

MVGKMWPVLW TLCAVRVTVD AISVETPQDV LRASQGKSVT LPCTYHTSTS SREGLIQWDK 
LLLTHTERW IWPFSNKNYI HGELYKNRVS I SNNAEQSDA SITIDQLTMA DNGTYECSVS 
LMSDLBGNTK SRVRLLVLVP PSKPECGIEG ETIIGNNIQL TCQSKEGSPT PQYSWKRYNI 
LNQEQPLAQP ASGQPVSLKN ISTDTSGYYI CTSSNEEGTQ FCNITVAVRS PSMNVALYVG 
IAVGWAALI IIGIIIYCCC CRGKDDNTED KEDARPNREA YEEPPEQLRE LSREREEEDD 
YRQEEQRSTG RESPDHLDQ 



Seq ID NO: C218 Protein Sequence 
Protein Accession #: Eos sequence 



MGSRTPESPL 
GPSVEFYRPG 



DNPTRILEYA 
IAESYYPEYL 
GYVTILNGSD 
EVGRVYVYLQ 
QQGWPVFPG 
VDKAWYRGR 
GFTVELQLDW 
LSPIHIALNF 
GEQNHVYLGD 
FAVNQSRLLV 
SFRLSVEAQA 
SQGVLELSCP 
SASSGPQILK 
YKALKMPYRI 
YKLGFFKRSL 



11 
I 

HAVQLRWGPR 
TDGVSVLVGA 
EEPVEYKSLQ 
PCRSDFSWAA 
INLVQGQLQT 
IRSLYNPSGE 
HPAGIEPTPT 
GPGGLGSKPS 
PIVSASASLT 
QRQXGGVRRA 
SLDPQAPVDS 
KNALNLTPHA 
CDLGNPMKAG 
QVTLNGVSKP 
QALEGQQLLY 
CPEAECFRLR 
LPRQLPQKER 
PYGTAMEKAQ 



21 
I 

RRPPLLPLLL 
PKANTSQPGV 
WFGATVRAHG 
GQGYCQGGFS 
RQASSIYDDS 
OMASYFGYAV 
LTLTGHDEPG 
QVLQPLWAAS 
IFPAMFNPBE 
LFLASRQATL 
HGLRPALHYQ 
QNVGEGGAYE 
ASLWGGLRFT 
EAVLFPVSDW 
VTRVTGLNCT 
CELGPLHQQE 
QVATAVQWTK 
LKPPATSDA 



31 
I 

LLLPPPPRVG 
LQGGAVYLCP 
SSILACAPLY 
AEFTKTGRW 
■YLGYSVAVGE 
AATDVNGDGL 
RFGSSLTPLG 
HTPDFFGSAL 
RSCSLEGNPV 
TQTLLIQNGA 
SKSRIEDKAQ 
AELRVTAPPE 
VPHLRDTKKT 
HPRDQPQKEE 
TNHPINPKGL 



41 
1 

GFNLDAEAPA 
WGASPTQCTP 



AEGSYGVPLW 



LGGPGSYFWQ 
FSGDDTEDFV 
DDLLVGAPLL 
DLDQDGYNDV 
RGGRDLDGNG 
ACINLSFCLN 
REDCREMKIY 
ILLDCGEDNI 
AEYSGLVRHP 
IQFDFQILSK 
DLGPAVHHVY 
ELDPEGSLHH 
WAKTFLQREH 
IIILAILFGL 



51 

I 

VLSGPPGSFF 
IEFDSKGSRL 
DPVGTCYIiST 
GQILSATQEQ 
AGVPKGNLTY 
MDRTPDGRPQ 
AIGAPFGGET 
YPDLIVGSFG 
ASGKHVADSI 



CVPDLQLEVF 
GNFS3LSCDY 
NLNNSQSDW 
ELINQGPSSI 
QQKREAPSRS 
QPFSLQCEAV 
LLLGLL1YIL 



Seq ID NOt C219 Protein Sequence 
Protein Accession ft: NP_002412.1 



MHSFPPLLUj 
VEKLKQMQEF 
YTPDLPRADV 
LAKAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PiO)IYSSFGF 
GIGHKVDAVF 



11 

I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIKKAFQL 
IGGDAKFDED 
DDIDGIQAIY 
BVELNFISVF 
PRTVKHIDAA 
KKDGFFYPFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 



GTRQYKFDPK 



31 

i 

VDIiVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NUJRVAAHEL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



41 
I 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSRST 
KLTFDAITTI 
RFFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



3360 
3420 
3460 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4702 



60 

120 

180 

240 

300 

319 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1049 



51 
I 

VEKRRNSGPV 
QTHLTYRIEN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLHGY 
YPKMIAHDFP 



Seq ID NO: C220 Protein Sequence 
Protein Accession ft: Eos sequence 



41 



51 



1 11 21 31 

I I I I I I 

MHSFPPTiTJiTi LFWGWSHSP PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 
VEKLKQMQEF FOLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 
YTPDLPRADV DHAIKKAFQL WSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 
LAHAFQPGPG IGGDAKFDED ERWTNNFREY NLHRVAAHAL GHSLGLSHST DIGALMYPSY 



60 

120 

180 

240 

300 

360 

420 

469 



60 
120 
180 
240 



1336 



WO 03/042661 



TFSGDVQLAQ DDIDQIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 

FYMRTNPFYP EVELNFISVF WPQL.PNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 360 

PKDIYSSFGF PRTVKHIDAA LSEENTGXTY FFVANKYWRY DBYKRSMDPG YPKMIAHDFP 420 

GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 469 



Seq ID NO: C221 Protein Sequence 
Protein Accession ft: NP_0S5146.1 



1 11 21 31 41 SI 

I I I I I I 

MVRKPWSTI SKGGYLQGKV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA 60 

GIFISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 120 

LPAFVRVWVE LLI IRPAATA VISLAFGRYI LEPPFIQCEI PELAIKLITA VGITWMVLN 180 

SMSVSWSARI QIFLTFCKIiT AILIIIVPGV MQLIKGQTQM PKDAFSGRDS SITRLPLAPY 240 

YGMYAYAGWP YLNFVTEEVE KPEKTIPLAI CISMA1TIGV YVLTNVAYFT TINAEELLbS 300 

NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH LPEILSMIHV 360 

RKHTPLPAVI VLHPLTMIML FSGDLDSLLN FLSFARWLFI GLAVAGLIYL RYKCPDMHRP 420 

FKVPLFIPAI* FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 480 

SEKITRTLQI ILEWPEEDK L 501 



Seq ID NO: C222 Protein Sequence 
Protein Accession #: NP_003237.1 



1 11 21 31 41 51 

I I I I I I 

MGLAWGLGVL FLMHVCGTNR IPESGGDMSV FDIFELTGAA RKGSGRRLVK GPDPSSPAPR 60 

IEDANLIPPV PDDKFQDLVD AVRAEKGFLL LASLRQMKKT RGTLLALERK DHSGQVPSW 120 

SNGKAGTLDL SLTVQGKQHV VSVEEALLAT GQWKSITLFV QEDRAQLYID CEKMENAEItD 180 

VPIQSVFTRD LASIARLRIA KGGVNDNFQG VLQNVRFVFG TTPEDILRNK GCSSSTSVIiL 240 

TLDNNWNGS SPAIRTNYIG HKTKDLQAIC GISCDELSSM VLELRGLRTI VTTLQDSIRK 300 

VTEENKEIAN ELRRPPLCYH NGVQYRNNEE MTVDSCTECH CQNSVTICKK VSCPIMPCSN 360 

ATVPDGBCCP RCWPSDSADD GWSPWSEWTS CSTSOGNGIQ QRGRSCDSLN NRCEGSSVQT 420 

RTCHIQECDK RFKQDGGWSH WSPWSSCSVT CGDGVITRIR LCNSPSPQMN GKPCEGEARE 480 

TKACKKDACP INGGWGPWSP WDICSVTCGG GVQKRSRLCN NPAPQFGGKD CVGDVTENQI 540 

CNKQDCPIDG CLSNPCFAGV KCTSYPDGSW KCGACPPGYS GNQIQCTDVD ECKEVPDACF 600 

NHNGEHRCEN TDPGYNCLPC PPRPTGSQPF GQGVEHATAN KQVCKPRNPC TDGTHDCNKN 660 

AKCNYLGHYS DPMYRCECKP GYAGNGIICG EDTDLDGWPN ENLVCVANAT YHCKKDNCPN 720 

LPNSGQEDYD KDGIGDACDD DBDNDKIPDD RDNCPFHYNP AQYDYDRDDV GDRCDNCPYN 780 

HNPDQADTDN NGEGDACAAD IDGDGILNER DNCQYVYNVD QRDTDMDGVG DQCDNCPLEH 840 

NPDQLDSDSD RIGDTCDNNQ DIDEDGHQNN LDNCPYVPNA NQADHDKDGK GDACDHDDDN 900 

DGIPDDKDNC RLVFNPDQKD SDGDGRGDAC KDDFDHDSVP DIDDICPENV DISETDFRRF 960 

QMIPLDPKGT SQNDPNWWR HQGKBLVQTV NCDPGLAVGY DEFNAVDFSG TPFINTERDD 1020 

DYAGFVFGYQ SSSRFYWMW KQVTQSYWDT NPTRAQGYSG LSVKWNSTT GPGEHLRNAL 1080 

WHTGNTPGQV RTLWHDPRHI GWKDFTAYRW RLSHRPKTGF IRWMYEGKK IMADSGPIYD 1140 

KTYAGGRLGIj FVFSQEMVFF SDLKYECRDP 1170 



Seq ID NO: C223 Protein Sequence 
Protein Accession #: NP_002183.1 



1 11 21 31 41 51 

I I 1 I I I 

MPLLWLRGPL LASCWIIVRS SPTPGSEGHS AAPDCPSCAL AALPKDVPNS QPEMVEAVKK 60 

HILNMLHLKK RPDVTQPVPK AALLNAIRKL HVGKVGENGY VEIEDDIGRR AEMNELMEQT 120 

SEIITFAESG TARKTLHFEI SKEGSDLSW ERAEVWLFLK VPKANRTRTK VTIRLPQQQK 180 

HPQGSLDTGE EAEEVGLKGE RSELLLSEKV VDARKSTWHV PPVSSSIQRL LDQGKSSLDV 240 

RIACEQCQES GASLVLLGKK KKKEEEGEGK KKGGGEGGAG ADEEKEQSHR PFLMLQARQS 300 

EDHPHRRRRR GLECDGKVNI CCKKQFFVSF KDIGWNDWII APSGYHANYC EGECPSHIAG 360 

TSGSSLSFHS TVINHYRMRG HSPPANLKSC CVPTKLRPMS MLYYDDGQNI IKKDIQNMIV 420 

426 



Seq ID NO: C224 Protein Sequence 
Protein Accession #: NP_000086.1 



1 11 21 31 41 51 

I 1 I 1 I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQB TNAALQDVRD WLRQQVREIT 60 

FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGPCPPGV ACIQTESGGR CGPCPAGFTG 120 

NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKANKQVCTD 180 

INECBTGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 

ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 

BDVDRDGIGD ACDPDADGDG VpNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 360 

DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

VDHDPVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 

RDNCRLVPNP GQEDADRDGV GDVCQDDPDA DKWDKIDVC PENAEVTLTD FRAFQTWLD 540 

PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 

FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 

BSQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 

RLGVFCFSQE NIIHANLRYR CNDTIPEDYE THQIiRQA 757 



Seq ID NO: C225 Protein Sequence 
Protein Accession ft: NP_6 12464 



1 11 21 31 41 51 

I I I I I I 

1337 



WO 03/042661 



PCT/DS02/36810 



5 
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15 
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80 



MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTCT NSVSRIIIEE 
LPK 

Seq ID NO: C226 Protein Sequence 
Protein Accession ft: NP_003216.1 

1 11 21 31 41 51 

I I 1 I I I 

MATMENKVIC ALVLVSMLAL GTLAEAQTET CTVAPRERQN CGFPGVTPSQ CANKGCCFDD 
TVRGVPWCFY PNTIDVPPEE ECEF 

Seq ID NO: C227 Protein Sequence 
Protein Accession 8: NP,_056234.1 

1 11 



60 
120 

iao 

240 
243 



60 
84 



51 



21 31 41 

I I I I I I 

MPKRAHWGAL SWLILLKGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 
NLGFNSIQAL SETSFAGLTK LELLMIHGNE IPSIPDGALR DLSSLQVFKF SYNKLRVITG 
QTLQGLSNLM RLHIDHNKIE FIHPQAFNGL TSLRLLHLEG NLLHQLHPST FSTFTFLDYF 
RLSTIRHLYL AENMVRTLPA SMLRNMPLLE NLYLQGNPWT CDCEMRWFLE WDAKSRGILK 
CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMTCLKPSI ESPLRQNRSR SIEEEQEQEE 
DGGSQLIIiEK FQLPQWSISL NHTDEHGNMV NLVCDIKKPM DVYKIHLNQT DP PD ID I NAT 
VALDFECPMT RENYEKLWKL IAYYSEVPVK LHRELMLSKD PRVSYQYRQD ADEEALYYTG 
VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIE 
PSGAVQRDQT VLEGGPCQLS CNVKASESPS IFWVLPDGSI LKAPMDDPDS KFSILSSGWL 
RIKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTLPCN 
ALAIPEAHLS WILPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 
HFTVGITVTK KGSGLPSKRG RRPGAKALSR VREDIVEDEG GSGMGDEENT SRRLLHPKDQ 
EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 
ERWADIIiAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP FPAVSPPSAS PVQTVTSAEE 
SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT STPLEEWDD LSEKTEEITS 
TEGDLKGTAA PTLISEPYEP SPTLHTLDTV YEKPTHEETA TEGWSAADVG SSPEPTSSEY 
EPPLDAVSLA ESEPMQYFDP DLETKSQPDE DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 
STIGEPGVPG QSHLQGLTDN IHLVKSSLST QDTLLIKKGM KEMSQTLQGG NMLEGDPTHS 
RSSESEGQBS KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TPRQKVAPSS 
TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETF STQPTQAPDI KISSQVESSL 
VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPNK HRYTPSTVSS RASGSKPSPS 
PENKHRNIVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 
SDGKEIKDDV ATNVDKHKSD ILVTGESITN AIPTSRSLVS TMGEFKEESS PVGFPGTPTW 
NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KELEDVDFTS EFLSSLTVST PFHQEBAGSS 
TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN HTPTAARMKE PASSSPSTIL 
MSLGQTTTTK PALPSPRISQ ASRDSKENVF LNYVGNPETE ATPVNNEGTQ HMSGPNELST 
PSSDRDAFNL STKLELEKQV FGSRSLPRGP DSQRQDGRVH ASEQLTRVPA KPILPTATVR 
LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK QPTTPRLSST TIPLPLHMSK 
PSIPSKFTDR RTDQFNGYSK VFGNNNIPEA RNPVGKPPSP RIPHYSNGRL PFFTNKTLSF 
PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHL DFGPPAPPLL HTPQTTGSPS 
TNLQNIPMVS STQSSISFIT SSVQSSGSFH QSSSKFFAGG PPASKFWSLG EKPQILTKSP 
QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLKN GTLVIRKVQV 
QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QIIASHYQDV TVYLGDTIAM ECLAKGTPAP 
QISWIFPDRR VWQTVSPVES RITLHENRTL SIKEASFSDR GVYKCVASNA AGADSLAIRL 
HVAALPPVIH QEKLENISLP PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFLHGNLF 
VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 
GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIKV FANGTLWKS VTDKDAGDYL 
CVARNKVGDD YWLKVDWM KPAKIEHKEE NDHKVFYGGD LKVDCVATGL PNPEISVJSLP 
DGSLVNSFKQ SDDSGGRTKR YWFNNGTLY FNEVGMREEG DYTCFAENQV GKDEMRVRVK 
WTAPATIRN KTYliAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 
GTLLIQKAQR SDSGNYTCLV RNSAGEDRKT VWIHVNVQPP KINGNPNPIT TVREIAAGGS 
RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGNRITVHGN GSLDIRSLRK SDSVQLVCMA 
RNEGGEARLI VQLTVLEPME KPIFHDPISE KITAMAGHTI SLNCSAAGTP TPSLVWVLPN 
GTDLQSGQQL QRPYHKADGM LHISGLSSVD AGAYRCVARN AAGHTERLVS LKVGLKPEAN 
KQYHNLVSII NGETLKLPCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 
ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 
PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGFYKC MAKNILGSDS 
KTTYIHVF 



Seq ID NO: C228 Protein Sequence 
Protein Accession ft: Eos sequence 



MPGTKLTRTG 
SRQYTVRYRE 
TPESAPTTAP 
FQNTFFHTPR 
SLCQQECSCT 
MCYEDP7VSS 
PQGRNAKDLL 
SDTQDQKRTL 
SASPAHHAST 
PSSVLRDRSS 
SSPLSKGGKD 



11 
I 

APADYRVILK 
KGELARWDYX 
ENLNVWPVNG 
LSNHLEQSPS 
QKDFSCLAYL 
LTGNSLKSVA 
LDLKNKILAN 
RPPSRHGHSV 
QGTSHRPSLP 
VHPGAKPASP 
GEDAPATNSN 



21 
I 

TSQEDELDVP 
QIANRRVLIE 
KPTWAASWD 
PILETLLLPW 
IDIQTKQVNK 
ASKADVQQNT 
GGAPRKPQLR 
VAPGRTAVRA 
ASLNDNDLVD 
ARRTPHSGAA 
AP8RSTMSSS 



31 

1 

DDISVRVMSS 
NLIPDTVYEF 
ALPETEGKVK 
WMVCSLGNAI 
DPQLEGSVFG 



AKKAEELDLQ 
RMPALPRREG 
SDEDERAVGS 
EEDSSASAPP 
VSSHLSSRTQ 



41 
I 

QSVLVSWVDP 
AVRISQGERD 
VCLLDTGLFS 
FSKSGPQTGE 
PCFLFYFLTF 
PSSPSPRAPA 
STEITGEEEL 
VDKPGFSLAT 
LHPKGAFAQP 
SRLSPPHGGS 
VSEGAEASDG 



51 
I 

VLEKOKKWA 
GKWSTSVFQR 
VSSFQPSAKS 
AHDLTPKPSL 
MLDIGGFSFI 
SSQHPSVPAS 
GSREDSPMSP 
QPRPGAPPSA 
RPALSPSRQS 
SRLLPTQPHL 



60 

120 

180 

240 

300 

360 

420 

4B0 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2828 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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1338 



WO 03/042661 



PCTAJS02/36810 
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GGRQAEATAQ 
AHPRVPSHSD 
ASUffiKEPIP 
SPARPPAARS 
QAGRPRPTSQ 
AARAKEAAAS 
GRSEPPSKRP 
GSLAKEERBP 
PWPRYTTRAP 
GSNGKPNGQR 
GTPWSPDGL 
TTPIiPTTTTP 
TTPKPTTPIP 
YDEDYEFETS 
DPSAPCSLTD 
ATPGDLVTGY 
ISPSVSFVTE 



TLRARPASGH 
SKPKLSSGIH 
ENPKSTGADT 
QQHPSVPRRM 
GRSHSSSDPY 
LPKHQQVESP 
LSSKSQQSVS 
AIALAPRGGS 
PGHFSTTPML 
IINGPQGTKW 
PLFGQGRHGT 
RPTTATTKQP 
TCPPGTLERH 
RPPTTTEPST 
ALDHFQVDSL 
LVYSASYEDF 
SDNPLLWRP 



PHLLRHKPFA 
GDEEDEKPLP 
HPQGKYSSLA 
TPGRAPEQQP 
TASSRGMLPT 
TGAGAGGOHR 
AEDEEEEDAG 
LAPVKRPLPP 
SLRQRMMHAR 
WDLDRGLVL 
PLANAQDKPI 
TTTTTPLPTT 
DDDGNLIMSS 
TATTPRVIPE 
DEIIPNDLKK 
IRNKPSTQAS 
PGGELSGSHS 



ANGRSPSRPS 
ATWNDHVPS 
SKAQDVQQST 
PPPVATSQHH 
ALQNQDEDAQ 
SQRGHAASPA 
PFKGGKEDLIj 
PPGSSPRASH 
FRNPLSRQPA 
NABGRYLQDS 
LSLGGKPLVG 
TPRPTTATTR 
NGIPECYABE 
EGAXSSPPEE 
SDLPPQHAPR 
SVTHLPIENL 
LSNMIPATRT 



IGRGPRLQPS 
SSRQPISRGW 
DADTEGHSPK 
PGPQSRDAGR 
GSYDDDSTEV 
RPSRPGGPQS 
SSSVPKHPSS 
VPSRPPPRSA 
RPSYRQGYNQ 
HGNPLRIKLG 
liEVIKKTTHP 
RTTTRRPTTT 
DEFSGLETDT 
EFDLAGRKRF 
NITWAVEGC 
KPNTRYYFKV 
AMDGNM 



SSPQSTVPSR 
EDLRRSPQRG 
AQPGSTDRHA 
SPSQPRLSLT 
EAQDVRAPAH 
RARVPSRAAP 
STPRGGKDAD 
ATVSPVAGTH 
RPNVEGKVLP 
GDGRTIVDLE 
PTTTMQPTTT 
VRTTTRTTTT 
AVPTEEAYVI 
VAPYVTYLNK 
HSFVIVDWDK 
QAQNPHGYGP 



Seq ZD NO: C229 Protein Sequence 
Protein Accession 8: NP_003005.l 

1 11 21 31 

I I I I 

MFLSILVALC LWLHLALGVR GAPCEAVRIP MCRHMPWNIT 
YEELVDVNCS AVLRFFFCAM YAPICTLEFL HDPIKPCKSV 
E5LACDELPV YDRGVCISPE AIVTDLPBDV KWIDITPDMM 
KKVKPTLATY LSKNYSYVIH AKIKAVQRSG CNEVTTWDV 
SSCQCPHZIiP HQDVLIKCYE WRSRKMLLEN CXjVEKWRDQL 
KKKTAGRTSR SNPPKPKGKP PAPKPASPKK NIKTRSAQKR 



Seq ID NO: C230 Protein Sequence 
Protein Accession #: NP 005931.1 



41 



51 
I 



RMPNHLHHST QENAILAIEQ 
CQRARDDCEP LMKMYNHSWP 
VQERPLDVDC KRLSPDRCKC 
KBIFKSSSPI PRTQVPLITN 
SKRSIQWEER LQEQRRTVQD 
TNPKRV 



1 
I 

MAPAAWLRSA 
PAPATQEAPR 
LVQEQVRQTM 
FFPKTHREGD 
YPLSLSPDDC 
VSTIRGELFF 
QYWVYDGEKP 
PVPRRATDWR 
AEPANTFL 



11 

I 

AARALLPPML 
PASSLRPPRC 
AEALKVWSDV 
VHFDYDETWT 
RGVQHLYGQP 
FKAGFVWRLR 
VLGPAPLTEL 
GVPSEIDAAF 



21 
I 

LLLLQPPPLL 
GVPDPSDGLS 
TPLTFTEVHE 
IGDDQGTDIiL 
WPTVTSRTPA 
GGQLQPGYPA 
GLVRFPVHAA 
QDADGYAYFL 



31 
I 

ARALPPDVHH 
ARNRQKRFVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHWQGLP 
LVWGPEKNKI 
RGRLYWKFDP 



41 

I 

LHAERRGPQP 
SGGRWEKTDL 
RYWKGDDLPF 
LGLQHTTAAK 
EIAFLEPDAP 
SPVDAAFEDA 
YFFRGRDYWR 
VKVKALEGFP 



51 
I 

WHAALPSSPA 
TYRIIiRFPWQ 
DGPGGIliAHA 
ALMSAFYTFR 
PDACEASFDA 
QGHIWFFQGA 
FHPSTRRVDS 
RLVGPDFFGC 



Seq ID NO: C231 Protein Sequence 
Protein Accession «i NP_076927 



MGENDPPAVE 
LALAIGLGIH 
TAASWKTMCS 
VTALHHSVYV 
LOGGSVITPL 
LGNDIALMKL 
VPLISNKICN 
GIGCAEVNKP 



11 
I 

APFSFRSLFG 



DDWKGHYANV 
REGCASGHW 
WIITAAHCVY 
AGPLTFKEMI 
HRDVYGGIIS 
GVYTRVTSPL 



21 
I 

LDDLKISPVA 
SSFKCIELIA 
ACAQLGFPSY 
TLQCTACGHR 
DLYLPKSWTI 
QPVCLPNSEE 
PSMLCAGYLT 
DWIHEQMERD 



31 
I 

PDADAVAAQI 
RCDGVSDCKD 
VSSDNLRVSS 
RGYSSRIVGG 
QVGLVSLLDN 
NFPDGKVCWT 
GGVDSCQGDS 



41 
I 

LSLLPIiKFFP 
GEDEYRCVRV 
LEGQFREEFV 
NMSLLSQWPW 
PAPSHLVEKI 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1666 



60 

120 

180 

240 

300 

346 



GGPLVCQERR 



51 
1 

IIVIGIIALI 
GGQNAVLQVF 
SIDHLLPDDK 
QASLQFQGYH 
VYHSKYKPKR 
DASPVLNHAA 
LWKLVGATSF 



Seq ID NO: C232 Protein Sequence 
Protein Accession #: NP_003211 



1 
1 

MLWKLTDNXK 
PYQPIYPQSQ 
LDPRRDYRRH 
SLSKSNSNAV 
PECLNASLLG 
ARDFGYVCET 
NSRPNPILEP 



11 
I 

YEDCBDRHDG 
DPYSHVNDPY 
EDLLHGPHAL 
SAIPINKDNL 
GVLRRAKSKN 
EFPAKAVAEF 
GIQSCLTHFN 



21 
I 

TSNGTARLPQ 
SIiNPLHAQPQ 
SSGLGDLSIH 
FGGWNPNEV 
GGRSLRBKLD 
LNRQHSDPNE 
LISHGFGSPA 



31 

I 

LGTVGQSPYT 
PQHPGNPGQR 
SLPHAIEEVP 
FCSVPGRLSL 
KIGLNLPAGR 
QVTRKNMLLA 
VCAAVTALQN 



41 

I 

SAPPLSHTPN 
QSQESGLLHT 
HVEDPGINIP 
LSSTSKYKVT 
RKAANVTLLT 
TKQICKEFTD 
YLTEALKAMD 



51 
I 

ADFQPPYFPP 
HRGLPHQLSG 
DQTVIKKGPV 
VAEVQRRLSP 
SLVEGEAVHL 
LLAQDRSPIiG 
KMYLSNNPNS 



Seq ID NO: C233 Protein Sequence 
Protein Accession #: NP_002 979.1 

1 11 21 31 41 51 

I I I I I I 

MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 
LLTKRGRQIC ADPNKKWVQK YISDLKLNA 



Seq ID NO: C234 Protein Sequence 
Protein Accession #: NP_0040S4.1 



60 

120 

180 

240 

300 

360 

420 

480 

488 



60 

120 

180 

240 

300 

360 

420 

453 



60 

120 

180 

240 

300 

360 

420 

437 



60 
89 



1339 



WO 03/042661 



1 11 21 31 41 51 

I I 1 1 I I 

MILQAHLHSL CLLMLYLATG YGQEGKFSGP LKPMTFSIYE GQEPSQIIFQ FKANPPAVTP 60 

ELTGETDNIF VIERBQLLYY NRALDRETRS THNLQVAALD ANGIIVEGPV PITIBVKDIN 120 

DNRPTFLQSK YFjGSVRQNSR PGKPFLYVNA TDLDDPATPN GQLYYQIVIQ LPMINNVMYF 180 

QINNKTGAIS LTREGSQELN PAKNPSYNLV ISVKDMGGQS EKSPSDTTSV DIIVTENIWK 240 

APKPVEMVEN STDPHPIKIT QVRWNDPGAQ YSLVDKEKLP RFPFSIDQEG DIYVTQPLDR 300 

EEKDAYVPYA VAKDBYGKPL 5YPLEIHVXV KDINDNPPTC PSPVTVFEVQ ENERLGNSIG 360 

TLTAHDRDEE NTANSFLNYR IVEQTPKLPM DGLFLIQTYA GMLQLAKQSL KKQDTPQYNL 420 

TIEVSDKDFK TLCFVQINVI DINDQIPIFE KSDYGNLTLA EDTNIGSTIL TIQATDADEP 480 

FTGSSKILYH IIKGDSEGRL GVDTDPHTNT GYVIIKKPLD FETAAVSNIV FKAENPEPLV 540 

FGVKYNASSF AKFTLIVTDV NEAPQFSQHV FQAKVSEDVA IGTKVGNVTA KDPEGLDISY 600 

SLRGDTRGWL KIDHVTGEIF 3VAPLDREAG SPYRVQWAT EVGGSSLSSV SEFHLILMDV 660 

NDNPPRLAKD YTGLPFCHPI* SAPGSLIFEA TDDDQHLFRG PHPTFSI/3SG SLQNDWEVSK 720 

INGTHARLST RHTEFEEREY WLIRINDGG RPPLEGIVSL PVTFCSCVEG SCFRPAGHQT 780 

GIPTVGMAVG ILLTTLLVIG IILAWFIRI KKDKGKDNVE SAQASEVKPL RS 832 

Seq ID NO: C235 Protein Sequence 
Protein Accession #: NP_004434.1 

1 11 21 31 41 51 

I I I I I I 

MARARPPPPP SPPPGLLPLL PPLLLLPLLL LPAGCRALEE TLMDTKWVTS ELAWTSHPBS 60 

GWEEVSGYDE AMNPIRTYQV OJVRESSQNN WLRTGFIWRR DVQRVYVELK FTVRDCNSIP 120 

NIPGSCKETF NLFYYEADSD VASASSPFWM ENPYVKVDTI APDESFSRLD AGRVNTKVRS 180 

FGPLSKAGFY LAFQDQGACM SLISVRAFYK KCASTTAGPA LFPETLTGAE PTSLVIAPGT 240 

CIPNAVEVSV PLKLYCNGDG EWMVPVGACT CATGHEPAAK ESQCRPCPPG SYKAKQGEGP 300 

CLPCPPNSRT TSPAASICTC HNNFYRADSD SADSACTTVP SPPRGVISNV NET SI>I LEWS 360 

EPRDLGGRDD LLYNVICKKC HGAGGASACS RCDDNVEFVP RQLGLTBRRV HISHLLAHTR 420 

YTFEVQAVNG VSGKSPLPPR YAAVNITTNQ AAPSEVPTLR LHSSSGSSLT LSWAPPERPN 480 

GVILDYEMKY FEKSEGIAST VTSQMNSVQL DGLRPDARYV VQVRARTVAG YGQYSRPAEF 540 

ETTSERGSGA QQLQEQLPLI VGSATAGLVF WAVWIAIV CLRKQRHGSD SEYTEKLQQY 600 

IAPGMKVYID PFTYEDPNEA VREFAKEIDV SCVKIEEVIG AGEFGEVCRG RLKQPGRREV 660 

PVAIKTLKVG YTERQRRDPL SEASIMGQFD HPNIIRLEGV VTKSRPVMIL TEFMENCALD 720 

SFLRLNDGQF TVIQLVGMLR GIAAGMKYLS EMNYVHRDLA ARNILVNSNL VCKVSDFGLS 780 

RFLEDDPSDP TYTSSLGGKI PIRWTAPEAI AYRKFTSASD VWSYGIVMWE VMSYGBRPYW 840 

DMSNQDVINA VEQDYRLPPP MDCPTALHQL MLDCWVRDRN LRPKPSQIVN TLDKLIRNAA 900 

SLKVIASAQS GMSQPLLDRT VPDYTTFTTV GDWLDAIKKG RYKESPVSAG FASFDLVAQM 960 

TAEDLLRIGV TLAGRQKKIL SSIQDMRLQM NQTLPVQV 998 

Seq ID NO: C236 Protein Sequence 
Protein Accession #: NP_001795.1 

1 11 21 31 41 SI 

I I I I I I 

MYVGYYLDKD SPVYPGPARP ASLGLGPANY GPPAPPPAPP QYPDFSSYSH VEPAPAPPTA 60 

WGAPFPAPKD DWAAAYGPGP AAPAASPASL AFGPPPDFSP VPAPPGPGPG LLAQPLGGPG 120 

TPSSPGAQRP TPYEWMRRSV AAGGGGGSGK TRTKDKYRW YTDHQRLELE KEFEYSRYIT 180 

IRRKSELAAN LGLTERQVKI WFQNRRAKBR KVNKKKQQQQ QPPQPPMAHD ITATPAGPSL 240 

GGLCPSNTSL LATSSPMPVK EEFLP 265 

Seq ID NO: C237 Protein Sequence 
Protein Accession #» NP_06B813,1 

1 11 21 31 41 51 

I I I I I I 

MGSDRARKGG GGPKDFGAGL KYNSRHEKVN GLEEGVEFLP VNNVKKVEKH GPGRWWLAA 60 

VLIGLLLVLL GIGFLVWHLQ YRDVRVQKVF NGYMRITNEN FVDAYBNSNS TBPVSLASKV 120 

KDALKLLYSG VPFLGPYHKE SAVTAFSEGS VIAYYWSEFS IPQHLVBEAB RVMAEERWM 180 

LPPRARSLKS PWTSWAFP TDSKTVQRTQ DNSCSFGLHA RGVELMRFTT PGFPDSPYPA 240 

HARCQWALRG DADSVLSLTF RSFDLASCDE RGSDLVTVYN TLSPMEPHAL VQLCGTYPPS 300 

YNLTFHSSQN VLLITLITNT BRRHPGFEAT FFQLPRMSSC GGRLRKAQGT FNSPYYPGHY 360 

PPNIDCTWNI EVPNNQHVKV RFKFFYLLEP GVPAGTCPKD YVEINGEKYC GERSQPWTS 420 

NSNKITVRFH SDQSYTDTGF LAEYLSYDSS DPCPGQFTCR TGRCIRKBLR CDGWADCTDH 480 

. SDELNCSCDA GHQFTCKNKF CKPLFWVCDS VNDCGDNSDE QGCSCPAQTF RCSNGKCLSK 540 

SQQCNGKDDC GDGSDEASCP KVNVVTCTKH TYRCLNGLCL SKGNPECDGK EDCSDGSDBK 600 

DCDCGLRSFT RQARWGGTD ADEGEWPWQV SLHALGQGHI CGASLISPNW LVSAAHCYID 660 
DRGPRYSDPT QWTAFLGLHD QSQRSAPGVQ ERRLKRIISH PFFNDFTFDY DIALLELEKP 720 
AEYSSMVRPI CLPDASHVPP AGKAIWVTGW GHTQYGGTGA LILQKGEIRV INQTTCENLL 780 
PQQITPRMMC VGFLSGGVDS OQGDSGGPLS SVEADGRIFQ AGWSWGDGC AQRNKPGVYT 840 
RLPLFRDWIK ENTGV 855 

Seq ID NO: C238 Protein Sequence 
Protein Accession 8: Eos sequence 

1 11 21 31 41 51 

I I 1 I t I 

MPPFLLLEAV CVFLFSRVPP SLPLQBVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 
SVGKGSPBRS KHFAITVCDG LDISPBRVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 
MVFKGGRTET BLALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 
FAVGVRFPRW EBLHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 
PCEHRTLEMV REPAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSbEC RVDLLFLLDS SAGTTLDGFL 360 

1340 



WO 03/042661 



PCT/US02/36810 



RAKVFVKRPV RAVLSEDSRA RVGVATYSRB LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTBSHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLPNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENPAQM QSFVRSCALQ FBVNPDVTQV GLWYGSQVQ TAFGLDTRPT RAAMLRAISQ 600 

5 APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRKNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK RPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWBGPHCENR FLRRP 755 

1 _ Seq ID NO: C239 Protein Sequence 
1U Protein Accession 8: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVPLPSRVPP SLPLQEVRVS" KETIGKISAA SKMKWCSAAV DIMFLLDGSN 60 

15 SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

20 RAKVFVKRPV RAVLSEDSRA RVGVATYSRB LLVAVPVGEY QDVPDLVWSL DGIPPRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLPNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 

RTPPSNYREG LGTEMVPTFW NVCAPGP 807 



25 
30 
35 
40 
45 
50 
55 



Seq ID NO: C240 Protein Sequence 
Protein Accession #: XPJ>97386.1 

1 11 21 31 41 51 

i i i i i r 

MPKSEPLGCL SPASRAPGSA AATGAWLPAA SGGPGPLGPP CTCPPRSLGR GRAGSRAGSS 60 

PSGCVCVSGI LRWSVGDPA SRRWVDLDSN SEDLSLLLTP MIVGTGGVGG GWARGWVPAQ 120 

EKEVAEGSGH AGRGNGRRLQ RVYGARSWIL GRKPCLQRLL PASGGPVQPQ PCPSPATACR 180 

WGFKFGVAFW GAAQHPPLCR LGGGRAVPSA TRTLDGP 217 

Seq ID NO: C241 Protein Sequence 
Protein Accession #: CAC03433 

1 11 21 31 41 51 

I I I I I 1 

MLSSTDFTFA SWELWRVDH PNEEQQKDVT LRVSGDLHVG GVMLKLVEQI NISQDWSDFA 60 

LWWEQKHCWL LKTHWTLDKY GVQADAKLLF TPQHKMLRLR LPNLKMVRLR VSFSAWFKA 120 

VSDICKILNI RRSEELSLLK PSGDYFKKKK KKDKNNKEPI IEDILNLESS PTASGSSVSP 180 

GLYSKTMTPI YDPINGTPAS STMTWFSDSP LTEQNCSILA FSQPPQSPEA LADMYQPRSL 240 

VDKAKLNAGW LDSSRSLMEQ GIQEDEQLLL RFKYYSFFDL NPKYDAVRIN QLYEQARWAI 300 

LLEEIDCTEE EMLIFAALQY HISKLSLSAE TQDFAGBSEV DEIEAALSNL EVTLEGGKAD 360 

SLLEDITDIP KLADNLKLFR PKKLLPKAFK QYWFIFKDTS IAYFKNKELE QGEPLEKLNL 420 

RGCEWPDVN VAGRKFGIKL LIPVADGMNE MYLRCDHENQ YAQWMAACML ASKGKTMADS 480 

SYQPEVLNIL SFLRMKNRNS ASQVASSLEN MDMNPECFVS PRCAKKHKSK QLAARILEAH 540 

QNVAQMPLVE AKLRFIQAWQ SLPEFGLTYY LVRFKGSKKD DILGVSYNRL IKIDAATGIP 600 

VTTWRFTNIK QWNVNWETRQ WIEFDQNVF TAFTCLSADC KIVHEYIGGY IFLSTRSKDQ 660 

NETLDEDLFH KLTGGQD 677 



60 
65 
70 
75 
80 



Seq ID NO: C242 DNA Sequence 
Nucleic Acid Accession ft: NM_005170 
Coding sequence: 337.. 918 

1 11 21 31 41 51 

1 I I I I I 

GGGCGTGAGA AAGGCGACGG CGGCGGCGCG GAGGAGGGTT ATCTATACAT TTAAAAACCA 60 

GCCGCCTGCG CCGCGCCTGC GGAGACCTGG GAGAGTCCGG CCGCACGCGC GGGACACGAG 120 

CGTCCCACGC TCCCTGGCGC GTAOGGCCTG CCACCACTAG GCCTCCTATC CCCGGGCTCC 180 

AGACGACCTA GGACGCGTGC CCTGGGGAGT TGCCTGGCGG CGCCGTGCCA GAAGCCCCCT 240 

TGGGGCGCCA CAGTTTTCCC CGTCGCCTCC GGTTCCTCTG CCTGCACCTT CCTGCGGCGC 300 

GCCGGGACCT GGAGCGGGCG GGTGGATGCA GGCGCGATGG ACGGCGGCAC ACTGCCCAGG 360 

TCCGCGCCCC CTGCGCCCCC CGTCCCTGTC GGCTGCGCTG CCCGGCGGAG ACCCGCGTCC 420 

CCGGAACTGT TGCGCTGCAG CCGGCGGCGG CGACCGGCCA CCGCAGAGAC CGGAGGCGGC 480 

GCAGCGGCCG TAGCGCGGCG CAATGAGCGC GAGCGCAACC GCGTGAAGCT GGTGAACTTG 540 

GGCTTCCAGG CGCTGCGGCA GCACGTGCCG CACGGCGGCG CCAGCAAGAA GCTGAGCAAG 600 

GTGGAGACGC TGCGCTCAGC CGTGGAGTAC ATCCGCGCGC TGCAGCGCCT GCTGGCCGAG 660 

CACGACGCCG TGCGCAACGC GCTGGCGGGA GGGCTGAGGC CGCAGGCCGT GCGGCCGTCT 720 

GCGCCCCGCG GGCCGCCAGG GACCACCCCG GTCGCCGCCT CGCC C TCCOG CG CTTCTT CG 780 

TCCCCGGGCC GCGGGGGCAG CTCGGAGCCC GGCTCCCCGC GTTCCGCCTA CTCGTCGGAC 840 

GACAGCGGCT GCGAAGGCGC GCTGAGTCCT GCGGAGCGCG AGCTACTCGA CTTCTCCAGC 900 

TGGTTAGGGG GCTACTGAGC GCCCTCGACC TA 932 

Seq ID NO: C243 Protein Sequence 
Protein Accession #: NP_060233.1 

l 11 21 31 41 51 

1341 



WO 03/042661 



PCT/US02/36810 
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MSGGHQLQLA 
LNLTLEGVFA 
SLASKARMAG 
KAHVRIELKE 
SQLATRRYQA 
PWLHQHRTCP 
PSRSAVARPP 
LQSTSQHPAA 
NCTDISLQGV 
KTQVSSHVHY 
SNSAAPSGRL 
SEPTPGSRPO 
NSQPVWLCLT 
QAV 



I 

ALWPWLLMAT 
GVAEITPAEG 
ERGASAVLFD 
PPAWPDYDVW 
SCRQARGEWP 
LCVFNITEGD 
RPGPFLPSQE 
CPVPLRRARP 
HGSSSTFCSS 
HRHRHHHYKK 
SNPQCPRALP 
DATVHPACQI 
PRQPLEPHPP 



I 

LQAGFGRTGL 
KLMQSHPLYL 
ITEDRAAAEQ 
ILMTWGTIF 
DSGSSCSSAP 
SFSQSLGPSR 
PGMGPRHHRF 
FDSSGSGBSY 
LSSDPDPLVY 
RFQWRGRKPG 
EPAPGPVDAS 
FPHYTPSVAY 



I 

VLAAAVESER 
CNASDDDNLE 
LQQPU3LTWP 
VIILASVLRI 
VCAICLEEFS 
SYQEPGRRLH 
PRAAHPRAPG 
CTERSGYLAD 
CSPKGDPQRV 
PETGVPQSRP 
SICPSTSSLF 
PWSPEAHPLI 
TAEGRPCPYP 



I 

SAEQKAVIRV 
PGFISIVKLE 
WLIWGNDAB 
RCRPRHSRPD 
EGQELRVISC 
LIRQHPGHAH 
EQQRLAGAQH 



I 

IPLKKDPTGK 
SPRRAPRPCL 
KLMEFVYKNQ 
PLQQRTAWAI 
LHEFHRNCVD 
YKLPAAYLLG 
PYAQGWQlSH 



DMQPSVTSRP 
PIPRTQPQPE 
NLQKSSLSAR 
CGPPGLDKRL 
HCQVLSAQPG 



RSLDSWPTG 
PPSPDQQVTG 
HPQRKRRGGP 
LPETPGPCYS 



60 
120 

ieo 

240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
783 



Seq ID NO: C244 DNA Sequence 
Nucleic Acid Accession 8: NMJ5042B9 
Coding sequence: 493.. 1695 

1 11 21 31 41 51 

I I I I I I 

GCCGCCGCCT CGTCCACCGG AGGAGCCGGC GCCAGCGTGG ACGGCGGCAG CCAGGCTGTG 
CAGGGGGGCG GCGGGGACCC CCGAGCGGCT CGGAGTGGCC CCTTGGACGC CGGGGAAGAG 
GAGAAGGCAC CCGCGGAACC GACGGCTCAG GTGCCGGACG CTGGCGGATG TGCGAGCGAG 
GAGAATGGGG TACTAAGAGA AAAGCACGAA GCTGTGGATC ATAGTTCCCA GCATGAGGAA 
AATGAAGAAA GGGTGTCAGC CCAGAAGGAG AACTCACTTC AGCAGAATGA TGATGATGAA 
AACAAAATAG CAGAGAAACC TGACTGGGAG GCAGAAAAGA CCACTGAATC TAGAAATGAG 
AGACATCTGA ATGGGACAGA TACTTCTTTC TCTCTGGAAG ACTTATTCCA GTTGCTTTCA 
TCACAGCCTG AAAATTCACT GGAGGGCATC TCATTGGGAG ATATTCCTCT TCCAGGCAGT 
ATCAGTGATG GCATGAATTC TTCAGCACAT TATCATGTAA ACTTCAGCCA GGCTATAAGT 
CAGGATGTGA ATCTTCATGA GGCCATCTTG CTTTGTCCCA ACAATACATT TAGAAGAGAT 
CCAACAGCAA GGACTTCACA GTCACAAGAA CCATTTCTGC AGTTAAATTC TCATACCACC 
AATCCTGAGC AAACCCTTCC TGGAACTAAT TTGACAGGAT TTCTTTCACC GGTTGACAAT 
CATATGAGGA ATCTAACAAG CCAAGACCTA CTGTATGACC TTGACATAAA TATATTTGAT 
GAGATAAACT TAATGTCATT GGCCACAGAA GACAACTTTG ATCCAATCGA TGTTTCTCAG 
CTTTTTGATG AACCAGATTC TGATTCTGGC CTTTCTTTAG ATTCAAGTCA CAATAATACC 
TCTGTCATCA AGTCTAATTC CTCTCACTCT GTGTGTGATG AAGGTGCTAT AGGTTATTGC 
ACTGACCATG AATCTAGTTC CCATCATGAC TTAGAAGGTG CTGTAGGTGG CTACTACCCA 
GAACCCAGTA AGCTTTGTCA CTTGGATCAA AGTGATTCTG ATTTCCATGG AGATCTTACA 
TTTCAACACG TATTTCATAA CCACACTTAC CACTTACAGC CAACTGCACC AGAATCTACT 
TCTGAACCTT TTCCGTGGCC TGGGAAGTCA CAGAAGATAA GGAGTAGATA CCTTGAAGAC 
ACAGATAGAA ACTTGAGCCG TGATGAACAG CGTGCTAAAG CTTTGCATAT CCCTTTTTCT 
GTAGATGAAA TTGTCGGCAT GCCTGTTGAT TCTTTCAATA GCATGTTAAG TAGATATTAT 
CTGACAGACC TACAAGTCTC ACTTATCCGT GACATCAGAC GAAGAGGGAA AAATAAAGTT 
GCTGCGCAGA ACTGTCGTAA ACGCAAATTG GACATAATTT TGAATTTAGA AGATGATGTA 
TGTAACTTGC AAGCAAAGAA GGAAACTCTT AAGAGAGAGC AAGCACAATG TAACAAAGCT 
ATTAACATAA TGAAACAGAA ACTGCATGAC CTTTATCATG ATATTTTTAG TAGATTAAGA 
GATGACCAAG GTAGGCCAGT CAATCCCAAC CACTATGCTC TCCAGTGTAC CCATGATGGA 
AGTATCTTGA TAGTACCCAA AGAACTGGTG GCCTCAGGCC ACAAAAAGGA AACCCAAAAG 
GGAAAGAGAA AGTGAGAAGA AACTGAAGAT GGACTCTATT ATGTGAAGTA GTAATGTTCA 
GAAACTGATT ATTTGGATCA GAAACCATTG AAACTGCTTC AAGAATTGTA TCTTTAAGTA 
CTGCTACTTG AATAACTCAG TTAAOGCTGT TTTGAAGCTT ACATGGACAA ATGTTTAGGA 
CTTCAAGATC ACACTTGTGG GCAATCTGGG GGAGCCACAA CTTTTCATGA AGTGCATTGT 
ATACAAAATT CATAGTTATG TCCAAAGAAT AGGTTAACAT GAAAACCCAG TAAGACTTTC 
CATCTTGGCA GCCATCCTTT TTAAGAGTAA GTTGGTTACT TCAAAAAGAG CAAACACTGG 
GGATCAAATT ATTTTAAGAG GTATTTCAGT TTTAAATGCA AAATAGCCTT ATTTTCATTT 
AGTTTGTTAG CACTATAGTG AGCTTTTCAA ACACTATTTT AATCTTTATA TTTAACTTAT 
AAATTTTGCT TTCT 



Seq ID NO: C245 Protein Sequence 
Protein Accession #: NP 004433 



l 
I 

MALRRLGAAL 
VCNVPESSQN 
DSATKTFPNW 
YGGCMSLIAV 
CNGDGEWLVP 
TNCVCRNGYY 
ICKSCGSGRG 
SPQFASVNIT 
YNATAIKSPT 
LPLIIGSSAA 
FTYEDPNEAV 
TBKQRRDFLS 
VIQLVGMLRG 
YTSAIiGGKIP 
EQDYRLPPPM 
INLPLLDRTI 
LAGHQKKILN 



11 

I 

LLLPLLAAVE 
NWLRTKFIRR 
MENPWVKVDT 
RVFYRKCPRI 
IGRCMCKAGF 
RADLDPLDMP 
ACTRCGDNVQ 
TNQAAPSAVS 
NTVTVQGLKA 
GLVFLIAVW 
REFAKSIDIS 
EASIKGQFDH 
IAAGMKYliAD 
IRWTAPEAIQ 
DCPSALHQLK 
PDYTSFNTVD 
SIQVMRAQMN 



21 
I 

ETLMDSTTAT 
RGAHRIHVEM 
IAADESFSQV 
IQNGAIFQET 
EAVENGTVCR 
CTTIPSAPQA 
YAPRQLGLTE 
IMHQVSRTVD 
GAIYVFQVRA 
IAIVCNRRRG 
CVKIEQVIGA 
PNVIHLEGW 
KNYVHRDLAA 
YRKFTSASDV 
LDCVJQKDRNH 
EWLEAIKMGQ 
QIQSVEV 



31 
I 

AELGWMVHPP 
KFSVRDCSSI 
DLGGRVMKIN 
LSGAESTSLV 
GCP9GTFKAN 
VISSVNETSIi 
PRIYISDLLA 
8ITLSWSQPD 
RTVAGYGRY8 
FERADSEYTD 
GEFGEVCSGH 
TKSTPVMIIT 
RNILVNSNLV 
WSYGIVMWBV 
RPKFGQIVNT 
YKESFANAGF 



41 
I 

SGWEEVSGYD 
PSVPGSCKET 
TEVRSFGPVS 
AARGSCIANA 
QGDEACTHCP 
MLEWTPPRD3 
HTQYTFEIQA 
QPNGVIIiDYE 
GKMYFQTMTE 
KLQHYTSGHM 
LKLPGKREIF 
EFMENGSLDS 
CKVSDFGLSR 
MSYGERPYWD 
LDKMIRNPNS 
TSFDWSQMM 



51 
I 

ENMNTIRTYQ 
FNLYYYEADF 
RSGPYLAFQD 
EEVDVPIKLY 
INSRTTSEGA 
GGREDLVYNI 
VNGVTDQSPF 
LQYYEKELSE 
AEYQTSIQEK 
TPGMKIYIDP 
VAIKTIiKSGY 
FLRQNBGQFT 
FLEDDTSDPT 
MTNQDVINAI 
LKAMAPLSSG 
MEDILRVGLT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2174 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

987 



Seq ID NO: C246 Protein Sequence 
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10 
15 



25 



40 



65 



Protein Accession »: HP_114148.1 

1 11 21 31 41 SI 

I I I I I I 

MDARRVPQKD LRVKKNLKKF RYVKLISMBT SSSSDDSCDS FASDNFANTR LQSVREGCRT 60 

RSQCRHSQPL RVAMKFPARS TRGATNKKAE SROPSENSVT DSNSDSEDES GMNFLEKRAL 120 

NIKQNKAMLA KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 180 

TRSRSRILGS LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 

PVEEITEGGV GERLQQFSKR RYITVHWALL VINAVRRLLI PKQTAETQTA GAFEASSVAP 300 

AFETVMVKRS GMLCWIRTGI ARLVEESATA VSAGSEMDGV RLGSLCI 347 

Seq ID NO: C247 Protein Sequence 
Protein Accession #: NP 036577.1 



11 21 31 41 51 

1 I I I I I 

MENPSPAAAL GKALCALLLA TLGAAGQPLG GESICSARAP AKYSITFTGK WSQTAFPKQY 60 

PLFRPPAQWS SLLGAAHSSD YSMWRKNQYV SNGLRDFAER GBAWALMKEI EAAGEALQSV 120 

HAVFSAPAVP SGTGOTSAEL EVQRRHSLVS FWRIVPSPD WFVGVDSLDL CDGDRWREQA 180 

20 ALDLYPYDAG TDSGFTFSSP NFATIPQDTV TEITSSSPSH PANSPYYPRL KALPPIARVT 240 

LVRLRQSPRA FIPPAPVLPS RDNEIVDSAS VPETPLDCKV SLWSSWGLCG GHCGRIGTKS 300 

RTRYVRVQPA NNGSPCPELE EEAECVPDNC V 331 



Seq ID NO: C248 Protein Sequence 
Protein Accession 8: NP_063 947.1 



1 11 21 31 41 51 

1 I I I I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW VLIKVILDKY 60 

30 YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TFSGTVRPIC LPPFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTROJADD 360 

35 AYQGEVTEKM MCAGIPEGGV DTOQGDSGGP LMYQSDQWHV VGXVSWGYGC GGPSTPGVYT 420 

KVSAYLNWIY NVWKAEL 437 



Seq ID NO: C249 Protein Sequence 
Protein Accession fi: NP 003036.1 



1 11 21 31 41 51 

1 I I I I I 

MGCKVLLNIG QQMLRRKWD CSREETRLSR CLNTFDLVAL GVGSTLGAGV YVLAGAVARE 60 

NAGPAIVISF LIAALASVLA GLCYGEFGAR VPKTGSAYLY SYVTVGELWA FITGWNLILS 120 

45 YIIGTSSVAR AWSATFDELI GRPIGEFSRT HMTLNAPGVL AENPDIFAVI IILILTGLLT 180 

LGVKESAMVN KIFTCINVLV LGPIMVSGFV KGSVKNWQLT EEDFGNTSGR LCLNNDTKEG 240 

KPGVGGFMPF GFSGVLSGAA TCFYAFVGFD CIATTGEEVK NPQKAIPVGI VASLLICFIA 300 

YFGVSAALTL MMPYFCLDNN SPLPDAFKHV GWEGAKYAVA VGSLCALSAS LLGSMFPMPR 360 

VIYAMAEDGL LFKFLANVND RTKTPIIATL ASGAVAAVMA FLFDLKDLVD LMSIGTLLAY 420 

50 SLVAACVLVL RYQPEQPNLV YQMASTSDEL DPADQNELAS TNDSQLGFLP EAEMFSLKTI 480 

LSPKNMEPSK ISGLIVNIST SLIAVLIITF CIVTVLGREA LTKGALWAVF LLAGSALLCA 540 

WTGVIWRQP ESKTKLSFKV PFLPVLPILS IFVNVYLMMQ LDQGTWVRFA VWMLIGFIIY 600 

FGYGLWHSEE ASLDADQART PDGNLDQCK 629 

55 Seq ID NO: C250 Protein Sequence 
Protein Accession ft: NPJ)02767.1 

1 11 21 31 41 51 

, n I I I I I I 

OU MRAPHLHLSA ASGARALAKL LPLLMAQLWA AEAALLPQND TRLDPEAYGA PCARGSQPVfQ 60 

VSLFNGLSFH CAGVLVDQSW VLTAAHCGNK PLWARVGDDH LLLLQGEQLR RTTRSWHPK 120 

YHQGSGPILP RRTDEHDLML LKLARPWPG PRVRALQLPY RCAQPGDQCQ VAGWGTTAAR 180 

RVKYNKGLTC SSITILSPKE CEVFYPGWT NNMICAGLDR GQDPCQSDSG GPLVCDBTLQ 240 

GILSWGVYPC GSAQHPAVYT QICKYMSWIN KVIRSN 276 



Seq ID NO: C251 Protein Sequence 
Protein Accession ft: XP 095088.3 



„ 1 11 21 31 41 51 

70 | | | | | I 

MTRAATAEPG RVSPASPARS TAGLPRAFLQ SLRTLLDILD DWQRGCVHLR EIQSLWVEAR 60 

EliPSGVLEGL SQRRGPQPGA AVRSRRGGAV PRGARAVPER CAGTETRRGR RCSGLQRLGG 120 

GFRGCPADPC ARGEHRRHTI TSGVDOGLLK QMKELEQEKE VLLQGLEHMA QGRDWYQQQL 180 

_ _ QQVQERQCRL GQSRASADFG AVGSPRPUGR LLPKVQEVAR WLGELLAEAC AGRALPTSSS 240 

75 GPPCSALTST SSPGWQQQII LMLKEQNRLL TQEVTEKSER ITQLEQKSAL IKQLFEARAL 300 

SQQDGGLSPA GPHIEPLTRF RLPVLTWAGA LLSPHSPQLL LFLSADSGGP LHELPDTWFP 360 

AVLLWVPSPG KRTAHARLHF HQRPABGAWQ LGCGAEAAPE TCGTLPHFES HKTTCEPDSL 420 

GGPCPQEGDR SWSHLGAAFD VAPAVAKVTP NREDAAGSRH GDICPLCPKG LLTFRDIAXE 480 

OA FSLAEWQCLD HAQQNLYRDV MLENYRNLFS LGMTVSKPDL IACLEQNKEP QNIKRNEMAA 540 

O0 KHPVTCSHFN QDLQPEQSIK DSIiQKVIPRT YGKCGHBNLQ LKKCCKRVDB CEVHKGGYND 600 

LNQCLSNTQN KIFQTHKCVK VFSKFSNSNR HNARYTGKKH LKCKKYGKSF CMPSHLNQHQ 660 

IIHTKEKSYK CEECGKSFNH SSSGTTHKRI LTGEKPYRCE EOGKAFRWPS NLTRHKRIHT 720 

GEKPYACEEC GQAFRRSSTL TNHKRIHTGE RPYKCEECGK AFSVSSALIY HKRIHTGBKP 780 

YTCEEOGKAP NCSSTUCTHK IIKTGEKPYT CEECGRTFNC SSTVKAHKRI HTGBKPYKCB 840 
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ECDKAFKWHS SLAKHKI IHT GEKPYKCSDS KALAKSSEVQ KVYSGDGENG IRVHKKKBTQ 900 

GWLVRNKNEN RTGLFQIRAA VRPNRDPSWG QQBGSLTDPI QRKEEPDLQN HYOHQNALED 960 

QRNTGVGGLL TFRDWIEFS LEEWQCLDHA QQNLYRDVML ENYRNLVSLG IAVSKPDLIT 1020 

CLEQNKEPWN IKRNEMVTKH PDLPPBLGIK DSLQKVIPRR YGKSGHDNLQ VKTCKSMGEC 1080 

EVQKGGCNEV NQCLSTTQNK IPQTHKCVKV FGKFSNSNRH KTRHTGKKHF KCKKYGKSFC 114 0 

MVSQLHQHQI IHTRENSYQC KECGKPFNCS STLSKHKRIH TGEKPYRCEE CGKAFTWSST 1200 

LTKHRRIHTG EKPYTCEECG QAFSRSSTLA NHKRIHTGEK PYTCEECGKA PSLSSSLTYH 1260 

KRIHTGEKPY TCEECGKAFN CSSTLKKHKI IHTGEKPYKC KECGKAFAFS STLNTHKRIH 1320 

TGEEPYKCEE CDKAFKWSSS LANHKSMHTG EKPYKCE 13 57 



Seq ID NO: C2S2 Protein Sequence 
Protein Accession ft: NP_114433.1 

1 11 21 31 41 51 

I I I I I I 

MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWPYHKSN CYGYFRKLRN WSDAELECQS 60 

YGNGAHIASI LSLKEASTIA BYISGYQRSQ PIWIGLHDPQ KRQQWQWIDG AMYLYRSWSG 120 

KSMGGNKHCA EMSSNNNFLT NSSNECNKRQ HFLCKYRP 158 



Seq ID NO: C253 Protein Sequence 
Protein Accession ft: XP__051860.2 

1 11 21 31 41 51 

1 1 I I I I 

MDGVNLSTEV VYKKGQDYRF ACYDRGRACR SYRVRFLCGK PVRPKLTVTI DTNVNSTILN 60 

LEDNVQSWKP GDTLVIASTD YSMYQAEEFQ VLPCRSCAPN QVKVAGKPMY LHIGEEIDGV 120 

DMRAEVGLLS RNIIVMGEME DKCYPYRNHI CNFFDFDTFG GHIKFALGFK AAHLEGTELK 180 

HMGQQIiVGQY PIHFHLAGDV DERGGYDPPT YIRDLSIHHT FSRCVTVKGS NGLLIKDWG 240 

YNSLGHCFFT EDGFEERNTF DHCLGLLVKS GTLLPSDRDS KMCKMITGDS YPGYIPKPRQ 300 

DCNAVSTFWM ANPNNNLINC AAAGSEETGF WFIFHHVPTG PSVGMYSPGY SEHIPLGKPY 360 

NNRAHSNYRA GMIIDNGVKT TEASAKDKRP FIiSIISARYS PHQDADPLKP REPAIIRHPI 420 

AYKNQDHGAW LRGGDVWLDS CRFADNGIGL TLASGGTFPY DDGSKQBIKN SLFVGESGNV 480 

GTEMMDNRIW GPGGLDHSGR TLPIGQNFPI RGIQLYDGPI NIQNCTFRKF VALEGRHTSA 540 

IiAFRLNNAWQ SCPHNNVTGI AFEDVPITSR VFFGEPGPWF NQLDMDGDKT SVFHDVDGSV 600 

SEYPGSYLTK NDNWLVRHPD CINVPDWRGA ICSGCYAQMY IQAYKTSNLR MKIIKNDFPS 660 

HPLYLEGALT RSTHYQQYQP WTLQKGYTI HWDQTAPAEL AIWLINFNKG DWIRVGLCYP 720 

RGTTFSILSD VHNRLLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYWDEDS GLLFLKLKAQ 780 

NEREKFAFCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPM PKKLFGSQLK 840 

TKDHFL5VKM ESSKQHFFHL WNDFAYIEVD GKKYPSSEDG IQVWIDGNQ GRWSHTSFR 900 

NSILQGIPWQ LFNYVATIPD NSIVLMASKG RYVSRGPWTR VLEKLGADRG LKLKEQMAPV 960 

GFKGSFRPIW VTLDTEDHKA KIFQWPIPV VKKKKL 996 



Seq ID NO: C254 Protein Sequence 
Protein Accession ft: NP_055188.1 

1 11 21 31 41 51 

1 I I I I 1 

HTALSSENCS FQYQLRQTNQ PLDVNYLLPL IILGKILLNI LTLGMRRKNT CQNPMEYPCI 60 

SliAPVDLLLL VNISIILYFR DFVLLSIRPT KYHICLFTQI ISFTYGFLHY PVFLTACIDY 120 

CLNPSKTTKL SPKOQKLPYF FTVILIWISV LAYVLGDPAI YQSLKAQNAY SRHCPFYVSI 180 

QSYWLSFFMV MILFVAFITC WEEVTTLVQA IRITSYMNET ILYFPFSSHS SYTVRSKKIF 240 

LSKLIVCFLS TWLPFVLLQV IIVLLKVQIP AYIEMNIPWL YFVNSFLIAT VYWFNCHKLN 300 

LKDIGLPLDP PVNWKCCPIP LTIPNLEQIE KPISIMIC 338 



Seq ID NO i C25S Protein Sequence 
Protein Accession ft: Eos sequence 



1 11 21 31 41 51 

I I I I I ! 

MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 60 

I FLYWQPRD FPEDTLRKFX QKAYESKIDY DKIVYYEAGI ILCCVLGLLF IIUIPLVGYF 120 

FCMCRCCNKC GGEMHQRQKE NGPFLRKCFA ISLLVICIII SIGIFYGFVA NHQVRTRIKR 180 

SRKLADSNFK DLRTLLNETP EQIKYILAQY NTTKDKAFTD LNSINSVLGG GILDRLRPNI 240 

IPVLDEIKSM ATAIKBTKEA LENMNSTLKS LHQQSTQLSS SLTSVKTSLR SSLNDPLCLV 300 

HPSSETCNSI RLSLSQLNSN PELRQLPPVD AELDNVNNVL RTDLDGLVQQ GYQSLNDIPD 360 

RVQRQTTTW AGIKRVIAISI GSDIDNVTQR LPIQDILSAF SVYVNNTESY IHRNLPTLEE 420 

YDSYWWLGGL VICSLLTLTV I FYYLGLLCG VCGYDRHATP TTRGCVSNTG GVFLMVGVGL 480 

SFLFCWILMI IWLTFVFGA NVEKLICEPY TSKELFRVLD TPYLLNEDWE YYLSGKLFNK 540 

SKMKLTPEQV YSDCKKNRGT YGTLHLQNSF NISEHLNINE HTGSISSELE SLKVNLNIFL 600 

LGAAGRKNLQ DFAACGIDRM NYDSYLAQTG KSPAGVNLLS FAYDLEAKAN SLPPGNLRNS 660 

LKRDAQTIKT IHQQRVLPIE QSLSTLYQSV KILQRTGNGL LERVTRILAS LDFAQNFITN 720 

NTSSVIIEET KKYGRTI IGY FEHYLQWIEF SISEKVASCK PVATALDTAV DVFLCSYIID 780 

PLNLFWFGIG KATVFLLPAL IFAVKLAKYY RRMDSEDVYD DVETIPMKNM ENGNNGYHKD 840 

HVYGIHNPVM TSPSQH 856 



Seq ID NO: C256 Protein Sequence 
Protein Accession ft: NPJL49038.1 



1 11 21 31 41 51 

I I I I I I 

MKAIIHLTLL ALLSVNTATN QGNSADAVTT TETATSGPTV AAADTTETNF PETASTTANT 60 

PSFPTATSPA PPIISTHSSS TIPTPAPPII STHSSSTIPI PTAADSBSTT NVNSLATSDI 120 

ITASSPNDGL ITMVPSETQS NNEMSPTTED NQSSGPPTGT ALLETSTLNS TGPSNPCQDD 180 

PCADNSLCVK LHNTSFCLCL EGYYYNSSTC KKGKVFPGKI SVTVSETFDP EEKHSMAYQD 240 
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LHSEITSLPK DVFGTSVYGQ TVILTVSTSL SPRSEMRADD KFVNVTIVTI LABTTSDNBK 300 

TVTEKINKAI RSSSSNFLNY DLTLRCDYYG CNQTADDCLN GLACDCKSDL QRPNPQSPFC 360 

VAS SLKCPDA CNAQHKQCLI KKSGGAPECA CVPGYQEDAN GNOQKCAFGY SGLDCKDKFQ 420 

LILTIVGT1A GIVILSMIIA LIVTARSNNK TKHIEEENLI DEDFQNLKLR STGFTNLGAE 460 

J GSVFPKVRIT ASRDSQMQNP YSRHSSMPRP DY 512 

Seq ID NO: C257 Protein Sequence 
Protein Accession ft: NPJ101423.1 

10 1 11 21 31 41 51 

I I I t I | 

MTAGRRMEML CAGRVPALLL CLGFHLLQAV LSTTVIPSCI PGESSONCTA LVQTEDNPRV 60 

AQVSITKCSS DMNGYCLHGQ CIYLVDMSQN YCRCEVGYTG VRCEHFFLTV HQPLSKEYVA 120 

LTVILIIliFL ITWGSTYYF CRWYRNRKSK EPKKEYERVT SGDPELPQV 169 



15 



60 



Seq ID NO: C258 Protein Sequence 
Protein Accession ft: AAC63 902.1 



OA \ 11 21 31 41 51 

MDRSKENCIS GPVKATAPVG GPKRVLVTQQ IPOQNPLPVN SGQAQRVLCP SNSSQRVPLQ 60 

AQKLVSSHKP VQNQKQKQLQ ATSVPHPVSR PLNNTQKSKQ PLPSAPENNP EEELASKQKN 120 

EESKfcRQWAL EDFEIGRPLG KGKFGNVYLA REKQSKFILA LKVLFKAQLE KAGVEHQLRR 180 

„ EVEIQSHLRH PNILRXiYGYF HDATRVYLIL EYAPLGTVYR ELQKLSKFDB QRTATYITEL 240 

ZD ANALSYCHSK RVIHRDIKPE NLLLGSAGEL KIADFGWSVH APSSRRTTLC GTLDYLPPEM 300 

IEGRMHDEKV DLWSLGVLCY EFLVGKPPFE ANTYQETYKR ISRVEFTFPD FVTEGARDLI 360 

SRLLKHNPSQ RPMLREVLEH PWITANSSKP SNOQNKESAS KQS 403 

„ Seq ID NO: C2S9 Protein Sequence 
D\J Protein Accession #-. NP_037504.1 

1 11 21 31 41 51 

I I I I I I 

MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 60 

GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIINRFCYG 120 

QCNSFYIPRH IRKEEGSFQS CSPCKPKKFT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 180 



IDLD 



Seq ID NO: C261 Protein Sequence 
Protein Accession ft: NP 000575.1 



184 



. - Seq ID NO: C260 Protein Sequence 
4U Protein Accession ft: Eos sequence 

1 11 21 31 41 si 

I I I 1 I I 

MKVGVLWLIS FFTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK 60 

EKRDLRNFLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WFPPSCLDPQ 120 

NCYLHTAGAL PSCECHLNNL SQSVNFCERT KIWGTFKINE RPTNDLKNSS SAIYSKYANG 180 

IEIQLKKAYE RIQGPESVQV TQFRNGSIVA GYEWGSSSA SELLS AIEHV AEKAKTALHK 240 

LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GNITAKCESS GWQVIRETCV 300 

- n LSLLEELNKN FSMIVGNATE AAVSSPVQNL SVIIRQNPST TVGNLASWS ILSNISSLSL 360 

JU ASHFRVSNST MEDVISIADN ILNSASVTNW TVIiLREEKYA SSRLLETLEN ISTLVPPTAL 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 480 

SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDFS 540 

HLQWNDAGCH LVNETQDIVT CQCTHLTSFS ILMSPPVPST IFPWKNITY VGLGISIGSL 600 

r<- ILCLIIEALF WK Q I KKSQTS HTRRICMVNI ALSLLIADVW FIVGATVDTT VNPSGVCTAA 660 

J J VFFTHFFYLS LFFWMLMLGI LLAYRIILVF HHMAQHLMMA VGFCLGYGCP LIISVITIAV 720 

TQPSNTYKRK DVCWLNWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 780 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNIAWH VIFALLNAFQ GFFILCFGIL 840 

LDSKLRQLLF NKLSALSSWK QTEKQNSSDL SAKPKFSKPF NPLQNKGHYA PSHTGDSSDN 900 

IMLTQFVSNE g 10 



>;r J- 11 21 31 41 51 

MTSKLAVALL AAFLISAALC EGAVLPRSAK ELRCQCIKTY SKPFHPKFIK ELRVIESGPH 60 

CANTEIIVKL SDGRELCLDP KENWVQRWE KFLKRAENS 99 

n Seq ID NO: C262 Protein Sequence 
/U Protein Accession ft: NP_005594.1 

} 11 21 31 41 51 

1 I I I I ! 

MSTERDSBTT FDEDSQPNDB WPYSDDETE DELDDQGSAV EPEQNRVNRE AEENREPFRK 60 

/O ECTWQVKAND RKYHEQPHFM NTKFLCIKES KYANNAIKTY KYNAFTFIPM NLFEQFKRAA 120 

NLYPLALLIL QAVPQISTLA WYTTLVPLLV VLGVTAIKDL VDDVARHKMD KEINNRTCEV 180 

IKDGRFKVAK WKEIQVGDVI RLKKNDFVPA DILLLSSSEP NSLCYVETAB LDGETNLKFK 240 

MSLEITDQYL QREDTLATFD GFIECBEPNN RLDKFTGTLP WRNTSFPLDA DKILLRGCVI 300 

On RNTDFCHGLV IFAGADTKIM KNSGKTRFKR TKIDYLMNYM VYTIFWLIL LSAGLAIGHA 360 

OU YWEAQVGNSS WYLYDGEDDT PSYRGFLIFW GYIIVLNTMV PISLYVSVEV IRLGQSHFIN 420 

WDLQMYYAEK DTPAKARTTT LNEQLGQIHY IFSDKTGTLT QNIMTFKKCC INGQIYGDHR 480 

DASQHNHNKI EQVDFSWNTY ADGKLAFYDH YLIEQIQSGK EPEVRQFFFL LAVCHTVMVD 540 

RTDGQLNYQA ASPDEGALVN AARNFGFAFL ARTQNTITIS ELGTERTYNV LAILDFNSDR 600 

KRMSIIVRTP EGNIKLYCKG ADTVIYERLH RMNPTKQETQ DALDIPANET LRTLCLCYKE 660 
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IEEKEFTEKN KXFMAASVAS TNRDEALDKV YEEIEKDLIL LGATAIEDKL QDGVPETISK 720 

LAKADIK.IWV LTGDKKETAE NIGFACELLT EDTTICYGED INSLLHARME NQRNRGGVYA 780 

KFAPPVQESP FPPGGNRALI ITGSWLNEIL LEKKTKRNKI LKLKFPRTEE ERRMRTQSKR 840 

RLEAKKEQRQ KNFVDIACEC SAVICCRVTP KQKAMWDLV KRYKKAITLA IGDGANDVNM 900 

5 IKTAHIGVGI SGQEGMQAVM SSDYSFAQFR YLQRLLLVHG RWSYIRMCKF LRYFFYKNFA 960 

FTLVHFWYSF FNGYSAQTAY EDWFITLYNV LYTSLPVLLM GLLDQDVSDK LSLRFPGLYI 1020 

VGQRDLLFNY KRFFVSLLHG VLTSMILPFI PLGAYLQTVG QDGEAPSDYQ SFAVTIASAL 1080 

VITVNFQIGL DTSYWTFVNA FSIFGSIALY FGIMFDFHSA GIHVLFPSAF QPTGTASNAL 1140 

t n RQPYIWLTII LTVAVCLLPV VAIRFLSMTI WPSESDKIQK HRKRLKAEEQ WQRRQQVPRR 1200 

1U GVSTRRSAYA PSHQRGYADL 1SSGRSIRKK RSPLDAIVAD GTAEYRRTGD S 1251, 
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Seq ID NO: C263 Protein Sequence 
Protein Accession 8: XM_044S33 

1 11 21 31 41 51 

I I I I I I 

MLRTAMGLRS HLAAPHGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGSEERPFL €0 

RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADAEKKQQC 120 

SFKGKDPQRD CQNYIKILLP LSGSHLFTCG TAAPSPMCTY INMENFTLAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG ELYTGTVSSF QGNDPAISRS QSLRPTKTES SLNWIjQDPAF 240 

VASAYIPESL GSLQGODDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVIjQQRW 300 

TSFLKAQLLC SRPDDGFPFN VUJDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 420 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQL ATRPWIQDIE GASAKDLCSA SSWSPSPVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNLATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAVL LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 

PLDHRGYQSL SDSPPGSRVP TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 837 

Seq ID NO s C264 Protein Sequence 
Protein Accession #: NPJJ08950.1 

1 11 21 31 41 51 

I I I I I I 

MASQNRDPAA TSVAAARKGA EPSGGAARGP VGKRLQQBLM TLKMSGDKGI SAFPESDNLF 60 

KWVGTIHGAA GTVYEDLRYK LSLEFPSGYP YNAPTVKFLT PCYHPNVDTQ GNICLDILKE 120 

KWSALYDVRT ILLSIQSLLG EPNIDSPLNT HAAELWKNPT AFKKYLQETY SKQVTSQEP 179 

Seq ID NO; C265 Protein Sequence 
-Protein Accession #: NP_055399.1 



.-1 11 21 31 41 51 

45 | 1 | | | i 

MGRGWGFLFG LLGAVWLLSS GHGEEQPPET AAQRCFCQVS GYLDDCTCDV ETIDRFNNYR 60 

LFPRLQKLLE SDYFRYYKVN LKRPCPFWND ISQCGRRDCA VKPCQSDBVP DGIKSASYKY 120 

SEEANNLIEE CEQABRLGAV DESLSEETQK AVLQWTKHDD SSDNFCEADD IQSPEAEYVD 180 

LLLNPERYTG YKGPDAWKIW NVIYEENCFK PQTIKRPLNP LASGQGTSEE NTPYSWLEGL 240 

CVEKRAFYRL ISGLHASINV HLSARYLLQE TWLEKKWGHN ITEFQQRFDG ILTEGEGPRR 300 

LKNLYFLYLI BLRALSKVLP FFERPDFQLF TGNKIQDBEN KMLLLEILHE IKSFPLHFDE 360 

NSFFAGDKKE AHKLKEDFRL HFRNISRIMD CVGCFKCRLW GKLQTQGLGT ALKILFSEKL 420 

IANMPESGPS YEFHLTRQEI VSLFNAFGRI STSVKELENP RNLLQNIH 468 



Seq ID NO: C266 Protein Sequence 
Protein Accession #i NP_002B79.1 

1 11 21 31 41 51 

I I I I I I 

MQPRRQRLPA PWSGPRGPRP TAPLLALLLL LAPVAAPAGS GGPDDPGQPQ DAGVPRRLLQ 60 

QKARAALHFF NFRSGSPSAL RVLAEVQEGR AWINPKEGCK VHWFSTERY NPESLLQEGE 120 

GRLGKCSARV FFKNQKPRPT INVTCTRLIE KKKRQQEDYL LYKQHKQLKN PLEIVSIPDN 180 

HGHIDPSLRL IWDIAPLGSS YVMWEMTTQV SHYYLAQLTS VRQWVRKT 228 

Seq ID NO: C267 Protein Sequence 
Protein Accession #: NP 005400.1 



1 11 21 31 41 51 

7n I ! I I I I 

/U MSVKGMAIAL AVILCATWQ GFPKFKRGRC LCIGPGVKAV KVADIEKASI KYPSNNCDKI 60 
EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNF 94 



Seq ID NO: C268 Protein Sequence 
Protein Accession #: FGENBSH predicted 

1 11 21 31 41 51 

I I I I I I 

MLRQVLRRGL QSFCHRLGLC VSRHPVPFLT VPAVLTITFG LSALNRFQPE GDLERLVAPS 60 

HSLAKIERSli ASSLFPLDQS KSQLYSDLHT PGRYGRVIIiL SPTGDNILLQ AEGILQTHRA 120 

VLEMKVNHKG YNYTPSHLCV LRNQDKKCVL DDIISVLEDL RQAAVSNKTT ARVQVRYPNT 180 

KLKVCSFCML, LPIKEAALHP LP 202 

Seq ID NO: C269 Protein Sequence 
Protein Accession ft: NP_002429.1 
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1 11 21 31 41 51 

I I I I I I 

MRLPLLLVFA SVIPGAVLLL DTRQFLIYNE DHKRCVDAVS PSAVQTAACN QDAESQKFRW 60 

VSESQIMSVA FKLCLGVPSK TDWVAITLYA CDSKSBFQKW BCKNDTLLGI KGEDLFFNYG 120 

NRQEKNIMLY KGSGLWSRWK IYGTTDNLCS RGYEAMYTLL GNANGATCAF PFKFENKWYA 180 

DCTSAGRSDG WLWCGTTTDY DTDKLFGYCP LKFEGSESLW NKDPLTSVSY QINSKSALTH 240 

HQARKSCQQQ NAELLSITEI HEQTYLTGLT SSLTSGLWIG LNSLSFNSGW QWSDRSPFRY 300 

LNWLPGSPSA EPGKSCVSLN PGKNAKWENL ECVQKLGYIC KKGKTTLNSF VIPSESDVPT 360 

HCPSQWWPYA GHCYKIHRDE KKIQRDALTT CRKEGGDLTS IHTIEELDPI ISQLGYEPND 420 

ELWIGLKDIK IQMYFEHSDG TPVTFTKWLR GEPSHEKNRQ EDCWMKGKD GYWADRGCEW 480 

PLGYICKMKS RSQGPE1VEV EKGCRKGWKK HHFYCYMIGH TLSTFAEANQ TCNNENAYLT 540 

TIEDRYEQAF LTSFVGLRPE KYFWTGLSDI QTKGTFQWTI EEEVRFTHWN SDMPGRKPGC 600 

VAMRTGIAGG LWDVLKCDEK AKFVCKHWAE GVTHPPKPTT TPEPKCPEDW GASSRTSLCF 660 

KLYAKGKHEK KTWFESRDFC RALGGDLASI NNKBEQQTIW RLZTASGSYH KLFWLGLTYG 720 

SPSEGFTWSD GSPVSYENWA YGEPNNYQNV EYOGELKGDP TMSWNDINCB HUraWICQIQ 780 

KGQTPKPEPT PAPQDNPPVT EDGWVIYKDY QYYFSKEKET MDNARAFCKR NFGDLVSIQS 840 

ESEKKFLWKY VNRNDAQSAY FIGLLISLDK KFAWMDGSKV DYVSWATGEP NFANEDENCV 900 

TMYSNSGFWN DINCGYPNAF ICQRHNSSIN ATTVMPTMPS VPSGCKEGWN FYSNKCFKIF 960 

GFMEBERKNVJ QEARKACIGF GGNLVSIQNE KEQAFLTYHM KDSTPSAWTG LNDVNSEHTF 1020 

LWTDGRGVHY TNWGKGYPGG RRSSLSYEDA DCWIIGGAS NEAGKWMDDT CDSKRGYICQ 1080 

TRSDPSLTNP PATIQTDGFV KYGKSSYSLM RQKFQWHEAE TYCKLHNSLI ASILDPYSNA 1140 

FAWLQMETSN ERVWIALNSN LTDNQYTWTD KWRVRYTNWA ADEPKLKSAC VYLDLDGYWK 1200 

TAHCNESFYF LCKRSDEIPA TEPPQLPGRC PBSDHTAWIP FHGHCYYIES SYTRNWGQAS 1260 

LECLRMGSSL VSIESAAESS FLSYRVEPLK SKTNFWIGLF RNVEGTWLWI NNSPVSFVNVJ 1320 

NTGDPSGERN DCVALHASSG FWSNIHCSSY KGYICKRPKI IDAKPTHELL TTKADTRKMD 1380 

PSKPSSNVAG WIIVILLIL TGAGLAAYFF YKKRRVHLPQ EGAFENTLYF NSQSSPGTSD 1440 

MKDLVGNIEQ NEHSVI 1456 

Seq ID NO: C270 Protein Sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

1 I I I I I 

MVLLHWCLLW LLFPLSSRTQ KLPTRDEELF QMQIRDKAFP HDSSVIPDGA EISSYLFRDT 60 

PKRYPPWEE DNTPLSVTVT PCDAPLEWKL SLQELPEDRS GEGSGDLEPL EQQKQQIINE 120 

EGTELFSYKG NDVEYFISSS SPSGLYQLDL LSTEKDTHFK VYATTTPESD QPYPELPYDP 180 

RVDVTSLGRT TVTLAWKPSP TASLLKQPIQ YCWINKEHN FKSLCAVEAK LSADDAFMMA 240 

PKPGLDFSPF DFAHFGFPSD NSGKERSFQA KPSPKLGRHV YSRPKVDIQK ICIGNKNIFT 300 

VSDLKPDTQY YFDVFWNIN SNMSTAYVGT FARTKEEAKQ KTVBLKDGKI TDVFVKRKGA 360 

KFLRFAPVSS HQKVTFPIHS CLDAVQIQVR RDGKLLLSQN VEGIQQFQLR GKPKAKYLVR 420 

LKGNKKGASM LKILATTRPT KQSPPSLPED TRIKAFDKLR TCSSATVAWL GTQERNKFCI 480 

YKKEVDDNYN EDQKKREQMQ CLGPDIRKKS EKVLCKYFHS QNLQKAVTTE TIKGLQPGKS 540 

YLLDVYVIGH GGHSVKYQSK WKTRKFC 568 

Seq ID NO: C271 Protein Sequence 
Protein Accession ft: AAH34229.1 



1 11 21 31 41 51 

50 | | | | | | 

MEKVQLEFEM QEMEKKLQEF RSTRNKEKED RESSEYYWKS GKVGKLVNQS YMMSQNKGNV 60 

VKFSAGKVKL KLLKEQIQEP VKPTVNYKMA NSSECEKPKI NGKVGGQCEN KAALLVCLEC 120 

GEDYCSGCFA NVHQKGALKL HRTTLLQARS QILFNVLDVA HQFIKDVNPD EPKEENNSTK 180 

ETSKIQHKPK SVLLQRSSSE VEITTMKRAQ RTKPRKSLLC EGSPDEBASA QSFQEVLSQW 240 

RTGNHDDNKK QNLHAAVKDS LEECEVQTNIi KIWREPLNIE LKEDILSYME KLWLKKHRRT 300 

PQEQIiFKCYQ IRSHIHMKPL VMHSVLKMKT MKIVMVRRPK YNTQLFYCQ 349 



Seq ID NO: C272 Protein Sequence 
Protein Accession ft: NP_078963.1 



1 11 21 31 41 51 

I I ! I I I 

MEKLWIiKKHR RTPQEQLFKM LSDTFPHPHE TTGDAQCSQN ENDED SDGEE TKVQBTALLL 60 

PVETLNIERP EPSLKTVELD DTYEEEPEEA ENIVPYKVKL ADADSQRSCA FHDCQKNSPP 120 

DJ YENGIHQHHV PDKGKRDFLN LCLRNSSTYY KDNSKGETSN TDFDNTVDPD VYSSDIEKIE 180 

ESTSFERNLK EKNIGLESNQ KSDDSCVSLE SKDTLLGRDL EKAPIEEKLS QDIKESLELS 240 

NLYKRPSFEE SKTTKSSLLL QEIACRSKPI TKQYQGLERF FIFDTNERLN LLPSHRLECN 300 

NSSTRITLAE DREWIPDHSL SEYADNATVL GVLQGAQSPS SSRKQQKMGQ KSQRPSTANF 360 

PLSNSVKESS SCLSSSHPRS RSAAAQSSSR AASEISEIEY IDITDQNELS LDDTTDQHTL 420 

DNLEKELQVL RSLADTSEKL YSLTSEEFPD FSSQSLNISQ ISTDFLKTSH VRGPCGVEEL 480 

SCSGRDTKIQ SLLSLSESST DEEEEDFLNK QHVITLPWSK ST 522 



Seq ID NO: C273 Protein Sequence 
Protein Accession ft: NP_005399.l 

1 11 21 31 41 51 

I I I I I I 

MKVSAVLLCL LLMTAAFNPQ GLAQPDALNV PSTGCFTFSS KKISLQRLKS YVITTSRCPQ 60 

KAVIFRTKLG KEICADPKEK WVQNYMKHLG RKARTLKT 98 

Seq ID NO: C274 Protein Sequence 
Protein Accession ft: BAC05158.1 

1 11 21 31 41 SI 
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I I I I I I 

MFLLTGGVSL KSAEKNPDPT WLQDKSWEEI CRASEFPAFR GLRQHFCEHI YEWREIYDSK 60 

EPHNAKFPAP MDKNLNELQK IIILRCLRPD KITPAITNYV TDKLGKKFVE PPPFDLTKSY 120 

LDSNCTIPLI FVLSPGADPM ASLLKFANDK SMSGNKFQAI SLGQGQGPIA AKMIKAAIEE 180 

5 GTWCLQNCH LAVSWMPMLE KICEDFTSET CNSSFRLWLT SYPSSKPPVT ILQNGVKMTN 240 

EPPTGLRLNL LQSYLTDPVS DPEFFKGCRG KELLPINEYD TIPFEAISYL TGECNYGGRV 300 

TDDWDRRLLL TMLADFYNLY IVENPHYKFS PSGNYFAPPK GTYEDYIEFI KKLPFTQHPE 360 

IPGLHENVDI SKDLQQTKTL FESLLLTQGG SKQTGASGST DQILLEITKD ILNKLPSDFD 420 

IEMALRKYPV RYEESMNTVL VQEMERFNNL IITIRNTLRD LEKAIKGWV MDSALEALSS 480 

1U SLLVGKVPEI WAKRSYPSLK PLGSYITDFL ARLNFLQDWY NSGKPCVFWL SGFFFTQAFL 540 

TGAMQNYARK YTTPIDLLGY EFEVIPSDTS DTSPEDGVYI HGLYLDGARW DRESGLLAEQ 600 

YPKLLFDLMP IINIKPTQKS RIIKSDAYVC PLYKTSERKG TLSTTGHSTN FVIAMLLKTD 660 

QPTRHWIKRG VALLCQLDD 679 

15 Seq ID NO: C275 Protein Sequence 
Protein Accession 8: AAA60212.1 



20 



1 11 21 31 41 51 

1 I I I I I 

MAESHLLQWL LLLltPTLCQP GTAAWTTSSL ACAQGPEFWC QSLEQALQCR ALGHCLQEW1 60 

GHVGADDLCQ ECEDIVHILN KMAKEAIFQD TMRKFLEQEC NVLPLKLLMP QCNQVLDDYF 120 

PLVIDYFQNQ TDSNGICMHL GLCKSRQPEP EQEPGMSDPL PKPLRDPLPD PLLDKLVLPV 180 

LPGALQARPG PHTQDLSEQQ PPIPLPYCWL CRALIKRIQA MIPKGALAVA VAQVCRWPL 240 

VAGGICQCLA ERYSVILLDT LLGRMLPQLV CRLVLRCSMD DSAGPRSPTG EWLPRDSECH 300 

25 LCKSVTTQAG NSSEQAIPQA MLQACVGSWL DREKCKQFVE QHTPQLLTLV PRGHDAHTTC 360 

QALGVCGTMS SPIOCIHSPD L 381 
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Seq ID NO: C276 Protein Sequence 
Protein Accession 8: NP_631911.1 

1 11 21 * 31 41 51 

I I I I I I 

MLGCGIPALG LLLLLQGSAD GNGIQGFFYP WSCEGDIWDR ESCGGQAAID SPNLCLRLRC 60 

CYRNGVCYHQ RPDENVRRKH MWALVWTCSG LLLLSCSICL PWWAKRRDVL HMPGPLAGPC 120 

DMSKSVSLLS KHRGTKKTPS TGSVPVALSK ESRDVEGGTE GEGTEEGEET EGEEEED 177 

Seq ID NO: C277 Protein Sequence 
Protein Accession 8: NP_473364.1 

1 11 21 31 41 51 

I - I t I I I 

MKLVTIFLLV TISLCSYSAT AFLINKVPLP VDKLAPLPLD NILPFMDPLK LLLKTLGISV 60 

EHLVEGLRRC VNELGPEASE AVKKLLEALS HLV 93 

Seq ID NO: C278 Protein Sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MPLSYAYKNA ETLAGRHTSS WMSRGAYQRR NTRAAGRPEE CTDRNWEAGR TRGIWLGQLE 60 

ERCSDVFGVS FFWWVRGLAG SGAKLQTFTP AQEGAPTVQR QAEALLKCRQ SGRPGRGGAE 120 

SBRARDASML SPLSAAMRNY PTSSTIPPRR* SYSPTEIAHK SYSCSLPDMK ISMAESGPSL 180 

DSLDILEDGE SGSPFLVTHL YFLGWTTGM EQLDFETGPN IFDLQIYVKD EVGVTDLQVL 240 

TVQVTDVNEP PQFQGNLAED HLRADQPHFN AHSHTYVRW ATALARHRLR SSIGSPFLGT 300 

FCVWGMQYF LISPPKSFRM SANGTLFSTT ELDPEAGHRS FHLIVEVRDS GGLKASTELQ 360 

VNIVNLNDEV PRFTSPTRVY TVLEBLSPGT IVANITAEDP DDEGFPSHLL YSITTVSKYF 420 

MINQLTGTIQ VAQRIDRDAG BLRQNPTISL EVLVKDRPYG GQENRIQITF IVEDVNDNPA 480 

TGQKFTFRSS LHPALCSKTL TWMDTVLDCF HAADKDIPVT GRFTKERGLI GLTVPHGWGS 540 

LTIMAEGKEE QVTSYMDGSR QRDRACVGKL LLIKPSDLMR LSHYHENNSG KTCPHDSISS 600 

YQVPPTTCRN SRIQATNNED TSSVTVTVNI LEEKDEKPIC TPNSYFLALP VDLKVGTNIQ 660 

NFKLTCTDLD SSPRSFRYSI GPGNVNNHFT FSPNAGSNVT RLLLTSRFDY AGGFDKIWDY 720 

KLLVYVTDDN LMSDRKKAEA LVETGTVTLS IKVTPHPTTI ITTTPRPRVT YQVLRKNVYS 780 

PSAWYVPFVI TLGSILLLGL LVYLWLLAK AIHRHCPOCT GKNKEPI/TKK GETKTAERDV 840 

WETIQMNTI FDGEAIDPEP EQASLELYAL LPSCCDPSPV TLRKVQVCGB SBETGQCSGH 900 

ITLPGKIPVD DPRKQETGLQ GDPEVWTLCP AVKWVGSPQ AERCIRLALS LKKYSSD 957 

Seq ID NO: C279 Protein Sequence 
Protein Accession #i XP_168571.1 

1 11 21 31 41 51 

I I 1 I I I 

MINQLTGTIQ VAQRIDRDAG ELRQNPTISL EVLVKDRPYG GQENRIQITF IVEDVNDNPA 60 

TOQKFTFSIM VPESTAKGTL LLDLNKFCFD DDSEAPNNRF NFTMPSGVGS GSRFLQDPAG 120 

SGKXVLIGDL DYENPSNLAA GNKYTVIIQV QDVAPPYYKN NVYVYILTSP ENEFPLIFDR 180 

PSYVFDVSER RPAQGHLSGP BEKRLLSICH VRAVCHHFGL HIASGSPRVP GRPIGQSHPQ 240 

TLPLQDWEEQ GTSOKERRNE DCRERRRGGN YPDEHYL 277 

Seq ID NO: C280 Protein Sequence 
Protein Accession 8: NP_005257.2 

1 11 21 31 41 51 

I I I I I I 

MGDWSFLGNF LEEVHKHSTV VGKVWLTVLF IFRMLVLGTA AESSWGDEQA DFRCDTIQPG 60 
CQNVCYDQAF PISHIRYWVL QIIFVSTPSL VYMGHAMHTV RMQEKRKLRE AERAKEVRGS 120 
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GSYEYPVAEK AELSCWEEGN GRIALQGTLL NTYVCSILIR TTMBVGFIVG QYPIYGIFLT 180 

TLHVCRRSPC PHPVNCYVSR PTEKNVFIVF MLAVAALSLL LSLABLYHLG WKKIRQRFVK 240 

PRQHMAKOQL SGPSVGIVQS CTPPPDFNQC LENGPGGKFF NPFSNNMASQ QNTDNLVTEQ 300 

VRGQEQTPGE GFIQVRYGQK PEVPNGVSPG HRLPHGYHSD KRRLSKASSK ARSDDLSV 358 

Seq ID NO: C281 Protein Sequence 
Protein Accession 8: NP_0S5274-2 

1 11 21 31 41 51 

I I I I I I 

MYLSICCCFL LWAPALTLKN LNYSVPEEQG AGTVIGNIGR DARLQPGLPP AERGGGGRSK 60 

SGSYRVLENS APHlxLDVDAD SGLLYTKQRI DRESLCRHNA KCQLSLEVFA NDKEICMIKV 120 

EIQDINDNAP SFSSDQIEMD ISENAAPGTR FPLTSAHDPD AGENGLRTYL LTRDDHGLFG 180 

LDVKSRGDGT KFPELVIQKA 'LDREQQNHHT LVLTALDGGE PPRSATVQIN VKVIDSNDNS 240 

PVFEAPSYLV ELPENAPLGT WIDLNATDA DEGPNGEVLY SFSSYVPDRV RELFS IDPKT 300 

GLIRVKGNLD YEENGMLEID VQARDLGPNP IPAHCKVTVK LIDRNDNAPS IGFVSVRQGA 360 

LSEAAPPGTV IALVRVTDRD SGKNGQLQCR VLGGGGTGGG GGLGGPGGSV PFKLEENYDN 420 

FYTWTDRPL DRETQDEYNV TIVARDGGSP PLNSTKSFAI KILDENDNPP RFTKGLYVLQ 480 

VHENNIPGEY LGSVLAQDPD LGQNGTVSYS ILPSHIGDVS IYTYVSVNPT NGAIYALRSF 540 

NFEQTKAFEF KVLAKDSGAP AHLESNATVR VTVLDVNDNA PVIVLPTLQN DTAELQVPRN 600 

AGLGYLVSTV RALDSDFGES GRLTYEIVDG NDDHLFEIDP SSGEIRTLHP FWEDVTPWE 660 

LWKVTDHGK PTLSAVAKLI IRSVSGSLPE GVPRVNGEQH HWDMSLPLIV TLSTISIILL 720 

AAMITIAVKC KRENKEIRTY NCRIAEYSHP QLGGGKGKKK KINKNDIMLV QSEVEERNAM 780 

NVMNWSSPS LATSPMYFDY QTRLPLSSPR SEVMYLKPAS NNLTVPQGKA GCHTSFTGQG 840 

TNASETPATR MSIIQTDNFP AEPMYMGSRQ QFVQSISVAP RLRTQKEPA 889 

Seq ID NO: C282 Protein Sequence 
Protein Accession #: NPj)0S592.l 

1 11 21 31 41 51 

I I I I I I 

MELCRSLALL GGSLGLMFCL IALSTDFWFE AVGPTHSAHS GLWPTGHGDI ISGYIHVTQT 60 
FSIMAVLWAL VSVSFLVLSC FPSLFPPGHG PLVSTTAAFA AAISMWAMA VYTSERWDQP 120 
PHPQIQTFFS WSFYLGWVSA ILLLCTGALS LGAHCGGPRP GYETL 165 

Seq ID NO: C263 Protein Sequence 
Protein Accession #: NP_006424.2 

1 11 21 31 41 51 

1 I I I I I 

MATWALLLLA AMLLGNPGLV FSRLSPEYYD LARAHLRDEE KSCPCLAQEG PQGDLLTKTQ 60 

ELGRDYRTCL TIVQKLKKMV DKPTQRSVSN AATRVCRTGR SRWRDVCRNF MRRYQSRVTQ 120 

GLVAGETAQQ ICEDLRLCIP STGPL 145 

Seq ID NO: C284 Protein Sequence 
Protein Accession #: NPJ>05594.1 

1 11 21 31 41 51 

I I I I 1 I 

MKVSAAALAV ILIATALCAP ASASPYSSDT TPCCFAYIAR PLPRAHIKEY FYTSGKCSNP 60 

AWFVTRKNR QVCANPEKKW VRBYINSLBM S 91 

Seq ID NO: C285 Protein Sequence 
Protein Accession g: NP_071437.1 

1 11 21 31 41 51 

I I I I I I 

MAPGRAVAGIi LLLAAAGLGG VAEGPGLAFS EDVLSVFGAN LSLSAAQLQH LLEQMGAASR 60 

VGVPKPGQLH FNQCLTAEEI FSLHGFSNAT QITSSKFSVI CPAVLQQI2JF HPCEDRPKHK 120 

TRPSHSEVWG YGFLSVTIIN LASLLGLILT PLIKKSYFPK ILTFFVGLAI GTLFSNAIFQ 180 

LIPEAFGFDP KVDSYVEKAV AVFGGFYLLF FFERMLKMLL KTYGQNGHTH FGNDNFGPQE 240 

KTHQPKALPA INGVTCYANP AVTEANGHIH FDNVSWSLQ DGKKEPSSCT CLKGPKLSEI 300 

GTIAWMITLC DALHNFIDGL AIGASCTLSL LQGLSTSIAI LCEEFPHELG DFVILLNAGM 360 

STRQALLFNF LSACSCYVGL AFGILVGNNF APNIIFALAG GMFLYISLAD MFPEMNDMLR 420 

EKVTGRKTDF TFFMIQNAGM LTGFTAILLI TLYAGEIELE 460 

Seq ID NO: C286 Protein Sequence 
Protein Accession #: NP_00417S.l 

1 11 21 31 41 51 

I I I t i I 

MPNSEPASLL ELFNSIATQG ELVRSLKAGN ASXDEIDSAV KMLVSLKMSY KAAAGEDYKA 60 

DCPPGNPAPT SNHGPDATEA EEDFVDPWTV QTSSAKGIDY DKLIVRFGSS KIDKELINRI 120 

ERATGQRPHH FLRRGIFFSH RDMNQVLDAY ENKKPFYLYT GRGPSSEAMH VGHLIPFIFT 180 

KWLQDVFNVP LVIQMTDDEK YLWKDLTLDQ AYGDAVENAK DIIACGFDIN KTFIFSDLDY 240 

MGMSSGFYKN WKIQKHVTF NQVKGIFGFT DSDCIGKISF PAIQAAPSFS NSFPQIFRDR 300 

TDIQCLIPCA IDQDPYFRMT RDVAPRIGYP KPALLHSTFF PALQGAQTKM SASDPNSSIF 360 

LTDTAKQIKT KVNKHAFSGG RDTIEEHRQF GGNCDVDVSF KYLTFFLEDD DKLEQIRKDY 420 

TSGAMLTGEL KKALIEVLQP LIAEHQARRK EVTDEIVKEF MTPRKLSFDF Q 471 



Seq ID NO: C287 Protein Sequence 



1349 



WO 03/042661 



Protein Accession #t NPJJ04929.1 



KTVFRQENVD 
IEREVSILKE 
KQILNGVYYL 
PEFVAPEIVN 
FSNTSALAKD 
ARKKWKQSVR 
QHLLGSLSNY 
RHGHVDTLKF 
TPLHCAAWHG 
CDKDGHIALH 
NCNLDISNKY 
GLLARLRKDT 
RPRLSSTNSS 
PTHHPHCSAD 
FSLEEPYEIQ 
GEFGYDKDTS 
PPMTHLCBKI 
EINIMQSETV 
ELLQILDAMD 
PPCGIFHKVQ 
GLETEKIKCC 
AQTLKETSLT 
PDPLGKDWCL 
LRELGRRDAA 



11 
I 

DYYDTGEEIX3 
IQHPNVITLH 
HSLQIAHFDL 
YEPLGLEADM 
FIRRLLVKDP 
LISLOQRLSR 
DVNQPNKHGT 
LSENKCPLDV 
YYSVAKALCE 
LAVRRCQMEV 
GRTPLHLAAN 
HRGLFIQQLR 
RFPPSPIASK 
DQSTKAIDIQ 
LNPVIFWLSF 
LLKEIRNRFG 
ISTLPSWRKL 
QDVLLLDPRW 
ICARDLSSGT 
VNLCRWIHQQ 
LbLDSVCSTI 
NTMGGYKESF 
LAMNLGLPDL 
DLLLKASSVF 



21 
1 

SGQFAWKKC 
EVYENKTDV1 
KPENIMLLDR 
WSIGVITYIL 
KKRMTIQDSL 
SFLSRSNMSV 
PPLLIAAGCG 
KDKSGEMALH 
AGCNVNIKNR 
IKTLLSQGCF 
NGILDWRYL 
PTQNLQPRIK 
PTVSVSINNL 
NAYLNGVGDP 
LKSLVPVEEP 
NDLHISNKLF 
NGPNQLMSLQ 
LCTNVLGKliL 
MVDVPALIKT 
STEGDADIRL 
ENVMATTLPG 
SSIMCFGCHD 
VAKYNTONGA 
KINLDGNGQE 



31 

I 

REKSTGLQYA 
LXLELVAGGE 
NVPKPRIKII 
LSGASPFLGD 
QHPWIKPKDT 
ARSDDTLDEE 
NIQILQLiLIK 
VAARYGHADV 
EGSTPLLTAS 
VDYQDRHGNT 
CLMGASVEAL 
LKLFGHSGSG 
YPGCENVSVR 
SVWEFSGNPV 
IAFGGKLKNP 
VLDAGASGSK 
QFVYDVQDQL 
SVETPRALHH 
DNLHRSWADE 
WVNGCKLANR 
LLTVKHYLSP 
VYSQASLGMD 
PKDFLPSPLH 
AYASSCNSGT 



41 
I 

AKFIKKRRTK 
LFDFLAEKES 
DFGLAHKIDF 
TKQETLANVS 
QQALSRKASA 
DSFVMKAIIH 
RGSRIDVQDK 
AQVTCAASAQ 
ARGYHDIVEC 
PLHVACKDGN 
TTDGKTAEDL 
KTTLVESLKC 
SRSMMFEPGL 
YFCCYDYFAA 
LQWLVATHA 
DMKVLRNHLQ 
NPLASEEDLR 
YRGRYTVEDI 
EDEVMVYGGV 
GAELLVLLVN 
QQLREHHEPV 
IHASDLNLLT 
ALLREWTTYP 
SYNSISSWS 



LTEEEATEFL 
GNEFKNIFGT 
AVNYEFEDEY 
VNMEKFKKFA 
AINDDNVPGL 
GGSNAVYWAA 
IPISRTKEEE 
LAEHGADLNA 
MPIWAliCEA 
ARSEQHEHVA 
GLLRSFFRRR 
TKGMLEVFVA 
NDPTSIHWV 
DIMNVPRPAG 
EIRSQIVSVC 
RIAQQLHSTG 
QRLVPDSDVE 
RIVPVEHLTP 
KGQGIEVQVR 
MIYQPRDFFR 
RRKLSRLLDP 
ESTVGTLMSK 
R 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1431 



Seq ID NO j C288 Protein Sequence 
Protein Accession #; NP_002072.1 



MELRARGWWL 
ICPQGYTCCT 
TLQATFPGAF 
QLLLPDDYLD 
PliGPECSRAV 
TDKFWGTSGV 
RGKLAPRERP 
RGRYLPEVMG 
ASDDGSGSGS 



11 
1 

LCAAAALVAC 



GELYTQNARA 
CLGKQAEALR 
MKLVYCAHCL 
ESVIGSVHTW 
PSGTLEKLVS 
DGLANQrNNP 



21 
I 

ARGDPASKSR 
SHAELETALR 
FRDLYSELPX 
PFGEAPRELR 
GVPGARPCPD 
IiAEAINALQD 
EAKAQLRDVQ 
EVEVDITKPD 



31 
1 

SCGEVRQIYG 
DSSRVLQAML 
YYRGANLHLE 
LRATRAFVAA 
YCRNVLKGCL 
NRDTLTAKVI 
DFWI8LPGTL 
MTIRQQIMQL 



41 

I 

AKGFSLSDVP 
ATQLRSFDDH 
ETLAEFWARL 
RSFVQGLGVA 
ANQADLDAEW 
QGCGNPKVMP 
CSEKMALSTA 
KIMTNRLRSA 



51 

I 

QAEISGEHLR 
FQHLLNDSER 
LERLFXQLHP 
SDWRKVAQV 
RNLLDSMVLI 
QGPGPEEKRR 
SDDRCWNGMA 
YNGNDVDFQD 



60 

120 

180 

240 

300 

360 

420 

480 

490 



Seq ID NO: C289 Protein Sequence 
Protein Accession #: AAH30205.1 

1 11 21 31 41 51 

I I I I I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 60 

EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNCGFGKTGI IDYGIRLNRS 120 

ERWDAYCYNP HAKECGGVFT DPKQIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 180 

DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDBLPDDIIS TGNVMTLKFL SDASVTAGGF 240 

QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 277 

Seq ID NO: C290 Protein Sequence 
Protein Accession &: NP_001973.l 



MRANDALQVL 
GNLEIVLTGH 



11 
I 

GLLFSIiARGS 
NADLSFLQWI 
LRQLRI*TQLT 



RSCPPCHEVC 
TDCPACRHFN 
CVRACPPDKM 
LDFLITGLNG 
RSLYNRGFSIi 
RLDIKHNRPR 
REFAHEAKCF 
YKYPDVQNEC 
GTFLYWRGRR 
GTVHKGVWIP 
SSLQLVTQYL 
LLKSPSQVQV 
VWELMTFGAE 
LANEFTRMAR 
NXiATTTLGSA 
PVSLHPMPRG 
TPVTPLSPPG 
NRRRRHSPPH 
EYMNRQRDGG 
SAFDNPDYWH 



DSGACVPRCP 
EVDKNGLKMC 
DPWHKIPALD 
LIMKNLNVTS 
RDCVAEGKVC 
SCHPECQPMG 
RPCHENCTQG 
IQNKRAMRRY 
EGESIKIPVC 
PLGSLLDHVR 
ADFGVADLLP 
PYAGLR1AEV 
DPPRYLVIKR 
LSLPVGTLNR 
CLASSSSEGH 
LEBEDVNGYV 
PPRPSSLEEL 
GPGGDYAAMG 
SRLFPKANAQ 



21 
I 

EVGNSQAVCP 
REVTGYVLVA 
BILSGGVYIE 
DOQTLTKTIC 
QPLVYNKLTF 
EPCGGLCPKA 
PEKLNVFRTV 
LGFRSLKEIS 
DPLCSSGGCW 
OTATCNGSGS 
CKGPELQDCL 
LERGESIEPL 
IKVIEDKSGR 
QHRGALGPQL 
PDDKQLLYSE 
PDLLEKGERI* 
ESGPGIAPGP 
PRGSQSLLSP 
VTGSEAELQE 
MPDTHLKGTP 
GYEYMDVGSD 
ACPASEQGYE 
RT 



31 

I 

GTLNGLSVTG 
MNEFSTLPLP 
KNDKLCHMDT 



QLEPNPHTKY 
CEGTGSGSRF 
REITGYLNIQ 
AGRIYISANR 
GPGPGQCLSC 
DTCAQCAHFR 
GQTLVLIGKT 
DFSEKANKVL 
QSFQAVTDHM 
LLNWGVQIAK 
AKTPIKWMAL 
AQPQICTIDV 
BPHGLTNKKL 
SSGYMPMNQG 
KVSMCRSRSR 
SSREGTLSSV 
LSASLGSTQS 
EMRAFQGPGH 



41 
I 

DAENQYQTLY 
NLRWRGTQV 
IDWRDIVRDR 
PNPNQCCHDE 
QYGGVCVASC 
QTVDSSNIDG 
SWPPHMHNFS 
QLCYHHSLNW 
RNYSRGGVCV 
DGPHCVSSCP 
HLTMALTVIA 
ARIFKETELR 
LAIGSLDHAH 
GMYYLEEHGM 
ESIHFGKYTH 
YMVMVKCWMI 
EEVELBPELD 
NLGGSGQESA 
SRSPRPRGDS 
GLSSVLGTEE 
CPLHPVPIMP 
QAPHVHYARL 



51 
I 

KLYERCEWM 
YDGKFAIFVM 
DAEIWKDNG 
CAGGCSGPQD 
PHNFWDQTS 
FVNCTKILGN 
VFSNLTTIGG 
TKVLRGPTEE 
THCNPLNGEP 
HGVLGAKGPI 
GLWIFMMLG 
KLKVLGSGVF 
IVRLLGXjCPG 
VHRNLAARNV 
QSDVWSYGVT 
DEMIRPTFKE 
LDLDLEAEED 
VSGSSBRCPR 
AYHSQRHSLL 
EDEDEEYEYM 
TAGTTPDEDY 
KTLRSLEATD 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1342 



1350 



WO 03/042661 



PCT/US02/36810 



Seq ID NO: C291 Protein Sequence 
Protein Accession ft: NP_001207.1 

_ 1 11 21 31 41 51 

5 I | | | | I 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 

GEEDLPSEED SPRBEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSLKL EDLPTVEAPG 120 

DPQEPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 1B0 

ELI/3PQUPL PELRLRNNGH SVQLTLPPGIi EMALGPGREY RALQLHLHWG AAGRPGSEHT 240 

10 VEGHRFPAEI HWHLSTAFA RVDEALGRPG GLAVLAAPLE EGPEENSAYE QLLSRLEEIA 300 

EEGSETQVPG LDISALLPSD PSRYFQYEGS LTTPPCAQGV IWTVFNQTVM LSAKQLHTLS 360 

DTLWGPGDSR LQLNFRATQP LNGRVIEASP PAGVDSSPRA AEPVQUJSCL AAGDILALVF 420 

GLLFAVTSVA PLVQMRRQHR RGTKGGVSYR PAEVAETGA 459 

15 Seq ID NO: C292 Protein Sequence 
Protein Accession ft: NP_004198.1 

1 11 21 31 41 51 

on I I ! I I I 

ZO MGGAWDEGP TGVKAPDGGW GWAVLFGCFV ITOPSYAFPK AVSVFFKELI QEFGIGYSDT 60 

AWISSILLAM LYGTGPLCSV CVNRFGCRPV MLVGGLFASL GMVAASPCRS I IQVYLTTGV 120 

ITGLGLALNP QPSLIMLNRY FSKRRPMANG LAAAGSPVFL CALSPLGQLL QDRYGWRGGP 180 

LILGGLLLNC CVCAALMRPL WTAQPGSGP PRPSRRLLDL SVFRDRGFVL YAVAASVMVL 240 

GLPVPPVFW SYAKDLGVPD TKAAPLLTIL GFIDIFARPA AGFVAGLGKV RPYSVYLFSF 300 

25 SMFFNGLADL AGSTAGDYGG LWFCIFFGI SYGMVGALQF EVLMAIVGTH KFSSAIGLVL 360 

LMEAVAVLVG PPSGGKLLDA THVYMYVFIL AGAEVLTSSL ILLLGNFFCI RKKPKEPQPE 420 

VAAAEEEKLH KPPADSGVDL REVEHPLKAE PEKNGEWHT PETSV 465 

A Seq ID NO: C293 Protein Sequence 
30 Protein Accession ft: NPJJ00349.1 

1 11 21 31 41 51 

I I I I I I 

MAXiFVRLLAL ALALALGPAA TLAGPAKSPY QLVLQHSRLR GRQHGPNVCA VQKVIGTNRK 60 

35 YFTNCKQWYQ RKICGKSTVI SYECCPGYEK VPGEKGCPAA LPLSNLYETL GWGSTTTQL 120 

YTDRTEKLRP EMEGPGSFTI FAPSNEAWAS LPAEVLDSLV SNVNIELLHA LRYHMVGRRV 180 

LTDELKHGMT LTSMYQNSNI QIHHYPNGIV TVNCARLLKA DHHATNGWH LIDKVISTIT 240 

NNIQQUEIE DTFETLRAAV AASGLNTMLE GNGQYTLLAP TNEAFEKIPS ETLNRILGDP 300 

EALRDLLNNH ILKSAMCAEA IVAGLSVETL EGTTLEVGCS GDMLTINGKA IISNKDILAT 360 

40 NGVIHYIDEL LIPDSAKTLF ELAAESDVST AIDLPRQAGL GNHLSGSERL TLLAPLNSVF 420 

KDGTPPIDAH TRNLLRNHII KDQLASKYLY HGQTLETLGG KKLRVFVYRN SLCIENSCIA 480 

AHDKRGRYGT LFTMDRVLTP PMGTVMDVLK GDNRFSMLVA AIQSAGLTET LNREGVYTVP 540 

APTNEAFRAL PPRERSRLLG DAKELANILK YHIGDEILVS GGIGALVRLK SLQGDKLEVS 600 

LKNNWSVNK EPVAEPDIMA TNGWHVITN VLQPPANRPQ ERGDELADSA LEIFKQASAF 660 

45 SRASQRSVRL APVYQKLLER MKH 683 

Seq ID NO: C294 Protein Sequence 
Protein Accession ft: NP_006527.1 

50 i 11 21 31 41 51 

I 1 1/1 I I 

MTQRSIAGPI CNLKFVTLLV AI*SSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 60 
IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 
GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VPVHEWAHLR WGVFDEYNND 180 
55 KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPOENCIISK LFKEGCTFIY NSTQNATASI 240 
MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFBHSFP MNGTELPPPP 300 
TPSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 360 
RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLK2CGF EWEKLNGKA YGSVMILVTS 420 
GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 480 
60 SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 
FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 600 
AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPBTGD PVTLRLLDDG 660 
AGADVIKNDG IYSRYFPSFA ANGRYSLKVH VNHSPSISTP AHSIPGSHAM YVPGYTANGN 720 
IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 
LTLSWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIPTFS 840 
PQISTNGPEH QPNGBTHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 
LILKGVLTAM GLIGIICLII WTHHTLSRK KRADKKENGT KLL 943 



65 



„ Seq ID NO: C295 Protein Sequence 
70 Protein Accession S: Eos sequence 



75 
80 



1 11 21 31 41 51 

I I I 1 I I 

MKFLLILLLO ATASGALPLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHP LGLKVTGQLD TSTLEMMHAP RCGVPDVHHP REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV HSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS DPKAVMPPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSIiWPTLPS GIBAAYEIEA RNQVFLFKDD KYWLISNLRP 360 

BPNYPKSIHS PGFPNFVKKI DAAVPNPRFY RTYFPVDNQY WRYDERRQMM DPGYPKLITK 420 

NFQGIGPKID AVFYSKNKYY YFFQGSNQFB YDPLLQRITK TLKSNSWFGC . 470 

Seq ID NO: C296 Protein Sequence 
Protein Accession ft: Eos sequence 

1351 



WO 03/042661 



1 11 21 31 41 SI 

I I I I I I 

MKFLLILLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKP YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLPLTAVHAI GHSLGLGHSS DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVPLFKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDSRRQMM DPGYPKIiITK 420 

NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 470 

Seq ID MO: C297 Protein Sequence 
Protein Accession #i NP_008883.1 

1 11 21 31 41 SI 

I I I I I I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 60 

IGIFVGICLF CLSVLGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TQRDFFTPNL 120 

FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLHLQDNCC GVNGPSDWQK YTSAFRTENN 180 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 240 

LCWTFWVLLG TMFYWSRIEY 260 

Seq ID NO: C298 Protein Sequence 
Protein Accession 8: NP_001784.2 

1 11 21 31 41 51 

I I I I I 1 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEFGQA LGKVFMGCPG 60 

QEPALFSTDN DDFTVRNGET VQERRSL.KER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK I FYS I TG PGA DSPPEGVFAV EKETGWLLLK KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVUCL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA HDNGSPPTTG TGTLLLTLID S40 

VNDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ AGLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP VfKGGFILPVL 660 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLKR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 

DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 829 

Seq ID NO: C299 Protein Sequence 
Protein Accession 8: NPJ>05620.1 

1 11 21 31 41 51 

I I I I I I 

MAKKSAENGI YSVSGDEKXG PLIAPGPDGA PAKGDGPVGL GTPGGRLAVP PRETWTRQMD 60 

FIMSCVGFAV GLGNVWRFPY LCYKNGGGVF LIPYVLIALV GGIPIFFLEI SLGQFMKAGS 120 

INVWNICPLF KGLGYASMVI VFYCNTYYIM VLAWGFYYLV KSFTTTLPKA TCGHTWNTPD 1B0 

CVEIFRHEDC ANASLANLTC DQLADRRSPV IEFWENKVLR LSGGLEVPGA LNWEVTLCLL 240 

ACWVLVYFCV WKGVKSTGKI VYFTATFPYV VLWLLVRGV LLPGALDGII YYLKPDWSKL 300 

GSPQVWIDAG TQIFFSYAIG LGALTALGSY NRFNNNCYKD AIILALIKSG TSFFAGFWF 360 

SILGFMAAEQ GVHISKVAES GPGLAFIAYP RAVTLMPVAP LWAALFFFML LLLGLDSQFV 420 

GVEGFITGIiL DLLPASYYFR FQREISVALC CALCFVIDLS MVTDGGMYVF QLFDYYSASG 480 

TTLLWQAFWE CWVAWVYGA DRFMDDIACM IGYRPCPWMK WCWSFFTPLV CMGIFIFNW 540 

YYEPLVYNNT YVYPWWGEAM GWAFALSSML CVPLHLLGCL IiRAKGTMAER WQHLTQPIHG 600 

LHHLEYRAQD ADVRGLTTLT PVSESSKVW VESVM 635 

Seq ID NO: C300 Protein Sequence 
Protein Accession #: NPJJ06S07.1 

1 11 21 31 41 51 

I I I I I I 

MEPSSKKLTG RLMLAVGGAV LGSLQFGYNT GVINAPQKVI EEPYNQTWVH RYGESILPTT 60 
LTTLWSLSVA IFSVGGMIGS FSVGLFVNRF GRRNSMLMMN LLAFVSAVLM GFSKLGKSFE 120 
MLILGRFIIG VYCGLTTGFV PMYVGEVSPT APRGALGTLH QLGIWGILI AQVFGLDSIM 180 
GNKDLWPLLL SIIFIPALLQ CIVLPFCPES PRFLLINRNE ENRAKSVLKK LRGTADVTHD 240 
LQEMKEESRQ MMREKKVTIL ELFRSPAYRQ PILIAWLQL SQQLSGINAV FYYSTSIFEK 300 
AGVQQPVYAT IGSGIVNTAF TWSLFWER AGRRTIiHLIG LAGMAGCAIL MTIALALLEQ 360 
LPWMSYLSIV AIFGFVAFFE VGPGPIPWFI VAELFSQGPR PAAIAVAGFS NWTSNFIVGM 420 
CFQYVEQLOG PYVFIIFTVL LVLFFIFTYF KVPETKGRTF DEIASGFRQG GASQSDKTPE 480 
BLFHPLGADS QV 492 

Seq ID NO: C301 Protein Sequence 
Protein Accession #: XP_035292.2 

1 11 21 31 41 51 

I I I I I I 

MAGAGPKRRA LAAPAAEEKB EAREKMLAAK SADGSAPAGE GEGVTLQRNI TLLNGVAIIV 60 
GTIIGSGIFV TPTGVLKEAG SPGLALWHA ACGVFSIVGA LCYAELGTTI SKSGGDYAYM 120 
LEVYGSLPAF LKLMIELLII RPSSQYIVAL VFATYLLKPL PPTCPVPEEA AKLVACLCVL 180 
LLTAVNCYSV KAATRVQDAP AAAKLLALAL IILLGFVQIG KGDVSNLDPN FSFEGTKLDV 240 



1352 
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GNIVLALYSG LPAYGGWNYL NFVTEEMINP YRNLPLAIII SLPIVTLVYV LTNLAYFTTL 
STBQMLSSEA VAVDFGNYHL GVMSNIIPVF VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 
SILSMIHPQL LTPVPSLVFT CVHTLLYAFS KDIFSVINFF SFFNWLCVAL AIIGMIWLRH 
RKPELERPIK VNLALPVFFI LACLFLIAVS FWKTPVECGI GFTIILSGLP VYFFGVWWKN 
KPKWLLQGIF STTVLCQKLM QWPQBT 

Seq ID NO: C302 Protein Sequence 
Protein Accession #: NP 00S2S9.1 



11 



21 



31 41 51 

III 
MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 
SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRKREAHG ENSGRLYLNP 
GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SPYPKYILPP WKCHADPCP NIVDCFISKP 
SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 
DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 

Seq ID NO: C303 Protein Sequence 
Protein Accession #: NP 005121.1 



41 



51 



1 11 21 31 

I I I I 1 i 

MKICSLTLLS FLLLAAQVLL VEGKKKVKNG LHSKWSEQK DTLGNTQIKQ KSRPGNKGKF 
VTKDQANCRW AATEQBEGIS LKVECTQLDH EFSCVFAGNP TSCLKLKDER VYWKGVARNfc 
RSQKDICRYS KTAVKTRVCR KDFPESSLKL VSSTLFGNTK PRKEKTEMSP REHIKGKETT 
PSSLAVTQTM ATKAPBCVED PDMANQRKTA LEFCGETWSS LCTFFLSIVQ DTSC 

Seq ID NO j C304 Protein Sequence 
Protein Accession #: AAH22542 

1 11 21 31 41 SI 

I I I i I I 

MCSEIILRQE VLKDGFHRDL LIKVKFGESI EDLHTCRLLI RQDIPAGLYV DPYELASLRE 
RNITEAVMVS ENFDIEAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPHSEDGE 
ASIWNNPDL LMFCDQAGSR RMIRFRFDSF DKTIEFPILK CWAHSEVAAP CALENEDICQ 
WNKMKYKSVY KNVILQVPVG LTVHTSLVCS VTLLITILCS KKKKK 



Seq ID NO: C305 Protein Sequence 
Protein Accession #: NP 004985.1 



MSLWQPLVLV 
RGESKSLGPA 
ITYWIQNYSE 
FDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 



PMYRFTEGPP 
PTAGPTGPPS 



LDKLGLGADV 
THDVFQYREK 



11 
! 

LXiVLGCCFAA 
LLLLQKQLSL 
DLPRAVIDDA 
AFPPGPGIQG 
DGLPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYFCQDRFYW 



21 

! 

PRQRQSTLVL 
PETGELDSAT 
FARAFALWSA 
DAHFDDDELH 
NYDTDDRFGF 



HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 



31 
I 

FPGDLRTNLT 
LKAMRTPRCG 
VTPLTFTRVY 
SLGKGVWPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RLWRFDVKAQ 
VDQVGYVTYD 



41 
I 

DRQLAEEYLY 
VPDLGRFQTF 
SRDADIVIQF 
RFGNADGAAC 
GNADGKPOQF 
NSAGELCVFP 



51 
I 

RYGYTRVAEM 



PTAPPTVCPT 
FDAIAEIGNQ 
FFSGRQVWVY 
MVDPRSASEV 
ILQCPED 



GVAEHGDGYP 
HFPFIFEGRS 
PFIFQGQSYS 
FTFLGKEYST 
HSSVPEALMY 
GPPTVHPSER 
LYLFKDGKYW 
TGASVLGPRR 
DRMFPGVPLD 



Seq ID NO: C306 Protein Sequence 
Protein Accession #: NP 000204 



KAGPRPSPWA 
CNTQABLLAA 
EVFEPLESPV 
PQTDMRPEKL 
QTAVCTRDIG 
TQDYPSVPTL 
EAPNRIRSNL 
THVCQLPEDQ 
CSEGWSGQTC 
FQCPRTSGFL 
CHCHQQSliYT 
LKRAEEWVR 
LLPLLALLLL 
SGNLKGRDW 
AQLRQEVEEN 
LTEKOVEQRA 
LVEAIDVPAG 
KSQVSYRTQD 
VQL»SNPKFGA 
RKIHFNWLPP 
AQGEGPYSSL 
NDDNRPIGPM 
KRPMSIPIIP 
EELDLRRVTW 



11 
I 

RLLLAALISV 
GCQRESIWM 
DliYILMDFSN 
KEPHPNSDPP 
WRPDSTHLLV 
VRLLAKHNII 
DIRALDSPRG 
KGNIHLKPSF 
NCSTGSLSDI 



DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 
RWKVTNNMQR 
LNEVYRQISG 
FHDLKVAPGY 
TATLGRRLVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPMGYRVK 
VSCRTHQEVP 
KKVLVDNPKN 
DIPIVDAQSG 
RLPPELIPRL 



21 
I 

SLSGTLANRC 
ESSFQITEET 
SMSDDIjDNIiK 
FSFKNVISLT 
FSTBSAFHYE 
PIFAVTNYSY 
LRTEVTSKMF 
SDGLKMDAGI 
QPCLREGEDK 
QCVCEPGWTG 
IHPGLCEDLR 
TYSYTMEGDG 
ACLALLPCCN 
PGFATHAASI 
VHKLQQTKFR 
YTLTADQDAR 
ITIIKEQARD 
PVEGELLFQP 
IIRDPDELDR 
YWIQGDSESE 
SEPGRLAFNV 
RMLLIENLRE 
EDYDSFLMYS 



31 
I 

KKAPVKSCTE 
QIDTTLRRSQ 
KMGQNLARVL 
EDVDEFRNKL 
ADGANVLAGI 
SYYEKLHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCSGSGEOQC 
PSCDCPLSNA 
SCVQCQAWGT 
APGPNSTVLV 
RGHMVGFKED 
NPTELVPYGL 
QQPNAGKKQD 
GMVEPQEGVE 
WSFEQPEFS 
GEAWKELQVK 
SFTSQMIiSSQ 
AHLLDSKVPS 
VSSTVTQLSW 
SQPYRYTVKA 
DDVLRSPSGS 
EAPTAPRTTA 



41 
I 

CVRVDKDCAY 
MSPQGLRVRL 
SQLTSDYTIG 
QGERISGNLD 
MSRNDERCHL 
PVSSLGVLQE 
RRGEVGIYQV 
KEVRSARCSF 
GHCVCYGEGR 
TCIDSNGGIC 



300 
360 
420 
480 
507 



60 

120 

180 

240 

273 



60 
120 
180 
234 



60 
120 
180 
225 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

707 



51 
I 

CTDEMFRDRR 
RPGEERHFEL 
FGKFVDKVSV 
APEGGFDAIL 
DTTGTYTQYR 
DSSNIVELLE 
QLRALEEVDG 
NGDFVOGQCV 
YEGQFCEYDN 



HKKKDCPPGS 
HYMLRENLMA 
SLRLARLCTE 
HTIVDTVLMA 
LVDVRVPLFI 
VSRGDOVARI 
LLELQEVDSL 
PPPHGDLGAP 
VELTNLYPYC 
ASPAETNGBI 
RNGAGWGPER 
QRPSVSDDTG 
ARAGRAAAVP 



CNFKVKMVDE 
FWWLIPLLLL 
SDHLDTPMLR 
NLLKPDTREC 
PRSAKPALLK 
RPEDDDEKQL 
PVIRRVLDGG 
LRGRQVRRFH 
QNPNAKAAGS 
DYEMKVCAYG 
TAYEVCYGLV 
EAIINLATQP 
OGWKFEPLLG 
RSATPGPPGE 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 



1353 



WO 03/042661 



PCT/US02/36810 



HLVNGRMDFA FPGSTKSLHR NTTTSAAAYG THLSPHVPKR VLSTSSTLTR DYNSLTRSEH 1500 

SHSTTLPRDY STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCBRPLQGY 1560 

SVEYQLLNGG ELHRLNIPNP AQTSVWEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES 1620 

5QVHPQSPLCP LPG5AFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM 1680 

AQGGGPATAF RVDGDSPESR LTVPGLSENV PYKFKVQART TEGPGPEREG IITIESQDGG 1740 

PFPQLGSRAG LFQHPLQSEY SSITTTHTSA TEPFLVDQLT LGAQHLEAOG SLTRHVTQEF 1800 

VSRTLTTSGT LSTHMDQQFF QT 1822 

Seq ID NO: C307 Protein Sequence 
1U Protein Accession 8: NP_076404.1 

1 11 21 31 41 51 

I I I I I I 

MGFNLTLAKL PNNELHGQES RNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIPVASILLN 60 

ID GLAVWIFFHI RNKTSFIFYL KNIWADLIM TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 120 

FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITPTKV LSVCVWVIMA VLSLPNIILT 180 

NGQPTEDNIH DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 240 

ISQSSRXRKH NQSIRVWAV FPTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 300 

FliSACNVCLD PIIYFFMCRS FSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 358 



20 



Seq ID NO: C308 Protein Sequence 
Protein Accession »: NP 065840.1 



0 c 1 11 21 31 41 51 

MVWCLG1AVL SLVISQGADG RGKPEWSW GRAEESWLG CDLLPPAGRP PLHVIEWLRF 60 

GFLLPIFIQF GLYSPRIDPD YVGRVRLQKG ASLQIEGLRV EDQGWYECRV FFLDQHIPED 120 

DFANGSWVHL TVNSPPQFQE TPPAVLEVQE LEPVTLRCVA RGSPLPHVTW KLRGKDLGQG 180 

QGQVQVQNGT LRIRRVERGS SGVYTCQASS TEGSATHATQ LLVLGPPVIV VPPKNSTVNA 240 

JU SQDVSLACHA EAYPANLTYS WFQDNINVFH ISRLQPRVQI LVDGSLRLLA TQPDDAGCYT 300 

CVPSNGLLHP PSASAYLTVL CMPGVIRCPV RANPPLLFVS WTKDGKALQL DKFPGWSQGT 360 

EGSLI IALGN EDALGEYSCT PYNSLGTAGP SPVTRVLLKA PPAFIERPKE EYFQEVGREL 420 

LIPCSAQGDP PPWSWTKVG RGLQGQAQVD SNSSLILRPL TKEAHGHWEC SASNAVARVA 480 

~r TSTNVYVLGT SPHWTNVSV VALPKGANVS WEPGFDGGYL QRFSVWYTPL AKRPDRMHHD 540 

3D WVSLAVPVGA AHLLVPGLQP HTQYQFSVLA QNKLGSGPFS EIVLSAPEGL PTTPAAPGLP 600 

PTEIPPPLSP PRGLVAVRTP RGVLLHWDPP ELVPKRLDGY VLEGRQGSQG WEVLDPAVAG 660 

TETELLVPGL IKDVLYEFRL VAFAGSFVSD PSNTANVSTS GLEVYPSRTQ LPGLLPQPVL 720 

AGWGGVCFL GVAVLVSILA GCLLNRRRAA RRRRKRLRQD PPLIFSPTGK SAAPSALGSG 780 

SPDSVAKLKL QGSPVPSLRQ SLLWGDPAGT PSPHPDPPSS RGPLPLEPIC RGPDGRFVMG 840 

4U PTVAAPQERS GREQAEPRTP AQRLARSFDC SSSSPSGAPQ PLCIEDISPV APPPAAPPSP 900 

LPGPGPLLQY LSLPFFREMN VDGDWPPLEE PSPAAPPDYM DTRRCPTSSF LRSPETPPVS 960 

PRESLPGAW GAGATAEPPY TALADWTLRE RLLPGLLPAA PRGSLTSQSS GRGSASFLRP 1020 

PSTAPSAGGS YLSPAPGDTS SWASGPERWP RREHWTVSK RRNTSVDENY EWDSEPPGDM 1080 

. _ ELLETLHLGL ASSRLRPEAE TELGVKTPEE GCULNTAHVT GPEARCAALR EEFLAFRRRR 1140 

43 DATRARLPAY RQPVPHPEQA TLL 1163 

Seq ID NO: C309 Protein Sequence 
Protein Accession #: 

50 1 li 21 31 41 51 

I I I I 1 I 

MLTKPLQGPP APPGTPTPPP GGKDREAFEA EYRLGPLLGK GGFGTVFAGH RLTDRLQVAI 60 

KVIPRNRVLG WSPLSDSVTC PLEVALLWKV GAGGGHPGVI RLLDWFETQE GFMLVLERPL 120 

_ - PAQDLFDYIT EKGPLGEGPS RCFFGQWAA IQHCHSRGW HRDIKDENIL IDLRRGCAKL 180 

33 IDPGSGALLH DEPYTDFDGT RVYSPPEWIS RHQYHALPAT VWSLGILLYD MVCGDIPFER 240 

DQEILEAELH FPAHVSPDCC ALIRRCLAPK PSSRPSLEEI LLDPWMQTPA EDVTPQPLQR 300 

RPCPFGLVLA TLSLAWPGLA PNGQKSHPMA MSQG 334 

^ n Seq ID NO: C310 Protein Sequence 
OU Protein Accession #: NP_002501.l 

1 11 21 31 41 51 

I I I I I I 

^ MECLYYFLGF LLLAARLPLD AAXRFHDVLG NERPSAYMRE HNQLNGWSSD ENDWNEKLYP 60 

03 VWKRGDMRWK NSWKGGRVQA VLTSDSPALV GSNITFAVNL IFPRCQKEDA NGNIVYEKNC 120 

RNEAGLSADP YVYNWTAKSE DSDGENGTGQ SHHNVFPDGK PFPHHPGWRR WNFIYVFETL 180 

GQYFQKLGRC SVRVSVNTAN VTLGPQLMEV TVYRRHGRAY VPIAQVKDVY WTDQIPVFV 240 

TMFQKNDRNS SDETFLKDLP IMFDVLIHDP SHFLNYSTIN YKWSFGDNTG LFVSTNHTVN 300 

„ HTYVLNGTFS LNLTVKAAAP GPCPPPPPPP RPSKPTPSLG PAGDNPLBLS RIPDENOQIN 360 

/U RYGHFQATIT IVEGILEVNI IQMTDVLMPV PWPESSLIDF WTOQGSIPT EVCTIISDPT 420 

CEITQNTVCS PVDVDEMCLL TVRRTFNGSG TYCVNLTLGD DTSIALTSTL ISVPDRDPAS 480 

PLRMANSALI SVGCLAIFVT VISLLVYKKH KEYNPIENSP GNWRSKGLS VFLNRAKAVF 540 

FPGNQEKDPL LKNQEFKGVS 560 

75 Seq ID NO: C311 Protein Sequence 
Protein Accession th Eos seq 

1 11 21 31 41 51 

OA I i I I 1 ! 

OU MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS BHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPPILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCBVLTKQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

1354 



J 

WO 03/042661 



PCT/US02/36810 



TGKEEIHEAV CSSEPENVQA DPENYTSLLV 
HBPLTDGYQD LGAILNNLLP NMSYVLQIVA 
LIGTEEIIKE EEEGKDIBEG AIVNPGRDSA 
RSPTRGSEPS GKGDVPNTSL NSTSQPVTKL 
5 GSKTVLRSPH MNLSGTAESL NTVSITEYEE 
ENISQGYIFS SENPETITYD VLIPBSAHNA 
TAQPDVGSGR ESPLQTNYTE IRVDESEKTT 
TEVTPHAPTP SSRQQDLVST VNWYSQTTQ 
LVIVSALTFI CLWLVGILI YWRKCFQTAH 

10 IKHFPKHVAD LHASSGPTEE FEBVQSCTVD 
KLAQLAEKDG KLTDYINANY VDGYNRPKAY 
NLVEKGRRKC DQYWPADGSE EYGNFLVTQK 
GRWTQYHYT QWPDMGVPEY SLPVLTFVRK 
SMLQQIQHEG TVNIFGFLKH IRSQRNYLVQ 

15 YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ 
SSLSGEGTDY INASYIMGYY QSNEFIITQH 
AEDEPVYWPN KDEPINCESF KVTLMAEEHK 
CPKWPNPDSP ISKTFBLISV IKEEAANRDG 
SVDVYQVAKM INLMRPGVFA DIEQYQFLYK 

20 AESLESLV 



TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

ICTNGLYGKY SDQLIVDMPT DNPELDLPFE 420 

TNQIRKKBPQ ISTTTHYNRI GTKYNEAKTN 480 

ATEKDISLTS QTVTBLPPHT VEGTSASLND S40 

ESLLTSFKLD TGAEDSSGSS PATSAIPPIS 600 

SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

KSFSAGPVMS QGPSVTDLEM PHYSTPAYFP 720 

PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

LGITADSSNH PDNKHKNRYI NIVAYDHSRV 900 

IAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 

SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 1020 

AAYAKRHAVG PVWHCSAGV GRTGTYIVLD 1080 

TEEQYVFIHD TLVEAILSKB TEVLDSHIHA 1140 

SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

PLLHTIKDFW RMIWDHNAQL WMIPDGQNM 1260 

CLSNEEKLII QDFILEATQD DYVLEVRHFQ 1320 

PMIVHDEKGG VTAGTFCALT TLMHQLEKEN 1380 

VILSLVSTRQ EENPSTSLDS NGAALPDGNI 1440 

1448 



Seq ID NO: C312 Protein Sequence 
Protein Accession «: XP 031379 



25 

30* 

35 

40 

45 
50 
55 
60 
65 



MRILKRPLAC 
QSPINIDEDL 
PKASKITPHW 
ILFEVGTEEN 
TDTVDWIVPK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEPS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LNTTPAASSS 
ILPQVTSATE 
KTLMFSQVEP 
SLPSGPSHIP 
PVSVAEFTYT 
LNASLQBTSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASEKYE 
EPLNTLINKL 
PHRDGSVTST 
KCMSCSSYRE 
SPGKSPSANG 
ENETSTDFSF 
HESRIGLAEG 
VISTPPTPIF 



11 

I * 

IQLLCVCRLD 
TQVNVNLKKL 
GKOJMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 



QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVPIHDTLV 
SAALKQCNRE 
HTIKDFWRMI 
NEEKLIIQDF 
VHDEHGGVTA 
SLVSTRQEEN 



GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
DSALHATPVF 
SDKVPLHASL 
PSSDAMMHAR 
IPKSSLITPT 
TSVFGDDNKA 
SISSTKGMFP 
ASSDPASSEM 
TPKVDKISST 
PVLLKSESSH 
IHSDEILTST 
KLLFPSKATS 
SQEKVMNDSD 
LSQKHNDGKE 
ADTNEKDADG 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
BAILS KETEV 
KNRTSSIIPV 
WDHNAQLWM 
ILEATQDDYV 
GTFCALTTLM 
PSTSLDSNGA 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
PSVDVSFESI 
PVAGGDLLLE 
SSGPEPSYAL 
ASLLQPTHAL 
LSKSEIIYGN 
GSLAHTTTKV 
LSPSTQLIiFY 
MLHLIVSNSA 
QWPSLYSND 
KSSVTGKVFA 
ELSHSAKSDA 
THENSLMDQN 
ENDIQTGSAL 
ILAAGDSEIT 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHXHAYVN 
ERSRVGISSL 
IPDGQNMAED 
LEVRHFQCPK 
HQLEKENSVD 
ALPDGNIAES 



31 
I 

KLVEEIGWSY 
ENTPIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 



KSFSAGPVMS 
PVYNGETPLQ 
LSSYDGAPLL 
PSLAQYSDVL 



BTELQIPSFN 
FDHEISQVPE 
ETSASFSTEV 
SSENMLHSTS 
ELFQTANLEI 
GIPTVASDTF 
GLVGGGEDGD 
NPISYSLSEN 
LPLSPESKAW 
PGFPQSPTSS 
CLWLVGILI 
LHASSGPTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
SGEGTDYXNA 
BFVYWPNKDE 
WPNPDSPISK 
VYQVAKMINL 
LESLV 



41 

I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
PSYSSEVFPL 
PFSSASFSSE 
STTHAASETL 
TVSYSSAIPV 
SDSEFLLPDT 
EMVYPSESTV 
NNFSVQPTHT 
LLQPSFQASD 
VPVFDVSPTS 
NQAHPPKGRH 
VSTDHSVPIG 
TDDDGDDDDD 
SEEDNRVTSV 
AVLTSDEBSG 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPB 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPPIS 
NVWFPSSTDI 
PHYSTPAYFP 
VTPLLLDNQI 
LFRHLHTVSQ 
EFGSESGVLY 
HDSVGVTYQG 
DGLTALNISS 
MPNMYDNVNK 
VSQASGDTSL 
VDTLLKTVLP 
HMHSASLQGL 
VFATPVLSID 
NGHVAITAVS 
DRGSDGLSIH 
SSDSQTGMDR 
SGQGTSDSLN 



YWRKCFQTAH 
FETLKEFYQB 
DYINANYVDG 
WPADGSBEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLBKQFQLL 
SYIMGYYQSN 
PINCESFKVT 
TFELISVIKE 
MRPGVFADIE 



FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
SQSNIQQSDY 
EPIITQHPLL 
LMABEHKCLS 
EAANRDGPMI 
QYQFLYKVIL 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2315 



Seq ID NO: C313 Protein Sequence 
Protein Accession ftt NP_002842 

70 1 11 21 31 41 51 

MRILKRPLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTPIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKOJMSSDGS EHSLEGQKFP LEMQIYCPDA DRFSSFEEAV KGKGKLRALS 180 

75 ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPB 420 

LIGTEEIIKE EEEGKDIBEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

80 RSPTRGSEPS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPPIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYPP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAE GLESBKKAVI 780 
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10 



45 



75 



PLVIVSALTP ICLWLVGIL IYWRKCFQTA HPYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE BFETLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAB DFWRMIWEHN 960 

VEVTVMITNL VEKGRRKCDQ YWPADGSEBY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSL PVLTFVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV N ALL I PGP AG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQS NEFIITQHPL LHTIKDFWRM IWDHNAQLW 1260 

MIPDGQMMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLI IQD FILEATQDDY 1320 

VLEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 

AALPDGNIAE SLESLV 1456 



Seq ID NO: C314 Protein sequence 
15 Protein Accession fc: Eos sequence 

1 11 21 31 41 SI 

I 1 I ' I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

20 QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGV6EMV 120 

MVFKASKITF HWGKCNMSSD GSEHSLEGQK FPLEMQIYCF DADRFSSFEE AVKGKGKLRA 180 

LSILFEVGTE ENLDFKAIID GVESVSRFGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP 240 

PCTDTDWIVF KDTVSISESQ LAVFCEVLTM QQSGYVMLMD YLQNNFREQQ YKFSRQVFSS 300 

YTGKEEIHEA VCSSEPENVQ ADPENYTSLL VTWERPRWY DTMIEKFAVL YQQLDGEDQT 360 

25 KHEFLTDGYQ DLGAILNNLL PNMSYVLQIV AICTNGLYGK YSDQLIVDMP TDNPELDLFP 420 

ELIGTEEIIK EEEEGKDIEE GATVNPGRDS ATNQIRKKEP QISTTTHYNR IGTKYNEAKT 480 

NRSPTRGSEF SGKGDVPNTS LNSTSQPVTK LATEKDISLT SQTVTELPPH TVEGTSASLN 540 

DGSKTVLRSP HMNLSGTAES LNTVSITEYE EESLLTSFKL DTGAEDSSGS SPATSAIPFI 600 

SENISQGYIF SSENPETITY DVLIPESARN ASEDSTSSGS EESLKDPSME GNVWFPSSTD 660 

30 ITAQPDVGSG RESFLQTNYT EIRVDESEKT TKSPSAGPVM SQGPSVTDLE MPHYSTFAYF 720 

PTEVTPHAFT PSSRQQDLVS TVNWYSQTT QPVYNEASNS SHESRIGLAE GLESEKKAVI 780 

PLVIVSALTF ICLWLVGIL IYWRKCFQTA HPYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EFBTLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHN 960 

35 VEVIVMITNL VEKGRRKCDQ YWPADGSBBY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSL PVLTFVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQS NEFIITQHPL LHTIKDFWRM IWDHNAQLW 1260 

40 MIPDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLI IQD FILEATQDDY 1320 

VLEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 

AALPDGNIAE SLESLV 1456 



Seq ID NO: C315 Protein Sequence 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

JO MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

55 TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

rA GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

60 ENISQGYIFS SENPBTITYD VLIPESARNA 8EDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESBKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

^ IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 900 

65 AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FWRMIWEHNV 960 

EVIVMITNLV BKGRRKCDQY WPADGSEBYG NPLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 1020 

SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTPVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

70 SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPLLH TIKDFWRMIW DHNAQLWMI PDGQNKAEDE FVYWPNKDEP 1320 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 

RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIABSL ESLV 1494 



Seq ID NO: C316 Protein Sequence 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

80 | j I i i i 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

1356 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



TDTVDWIVF1C 
TGKEEIHEAV 
HEFLTDGYQD 
ESRIGLAEGL 
ISTPPTPIFP 
ADSSNHPDMK 
GPLKSTAEDP 
LAYYTVRNFT 
KRHAVGPVW 
YVFIHDTLVE 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAQ 
LVSTRQEENP 



DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
ESEKKAVIPL 
ISDDVGAIPI 
HKNRYINIVA 
WRMIWEHNVE 
LRNTKIKKGS 
HCSAGVGRTG 
AILSKETEVL 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSKGAA 



AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
VTVSALTFIC 
KHFPKHVADL 
YDHSRVKLAQ 
VIVMITNLVB 
QKGRPSGRW 
TYIVLDSMLQ 
DSHIHAYVNA 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
LWLVGILIY 
HASSGFTEEF 
LAEKDGKLTD 
KGRRKCDQYW 
TQYHYTQWPD 
QIQHEGTVNI 
LLIPGPAGKT 
GEGTDYINAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 
BSL 



LQNNPREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
WRKCFQTAHF 
ETLKEFYQEV 
YINANYVDGY 
PADGSEEYGN 
MGVPEYSLPV 
FGFLKHIRSQ 
KLEKQFQLLS 
YIMGYYQSNE 
IKCESFKVTL 
PELISVIKEE 
RPGVFADIEQ 



KFSRQVFSSY 
QQLDGEDQTK 
DNPEASNSSH 
YLEDSTSPRV 
QSCTVDLGIT 
NRPKAYIAAQ 
FLVTQKSVQV 
LTFVRKAAYA 
RNYLVQTEEQ 
OSNIQQSDYS 
FIITQHPLLH 
MAEEHRCLSN 
AANRDGPMIV 
YQFLYKVILS 



Seq ID NO: C317 Protein Sequence 
Protein Accession Eos sequence 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFIP 
LVIVSALTFI 
IKHFPKHVAD 
AHDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
SRVARTILLS 
YIMGYYQSNE 
IKCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



11 
1 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKA1IDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 



QKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
BSFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMIV 
YQFLYKVILS 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQGL 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 



41 

I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNPREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPRT 
TGAEDSSGSS 



KSFSAGPVMS 



LVSTRQEENP 



FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
TLSPRLECRG 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
TISAHCNLPL 
RSRVGISSLS 
PDGQNMAEDB 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINTV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
PGLTDPPTSA 
GEGTDYINAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 
BSL 
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600 
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Seq ID NO: C318 Protein Sequence 
Protein Accession #: Eos sequence 



1 
I 

MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
ERSRVGISSL 
IPDGQNMAED 
DGRNFLCSDN 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 



GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
BSFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 



QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
SGEGTDYINA 
EFVYWPNKDB 
PYAPTRKRKF 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPBYSLP 
IFGFLKHIRS 
TKLEKQFQLL 
SYIMGYYQSN 
PINCESFKVT 
RGCLPGSQDD 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNPREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTBLPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 



FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
SQSNIQQSDY 
EFIITQHPLL 
LMAEEHKCLS 
QSDEARSLC 



VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
SAALKQCNRE 
HTIKDFWRMI 
NEEKLIIQDF 



51 
I' 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINTV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
KNRTSSIIPV 
WDHNAQLWM 
ILEATQAWRS 



Seq ID NO: C319 Protein Sequence 
Protein Accession #: XP 002914.4 



51 



1 11 21 31 41 

I I I I I 1 

MKDIDIGKEY IIPSPGYRSV RBRTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 
LDASMHSQLR ILDBEHPKGK YKHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 
VAHKKGELSM EDVWSLSKHB SSDVNCRRLB RLWQEELNBV GPDAASLRRV VWIFCRTRLI 

1357 
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1260 

1320 

1359 



60 

120 

180 



WO 03/042661 



LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TBSNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVPILFY PAMMPASRLT AYPRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQS ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKBKV RQLQRTEHQA 540 

VXiAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLBIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA I SGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DBVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAPSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRA5SRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV PPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDBATAAM DTETDLLIQE TIREAFADCT 1380 

MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVH.SNDSS RFYAMFAAAE NKVAVKG 1437 

Seq ID NO: C320 Protein Sequence 
Protein Accession #: NP_00S679.1 

1 11 21 31 41 51 

11)111 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALRP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVPILFY PAMMPASRLT AYPRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAPSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA S40 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA I SGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAPSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV PPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDBATAAM DTETDLLIQE TIREAFADCT 1380 

MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 1437 

Seq ID NO: C321 Protein Sequence 
Protein Accession 8: NP_005553.1 

1 11 21 31 41 51 

I I I I I I 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSRQCIFD RELHRQTGNG FRCLNCNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY PVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHDVILE GAGLRITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPWVB QCICPVGYKG QFCQDCASGY KRDSARLGPF GTCIPCNCQG 420 

GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETEBW 480 

CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM BALISKAQGG DGWPDTELE GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 780 

ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE 840 

ADRSYQHSLR LLD9VSRLQG VSDQSFQVEB AKRIKQKADS LSTLVTRHMD EFKRTQKNLG 900 

NWKBEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLRBFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL BRKELBFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 

MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 1193 

8eq ID NOt C322 Protein Sequence 
Protein Accession #: NP_066924.1 

1 11 21 31 41 51 

I I I I I I 

MANAGLQLLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 
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QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 

IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 180 

LLCCSCPRKT TSYPTPRPYP KPAPSSQKDY V 211 

Seq ID NO: C323 Protein Sequence 
Protein Accession #: AAM77876 

1 11 21 31 41 51 

I I 1 I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IPIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NPTRNKLTSL SRKHFRHLDL SELILVGNPP TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHPAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYBDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LPLLKLARHS KFGMKGFVLP HKIPLDG 477 

Seq ID NO: C324 Protein Sequence 
Protein Accession fl: NP_00617l.l 

1 ll 21 31 41 51 

I i I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IPIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NPTRNKLTSL SRKHFRHLDL SELILVGNPP TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHPAPTIT FLESPTSDHH 300 

WCIPPTVKGN PKPALQWPYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYBDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LPLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IBNPQYFGIT NSQLKPDTFV QHIKRHNIVL 540 

KRBLGEGAFG KVFLAECYNL CPEQDKILVA VKTLKDASDN ARKDFHREAE LLTNLQHEHI 600 

VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 

PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 

EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYLDI LG 822 

Seq ID NO: C325 Protein Sequence 
Protein Accession fh Eos sequence 

1 11 21 31 41 51 

I I I i I t 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NPTRNKLTSL SRKHFRHLDL SELILVGNPP TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWPYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYBDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LPLLKLARHS KFGMKGFVLP HKIPLDG 477 

Seq ID NO: C326 Protein Sequence 
Protein Accession #* NP_570843.1 

1 11 21 31 41 51 

I I I I I I 

MPLKHYIJ.LT, vgcqawgagl AYHGCPSECT CSRASQVECT GARIVAVPTP LPWNAMSLQI 60 

LNTHITBLNE SPFLNISALI ALRIEKNELS RITPGAFRNL GSLRYLSLAN NKLQVLPIGL 120 

FQGLDSLESL LLSSNQLLQI QPAHPSQCSN LKBLQLHGNH LEYIPDGAFD HLVGLTKLNL 180 

GKNSLTHISP RVFQHLGNLQ VLRLYENRLT DIPMGTPDGL VNLQELALQQ NQIGLLSPGL 240 

FHNNHNLQRL YLSNNHISQL PPSIFMQLPQ LNRLTLFGNS LKELSLGIFG PMPNLRELWL 300 

YDNHISSLPD NVFSNLRQLQ VLILSRNQIS FISPGAPNGL TELRELSLHT NALQDLDGNV 360 

FRMLANLQNI SLQNNRLRQL PGNIFANVNG LMAIQLQNNQ LENLPLGIFD HLGKLCELRL 420 

YDNPWRCDSD ILPLRNWLLL NQPRLGTDTV PVCFSPANVR GQSLIIINVN VAVPSVHVPE 480 

VPSYPETPWY PDTPSYPDTT SVSSTTELTS PVEDYTDLTT IQVTDDRSVW GMTQAQSGLA 540 

IAAIVIGIVA LACSLAACVG CCCCKKRSQA VLMQMKAPNE C 581 

Seq ID NO: C327 Protein Sequence 
Protein Accession 8: NP_002649.1 

1 11 21 31 41 51 

I I I I I I 

MRALLARLLL CVLWSDSKG SNBLHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PBELKFQCGQ KTLRPRPKII 180 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCPIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKPEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 300 
PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 
CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 

SHTKEENGLA L 431 

Seq ID NO: C328 Protein Sequence 
Protein Accession #: XP_087254.l 
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1 11 21 31 41 51 

I I i I I I 

MQFRECSING MKYQEINGRL VPBGPTPDSS EGNLSYLSSL SHLNNLSHLT TSSSFRTSPE 
NETELIKEHD LPFKAVSLCH TVQI SNVQTD CTGDGPWQSN LAPSQLEYYA SSPDEKALVE 
AAARIGIVFI GNSEETMEVK TLGKLERYKL LHILBFDSDR RRMSVIVQAP SGEKLLFAKG 
AESSILPKCI GGEIEKTRIH VDEFALKGLR TLCIAYRKFT SKEYEEIDKR IFEARTALQQ 
REEKLAAVFQ FIEKDLILLG ATAVEDRLQD KVRETIEALR MAGIKVWVLT GDKHETAVSV 
SLSCGHFHRT KNILELINQK SDSECAEQLR QLARRITEDH VIQHGLWDG TSLSIALREH 
EKLFMEVCRN CSAVLCCRMA PLQKAKVIRL IKISPEKPIT LAVGDGANDV SMIQEAHVGI 
GIMGKEGRQA ARNSDYAIAR FKFLSKLLFV HGHFYYIRIA TLVQYFFYKN VCFITPQFLY 
QPYCLPSQQT LYDSVYLTLY NICFTSLPIL IYSLLEQHVD PHVLQNKPTL YRDZSKNRLIi 
SIKTFLYWTI LGFSHAFIFF FGSYLLIGKD TSLLGNGQMF GKWTFGTLVF TVMVITVTVK 
MALETHFWTW INHLVTWGSI IFYFVFSLFY GGILWPFLGS QNMYFVFIQL LSSGSAWFAI 
ILMWTCLFL DIIKKVFDRH LHPTSTEKAQ LTETNAGIKC LDSMCCFPEG EAACASVGRM 
LERVIGRCSP THISRSWSAS DPFYTNDRSI LTLSTMDSST C 

Seq ID NO» C329 Protein Sequence 
Protein Accession fh XP_087461.1 

1 11 21 31 41 51 

I I I I I I 

MLPLLAALLA AACPLPPVRG GAADAPGLLG VPSNASVNAS SAASPSPRGC WPRRPPGPPS 
ARARRRRRRR RRLCNISVQR QMLSSLLVRW GRPRGFQCDL LLFSTNAHGR AFFAAAFHRV 
GPPLLIEHLG LAAGGAQQDL RLCVGCGWVR GRRTGRLRPA AAPSAAAATA GAPTALPAYP 
AAEPPGPLWL QGEPLHFCCL DFSLEELQGE PGWRLNRKPI ESTLVACFMT LVIWWSVAA 
LIWPVPIIAG FLPNGMEQRR TTASTTAATP AAVPAGTTAA AAAAAAAAAA AVTSGVATK 

Seq ID NO: C330 Protein Sequence 
Protein Accession #: XP_OS1522.2 

1 11 21 31 41 51 

I I I I I I 

MDLHLFDYSE PGNFSDISWP CNSSDCIWD TVMCPNMPKK SVLLYTLSFI YIFIFVIGMI 
ANSVWWVNI QAKTTGYDTH CYILNIAIAD LWWLTIPVW WSLVQHNQW PMGELTCKVT 
HLIFSINLFG SIFFLTCMSV DRYLSITYFT NTPSSRKKMV RRWCILVWL LAFCVSLPDT 
YYIiKTVTSAS NNETYCRSFY PEHSIKEWLI GMELVSWLG FAVPFSIIAV FYFLLARAIS 
ASSDQEKHSS RKIIFSYVW FLVCWLPYHV AVLLDIFSIL HYIPFTCRLE HALFTALHVT 
QCLSLVHCCV NPVLYSFINR NYRYELMKAF IFKYSAKTGL TKLIDASRVS ETEYSALEQS 
TK 

Seq ID NO: C331 Protein Sequence 
Protein Accession #: NP_000341.1 

1 ll 21 31 41 51 

I I I I I 1 

MGFVRQIQLL LWKNWTLRKR QKIRFWELV WPLSLFLVLI WLRNANPLYS HHECHFPNKA 60 

MPSAGMLPWL QGIFCNVNNP CFQSPTPGES PGIVSNYNNS ILARVYRDFQ ELLMNAPESQ 120 

HLGRIWTELH ILSQFMDTLR THPERIAGRG IRIRDILKDE ETLTLFLIKN IGfcSDSWYL 180 

LINSQVRPEQ FAHGVPDLAL KDIACSEALL ERPIIPSQRR GAKTVRYALC SLSQGTLQWI 240 

EDTLYANVDF FKLFRVLPTL LDSRSQGINL. RSWGGILSDM SPRIQEFIHR PSMQDLLWVT 300 

RPLMQNGGPE TFTKLMGILS DLLCGYPEGG GSRVLSFNWY EDNNYKAFLG IDSTRKDPIY 360 

SYDRRTTSFC NALIQSLESN PLTKIAWRAA KPLLMGKILY TPDSPAARRI LKNANSTPEE 420 

LEHVRKLVKA WEEVGPQIWY FFDNSTQMNM IRDTLGNPTV KDFLNRQLGB EGITAEAILN 480 

FLYKGPRESQ ADDMANFDWR DIFNITDRTL RLVNQYLECL VLDKFESYND ETQLTQRALS 540 

LLEENMFWAG WFPDMYPWT SSLPPHVKYK IRMDHJWEK TNKIKDRYWD SGPRADPVBD 600 

FRYIWGGFAY LQDMVEQGIT RSQVQAEAPV GIYLQQMPYP CFVDDSFMII LNRCFPIFMV 660 

LAWIYSVSMT VKSIVLEKEL RLKETLKNQG VSNAVIWCTW PLDSPSIMSM SIPLLTIFIM 720 

HGRILHYSDP FILFLFLLAF STATIMLCFL LSTFFSKASL AAACSGVIYF TLYLPHILCF 780 

AWQDRMTAEL KXAVSLLSPV AFGFGTEYLV RFEEQGLGLQ WSNIGNSPTB GDEFSFLLSM 840 

QMMLLDAACY GLIAWYLDQV FPGDYGTPLP WYFLLQESYW LSGBGCSTRE ERALEKTEPL 900 

TEETEDPEHP EGIHDSPFER EHPGWVPGVC VKNLVKIPEP OGRPAVDRLN ITFYENQITA 960 

FLGHNGAGKT TTLSILTGLL PPTSGTVLVG GRDI ETSLDA VRQSLGKCPQ HNILFHHLTV 1020 

AEHMLFYAQL KGKSQEEAQL EMEAMLEDTG LHHKRNEEAQ DLSGGMQRKL SVAIAFVGDA 1080 

KWILDEPTS GVDPYSRRSI WDI*LLKYRSG RTIIMPTHHM DEADHQGDRI AIIAQGRLYC 1140 

. SGTPLFLKNC FGTGLYLTLV RKMKNIQSQR KGSEGTCSCS SKGFSTTCPA HVDDLTPEQV 1200 

LDGDVNELMD WLHHVPEAK LVECIGQELI FXtLPNKNFKH RAYASLFREL EETLADLGLS 1260 

SFGISDTPLB EIFLKVTEDS DSGPLFAGGA QQKRENVNPR HPCLGPREKA GQTPQDSNVC 1320 

SPGAPAAHPE GQPPPEPECP GPQLNTGTQL VLQHVQALLV KRFQHTIRSH KDFLAQIVLP 1380 

ATFVFLALML SIVILPFGEY PALTLHPWIY GQQYTFFSMD EPGSEQFTVL ADVLLNKPGF 1440 

GNRCLKEGWL PEYPCGNSTP WKTPSVSPNI TQLFQKQKHT QVNPSPSCRC STREKLTMLP 1S00 

ECPEGAGGLP PPQRTQRSTE ILQDLTDRNI SDFLVKTYPA LIRSSLKSKP WVNEQRYGGI 1560 

SIGGKLPWP ITGEAiVGFL SDLGRIMNVS GGPITREASK EIPDFLKHLE TEDNIKVWFN 1620 

NKGHHALVSF LNVAHNAILR ASLPKDRSPB EYGITVISQP LNLTKEQLSE ITVLTTSVDA 1680 

WAICVIPSM SFVPASFVLY LIQERVNKSK HLQFISGVSP TTYWVTNPLW DIMNYSVSAG 1740 

LWGIPIGFQ KKAYTSPENlr PALVALLLLY GVfAVIPMMYP ASFLFDVPST AYVALSCANL 1800 

PIGINSSAIT FILELFDNNR TLLRFNAVLR KLLIVFPHFC LGRGLIDLAL SQAVTDVYAR I860 

FGEEHSANPF HWDLIGKNLF AMWEGWYF LLTLLVQRHF FLSQWIAEPT KEPIVDEDDD 1920 

VAEERQRIIT GGNKTDILRL HELTKIYLGT SSPAVDRLCV GVRPGECFGL LGVNGAGKTT 1980 

TFKMLTGDTT VTSGDATVAG KSILTNISEV HQNMGYCPQF DAIDELLTGR EHLYLYARLR 2040 

GVPAEEIEKV ANWSIKSLGL TVYADCLAGT YSGGNKRKLS TAIALIGCPP LVLLDEPTTG 2100 

KDPOARRMLH NVIVSIIRKG RAWLTSHSM BECEALCTRL AIMVKGAFRC MGTIQHLKSK 2160 

FGDGYIVTMK IKSPKDDLLP DLNPVBQFFQ GNFPGSVQRE RHYNMLQFQV SSSSLARIFQ 2220 

LLLSHKDSLL IEBYSVTQTT LDQVFVNFAK QQTESHDLPL HPRAAGASRQ AQD 2273 
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Seq ID NO: C332 Protein Sequence 
Protein Accession B: NP_006662.2 

5 1 11 21 31 41 51 

I I I I I I 

MVPHAILARG RDVCRRNGLL ILSVLSVIVG CLLGFFLRTR RLSPQEISYP QFPGELLMRM 60 

LKMMILPLW SSIHSGLASL DAKTSSRLGV LTVAYYLWTT FMAVIVGIFM VSIIHPGSAA 120 

QKETTEQSGK PI MS SAD ALL DLIRNMFPAN LVEATPKQYR TKTTPWKSP KVAPEEAPPR 180 

10 RILIYGVQEB NGSHVQNFAL DLTPPPEWY KSBPGTSDGM NVLGIVFFSA TMGIMLGRKG 240 

DSGAPLVSFC QCLKESVMKI VAVAVWYFPF GIVFLIAGKI LEMDDPRAVG KKLGFYSVTV 300 

VCGLVLHGLF ILPLLYFFIT KKNPIVFIRG I LQ ALL I ALA TSSSSATLPI TFKCLLENNH 360 

IDRRIARFVL PVGATINMDG TALYBAVAAI FIAQVNNYEL DFGQIITISI TATAASIGAA 420 

GIPQAGLVTM VIVLTSVGLP TDDITLIIAV DWALDRFRTM INVLGDALAA GIMAHICRKD 480 

15 FARDTGTEKL LPCETKPVSL QEIVAAQQNG CVKSVAEASE LTLGPTCPHH VPVQVERDEE 540 

LPAASLNHCT IQISELETNV 560 



20 



50 



60 



Seq ID NO: C333 Protein Sequence 
Protein Accession 8: NP 005680.1 



1 11 21 31 41 51 

i I I I 1 I 

MVTVGNYCEA EGPVGPAWMQ DGLSPCFFFT LVPSTRMALG TLALVLALPC RRRERPAGAD 60 

SLSWGAGPRI SPYVLQLLLA TLQAALPLAG LAGRVGTARG APLPSYLLLA SVLESLAGAC 120 

25 GLWLLWERS QARQRLAMGI WIKFRHSPGL LLLWTVAFAA ENLALVSWNS PQWWWARADL 180 

GQQVQFSLWV LRYWSGGLP VLGLWAPGLR PQSYTLQVHE EDQDVERSQV RSAAQQSTWR 240 

DFGRKLRLLS GYLWPRGSPA LQLWLICLG LMGLERALNV LVPIFYRNIV NLLTEKAPWN 300 

SLAWTVTSYV FLKFLQGGGT GSTGFVSNLR TFLWIRVQQF TSRRVELLIF SHLHELSLRW 360 

HLGRRTGEVL RIADRGTSSV TGLLSYLVFN VIPTLADIII GIIYFSMFFN AWFGLIVFLC 420 

30 MSLYLTLTIV VTEWRTKFRR AMNTQENATR ARAVDSLLNF ETVKYYNAES YEVERYREAI 480 

IKYQGLEWKS SASLVLLNQT QNLVIGLGLL AGSLLCAYFV TEQKLQVGDY VLFGTYIIQL 540 

YMPLNWFGTY YRMIQTNFID MENMFDLLKE ETEVKDLPGA GPLRPQKGRI EFENVHFSYA 600 

DGRETLQDVS FTVMPGQTLA LVGPSGAGKS TILRLLFRFY DISSGCIRID GQDISQVTQA 660 

SLRSHIGWP QDTVLFNDTI ADNIRYGRVT AGNDEVEAAA QAAGIHDAIM AFPEGYRTQV 720 

35 GERGLKLSGG EKQRVAIART ILKAPGIILL DEATSALDTS NERAIQASLA KVCANRTTIV 780 

VAHRLSTWN ADQILVIKDG CIVBRGRHEA LLSRGGVYAD MWQLQQGQEE TSEDTKPQTM 840 

BR 842 

Seq ID NO: C334 Protein Sequence 
40 Protein Accession #: NPJ>00667.1 

1 11 21 31 41 51 

I I I I I I 

MLLETQDALY VALELVIAAL SVAGNVLVCA AVGTANTLQT PTNYFLVSLA AADVAVGLFA 60 

45 IPFAITISLG FCTDFYGCLF LACFVLVLTQ SSIFSLLAVA VDRYLAICVP LRYKSLVTGT 120 

RARGVIAVLW VLAFGIGLTP FLGWNSKDSA TNNCTEPWDG TTNESCCLVK CLFENWPMS 180 

YMVYFNFFGC VLPPLLIMLV IYIKIFLVAC RQLQRTELMD ESRTTLQREI HAAKSLAMIV 240 

GIFALCWLPV HAVNCVTLFQ PAQGKNKPKW AMNMAILLSH ANSWNPIVY AYRNRDFRYT 300 

FHKIISRYLL CQADVKSGNG QAGVQPALGV GL 332 



Seq ID NO: C33S Protein Sequence 
Protein Accession #: NP 443164 



1 11 21 31 41 51 

55 1 | | | 1 | 

MGLGARGAWA ALLLGTLQVL ALLGAAHESA AMAETLQHVP SDHTNETSNS TVKPPTSVAS 60 

DSSNTTVTTM KPTAASNTTT PGMVSTNMTS TTLKSTPKTT SVSQNTSQIS TSTMTVTHNS 120 

SVTSAASSVT ITTTMHSEAK KGSKFDTGSF VGGIVLTLGV LSILYIGCKM YYSRRGIRYR 180 

TIDEHDAII 189 



Seq ID NO: C336 Protein Sequence 
Protein Accession #: NP_004186.1 



1 11 21 31 41 51 

65 | | i | | | 

MAQHGAMGAF RALGGLALLC ALSLGQRPTG GPGCGPGRLL LGTGTDARCC RVHTTRCCRD 60 

YPGBECCSEW DCMCVQPEFH CGDPCCTTCR HHPCPPGQGV QSQGKFSFGF QCIDCASGTF 120 

SGGHEGHCKP WTDCTQFGFL TVPPGNKTHN AVCVPGSPPA EPLGWLTWL LAVAACVLLL 180 

„ T3AQLGLHIW QLRSQCMWPR ETQLLLEVPP STBDARSCQF PBEBRGERSA EEKGRLGDLW 240 

70 V 241 

Seq ID NO: C337 Protein Sequence 
Protein Accession ft: BAC03767.1 

75 1 11 21 31 41 51 

I I I I I I 

MGCDGRVSGL LRRNLQPTLT YWSVPPSFGL CIAFLGPTLL DLRCQTHSSL PQISWVFFSQ 60 

QLCLLLGSAL GGVFKRTLAQ SLWALFTSSL AISLVFAVIP FCRDVKVLAS VMALAGLAMG 120 

OA CIDTVANMQL VRMYQKDSAV FLQVLHFFVG FGALLSPLIA DPFLSEANCL PANSTANTTS 180 

SO RGHLFHVSRV LGQHHVDAKP WSNQTFPGLT PKDGAGTRVS YAFWIMALID LPVPMAVLML 240 

L8KERLLTCC PQRRPLLLSA DELALBTQPP BKEDASSLPP KFQSHLGHED LFSCGQRKNL 300 

RGAPYSPFAI HITGALVLFM TDGLTGAYSA FVYSYAVEKP LSVGHKVAGY LPSLFWGFIT 360 

LGRLLSIPIS SRMKPATMVF INWGVWTP LVLLIFSYNV VFLPVGTASL GLFLSSTFPS 420 

MLAYTEDSLQ YKGCATTVLV TGAGVGEMVL QMLVGSIFQA QGSYSFLVCG VIFGCLAFTF 480 
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YILLLFFHRM HPGLPSVPTQ DRSIGMENSE CYQR 514 

Seq ID NO: C338 Protein Sequence 
^ Protein Accession ft: NP_0 02 194.1 

1 11 21 31 41 51 

1)1111 

MGPERTGAAP LPLLLVLALS QGILNCCLAY NVGLPEAKIF SGPSSEQFGY AVQQPINPKG 60 

NWUiVGSPWS GFPENRM6DV YKCPVDLSTA TCEKLNLQTS TSIPNVTEMK TNMSLGLILT 120 

10 RNHGTGGFLT CGPLWAQQCG NQYYTTGVCS DISPDFQLSA SFSPATQPCP SLLDWWCD 180 

ESNSIYPWDA VKNFLEKFVQ GLDIGPTKTQ VGLIQYANNP RWFNLNTYK TKBEMIVATS 240 

QTSQYGGDLT NTFGAIQYAR KYAYSAASGG RRSATKVMW VTDGESHDGS MLKAVIDQCN 300 

HDNILRFGIA VLGYLNRNAL DTKNLIKEIK AIASIPTERY FFNVSDEAAL LEKAGTLGBQ 360 

IFSIEGTVQG GDKFQMEMSQ VGFSADYSSQ NDILMLGAVG AFGWSGTIVQ KTSHGHLIFP 420 

15 KQAFDOILQD RNHSSYLGYS VAAISTGEST HFVAGAPRAN YTGQIVLYSV NENGNITVIQ 480 

AHRGDQIGSY FGSVLCSVDV DKDTITDVLL VGAPMYMSDL KKBEGRVYLF TIKKGILGQH 540 

QFLEGPEGIE NTRFGSAIAA LSDINMDGFN DVIVGSPLEN QNSGAVYIYN GHQGTIRTKY 600 

SQKILGSDGA FRSHLQYFGR SLDGYGDLNG DSITDVSIGA PGQWQLWSQ SIADVAIEAS 660 

FTPEKITLVN KNAQIILKLC FSAKFRPTKQ NNQVAIVYHI TLDADGFSSR VTSRGLFKEN 720 

20 NERCLQKNMV VNQAQSCPEH IIYIQEPSDV VNSLDLRVDI SLENPGTSPA LEAYSETAKV 780 

FSIPFHKDCG EDGLCISDLV LDVRQIPAAQ EQPFIVSNQN KRLTFSVTLK NKRESAYKTG 840 

IWDFSENLF FASFSLPVDG TEVTCQVAAS QKSVACDVGY PALKREQQVT FTINFDFNLQ 900 

NLQNQASLSF QALSESQEEN KADNLVNLKI PLLYDAEIHL TRSTNINFYE ISSDGNVPSI 960 

VHSFEDVGPK FIFSLKVTTG SVPVSMATVI IHIPQYTKEX NPLMYLTGVQ TDKAGDISCN 1020 

25 ADINPLKIGQ TSSSVSFKSE NFRHTKELNC RTASCSMVTC WLKDVHMKGE YFVNVTTRIW 1080 

NGTFASSTFQ TVQLTAAAEI NTYNPEIYVI EDNTVTIPLM IMKPDEKAEV PTGVIIGSII 1140 

AGILLLLALV AILWKLGFFK RKYEKMTKNP DEIDETTELS S 1181 

Seq ID NO: C339 Protein Sequence 
30 Protein Accession ft: NP 113648.1 

1 11 21 31 41 51 

i i i i i i 

MYRPRARAAP EGRVRGCAVP STVIJiLLAYI* AYIALGTGVF WTLEGRAAQD SSRSFQRDKW 60 

ELLQNFTCLD RPALDSLIRD WQAYKNGAS LLSNTTSMGR WELVGSFFFS VSTITTIGYG 120 

35 NLSPNTMAAR LFCIFFALVG IPLNLWLNR LGHLMQQGVN HWASRLGGTW QDPDKARWLA 180 

GSGALLSGLL LFLIiLPPLLF SHMEGWSYTE GFYFAFITLS TVGFGDYVIG MNPSQRYPLW 240 

YKNMVSLWIL FGMAWLAIiI 1 KLILSQLETP GRVCSCCHHS SKEDFKSQSW RQGPDREPES 300 

HSPQQGCYPE GPMGI IQHLE PSAHAAGCGK DS 332 

40 Seq ID NO: C340 Protein Sequence 
Protein Accession ft: NPJJ04145.1 

1 11 21 31 41 51 

A c I I i I I I 

4j MEWDNGTGQA LGLPPTTCVY RENFKQLLLP PVYSAVLAAG LPLNICVITQ ICTSRRALTR 60 

TAVYTLNLAL ADLLYACSLP LLIYNYAQGD HWPFGDFACR LVRFLFYANL HGSILFLTCI 120 

SFQRYLGICH PLAPWHKRGG RRAAWIiVCVA VWLAVTTQCL PTAIFAATGI QRNRTVCYDL 180 

SPPALATHYM PYGMALTVIG FLLPFAALLA CYCLLACRLC RQDGPAEPVA QERRGKAARM 240 

AWVAAAFAI SFLPFHITKT AYIAVRSTPG VPCTVLEAFA AAYKGTRPFA SANSVLDPIL 300 

50 PYFTQKKFRR RPHELLQKLT AKWQRQGR 328 

Seq ID NO: C341 Protein Sequence 
Protein Accession #: NP_00 9128.1 

55 1 11 21 31 41 SI 

1 I 1 I 1 I 

MQRPGPRLWL VLQVMGSCAA ISSMDMERPG DGKCQPIEIP MCKDIGYNMT RMPNLMGHEN 60 

QREAAIQLHE FAPLVEYGCH GHLRFFLCSL YAPMCTEQVS TPIPACRVMC EQARLKCSPI 120 

MEQFNFKWPD SLDCRKLPNK NDPNYLCMEA PNNGSDEPTR GSGLFPPLFR PQRPHSAQEH 180 

60 , PLKDGGPGRG GCDNPGKFHH VEKSASCAPL CTPGVDVYWS REDKRFAWW LAIWAVLCFF 240 

SSAFTVLTFL IDPARFRYPE RPIIFLSMCY CVYSVGYLIR LPAGAESIAC DRDSGQLYVI 300 

QEGLESTGCT LVFLVLYYFG MASSLWWWL TLTWFIAAGK KWGHEAIEAN SSYFHLAAWA 360 

IPAVKTIIilL VMRRVAGDEL TGVCYVGSMD VNALTGFVLI PLACYLVIGT SFILSGFVAL 420 

FHIRRVMKTG GENTDKLEKL MVRIGLFSVL YTVPATCVIA CYFYERLNMD YWKILAAQHK 480 

65 CKMNNQTKTL DCLMAASIPA VEIFMVKIFM LLWGITSGM WIWTSKTDQS WQQVCSRRLK 540 

KKSRRKPASV ITSGGIYKKA QHPQKTHHGK YEIPAQSPTC V 581 



70 



Seq ID NO: C342 Protein Sequence 
Protein Accession ft: NP 005752.1 



1 11 21 31 41 51 

I I I I I I 

MEVSRRKAPP RPPRPAAPLP LLAYLLALAA PGRGADEPVW RSEQAIGAIA ASQEDGVFVA 60 

SGSCLDQLDY SLEHSLSRLY RDQAGNCTEP VSIAPPARPR PGSSFSKLLL PYREGAAGLG 120 

75 GLLLTGWTFD RGACEVRPLG NLSRNSLRNG TEWSCHPQG STAGWYRAG RNNRWY1AVA 180 

ATYVLPEPET ASRCNPAASD HDTA1ALKDT EGRSLATQEL GRLKLCEGAG SLHFVDAFLW 240 

NGSIYFPYYP YNYTSGAATG WPSMARIAQS TEVLFQGQAS LDCGHGHFDG RRLLI>SSSLV 300 

EALDVWAGVF SAAAGEGQER RSPTTTALCL FRMSEIQARA KRVSWDFKTA ESHCKEGDQP 360 

orv ERVQPIASST LIHSDLTSVY GTWMNRTVL FLGTGDGQLL KVILGENLTS NCPEVIYEIK 420 

80 EETPVFYKLV PDPVKNIYIY LTAGKEVRRI RVANCNKHKS CSECLTATDP HCGWCHSLQR 480 

CTFQGDCVHS ENLENWLDIS SGAKKCPKIQ IIRSSKEKTT VTMVGSFSPR HSKCMVKNVD 540 

SSRELCQNKS QPNRTCTCSI PTRATYKDVS WNVMFSFGS HNLSDRFNFT NCSSLKECPA 600 

CVETGCAWCK SARRCIHPFT ACDPSDYERN QEQCPVAVEK TSGGGRPKEN KGNRTNQALQ 660 

VFYIKSIEPQ KVSTLGKSNV IVTGANFTRA SNITMILKGT STCDKDVIQV SHVLNDTHMK 720 

1362 



WO 03/042661 



PCTYUS02/36810 



PSLPSSRKEM KDVCIQFDGG NCSSVGSLSY IALPHCSLIP PATTWISGGQ NITMKGRNFD 780 

VIDNLIISHE LKGNINVSEY CVATYCGFLA PSLKSSKVRT NVTVKLRVQD TYLDCGTLQY 840 

REDPRPTGYR VESBVDTELE VKIQKENDNF NISKKDIEIT LFHGENGQLN CSFENITRNQ 900 

DLTTILCKIK GIKTASTIAN SSKKVRVKLG NLELYVEQES VPSTWYPLIV LPVLLVIVIF 960 

5 AAVGVTRHKS KELSRKQSQQ LELLESELRK EIRDGFAELQ MDKLDWDSF GTVPFLDYKH 1020 

FALRTFFPES GGFTHIFTED MHNRDANDKN ESLTALDALI CNKSPLVTVI HTLEKQKKFS 1080 

VKDRCLFASF LTIALQTKLV YLTSILEVLT RDLMEQCSNM QPKLMIiRRTE SWEKLLTNW 1140 

MSVCLSGFLR ETVGBPFYLL VTTLNQKINK GPVDVITCKA LYTLNEDWLL WQVPEFSTVA 1200 

LNWFEKIPE NESADVCRNI SVNVLDCDTI GQAKEKIFQA PLSKKGSPYG IX5LNEIGLEL 1260 

10 QMGTRQKELL DIDSSSVILE DGITKLNTIG HYEISNGSTI KVPKKIANFT SDVEYSDDHC 1320 

HLILPDSEAP QDVQGKRHRG KHKPKVKEMY LTKLLSTKVA IHSVLBKLFR SIWSLPNSRA 1380 

PFAIKYFFDP LDAQAENKKI TDPDWHIWK TNSLPLRFWV N1LKNPQFVP DIKKTPHIDG 1440 

CLSVIAQAFM DAFSLTEQQL GKEAPTKKLL YAKDIPTYKE EVKSYYKAIR DLPPLSSSEM 1500 

EEFLTQESKK HENEFNEEVA LTEIYKYIVK YPDEILNKLE RERGLEEAQK QLLHVKVLFD 1560 

15 EKKKCKWM 1S68 

Seq ID NO: C343 Protein Sequence 
Protein Accession ft: NP_002176.1 

20 1 11 21 31 41 51 

I I I i I I 

MTILGTTFGM VFSLLQWSG ESGYAQNGDL EDAELDDYSF SCYSQLEVKG SQHSLTCAP^E 60 
DPDVNTTNLE FEICGALVEV KCLNFRKLQE IYFIETKKFL LIGKSNICVK VGEKSLTCKK 120 
IDLTTIVKPE APFDLSVIYR EGANDFWTF NTSHLQKKYV KVLMHDVAYR QEKDENKWTH 1B0 
25 VNLSSTKLTL LQRKLQPAAM YEIKVRSIPD HYFKGFWSEW SPSYYPRTPE INNSSGEMDP 240 
ILLTISILSF FSVALLVILA CVLWKKRIKP IVWPSLPDHK KTLEHLCKKP RKNLNVSPNP 300 
ESFLDCQIHR VDDIQARDEV EGFLQDTFPQ QLEESEKQRL GGDVQSPNCP SEDVWTPES 360 
PGRDSSLTCXi AGNVSACDAP ILSSSRSLDC RESGKNGPHV YGDLLLSLGT TNSTLPPPPS 420 
LQSGILTLNP VAQGQPILTS LGSNQEEAYV TMSSFYQNQ 459 



30 



40 



Seq ID NO: C344 Protein Sequence 
Protein Accession #: NP 002713.1 



1 11 21 31 41 51 

35 | | | | | I 

HAAARLCLSL LLLSTCVALL LQPLLGAQGA PLEPVYPGDN ATPEQMAQYA ADLRRYINML 60 
TRPRYGKRHK EDTLAFSEWG SPHAAVPREIi SPLDL 95 



Seq ID NO i C345 Protein Sequence 
Protein Accession #: NP_115934.1 



1 11 21 31 41 51 

I I I I I 1 

MTWRHHVRLL FTVSLALQH NLGNSYQREK HNGGREEVTK VATQKHRQSP LNW TSSH FGE 60 

45 VTGSAEGWGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG SFCVCPAHPT GRYCBHDQRR 120 

SECGALEHGA WTLRACHLCR CIFGALHCLP LQTPDRCDPK DPLASHAHGP SAGGAPSLLL 180 

LLPCALLHRL LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRL 223 

Seq ID NO: C346 Protein Sequence 
50 Protein Accession ft: NP J) 0652 4.1 

1 11 21 31 41 51 

I I I I.I I 

MARSLVCLGV IILLSAFSGP GVRGGPMPKL ADRKLCADQE CSHPISMAVA LQDYMAPDCR 60 

55 FLTIHRGQW YVFSKLKGRG RLFWGGSVQG DYYGDLAARL GYPPSSIVRE DQTLKPGKVD 120 

VKTDKWDPYC Q 131 



Seq ID NO: C347 Protein Sequence 
60 Protein Accession ft; Eos sequence 

1 11 21 31 41 51 

! I I I I I 

^ KTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 60 

65 TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH GVPTGSFTLI 120 

TSRTKLSSIT SEATGNBSHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 180 

FLGLIFLVSY MMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMSQ IPSPR 235 



70 Seq ID NO i C348 Protein Sequence 
Protein Accession ft: NP_543146.1 

1 11 21 31 41 51. 

75 MTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 60 

TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH QVPTGSFTLI 120 

TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 180 

FLGLIFLVSY MMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMGQ IPSPR 235 

80 Seq ID NOt C349 Protein Sequence 

Protein Accession ft: FGENESH predicted 



1 11 21 31 41 51 

I I I I I I 

1363 



WO 03/042661 



PCT/US02/36810 



MWPRLAFCCW GLALVSGWAT FQQMSPSRNF SFRLiFPETAP GAPGSIPAPP APGDEAAGSR 60 

VERLGQAFRR RVRLLRELSE RLELVFLVDD SSSVGBVNFR SELMPVRKLL SDFPWPTAT 120 

RVAIVTFSSK NYWPRVDYI STRRARQHKC ALLLQEIPAI SYRGGGTYTK GAFQQAAQIL 180 

LHARENSTKV VFL1TDGYSN GGDPRPIAAS LRDSGVEIFT FGIWQGNIRB LNDMASTPKE 240 

5 EHCYLLHSFE EFEALARRAL HEDLPSGSPI QDDMVHCSYL CDEGKDCCDR MGSCKOGTRT 300 

GHFECICEKG YYGKGLQYEC TACPSGTYKP EGSPGGISSC IPCPDENHTS PPGSTSPEDC 360 

VCREGYRASG QTCELVHCPA LKPPENGYFI QNTCNNHFNA ACGVRCHPGF DLVGSSIILC 420 

LPNGLWSGSE SYCRVRTCPH LRQPKHGHIS CSTREMLYKT TCLVACDEGY RLEGSDKLTC 480 

1A QGNSQWDGPE PRCVERHCST FQMPKDVIIS PHNCGKQPAK FGTICYVSCR QGFILSGVKE 540 

10 MLRCTTSGKW NVGVQAAVCK DVEAPQINCP KDIEAKTLEQ QDSANVTWQI PTAKDNSGEK 600 

VSVHVHPAFT PPYLFPIGDV AIVYTATDLS GNQASCIFHI KVIDAEPPVI DWCRSPPPVQ 660 

VSEKVHAASW DEPQFSDNSG AELVITRSHT QGDIiFPQGET IVQYTATDPS GNNRTCDIHI 720 

VIKGSPCEIP FTPVNGDFIC TPDNTGVNCT LTCLEGYDFT EGSTDKYYCA YEDGVWKPTY 780 

TTEWPDCAKK RFANHGFKSF EMFYKAARCD DTDLMKKFSE AFETTLGKMV PSFCSDAEDI 840 

15 DCRLEENLTK KYCLEYNYDY ENGFAIGPGG WGAANRLDYS YDDFLDTVQE TATSIGNAKS 900 

SRIKRSAPLS DYKIKLIFNI TASVPLPDER NDTLEWEKQQ RLLQTLETIT NKLKRTLNKD 960 

PMYSFQLASE ILIADSNSLE TKKASPFCRP GSVLRGRMCV NCPLGTYYNL EHFTCBSCRI 1020 

GSYQDEEGQL ECKLCPSGMY TEYIHSRNIS DCKAQCKQGT YSY9GLETCE SCPLGTYQPK 1080 

FGSRSCLSCP ENTSTVKRGA VNISACGVPC PEGKFSRSGL MPCHPCPRDY YQPNAGKAFC 1140 

20 LACPFYGTTP FAGSRSITEC STSVLNITIF GGFGHLEI»LN CPSEVFHECF FNPCHNSGTC 1200 

QQLGRGYVCL CPLGYTGLKC ETDIDECSPL PCLNNGVCKD LVGEFICECP SGYTGQRCEE 1260 

NINECSSSPC LNKGICVDGV AGYRCTCVKG PVGLHCETEV NECQSNPCLN NAVCEBQVGG 1320 

FLCKCPPGFL GTRCGKNVDE CLSQPCKNGA TCKDGANSFR CLCAAGFTGS HCELNINECQ 1380 

SNPCRNQATC VDELNSYSCK CQPGPSGKRC ETEQSTGFNL DFEVSGIYGY VMLDGMLPSL 1440 

25 HALTCTFKMK SSDDMNYGTP ISYAVDNGSD NTLLLTDYNG WVLYVNGREK ITNCPSVNDG 1500 

RWHHIAITWT SANGIWKVYI DGKLSDGGAG LSVGLPIPGG GALVLGQEQD KKGEGFSPAE 1560 

SFVGSISQLN LWDYVLSPQQ VKSLATSCPE ELSKGNVLAW PDFLSGIVGK VKIDSKSIFC 1620 

SDCPRLGGSV PHLRTASEDL KPGSKVNLFC DPGFQLVGNP VQYCLNQGQW TQPLPHCERI 1680 

SCGVPPPLEN GFHSADDFYA GSTVTYQCNN GYYLLGD^RM FCTDNGSWNG VSPSCLDVDE 1740 

30 CAVGSDCSEH ASCLNVDGSY ICSCVPPYTG DGKKCAEPIK CKAPGNPENG HSSGBIYTVG 1800 

AGVTFSCQEG YQLMGVTKIT CLESGEWNHL IPYCKAVSCG KPAIPEKGCI EEIAFTFGSK 1860 

VTYRCNKGYT LAGDKESSCL ANSSWSHSPP VCEPVKCSSP ENINNGKYIL SGLTYLSTAS 1920 

YSCDTGYSLQ GPSIIECTAS GIWDRAPPAC "HLVFCGEPPA IKDAVITGNN FTFRNTVTYT 1980 

CKEGYTLAGL DTIECLADGK WSRSDQQCLA VSCDEPPIVD HASPETAHRL FGDIAFYYCS 2040 

35 DGYSLADNSQ LLCNAQGKWV PPEGQDMPRC IAHFCEKPPS VSYSILESVS KAKPAAGSW 2100 

SFKCMEGFVL NTSAKIECMR GGQWNPSPMS IQCIPVRCGE PPSIMNGYAS GSNYSFGAMV 2160 

AYSCNKGFYI KGEKKSTCEA TGQWSSPIPT CHPVSCGEPP KVENGFLEHT TGRIFESEVR 2220 

YQCNPGYKSV GSPVFVCQAN RHWHSESPLM CVPLDCGKPP PIQNGFMKGE NFEVGSKVQF 2280 

FCNEGYELVG DSSWTCQKSG KWNKKSNPKC MPAKCPEPPL LENQLVLKEL TTEVGWTFS 2340 

40 CKEGHVLQGP SVLKCLPSQQ WNDSFPVCKI VLCTPPPLIS FGVPIPSSAL HFGSTVKYSC 2400 

VGGFFLRGNS TTLCQPDGTW SSPLPECVPV ECPQPEEIPN GIIDVQGLAY LSTALYTCKP 2460 

GFELVGNTTT LCGENGHWLG GKPTCKAIEC LKPKEILNGK FSYTDLHYGQ TVTYSCNRGF 2520 

RLEGPSALTC LETGDWDVDA PSCNAIHCDS PQPIENGFVE GADYSYGAII IYSCFPGFQV 2580 

AGRAMQTCEE SGWSSSIPTC MPIDCGLPPH IDFGDCTKLK DDQGYFEQED DMMEVPYVTP 2640 

45 HPPYHUGAVA KTWENTKESP ATHSSNFLYG TMVSYTCNPG YELLGNPVLI OQEDGTHKGS 2700 

APSCISIECD LPTAPENGFL RPTETSMGSA VQYSCKPGHI LAGSDLRLCL ENRKWSGASP 2760 

RCEAISCKKP NPVMNGSIKG SNYTYLSTLY YECDPGYVLN GTERRTOQDD KNWDBDEPIC 2820 
IPVDCSSPPV SANGQVRGDE YTFQKEIEYT CNEGFLLEGA RSRVCLANGS WSGATPDCVP 2880 
VRCATPPQLA NGVTEGLDYG FMKEVTFHCH EGYILHGAPK LTCQSDGNWD AEIPLCKPVN 2940 
50 CGPPEDLAHG FPNGFSFIHG GHIQYQCFPG YKLHGNSSRR CLSNGSWSGS SPSCLPCRCS 3000 
TPVISYGTVN GTDFDGGXAA RIQCPKGFKZi LGLSEITCEA DGQWSSGFPH CEHTSCGSLP 3060 
MIPNAFISET SSWKENVITY SCRSGYVIQG SSDLICTEKG VWSQPYPVCE PLSCGSPPSV 3120 
ANAVATGBAH TYESEVKLRC LEGYTMDTDT DTPTOQKDGR WFPERISCSP KKCPLPENIT 3180 
HILVHGDDPS VNRQVSVSCA EGYTFEGVNI SVCQLDGTWB PPFSDBSCSP VSCGKPESPE 3240 
55 HGFWGSKYT FESTIIYQCE PGYELEGNRE RVCQENRQWS GGVAICKETR CETPLEFLNG 3300 
KADIENRTTG PNWYSQJRG YSLEGPSEAH CTENGTWSHP VPLCKPNPCP VPFVI PENAL 3360 
LSEKEFYVDQ NVSIKCREGF LLQGHGIITC NPDETWTQTS AKCEKISCGP PAHVENAIAR 3420 
GVHYQYGDMI TYSCYSGYML EGFLRSVCLE NGTWTSPPIC RAVCRFPCQN GGICQRPNAC 3480 
SCPEGWMGRI* CEEPICILPC LNGGRCVAPY QCDCPPGWTG SRCHTAVOQS PCLNGGKCVR 3540 
PNRCHCLSSW TGHNCSR 3557 



60 
65 
70 



Seq ID NO: -C3S0 Protein Sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MRFSVSGMRT DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR OGDKIYNPLE 60 

QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 

SKCERGRIC 129 

Seq ID NO i C351 Protein Sequence 
Protein Accession ft: AAH35671.1 



„ 1 11 21 31 41 51 

75 I 1 | ill 

MVPGARGGGA LARAAGRGLL ALLLAVSAPL RLQAEELGDG CGHLVTYQDS GTMTSKNYPG 60 

TYPNHTVCEK TITVPKGKRL ILRLGDLDIE SQTCASDYIiL FTSSSDQYGP YCGSKTVPKE 120 

LLLNTSBVTV RFESGSHISG RGFLLTYASS DHPDLITCLE RASHYLKTEY SKFCPAGCRD 180 

on VAGDISGNMV DGYRDTSLLC KAAIHAGIIA DELGGQISVL QRKGISRYEG ILANGVLSRD 240 

OO GSLSDKRFLF TSNGCSRSLS FEPDGQIRAS SSWQSVNBSG DQVHWSPGQA RLQDQGPSWA 300 

SGDSSNNHKP REWLEIDLGE KKKITGIRTT GSTQSNFNFY VKSFVMNFKN NNSKWKTYKG 360 

IVNNEEKVFQ GNSNFRDPVQ NNFIPPIVAR YVRWPQTWH QRIALKVELI GOQITQGNDS 420 

LVWRKTSQST SVSTKKEDET ITRPIPSEET STGINITTVA IPLVLLWLV PAGMGIFAAF 480 

RKKKKKGSPY GSAEAQKTDC WKQIKYPFAR HQSAEFTISY DNEKEMTQKL DLITSDMAG 539 
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Seq ID NO: C3S2 Protein Sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I 1 I 

MGFGAGQRLR PVPAPRSSAB EAARPGQLRL GIRRGEAEIA KLAPSGVMVP GARGGGALAR 60 

AAGRGLLALL LAVSAPLRLQ AEELGDGCGH LVTYQDSGTM TSKNYPGTYP NHTVCEKTIT 120 

VPKGKRLILR LGDLDIESQT CASDYLLFTS SSDQYGPYOG SMTVPKELLL NTSEVTVRFB 180 

SGSHISGRGF LLTYASSDHP DLITCLERAS HYLKTBYSKF CPAGCRDVAG DISGNMVDGY 240 

RDTSLLCKAA I HAG 1 1 AD EL GGQISVLQRK GISRYEGILA NGVLSRDGSL SDKRFLFTSN 300 

GCSRSLSFEP DGQIRASSSW QSVNESGDQV HWSPGQARLQ DQGPSWASGD SSNNHKPREW 360 

LEIDLGEKKK ITGIRTTGST QSNFNFYVKS FVMNFKNNNS KWKTYKGIVN NEBKVFQGNS 420 

NFRDPVQNNP IPPIVARYVR WPQTWHQRI ALKVELIGCQ ITQGNDSLVW RKTSQSTSVS 480 

TKKEDETITR PIPSEETSTG INITTVAIPL VLLWLVFAG MGIFAAFRKK KKKGSPYGSA 540 

EAQKTDCWKQ IKYPFARHQS AEFTISYENE KEMTQKLDLI TSDMAG 586 

Seq ID NO: C353 Protein Sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I 1 I 

MFQRQERFLD LSSAEAVAAW ILHQHPDIIN KGDGCGHLVT YQDSGTMTSK NYPGTYPNHT 60 

VCEKTITVPK GKRLILRLGD LDIESQTCAS DYLLFTSSSD QYGMQKEEET EVLCLSVAGA 120 

QRVDIPVQLL PSFLEGWKGH ADARGPYCGS MTVPKELLLN TSEVTVRFES GSHISGRGFL 180 

LTYASSDHPD LITCLERASH YLKTEYSKFC PAGCRDVAGD ZSGNMVDGYR DTSLLCKAAI 240 

HAGIIADELG GQISVLQRKG ISRYEGILAN GVLSRDGSLS DKRFLFTSNG CSRSLSFEPD 300 

GQIRASSSWQ SVNESGDQVH WSPGQARLQD QGPSWASGDS SNNHKPREWL BIDLGEKKKI 360 

TGIRTTGSTQ SNFNFYVKSF VMNFKNNNSK WKTYKGIVNN EEKVFQGNSN PRDPVQNNFI 420 

PPIVARYVRV VPQTWHQRIA LKVELIGCQI TQGNDSLVWR KTSQSTSVST KKEDETITRP 480 

IPSEETSTDA MPVQIVGDHT QMISQRENLG PDEGKIPFKG TAESMVRWF AWVNDLGML 540 

FLAHTPEEDI DHYCWKQIKY PFARHQSAEF TISYDNEKEM TQKLDLITSD MADYQQPLMI 600 

GTGTVTRKGS TFRPMDTDAE EAGVSTDAGG HYDCPQRAGR HBYALPLAPP EPEYATPIVE 660 

RHVLRAHTFS AQSGYRVPGP QPGHKHSLSS GGFSPVAGVG AQDGDYQRPH SAQPADRGYD 720 

RPKAVSALAT ESGHPDSQKP PTHPGTSDSY SAPRDCLTPL NQTAMTALL 769 

Seq ZD N0t C354 Protein Sequence 
Protein Accession #i NP_004607.1 

1 11 21 31 41 51 

I I I 1 I I 

MAGVSACIKY SMFTFNFLFW LCGILILALA IWVRVSNDSQ AIFGSEDVGS SSYVAVDILI 60 

AVGAIIMILG FLGCCGAIKE SRCMLLLFFI GLLLILLLQV ATGILGAVFK SKSDRIVNET 120 

LYENTKLLSA TGESEKQFQE AIIVFQEEFK CCGLVNGAAD WGNNFQHYPE LCACLDKQRP 180 

CQSYNGKQVY KETCISFIKD FLAKNLIIVI GISFGIAVIE ILGLVFSMVL YOQIGNK 237 

Seq ID NO: C355 Protein Sequence 
Protein Accession ft: NPJJ04608.1 

1 11 21 31 41 51 

I I I I I I 

MCTGGCARCL GGTLIPLAFF GFLANILLFF PGGKVIDDND HLSQEIWFFG GILGSGVLMI 60 

FPALVFLGLK NNDCCGCCGN EGOGKRFAMF TSTIFAWGF LGAGYSFIIS AISINKGPKC 120 

LMANSTWGYP FHDGDYLNDE ALWNKCREPL NWPWNLTLF SILLWGGIQ MVLCAIQWN 180 

GLLGTLCGDC QCOGCCGGDG PV 202 

Seq ID NO: C356 Protein Sequence 
Protein Accession #: NP_002372.l 

1 11 21 31 41 51 

I I I I I I 

MPRPAPARRL PGLL.LLLWPL LLLPSAAPDP VARPGFRRLE TRGPGGSPGR RPSPAAPDGA 60 

PASGTSEPGR ARGAGVCKSR PLDLVFIIDS SRSVRPLEFT KVKTFVSRII DTLDIGPADT 120 

RVAWNYAST VKIEFQLQAY TDKQSLKQAV GRITPLSTGT MSGIAIQTAM DEAFTVEAGA 180 

. REPSSNIPKV AIXVTDGRPQ DQVNEVAARA QASGIELYAV GVDRADMASL KMMASEPLEE 240 

HVFYVETYGV IEKLSSRFQE TFCALDPCVL GTHQOQHVCI SDGEGKHHCE CSQGYTLNAD 300 

KKTCSALDRC ALNTHGCEHI CVNDRSGSYH CECYEGYTLN EDRKTCSAQD KCALGTHGCQ 360 

HICVNDRTGS HHCECYEGYT LNADKKTCSV RDKCALGSHG CQHICVSDGA ASYHCDCYPG 420 

YTLNEDKKTC SATEEARRLV STEDACGCEA TLAFQDKVSS YLQRLNTKLD DILEKLKINE 480 

YGQIHR 486 

Seq ID NO: C357 Protein Sequence 
Protein Accession #: NPJJ57723.1 

1 11 21 31 41 51 

I I 1 I I I 

MARGSLRRLL RLLVLGLWLA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 
SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRREKF TTPIEETGGE 120 
GCPAVALIQ 129 

Seq ID NO: C358 Protein Sequence 
Protein Accession #: NP_001810.l 
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1 11 21 31 41 SI 

I I 1 I I I 

MQPTLLLSLL GAVGLAAVNS MPVDNRNHNE GHVTRCIIBV LSNALSKSSA PPITPECRQV 60 

UCTSRKDVKD KETTENENTK FEVRLLRDPA DASEAHESSS RGEAGAPGEE DIQGPTKADT 120 

EKWAEGGGHS RERADEPQWS LYPSDSQVSE EVKTRHSEKS QRBDEEEEEG ENYQKGERGE 180 

DSSEEKHLEE PGETQNAFLN ERKQASAIKK EELVARSETH AAGHSQEKTH SRBKSSQESG 240 

EEAGSQENHP QESKGQPRSQ EESEEGEEOA TSEVDKRRTR PRHHHGRSRP DRSSQGGSLP 300 

SEEKGHPQEE SEESNVSMAS LGEKRDHHST HYRASBEEPE YGEEIKGYPG VQAPEDLEWE 360 

RYRGRGSEEY RAPRPQSEES WDEEDKRNYP SLELDKMAHG YGEESEEERG LEPGKGRHHR 420 

GRGGEPRAYF MSDTREEKRF LGEGHHRVQE NQMDKARRHP QGAWKELDRN YLNYGEEGAP 400 

GKWQQQGDLQ DTKENREEAR FQDKQYSSHH TAEKRKRLGE LFNPYYDPLQ WKSSHFERRD S40 

NMNDNFLEGE EENELTLNEK NFFPEYNYDW WEKKPFSBDV NWGYEKRNLA RVPKLDLKRQ 600 

YDRVAQLDQL LHYRKKSAEF PDFYDSEEPV STHQEAENEK DRADQTVLTE DEKKELENLA 660 

AMDLELQKIA EKFSQRG 677 

Seq ID NO: C3S9 Protein Sequence 
Protein Accession fh XP_0 93082.1 

1 11 21 31 41 51 

I I I I I I 

MKLLCEGLKQ PNCVLQTLRW YRCLISSASC GALAAVLSTS QWLTEIiEFSS TKLEASALKL 60 

LYGGLKDPNC KLQKLNLQFS LSVTAAKLPV GMVGNCSGFS GSLVQSHFGY CQDSSFKCDL 120 

CKLLWPSTRV AAAKDCGSPK SFLSEGLNWA GRIiEAVEEVL GLGVLVQPGD PASQGGGHCE 180 

NYGSFRDLVD LEVKAEPSLR KGGMDLQRPT LQWLLCKIF SLKLFLFIAL PNSPGQVSW 240 

QVTIPDGFVN VTVGSNVTLI CIYTTTVASR EQLSIQWSFF HKKEMEPISS PWEEGKWPDV 300 

EAVKGTLDGQ QAELQIYFSQ GGQAVA1GQP KDRITGSNDP GNASITISHM QPADSGIYIC 360 

DVNNPPDFLG QNQGILNVSV LVKPSKPLCS VQGRPETGHT ISLSCLSALG TPSPVYYWHK 420 

LEGRDIVPVK ENFNPTTGIL VIGNLTNFEQ GYYQCTAINR LGNSSCEIDL TSSHPEVGII 480 

VGALIGSLVG AAIIISWCF ARNKAKAKAK ERNSKTIAEL EPMTKINPRG ESEAMPREDA S40 

TQLEVTLPSS IHETGPDTIQ EPDYEPKPTQ EPAPEPAPGS EPMAVPDU3I ELELEPETQS 600 

ELEPEPEPEP ESEPGWVEP LSEDEKGWK A 631 

Seq ID NO: C360 Protein Sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I 1 

MVFAFWKVFL ILSCLAGQVS WQVTIPDGF VNVTVGSNVT LICIYTTTVA SREQLSIQWS 60 

FFHKKEMEPI SSPWEEGKWP DVEAVKGTLD GQQAELQIYF SQGGQAVAIG QFKDRITGSN 120 

DPGNASITIS HMQPADSGIY ICDVNNPPDF LGQNQGILNV SVLVKPSKPL CSVQGRPETG 180 

HTISLSCLSA LGTPSPVYYW HKLEGRDIVP VKENFNPTTG ILVIGNLTNF EQGYYQCTAI 240 

NRLGNSSCEI Dl/TSSHPEVG IIVGALIGSL VGAAIIISW CFARNKAKAK AKERNSKTIA 300 

ELEPMTKINP RGESEAMPRE DATQLEVTLP SSIHETGPDT IQEPDYEPKP TQBPAPEPAP 360 

GSEPMAVPDL DIELELEPET QSELEPEPEP EPESEPGWV EPLSEDEKGV VKA 413 

Seq ID NO: C361 Protein Sequence 
Protein Accession #: NP_0030ll.l 

1 11 21 31 41 51 

I I I I I I 

MVSRMVSTML SGLLFWLASG WTPAFAYSPR TPDRVSEADI QRLLHGVMEQ LGIARPRVEY 60 

PAHQAMNLVG PQSIEGGAHE GLQHLGPFGN IPNIVAELTQ DNIPKDFSED QGYPDPPNPC 120 

PVGKTDDGCL ENTPDTAEFS REFQLHQHLF DPKHDYPGLG KWNKKLLYEK MKGGERRKRR 180 

SVNPYLQGQR LDNWAKKSV PHFSDEDKDP E 211 

Seq ID NO: C362 Protein Sequence 
Protein Accession 8: NP_076926.2 

1 11 21 31 41 51 

I 1 I I I I 

MTTMQGMEQA MPGAGPGVPQ bGNMAVIHSH LWKGLQEKFL KGEPKVLGW QILTALMSLS 60 

MGITMMCMAS NTYGSNPISV YIGYTIWGSV MFIISGSLSI AAGIRTTKGL VRGSLGMNIT 120 

SSVLAASGIL INTFSLAFYS FHHPYCNYYG NSNNCHGTMS ILMGLDGMVL LLSVLEFCIA 180 

VSLSAFGCKV LCCTPGGWL ILPSHSHMAE TASPTPLNEV 220 

Seq ID NO: C363 Protein Sequence 
Protein Accession #: NP_002082.1 

1 11 21 31 41 51 

I I i I I 1 

MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKKYPR GNHWAVGHLM GKKSTGESSS 60 

VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS MDSEDSSNFK 120 

DVGSKGKVGR LSAPGSQREG RNPQLNQQ 148 

Seq ID NO: C364 Protein Sequence 
Protein Accession #: NP_036393.1 

1 11 21 31 41 51 

I I I I I I 

MDLQGRGVPS IDRLRVLLMb FHTMAQIMAE QEVENLSGLS TNFEKDIFW RENGTTCU4A 60 

BFAAKFIVPY DVWASNYVDL ITBQADIALT RGAEVKGRCG HSQSELQVFW VDRAYALKML 120 

PVKESHNMSK GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHiS ALVTPAGKSY 180 

EOQAQQTI8L ASSDPQKTVT MILSAVHIQP FDIISDPVFS EEHKCPVDER EQLEETLPLI 240 

LGLILGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYKHMG 280 
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Seq ID NO: C365 Protein Sequence 
Protein Accession ft: NP_003217.l 

X 11 21 31 41 51 

I I I I I I 

MLGLVLALLS SSSAEEYVGL SANQCAVPAK DRVDCGYPHV TPKECNNRGC CFDSRIPGVP 
WCFKPLTRKT ECTF 

Seq ID NO: C366 Protein Sequence 
Protein Accession ft: np 002984.1 



51 



1 11 21 31 41 

I I I I I i 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 
KTIGKLQVFP AGPQCSKVEV VASUCNGKQV CLDPEAPFLK KVIQKILDSG NKKN 

Seq ID NO: C367 Protein Sequence 
Protein Accession ft: NP 005233.2 



1 
I 

MRSPSAAWLL 
VDEFSASVLT 
DLLSVIWFPL 
MGHSRKKANI 
NYFLSLAIGV 
TPSNLLLWH 
CRSVRTVRQM 



11 
I 

GAAILLAASL 
GKLTTVPLPI 
KIAYHIHANN 
AIGISLAIWL 
FLFPAPLTAS 
YFLIKSQGQS 
QVSLTSKKHS 



21 
I 

SCSGTIQGTN 
VYTIVFWGL 
WIYGEALCNV 
LILLVTIPLY 
AYVLMIRMLR 
HVYALYIVAL 



31 
I 

RSSKGRSLIG 
PSNGMALWVF 
LIGFFYGNMY 
WKQTIFIPA 



CLSTLNSCID 
TTVKTSY 



Seq ID NO: C368 Protein Sequence 
Protein Accession ft: NP 003460.1 



MAEAKTHWLG 
YIENLRQQAH 
AENEPQSAPK 
RTNEIVEEQY 
YEDWGGEDW 
DQLSDDVSKV 
PPEDLIEMLK 
RAGTEALPDG 
AAWIPHVENR 
EQIEQAIKEH 
KAEKGREHIA 



11 
I 

AALSLIPLIF 
KEESSPDYNP 
ENKPYALNSE 
TPQSLATLES 
NPVEEKIESQ 
IAYUCRLVNA 
TGEKPNGSVE 
LSVEDILNLL 
QMAYENLNDK 
LNQGSSQETD 
KRAMENM 



21 
I 

LISGAEAASF 
YQGV8VPLQQ 
KNPPMDMSDD 
VFQEXiGKLTG 
TQEEVRDSKE 
AGSGRLQNGQ 
PERELDLPVD 
GMESAANQKT 
DQELGEYLAR 
KLAPVSKRFP 



31 

i 

QRNQLLQKEP 



41 
1 

KVDGTSHVTG 
LFRTKKKHPA 
CSILFMTCLS 
LNITTCHDVL 
KRKRAIKLIV 
PFVYYFVSHD 



41 
I 

DLRLENVQKF 



YETQQWPERK 
PNNQKRERMD 
NIGKNEQIND 
NGERATRiiFE 
LDDISEADLD 
SYFPNPYNQE 
MLVKYPBIIN 
VGPPKNDDTP 



LKHMQFPPMY 
EEQKLYTDDE 
EMKRSGQLGI 
KPLDSQSXYQ 
HPDLFQNRML 
KVLPRLPYGA 
SNQVKRVPGQ 
NRQYWDEDLL 



51 
I 

KGVTVETVFS 
VIYMANLALA 
VQRYWVIVNP 
PEQLLVGDMF 
TVLAMYLICF 
FRDHAKNALL 



51 
I 

PSPEMIRALE 
MRIILEALRQ 
EENSRDNPPK 
DDIYKANNIA 



LIEISRNLQI 
SKSGYPKTPG 
GRSRSNQLPK 
GSSEDDLQEE 
MKVLEYLNQE 



60 
74 



60 
114 



60 

120 

180 

240 

300 

360 

397 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

617 



Seq ID NO: C369 Protein Sequence 
Protein Accession ft: NP_112217.1 



1 
I 

MPCAQRSWLA 
DASGHFLSYG 
KRYGNLSHVK 
KHPLVEGGYH 
SISKERWVET 
LEEEEQGI»KI 
LGLSHLSGMC 
RQLOYDPTPI* 
QYGPNATFCQ 
IPGGHGRWSP 
PTFRQMQCSE 



LIPKGARDIR 
EKLMATGPTN 
TGIRRQTAHC 
GEKKRTVLCI 
GGVRIRSVTC 
VPPPTSRPRM 
ISTGSTSQPI 
PPYNTLTKGP 
PDLSRESWWP 
KTANRNHLKL 
WSECSTTCGIj 
GGFKIREXQC 
GVQERGVFCP 
EGNKTEDQDQ 
TVRAECCPSC 



11 
I 

NLSWAQLLN 
LHYPITSSRR 
MMASSAPLCH 
PHIVYRRQKV 
LWADTKMIE 
VHHAEKTLSS 
QPHRSCNINE 
TWSKCSEEYI 
EVENVCQTIiW 
WSHCSRTCGA 
FDTVPYKNEL 
GICKMVGCDY 
VMEIEGAGNF 
ESVWIQLLFQ 
IKKGRGMVKA 
QTMVSDEQAL 
AKNHDEPCDV 
LTTPTGPESM 
LTSQSLSIQP 
EMEIHSGSGE 
PPSTVMEGLL 
PNNKNOTKSS 
GAYWKRVECT 
VDSRDHRNLR 
GGLCDWTKRP 
CLCDRKPRPP 
PQTHITHTQR 



21 
I 

FGALCYGRQP 
KRDLDGSEDN 
LSGTVLQQGT 
PETKEPTCGL 
YHGSENVESY 
FCKWQKSINP 
DSGLPLAFTI 
TRFLDRGWGF 
CSVKGPCRSK 
GVQSAERLCN 
YHWFPIFNPA 



LAIRSEDPEK 
VTNPGIKYEY 
TPCDPETOPN 
PPTDCQHLLK 
TRKPNSRALC 
STSTPAISSP 



31 

I 

QPGPVRPPDR 
VYYRI SHEER 
RVGTAALSAC 
KDSVNISQKQ 
ILTIMNMVTG 
KSDLNPVHHD 
AHELGHSFGI 
CLDDIPKKKG 
LDAAADGTQC 
NPEPKFGGKY 
HPCELYCRPI 
CGVCLGDGSS 
YYUIGGFIIQ 
TIQKDGLDND 



EREQPEDKDE 
PSQRPTTSET 
EPVLTEEDAT 
TQMDSDCAAI 
PFHOQFIiAGI 
TSTMSCNEHL 
BFKKCNQQAC 
QRRQRLLQKS 



PKTLLSCNRD 
GLQQCPSSRR 
SPTTASKEGD 
GPTSEGGLVA 
SNPVIWTKIR 
GTPRVEGMVT 
SLITEGFLLN 
QRPDPAKRCH 
PPPLSMSOIP 
CCHWATGNWD 
KKSADLLCTK 
KEL 



41 51 
I I 

RQEHFIKGLP EYHWGPVRV 60 

DLFFNLTVNQ GFLSNSYIME 120 

HGLTGFFQLP EGDFFIEPVK 180 

ELWREKWERH NLPSRSLSRR 240 

LFHNPSIGNA IHIVWRLIL 300 

VAVLLTRKDI CAGFNRPCET 360 

QHDGKENDCE PVGRHPYIMS 420 

UCSKVIAPGV IYDVHHQOQL 480 

GEKKWCMAGK CITVGKKPES 540 

CTGERKRYRL CNVHPCRSEA 600 

DGQFSEKMLD AVIDGTPCFE 660 

CQTVRKMFKQ KEGSGYVDIG 720 

WNGNYKLAGT VFQYDRKGDL 780 

VEQMYFWQYG HWTECSVTCG 840 

CPPRWWAGEW EACSATCGPH 900 

ILCPSDWTVG NWSECSVSCG 960 

VLKPNKGTIS NGKNPPTLKP 1020 

LGGKQWQDSS TQPELSSRYL 1080 

TTTSGSGLSS SRNPITWPVT 1140 

VPGNDAPVES TEMPLAPPXjT 1200 

EKPANTLLPL GGDHQPEPSG 1260 

ASNYKQLTNG HGSAHWIVGN 1320 

LRPCAGWKVG NWSKCSRNCS 1380 
EPCEAWQVEP WSQCSRSCGG 1440 
LCSTSOGGGF QKRIVQCVPS 1500 
DKLSASFOQT LKAMKKCSVP 1560 
1593 



Seq ID NO: C370 Protein Sequence 
Protein Accession ft: NP_00l0S3.l 



li 
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31 
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I 1 1 I I I 

MRQSHQLPLV GLLLFSFIPS QLCBICEVSB ENYIRLKPLL NTMIQSNYNR GTSAVNWLS 
LKLVGIQIQT LMQKMIQQIK YNVKSRLSDV SSGELALIIL ALGVCRNAEB NLIYDYHLTD 
KLENKFQAEI ENMEAHNGTP LTNYYQLSLD VLALCLFNGN YSTAEWNHF TPENKNYYFG 
SQFSVDTGAM AVLALTCVKK SLINGQIKAD EGSLKNISIY TKSLVEKILS EKXENGLIGN 
TFSTGEAMQA LFVSSDYYNE NDWNCQQTLN TVLTEISQGA FSNPNAAAQV LPALMGKTFL 
DINKDSSCVS ASGNFNISAD EPITVTPPDS QSYISVNYSV RINETYFTNV TVLKGSVFLS 
VMEKAQKMND TIFGFTMEER SWGPYITCIQ GLCANNNDRT YWELLSGGEP LSQGAGSYW 
RNGENLEVRW SKY 

Seq ID NO: C371 Protein Sequence 
Protein Accession ft: NP_004S82.1 

1 11 21 31 41 51 

I I I I I I 

MCCTKSLLLA ALMSVLLLHL CGESEAASNF DCCLGYTDRI LHPKFIVGFT RQLANEGCDI 
NAIIFHTKKK LSVCANPKQT WVKYIVRLLS KKVKNM 

Seq ID NO: C372 Protein Sequence 
Protein Accession ft: NP 037403.1 



11 



21 



51 



31 41 

i i i r I 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
RSVPRGEAAG AVQELARAIA KLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 
EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSBL PPEGVLGALL 
RVKRLETPAP QVPARRLLPP 

Seq ID NO: C373 Protein Sequence 
Protein Accession ft: NP 002236.1 



51 



1 11 21 31 41 

I I 1 I I I 

MLQSLAGSSC VRLVERHRSA WCFGFLVLGY LLYLVFGAW FSSVELPYED LLRQELRKLK 
RRFLEEHECL SEQQLEQFLG RVliEASNYGV SVLSNASGNW NWDFTSALFF ASTVLSTTGY 
GHTVPLSDGG KAFCIIYSVI GIPFTLLFLT AWQRITVHV TRRPVLYFHI RWGFSKQWA 
IVHAVLLGFV TVSCFFFIPA AVFSVLEDDW NFLESFYFCF ISLSTIGLGD YVPGEGYNQK 
FRELYKIGIT CYLLLGLIAM LWLETFCEL HELKKFRKMF YVKKDKDEDQ VHIIEHDQLS 
FSSITDQAAG MKEDQKQNEP FVATQSSACV DGPANH 

Seq ID NO: C374 Protein Sequence 
Protein Accession ft: NP_005463.1 

1 11 21 31 41 51 

I I I I I I 

METTNGTETW YESLHAVLKA LNATLHSNLL CRPGPGLGPD NQTEERRASL PGRDDNSYMY 
ILFVMFLFAV TVGSLILGYT RSRKVDKRSD PYHVYIKNRV SMI 

Seq ID NO: C375 Protein Sequence 
Protein Accession fti NP 005236.1 



60 

120 

180 

240 

300 

360 

420 

433 



60 
96 



60 

120 

180 

240 

260 



60 

120 

180 

240 

300 

336 



60 
103 



1 
I 

MGRHLALLLL 
VYITHPAWEV 
KALEKNTNVE 
NGEFYYSFKD 
AKLTVHIEQA 
QQFRTVRSFP 
KAGPVKFEKD 
SILEPVKRQQ 
TIMSLSAVDP 
SDWGLPYRRE 
LVQYQIEAGN 
ITVAASHKLV 
PTGIQVKENQ 
DRETTDKYTL 
IIOVEATDKD 
AREEPQLFST 
QVRYSLLDHG 
EWDVNENLH 
VFKIGEETGV 
EPVYYPEIME 



11 



51 



DRERNARREP 
DILSIKAVDN 
SVEPPGIPLW 
QVFIKVIDTN 
SLKKFRLDPA 
WFTASSYKGR 
SIKTAKELDR 
ETVSIGSFVG 
QGTNMAGIiST 



21 31 41 

I I I I I 

LLLLFQHFGD SDGSQRLEQT PLQFTHUEYN VTVQENSAAK TYVGHPVKMG 60 

RYKIVSGDSE NLFKABEYIL GDFCFLRIRT KGGNTAILNR EVKDHYTLIV 120 

ARTKVRVQVL DTNDLRPLFS PTSYSVSLPE NTAIRTSIAR VSATDADIGT 180 

RTDMFAIHPT SGVIVLTGRL DYLBTKLYEM EIIAADRGMK LYGSSGISSM 240 

NECAPVITAV TLSPSELDRD PAYAIVTVDD CDQGANGDIA SLSIYAGDLL 300 

GSKEYKVKAI GDIDWDSHPF GYNLTLQAKD KGTPPQFSSV KVTHVTSPQF 360 

VYRABISEFA PPNTPWMVK AIPAYSHLRY VFKRTPGKAK FSLNYNTGLI 420 

AAHFELEVTT SDRKASTKVL VKVLGANSNP PBFTQTAYKA AFDENVPIGT 480 

DEGENGYVTY SIANLNHVPF AIDHFTGAVS TSENLDYELM PRVYTLRIRA 540 

VEVLATITLN NLNDNTPLFB KINCEGTIPR DLGVGEQITT VSAIDADELQ 600 

ELDLFSLNPN SGVLSLKRSL MDGLGAKVSF HSLRITATDG ENFATPLYIN 660 

NLQCEETGVA KKLAEKLLQA NKLHNQGEVE DIFFDSHSVN AHIPQFRSTL 720 

PVGSSVIFMN STDLDTGFNG KLVYAVSGGN EDSCFMIDME TGMLKILSPL 780 

NITVYDLGIP QKAAWRLLHV WVDANDNPP EFLQESYFVE VSEDKEVHSE 840 

LGPNGHVTYS ILTDTDTFSI DSVTGWNIA RPLDRELQHE HSLKIEARDQ 900 

WVKVSLEDV NDNPPTFIPP NYRVKVREDL PEGTVIKWLE AHDPDLGQSG 960 

EGNFDVDKLS GAVRIVQQLD FEKKQVYNLT VRAKDKGKPV SLSSTCYVEV 1020 

PPVFSSFVEK GTVKEDAPVG SLVKTVSAHD BDAGRDGEIR YSIRDGSGVG 1080 

IETSDRLDRE STSHYKLTVF ATDQGWPLS SFIEIYIEVE DVNDNAPQTS 1140 

NSPKDVSWQ IEAFDPDSSS NDKLMYKITS GNPQGFFSIH PKTGLITTTS 1200 

HILEVTVTDN GSPPKSTIAR VIVKILDEND NKPQFLQKFY KIRLPEREKP 1260 

LYRVIATDKD EGPNAEISYS IEDGNEHGKF FIEPKTGWS SKRFSAAGEY 1320 

GRPQKSSTTR LHIEWISKPK QSLEPI6FEE SFPTFTVMES DPVAHMIGVI 1380 

FDITGGNYDS HFDVDKGTGT IIVAKPLDAE QKSNYNLTVB ATDGTTTILT 1440 

DHRPQFSTSK YEWIPEDTA PETE I LQ ISA VDQDEKNKLI YTLQSSRDPL 1500 
TGSLYTSEKL DHEAVSPAHL TVMVRDODVP VKRNFARTW NVSDTNDHAP 1560 
VYESAAVGSV VliQVTALDKD KGKNAEVLYS IBSGNIGNIG NSFMIDPVLG 1620 
SNQAEYDLMV KATDKGSPPM SEITSVRIFV TIADNASPKF TSKEYSVELS 1680 
MVTAHSQSSV VYEIKDGNTG DAFDINPHSG TIITQKAU)P ETLPIYTLII 1740 
NTTVLVHLQD ENDNAPVFMQ AEYTGLISES ASINSWLTD RNVPLVIRAA 1800 
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DADKDSNALL VYHTVEPSVH TYFAIDSSTG AIHTVLSLDY EETSIFHFTV QVHDKGTPRL 1860 

FAEYAANVTV HVIDINDCPP VFAKPLYEAS LLLPTYKGVK VITVNATDAD SSAFSQLIYS 1920 

ITEGNIGEKP SMDYKTGALT VQNTTQLRSR YELTVRASDG RFAGLTSVKI NVKBSKESHL 1980 

KPTQDVYSAV VKENSTEABT LAVITAIGSP INEPLFYHIL NPDRRFKISR TSGVLSTTGT 2040 

PFDREQQEAF DVWEVIEEH KPSAVAHWV KVIVEDQNDN APVFVNLPYY AWKVDTEVG 2100 

HVIRYVTAVD RDSGRNGEVH YYLKEHHEHF QIGPLGEISL KKQFELDTLN KEYLVTWAK 2160 

DGGNPAFSAE VIVPITVMNK AMPVFEKPPY SAEIAESIQV HSPWHVQAN SPEGLKVFYS 2220 

ITDGDPFSQP TINFNTGVIN VIAPLDFEAH PAYKLSIRAT DSLTGAHAEV PVDIIVDDIN 2280 

DNPPVFAQQS YAVTLSEASV IGTSWQVRA TDSDSEPNRG ISYQMFGNHS KSHDHPHVDS 2340 

STGLISLLRT LDYEQSRQHT IFVRAVDGGM PTLSSDVIVT VDVTDLNGNP PLFEQQIYEA 2400 

RISEHAPHGH FVTCVKAYDA DSSDIDKLQY SILSGNDHKH FVIDSATGII TLSNLHRHAL 2460 

KPFYSLNLSV SDGVFRSSTQ VHVTVIGGNL HSPAFLQNEY EVELAENAPL HTLVMEVKTT 2520 

DGDSGIYGHV TYHIVNDFAK DRFYINERGQ IPTLEKLDRE TPAEKVISVR LMAKDAGGKV 2580 

AFCTVNVILT DDNDNAPQFR ATKYEVNIGS SAAKGTSWK SASDADEGSN ADITYAIEAD 2640 

SESVKENLEI NKLSGVITTK ESLIGLENEF FTFFVRAVDN GSPSKESWL VYVKILPPEM 2700 

QLPKPSBPFY TFTVSEDVPV GTEIDLIRAE HSGTVLYSLV KGNTPESNRD ESFVIDRQSG 2760 

RLKLEKSU5H ETTKWYQFSI LARCTQDDHE MVASVDVSIQ VKDANDNSPV FESSPYBAPI 2820 

VENLPGGSRV IQIRASDADS GTNGQVMYSL DQSQSVEVIE SFAINMETGW ITTLKELDHE 2880 

KRDNYQIKW ASDHGEKIQL SSTAIVDVTV TDVNDSPPRF TAEIYKGTVS EDDPQGGVIA 2940 

ILSTTDADSE EINRQVTYFI TGGDPLGQFA VETIQNBWKV YVKKPLDREK RDNYLLTITA 3000 

TDGTFSSKAI VEVKVLDAND NSPVCEKTLY SDTIPEDVLP GKLIMQISAT DADIRSKAEI 3060 

TYTLLGSGAE KFKLNPDTGE LKTSTPLDRE EQAVYHLLVR ATDGGGRFCQ ASIWTLEDV 3120 

NDNAPEPSAD PYAITVFENT EPGTLLTRVQ ATDADAGLNR KILYSLIDSA DGQFSINELS 3180 

GIIQLEKPLD RELQAVYTLS LKAVDQGLPR RLTATGTVIV SVLDINDNPP VFEYREYGAT 3240 

VSEDILVGTE VLQVYAASRD IEANAEITYS IISGNEHGKF SIDSKTGAVF IIENLDYESS 3300 

HEYYLTVEAT DGGTPSLSDV ATVNVNVTDI NDNTPVFSQD TYTTVISEDA VLEQSVITVM 3360 

ADDADGPSNS HIHYSIIDGK QGSSPTIDPV RGEVKVTKLL DRETISGYTL TVQASDNGSP 3420 

PRVNTTTVNI DVSDVNDNAP VFSRGNYSVI IQENKPVGFS VLQLWTDED SSHNGPPFPP 3480 

TIVTGNDEKA FEVNPQGVLL TSSAIKRKEK DHYLLQVKVA DNGKPQLSSL TYIDIRVIEE 3540 

SIYPPAILPL EIFITSSGEE YSGGVIGKIH ATDQDVYDTL TYSLDPQMDN LFSVSSTGGK 3600 

LIAHKKLDIG QYLLNVSVTD GKFTTVADIT VHIRQVTQEM LNHTIAIRFA NLTPEEFVGD 3660 

YWRNFQRALR NILGVRRNDI QIVSLQSSEP HPHLDVLLFV EKPGSAQIST KQLLHKINSS 3720 

VTDIEEIIGV RILNVFQKLC AGLDCPWKFC DEKVSVDESV MSTHSTARLS FVTPRHHRAA 3780 

VCLCKEGRCP PVHHGCEDDP CPEGSECVSD PWEEKHTCVC PSGRFGQCPG SSSMTLTGNS 3840 

YVKYRLTENE NKLEMKLTMR LRTYSTHAW MYARGTDYSI LEIHHGRLQY KFDCGSGPGI 3900 

VSVQSIQVND GQWHAVALEV NGNYARLVLD QVHTASGTAP GTLKTLNLDN YVFFGGHIRQ 3960 

QGTRHGRSPQ VGNGFRGCMD SIYLNGQELP LNSKPRSYAH IEESVDVSPG CPLTATEDCA 4020 

SNPCQNGGVC NPSPAGGYYC KCSALYIGTH CEISVNPCSS NPCLYGGTCV VDNGGFVCQC 4080 

RGLYTGQRCQ LSPYCKDEPC KNGGTCFDSL DGAVGQCDSG FRGERCQSDI DECSGNPCLH 4140 

GALCENTHGS YHCNCSHEYR GRHCEDAAPN QYVSTPWNIG LAEGIGIWF VAGIFLLVW 4200 

FVLCRKMISR KKKHQAEPKD KHLGPATAFL QRPYFDSKLN KNIYSDIPPQ VPVRPISYTP 4260 

SIPSDSRNNL DRNSFEGSAI PEHPBFSTFN PBSVHGHRKA VAVCSVAPNL PPPPPSNSPS 4320 

DSDSIQKPSW DFDYDTKWD LDPCLSKKPL EEKPSQPYSA RESLSEVQSL SSPQSESCDD 4380 

NGYHWDTSDW MPSVPLPDIQ EFPNYEVIDE QTPLYSADPN AIDTDYYPGG YDIESDPPPP 4440 

PEDFPAADEL PPLPPEFSNQ FESIHPPRDM PAAGSLGSSS RNRQRFNLNQ YLPNFYPLDM 4500 

SEPQTKGTGE KSTCREPHAP YPPGYQRHFE APAVESMPMS VYASTASCSD VSACCEVESE 4560 

VMMSDYESGD DGHFEEVTIP PLDSQQHTEV 4590 

Seq ID NO: C376 Protein Sequence 
Protein Accession #: NP_055035.1 

1 11 21 31 41 51 

I I I I I I 

MCYGKCARCI GHSLVGLALL CIAANILLYF PNGETKYASE NHLSRFVWFP SGIVGGGLLM 60 

LLPAFVPIGL EQDDCCGCCG HENCGKRCAM LSSVLAALIG IAGSGYCVIV AALGLAEGPL 120 

CLDSLGQWNY TFASTEGQYL LDTSTWSECT EPKHIVEWNV SLFSILIALG GIEFILCLIQ 180 

VINGVLGGIC GFCCSHQQQY DC 202 



Seq ID NO: C377 Protein Sequence 
Protein Accession #* NP 003750.1 



1 
I 

KSTENVEGKP 
PPQLFTELDE 
. MEKGSIMLDR 
ADIGKTVSSA 
SNVLVGEVDF 
ATLMSDEVFH 
NMYSGGENVQ 
LNIQALSAIL 
ILSSTGPVLV 
BEGFSSLISF 
LAPEYLPTMS 
TSSMALKKFK 
NRGWFVPPFG 
WVAILMVICS 
ILTGLSVFMA 
VPLRRVHLFT 
VIPEKDKKKK 
DRERSPTFLE 



LXiAVDGQEMB 
EASSLPQLVE 
SRMFTNPDNG 
LDTPFIAFVR 
DIAYKAKDRH 
MNGDTPHDGG 
PIYLATVTNA 
FERIiLFNFSK 
IFIYDAFKKM 
STDKYHNTTP 
TSPYFPTTAR 
ENPWWVCLAA 
LMALPWYVAA 
PILKFIPMPV 
FLQVLCLALL 
EDBKKKKKKK 
RHTSC 



21 
I 

SSTFLRWQP 
WKETARHIKF 
MIVDHQIETG 
SPAMTHRNLT 
LQQAVMLGAL 
DLIAGIDBFL 
HGGGGHGDCB 
ITFGGLLGDA 
DNNFDYLBFR 
IKLADYYPIN 
DWAFLSKKEC 
KLISDFAIIL 
AIPALLVTIL 
TVISIAHIDS 
LYGVFLYMGV 
WILKSTVAAI 



31 
I 

MFNHSIFTSA 



LLKPELKDKV 
SSSLNDISDK 
TBVPVPTRFIi 
DEVIVLPPGE 
ELQRTGRFCG 
TDNMQGVLES 
LWIGLWSAFL 
SNFKVGYNTL 
SKYGGNLVGN 
SILIFCVIDA 
1FMDQQITAV 
LKMETBTSAP 
ASLNGVQFMD 
IFPVMI1ALV 
DCPYSBKVPS 



41 
I 

VSPAAERIRF 
VfSKPHVATLS 
TYTLLRKHRH 
PEKDQLKNKF 
FILLGPKGKA 
WDPAIRIBPP 
GLIKDIKRKA 
FLGTAVSGAI 
CLILVATDAS 
FSCTCVPPDP 
NCNFVPDITL 
LVGVDTPKLI 
IVKRKEHKLK 
GEQPKFLGVR 
RLKLLLMPLK 
AVRKGMDYLP 
IKIPMDIMEQ 



51 
I 

ILGEEDDSPA 
LHSLFELRTC 
QTKKSNLRSL 
MKKLPRDAEA 
KSYHEIGRAI 
KSLPSSDKRK 
PFFASDFYDA 
FCLFAGQPLT 
FLVQYFTRFT 
ANISISNDTT 
MSFILFLGTY 
VPSEFKPTSP 
KGAGYHLDLF 
EQRVTGTLVF 
HQPDFIYIiRH 
SQHDLSFLDD 
QPPLSDSKPS 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1035 



Seq ID NO i C378 Protein Sequence 
Protein Accession fl: NP_000949.1 



1369 



WO 03/042661 



1 11 21 31 41 51 

I I I I I I 

MSTPGVNSSA SLSPDRLNSP VTIPAVHFIF GWGNLVAIV VLCKSRKEQK ETTFYTLVCG 60 

LAVTDLLGTL LVSPVTIATY MKGQWPGGQP LCEYSTFILL FFSLSGLSII CAMSVBRYLA 120 

INHAYFYSHY VDKRLAGLTL FAVYASNVLF CALPNKGLGS SRLQYPDTWC PIDWTTNVTA 180 

HAAYSYMYAG FSSFLILATV LCNVLVCGAL LRMHRQFMRR TSLGTEQHHA AAAASVASRG 240 

HPAASPALPR LSDFRRRRSF RRIAGAEIQM VILLIATSLV VLICSIPLW RVFVNQLYQP 300 

SLEREVSKNP DLQAIRIASV NPILDPWIYI LLRKTVLSKA IEKIKCLFCR IGGSRRERSG 360 

QHCSDSQRTS SAMSGHSRSF ISRELKEISS TSQTLLPDLS LPDLSENGLG GRNLLPGVPG 420 

MGLAQEDTTS LRTLRISETS DSSQGQDSES VLLVDEAGGS GRAGPAPKGS SLQVTFPSET 480 

LNLSEKCI 488 



Seq ID NO: C379 Protein Sequence 
Protein Accession #: NP_002650.1 



1 11 21 31 41 51 

I I I I I I 

MGHPPLLPLL LLLHTCVPAS WGLRCMQCKT NGDCRVEECA LGQDLCRTTI VRLWEEGEEL 60 

ELVEKSCTHS EKTNRTLSYR TGLKITSLTE WCGLDLCNQ GNSGRAVTYS RSRYLECISC 120 

GSSDMSCERG RHQSLQCRSP EEQCLDWTH WIQEGEEGRP KDDRHLRGCG YLPGCPGSNG 180 

FHNNDTFHFL KCCNTTKCNE GPILELENLP QNGRQCYSCK GNSTHGCSSE ETFLIDCRGP 240 

MMQCLVATGT HEPKNQSYMV RGCATASMCQ HAHLGDAFSM NHIDVSCCTK SGCNHPDLDV 300 

QYRSGAAPQP GPAHLSLTIT LLMTARLWGG TLLWT 335 



Seq ID NO: C380 Protein Sequence 
Protein Accession ft: EAB55406.1 



1 11 21 31 41 51 

I I I I 1 I 

MDFESGQVDP LASVILPPNL LENLSPEDSV LVRRAQFTFF NKTGLFQDVG PQRKTLVSYV 60 

MACSIGNITI QNLKDPVQIK IKHTRTQEVH HPICAFWDLN KNKSFGGWNT SGCVAHRDSD 120 

ASETVCLCNH FTHFGVLMDL PRSASQLDAR NTKVLTFISY IGCGISAIPS AATLLTYVAF 180 

EKLRRDYPSK ILMNL STALL FLNLLFLLDG WITSFNVDGL CIAVAVLLHF FLLATFTWMG 240 

LEAIHMYIAL VKVFNTYIRR YILKFCIIGW GLPALWSW LASRNNNBVY GKESYGKEKG 300 

DEFCWIQDPV IFYVTCAGYP GVMFFLNIAM FIWMVQICG RNGKRSNRTL REEVLRNLRS 360 

WSLTFLLGM TWGPAFPAWG PLNIPFMYLF SIFNSLQGLF IFIFHCAMKE NVQKQWRRHL 420 

CCGRFRLADN SDWSKTATNI IKKSSDNLGK SLSSSSIGSN STYLTSKSKS SSTTYFKRNS 480 

HTDNVSYEHS FNKSGSLRQC FHGQVLVKTG PC 512 



Seq ID NO: C381 Protein Sequence 
Protein Accession ft: NP_000565.1 



1 11 21 31 41 51 

I I I I I I 

MTVARPSVPA ALPLLGELPR LLLLVLLCLP AVWGDOGLPP DVPNAQPALE GRTSPPEDTV 60 

ITYKCEESFV KIPGEKDSVI CLKGSQWSDI EEFCNRSCEV PTRLNSASLK QPYITQNYFP 120 

VGTWEYECR PGYRREPSLS PKLTCLQNLK WSTAVEFCKK KSCPNPGEIR NGQIDVPGGI 180 

LPGATISFSC NTGYKLFGST SSFCLISGSS VQWSDPLPEC REIYCPAPPQ IDNGIIOGER 240 

DHYGYRQSVT YACNKGFTMI GEHSIYCTVN NDEGEWSGPP PBCRGKSLTS KVPPTVQKPT 300 

TVNVPTTEVS PTSQKTTTKT TTPNAQATRS TPVSRTTKHF HETTPNKGSG TTSGTTRLLS 360 

GHTCFTLTGL LGTLVTMGLL T 381 



Seq ID NO i C382 Protein Sequence 
Protein Accession ft: Eos sequence 



1 11 21 31 41 51 

I I i I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 180 

AFRSLSALQA MTLALNKIHH I PDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 240 

LNYNNLDEFP TAIRTLSNLK ELHFYDNPIQ FVGRSAFQHL PBLRTLTLNG ASQITEFPDL 300 

TGTANLESLT LTGAQISSLP QTVCNQLPNL QVLDLSYNLL EDLPSFSVCQ KLQKIDLRHN 360 

EIYBIKVDTF QQLLSLRSLN LAWNKIAIIH PNAFSTLPSL IKLDLSSNLL SSFPITGLHG 420 

LTHLKLTGNH ALQSLISSEN FPBLKVIEMP YAYQCCAFGV CENAYKISNQ WNKGDNSSMD 480 

DLHKKDAGMF QAQDERDLED FLLDFEEDLK ALHSVQCSPS PGPFKPCEHL LDGWLIRIGV 540 

WTIAVLALTC NALVTSTVFR SPLYISPIKL LIGVIAAVNM LTGVSSAVLA GVDAFTFGSF 600 

ARHGAWWENG VGCHVIGFLS IFASESSVFL LTLAALERGP SVKYSAKFET KAPFSSLKVI 660 

ILLCALLALT KAAVPLLGGS KYGASPLCLP LPFGEPSTMG YMVALILLNS LCFLMMTIAY 720 

TKLYCNLDKG DLENIWDCSM VKHIALLLFT NCILNCPVAP LSFSSLINLT FISPEVIKFI 780 

LLVWPLPAC LNPLLYILFN PHFKEDLVSL RKQTYVWTRS KHPSLMSINS DDVEKQSCDS B40 

TQALVTFTSS SITYDLPPSS VPSPAYPVTE SCHLSSVAFV PCL 883 



Seq ID NO: C383 Protein Sequence 
Protein Accession ft: NP_003658.1 



1 11 21 31 41 51 

1 I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 180 

AFRSLSALQA MTLALNKIHH I PDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 240 

LNYNNLDEFP TAIRTLSNLK ELGFHSNNIR SIPBKAFVGN PSLITIHFYD NPIQPVGRSA 300 

1370 



WO 03/042661 



FQHLPELRTL TLNGASQITE FPDLTGTANL ESUTLTGAQI SSLPQTVCNQ LPNLQVLDLS 

YNLLBDLPSF svcqklqkid lrhneiyeik vdtfqqllsl rslnlawnki aiihpnafst 

LPSLIKLDLS SNLLSSFPIT GLHGLTHLKL TGNHALQSLI SSENFPELKV IEMPYAYQCC 
AFGVCENAYK ISNQWNKGDN SSHDDLHKKD AGMFQAQDER DLEDFLLDFE BDLKALHSVQ 
CSPSPGPFKP CEHLLDGWLI RIGVWTIAVL ALTCNALVTS TVFRSPLYIS PIKLLIGVIA 
AVNMLTGVSS AVLAGVDAFT FGSFARHGAW WENGVGCHVI GFLSIPASES SVFLLTIAAL 
ERGFSVKYSA KFETKAPFSS LKVIILLCAL LALTMAAVPL LGGSKYGASP LCLPLPFGEP 
STMGYMVALI LLNSLCFLMM TIAYTKLYCN LDKGDLENIW DCSMVKHIAL LLFTNCILNC 
PVAFLSFSSL XNLTFISPEV IKFILLWVP LPACLNPLLY ILFNPHFKED LVSLRKQTYV 
WTRSKHPSLM SINSDDVEKQ SCDSTQALVT FTSSSITYDL PPSSVPSPAY PVTESCHLSS 
VAFVPCL 

Seq ID NO j C384 Protein Sequence 
Protein Accession #: NP_0 034 97.1 

1 11 21 31 41 51 

I I I 1 ! I 

MEMFTFLLTC IFLPLLRGHS LFTCEPITVP RCMKMAYNMT FFPNLMGHYD QSIAAVEMEH 
FLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE KVYSDCKKLI DTFGIRWPEE 
LECDRLQYCD ETVPVTFDPH TEFLGPQKKT EQVQRDIGFW CPRHLKTSGG QGYKFLGIDQ 
CAPPCPNMYF KSDBLEFAKS F1GTVSIFCL CATLFTFLTF LIDVRRFRYP ERPIIYYSVC 
YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVLFMLLY FFTMAGTVWW 
VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV MLLALNKVEG DNISGVCFVG 
LYDLDASRYF VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI QHDGRNQEKL, KKFMIRIGVF 
SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PEIALFMIKY 
LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR KVLQESCEFF LKHNSKVKHK 
KKHYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY LGQBTLTEIQ TSPETSMREV 
KADGASTPRL REQDCGEPAS PAASISRLSG EQVDGKGQAG SVSESARSEG RISPKSDITD 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 

Seq ID NO: C385 Protein Sequence 
Protein Accession fl: NP_000573 

1 11 21 31 41 51 

I I I I I i 

MRIAVICFCL LGITCAIPVK QADSGSSEBK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 

PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 

DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLRSKSKKF RRPDIQYPDA TDEDITSHME 180 

SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDBS 240 

NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 300 

Seq ID NO: C386 Protein Sequence 
Protein Accession 8: NP_002812 

1 11 21 31 41 51 

I I I I I I 

MGAARGSPAR PRRLPLLSVL LLPLLGGTOT AIVFIKQPSS QDALQGRRAL LRCEVEAPGP 60 

VHVYWLiLDGA PVQDTERRFA QGSSLSFAAV DRLQDSGTFQ CVARDDVTGE EARS ANAS FN 120 

IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 180 

KERNLTLRPA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DESFARWLA PQDVWARYE 240 

EAMFHCQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 300 

CIGQGQRGPP IILEATLHl»A EIEDMPLFEP RVFTAGSEER VTCLPPKGLP EPSVWWEHAG 360 

VRLPTHGRVY QKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNTTVAT VPSWLKKPQD 420 

SQLEEGKPGY LDCLTQATPK PTWWYRNQM LISEDSRFEV FKNGTLRINS VEVYDGTWYR 480 

CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKEA TVPCSATGRE KPTIKWERAD 540 

GSSLPEWVTD NAGTLHFARV TRDDAGNYTC IASNGPQGQI RAHVQLTVAV FITFKVEPER 600 

TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV IHDVAPEDSG 660 

RYTCIAGNSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYIIAVL 720 

GLMFYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIQEEVALT SLGSGPAATN 780 

KRHSTSDKMH PPRSSLQPIT TLGKSEFGEV FLAKAQGLEE GVAETIiVLVK SLQTKDEQQQ 840 

LDFRRELEMF GKLNHANWR LLGLCREAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 900 

PLSTKQKVAL CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 960 

YHFRQAWVPL RWMSPEAILE GDFSTKSDVW AFGVLMWEVP THGENPHGGQ ADDEVLADLQ 1020 

AGKARLPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA LGDSTVDSKP 1070 

. Seq ID NO: C387 Protein Sequence 
Protein Accession fh NP_002300.1 

1 11 21 31 41 51 

MKVLAAGWP LLLVLHWKHG AGSPLPITPV NATCAIRHPC HNNLMNQIRS QLAQLNGSAN 60 

ALFILYYTAQ GEPFPNNLDK LCGPNVTDPP PFHANGTEKA KLVELYRIW YIXSTSI/ailT 120 

RDQKILNPSA LSLHSKLNAT ADILRGLLSN VLCRLCSKYH VGHVDVTYGP DTSGKDVFQK 180 

KKLGCQLLGK YKQIIAVLAQ AF 202 

Seq ID NO: C388 Protein Sequence 
Protein Accession #: XPJ>97S08 

1 11 21 31 41 51 

MGRPRLTLVC HVSIIISARD LSMNNLTELQ PGLFHHLRFL EELRLSGNHL SHIPGQAFSG 60 

LYSLKILMLQ NNQLGGIPAE ALWELPSLQS LRLDANLISL VPERSFEGLS SLRHIiWLDDN 120 

ALTEIPVRAIi NNLPALQAMT LALNRISHIP DYAFQNLTSL WLHLHNNRI QHLGTHSFEG 180 

LHNLETLDLN YNKLQEFPVA IRTLGRLQBL GFHNNNIKAI PEKAFMGNPL LQTIHFYDNP 240 

1371 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
907 
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IQFVGRSAFQ YLPKLHTLSL NGAMDIQEPP DLKGTTSLEI LTLTRAGIRL LPSGMCQQLP 300 

RLRVLELSHN QIEELPSLHR CQKLEEIGLQ HNRIWEIGAD TFSQLSSLQA LDLSWNAIRS 360 

IHPEAFSTLH SLVKLDLTDN QLTTLPLAGL GGLMHLKLKG NLALSQAPSK DSFPKLRILE 420 

VPYAYQCCPY GMCASFFKAS GQHEAEDLHL DDEESSKRPL GLLARQAENH YDQDLDELQL 480 

EMEDSKPHPS VQCSPTPGPF KPCEYIiPESW GIRIAVMAIV LLSVLCNGLV LLTVFAGGPV 540 

PLPPVKFWG AIAGANTLTG ISCGLLASVD ALTFGQFSEY GARWETGLGC RATGFLAVLG 600 

SEASVLLLTL AAVQCSVSVS CVRAYGKSPS LGSVRAGVLG CLAIAGLAAA LPLASVGEYG 660 

ASPLCLPYAP PEGQPAALGF TVAtiVMMNSF CFLWAGAYI KLYCDLPRGD FEAVWDCAMV 720 

RHVAWLIFAD GLLYCPVAFL SFASMLGLFP VTPEAVKSVL LWLPLPACL MPLLYLLFNP 780 

HFRDDLRRLR PRAGDSGPLA YAAAGELEKS SCDSTQALVA FSDVDLILEA SEAGRPPGLE 840 

TYGFPSVTLI SCQQPGAPRL EGSHCVEPEG NHFGNPQPSM DGELLLRAEG STPAGGGLSG 900 

GGGFQPSGtiA FASHV 915 



Seq ID NO: C389 Protein Sequence 
Protein Accession #: np_570901 



l 11 21 31 41 51 

1111)1 

MASLVSLELG LLLAVLWTA TASPPAGLLS LLTSGQGALD QEALGGliLNT LADRVHCTNG 60 

PCGKCLSVED ALGLGBPEGS GLPPGPVLEA RYVARLSAAA VLYLSNPEGT CEDTRAGLWA 120 

SHADHLLALL ESPKALTPGL SWLLQRMQAR AAGQTPKTAC VDXPQLLEEA VGAGAPGSAG 180 

GVLAALLDHV RSGSCFHALP SPQYFVDFVF QQHSSEVPMT LAELSALKQR LGVGREAHSD 240 

HSHRHRGASS RDPVPLISSS NSSSWfDTVC LSARDVMAAY GLSEQAGVTP EAWAQLSPAL 300 

LQQQLSGACT SQSRPPVQDQ LSQSBRYLYG SLATLLICLC AVFGLLLLTC TGCRGVAHYI 360 

LQTFLSLAVG ALTGDAVLHL TPKVLGLHTH SEEGLSPQPT WRLLAMLAGL YAFFLFENLF 420 

NLLLPRDPED LEDGPCGHSS HSHGGHSHGV SLQLAPSELR QPKPPHEGSR ADLVAEESPE 480 

LLNPEPRRLS PELRLLPYMI TLGDAVHNFA DGLAVGAAFA SSWKTGLATS LAVFCHELPH 540 

ELGDFAALLH AGLSVRQALL LNLASALTAF AGLYVALAVG VSEESEAWIL AVATGLFLYV 600 

ALCDMLPAML KVRDPRPWLL FLLHNVGLLG GWTVLLLLSL YEDDITF 648 



Seq ID NO: C390 Protein Sequence 
Protein Accession #: NP_061844 



1 11 21 31 41 51 

I I I I I 1 

MANASEPGGS GGGEAAALGL KLATLSLLLC VSLAGNVLFA LLIVRERSLH RAPYYLLLDL 60 

CLADGLRALA CLPAVMLAAR RAAAAAGAPP GALGCKLLAF LAALFCFHAA FLLLGVGVTR 120 

YLAIAHHRFY AERLAGWPCA AMLVCAAWAL ALAAAFPFVL DGGGDDEDAP CALEQRPDGA 180 

PGALGFLLLL AVWGATHLV YLRLLFFIHD RRKMRPARLV PAVSHDWTFH GPGATGQAAA 240 

NWTAGFGRGP TPPALVGIRP AGPGRGARRL LVLEBFKTEK RLCKMFYAVT LLFLLLWGPY 300 

WASYLRVLV RPGAVPQAYL TASVWLTFAQ AGINPWCFL FNRELRDCFR AQFPCCQSPR 3 SO 

TTQATHPCDL KGIGL 375 



Seq ID NOr C391 Protein Sequence 
Protein Accession #: NP_005622 



1 11 21 31 41 51 

I I 1 1 I 1 

MAAARPARGP ELPLLGLLLL LLLGDPGRGA ASSGNATGPG PRSAGGSARR SAAVTGPPPP 60 

LSHOGRAAPC EPLRYNVCLG SVLPYGATST LLAGDSDSQE EAHGKLVLWS GLRNAPRCWA 120 

VIQPLLCAVY MPKCENDRVE LPSRTLCQAT RGPCAIVERE RGWPDFLRCT PDRFPEGCTN 180 

EVQNIKFNSS GQCEVPLVRT DNPKSWYEDV EGCGIQOQNP LFTEAEHQDM HSYIAAPGAV 240 

TGLCTLFTLA TFVADWRNSN RYPAVILFYV NACFPVGSIG WIAQFMDGAR REIVCRADGT 300 

MRLGEPTSNE TLSCVIIFVI VYYALMAGW WFWLTYAWH TSFKALGTTY QPLSGKTSYF 360 

HLLTWSLPFV LTVAILAVAQ VDGDSVSGIC FVGYKNYRYR AGFVLAPIGL VLIVGGYFLI 420 

RGVMTLFSIK SNHPGLLSEK AASKINETML RLGIFGFLAF GPVLITFSCH FYDFFNQAEW 480 

ERSFRDYVLC QANVTIGLPT KQPIPDCEIK NRPSLLVEKI NLFAMFGTGI AMSTWVWTKA 540 

TLLIWRRTWC RLTGQSDDEP KRIKKSKMIA KAFSKREELL QNPGQELSFS MHTVSHDGPV 600 

AGLAFDLNEP SADVSSAWAQ HVTKMVARRG AILPQDISVT PVATPVPPEE QANLWLVEAE 660 

ISPELQKRLG RKKKRRKRKK EVCPLAPPPE LHPPAPAPST IPRLPQLPRQ KCLVAAGAWG 720 

AGDSCRQGAW TLVSNPFCPE PSPPQDPFLP SAPAPVAWAH GRRQGLGPIH SRTNLMDTEL 780 

MDADSDF 788 



Seq ID NO: C392 Protein Sequence 
Protein Accession #: BAC04382 

1 11 21 31 41 51 

I 1 I I I I 

MGARSGARGA LLLMJiLCWD PRLSQAGRKR SGEVLPDSFP SAPAEPLPYF LQEPQDAYIV 60 

KNKPVELRCR AFPATQIYFK CNGEWVSQND HVTQEGLDEA TLGARGGLRV REVQIEVSRQ 120 

QVEELFGLED YWCQCVAWSS AGTTKSRRAY VRIAYLRKNF DQEPLGKEVP LDBEVLLQCR 180 

PPEGVPVAEV EWLKNEDVID PTQDTNFLLT IDHNLIIRQA RLSDTANYTC VAKNIVAKRR 240 

STTATVIVYV NGGWSSWAEW SPCSNRCGRG WQKRTRTCTN PAPLNGGAFC EGQAFQKTAC 300 

TTICFVDGAW TEWSKWSACS TECAHWRSRE CMAPPPQNGG RDCSGTLLDS KNCTDGLCMQ 360 

NKKTI*SDPNS HLLEASGDAA LYAGLWAIF VWAILMAVG VWYRRNCRD FDTDITDSSA 420 

ALTGGFHPVN FKTARPSNPQ LLHPSVPPDL TASAGIYRGP VYALQDSTDK IPMTNSPLLD 480 

PLPSLKVEVY SSSTTGSGPG LADGADLLGV LPPGTYPSDF ARDTHFiaLR SASLGSQQLL S40 

GLPRDPGSSV SGTFGCLGGR LSIPGTGVSL LVPNGAIPQG KFYEMYLLIN KAESTLPLSE 600 

GTQTVLSPSV TCGPTGLLLC RPVILTMPHC AEVSARDWIF QLKTQAKQGH WEEWTLDEE 660 

TLNTPCYCQL EPRACHILLD QLGTYVFTGE SYSRSAVKRL QLAVFAPALC TSLEYSLRVY 720 

CLEDTPVALK EVLELBRTLG GYLVEEPKPL MFKDSYHNLR LSLHDLPHAH WRSKLLAKYQ 780 

EIPFYHIWSG SQKALHCTFT LERHSLASTE LTCKICVRQV EGEGQIFQLH TTLAETPAGS 840 

LDTLCSAPGS TVTTQLGPYA FKIPLSIRQK ICNSLDAPNS RGNDWRMLAQ KLSMDRYLNY 900 

FATKASPTGV ILDLWEALQQ DDGDLNSIAS ALEEMGKSEM LVAVATDGDC 950 
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Seq ID NO: C393 Protein Sequence 
Protein Accession ft: NP_004616 

I 11 21 31 41 51 

1 I I 1 I 1 

MNRKARRCLG HLFLSLGMVY LRIGGPSSW ALGASIICNK IPGLAPRQRA ICQSRPDAII 60 

VIGEGSQMGL DEOQFQFRNG RWNCSALGER TVFGKELKVG SREAAFTYAI IAAGVAHAIT 120 

AACTQGNLSD CGCDKEKQGQ YHRDBGWKWG GCSADIRYGI GFAKVFVDAR EIKQNARTLM 180 

NLHNNEAGRK ILEENMKLEC KCHGVSGSCT TKTCWTTLPQ FRELGYVLKD KYNEAVHVEP 240 

VRASRNKRPT FLKIKKPLSY RKPMDTDLVY IEKSPNYCEB DPVTGSVGTQ GRACNKTAPQ 300 

ASGCDLMCCG RGYNTHQYAR VWQCMCKFHW CCYVKCNTCS ERTEMYTCK 349 

Seq ID NO: C394 Protein Sequence 
Protein Accession #: NP_003777 

1 11 21 31 41 51 

1 1 I I I I 

MDALCGSGEL GSKFWDSNLS VHTENPDLTP CPQNSLLAWV PCIYLWVALP CYLLYLRHHC 60 

RGYIILSHLS KLKMVLGVLL WCVSWADLFY SFHGliVHGRA PAPVFFVTPL WGVTMLLAT 120 

LLIQYERLQG VQSSGVLIIF WFLCWCAIV PFRSKILLAK AEGEISDPFR FTTFYIHFAL 180 

VLSALILACF REKPPFFSAK NVDPNPYPET SAGFLSRLFF WWFTKMAIYG YRHPLREKDL 240 

WSLKEEDRSQ MWQQLLEAW RKQEKQTARH KASAAPGKNA SGEDEVLLGA RPRPRKPSFL 300 

KALLATFGSS FLISACFKLI QDLLSFINPQ LLSILIRFIS NPMAPSWWGF LVAGLMFLCS 360 

MMQSLILQHY YHYIFVTGVK FRTGIMGVIY RKALVITNSV KRASTVGEIV NLMSVDAQRF 420 

MDLAPFLNLL WSAPLQIILA IYFLWQNLGP SVLAGVAFMV LLIPLNGAVA VKMRAFQVKQ 480 

MKLKDSRIKL MSEILNGIKV LKLYAWEPSF LKQVEGIRQG ELQLLRTAAY LHTTTTFTWM 540 

CSPFLVTLIT LWVYVYVDPN NVLDAEKAFV SVSLFNILRL PI2JMLPQLIS NLTQASVSLK 600 

RIQQFLSQEE LDPQSVERKT ISPGYAITIH SGTFTWAQDL PPTLHSLDIQ VPKGALVAW 660 

GPVGCGKSSL VSALLGEMEK LEGKVHMKGS VAYVPQQAWI QNCTLQENVL FGKALNPKRY 720 

OQTLEACALL ADLEMLPGGD QTEIGEKGIN LSGGQRQRVS LARAVYSDAD IFLLDDPLSA 780 

VDSHVAKHIF DHVIGPEGVL AGKTRVLVTH GISFLPQTDF IIVLADGQVS EMGPYPALLQ 840 

RNGSFANFLC NYAPDEDQGH LEDSWTALEG AEDKEALLIE DTLSNHTDLT DNDPVTYWQ 900 

KQFMRQLSAL SSDGEGQGRP VPRRHLGPSE KVOVTEAKAD GALTQEEKAA IGTVELSVFW 960 

DYAKAVGLCT TLAICLLYVG QSAAAIGANV WLSAWTNDAM ADSRQNNTSL RLGVYAALGI 1020 

LQGFLVMLAA MAMAAGGIQA ARVLHQALLH NKIRSPQSFF DTTPSGRILN CFSKDIYWD 1080 

EVLAPVILML LNSFFNAIST LWIMASTPL FTWILPLAV LYTLVQRFYA ATSRQLKRLE 1140 

SVSRSPIYSH FSETVTGASV IRAYNRSRDF EIISDTKVDA NQRSCYPYII SNRWLSIGVE 1200 

FVGNCWLFA ALFAVIGRSS LNPGLVGLSV SYSLQVTFAL NWMIRMMSDL ESNIVAVERV 1260 

KEYSKTETEA PWWEGSRPP EGWPPRGEVB FRNYSVRYRP GLDLVLRDLS LHVHGGEKVG 1320 

IVGRTGAGKS SMTLCLFRIL EAAKGEIRID GLNVADIGLH DLRSQLTIIP QDPILFSGTL 1380 

RMNLDPFGSY SEED I WW ALE LSHLHTFVSS QPAGLDFQCS EGGENLSVGQ RQLVCLARAL 1440 

LRKSRILVLD EATAAIDLET DNLIQATIRT QFDTCTVLTI AHRLNTIMDY TRVLVLDKGV 1500 

VAEFDSPANL IAARGIFYGM ARDAGLA 1527 

Seq ID NO: C395 Protein Sequence 
Protein Accession #: NP_004617 

1 11 21 31 41 51 

1 I I I 1 I 

MRARPQVCEA LLFALALOTG VCYGIKWLAL SKTPSALALN QTQHCKQLEG LVSAQVQIiCR 60 

SNLELMHTW HAAREVMKAC RRAFADKRWN CSSIELAPNY LLDLERGTRE SAFVYALSAA 120 

AISHAIARAC TSGDLPGCSC GPVPGEPPGP GNRWGGCADN LSYGLLMGAK FSDAPMKVKK 180 

TGSQANKLMR LHNSEVGRQA LRASLEMKCK CHGVSGSCSI RTCWKGLQBL QDVAADLKTR 240 

YLSATKWHR PMGTRKHLVP KDLDIRPVKD SELVYLQSSP DFCMKNEKVG SHGTQDRQCN 300 

KTSNGSDSCD LMCCGRGYNP YTDRWERCH CKYHWCCYVT CRRCERTVER YVCK 354 

Seq ID NO: C396 Protein Sequence 
Protein Accession #: NP_114072 

1 11 21 31 41 51 

I I I I I I 

MEWGYLLEVT SLLAALALLQ RSSGAAAASA KELACQEITV PLCKGIGYNY TYMPNQFNHD 
TQDEAGLEVH QFWPLVEIQC SPDLKFFLCS MYTPICLEDY KKPLPPCRSV CERAKAGCAP 
LMRQYGFAWP DRMRCDRLPE OGNPDTLCMD YNRTDLTTAA PSPPRRLPPP PPGEQPPSGS 
. GHGRPPGARP PHRGGGRGGG GGDAAAPPAR GGGGGGKARP PGGGAAPCEP GCQCRAPMVS 
VSSERHPLYN RVKTGQIANC ALPCHNPFFS QDERAFTVFW IGLWSVLCFV STFATVSTFI* 
IDMERFKYPE RPIIFLSACY LFVSVGYLVR LVAGHEKVAC SGGAPGAGGA GGAGGAAAGA 
GAAGAGAGGP GGRGEYEBLG AVEQHVRYET TGPALCTWF LLVYFFGMAS SIWWILSLT 
WFLAAGMKHG NEAIAGYSQY FHLAAWLVPS VKSIAVLALS SVDGDPVAGI CYVGNQSLDK 
LRGFVLAPLV IYLFIGTMFL LAGFVSLFRI RSVIKQQDGP TKTHKLEKLM IRLGLFTVLY 
TVPAAVWAC IiFYEQHNRPR WEATHNCPCL RDLQPDQARR PDYAVFMLKY FMCLWGITS 
GVWVWSGKTL ESWRSLCTRC CWASKGAAVG GGAGATAAGG GGGPGGGGGG GPGGGGGPGG 
GGGSLYSDVS TGLTWRSGTA SSVSYPKQMP LSQV 

Seq ID NO: C397 Protein Sequence 
Protein Accession 8: XPJ>50625 

1 11 21 31 

(ill 
MLQGPGSLLL LFLASHCCLG SARGLFLFGQ PDFSYKRSNC 
PNLLGHETMK EVLEQAGAWI PLVMKQCHPD TKKFLCSLFA 
VKDRCAPVMS AFGFPWPDML ECDRFPQDND LCIPLASSDH 
DDNDIMETLC KNDFALKIKV KEITYINRDT KIILETKSKT 



41 51 
1 I 

KPIPANLQLC HGIEYQNMRL 60 

PVCLDDLDET IQPCHSLCVQ 120 

LLPATEBAPK VCEACXNKND 180 

IYKLNGVSER DLKKSVLWLK 240 
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DSLQCTCEEM NDINAPYLVM GQKQGGBLVI TSVKRWQKGQ REFKRISRSI RKLQC 295 

Seq id NO: C398 Protein Sequence 
Protein Accession ft: NPJJ01297.1 

1 11 21 31 41 51 

I I I I 1 I 

MSMGLEITGT ALAVLGWLGT IVCCALPMWR VSAFIGSNII TSQNIWEGLW MNCWQSTGQ 60 

MQCKVYDSLL ALPQDLQAAR ALIWAILLA AFGLLVALVG AQCTNCVQDD TAKAKITIVA 120 

GVLPLLAALL TLVPVSWSAN TIIRDFYNPV VPEAQKREMG AGLYVGWAAA ALQLLGGALL 1B0 
CCSCPPREKK YTATKWYSA PRSTGPGASL GTGYDRKDYV 220 

Seq ID NO: C399 Protein Sequence 
Protein Accession ft: NP_036581.1 

1 11 21 31 41 51 

I I I I I I 

MESRKDITNQ EELWKMKPRR NLEEDDYLHK DTGETSMLKR PVLLHLHQTA HADBPDCPSE 60 

LQHTQELFPQ WHLPIKIAAI IASLTFLYTL LREVIHPLAT SHQQYFYKIP ILVINKVLPM 120 

VSITLIALVY LPGVIAAIVQ LHNGTKYKKP PHWLDKWMLT RKQFGLLSPF PAVLHAIYSL 180 

SYPMRRSYRY KLLNWAYQQV QQNKEDAWIE HDVWRMEIYV SLGIVGLAIL ALLAVTSIPS 240 

VSDSLTWREF HYIQSKLGIV SLLLGTIHAL IFAWNKWIDI KQPVWYTPPT FMIAVFLPIV 300 

VLIPKSILFL PCLRKKIUCI RHGWEDVTKI NKTEICSQL 339 



25 Seq ID NO: C400 Protein Sequence 
Protein Accession ft: NP 001766.1 



1 11 21 31 41 51 

I I I I I I 

MANCEFSPVS GDKPCCRLSR RAQLCLGVSI LVL I LVWLA VWPRWRQTW SGPGTTKRFP 60 

ETVLARCVKY TEIHPEMRHV DCQSVWDAFK GAPISKHPCN ITEEDYQPLM KLGTQTVPCN 120 

KILLWSRIKD LAHQPTQVQR DMFTLEDTLL GYLADDLTWC GEFNTSKINY QSCPDWRKDC 180 

SNNPVSVFWK TVSRRFAEAA CDWHVMLNG SRSKIFDKNS TFGSVEVHNL QPEKVQTLEA 240 

WVIHGGREDS RDLCQDPTIK ELESIISKRN IQFSCKNIYR PDKFLQCVKN PEDSSCTSEI 300 

Seq ID NO* C401 Protein Sequence 
Protein Accession ft: XP 120513.2 



Ari 1 11 21 31 41 51' 

40 i | | III 

MVSCTFSGPL RETNENVKKF YALRAFMFRM SSEAAMLGES RTPKPRKHRA TTRAKIFKRF 60 

FSEGSESNSR LVEELAVIHT YSDDPAPTTS PSSVQPREFG VMQGAPRARF GSRTPPAAAE 120 

ASSPHLGIGE AAOQSGARAA APRAGARRCQ FQRQAAAAAA TAQTHTLPHA RTRADPAGRR 180 

RRHPRSPAPG GEGTCSEGPA PRRRMEEEMQ PAEEGPSVPK IYKQRSPYSV LKTPPSKRPA 240 

LAKRYERPTL VELPHGHLRT PAQPPPASPA ASSSSSFAAV VRLGAPPRPP RRGPRARGTI 300 

PPLLPAPGVA GTLLPPPTSS SPPSPRPRPW BAAAPRGGTS HTHMWRSQST LPGSDTMVSV 360 

FGLMAQRRWQ HRSLKQFEWG ILGSWGTWPC GQDWLEKEGQ VAVLLPRSEG NTAPKKSRMI 420 

LDAFAQQCSR VLSLLNCGGK LLDSNHSQSM ISCVKQEGSS YNERQEHCHI GKGVHSQTSD 480 

NVDIEMQYMQ RKQQTSAFLR VFTDSLQNYL LSGSFPTPNP SSASEYGHLA DVDPLSTSPV 540 

HTLENISLDS TASLCKSRHL SREPPVKSDF PNPLQQALAG GASRPFSGAQ QSIAYRVNSE 600 

LEDGIRSPVP LSCEALEMDL TSIX3SKQLLN NYPVYITSKQ WDEAVNSSKK DGRRLLRYLI 660 

RFVFTTDELK YSCGLGKRKR SVQSGETGPE RRPLDPVKVT CLRGTASFRS VSPSVISFHR 720 

IGCGSPRTSV QPSVF 735 



Seq ID NO: C402 Protein Sequence 
Protein Accession ft: BAA92562.1 

1 11 21 31 41 51 

I I I I I I 

METTVLSGIN FEYKGMTGWE VAGDHIYTAA GASDNDFMIL TLWPGFRPP QSVMADTENK 60 

EVARITFVFE TLCSVNCELY FMVGVNSRTN TPVETWKGSK GKQSYTYIIE ENTTTSFTWA 120 

FQRTTFHEAS RKYTNDVAKI YSINVTNVMN GVASYCRPCA LEASDVGSSC TSCPAGYYID 180 

RDSGTCHSCP PNTILKAHQP YGVQACVPCG PGTKNNKIHS LCYNDCTFSR NTPTRTFNYN 240 

FSALANTVTL AGGPSPTSKG LKYFBHFTLS LCGNQGRKMS VCTDNV7DLR IPEGESGFSK 300 

SITAYVCQAV IIPPEVTGYX AGVSSQFVSL ADRLIGVTTD MTLDGITSPA ELFHLESLGI 360 

PDVIFFYRSN DVTQSCSSGR STTIRVRCSP QKTVPGSLLL PGTCSDGTCD GCNFHFLWES 420 

AAACPLCSVA DYHAIVSSCV AGIQKTTYVW REPKLCSGGI SLPEQRVTIC KTIDFWLKVG 480 

ISAGTCTAIL LTVLTCYFWK KNQKLEYKYS KLVMNATLKD CDLPAADSCA IMEGEDVEDD 540 

LIFTSKKSLP GKIKSFTSKQ PAPVTISLSE DS 572 

Seq ID NO: C403 Protein Sequence 
Protein Accession ft: NP 0SS139.1 



1 11 21 31 41 51 

75 | | | | | | 

MALQGISWE LSGLAPGRXC AMVLADFGAR WRVDRPGSR YDVSRLGRGK RSLVLDLKQP 60 

REPRAAASVQ AVGCAAGALP PRCHGETPAG PRDSAAGKSK AYIjOQAEHIH PVQSSFCRLA 120 

GHDINYLALS GVLSKIGRSG ENPYAPLNLV ADFAGGGLMC ALGIIMALFD RTRTDKGQVI 180 

DAKMVEGTAY LSSPLWKTQK SSLWEAPRGQ NMLDGGAPPY TTYRTADGEF MAVGAIBPQF 240 

YELLIKGLGL KSDELPNQMS TDDWPEMKKK PADVFAKKTK AEWCQIPDGT DACVTPVLTP 300 

EEWHHDHNK ERGSFITSEE QDVSPRLAPL LUTTPAIPSS KGDPFIGEHT EEILEEFGFS 360 

REEIYQUISD KIIESNKVKA SL 382 

Seq ID NO: C404 Protein Sequence 
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Protein Accession #; XP_091332.l 

1 11 21 31 41 51 

I I I I I 1 

MQRWTLWAAA FLTLHSAQAF PQTDISISPA LPELPLPSLC PLFWMEFKGH CYRFFPLNKT 60 

WAEADLYCSE FSVGRKSAKL ASIH5WBENV FVYDLVNSCV PGIPADVWTG LHDHRQEGQF 120 

EWTDGSSYDY SYWDGSQPDD GVHADPEEED CVQIWYRPTS EQLQAPEPQL PLSISEATDV 180 

YLPEDFPAEP KLMDQSWVSR KSLKPSKSHL MBPPTPVAKH QKAKTRHRSL RGVWWPSGKA 240 

GSWKERMNAD YGRRKRSAPR QEGRLRCRER RLRAASGQGR PEGQRKQRQQ ERQERGWEEL 300 

GGVSPMRGAQ AWQHGLGAGS QRGAAPECGE NHQAPBLGST WRGQRXjQPQT AALCHFALRK 360 

LPGNAHGLAA AFVQPALQVQ EEKNNRTRFS GAYFTMSDPT CDQDSKEQSL RRHGREAEKD 420 

GPYRLVKKKR GPVACPSSFE LQSGGEVCLD FPVELRAGTW IAREPP 466 



Seq ID NO; C40S Protein Sequence 
15 Protein Accession 9: XP 054869.2 
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1 11 21 31 1 41 51 

I I I I I I 

MHTCCPPVTL EQDLHRKMHS WMLQTLAFAV TSLVLSCAET IDYYGEICDN ACPCEEKDGI 60 

LTVSCENRGI ISLSEISPPR FPIYHLLLSG NLLNRLYPNE FVNYTGAS1L HLGSNVIQDI 120 

ETGAFKGLRG LRRLHLNNNK LELLRDDTFL GLENLEYLQV DYNYISVIEP NAFGKLHLLQ 180 

VLILNDNLLS SLPNNLFRFV PLTHLDLRGN RLKLLPYVGL LQHMDKWEL QLEENPWNCS 240 

CELISLKDWL DSISYSALVG DWCETPFRL HGRDLDBVSK QELCPRRLIS DYEMRPQTPL 300 

STTGYLHTTP ASVNSVATSS SAVYKPPLKP PKGTRQPNKP RVRPTSRQPS KDLGYSNYGP 360 

SIAYQTKSPV PLECPTACSC NLQISDLGLN VNCQERKIES IAELQPKPYN PKKMYLTENY 420 

IAWRRTDPL EATGLDLLHL GMNRISMIQD RAFGDLTNLR RLYLNGNRIE RLSPELFYGL 480 

QSLQYLFLQY NLIREIQSGT FDPVPNLQLL FLNNNLLQAM PSGVFSGLTL LRLNLRSNHF 540 

TSLPVSGVLD QLKSLIQIDL HDNPWDCTCD IVGMKLWVEQ LKVGVLVDEV ICKAPKKFAE 600 

TDMRSIKSEL LCPDYSDWV STPTPSSIQV PARTSAVTPA VRLMSTGAPA SLGAGGGASS 660 

VPLSVLILSL LLVPIMSVFV AAGLFVLVMK RRKKNQSDHT STNNSDVSSF NMQYSVYGGG 720 

GGTGGHPHAH VHHRGPALPK VKTPAGHVYE YIPHPLGHMC KNPIYRSREG NSVEDYKDLH 780 

ELKVTYSSNH HLQQQQQPPP PPQQPQQQPP PQLQLQPGEE ERRESHHLRS PAYSVSTIEP 840 

REDLLSPVQD ADRFYRGILE PDKHCSTTPA GNSLPEYPKF PCSPAAYTFS PNYDLRRPHQ 900 

YLHPGAGDSR LREPVLYSPP SAVFVEPNRN EYLELKAKLN VEPDYLEVLE KQTTFSQF 958 

Seq ID NO: C406 Protein Sequence 
Protein Accession NP 000784.2 



Ark 1 11 21 31 41 51 

40 | | | | | | 

MGILSVDLLI TLQILPVFFS NCLFLALYDS VILLKHWLL LSRSKSTRGE WRRMLTSEGL 60 

RCVWKSFLLD AYKQVKLGED APNSSWHVS STEGGDNSGN GTQEKIAEGA TCHLLDFASP 120 

ERPLWNFGS ATUPPFTSQL PAFRKLVEEF SSVADPLLVY IDBAHPSDGW AIPGDSSLSF 180 

EVKKHQNQED RCAAAQQLLE RFSLPPQCRV VADRMDNNAN IAYGVAFERV CIVQRQKIAY 240 

LGGKGPFSYN LQEVRHWLEK NFSKRUKKTR LAG 273 



Seq ID NO; C407 Protein Sequence 
Protein Accession ft: NP_006540.2 

1 11 21 31 41 51 

I I I I 1 I 

MSSCVSSQPS SNRAAPQDBL GGRGSSSSES QKPCEALRGL SSLSIHLGME SFIWTECEP 60 

GCAVDLGLAR DRPItEADGQE VPLDSSGSQA RPHLSGRKLS LQERSQGGLA AGGSLDMNGR 120 

CICPSLPYSP VSSPQSSPRL PRRPTVESHH VSITGMQDCV QLNQYTLKDE IGKGSYGWK 180 

LAYNENDNTY YAMKVLSKKK LIRQAGFPRR PPPRGTRPAP GGCIQPRGPI EQVYQEIAIL 240 

KKLDHPNWK LVEVLDDPNE DHLYMVFELV NQGPVMEVPT LKPLSEDQAR FYFQDLIKGI 300 

EYLKYQKIIH RDIKPSNLLV GEDGHIKIAD FGVSNEFKGS DALLSNTVGT PAFMAPESLS 360 

ETRKIFSGKA LDVWAKGVTL YCFVFGQCPP MDERIMCLHS KIKSQALEFP DQPDIAEDLK 420 

DLITRMLDKN PESRIWPEI KLHPWVTRHG AEPLPSEDEN CTLVEVTEEE VENSVKHIPS 480 

LATVILVKTM IRKRSFGNPF EGSRREERSL SAPGNliLTKK PTRECESItSE LKEARQRRQP 540 

PGHRPAPRGG GGSAL.VRGSP CVESCWAPAP GSPARMHPLR PEEAMEPE 588 

Seq ID NO: C408 Protein Sequence 
Protein Accession #: NPJ)61116.2 



1 11 21 31 41 51 

I I I I I I 

MGLSLPKBKG LILCLWSKFC RWFQRRESWA QSRDBQNLLQ QKRIWESPLL LAAKDNDVQA 60 

LNKLLKYEDC KVHQRGAMGE TALHIAALYD NLEAAMVLME AAPBLVFEPM TSELYEGQTA 120 

LHIAWNQNM NLVRALLARR ASVSARATGT AFRRSPCNLI YFGEHPLSFA ACVNSEEIVR 180 

LLIEHGADIR AQDSLGNTVL HILILQPNKT FAOQMYNLLL SYDRHGDHLQ PLDLVPNHQG 240 

LTPFKLAGVE GNTVMFQHLM QKRKHTQWTY GPLTSTLYDL TEIDSSGDEQ SLLELIITTK 300 

KREARQILDQ TPVKELVSLK WKRYGRPYFC MLGAIYLLYI ICFTHCCIYR PLKPRTNNRT 360 

SPRDNTLLQQ KLLQEAYMTP KDDIRLVGEL VTVIGAIIIL LVEVPDIFRM GVTRFFGQTI 420 

75 LGGPFHVLII TYAFMVLVTM VMRLISASGE WPMSFALVL GWCNVMYFAR GFQMLGPFTI 480 

MIQKMIFGDL MRFCWLMAW ILGFASAFYI IFQTEDPEEL GHFYDYPMAL PSTFELFLTI 540 

IDGPANYNVD LPFHYSITYA AFAIIATLLM LNLLIAMMGD THWRVAHERD ELWRAQIVAT 600 

TVMLERKLPR CLWPRSGICG REYGLGDRWF LRVEDRQDLN RQRIQRYAQA FHTRGSEDLD 660 

KDSVEKLELG CPFSPHLSLP MPSVSRSTSR SSANWERLRQ GTLRRDLRGI INRGLEDGES 720 

OU WEYQI 725 

Seq ID NO; C409 Protein Sequence 
Protein Accession 8: NP_068710.l 
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1 11 21 31 41 51 

I I I I 1 I 

MQKVTLGLLV FLAGFPVLDA NDLEDKNSPF YYDWHSLQVG GLICAGVLCA MGIIIVMSEW 60 

RSSGEQAGRG WGSPPLTTQL SPTGAKCKCK FGQKSGHHPG BTPPLITPGS AOS 113 

Seq ID NO: C410 Protein Sequence 
Protein Accession #: NPJ>05962.1 

1 11 21 31 41 51 

I I 1 I I I 

MQKVTLGLLV FLAGFPVLDA NDLEDKNSPF YYDWHSLQVG GLICAGVLCA MGIIIVMSAK 60 

CKCKFGQKSG HHPGETPPLI TPGSAQS 87 

Seq ID NO: C411 Protein Sequence 
Protein Accession ft: NP_004952.1 

1 11 21 31 41 51 

I I I I I I 

MLSKVLPVLL GILLILQSRV BGPQTBSKNE ASSRDWYGP QPQPLENQLL SEETKSTBTB 60 

TGSRVGKLPE ASRILNTILS NYDHKLRPGI GEKPTWTVE IAVNSLGPLS ILDMEYTIDI 120 

IFSQTWYDER LCYNDTFESL VLNGNWSQL WIPDTFFRNS KRTHEHEITM PNQMVRIYKD 180 

GKVLYTIRMT IDAGCSLHML RFPMDSHSCP LSPSSPSYPE NEMIYKWENF KLEINEKNSW 240 

KLFQFDFTGV SNKTEHTTP VGDFWVMTIP FNVSRRFGYV AFQNYVPSSV TTMLSWVSFW 300 

IKTESAPART SLGITSVLTM TTLGTFSRKN FPRVSYITAL DFYIAICFVF CFCALLEPAV 360 

LNFLIYNQTK AHASPKLRHP RINSRAKART RARSRACARQ HQEAPVOQIV TTEGSDGEER 420 

PSCSAQQPPS PGSPEGPRSL CSKLACCEWC KRFKKYFCMV PDCEGSTWQQ GRLCIHVYRL 480 

DNYSRWPPV TFFFFNVLYW LVCLNL 506 

Seq ID NO: C412 Protein Sequence 
Protein Accession ft: NP_068B19.1 

1 11 21 31 41 51 

I I I I I I 

MEYTIDIIPS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHMLRPP MDSHSCPLSP SSFSYPENEM IYKWENFKLE 120 

INBKNSWKLF QLDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 

LSWVSPWIKT ESAPARTSLG ITSVLTMTTL GTPSRKNFPR VSYITALDFY IAICFVFCFC 240 

ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 300 

GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQARL 360 

CIHVYRLDNY SRWPPVTPF FFNVLYWLVC LNL 393 

Seq ID NO: C413 Protein Sequence 
Protein Accession #: NP_068822.1 

1 11 21 31 41 51 

I I I I 1 I 

MEYTIDIIPS QTWNSKRTHE HEITMPNQMV RIYKDGKVLY TIRMTIDAGC SLHMLRFPMD 60 

SHSCPLSFSS FSYPENEMIY KWENPKLEIN EKNSWKLFQF DFTGVSNKTE I ITTPVGDFM 120 

VMTIFFNVSR RFGYVAFQNY VPSSVTTMLS WVSFWIKTES APARTSLGIT SVLTMTTLGT 180 

FSRKNFPRVS YITALDFYIA ICFVFCFCAL LEFAVLNFLI YNQTKAHASP KLRHPRINSR 240 

AHARTRARSR ACARQHQBAF VCQIVTTEGS DGEERPSCSA QQPPSPGSPE GPRSLCSKLA 300 

CCEWCKRFKK YPCMVPDCEG STWQQGRLCI HVYRLDNYSR WFPVTPFFF NVLYWLVCLN 360 

L 361 

Seq ID NO: C414 Protein Sequence 
Protein Accession #: NP_068830.1 

1 11 21 31 41 51 

I I I I I I 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHMLRPP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 120 

INBKNSWKLF QFDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 

LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTPSRKNFPR VSYITALDFY IAICFVFCFC 240 

ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 300 

GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTNQQGRL 360 

CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 393 

Seq ID NO: C415 Protein Sequence 
Protein Accession 8: NP_068S91.1 

1 11 21 31 41 51 

i I I I I I 

MPAVSGPGPL PCLLLLLLDP HSPBTGCPPL RRFBYKLSFK GPRLALPGAG IPFWSHHGDA 60 

ILGLEEVRLT PSMRNRSGAV WSRASVPFSA WEVEVQMRVT GLGRRGAHGM AVWYTRGRGH 120 

VGSVLGGLAS WDGIGIFFDS PAEDTQDSPA IRVLASDGHI PSEQPGDGAS QGLGSCHWDF 180 

RNRPHPFRAR ITYWGQRLRM SLNSGLTPSD PGEPCVDVGP LLLVPGGFFG VSAATGTLAD 240 

DHDVLSFLTF SLSEPSPBVP PQPPLEMQQL RLARQLEGLW ARLGLGTRED VTPKSDSEAQ 300 

GEGERLFDLE ETLGRHRRIL QALRGLSKQL AQAERQWKKQ LGPPGQARPD GGWALDASCQ 360 

IPSTPGRGGH LSMSLNKDSA KVGALLHGQW TLLQALQEMR DAAVRMAAEA QVSYLPVGIE 420 

HHFLELDHIL GLLQEELRGP AKAAAKAPRP PGQPPRASSC LQPGIFLPYL LIQTVGFFGY 480 

VHFRQELNKS LQECLSTGSL PLGPAPHTPR ALGILRRQPL PASMPA 526 

Seq ID NO: C416 Protein Sequence 
Protein Accession ft: XP_117036.1 
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10 



15 



25 
30 
35 
40 
45 
50 
55 
60 



70 
75 
80 



1 11 21 31 41 SI 

I I I I I I 

MERRTRGALG SRRPPPPLPA LRHLCTGLQA AGMAWPGTLW RHTCQGRAXA AEGPWGLFRP 60 

HRCPREAGQA PVGPSPETQG VAHVCSRARV SVDBRBPGGG AYAMHVTPRW KGCHRHSGRT 120 

VRGSVSWKRP EQAAPETGRG PAVARGSGDG NECGWG 156 

Seq ID NO: C417 Protein Sequence 
Protein Accession #: XP_167803.2 

1 11 21 31 41 51 

I I I I I I 

MPGKGQRKTA TNKPGGLPGA PGVGIGGHCL YVLECKCFIK NKTKTHHHKK KNFAAKRNEE 60 

KLKKKKKQEK KMHTKPFHHT YPLSQQDFLF AKSYFCGNGP CFLWQGLF 108 

Seq ID NO: C418 Protein Sequence 
Protein Accession #: NP 079056.1 



„ 1 11 21 31 41 51 

20 | | | | | | 

MFRLVERYEM PRHEVYVLLI RNIFLKISII GILCYYWLNT VALSGEECWE TLIGQDIYRL 60 

LLMDFVFSLV NSFLGEFLRR IIGMQLITSL GLQEFDIARN VLELIYAQTL VWIGIFFCPL 120 

LPFIQMIMLF IMPYSKNISL MMNFQPPSKA WRASQMMTFF IFLLFFPSPT GVLCTLAITI 180 

WRLKPSADCG PFRGLPLFIH SIYSWIDTLS TRPGYLWWW IYRNLIGSVH FFFILTLIVL 240 

IITYLYWQIT EGRKIMIRLL HEQIINEGKD KMFLIEKLIK LQDMEKKANP SSLVLERREV 300 

EQQGFLHLGE HDGSLDLRSR RSVQEGNPRA 330 



Seq ID NO: C419 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MLSDDHVNEI IIQVENVSSG VQSHPSSNQI FQEKVLLDSS INMVLSISDI DVIDSQTVSK 60 

RNDQKGNQVL RFSTSLNESM SQTLHSLECM GIDTPGSSHE TVQGQKLIAS LIPMTSRDRI 120 

KAIRNQPRTM EEKRNLRKIV DKEKSKQTHR ILQLNCCIQC LNSISRAYRR SKNSLSEILN 180 

SISLWQKTLX IIGGKFGTSV LSYFNFLRWL LKFNIPSFIL NFSFIIIPQF TVAKKNTLQF 240 

TGLEFFTGVG YFRDTVMYYG FYTNSTIQHG NSGASYNMQL AYIFTIGACL TTCFFSLLFS 300 

MAKYFRNNFI NPHIYSGGIT KLIFCWDFTV THEKAVKLKQ KNLSTEIREN LSELRQENSK 360 

LTFNQLLTRF SAYMVAWWS TGVAIACCAA VYYLAEYNLE FLKTHSNPGA VLLLPFWSC 420 

INLAVPCIYS MFRLVERYEM PRHEVYVLLI RNIFLKISII GILCYYWLNT VALSGEECWE 480 

TLIGQDIYRL LLMDFVFSLV NSFLGEFLRR IIGMQLITSL GLQEFDIARN VLELIYAQTL 540 

VWIGIFFCPL LPFIQMIMLF IMPYSKNISL MMNFQPPSKA WRASQMMTFP IFLLFFPSFT 600 

GVLCTLAITI WRLKPSADCG PFRGLPLFIH SIYSWIDTLS TRPGYLWWW IYRNLIGSVH 660 

FFFILTLIVL IITYLYWQIT EGRKIMIRLL HEQIINEGKD KMFLIEKLIK LQDMEKKANP 720 

SSLVLERREV EQQGFLHLGE HDGSLDLRSR RSVQEGNPRA 760 

Seq ID NO: C420 Protein Sequence 
Protein Accession #: NP__002241.1 

1 11 21 31 41 51 

I I I I ! I 

MGGDLVLGLG ALRRRKRLLE QEKSLAGWAL VLAGTGIGLM VLHAEMLWFG GCSWALYLFL 60 

VKCTISISTF LLLCLIVAFH AKEVQLFMTD NGLRDWRVAL TGRQAAQIVL ELWCGLHPA 120 

PVRGPPCVQD LGAPLTSPQP WPGFLGQGBA LLSLAMLLRL YLVPRAVLLR SGVLLNASYR 180 

SIGALNQVRF RHWFVAKLYM NTHPGRLLLG LTLGLWLTTA WVLSVAERQA VNATGHLSDT 240 

LWLIPITFLT IGYGDWPGT MWGKIVCLCT GVMGVCCTAL LVAWARKLE FNKAEKHVHN 300 

FMMDIQYTKE MKESAARVLQ EAWMFYKHTR RKESHAARRH QRKLLAAINA FRQVRLKHRK 360 

LREQVNSMVD ISKMHMILYD LQQNLSSSHR ALEKQIDTLA GKLDALTELL STALGPRQLP 420 

427 



Seq ID NO: C421 Protein Sequence 
Protein Accession *h NP_079533.l 



~- 1 U 21 31 41 51 

65 | | | | | | 

. MGGKQRDEDD EAYGKPVKYD PSFRGPIKNR SCTDVICCVL FLLFILGYIV VGIVAWLYGD 60 

PRQVLYPRNS TGAYCGMGEN KDKPYLLYFN IFSCILSSNI ISVAENGLQC PTPQVCVSSC 120 

PEDPWTVGKN EFSQTVGEVF YTKNRNFCLP GVPWNMTVIT SLQQELCPSF LLPSAPALGR 180 

CFPWTNITPP ALPGITNDTT IQQGISGLID SLNARDISVK IFEDFAQSWY WILVALGVAL 240 

VLSLLFILLL RLVAGPLVLV LILGVLGVLA YGIYYCWEEY RVLRDKGASI SQLGFTTNLS 300 

AYQSVQETWL AALXVLAVLB AILLLVLIFL RQRIRIAIAL LKEASKAVGQ MMSTMFYPLV 360 

TFVLLLICIA YWAMTALYPL PTQPATLGYV LWASNISSPG CEKVPINTSC NPTAHLVNSS 420 

CPGLMCVFQG YSSKGLIQRS VFNLQIYGVL GLFWTLNWVL ALGQCVLAGA FASFYWAFHK 480 

PQDIPTFPLI SAFIRTLRYH TGSLAFGALI LTLVQIARVI LEYIDHKLRG VQNPVARCIM 540 

CCFKCCLWCL EKFIKFLNRN AYIMIAIYGK NFCVSAKNAF MLLMRNIVRV WLDKVTDLL 600 

LFFGKLLWG GVGVLSFFFF SGRIPGLGKD FKSPHLNYYW LPIMTSILGA YVIASGFFSV 660 

FGMCVDTLFL CFLEDLERNN GSLDRPYYMS KSLLKILGKK NEAPPDNKKR KK 712 



Seq ID NO: C422 Protein Sequence 
Protein Accession #: NP_057264.1 

1 11 21 31 41 51 

I 1 I I I I 

KGSNSGQAGR HIYKSLADDG PFDSVEPPKR PTSRLIMHSM AMFGREFCYA VEAAYVTPVL 60 
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LSVGLPSSLY SIVWFLSPII* GFLLQPWGS ASDHCRSRWO RRRPYILTLG VMMLVGMALY 120 

LNGATWAAL IANPRRKLVW AISVTMIGW LFDFAADFID GPIKAYLFDV CSHQDKBKGL 180 

HYHALFTGFG GALGYLLGAI DMAHLBLGRL LGTEPQVMPP FSALVLTLCF TVHLCSISEA 240 

PLTEVAKGIP PQQTPQOPPL SSDGMYEYGS IEKVKNGYVN PELAMQGAKN KNHAEQTRRA 300 

MTLKSLLRAL VNMPPHYRYL CISHLIGWTA FLSNMLFFTD FMGQIVYRGD PYSAHNSTEF 360 

LIYERGVEVG CWGFCINSVF SSLYSYFQKV LVSYIGLKGL YFTGYLLFGL GTGPIGLPPN 420 

VYSTLVLCSL FGVMSSTLYT VPFNL1TEYH REEEKBRQQA PGGDPDNSVR GKGMDCATLT 480 
CMVQLAQILV GGGLGFLVNT AGTWWVIT ASAVALIGCC FVALFVRYVD 530 



Seq ID NO: C423 Protein Sequence 
Protein Accession #: NP_003264.1 



1 11 21 31 41 51 

I I i I I I 

MEGFGGVGGR GTRGFAAKGV WRGRAEEGPV LGAAERGFMV STGSRRRVFE GPGGGGLRWT 60 

PGKGTGRQRG AWGPRABDGV RRRTLGMPRG SRRDVRAPCG PAGSWGARGG RRRDGPSRRR 120 

RGSATAAARH HVPPAPGGPF GPRAPAGSTR VPARAGGAVE PTGAAAVARL ARPAGGALPT 180 

AGAQGAGPAR GRSGEGSEWA RRGKGRPGPY QSPIX3PAVAB GQELKDKSRL RYPINGFQAL 240 

VLTALLVGLG MSAGLPLGAL PEMLLPLAFV ATLTAFIFSL FLYMKAQVAP VSALAPGGNS 300 

GNPIYDFFLG RBLNPRICFF DFKYFCELRP GLIGWVLINIi ALLMKEAELR GSPSLAMWLV 360 

NGFQLLYVGD ALWHEEAVLT TMDITHDGFG FMLAFGDMAW VPFTYSLQAQ FLLHHPQPLG 420 

LPMASVICLI NATGYYIFRG ANSQKNTFRK NPSDPRVAGL ETISTATGRK LLVSGWWGMV 480 

RHPNYLGDLI MALAWSLPCG VSHLLPYFYL LYFTALLVHR EAROERSACR STAWPGRSTA 540 

GVCLTASCPT STEAAPPPQV GHVPTHPPAH PGPGASTHLG LKGLHPTQP 589 



Seq ID NO: C424 Protein Sequence 
Protein Accession #: NP_056535.l 



1 11 21 31 41 51 

I I I ! I I 

MGRLLRAARIi PPLLSPLLLL LVGGAPLGAC VAGSDEPGPE GLTSTSLLDL LLPTGLEPLD 60 

SEEPSETMGL GAGLGAPGSG FPSEENEESR ILQPPQYFWE EEEELNDSSL DLGPTADYVF 120 

PDLTEKAGSI EDTSQAQELP NLPSPliPKMN LVEPPWHMPP REEEEEEEEE EEREKEEVEK 180 

QEEEEEEELL PVNGSQEEAK PQVRDFSLTS SSQTPGATKS RHEDSGDQAS SGVEVESSMG 240 

PSLLLPSVTP TTVTPGDQDS TSQEAEATVL PAAGLGVEFE APQEASEEAT AGAAGLSGQH 300 

EEVPALPSFP QTTAPSGAEH PDEDPLGSRT SASSPLAPGD MELTPSSATL GQEDLNQQLL 360 

EGQAAEAQSR IPWDSTQVIC KDWSNLAGKN YIILNMTENI DCEVFRQHRG PQLLALVEEV 420 

IiPRHGSGHHG AWHISLSKPS EKEQHLLMTL VGEQGWPTQ DVLSMLGDIR RSLEEIGIQN 480 

YSTTSSCQAR ASQVRSDYGT LFWLWIGA ICIIIIALGL LYNCWQRRLP KLKHVSHGEE 540 

LRFVENGCHD NPTLDVASDS QSEMQEKHPS LNGGGALNGP GSWGALMGGK RDPEDSDVFE 600 

EDTHL 605 



Seq ID NO: C425 Protein Sequence 
Protein Accession #: NP_001188.1 



1 11 21 31 41 51 

I I I 1 I I 

MSEVRPLSRD ILMETLLYEQ LLEPPTMEVL GMTDSEEDLD PMEDFDSL5C MEGSDALALR 60 
LACIGDEMDV SLRAPRLAQL SEVAMHSLGL AFIYDQTEDI RDVLRSFMDG FTTLKENIMR 120 
FWRSPNPGSW VSCEQVLLAL LLLLALLLPL LSGGLHLLLK 160 



Seq ID NO: C426 Protein Sequence 
Protein Accession #: AAP76225.1 



1 11 21 31 41 51 

I I I I I I 

MATPLPPPSP RRLRLLRLLL SGLVLGAALR GAAAGHPDVA ACPGSLDCAL KRRARCPPGA 60 

HACGPCLQPF QEDQQGLCVP RMRRPPGGGR PQPRLEDEID FLAQELARKE SGQSTPPLPK 120 

DRQRLPEPAT LGFSARGQGIi ELGItPSTPGT PTPTPHTSLG SPVSSDPVHM SPLEPRGGQG 180 

DGLAIiVLIIiA FCVAGAAALS VASLCWCRLQ REIRLTQKAD YATAKAPGSP AAPRISPGDQ 240 

RLAQSAEMYH YQHQRQQMLC LERHKEPPKE LDTASSDEEN EDGDFTVYEC PGLAPTGEME 300 

VRNPLFDHAA LSAPLPAPSS PPALP 325 



Seq ID NO: C427 Protein Sequence 
Protein Accession #: NP__004436.1 



1 11 21 31 41 51 

11(11! 

MVCSLWVLLL VSSVLALEEV LLDTTGETSE IGWLTYPPGG WDEVSVLDDQ RRLTRTFEAC 60 

HVAGAPPGTG QDNWLQTHFV ERRGAQRAHI RLHFSVRACS SLGVSGGTCR ETFTLYYRQA 120 

EEPDSPDSVS SWHLKRWTKV DTIAADESFP SSSSSSSSSS SAAWAVGPHG AGQRAGLQLN 180 

VKERSFGPLT QRGFYVAFQD TGACLALVAV RLFSYTCPAV LRSFASFPET QASGAGGASL 240 

VAAVGTCVAH ABPEEDGVGG QAGGSPPRLH CNGEGKWMVA VGGCRCQPGY QPARGDKACQ 300 

ACPRGLYKSS AGNAPCSPCP ARSHAPNPAA PVCPCLEGFY RASSDPPEAP CTGPPSAPQB 360 

LWFEVQGSAL MLHWRLPREL GGRGDLLFNV VCKECEGRQE PASGGGGTCH RCRDEVHFDP 420 

RQRGLiTESRV LVGGLRAHVP YILEVQAVNG VSBLSPDPPQ AAAINVSTSH EVPSAVPWH 480 

QVSRASNSIT VSWPQPDQTN GNILDYQLRY YDQAEDESHS FTLTSETNTA TVTQLSPGHI 540 

YGFQVRARTA AGHGPYGGKV YFQTLPQGEL SSQLPBRLSL VIGSILGALA FLLLAAITVL 600 

AWFQRKRRG TGYTEQLQQY SSPGIXJVKYY IDPSTYEDPC QAIRELAREV DPAYIKIEEV 660 

IGTGSFGEVR QGRLQPRGRR EQTVAIQALW AGGAESLQMT FliGRAAVLGQ FQHPNILRLE 720 

GWTKSRPLM VLTEFMEI/3P LDSFLRQREG QFSSLQLVAM QRGVAAAKQY LSSFAFVHRS 780 

LSAHSVLVNS HLVCKVARLG HSPQGPSCLL RWAAPEVIAH GKHTTSSDVW SFGILMWEVM 840 

SYGERPYWDM SEQEVLNAIE QEFRLPPPPG CPPGLHLLML DTWQKDRARR PHFDQLVAAF 900 

DKMIRKPDTL QAGGDPGBRP SQALLTPVAL DFPCLDSPQA WLSAIGLECY QDNFSKFGLC 960 
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TFSDVAQLSL EDLPALGITL AGHQKKLLHH IQLLQQHLRQ QGSVEV 1006 

Seq ID NO: C428 Protein Sequence 
Protein Accession ft: XP_043340.2 

1 11 21 31 41 Si 

I I I I I I 

MPFDFRRFDI YRKVPKDLTQ PTYTGAIISI CCCLPILFLF LSELTGFITT EWNELYVDD €0 

PDKDSGGKID VSLNISLPNL HCELVGLDIQ DEMGRHEVGH IDNSMKIPLN NGAGCRFEGQ 120 

FSINKVPGNF HVSTHSATAQ PQNPDMTHVI HKLSFGDTLQ VQNIHGAFNA LGGADRLTSN 180 

PLASHDYILK IVPTVYEDKS GKQRYSYQYT VANKEYVAYS HTGRIIPAIW FRYDLSPITV 240 

KYTERRQPLY RFITTICAII GGTFTVAGIL DSCIFTASEA WKKIQLGKMH 290 

Seq ID NO: C429 Protein Sequence 
Protein Accession ft: NP_002142.1 

1 11 21 31 41 SI 

I I 'I I I I 

MAQKEGGRTV PCCSRPKVAA LTAGTLLLLT AIQAASWAIV AVLLRSDQEP LYPVQVSSAD 60 

ARLMVFDKTE GTWRLLCSSR SNARVAGLSC EEMGFLRALT HSELDVRTAG ANGTSGFFCV 120 

DEGRLPHTQR LLEVISVCDC PRGRFLAAIC QDCGRRKLPV DRTVGGRDTS LGRWPWQVSL 180 

RYDGAHLCGG SLLSGDWVLT AAHCFPERNR VLSRWRVFAG AVAQASPHGL QLGVQAWYH 240 

GGYLPFRDPN SEENSNDIAL VHLSSPLPLT EYIQPVCLPA AGQALVDGKI CTVTGWGNTQ 300 

YYGQQAGVLQ EARVPIISND VCNOADFYGN QIKPKMFCAG YPEGGIDACQ GDSGGPFVCE 360 

DSISRTPRWR LCGIVSWGTG CALAQKPGVY TKVSDFREWI FQAIKTHSEA SGMVTQL 417 

Seq ID NO: C430 Protein Sequence 
Protein Accession ft: BAA92562.1 

1 11 21 31 41 51 

I I I I I 1 

METTVLSGIN FEYKGMTGWE VAGDHIYTAA GASDNDFMIL TLWPGFRPP QSVMADTENK 60 

EVARITFVFE TLCSVNCELY FMVGVNSRTN TPVETWKGSK GKQSYTYIIE ENTTTSFTWA 120 

FQRTTFHEAS RKYTNDVAKI YSINVTNVMN GVASYCRPCA LEASDVGSSC TSCPAGYYID 180 

RDSGTCHSCP PNTILKAHQP YGVQACVPCG PGTKNNKIHS LCYNDCTFSR NTPTRTFNYN 240 

FSALANTVTL AGGPSFTSKG LKYFHHFTIiS LC6NQGRKMS VCTDNVTDLR IPEGESGFSK 300 

SITAYVCQAV IIPPEVTGYK AGVSSQPVSL ADRLIGVTTD MTLDGITSPA ELFHLESLGI 360 

PDVIFFYRSN DVTQSCSSGR STTIRVRCSP QKTVPGSLLL PGTCSDGTCD GCNFHFLHES 420 

AAACPLCSVA DYHAIVSSCV AGIQKTTYVW REPKLCSGGI SLPEQRVTIC KTIDFWLKVG 480 

ISAGTCTAII* LTVLTCYFWK KNQKLEYKYS KLVMNATLKD CDLPAADSCA IMEGEDVEDD 540 

LIFTSKKSLF GKIKSFTSKQ PAPVTISLSE DS 572 

Seq ID NO: C431 Protein Sequence 
Protein Accession ft: NP_004855.1 

1 11 21 31 41 51 

I I 1 I I I 

MPGQELRTVN GSQMLLVLLV LSWLPHGGAL SIAEASRASF PGPSELHSED SRFRELRKRY 60 

EDLLTRLRAN QSWEDSNTDL VPAPAVRILT PEVRLGSGGH LHLRISRAAL PEGLPEASRL 120 

HRALFRLSPT ASRSWDVTRP LRRQLSLARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 180 

ELHLRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 240 

IGACPSQFRA ANMHAQIKTS LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 300 

LAKDCHCI 308 

Seq ID NO: C433 Protein Sequence 
Protein Accession ft: NP_443090.1 

1 11 21 31 41 • 51 

I I I I I ! 

MEDPSGAREP RARPRERDPG RRPHPDQGRT HDRPRDRPGD PRRKRSSDGN RRRDGDRDPK 60 

RDQBRDGNRD RNRDRERSRB RERDPDRGPR RDTHRDAGPR AGEHGWBKP RQSRTRDGAR 120 

GLTWDAAAPP GPAPWEAPEP PQPQRKGDPG RRRPESEPPS ERYLPSTPRP GREEVEYYQS 180 

EAEGLLECHK CKYLCTGRAC CQKLEVLLNL LILACSSVSY SSTGGYTGIT SLGGIYYYQF 240 

GGAYSGFDGA DGEKAQQLDV QFYQLKLPMV TVAMACSGAL TALCCLFVAM GVLRVPWHCP 300 

LLLVTEGLLD MLIAGGYIPA LYFYFHYLiSA AYGSPVCKER QALYQSKGYS GFGCSFHGAD 360 

IGAGIFAALG IWFALGAVL AIKGYRKVRK LKEKPAEMPE P 401 

Seq ID NO: C435 Protein Sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

1111)1 

MGAAGRQDFL FKAMLTISWL TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHIGQ 60 

GKTLUjTSSA TVYSIHISEG GKLVIKDHDE PIVLRTRHIL IDNGGELHAG SALCPFQGNF 120 

TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFF 180 

ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYIiNAVPDGR ILSVAVNDEG 240 

SRNLDDMARK AMTKU3SKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 300 

KLFQTEHGBY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 360 

IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR PLOGKPVRPK LTVTIDTNVN 420 

STILNLEDNV QSWKPGDTLV IASTDYSMYQ AEEFQVLPCR SCAPNQVKVA GKPMYLHIGS 480 

EIDGVDMRAB VGLLSRNIIV MGEMEDKCYP YRNHICNP7D PDTPGGHIKF ALGFKAAHLE 540 

GTELKHMGQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLLI 600 

KDWGYNSIX3 HCFFTEDGPE BRNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 

PKPRQDCNAV STFWMANPNN NLINCAAAGS EBTGFWFIPH HVPTGPSVGM YSPGYSEHIP 720 
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LGKFYKNRAH SNYKAGMIID NGVKTTEASA KDKRPFLSII SARYSPHQDA DPUCPREPAI 780 
IRHPIAYKNQ DHGAWLRGGD VWLDSCHFRG EAQEGPLLTG MKAGGILLGG DEAASGMAQG 840 
FSPPCRCLLK LVTTGSPFAH VSLAHS 866 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication, accession number, or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 1 . A method for detennining the presence or absence of a pathological cell in a 

2 patient, said method comprising detecting a nucleic acid comprising a sequence at least 80% 

3 identical to a sequence as described in Tables 2A-80 in a biological sample from said patient, 

4 thereby determining the presence or absence of said pathological cell. 

1 2. The method of Claim 1 , wherein: 

2 a) said pathology is described in Table 1, including a cancer; and/or 

3 b) said biological sample comprises isolated nucleic acids. 

j 1 3. The method of Claim 1, wherein said biological sample is tissue from an organ 

2 which is affected by said pathology of Table 1, including a cancer. 

1 4. The method of Claim 2, wherein said nucleic acids are mRNA 

1 5. The method of Claim 2: 

2 a) further comprising a step of amplifying nucleic acids before said step of detecting 

3 said nucleic acid; or 

4 b) where said detecting is of a protein encoded by said nucleic acid. 

1 6. The method of Claim 1, wherein said nucleic acid comprises a sequence as 

2 described in Tables 2A-80. 

1 7. The method of Claim 2, wherein: 

2 a) said detecting step is carried out by: 

3 i) using a labeled nucleic acid probe; 

4 ii) utilizing a biochip comprising a sequence at least 80% identical to a sequence 

5 as described in Tables 2A-80; or 

6 iii) detecting a polypeptide encoded by said nucleic acid; or 

7 b) said patient is: 

8 i) undergoing a therapeutic regimen to treat said pathology of Table 1 ; or 

9 ii) is suspected of having said pathology or cancer. 

1 8. An isolated nucleic acid molecule comprising a sequence as described in 

2 Tables 2A-80. 
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1 9. The nucleic acid molecule of Claim 8, which is labeled. 

1 10. An expression vector comprising the nucleic acid of Claim 8. 

1 1 1 . A host cell comprising the expression vector of Claim 1 0. 

1 12. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 comprising a sequence as described in Tables 2A-80. 

1 13. An antibody that specifically binds a polypeptide of Claim 12. 

1 14. The antibody of Claim 13: 

2 a) conjugated to an effector component; 

3 b) conjugated to a detectable label, including a fluorescent label, a radioisotope, or a 

4 cytotoxic chemical; 

5 c) which is an antibody fragment; or 

6 d) which is a humanized antibody. 

1 15. A method for specifically targeting a compound to a pathological cell in a 

2 patient, said method comprising administering to said patient an antibody of Claim 13, 

3 thereby providing said targetting. 

1 1 6. A method for determining the presence or absence of a pathological cell in a 

2 patient, said method comprising contacting a biological sample with an antibody of Claim 13. 

1 1 7. The method of Claim 1 6, wherein: 

2 a) said antibody is conjugated to: 

3 i) an effector component; or 

4 ii) a fluorescent label; or 

5 b) said biological sample is a blood, serum, urine, or stool sample. 

1 1 8. A method for identifying a compound that modulates a pathology-associated 

2 polypeptide, said method comprising the steps of: 
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3 a) contacting said compound with a pathology-associated polypeptide, said 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 

5 at least 80% identical to a sequence as described in Tables 2A-80; and 

6 b) determining the functional effect of said compound upon said polypeptide. 

1 1 9. A drug screening assay comprising the steps of: 

2 a) administering a test compound to a mammal having a pathology of Table 1 or a 

3 cell isolated therefrom; and 

4 b) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as described in 

6 Tables 2 A-80 in a treated cell or mammal with the level of gene expression of said 

7 polynucleotide in a control cell or mammal, wherein a test compound that 

8 modulates said level of expression of the polynucleotide is a candidate for the 

9 treatment of said pathology. 
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